7th January 2025

Reciprocal Probability Events More Likely than Expected

Events with a probability x^-1 are typically said to be more likely than not after x occurrences. A better estimate is 0.7x occurrences.

You will have no doubt encountered this common rule of thumb while working with arbitrary probabilities. Those who are more mathematically inclined will eventually wonder why it holds true, and if they try it out will discover that it alarmingly does not. For example, an event with probability $\frac{1}{137}$ has a 50/50 chance of happening after roughly 94.61 tries. I find this by noting that where the probability of the event is a half, the probity of it not happening is also a half. Therefore T, the number of times for this to be the case, satisfies ${(1 - \frac{1}{137})}^{T} = \frac{1}{2}$ and I find this by computing $T = \log_{1 - \frac{1}{137}} (\frac{1}{2}) = \frac{\ln (0.5)}{\ln (1 - \frac{1}{137})} \approx 94.61$ This number is much lower than you would expect using the common estimation.

What I find more interesting about the estimation that it being wrong is that it's almost right. The number of repeats before it is likely seems to grow linearly with the denominator of the probability. To determine why this is, let us consider the general formula for such probabilities. $T = \frac{\ln (0.5)}{\ln (1 - x^{- 1})}$ What does this approach? Well for larger values of x, $1 - x^{- 1}$ approaches 1. Around x = 1, $\ln (x) \approx x - 1$ . So substituting those into the formula: $T \approx \frac{\ln (0.5)}{1 - (x^{- 1} - 1)} = - \ln (0.5) \cdot x \approx 0.69 x$

This was my initial approach. It's almost right, but logic is flawed. You can't go willing-nilly substituting approximations for things unless you check that the error in those approximations approaches zero. The actual function that this approaches is $T \approx - \ln (0.5) \cdot (x - 0.5)$ which quickly converges though is just a constant (negligible) offset from the other approximation. It can be found by instead substituting the first term of a series expansion for ln $\ln (x) = 2 (\frac{x - 1}{x + 1} + \frac{1}{3} {(\frac{x - 1}{x + 1})}^{3} + \frac{1}{5} {(\frac{x - 1}{x + 1})}^{5} + \dots)$ into the limit of a linear approximation $f (x) \approx \lim_{h \to \infty} x f′ (h) + h - \frac{f (h)}{f′ (h)}$

Some preliminary results for the final estimation: $\begin{matrix} \frac{d}{d h} (\frac{1}{\ln (1 - h^{- 1})}) & = - h^{- 2} {(1 - h^{- 1})}^{- 1} {(\ln (1 - h^{- 1}))}^{- 2} \\ = {(h^{2} (h^{- 1} - 1))}^{- 1} {(\ln (1 - h^{- 1}))}^{- 2} \\ = \frac{1}{(h - h^{2}) {(\ln (1 - h^{- 1}))}^{2}} \\ \ln (1 - h^{- 1}) & = 2 (\frac{- h^{- 1}}{2 - h^{- 1}} + \frac{1}{3} {(\frac{- h^{- 1}}{2 - h^{- 1}})}^{3} + \dots) \\ = 2 (\frac{h^{- 1}}{h^{- 1} - 2} + \frac{1}{3} {(\frac{h^{- 1}}{h^{- 1} - 2})}^{3} + \dots) \\ = \frac{2}{1 - 2 h} + \frac{2}{3} (\frac{1}{{(1 - 2 h)}^{3}}) + \dots \\ \lim_{h \to \infty} \frac{d}{d h} (\frac{1}{\ln (1 - h^{- 1})}) & = \lim_{h \to \infty} {(h - h^{2})}^{- 1} {(\frac{2}{1 - 2 h} + \frac{2}{3} (\frac{1}{{(1 - 2 h)}^{3}}) + \dots)}^{- 2} \\ = \lim_{h \to \infty} {((h - h^{2}) {(\frac{2}{1 - 2 h} + \frac{2}{3} (\frac{1}{{(1 - 2 h)}^{3}}) + \dots)}^{2})}^{- 1} \\ = \lim_{h \to \infty} {(- h^{2} {(\frac{2}{1 - 2 h} + \frac{2}{3} (\frac{1}{{(1 - 2 h)}^{3}}) + \dots)}^{2})}^{- 1} \\ = \lim_{h \to \infty} - {(\frac{2 h}{1 - 2 h})}^{- 2} \\ = - 1 \end{matrix}$ Then substituting this into the approximation $\begin{matrix} \frac{1}{\ln (1 - x^{- 1})} & \approx \lim_{h \to \infty} x \frac{d}{d h} (\frac{1}{\ln (1 - h^{- 1})}) + h - \frac{(h - h^{2}) {(\ln (1 - h^{- 1}))}^{2}}{\ln (1 - h^{- 1})} \\ = - x + \lim_{h \to \infty} h + (h^{2} - h) \ln (1 - h^{- 1}) \\ = - x + \lim_{h \to \infty} h + \frac{2 h^{2} - 2 h}{1 - 2 h} + \frac{2}{3} (\frac{h^{2} - h}{{(1 - 2 h)}^{3}}) + \dots \\ = - x + \lim_{h \to \infty} h + \frac{2 h^{2} - 2 h}{1 - 2 h} \\ = - x + \lim_{h \to \infty} h - h + 0.5 + \frac{0.5}{1 - 2 h} \\ = 0.5 - x \end{matrix}$ we get very simple result for all that work.

So the more general formula for the number of repeats T to yield a probability P of the event with probability x^-1 occurring is $T = \frac{\ln (1 - P)}{\ln (1 - x^{- 1})} \approx - \ln (1 - P) \cdot (x - 0.5)$ For most values of P, the −0.5 offset of the graph is negligible. It only becomes relevant where P is close to 1, in which case the 0.7x estimate from earlier is inaccurate. The estimate is also inaccurate for choices of x close to 1, i.e. high certainty events. This isn't an issue for reciprocal probabilities, where 1/2 is the highest probability used, but for other values of x between 1 and 2 (corresponding to events with more than a 50% likelihood) the estimation is very inaccurate. Luckily its also not very useful for those values as they are very likely after only a single repeat.

This is perhaps the finest example of the difference between the median and the mean. The mean time for an event to happen is x tries, but the median is 0.7x. The mean gives undue weighting to those rare cases where an event occurs a long time after you begin trying. So in a sense, the original approximation isn't incorrect. You will have to wait an average of x tries for the event to occur, but your median wait is 0.7x.