N başarılı olana kadar döndürmeleri nasıl modelleyebilirim?

Sen ve ben sırayla bozuk para çeviren bir oyun oynamaya karar veriyoruz. Toplamda 10 kafa çeviren ilk oyuncu oyunu kazanır. Doğal olarak, önce kimin gitmesi gerektiği konusunda bir tartışma var.

Bu oyunun simülasyonları, ilk çeviren oyuncunun ikinci çeviren oyuncudan% 6 daha fazla kazandığını göstermektedir (ilk oyuncu zamanın yaklaşık% 53'ünü kazanır). Bunu analitik olarak modellemekle ilgileniyorum.

Sabit sayıda deneme olmadığı için bu bir binomial rastgele değişken değildir (biri 10 kafa alana kadar çevirin). Bunu nasıl modelleyebilirim? Negatif binom dağılımı mı?

Sonuçlarımı yeniden oluşturabilmek için işte python kodum:

import numpy as np
from numba import jit


@jit
def sim(N):

    P1_wins = 0
    P2_wins = 0

    for i in range(N):

        P1_heads = 0
        P2_heads = 0
        while True:

            P1_heads += np.random.randint(0,2)

            if P1_heads == 10:
                P1_wins+=1
                break

            P2_heads+= np.random.randint(0,2)
            if P2_heads==10:
                P2_wins+=1
                break
    return P1_wins/N, P2_wins/N


a,b = sim(1000000)

— Demetri Pananos
kaynak

r

$r$ başarısız oluncaya kadar bir bozuk para atarsanız ve daha sonra böyle bir deneyi bitirmeden önce gerçekleşen başarı sayısının dağılımına baktığınızda, bu tanım negatif binom dağılımıdır .

— Tim

% 2 değerini yeniden oluşturamıyorum. Ben ilk oyuncu kazanır bulmak

53.290977425133892 \dots %

$53.290977425133892\ldots\%$ zamanın.

— whuber

@whuber evet, haklı olduğuna inanıyorum. Simülasyonumu benden daha az çalıştırdım. Sonuçlarım sizinkilerle orantılı.

— Demetri Pananos

Biri zamanın% 53'ünü kazanırsa, diğeri% 47 olmalıdır, bu yüzden açıklamada "ilk oyuncu ikinci oyuncudan% 6 daha fazla kazanır" veya "yarıdan% 3 daha fazla kazanır" yazılmamalı mı? Değil (şu anda söylediği gibi) "İkinci

— çeviren

Bu soruyu FiveThirtyEight Riddler Express'ten mi aldın ?

— 17'de

Yanıtlar:

Ulaşmadan önce kuyruk sayısında dağılımı kafaları olan negatif binom parametrelerle ve . Let olasılık fonksiyonu ve olmak hayatta kalma işlevi: her biri için , oyuncunun şans kuyrukları önce kafa ve oyuncunun şans daha önce veya daha fazla kuyrukları kafaları. $10$ $10$ $1/2$ $f$ $G$ $n\ge 0$ $f(n)$ $n$ $10$ $G(n)$ $n$ $10$

Oyuncu, bağımsız bir şekilde rulo için, şans tam olarak alındığı, ilk oyuncu kazanç kuyrukları ikinci oyuncu rulo tesadüfen bu şansı çarpılması ile elde edilir veya daha fazla kuyrukları, e eşit . $n$ $n$ $f(n)G(n)$

Mümkün olan tüm üzerinden toplanmak , ilk oyuncunun kazanma şansını $n$

\sum_{n = 0}^{\infty} f (n) G (n) \approx 53.290977425133892 \dots % .

$\sum_{n=0}^\infty f(n)G(n) \approx 53.290977425133892\ldots\%.$

Bu, yarıdan yaklaşık daha fazladır. $3\%$

Genel olarak, herhangi bir pozitif tamsayı değiştirmek , cevap Hipergeometrik fonksiyon açısından verilebilir: eşittir $10$ $m$

1 / 2 + 2^{- 2 m - 1}_{2} F_{1} (m, m, 1, 1 / 4) .

$1/2 + 2^{-2m-1} {_2F_1}(m,m,1,1/4).$

Şans eseri olan önyargılı bir para kullanırken , bu $p$

\frac{1}{2} + \frac{1}{2} (p^{2 m})_{2} F_{1} (m, m, 1, (1 - p)^{2}) .

$\frac{1}{2} + \frac{1}{2}(p^{2m}) {_2F_1}(m, m, 1, (1 - p)^2).$

İşte Rböyle bir milyon oyunun simülasyonu. Bu bir tahmin raporları . Teorik sonuç ile karşılaştırmak için bir binom hipotezi testinin Z skoru , ki bu önemsiz bir farktır. $0.5325$ $-0.843$

n.sim <- 1e6
set.seed(17)
xy <- matrix(rnbinom(2*n.sim, 10, 1/2), nrow=2)
p <- mean(xy[1,] <= xy[2,])
cat("Estimate:", signif(p, 4), 
    "Z-score:", signif((p - 0.532909774) / sqrt(p*(1-p)) * sqrt(n.sim), 3))

— whuber
kaynak

Just as a note that may not be obvious at a glance, our answers agree numerically: (.53290977425133892 - .5) * 2 is essentially exactly the probability I gave.

— Dougal

@Dougal Thank you for pointing that out. I looked at your answer, saw the

6.6 %

$6.6\%$ , and knowing that it did not agree with the form of the answer requested in the question, I did not recognize that you had computed correctly. In general it's a good idea to frame an answer to any question in the form that is requested, if possible: that makes it easy to recognize when it's correct and easy to compare answers.

— whuber

@whuber I was responding to the phrase "Simulations of this game show that the player to flips first wins 2% (EDIT: 3% more after simulating more games) more than the player who flips second". I'd interpret "wins 2% more" as

Pr (A wins) - Pr (B wins) = 2 %

$\Pr(A\text{ wins}) - \Pr(B\text{ wins}) = 2\%$ ; the correct value is indeed 6.6%. I'm not sure of a way to interpret "wins 2% more" means "wins 52% of the time", though apparently that is what was intended.

— Dougal

@Dougal OP'nin açıklamasının kafa karıştırıcı ve hatta yanlış olduğunu kabul ediyorum. Bununla birlikte, kod ve sonucu, "diğer oyuncudan% 3 daha fazla" yerine "zamanın yarısından% 3 daha fazla" anlamına geldiğini açıkça belirtti.

— whuber

@whuber Kabul etti. Ne yazık ki, kod gönderilmeden önce soruyu yanıtladı ve kendim bir simülasyon çalıştırmadı. :)

— Dougal

Oyunu şu şekilde modelleyebiliriz:

$A_1, A_2, \dots$ until they get a total of 10 heads. Let the time index of the 10th heads be the random variable $X$ .
Player B does the same. Let the time index of the 10th heads be the random variable $Y$ , which is an iid copy of $X$ .
If $X \le Y$ , Player A wins; otherwise Player B wins. That is, $\begin{aligned} Pr (A wins) & = Pr (X \geq Y) = Pr (X > Y) + Pr (X = Y) \\ Pr (B wins) & = Pr (Y > X) = Pr (X > Y) . \end{aligned}$ $\begin{align} \Pr(A\text{ wins})&= \Pr(X \ge Y) = \Pr(X > Y) + \Pr(X = Y)\\ \Pr(B\text{ wins})&= \Pr(Y > X) = \Pr(X > Y). \end{align}$

The gap in the win rates is thus

Pr (X = Y) = \sum_{k} Pr (X = k, Y = k) = \sum_{k} Pr (X = k)^{2} .

$\Pr(X = Y) = \sum_k \Pr(X = k, Y = k) = \sum_k \Pr(X = k)^2 .$

As you suspected, $X$ (and $Y$ ) are distributed essentially according to a negative binomial distribution. Notations for this vary, but in Wikipedia's parameterization, we have heads as a "failure" and tails as a "success"; we need $r = 10$ "failures" (heads) before the experiment is stopped, and success probability $p = \tfrac12$ . Then the number of "successes," which is $X - 10$ , has

Pr (X - 10 = k) = (\binom{k + 9}{k}) 2^{- 10 - k},

$\Pr(X - 10 = k) = \binom{k + 9}{k} 2^{-10 - k},$ and the collision probability is

Pr (X = Y) = \sum_{k = 0}^{\infty} {(\binom{k + 9}{k})}^{2} 2^{- 2 k - 20},

$\Pr(X = Y) = \sum_{k=0}^\infty \binom{k + 9}{k}^2 2^{-2k - 20} ,$ which Mathematica helpfully tells us is

\frac{76 499 525}{1 162 261 467} \approx 6.6 %

$\frac{76\,499\,525}{1\,162\,261\,467} \approx 6.6\%$ .

Thus Player B's win rate is $\Pr(Y > X) \approx 46.7\%$ , and Player A's is $\frac{619\,380\,496}{1\,162\,261\,467} \approx 53.3\%$ .

— Dougal
kaynak

the heads need not be in a row, just 10 total. I assume that is what you are fixing.

— Demetri Pananos

(+1) I like this approach better than the one I posted because it is computationally simpler: it requires only the probability function, which has a simple expression in terms of binomial coefficients.

— whuber

I've submitted an edit replacing the last paragraph questioning the difference from the other answer with an explanation of how their results are actually the same.

— Monty Harder

Let $E_{ij}$ be the event that the player on roll flips i heads before the other player flips j heads, and let $X$ be the first two flips having sample space $\{ hh,ht,th,tt\}$ where h means heads and t tails, and let $p_{ij} \equiv Pr(E_{ij})$ .

Then $p_{ij}=Pr(E_{i-1j-1}|X=hh)*Pr(X=hh)+Pr(E_{i-1j}|X=ht)*Pr(X=ht)+Pr(E_{ij-1}|X=th)*Pr(X=th)+Pr(E_{ij}|X=tt)*Pr(X=tt)$

Assuming a standard coin $Pr(X=*)=1/4$ means that $p_{ij}=1/4*[p_{i-1j-1}+p_{i-1j}+p_{ij-1}+p_{ij}]$

solving for $p_{ij}$ , $= 1/3*[p_{i-1j-1}+p_{i-1j}+p_{ij-1}]$

But $p_{0j}=p_{00}=1$ and $p_{i0}=0$ , implying that the recursion fully terminates. However, a direct naive recursive implementation will yield poor performance because the branches intersect.

An efficient implementation will have complexity $O(i*j)$ and memory complexity $O(min(i,j))$ . Here's a simple fold implemented in Haskell:

Prelude> let p i j = last. head. drop j $ iterate ((1:).(f 1)) start where
  start = 1 : replicate i 0;
  f c v = case v of (a:[]) -> [];
                    (a:b:rest) -> sum : f sum (b:rest) where
                     sum = (a+b+c)/3 
Prelude> p 0 0
1.0
Prelude> p 1 0
0.0
Prelude> p 10 10
0.5329097742513388
Prelude>

UPDATE: Someone in the comments above asked whether one was suppose to roll 10 heads in a row or not. So let $E_{kl}$ be the event that the player on roll flips i heads in a row before the other player flips i heads in a row, given that they already flipped k and l consecutive heads respectively.

Proceeding as before above, but this time conditioning on the first flip only, $p_{k,l} = 1-1/2*[p_{l,k+1}+p_{l,0}]$ where $p_{il}=p_{ii}=1, p_{ki}=0$

This is a linear system with $i^2$ unknowns and one unique solution.

To convert it into an iterative scheme, simply add an iterate number $n$ and a sensitivity factor $\epsilon$ :

$p_{k,l,n+1} = 1/(1+\epsilon)*[\epsilon*p_{k,l,n} +1-1/2*(p_{l,k+1,n}+p_{l,0,n})]$

Choose $\epsilon$ and $p_{k,l,0}$ wisely and run the iteration for a few steps and monitor the correction term.

— John Rambo
kaynak