Number of trials to draw every element in a setProbability distribution for sampling an element $k$ times after $x$ independent random samplings with replacementExpected number of trials to see x unique values out of N total valuesCard game probability question?Probability for Magic TrickMontmort's card matching problem: Distribution of the number of matching cards?Probability of drawing no aces with replacement of cards?Suppose you draw a five-card hand randomly from the deck and get four cards that that would make a straight if you could replace the fifth card…Odds of nonconsecutive number drawGeometric distribution and lottery problemfrom 34 pick 5, repeat 11 times. Expected number of matches between the 11 trials?

Do we have to expect a queue for the shuttle from Watford Junction to Harry Potter Studio?

How to draw a matrix with arrows in limited space

What are some good ways to treat frozen vegetables such that they behave like fresh vegetables when stir frying them?

How to get directions in deep space?

What is Cash Advance APR?

How much theory knowledge is actually used while playing?

Which was the first story featuring espers?

awk assign to multiple variables at once

What is going on with gets(stdin) on the site coderbyte?

Make a Bowl of Alphabet Soup

Will number of steps recorded on FitBit/any fitness tracker add up distance in PokemonGo?

Does Doodling or Improvising on the Piano Have Any Benefits?

The IT department bottlenecks progress, how should I handle this?

Is there any evidence that Cleopatra and Caesarion considered fleeing to India to escape the Romans?

Is it ethical to recieve stipend after publishing enough papers?

15% tax on $7.5k earnings. Is that right?

Is there a RAID 0 Equivalent for RAM?

Is this toilet slogan correct usage of the English language?

"It doesn't matter" or "it won't matter"?

A Trivial Diagnosis

Why is so much work done on numerical verification of the Riemann Hypothesis?

Is it allowed to activate the ability of multiple planeswalkers in a single turn?

What (the heck) is a Super Worm Equinox Moon?

Why Shazam when there is already Superman?



Number of trials to draw every element in a set


Probability distribution for sampling an element $k$ times after $x$ independent random samplings with replacementExpected number of trials to see x unique values out of N total valuesCard game probability question?Probability for Magic TrickMontmort's card matching problem: Distribution of the number of matching cards?Probability of drawing no aces with replacement of cards?Suppose you draw a five-card hand randomly from the deck and get four cards that that would make a straight if you could replace the fifth card…Odds of nonconsecutive number drawGeometric distribution and lottery problemfrom 34 pick 5, repeat 11 times. Expected number of matches between the 11 trials?













3












$begingroup$


Problem:
Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?



Example:
You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?



Notes:
I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is



$$dfrac binomp-tr-i binom t i binompr$$



But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.










share|cite|improve this question











$endgroup$
















    3












    $begingroup$


    Problem:
    Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?



    Example:
    You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?



    Notes:
    I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is



    $$dfrac binomp-tr-i binom t i binompr$$



    But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.










    share|cite|improve this question











    $endgroup$














      3












      3








      3


      1



      $begingroup$


      Problem:
      Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?



      Example:
      You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?



      Notes:
      I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is



      $$dfrac binomp-tr-i binom t i binompr$$



      But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.










      share|cite|improve this question











      $endgroup$




      Problem:
      Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?



      Example:
      You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?



      Notes:
      I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is



      $$dfrac binomp-tr-i binom t i binompr$$



      But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.







      probability probability-distributions






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Mar 14 at 15:04







      abaresk

















      asked Mar 14 at 14:58









      abareskabaresk

      183




      183




















          1 Answer
          1






          active

          oldest

          votes


















          1












          $begingroup$

          I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).



          The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.



          For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.



          I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.






          share|cite|improve this answer









          $endgroup$












            Your Answer





            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "69"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3148108%2fnumber-of-trials-to-draw-every-element-in-a-set%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1












            $begingroup$

            I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).



            The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.



            For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.



            I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.






            share|cite|improve this answer









            $endgroup$

















              1












              $begingroup$

              I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).



              The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.



              For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.



              I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.






              share|cite|improve this answer









              $endgroup$















                1












                1








                1





                $begingroup$

                I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).



                The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.



                For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.



                I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.






                share|cite|improve this answer









                $endgroup$



                I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).



                The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.



                For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.



                I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.







                share|cite|improve this answer












                share|cite|improve this answer



                share|cite|improve this answer










                answered Mar 14 at 18:59









                Ron AppenzellerRon Appenzeller

                566




                566



























                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Mathematics Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3148108%2fnumber-of-trials-to-draw-every-element-in-a-set%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Moe incest case Sentencing See also References Navigation menu"'Australian Josef Fritzl' fathered four children by daughter""Small town recoils in horror at 'Australian Fritzl' incest case""Victorian rape allegations echo Fritzl case - Just In (Australian Broadcasting Corporation)""Incest father jailed for 22 years""'Australian Fritzl' sentenced to 22 years in prison for abusing daughter for three decades""RSJ v The Queen"

                    Who is our nearest planetary neighbor, on average?Santa Claus flies to the South PoleSeven Spheres of Unequal Mass, a weighing problem with a twistDescribe a large integerFast Mental Calculation of $7.5^7$Math in Space (without the help of celebrities)Find the value of $bigstar$: Puzzle 8 - InequalityWho drinks beer while running anyway?A Crucial DeliveryRanking And AverageHow long will my money last at roulette?

                    Daza language Contents Vocabulary Phonology References External links Navigation menudaza1242Daza"Dazaga"eeee178086576