Number of trials to draw every element in a setProbability distribution for sampling an element $k$ times after $x$ independent random samplings with replacementExpected number of trials to see x unique values out of N total valuesCard game probability question?Probability for Magic TrickMontmort's card matching problem: Distribution of the number of matching cards?Probability of drawing no aces with replacement of cards?Suppose you draw a five-card hand randomly from the deck and get four cards that that would make a straight if you could replace the fifth card…Odds of nonconsecutive number drawGeometric distribution and lottery problemfrom 34 pick 5, repeat 11 times. Expected number of matches between the 11 trials?
Do we have to expect a queue for the shuttle from Watford Junction to Harry Potter Studio?
How to draw a matrix with arrows in limited space
What are some good ways to treat frozen vegetables such that they behave like fresh vegetables when stir frying them?
How to get directions in deep space?
What is Cash Advance APR?
How much theory knowledge is actually used while playing?
Which was the first story featuring espers?
awk assign to multiple variables at once
What is going on with gets(stdin) on the site coderbyte?
Make a Bowl of Alphabet Soup
Will number of steps recorded on FitBit/any fitness tracker add up distance in PokemonGo?
Does Doodling or Improvising on the Piano Have Any Benefits?
The IT department bottlenecks progress, how should I handle this?
Is there any evidence that Cleopatra and Caesarion considered fleeing to India to escape the Romans?
Is it ethical to recieve stipend after publishing enough papers?
15% tax on $7.5k earnings. Is that right?
Is there a RAID 0 Equivalent for RAM?
Is this toilet slogan correct usage of the English language?
"It doesn't matter" or "it won't matter"?
A Trivial Diagnosis
Why is so much work done on numerical verification of the Riemann Hypothesis?
Is it allowed to activate the ability of multiple planeswalkers in a single turn?
What (the heck) is a Super Worm Equinox Moon?
Why Shazam when there is already Superman?
Number of trials to draw every element in a set
Probability distribution for sampling an element $k$ times after $x$ independent random samplings with replacementExpected number of trials to see x unique values out of N total valuesCard game probability question?Probability for Magic TrickMontmort's card matching problem: Distribution of the number of matching cards?Probability of drawing no aces with replacement of cards?Suppose you draw a five-card hand randomly from the deck and get four cards that that would make a straight if you could replace the fifth card…Odds of nonconsecutive number drawGeometric distribution and lottery problemfrom 34 pick 5, repeat 11 times. Expected number of matches between the 11 trials?
$begingroup$
Problem:
Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?
Example:
You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?
Notes:
I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is
$$dfrac binomp-tr-i binom t i binompr$$
But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.
probability probability-distributions
$endgroup$
add a comment |
$begingroup$
Problem:
Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?
Example:
You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?
Notes:
I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is
$$dfrac binomp-tr-i binom t i binompr$$
But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.
probability probability-distributions
$endgroup$
add a comment |
$begingroup$
Problem:
Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?
Example:
You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?
Notes:
I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is
$$dfrac binomp-tr-i binom t i binompr$$
But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.
probability probability-distributions
$endgroup$
Problem:
Suppose you have a set of p elements and during each trial, you randomly and uniformly select r elements from that set with replacement. What is the probability distribution for the number of trials it would take for every element of that set to have been selected at least once?
Example:
You have a standard deck of 52 cards, and during each trial you randomly select 5 cards from the deck. What is the probability distribution for the number of trials it would take for you to have held every card in the deck?
Notes:
I've worked out that the probability of drawing i new elements when you've already seen $p-t$ elements is
$$dfrac binomp-tr-i binom t i binompr$$
But I'm not sure how this could be used to get the distribution for the number of trials it would take for there to be no new elements. I suspect that this probability distribution is geometric.
probability probability-distributions
probability probability-distributions
edited Mar 14 at 15:04
abaresk
asked Mar 14 at 14:58
abareskabaresk
183
183
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).
The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.
For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.
I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3148108%2fnumber-of-trials-to-draw-every-element-in-a-set%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).
The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.
For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.
I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.
$endgroup$
add a comment |
$begingroup$
I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).
The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.
For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.
I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.
$endgroup$
add a comment |
$begingroup$
I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).
The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.
For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.
I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.
$endgroup$
I'm making the assumption that each trial's sample of size $r$ is conducted with replacement and there is replacement between each trial, I believe that's the intention. Note that I'll use $n$ to mean the number of elements ($p$ will be used for probability).
The special case where $r=1$ is known as the Coupon Collector's Problem. More details can be found here: https://en.wikipedia.org/wiki/Coupon_collector%27s_problem. In short, each individual trial is geometrically distributed but the whole process (number of trails conducted to select each unique item) is not geometrically distributed as the probability updates with each trial (sample). While the expectation of the geometric distribution is $E(X)=1/p$ the expectation of the Coupon Collector's Problem is $E(T)=n times (1/1 + 1/2 + ... + 1/n)$ where $T=$ number of trials. If we plug in the deck of cards example but set $r=1$ we see that $E(T)=52 times (1/1 + 1/2 + ... + 1/52) approx 235.98$ trials.
For cases of $r>1$ we're effectively still running the same Coupon Collector's Problem but conducting our samples in batches. Assuming $n=52$, the impact, roughly speaking, is that the expectation is reduced by $1/r$, or, $E(T) approx 52/r times (1/1 + 1/2 + ... + 1/52) approx 47.20$. The reason the result is approximate and not exact is the error introduced during the final trial. Imagine the unique (and highly unlikely) case that during the first 10 trials we selected only unique cards with $r=5$. This means we only need 2 more unique cards during trial 11 but there are several unique paths that can get us those 2 cards when we go to draw 5 times. This effectively boosts the probability of achieving our result during the final trial.
I ran a simulation of the deck of cards example using the R language with results aligned to the expected values noted above using $r=1$ and $r=5$. The distribution has asymptotic tails (obviously truncated on the left) and is heavily right-skewed. Adjusting $r$ simply slides (shifts) the distribution left or right (holding the same shape) as expected.
answered Mar 14 at 18:59
Ron AppenzellerRon Appenzeller
566
566
add a comment |
add a comment |
Thanks for contributing an answer to Mathematics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3148108%2fnumber-of-trials-to-draw-every-element-in-a-set%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown