Quote:
Originally Posted by Third Eye
(Post 7797384)
My best guess, given the wording, is that we are working with a geometric distribution with p=.25 and trying to determine the minimum number of trials it will take for a first success that occurs more than 2.57 standard deviation away from the mean. The mean for a geometric distribution is 1/p, so 1/.25=4. The variance is given by (1-p)/p^2 or .75/.0625=12. Thus the sd is 12^.5 or approximately 3.464. 2.57 sds is approximately equal to 8.9. So, to be at least 2.57 sds away from the mean you need at least 4 + 9 rolls, or 13.
|
Very nice. This could well be an answer to the original problem that Fat Elvis is trying to solve and it's the sort of alternative answer that I had in mind when I mentioned negative binomial distributions (of which the geometric distribution is a special case). It's why it's important to know what that problem's wording really is, because the problem as translated to the topic thread header is a little bit weird and seems to be lacking some crucial details.
The last sentence of the problem as posed by Fat Elvis is
"How rolls would it take to say that a number did not appear for 2.57 standard deviations?"
Let me just make a couple of comments about where you have interpreted that sentence of the problem differently than I have.
You are interpreting the problem to mean that the problem concerns the situation where the die roller has a particular value for the die in mind (e.g. a value of 3 dots) and is concerned about the number of throws of the die that could occur before that number first comes up. This is a reasonable interpretation. It leads to the crucial decision that the relevant standard deviation is for a geometric distribution with parameter p=0.25 and not the standard deviation for a discrete uniform distribution on the integers from 1 to 4. This matters a lot of course because those two distributions have different standard deviations. Your calculation of the standard deviation for the distribution you picked is indeed correct. Now keep in mind that there are two different versions of geometrically distributed variables, one counts just the number of failures until the first success, the other counts the total number of throws, including the throw with the success, so it is always bigger by 1. You are using the latter version, the one that Wikipedia denotes with the letter X to distinguish it from the other version, which it labels with a Y:
http://en.wikipedia.org/wiki/Geometric_distribution
X & Y have different means, of course, because X is Y+1, but they have the same standard deviation. This is pertinent to Fat Elvis problem because you are assuming that you are also assuming that the target value is 2.57 sd away from the mean, not just simply 2.57 sd. Note that the problem as translated by Fat Elvis doesn't have the phrase "away from the mean" in it. Still, I think this could be that the original problem, the one that Fat Elvis translated to the topic threader could have included that phrase. If it didn't, then under your interpretation of the problem one would simply report 9 as the value, not 9 + 4 as the value, an easy fix.
Man, no wonder statistics is so confusing to students. The wording of the problem really matters a lot! ;)
|