Originally Posted by DanT
In answer to your questions, you have to bear in mind that there is an important distinction between the population mean and the population standard deviation and the sample mean and the same standard deviation. The parameters that describe the population distribution are fixed constants that may or may not be known to the investigator. You asked me to suppose that an event has a 0.25 probability of occurring. OK, well you are now telling me the relevant POPULATION distribution--a Bernoulli with a probability of success of 0.25. Once I know that, I can compute the population mean, which will be 0.25 and the population standard deviation, which will be the square root of the variance, which equals the square root of 0.25 * ( 1 - 0.25 ).
Now, in real life, we often don't know what the parameters are for the relevant population distribution. We simply have access to a sample of observations from that distribution. We can use the sample to produce estimates of the unknown population parameters. Two such estimates are the sample mean and the sample standard deviation. These estimators have sampling distributions associated with them and those distributions do indeed depend on the sample size. Sample means based on a sample size of, say, 10, will vary more from sample to sample than would sample means based on a sample size of, say, 1000. The Central Limit Theorem pertains to the sampling distribution of the sample mean. It says that if the sample is from a population distribution that has a fixed finite mean and a finite population standard deviation, then the sampling distribution for the sample means can be approximated by a normal distribution, as the sample size gets larger and larger. So, for example, the Bernoulli distrbution has finite population means and standard deviations, so the Central Limit Theorem would apply to the sampling distribution of sample means for samples taken from that distribution. The sampling distribution for sample means based on a sample size of 10 will look sorta like a bell curve, if the population mean for the Bernoulli distribution is somewhere between, say, 0.30 and 0.70, but if you use sample sizes of 1,000 or more, then the sampling distribution for the means will really look very much like a bell curve, except for population means close to the edges, very low probability or very high probability events.
|