- Make sure mathematical assumptions made are reasonable.
- Get a good sample.
- Compute statistic from sample.
- Determine the quality of that statistic as an estimate for the
paramter.
- confidence level
- confidence interval, margin of error

We estimate the paramter to be between ______ and ______,

and this interval will contain the true value of the parameter

approximately _______% of the times we use this method.

- For this method to be valid, certain assumptions must be met:
- The parameter must exist and be fixed for the population of interest.
- The sample must be good enough. (The theory is based on a simple random sample, but if another sampling method is used that provides a good approximation to a simple random sample, then the results will still be reliable.)
- The sample must be large enough. The larger the sample, the more accurate the results. In any case, their should be at least 5 individuals with each value.
- The population must be large enough. The population should be at least 10 times the size of the sample.

- Compute p_hat (the proportion of the sample with the trait of interest).
**p_hat**= (number with trait) / (total number in sample) - The
**sampling distribution is approximately normal**with a**mean equal to p**(the true population proportion) and a**standard deviation deviation equal to the square root of**, where*(p(1-p)/n)**n*is the sample size.If we don't know p (and usually we don't, that is what we are trying to estimate after all) we have two choices:

- Use
**square root of (.5)(.5)/n**.This will always be as large or larger that the true value. But if p is fairly small, this will cause us to give a less precise result than would be possible with method (b).

- Use
**square root of p_hat(1-p_hat)/n**.With reasonable sample sizes, p and p_hat will be close, so this will be a good estimate of the actual standard deviation.

- Use
- We can fill in the blanks by using our knowledge of normal
distributions.

## Hypothesis Testing

- Make sure mathematical assumptions made are reasonable.
- Determine the null hypothesis and the alternative hypothesis.
- Collect data.
- Compute test statistic.
The test statstic is a measures how well the data seem to support the null hypothesis. For 2-way tables, we use the Chi-squared statistic. For hypothesis about sample proportions we use the Z statistic.

- Determine likelihood of such an extreme test
statistic if null hypothesis is true (p-value).
This is done by using a table, a computer, or simulation to approximate the p-value from the value of the test statistic

- Make a decision.
The decision depends on both the p-value and the level of confidence required. If the p-value is sufficiently small, then we say we "reject the null hypothesis" because if it were true, our data would be very "unusual".

Note the statistical evidence is never absolute proof, but it does provide a measure of its level of cerainty (the p-value or confidence level)

This page maintained by:

Last Modified: Thursday, 30-Aug-2001 12:17:13 EDT

Randall Pruim

Department of Mathematics and Statistics

Calvin College

rpruim@calvin.edu

- Make sure mathematical assumptions made are reasonable.