Review: Exam 2
Statistical studies
- Study types
- Controlled experiements, observational studies, sample surveys
- Advantages/disadvantages of one type of study over another
in various settings
- Evaluating studies
(Don't memorize, just be comfortable with these ideas)
- 7 critical
components
- Pitfalls to watch out
for in various types of studies
Familiarize yourself with types of bias for observational studies,
both those mentioned in text and those on this page
- Experimental design
- Identification of factors/levels and response variables
- Use of controls
- Placebos and their purpose
- Double-blind studies
- Randomization
- Importance of randomization when assigning experimental
units to treatment groups (Note: Without this
cause-and-effect relationships could not be established
be able to explain why)
- Use of a table of random digits
- Importance of replication of experiment on many units
- Block design
- Makes group assignment a little less random, but serves
a purpose (Know what this purpose is.)
- Similar to stratification in sample surveys
- Special case: matched-pairs design two
forms:
- Each unit receives all treatments (probably only
two) in a random order
- Experimental units come in pairs and are split
up randomly so that one receives one treatment,
the partner receives the other
- Sampling
- vs. taking a census
- Sampling methods (Which ones are valid? Be able to
identify sample type given a scenario.)
- Simple random sample (SRS) of size n
- All individuals/all groups of size n
are equally-likely to be chosen
- Reasons that an SRS is often impractical
- Identifying (from descriptions) sampling
methods that do and do not result in
an SRS
- Stratified random sample
- Systematic random sample (see problem 41, p. 264)
- Convenience sample (Ex.: A polster who wants to know America's
opinion interviews people who pass by her in a nearby mall parking
lot on a Tuesday morning)
- Voluntary response sample
- Multistage sample
-
Terms to know:
-
statistical inference, anecdotal evidence, population, design, sampling frame,
exploratory data analysis, experimental units/subjects, treatment, factors,
levels, confounding/lurking variables, blocks, strata, voluntary bias,
control/treatment groups, response, nonresponse, undercoverage, parameter,
statistic, sampling variability
Probability
- Concepts/terms
-
Experiment, outcome, sample space, event
- Unions (A or B), intersections
(A and B), complements (not
A) of events (and depictions of each using a
Venn diagram note that in this applet they write
AB when they mean A and B)
-
Independence, disjointness (=
mutual exclusivity) of events
- When outcomes are equally-likely
- Randomness
- short-term unpredictability, long-term predictability
-
Law of Large
Numbers (pp. 328-332)
- What does it guarantee?
- What information does it leave out?
- What is the mistake characterized as the law of small numbers
(or the gambler's fallacy)
- why it must be a part of sample selection (how later analysis
of data depends on it)
- Variability among samples sampling distributions
-
Rules of probability models
- 0 £ p
£ 1 ; p = 0
for null events, p = 1 for
certain events
- Sum of probabilities over all outcomes is 1
- Complementation rule: P(Ac) = 1 - P(A)
- P(A and B) = P(A) P(B) when A and B
are independent
- P(A or B) = P(A) + P(B) - P(A and B); this becomes
P(A or B) = P(A) + P(B) when A and B
are disjoint
- Assessing probabilities
- of continuous random variables
- Uniform distributions: probability = area of an appropriate
rectangle (See Example 4.17 and Figure 4.10, pp.
318-319)
-
Normal
distributions N(m,
s)
- Standardizing values (converting a value of X to a
standardized value Z) and the reverse process
- Using Table A to go back and forth between probabilities
and standardized scores
-
Interpreting a probability P(a < X < b) as area
under a normal curve
- Interpretation of the standard deviation s
- Distance from center to point where inflection
occurs
- The 68-95-99.7 rule (remember, these numbers are not
exact; you should be able to tell what they are
exactly using Table A)
- of discrete random variables
- Binomial distributions via Table C, formula (learn it),
or normal approximation (when appropriate)
- Uniform distributions
-
Terms to know:
-
probability, trials (of an experiment), expected
value mX (= mean) and
standard deviation sX
of a random variable, variance
Sampling Distributions of sample statistics
- What are they (in general)?
- Relationship to population distributions (in fact, the
sampling distribution for sample size n = 1
is the population distribution)
- Some specific sample statistics
- counts
- recognize situations in which the sampling distribution is
- binomial B(n, p)
- approximately binomial (close enough)
- approximately normal (already approximately binomial)
- expected value (mean) of the distribution is np
- variance of the distribution is np(1-p)
(Remember: s.d. is the sq. root of variance.)
- determining probabilities like P(X < c),
P(c < X < d), P(X > c)
- when count X is binomial (or approximately
so, but not approx. normal (Table C)
- when count is approximately normal (use normal
approximation)
- proportion = count/(sample size)
- recognize situations in which approximately normal
(Note: same as when counts are distributed
approximately normally)
- expected value (mean) of the distribution is p
- variance of the distribution is p(1-p)/n
- sample mean
- expected value (mean) of the distribution is m, same as population
- variance of the distribution is
s2/n
- distributed normally (for every value of n) if
population is
- central limit theorem
- What does it say?
- What implications does it have for sampling distributions
for means?
- How is it related to the normal approximation to binomial
distributions? (See p. 404)
-
Terms to know:
-
count, proportion, parameter, statistic, population, sample,
population distribution, sampling distribution
Confidence intervals for means
- Purpose (Why are they used? What do they tell you?
How should they be interpreted?)
- Construction: (estimator) ±
z* × (spread of sampling distribution for estimator)
- Effect on CIs when n (sample size) or C
(level of confidence) is changed
-
Terms to know:
-
margin of error, critical value, confidence level
Back to Math 143C Class Page
This page maintained by:
Thomas L. Scofield
Department of Mathematics and Statistics
Calvin College
Last Modified:
Monday, 26-Jul-2004 13:10:08 EDT