Math 143 C/E, Spring 2001
IPS Reading Questions
Chapter 2, Section 6
A two-way table requires two (and only two) variables. Generally these variables are categorical. Nevertheless, there really is no limitation to the number of values these variables can take on, except that it must be a finite number.
A cell's column percent is found by dividing the count in that cell by the total in that column. Similar, a cell's row percent is found by dividing its count by the total inits row. One can just as easily form bar graphs for inidividual columns or rows as one can for marginal distributions, and it is done frequently enough to have a name for these types of distributions: conditional distributions.
It is that seemingly apparent relationships between
variables sometimes fall apart upon closer inspection.
Lurking variables, such as patients' conditions
from Example 2.32, once considered, may reveal a
different relationship altogether. As the authors
point out, the first table on p. 199 is an
aggregation of the data in the other two (those
other two taken together are considered to be one
three-way table - can you explain why?), and obscures
the role that patient condition plays.
An important point here: the data in the first table of Example 2.32 as well as the enhanced data of the two tables below it, though fictitious, would be the result of an observational study. Can you describe how it is that experiments make small the possibility of some unconsidered lurking variable having an important effect on results?