The data sets used in class are included here in several formats. In the CSV format, the data file is a text file with one line for each individual in the sample. The values of each variable for each individual are separated by commas. Excel will save files in this format and files in this format can easily be used by R or Stata. Each dataset is also included as an Excel (.xls) spreadsheet. The TXT format is used to save data as a simple text-readable file. It is usually evident what the format of that file is.
Homework data set - a second random sample of thirty seniors just like the above. CSV Excel TXT
Reading comprehension scores of 66 students before and after training by one of three teaching methods. There are two pretest scores and three posttest scores. This dataset is from the Data and Story Library and is orginally due to Moore and McCabe. CSV Excel TXT
This data, used on the homework of February 3, is a study of 62,000 women who participated in a breast-cancer screening study. The data is here.
This data consists of the free-throw shooting percentage as of February 4 of every MIAA men's basketball player with 10 free throw attempts or more. The data is in CSV format.
The grades of 32 students in a section of Mathematics 222 (Spring 1992) on three tests and a final exam in CSV format.
The time in seconds between scores in the Calvin-Kalamazoo homecoming basketball game of February 7, 2003. This is a text file.
This data, from Dielman's Applied Regression Analysis, contains the graduation rates of all students entering the school in 1990 for 256 colleges, in CSV format.
This data is some of the questionaire data from the course evaluation forms filled out by Calvin students during Fall 2003. Each row is a questionaire filled out by one student. This includes all forms for all 3 or 4 semester hour classes. The columns are as follows:
Survey A number identifying the class Q1 Response to question 1 (Excellent 5, Very good 4, etc)
Q2 Response to question 2 Hours Number of hours spent on class Grade Grade expected (12 Pass, 13 Credit, 14 No credit) Credits Number of hours of credit
The data is in CSV format here.
Results of the Mathematics 143/243 class survey in CSV form.
Data on the probability distribution of mortality for US males in CSV format.
The world records in track (in seconds) for each metric distance in CSV format.
The batting and pitching statistics for the 14 American League teams during the 2003 season in CSV format.
A good site for baseball data in general is at www.baseball-reference.com
The batting statistics of all major league baseball teams for the five seasons 1994-98 in CSV format. A smaller version of the same dataset with just the data for certain events on a per game basis in CSV format.
2003 Baseball Statistics.
Data on every baseball game played in 2003 is here in CSV format.
2005 MIAA Men's Basketball.
Complete player statistics for all players in the MIAA for all games (not only conference games) in CSV format.
Raisins in a Sun-Maid Raisin Box.
Students in Mathematics 243 during Spring, 2005 counted the number of raisins in 14 gm boxes of Sun-Maid Raisins. The data is here in CSV format.
Fusion time in random dot stereograms.
From the data and story library comes the results of an experiment on the effect of prior information on the time to fuse random dot sterograms. The data is here in CSV format. See the Data and Story library for the story.
Darts versus experts in choosing stocks.
The change in portfolio value using two different strategies - consulting experts or throwing darts. The data is here in CSV format. The data comes from the Chance website.
2004 Baseball Season
Team statistics on the 2004 baseball season in CSV format.
2007 Baseball Season
Team statistics on the 2007 baseball season in CSV format.
Temperatures and Precipitation in Allegan Michigan
Daily maximum and minimum temperatures as well as precipitation data in Allegan, Michigan since 1948 (CSV format)