Minitab Instructions for Chapter 10 Technology Exercise
Seatbelt Safety (Question 2)
Start a new worksheet for this problem.
- Label columns as year, seatbelt, and fatality
- Gather the information for 1985 through 2003 from Figure 1 of the June
2005 Safety Belt Usage in Illinois. This information is from the Illinois Department of Transportation and is at http://www.dot.state.il.us/trafficsafety/seatbelt%20june%202005.pdf. The data is in a chart, so you'll have to read the percents from the top of
the bars. Data is available as late as 2005, but we're only able to get information through 2003 for the next part, so we'll stop in 2003.
- Gather the information for 1985 through 2002 from the Illinois
2002 Toll of Motor Vehicles Crashes page from the National Highway Traffic Safety Administration at http://www.nhtsa.dot.gov/STSI/State_Info.cfm?Year=2002&State=IL. There is a table toward the bottom of the page that is titled "Fatalities and Fatality Rate per 100 Million VMT". You want the Total Fatality Rate column. Be sure you use the "Fatality Rate" column and not the "Fatalities" column.
After gathering the data for 1985 through 2002, click on the 2003 at the top right of the page where it says Change Year. You are looking for the "Fatality Rate per 100M VMT" for Illinois in 2003. It's the second row in the first table. The data for 2004 is not available, so we're going to stop with 2003.
Making the Fitted Line Plot (part b)
- Choose Stat / Regression / Fitted Line Plot
- The response variable is fatality
- The predictor variable is seatbelt
- Click OK
- Copy the graph and paste it into Word
The Fitted Line plot also contains the regression equation and the value
of r2, the percent of the variation that can be explained by the regression model.
The output in the session window on Minitab gives much of the same information
including an ANOVA table that contains the F test statistic and the p-value
that can be used for checking correlation.
Checking for Significant Linear Correlation (parts d & e)
While the p-value can be found from the ANOVA table, it doesn't give the
value of r, the correlation coefficient.
- Choose Stat / Basic Statistics / Correlation
- The two variables are seatbelt and fatality (order doesn't matter)
- Click OK
The output gives you the correlation coefficient first and the p-value
second. The null hypothesis is that there is no significant linear correlation.
Give the Regression Equation (part h)
If you determined that there was significant linear correlation (positive
or negative) by rejecting the null hypothesis of no significant linear correlation,
then you should use the regression equation given by the computer. This was
found when you did the fitted line plot. Your equation should look something
like "fatality = 3.03814 - 0.0230872 seatbelt" (probably not that exactly).
- Choose Stat / Regression / Regression
- The response variable is fatality
- The predictor variable is seatbelt
- Click OK
- The regression equation is given towards the top of the output, you may need to scroll up.
If, however, you decided that there was no signficant linear correlation
because you retained the null hypothesis of the correlation test, then you
should use the mean of y (y-bar) for the estimated equation. Your equation
should be something like "fatality = ####" where #### is the numerical value of the mean of the fatality variable. You'll
have to do descriptive statistics to find out what that is.
Amtrak Delays (Question 3)
Gathering the Data
Amtrak keeps data available for the last five (5) days only. Since you need at least six days of information, you will need to collect information on more than one date. Do NOT wait until this is due to start it.
- Visit the Amtrak website at http://www.amtrak.com/
- The center portion of the screen is split into two parts, Fare Finder and Train Status. Go to the Train Status section.
- Leave the Departs box empty
- Put CHI in the Arrives box
- Put 300 in the optional Train No box.
- Click Next
- Record the delay in minutes for the indicated date. If the train is early, record the delay as negative. If the delay is given in hours and minutes, you need to convert it into minutes before recording.
- Change the date to a previous day and click Resubmit
- When you are done with the 300 train, change the train number to 22 and go through the cycle with the different dates.
- When you are done with the 22 train, change the train number to 324 and repeat the cycle of dates.
Entering the Information into Minitab
When entering information into Minitab, ignore any missing data. Do not put blank rows in the Minitab data.
- Label two columns in Minitab as train and delay.
- Enter either the train number or the train name in the first column as many times as you have data for that train. That is, if you have 7 days worth of information for the 300 train, then enter 300 or "State House" 7 times into the train column. Repeat this for the other two trains.
- Enter the delay information for each of the trains into the delay column. Match up the information so that the delay is in the same row as the proper train number.
Summarize the Delay Times (part c)
- Choose Stat / Basic Statistics / Display Descriptive Statistics
- Describe the variable delay
- By the variable train
- Click on Statistics and enable any other statistics you might need
- Click OK
- Record the values for the 300, 22, and 324 trains in the appropriate columns. Note that Minitab is going to put the trains in numerical order, not the order they appear in the table.
- Choose Stat / Basic Statistics / Display Descriptive Statistics
- Describe the variable delay
- Remove train from the by variables box
- Click OK
- Record the values in the combined column of the table.
ANOVA table (parts d, f, and g)
- Choose Stat / ANOVA / One-Way (the first option, not the unstacked option)
- The response variable is delay
- The factor is the train
- Click OK
- Copy the results into the ANOVA table. Notice that the order of the columns is different than what is on your worksheet.
The MS(Total) value does not appear in the ANOVA table output in Minitab as it is not technically part of the ANOVA. However, find it by dividing the SS(Total) by df(Total).
The test statistic and p-value are the F and p-value values from the table.
The Critical F value (part e)
The F value was found by dividing the MS(Factor) by MS(Error) where the Factor in this problem is the Train. The numerator df is the df(Factor) and the denominator df is the df(Error). You'll need those values to look up the Critical F value.
- Choose Calc / Probability Distributions / F
- Choose Inverse cumulative probability
- The numerator degrees of freedom are the df(Factor)
- The denominator degrees of freedom are the df(Error)
- The input constant is 0.95 (the area to the left for an α = 0.05 significance level)
- Click OK
- The critical value is the value Minitab calls X