- Know the assumptions / properties of Pearson's Linear Correlation Coefficient.
- What values is it between?
- What does a value of zero mean / not mean?
- What happens if you change the scale of either variable?
- What happens if you switch the variables?
- What type of relationship does it measure?
- What type of distribution does it have?
- How many degrees of freedom does it have?
- What type of distribution do the ordered pairs (x,y) have?
- Know the assumptions / properties of the contingency tables.
- What is the null hypothesis?
- How are the sample data selected?
- What requirement must be met?
- What type of data is used?
- What type of distribution does it have?
- How many degrees of freedom does it have?
- What type of test is it?
- Know the assumptions / properties of multinomial experiments.
- What is the null hypothesis?
- What requirement must be met?
- What is the sample data?
- What distribution does it have?
- How many degrees of freedom does it have?
- What type of test is it?
- Know the guidelines for using the regression equation.
- Know the guidelines for using a regression equation from page 501.
- What is the equation that should be used if there is no significant linear correlation (pg 500)?
- Know the properties of multiple regression.
- When does the largest value of R-square occur?
- When does the largest value of the adjusted R-square occur?
- How is the Analysis of Variance used to test the regression equation?
- How does correlation between independent variables affect the choice of variables?
- What tools can be used to perform multiple regression.
- What are the degrees of freedom?
- Contingency Table. Use Statdisk. Write the null hypotheses. Find one (not all of them) expected frequency. Identify the degrees of freedom. Find the p-value. Write the decision and conclusion.
- Know what happens to the linear correlation coefficient when the data is manipulated. Three parts.
- Know what happens to the test statistic of a multinomial experiment when the data is manipulated. Two parts.
- Know what happens to the test statistic of a contingency table when the data is manipulated. Two parts.
- Work a chi-square goodness of fit problem using Statdisk. The observed frequencies are given to you.
- Work a linear regression problem using Statdisk. You will need to load data files and copy / paste data. Use the regression equation to estimate the dependent variable for a specific value of the independent variable.
- The value of the correlation coefficient, r, and the total variation are given. Find the coefficient of determination, explained variation, and unexplained variation.
- Software was used to test for normality. The p-value is given. Write the conclusion.
- Use software to create a graphical correlation matrix and determine which variables seem related to each other. Perform multiple regression and record the coefficients and p-values for each independent variable. Also record the r-square and adjusted r-square value. Determine which variable is least significant by looking at the p-values and re-perform the multiple regression without that variable. Did the adjusted r-square increase or decrease by eliminating the variable?
- Use software to perform linear regression. Write the null hypothesis. Find the value of the correlation coefficient and the p-value. What is the decision? What is the conclusion? Should the regression equation be used? Write the equation that should be used (either the regression equation if significant or the mean of the dependent variable if not). Look at the normal probability plots and determine if the data appears normally distributed.

- It is possible to get more than 100 points on the exam.

# | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | Total |

Pts | 7 | 6 | 6 | 5 | 6 | 12 | 6 | 4 | 4 | 8 | 10 | 6 | 2 | 10 | 14 | 106 |