Your task is to perform some real-world inferential statistics. You will take a claim that someone has made, form a hypothesis from that, collect the data necessary to test the hypothesis, perform a hypothesis test, and interpret the results. If you use pre-existing data, rather than collecting it yourself, then you will need to do more analysis to get the full points.
You should try to come up with something of interest to you instead of some contrived situation. Several groups have tested to see if their company met their sales goals. One person tested to see if 60% of patients show up for their doctor's appointment in the clinic where she worked. One waitress tested to see if the average tip was 15% and another tested to see if gender plays a role in the amount of the tip.
You may work in groups of up to three (3) persons. Pick people you can work with; part of the grade will be assigned by the people in the group as to the work you contributed. Do not necessarily pick your friends, pick people who will do a good job.
You need to submit a proposal defining what it is that you wish to test and how you wish to go about testing it (think back to the types of sampling). The instructor will peruse these proposals, make suggestions and give it back to you. If your group can't decide on a project or need help defining it, see the instructor.
Be sure to define clearly what the population being studied is (Richland Students, People listed in the Decatur phone book, People driving a car, etc).
Some of your projects will require information from chapters in the book not yet covered. See the instructor if you have trouble identifying these areas. If you read the section(s) and don't understand the material, see the instructor for an explanation. Don't wait for the class to cover the material, it may be too late.
Make sure you get the project cleared with the instructor before you go collect the data. One person wanted to telephone survey some people and was talking in the range of $100 phone bill if she called everyone she said she was going to. The project should not cost you very much money to implement. It will take some time, however, and you should not wait until it's due to get started on it.
While you are not precluded from doing any of the given examples, it is certainly better if you can come up with something original which has an interest to you. Things like the Pepsi vs Coke, while acceptable, are really poor choices.
The instructor will keep a copy of your final project.
The project will be comprised of several parts, due at different times during the remainder of the semester.
This is to make sure you're on the correct track before wasting lots of time collecting useless information. Your proposal should also include a time line of when you will have the different components of your project completed. Include when you plan to have your data collected by, when you'll run the analysis, when you'll have the rough draft completed, and when you'll have the final draft completed. Your proposal should include your population, parameter of interest, and sampling frame as well as the who, what, where, when, why, and how for your project. Must be typed and printed.
An excellent proposal (5 points) will have the following components.
The main purpose of the status report is to remind you that there is a project and that you need to be working towards that because it's coming due. What I would like to see in the report is what you've accomplished towards your project so far, even if you haven't accomplished anything. Remember that in your proposal, you put down dates for completing certain tasks. One way to do the status report is to keep a journal of your project. This journal would log the activities of the group, when you worked, who participated, who was assigned what tasks, what was accomplished, etc. The status report must be typed but may be emailed.
An excellent status report (5 points) will have the following components.
This is a rough draft of the final report so the instructor can suggest corrections. The rough draft is the complete report except that you get a chance to be corrected before the final grade is assigned. Include everything you plan on including in the final report. This includes any graphs, tables, and text. I strongly urge you to make an appointment with the Student Learning Center to have someone proof your draft. The rough draft must be typed and printed. The narrative portions should be double spaced.
The grade here will be based on having the components of the final report present, not on their statistical correctness. This is your chance to make mistakes before it really affects your grade.
The final report will include a description of the problem, and why you think it is important, or what you hope to gain from testing the hypothesis. It should also include the context of the data, all data collected, and the values generated by Minitab or the calculator. A decision and conclusion should be stated. An analysis should follow with what the conclusion means in terms of the original problem. The final report should be in narrative format, must be typed, printed, and should be double spaced.
An excellent final report (60 points) will have the following components.
Classroom presentation of 3-5 minutes on why you picked the project you did, and what your results were. There is a projector with Powerpoint on it if you would like to make a slideshow. You can also make transparencies or write on the board if needed. The class and/or instructor may ask questions on why you did something the way you did. These points will be assigned by the other class members, not by the instructor. You will be assigning point totals to the group as a whole, not each individual member of the group. The grade you receive will be the average of the grades the class gives you. If you are not here for your presentation or the presentations of any of the other groups, you will receive a zero for this portion of the project.
Each group presentation will be rated as excellent, average, or poor in the areas of teamwork, effort in preparation of presentation, clarity of presentation, knowledge of project, and correct statistical usage.
This is the only part of the project that is not a group grade. Your score will be a combination of the scores given you by each member of the group and the instructor's evaluation of your evaluation.
Turn in a summary paragraph of what each person in the group (including yourself) did and how many points out of ten you would give them for their effort. Be sure to put your name, section number, and what the assignment is at the top of the sheet. These evaluations should be typed up individually and emailed to the instructor. The evaluations may be sent in the body of an email, they do not have to be a separate attachment. The other students in the group will not see what you wrote about them, just the average score they got from all of the students.
You need to evaluate everyone in the group including yourself. If you're the only person in the group and did all of the work, you still need to evaluate yourself or you'll miss out on the participation grade.
When the instructor grades your evaluation, he is looking for things like the quantity and quality of material written about each person, whether the evaluation was submitted on time, whether the instructions were followed, etc.
Some things are easier to test than other things. The purpose of this project is not to do a full-scale PhD level research project, it is to expose you to the process of hypothesis testing in a real-world application. You may test means, proportions, or linear correlation. It is also possible (but not covered in your textbook) to test a standard deviation. You may have one or more samples. You may categorize your variables in one or two ways.
If you are dealing with one sample, then you will need some numerical value to test against. The claim "more people prefer Pepsi than Coke" becomes a claim that the proportion of Pepsi drinkers is greater than 0.5. There are not two independent samples (Pepsi drinkers / Coke drinkers), just one sample categorized in two ways. A problem with the Pepsi / Coke thing is that it omits other soft drinks because that is more difficult to do. A chi-square goodness of fit test would be more appropriate in this case. Realize that some of their topics are really lame and you should, if at all possible, come up with a claim that you have heard or that interests you rather than one out of the book.
If your data consists solely of categories and not measured quantities, then you should be looking at proportions or counts. Section 7.3 tells you how to conduct a test about a single proportion, section 8.2 tells about testing two proportions, and sections 10.2 and 10.3 talk about dealing with 3 or more categories and tests for independence (when there are two ways of breaking down the results).
Things to look for that let you know you're dealing with categorical data or proportions include: proportions, percents, counts, frequencies, fractions, or ratios.
This list is a guideline, but counts can also be used as quantitative data as well. You really need to think about the response that was recorded for each case (a row in Minitab terms). Did you record a yes/no response for each case or did you record a number that means something? If it was a yes/no or other categorical data, then this is the place to be.
If your data consists of measured quantities, then you will probably be testing a mean or perhaps correlation between two variables. It is possible to test a claim about a standard deviation, but that is rare, and not covered in this course.
There are four main ways to analyze means. A single mean (requiring a numerical value) is discussed in sections 7.4 and 7.5. Comparison of two independent means is discussed in section 8.3 and the comparison of two dependent or paired means is discussed in section 8.4. Finally, comparison of three or more independent means is discussed in section 10.4.
You can also perform correlation and regression with two quantitative variables. Simple regression, with just one predictor variable, is covered in chapter 9. Multiple regression, with several predictor variables, is not covered in the textbook but is available online.
These are some of the many projects that students have worked on before. You should not limit yourself to these topics, but they may give you guidance for picking your topic. Topics that are related to people's work usually turn out to be the best projects.
You can also get ideas from reading newspapers or online news sites. I went to MSNBC.com and typed in keywords like "average", "more likely", or "correlation" to get some of the claims I used in the examples.
Available online are some sample projects prepared by the instructor. I do not expect your projects to be as long or detailed. There are sample student projects available in the classroom on the filing cabinet.