This is a semester long project involving many of the aspects of Statistics. It can also serve as an example for the project that students will have to perform.
The class will be divided into groups of 2-4 students each. Each group will be assigned one store.
Periodically, information will be turned in for a grade. All assignments submitted for a grade should have the name of the bookstore and the name of all the students in the group at the top of the document.
All information submitted to the instructor is to be printed or typed. Sample output from the different parts of the project will be made available in Adobe PDF format as the semester progresses so that you will have some guidelines about how to proceed.
Parts of the project will be due at different times. Due dates and point values will be posted.
This will take place very early in the semester, either during the first or second class period.
Exchange names, phone numbers, email addresses, and schedules with the other people in your group.
The list of books that you are given is the top 15 books from the NY Times Best Seller List for January 14, 2001. Take the list and visit your bookstore and find the prices for those books. Record the price that store charges. If the store does not carry a book, then write N/A on the sheet.
Some stores carry more than one format of a book. Try to match the format (hard cover / paperback) and the retail price to the one shown so that we are all comparing the same thing.
After you have collected the data, use SPSS and open the file \\acad2\stats\01\books, and enter your data. If there were any books you couldn't find, then enter -1 for the price of that book and it will show up as N/A and not be used in the calculations.
After you have entered the data, save it as \\acad2\stats\01\storename, where "storename" is the name of your store (do not put the .com in the filename). The instructor will go through and merge everyone's data into one data set.
Each group should describe the data for their store only (apply a filter to the data using the select cases command).
Describe the following for the price of books at your store.
The output from SPSS should be cleaned up and annotated. After the entire group has checked the results, print and submit the report.
The number in parentheses after each question is the section number from the textbook that the material covers.
Begin by finding the mean price of all the books for all the stores.
Then select your bookstore and compare the mean for your store to the numerical value found in the first step using a one-sample t-test.
Select just your bookstore and then compare the price to the discount price using the paired samples t-test. Make this comparison for both the entire store and for each of the six categories of books (use split file).
Compare the price using the independent samples t-test. Compare the prices for the entire store and for each format of book (use split file).
This will be the same report by all of the groups.
Compare the prices for the entire store using the one-way ANOVA.
If there are significant differences in the mean prices, which stores are different (post-hoc tests)?
Which is the cheapest store? Which is the most expensive store?Here is sample output for questions 1-3 and question 4.
As a group, comment on the entire project. Address questions like
This should be typed and only one document per group submitted.
As an individual, evaluate each member of your group including yourself. Comment on how much they contributed to the group. Did they show up for all the meetings, did they participate when they showed up, did they pull their weight or did they not do anything.
In addition to a paragraph describing each person, assign them a score between 0 and 10 points for their effort in the group. Remember you are evaluating yourself, also.
* The score that you receive for this part of the project will be the mean scores given to you by each person in your group.
This may be handwritten and there should be one document per person.
|Thu, Jan 18||Groups assigned|
|Fri, Feb 9||Data collected and entered into SPSS (20 points)|
|Fri, Mar 2||Descriptive Statistics Due (20 points)|
|Thu, Apr 12||Inferential Statistics Questions 1-3 Due (30 points)|
|Thu, May 10||Project Evaluations Due (20 points)|
|Fri, May 11||Inferential Statistics Question 4 Due (10 points)|
1. Amazon.com - http://www.amazon.com/
2. BN.Com - http://www.bn.com/ (Barnes & Noble - online)
3. Bluelight.com - http://www.bluelight.com (Kmart's online store)*
4. Borders.com - http://www.borders.com/
5. Barnes & Noble (Springfield)
6. WaldenBooks (Forsyth)
* The instructor has already gathered this data and will use it for generating sample output.
The following variables will be collected for each book.
We will be using SPSS to do the statistical work with this project. SPSS is available in S137 and the machines against the wall in C239. SPSS is commercially licensed software and you are not permitted to take a copy home, so plan on allowing some time at Richland to work on this project.
It is highly recommended that you go through the tutorial to become familiar with SPSS software.
You will open and save files from the academic file server. By doing this, the data and output that you create will be available anywhere in the College and not just on the machine where you work. You may wish to have a floppy disk to make a backup copy.
When you turn on machines, it will ask for a password for Microsoft Networking. This is so that you can get access to the file and print servers. The login is "student" and the password is "richland".
The path that you will enter when you open or save files will be: \\acad2\stats\01\
Be sure to name the files you save as something distinct. Use your name or the name of your store.
There are two types of files that we will be working with. Data files have a .SAV extension and the output Viewer Documents have a .SPO extension. If you have trouble finding the file you're looking for, make sure the document type is set correctly.
If your print upstairs in C239, make sure you select the "Postcript" printer (the one with PS at the end of the name) or SPSS will crash. For this reason, as well as others, it is highly recommended that you save your files before printing.
When the instructions ask you to filter or select cases, go into Data / Select Cases. Choose IF, and then specify a condition like "store=1".
When the instructions ask you to generate output for different classification variables (format, class, online, etc), use the Data / Split File command. Use "compare groups by" and then choose the classification variable.