This project deals with simulations. There are two kinds of simulations that we're going to look at. One involves finding the chance or likelihood of something happening; the other involves finding the average or expected number of times something will take to occur.
This project will go much quicker if you work in your teams. Have two of the members perform the simulation part and the third person record their results.
This is a simulation that is looking for the average number of kids you'll have to have before having at least two of each gender. We will simulate families and determine the size of each family. We'll repeat this process until we've generated 50 families and then find the average of the fifty families.
There are two sexes to choose from, boy and girl. We'll put those genders into a column and then sample from the column.
Since boys and girls can occur more than once, we'll be sure to sample with replacement. Although theoretically possible to have 25 kids without having at least two of each gender, it's highly unlikely. Make the number larger if you don't feel like taking the risk.
Here are some sample families as generated by Minitab. The family size is the number of kids before there are at least two boys and two girls.
# | Size | Family |
---|---|---|
1 | 4 | girl, girl, boy, boy |
2 | 5 | girl, boy, girl, girl, boy |
3 | 6 | boy, boy, boy, girl, boy, girl |
Although you're only asked to find the average family size, you may also want to comment on the shape and spread of the data.
There are four couples to play games. The easiest way to do this is to just use the last name of the couples and make up four different names. Put each name in twice, but make sure you do not sample with replacement.
Assume that players 1 and 2 make one team, players 3 and 4 make another, players 5 and 6 make the third, and players 7 and 8 make the fourth team.
We're interested in the percent of "yes" results.
In the following example, each column represents a different selection of teams. The bottom row represents whether or not that selection was good (no couples on a team) or not. The "good" will be a column when you do this in Minitab.
# | Teams | Teams | Teams | Teams |
---|---|---|---|---|
1 | Smith | Jones | Anderson | Williams |
2 | Jones | Smith | Smith | Jones |
3 | Jones | Williams | Williams | Smith |
4 | Anderson | Jones | Jones | Smith |
5 | Williams | Smith | Smith | Jones |
6 | Smith | Anderson | Williams | Anderson |
7 | Anderson | Williams | Anderson | Williams |
8 | Williams | Anderson | Jones | Anderson |
good? | yes | yes | yes | no |
As you can see in the fourth simulation, team 2 (players 3 and 4) were both Smiths.
That's all the setup there is that's necessary.
The Bernoulli distribution will return a one 38.6% of the time and a 0 the other 61.4% of the time. We will let 1 stand for a shot that was made and a 0 stand for a shot that was missed.
# | streak | shots |
---|---|---|
1 | no | 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 |
2 | no | 0 1 0 1 1 1 0 0 0 0 0 1 0 0 1 1 |
3 | no | 1 0 0 0 1 0 1 1 0 0 1 0 0 0 1 1 |
4 | yes | 1 1 0 1 1 1 1 1 1 0 1 1 0 0 1 1 |
Notice that it has to be at least four 1's in a row. It is okay to be more, as in the fourth game simulated above. If it should happen that he has at least four shots in a row twice in a game, you still only count the overall game as "yes" once ... don't count it twice.
Let's see how you can do with this one on your own. Here are some general steps. You fill in the particulars.
We need to create a deck of cards. There are thirteen possible values since we don't care about what suit the card is from. However, we can't just put in the thirteen values and use replacement because we might possibly get five of a kind and that would be really bad for a poker hand. So, what we have to is actually enter each of the thirteen possible values four times (one for each suit).
# | pair | hand |
---|---|---|
1 | no | 3 7 8 5 ace |
2 | yes | 2 7 8 3 2 |
3 | yes | jack 2 6 queen 2 |
4 | no | 5 8 9 jack 10 |
The 2nd and 3rd rows each had exactly one pair. They both happened to be a pair of 2's, which is the worst pair you can have, but at least it was a pair. Should you happen to get data like "2 7 2 8 2", you would not call that a pair because it's three of a kind. Technically, there is a pair in three of a kind, but we're looking for the probability of getting exactly one pair.