hypothesis
The University of California, Berkeley (Cal), and Stanford University are athletic archrivals in the Pacific 10 conference. Stanford fans claim Stanford’s basketball team is better than Cal’s team; Cal fans challenge this assertion.
In 2004, Stanford University’s basketball team went nearly undefeated within the Pac 10. Stanford’s record, and those of Cal and the other eight teams in the conference, are listed in In all, there were 89 games played among the Pac 10 teams in the season.
Stanford won 17 of the 18 games it played; Cal won 9 of 18. We would like to use these data to test the Stanford fans’ claim that Stanford’s team is better than Cal’s. That is, we would like to determine whether the difference between the two teams’ performance reasonably could be attributed to chance if the Stanford and Cal teams in fact have equal skill.
Problem 1. The null hypothesis is that (Q1)? A: the Stanford and Cal teams have the same skill B: Cal has the better team C: Stanford has the better team D: the Stanford and Cal teams have different skills E: none of the above
The alternative hypothesis is that (Q2)? A: the Stanford and Cal teams have the same skill B: Cal has the better team C: Stanford has the better team D: the Stanford and Cal teams have different skills E: none of the above
To test the hypothesis, we shall make a number of simplifying assumptions. First of all, we shall ignore the fact that some of the games were played between Stanford and Cal: we shall pretend that all the games were played against other teams in the conference. One strong version of the hypothesis that the two teams have equal skill is that the outcomes of the games would have been the same had the two teams swapped schedules. That is, suppose that when Washington played Stanford on a particular day, Stanford won. Under this strong hypothesis, had Washington played Cal that day instead of Stanford, Cal would have won.
A weaker version of the hypothesis is that the outcome of Stanford’s games is determined by independent draws from a 0-1 box that has a fraction PC of tickets labeled “1” (Stanford wins the game if the ticket drawn is labeled “1”), that the outcome of Berkeley’s games is determined similarly, by independent draws from a 0-1 box with a fraction pS of tickets labeled “1,” and that pS = PC. This model has some shortcomings. (For instance, when Berkeley and Stanford play each other, the independence assumption breaks down, and the fraction of tickets labeled “1” would need to be 50%. Also, it seems unreasonable to think that the chance of winning does not depend on the opponent. We could refine the model, but that would require knowing more details about who played whom, and the outcome.)
Nonetheless, this model does shed some light on how surprising the records would be if the teams were, in some sense, equally skilled. This box model version allows us to use Fisher’s Exact test for independent samples, considering “treatment” to be playing against Stanford, and “control” to be playing against Cal, and conditioning on the total number of wins by both teams (26).
Problem 2. The test statistic is (Q3)? A: the number of games Stanford wins B: the number of games Cal wins C: the number of times Stanford beats Cal D: the number of times Cal beats Stanford E: none of the above
If the null hypothesis is true, the test statistic has a (Q4) ? A: geometric B: binomial C: hypergeometric D: negative binomial E: none of the above distribution with parameter(s) (Q5) ? A: n=26, p=0.5 B: p=26/36 C: n=36, p=0.5 D: N=36, G=26, n=17 E: N=36, G=26, n=18 F: N=36, G=17, n=18 G: p=0.5 H: p=17/18, r=17 I: n=36, p=26/36 J: N=26, G=17, n=18 K: p=26/38, r=17 L: p=0.5, r=17 M: none of the above
Problem 3. The P-value for a one-sided test against the alternative hypothesis that the Stanford team is better than the Cal team is (Q6)
At a significance level of 10%, we should reject the null hypothesis. (Q7) ? A: false B: true
Problem 4. If the null hypothesis is true, the expected value of the test statistic is (Q8) and the standard error of the test statistic is (Q9)
The z-score of the test statistic is (Q10)
The normal approximation to the P-value for Fisher’s exact test against the alternative that the Stanford team is better than the Berkeley team is (Q11)
Now consider the z test using independent samples. We pretend that each team’s wins and losses are independent random samples with replacement from 0-1 box in which the fraction of ones represents the probability that that team wins each game it plays. The number of tickets labeled “1” in the sample is the number of games the team wins.
Problem 5. The sample percentage of games won by Stanford is (Q12) On the assumption that the null hypothesis is true, the bootstrap estimate of the standard error of the sample percentage of games won by Stanford is (Q13)
(Hint: if the null hypothesis is true, then with the simplifications we made it is as if the teams independently draw at random with replacement from the same box of tickets. What percentage of the tickets in that box would you estimate to be labeled “1?”)
The sample percentage of games won by Cal is (Q14) On the assumption that the null hypothesis is true, the bootstrap estimate of the standard error of the sample percentage of games won by Cal is (Q15)
The difference in sample percentages of games won by Stanford and Cal is (Q16) On the assumption that the null hypothesis is true, the bootstrap estimate of the standard error of the difference in sample percentages is (Q17)
The z-score for the difference in sample percentages is (Q18)
The approximate P-value for z test against the two-sided alternative that the Stanford and Berkeley teams have different skills is (Q19)
At a significance level of 10%, we should reject the null hypothesis. (Q20) ? A: false B: true