The Practice of Statistics



Chapter 1: Exploring Data

Key Vocabulary:

▪ individual

▪ variable

▪ frequency table

▪ relative frequency table

▪ distribution

▪ pie chart

▪ bar graph

▪ two-way table

▪ marginal distributions

▪ conditional distributions

▪ side-by-side bar graph

▪ association

▪ dotplot

▪ stemplot

▪ histogram

▪ SOCS

▪ outlier

▪ symmetric

▪ Σ

▪ [pic]

▪ spread

▪ variability

▪ median

▪ quartiles

▪ Q1, Q3

▪ IQR

▪ five-number summary

▪ minimum

▪ maximum

▪ boxplot

▪ resistant

▪ standard deviation

▪ variance

Data Analysis: Making Sense of Data (pp.2-6)

1. Individuals are…

2. A variable is …

3. When you first meet a new data set, ask yourself:

• Who…

• What…

• Why, When, Where and How…

4. Explain the difference between a categorical variable and a quantitative variable. Give an example of each.

5. Give an example of a categorical variable that has number values.

6. Define distribution:

7. What are the four steps to exploring data?

• Begin by….

• Study relationships…

• Start with a …

• Then add…

8. Answer the two questions for the Check Your Understanding on page 5:

9. Define inference.

1. Analyzing Categorical Data (pp.8-22)

1. A frequency table displays…

2. A relative frequency table displays…

3. What type of data are pie charts and bar graphs used for?

4. Categories in a bar graph are represented by ___________ and the bar heights give the category __________________.

5. What is a two-way table?

6. Define marginal distribution.

7. What are the two steps in examining a marginal distribution?

8. Answer the two questions for the Check Your Understanding on page 14.

9. What is a conditional distribution? Give an example demonstrating how to calculate one set of conditional distributions in a two-way table.

10. What is the purpose of using a segmented bar graph?

11. Answer question one for the Check Your Understanding on page 17.

12. Describe the four steps to organizing a statistical problem:

• State…

• Plan…

• Do…

• Conclude…

13. Explain what it meant by an association between two variables.

2. Analyzing Categorical Data (pp.27-42)

1. What is a dotplot? Draw an example.

2. When examining a distribution, you can describe the overall pattern by its

S_____ O_____ C_____ S_____

3. If a distribution is symmetric, what does it look like?

4. If a distribution is skewed to the right, what does it look like?

5. If a distribution is skewed to the left, what does it look like?

6. Describe and illustrate the following distributions:

a. Unimodal

b. Bimodal

c. Multimodal

7. Answer questions 1-4 for the Check Your Understanding on page 31.

8. How are a stemplot and a histogram similar?

9. When is it beneficial to split the stems on a stemplot?

10. When is it best to use a back-to-back stemplot?

11. List the three steps involved in making a histogram.

12. Why is it advantageous to use a relative frequency histogram instead of a frequency histogram?

13. Answer questions 2-4 for the Check Your Understanding on page 35.

3. Analyzing Categorical Data (pp.50-67)

1. What is the most common measure of center?

2. Explain how to calculate the mean, [pic].

3. What is the meaning of (?

4. Explain the difference between [pic] and (.

5. Define resistant measure.

6. Explain why the mean is not a resistant measure of center.

7. What is the median of a distribution? Explain how to find it.

8. Explain why the median is a resistant measure of center?

9. How does the shape of the distribution affect the mean and median?

10. What is the range?

11. Is the range a resistant measure of spread? Explain.

12. How do you find first quartile Q1 and third quartile Q3?

13. What is the Interquartile Range (IQR)?

14. Is the IQR and the quartiles a resistant measure of spread? Explain.

15. How is the IQR used to identify outliers?

16. What is the five-number summary of a distribution?

17. Explain how to use the five-number summary to make a boxplot.

18. What does the standard deviation measure? How do we calculate it?

19. What is the relationship between variance and standard deviation?

20. What are the properties of the standard deviation as explained on page 64?

21. How should one go about choosing measures of center and spread?

Chapter 2: Modeling Distributions of Data

Key Vocabulary:

▪ percentiles

▪ cumulative relative frequency graphs

▪ z-scores

▪ transforming data

▪ density curves

▪ median of density curve

▪ transform data

▪ mean of density curve

▪ standard deviation of density curve

▪ Normal curves

▪ Normal distributions

▪ 68-95-99.7 rule

▪ N(( =__, ( = ___)

▪ standard Normal distribution

▪ standard Normal table

▪ Normal probability plot

▪ [pic] mu

▪ [pic] sigma

2.1 Describing Location in a Distribution (pp.84-103)

1. A percentile is…

1. Is there a difference between the 80th percentile and the top 80%? Explain.

2. Is there a difference between the 80th percentile and the lower 80%? Explain.

3. Refer to the “Cumulative Relative Frequency Graphs” section on page 86 to answer the following questions:

a. Explain how to find the relative frequency column.

b. Explain how to find the cumulative frequency column.

c. Explain how to find the cumulative relative frequency column.

4. Explain how to make a cumulative relative frequency graph.

5. What can a cumulative relative frequency graph be used to describe?

6. Answer the four questions for the Check Your Understanding on page 89.

7. Explain how to standardize a variable.

8. What information does a z – score provide?

9. Explain how to calculate and interpret a z- score.

10. What is the purpose of standardizing a variable?

11. Explain the effects of adding or subtracting a constant from each observation when transforming data.

12. Explain the effects of multiplying or dividing by a constant from each observation when transforming data.

13. Summarize the four steps for exploring quantitative data as outlined on page 99.

14. What is a density curve?

15. What does the area under a density curve represent?

16. Where is the median of a density curve located?

17. Where is the mean of a density curve located?

18. Answer questions 1 and 2 for the Check Your Understanding on page 103.

2. Normal Distributions (pp.110-128)

1. How would you describe the shape of a Normal curve? Draw two examples.

2. Explain how the mean and the standard deviation are related to the Normal curve.

3. Define Normal distribution and Normal curve.

4. What is the abbreviation for a Normal distribution with a mean ( and a standard deviation (?

5. Explain the 68-95-99.7 Rule. When does this rule apply?

6. Answer questions 1-3 for the Check Your Understanding on page 114.

7. What is the standard Normal distribution?

8. What information does the standard Normal table give?

9. How do you use the standard Normal table (Table A) to find the area under the standard Normal curve to the left of a given z-value? Draw a sketch.

10. How do you use Table A to find the area under the standard Normal curve to the right of a given z-value? Draw a sketch.

11. How do you use Table A to find the area under the standard Normal curve between two given z-values? Draw a sketch.

12. Summarize the steps on how to solve problems involving Normal distributions as outlined on page 120.

13. When is it appropriate to use Table A “backwards”?

14. Describe two methods for assessing whether or not a distribution is approximately Normal.

15. What is a Normal probability plot?

16. How do you interpret a Normal probability plot?

17. When is it appropriate to use the NormalCDF and Inverse Normal functions on the calculator?

Chapter 3: Describing Relationships

3.1 Scatterplots and Correlation (pp.142-156)

1. Why do we study the relationship between two quantitative variables?

2. What is the difference between a response variable and the explanatory variable?

3. How are response and explanatory variables related to dependent and independent variables?

4. When is it appropriate to use a scatterplot to display data?

5. A scatterplot shows the relationship between…

6. Which variable always appears on the horizontal axis of a scatterplot?

7. When examining a scatterplot, you can describe the overall pattern by its:

D_____ O_____ F_____ S_____

8. Explain the difference between a positive association and a negative association.

9. What is correlation r?

10. Answer the five questions for the Check Your Understanding on page 149.

11. What does correlation measure?

12. Explain why two variables must both be quantitative in order to find the correlation between them.

13. What is true about the relationship between two variables if the r-value is:

a. Near 0?

b. Near 1?

c. Near -1?

d. Exactly 1?

e. Exactly -1?

14. Is correlation resistant to extreme observations? Explain.

15. What do you need to know in order to interpret correlation?

3.2 Least-Squares Regression (pp.164-188)

1. What is a regression line?

2. In what way is a regression line a mathematical model?

3. What is the general form of a regression equation? Define each variable in the equation.

4. What is the difference between y and [pic]?

5. What is extrapolation and why is this dangerous?

6. Answer the four questions for the Check Your Understanding on page 167.

7. What is a residual? How do you interpret a residual?

8. What is a least-squares regression line?

9. What is the formula for the equation of the least-squares regression line?

10. The least-squares regression line always passes through the point ...

11. What is a residual plot? Sketch a graph of a residual plot.

12. If a least-squares regression line fits the data well, what two characteristics should the residual plot exhibit?

13. What is the standard deviation of the residuals? How is it interpreted?

14. How is the coefficient of determination defined?

15. What is the formula for calculating the coefficient of determination?

16. If r2 = 0.95, what can be concluded about the relationship between x and y?

______% of the variation in (response variable) is accounted for by the regression line.

17. When reporting a regression, should r or r2 be used describe the success of the regression? Explain.

18. Identify the slope, the y intercept, s and r2 on the computer output.

[pic]

19. What are three limitations of correlation and regression?

20. What is an outlier?

21. What is an influential point?

22. Under what conditions does an outlier become an influential observation?

23. What is a lurking variable?

24. Why does association not imply causation?

Chapter 4: Designing Studies

Key Vocabulary:

▪ sample

▪ population

▪ sample survey

▪ voluntary response sample

▪ confounded

▪ design

▪ convenience sampling

▪ biased

▪ simple random sample

▪ table of random digits

▪ probability sample

▪ stratified random sample

▪ cluster sampling

▪ inference

▪ margin of error

▪ strata

▪ undercoverage

▪ nonresponse

▪ response bias

▪ sampling frame

▪ systematic random sample

▪ observational study

▪ experimental

▪ confounding

▪ lurking variable

▪ experimental units

▪ subjects

▪ random assignment

▪ treatment

▪ factor

▪ level

▪ placebo effect

▪ single blind experiment

▪ control group

▪ completely randomized experiment

▪ randomized block design

▪ matched pair design

▪ statistically significant

▪ replication

▪ hidden bias

▪ double-blind experiment

▪ block design

▪ data ethics

4.1 Sampling and Surveys (pp.206-224)

1. Explain the difference between a population and a sample.

2. What is involved in planning a sample survey?

3. Why might convenience sampling be unreliable?

4. What is a biased study?

5. Why are voluntary response samples unreliable?

6. Define simple random sample (SRS).

7. What two properties of a table of random digits make it a good choice for creating a simple random sample?

8. State the two steps in choosing an SRS:

9. What is the difference between sampling with replacement and sampling without replacement?

10. How can you account for this difference with and without replacement when using a table of random digits or other random number generator?

11. How do you select a stratified random sample?

12. What is cluster sampling?

13. What is inference?

14. What is a margin of error?

15. What is the benefit of a larger sample size?

16. A sampling frame is…

17. Give an example of undercoverage in a sample.

18. Give an example of nonresponse bias in a sample.

19. Give an example of response bias in a sample.

20. How can the wording of questions cause bias in a sample?

21. Answer the two questions for the Check Your Understanding on page 224.

4.2 Experiments (pp.231-251)

1. Explain the differences between observational study and experiment.

2. A lurking variable is…

3. What problems can lurking variables cause?

4. Confounding occurs when…

5. Answer the four questions for the Check Your Understanding on page 233.

6. Explain the difference between experimental units and subjects.

7. Define treatment.

8. By studying the TV Advertising example on page 235, identify the factors and levels in the experiment.

9. Explain why the example, Which Works Better: Online or In-Class SAT Preparation, is a bad experiment.

10. What is random assignment?

11. What is a comparative experimental design?

12. In a completely randomized design…

13. Does using chance to assign treatments in an experiment guarantee a completely randomized design? Explain.

14. What is the significance of using a control group?

15. The basic principles of statistical design experiments are:

16. Define control, random assignment and replication in experimental design.

17. Describe the placebo effect.

18. What are the differences between a double-blind and single-blind experiement?

19. Define statistically significant.

20. What is a block?

21. What is a randomized block design?

22. When does randomization take place in a block design, and how does this differ to a completely randomized design?

23. What is the goal of a matched pairs design?

24. When is it beneficial to use a blocked/paired design? How should we choose which variables to block for?

4.3 Using Studies Wisely (pp.261-267)

1. Name the two types of inferences that can be identified based on the design of a study.

2. Name the challenges of establishing causation.

3. What are the four criteria for establishing causation when we can’t do an experiment?

4. Briefly describe the basics of data ethics.

Chapter 5: Probability: What are the chances?

Key Vocabulary:

▪ law of large numbers

▪ probability

▪ simulation

▪ two -way table

▪ sample space

▪ S = {H, T}

▪ tree diagram

▪ probability model

▪ replacement

▪ event

▪ P(A)

▪ complement AC

▪ disjoint

▪ mutually exclusive event

▪ Venn diagram

▪ union (or)

▪ intersection (and)

▪ conditional probability

▪ independent events

▪ general multiplication rule

▪ general addition rule

▪ multiplication rule

5.1 Randomness, Probability, and Simulation (pp.282-292)

1. What is the law of large numbers?

2. The probability of any outcome…

3. How do you interpret a probability?

4. Answer the two questions for the Check Your Understanding on page 286.

5. What are the two myths about randomness? Explain.

6. Define simulation.

7. Name and describe the four steps in performing a simulation:

8. What are some common errors when using a table of random digits?

5.2 Probability Rules (pp.299-308)

1. In statistics, what is meant by the term sample space?

2. In statistics, what is meant the term probability model?

3. What is an event?

4. What is the P (A) if all outcomes in the sample space are equally likely?

5. Define the complement of an event. What is the complement rule?

6. Explain why the probability of any event is a number between 0 and 1.

7. What is the sum of the probabilities of all possible outcomes?

8. Describe the probability that an event does not occur?

9. When are two events considered disjoint or mutually exclusive?

10. What is the addition rule for mutually exclusive events?

11. What is the probability of two disjoint events?

12. Summarize the five basic probability rules as outlined on page 302.

13. Answer the three questions for Check Your Understanding on page 303.

14. When is a two-way table helpful?

15. In statistics, what is meant by the word “or”?

16. When can a Venn diagram be helpful?

17. What is the general addition rule for two events?

18. What happens if the general addition rule is used for two mutually exclusive events?

19. What does the union of two or more events mean? Illustrate on a Venn diagram.

20. What does the intersection of two or more events mean? Illustrate on a Venn diagram.

5.3 Conditional Probabilty and Independence (pp.312-327)

1. What is conditional probablity? What is the notation for conditional probabilty?

2. Answer the two questions for the Check Your Understanding on page 314.

3. What are independent events?

4. What is the notation used for independent events?

5. Answer the three questions for Check Your Understanding on page 317.

6. When is a tree diagram helpful?

7. State the general multiplication rule for any two events.

8. State the multiplication rule for independent events.

9. How is the general multiplication rule different than the multiplication rule for independent events?

10. Explain the difference between mutually exclusive and independent.

11. State the formula for calculating conditional probabilities.

12. How is the conditional probability formula related to the general multiplication rule?

Chapter 6: Random Variables

Key Vocabulary:

▪ random variable

▪ discrete random variable

▪ probability distribution

▪ mean of a random variable

▪ variance of a random variable

▪ probability density curve

▪ continuous random variable

▪ standard deviation

▪ binomial setting

▪ binomial random variable

▪ binomial distribution

▪ binomial coefficient

▪ binomial probability

▪ linear transformation

▪ normal approximation

▪ geometric setting

▪ geometric distribution

▪ geometric random variable

▪ Normal approximation

▪ geometric probability

▪ factorial

▪ expected value

▪ standard deviation

▪ [pic]

▪ [pic]

▪ uniform distribution



6.1 Discrete and Continuous Random Variables (pp.341-352)

1. What is a random variable?

2. Define probability distribution.

3. What is a discrete random variable?

4. What are the two requirements for the probability distributions of discrete random variables?

5. If X is a discrete random variable, what information does the probability distribution of X give?

6. In a probability histogram what does the height of each bar represent (assuming the width of each bar is the same)?

7. In a probability histogram, what is the sum of the height of each bar?

8. What is the mean [pic]of a discrete random variable X?

9. How do you calculate the mean of a discrete random variable?

10. Define expected value. What notation is used for expected value?

11. Does the expected value of a random variable have to equal one of the possible values of the random variable? Explain.

12. Explain how to calculate the variance and standard deviation of a discrete random variable.

13. Explain the meaning of the standard deviation of a random variable X.

14. What is a continuous random variable and how is it displayed?

15. If X is a continuous random variable, how is the probability distribution of X described?

16. What is the area under a probability density curve equal to?

17. What is the difference between a discrete random variable and a continuous random variable?

18. If X is a discrete random variable, do [pic] and [pic] have the same value? Explain.

19. If X is a continuous random variable, do [pic] and [pic] have the same value? Explain.

20. How is a Normal distribution related to probability distribution?

6.2 Transforming and Combining Random Variables

(pp.358-375)

1. What is the effect on a random variable of multiplying or dividing by a constant?

2. How does multiplying by a constant effect the variance?

3. What is the effect on a random variable of adding or subtracting by a constant?

4. Define linear transformation.

5. What are the effects of a linear transformation on the mean and standard deviation?

6. Define the mean of the sum of random variables.

7. What are independent random variables?

8. Define the variance of the sum of independent random variables. What types of variables does it apply to?

9. When can you add the variances of two random variables?

10. State the equation for the mean of the difference of random variables?

11. State the formula for the variance of the difference of random variables.

12. What happens if two independent Normal random variables are combined?

13. Suppose [pic] = 5 and [pic] = 10. According to the rules for means, what is [pic]?

14. Suppose [pic] = 2. According to the rules for means, what is [pic]?

15. Suppose [pic] = 2 and [pic] = 3 and X and Y are independent random variables. According to the rules for variances, what is [pic]? What is [pic]?

16. Suppose [pic] = 4. According to the rules for variances, what is [pic]? What is [pic]?

6.3 Binomial and Geometric Random Variables (pp.382-401)

1. What is a binomial setting?

2. Describe the conditions of a binomial setting.

3. What is a binomial random variable and what are its possible values?

4. Define the parameters of a binomial distribution.

5. Explain the meaning of the binomial coefficient and state the formula.

6. Explain how to calculate binomial probabilities.

7. What commands on the calculator are used to calculate binomial probabilities?

8. Explain how to calculate the mean and standard deviation of a binomial random variable.

9. When can the binomial distribution be used to sample without replacement? Explain why this is an issue.

10. What is a geometric setting?

11. Describe the conditions for a geometric setting.

12. What is a geometric random variable and what are its possible values?

13. Describe the parameters of a geometric distribution.

14. What is the formula for geometric probability?

15. How is the mean of a geometric random variable calculated?

Chapter 7: Sampling Distributions

Key Vocabulary:

▪ parameter

▪ statistic

▪ sampling variability

▪ sampling distribution

▪ population distribution

▪ biased estimator

▪ unbiased estimator

▪ bias

▪ variability

▪ variability of a statisic

▪ sample proportion

▪ mean and standard deviation of sampling distributions

▪ central limit theorem

7.1 What Is a Sampling Distribution? (pp.414-428)

1. Explain the difference between a parameter and a statistic?

2. What is sampling variability?

3. Explain the difference between [pic] and [pic], and between p and [pic]?

4. What is meant by the sampling distribution of a statistic?

5. What is population distribution?

6. What is the difference between the distribution of the population, the distribution of the sample, and the sampling distribution of a sample statistic?

7. What is sampling variability?

8. Explain the difference between [pic] and [pic], and between p and [pic]?

9. What is meant by the sampling distribution of a statistic?

10. What is population distribution?

11. What is the difference between the distribution of the population, the distribution of the sample, and the sampling distribution of a sample statistic?

12. When is a statistic considered an unbiased estimator?

13. What is biased estimator?

14. How is the size of a sample related to the spread of the sampling distribution?

15. The variability of a statistic is…

16. Explain the difference between bias and variability.

7.2 Sample Proportions (pp.432-438)

1. What is the purpose of the sample proportion?

2. In an SRS of size n, what is true about the sampling distribution of [pic] when the sample size n increases?

3. In an SRS of size n:

a. What is the mean of the sampling distribution of [pic]?

b. What is the standard deviation of the sampling distribution of [pic]?

4. What happens to the standard deviation of [pic] as the sample size n increases?

5. When does the formula [pic] apply to the standard deviation of [pic]?

6. When the sample size n is large, the sampling distribution of [pic] is approximately Normal. What test can you use to determine if the sample is large enough to assume that the sampling distribution is approximately normal?

7.3 Sample Means (pp.442-452)

1. What are the mean and standard deviation of the sampling distribution of the sample mean [pic]? Describe the conditions for these formulas.

2. Explain how the behavior of the sample mean and standard deviation are similar to the sample proportion.

3. The mean and standard deviation of a population are parameters.

What symbols are used to represent these parameters?

4. The mean and standard deviation of a sample are statistics.

What symbols are used to represent these statistics?

5. The shape of the distribution of the sample mean depends on …

6. Because averages are less variable than individual outcomes, what is true about the standard deviation of the sampling distribution of [pic]?

7. What symbols are used to represent the mean and standard deviation of the sampling distribution of [pic]?

8. What is the mean of the sampling distribution of [pic], if [pic] is the mean of an SRS of size n drawn from a large population with mean μ and standard deviation σ?

9. What is the standard deviation of the sampling distribution of [pic], if [pic] is the mean of an SRS of size n drawn from a large population with mean μ and standard deviation σ?

10. When should you use [pic] to calculate the standard deviation of [pic]?

11. What is the Central Limit Theorem?

Chapter 8: Estimating with Confidence

Key Vocabulary:

▪ point estimator

▪ point estimate

▪ confidence interval

▪ margin of error

▪ interval

▪ confidence level

▪ random

▪ normal

▪ independent

▪ four step process

▪ level C confidence interval

▪ degrees of freedom

▪ standard error

▪ one -sample z interval

▪ t distribution

▪ t-procedures

▪ one-sample t interval

▪ robust

[pic]

8.1 Confidence Intervals: The Basics (pp.615-643)

10. A point estimator is a statistic that…

11. The value of the point estimator statistic is called a ____________________ and it is our

"best guess" at the value of the _____________________.

12. Summarize the facts about sampling distributions learned in chapter 7:

Shape

Center

Spread

13. In statistics, what is meant by a 95% confidence interval?

14. A confidence interval takes the form of : “estimate [pic] margin of error”

where: estimate =

margin of error =

15. Define a level C confidence interval.

16. What information does the margin of error provide?

17. Sketch and label a 95% confidence interval for the standard normal curve.

18. In a sampling distribution of [pic], why is the interval of numbers between [pic] called a 95% confidence interval?

19. Sketch and label a 90% confidence interval for the standard normal curve.

20. Interpret a Confidence level: "To say that we are 95% confident is shorthand for …..

21. Explain how to interpret a Confidence interval.

22. Does the confidence level tell us the chance that a particular confidence interval captures the population parameter? If not, what does it tell us?

23. What does the critical value depend on?

24. Write the form for calculating a confidence interval as shown on page 478.

25. Why do we want high confidence and a small margin of error?

26. Explain the two conditions when the margin of error gets smaller.

27. State the three conditions for constructing a confidence interval for p or [pic].

• Random

• Normal

• Independent

19. What are the two important reminders for constructing and interpreting confidence intervals?

8.2 Estimating a Population Proportion (pp.484-494)

1. In statistics, what is meant by a sample proportion:[pic]?

2. Give the mean and standard deviation for the sampling distribution of [pic]?

3. How does the standard deviation differ to to standard error for the sampling distribution of

[pic]?

4. Describe the sampling distribution of a sample proportion [pic] as learned in section 7.2.

• Shape

• Center

• Spread

5. Define standard error.

6. In general what is meant by the standard error of a statistic?

7. How do you calculate the standard error of [pic]?

8. What is the formula for a one-sample z interval for a population proportion? Describe how to construct a level C confidence interval for a population proportion.

9. Describe the four step process on how to contruct and interpret a confidence interval.

• State

• Plan

• Do

• Conclude

10. What formula is used to determine the sample size necessary for a given margin of error?

11. What conditions must be met in order to use z procedures for inference about a proportion?

12. What does z* represent?

13. What is the value of z* for a 95% confidence interval? Include a sketch.

14. What is the value of z* for a 90% confidence interval? Include a sketch.

15. What is the value of z* for a 99% confidence interval? Include a sketch.

8.3 Estimating a Population Mean (pp.499-515)

1. What is the formula for a one-sample z interval for a population mean? Describe how to construct a level C confidence interval for a population mean.

2. What is the formula for the margin of error of the confidence interval for the population mean (?

3. How can you arrange to have both high confidence and a small margin of error?

4. Describe the three steps for choosing a sample size for a desired margin of error when estimating (.

5. What happens to the margin of error as z* gets smaller? Does this result in a higher or lower confidence level?

6. What happens to the margin of error, as σ gets smaller?

7. What happens to the margin of error, as n gets larger? By how many times must the sample size n increase in order to cut the margin of error in half?

8. The formula used to determine the sample size n that will yield a confidence interval for a population mean with a specified margin of error m is [pic]. Solve for n.

9. It is the size of the ________________ that determines the margin of error. The size of the ____________________ does not influence the sample size we need.

10. Complete the Check Your Undertanding on page 501.

11. How do you calculate the degrees of freedom for a t distribution?

12. What happens to the t distribution as the degrees of freedom increase?

13. How would you construct a t distribution?

14. Describe the differences between a standard normal distribution and a t distribution.

15. Describe the similarities between a standard normal distribution and a t distribution.

16. What is the formula for the standard deviation of the sampling distribution of the sample mean [pic]?

17. What is the standard error of the sample mean [pic]?

28. Describe how to construct the one-sample t interval for a population mean?

29. Summarize the three conditions for inference about a population mean:

• Random

• Normal

• Independent

30. Inferences for proportions use ______ and inferences for means use _____.

31. What does it mean if an inference procedure is robust?

32. If the size of the SRS is less than 15, when can we use t procedures on the data?

33. If the size of the SRS is at least 15, when can we use t procedures on the data?

34. If the size of the SRS is at least 30, when can we use t procedures on the data?

35. Summarize the details of the four step procedure for estimating p:

• State

• Plan

• Do

• Conclude

Chapter 9: Testing a Claim

Key Vocabulary:

▪ Significance test

▪ Null Hypothesis

▪ Alternative Hypotheses

▪ One sided alternative

▪ Two sided alternative

▪ p-value

▪ [pic]level

▪ significance level

▪ one-sample z test

▪ test statistic

▪ one sample t test

▪ paired data

▪ four-step process

▪ statistically significant

▪ Type I Error

▪ Type II Error

▪ Power

▪ Degrees of freedom

▪ t-distribution

▪ paired t procedures

[pic]

9.1 Significance Tests: The Basics (pp.528-543)

19. What is a significance test?

20. What is the difference between a null and an alternative hypothesis? What notation is used for each?

21. Explain the differences between one-sided and two-sided hypotheses. How can you decide which one to use?

22. What form does the null and alternative hypothesis take in significance testing?

23. Hypotheses always refer to a ___________, not to a ______________.

24. In statistics, what is meant by the P-value? What does a P-value measure?

25. If a P-value is small, what do we conclude about the null hypothesis?

26. If a P-value is large, what do we conclude about the null hypothesis?

27. What are common errors students make in their conclusions of P-values?

28. On what evidence would we reject the null hypothesis?

29. On what evidence would we accept the null hypothesis (ie. fail to reject the null hypothesis)?

30. What is meant by a significance level?

31. Explain what it means to say that data are statistically significant.

32. How small should the P-value be in order to claim that a result is statistically significant?

33. When using a fixed significance level to draw a conclusion in a statistical test what can be concluded when the P value is ( ( and ((?

34. What two circumstances guide us in choosing a level of significance?





35. What is a Type I Error?

36. What is a Type II Error ?

37. Which error is worse, Type I or Type II?

38. Complete the Check Your Understanding on page 539.

39. What is the relationship between the significance level α and the probability of Type I Error?

40. How can we reduce the probability of a Type I error?

41. What is meant by the power of a significance test?

42. What is the relationship between Power and Type II Error? Will you be expected to calculate the power on the AP exam?

43. What four factors affect the power of a test? Why does this matter?

44. Describe the three influences that must be verified before deciding on how many observations are needed in a study.

• Significance Level

• Practical Importance

• Power

9.2 Tests about a Population Proportion (pp.549-561)

1. Summarize the three conditions that must be checked before carrying out significance tests:







2. State the general form of the “test statistic”.

3. What does the test statistic measure? Is this formula on the AP exam formula sheet?

4. Describe the four step process for signifigance tests. Explain what is required at each step.

• State

• Plan

• Do

• Conclude

5. What test statistic is used when testing for a population proportion? Is this on the formula sheet?

6. Summarize the one-sample z test for a proportion and sketch the three possible [pic]’s.

• Choose…

• To test…

• Find…

• Use this test…

• If Normaility is not met

7. What happens when the data does not support [pic]?

8. If asked to carry out a signifigance test and there is no ( provided, what is recommended?

9. Can you use confidence intervals to decide between two hypotheses? What is the advantage to using confidence intervals for this purpose?

10. Why don't we always use confidence intervals?

9.3 Tests about a Population Mean (pp.565-585)

1. What are the three conditions for conducting a significance test for a population mean?

2. What test statistic do we use when testing a population mean? Is this formula on the AP exam formula sheet?

3. How do you calculate p-values using the t-distributions?

4. What do you do if the degrees of freedom you need is not in table b?

5. How do you find p-values when carrying out a signifigance test about a population mean on the calculator?

6. For a one-sample t- test for a population mean, state:

• [pic]

• the three possible [pic]’s (with small sketches to illustrate)

• What is the t test statistic and how is it interpreted?

• Under what conditions can this test be used…

7. In terms of rejecting the hypothesis [pic], how is a significance test related to a confidence interval on the same population?

8. Use your calculator to find the p value (tcdf command) for the example Healthy Streams. What is that p-value?

9. When using technology for the "DO" part of the four step process, what is recommended on page 573?

10. Work through the Juicy Pineapple example on page 574. Use a calculator to find the exact P-value. Why is tcdf mulitplied by 2?

11. Why is the difference between using the calculator versus Table b when finding the p-value in this example?

12. Do we have encough evidence to reject [pic] in the Juicy Pineapple example? Explain.

13. Read the Check Your Understanding on page 577 and answer questions 1 and 2.

14. What is paired data?

15. What information would lead us to apply a paired t-test to a study, and what would be the statistic of interest?

16. In the example, Is Caffeine Dependence Real, explain the difference in the "Do" procedures for this example versus the Juicy Pineapple example.

17. Describe the four points to be aware of when interpreting signifigance tests.

Chapter 10: Comparing Two Populations or Groups

Key Vocabulary:

▪ difference between two

proportions

▪ two sample z interval for proportions

▪ two sample z test for difference between two proportions

▪ two sample z statistic

▪ two sample t statistic

▪ pooled combined sample proportion

▪ standard error

▪ randomization distribution

▪ paired t-test

▪ two sample t test for means

▪ two sample t interval for means

▪ difference between two means

▪ pooled two sample t statistic

[pic]

10.1 Comparing Two Proportions (pp. 604-618)

1. Summarize the three properties of a sampling distribution of a sample proportion:

• Shape

• Center

• Spread

2. What are the shape, center, and spread of the sampling distribution of [pic]? Provide the formulas for the mean and standard deviation.

• Shape

• Center

• Spread

3. What conditions need to be met for the sampling distribution of [pic]?

4. Give the formula for the standard error when calculating a confidence interval for[pic], and define each variable in the equation.

5. What is the confidence interval for [pic]?

6. What conditions must be met in order to use the Two-sample z Interval for a Difference between Two Proportions?

• Random

• Normal

• Independent

8. Use the example, Teens and Adults on Social Networking Sites, to outline how to construct and interpret a confidence interval for the difference between two proportions, [pic].

9. State the null hypothesis for a two proportion significance test.

10. What does [pic]represent, and how is it calculated?

11. Why do we pool the sample proportions?

12. Give the formula for the two-proportion z-statistic, and define each variable in the equation.

13. Is this on the formula sheet? What does the test statistic measure?

14. State and use diagrams to illustrate the three possible alternative hypotheses for a two proportion z-test.

15. What are the conditions for conducting a two-sample z test for a difference between proportions?

16. How are these different than the conditions for a one-sample z interval for p?

17. Describe the randomization distribution.

18. What must you be careful about when defining parameters in experiments? How can this be avoided?

19. Can you use your calculator for the Do step? Are there any drawbacks?

20. What are the calculator commands for the two-sample z test and interval for [pic]?

2. Comparing Two Means (pp.627-648)

1. Summarize the three properties of a sampling distribution of a sample mean:

• Shape

• Center

• Spread

2. What are the shape, center, and spread of the sampling distribution of [pic]? Give the formula for the mean and standard deviation.

• Shape

• Center

• Spread

3. What are the conditions for the sampling distribution of [pic]?

4. Give the formula for the two-sample t-statistic, and define each variable in the equation.

5. Is this on the formula sheet? What does it measure?

6. What is the standard error of [pic]? Is this on the formula sheet?

7. What distribution does the two-sample t statistic have?

8. Why do we use a t statistic rather than a z statistic?

9. Without using technology, how do you estimate the degrees of freedom when using two-sample t-procedures?

10. How do you calculate the confidence interval for [pic]?

11. In a two-sample t interval problem, what conditions must be met for comparing two means?

12. What are the conditions for conducting a two-sample t test for [pic]?

13. Draw a sketch of the three possible scenarios for the alternative hypothesis.

14. Describe the Normal Condition when using the two sample t procedures.

15. What calculator commands are used for a two-sample t test and interval for [pic]?

16. How do you proceed when using two-sample t procedures to check the Normal Condition in the following cases:

• Sample size less than 15

• Sample size at least 15

• Large samples

17. In a two-sample problem, must/should the two sample sizes be equal?

18. When doing two-sample t procedures, should we pool the data to estimate a common standard deviation? Is there any benefit? Are there any risks?

Chapter 11: Inference for Distributions of Categorical Data

Key Vocabulary:

▪ one way table

▪ chi-square test for goodness of fit

▪ chi-square statistic

▪ expected count

▪ observed count

▪ chi square distribution

▪ degrees of freedom

▪ chi-square distribution

▪ components of chi-square

▪ cell counts

▪ r x c table

▪ chi square test for homogeneity

▪ chi square test for association/independence

11.1 Chi-Square Goodness of Fit Test (pp.678-690)

1. What is a one-way table?

2. What is a chi-square goodness-of-fit test?

3. What is the difference between the notation X2 and χ2?

4. State the general form for the null hypotheses for a χ2 goodness of fit test.

5. State the general form for the alternative hypotheses for a χ2 goodness of fit test.

6. How do you calculate the expected counts for a chi-square goodness-of-fit test? How should you round the answer for the expected counts?

7. What is the shape of a chi-square distribution? What happens to the shape as the degrees of freedom increases? (Illustrate with a diagram)

8. Describe the center and spread of the chi-square distributions.

9. What is the chi-square test statistic? Is it on the formula sheet? What does it measure?

10. How many degrees of freedom does the chi-square distribution have?

11. What is the rule of thumb for all expected counts in a chi-square goodness of fit test?

12. What conditions must be met in order to use the goodness of fit test?

12. How do you calculate p-values using chi-square distributions?

14. Can you use your calculator to conduct a chi-square goodness-of-fit test? If yes, what are the calculator commands?

15.What is meant by a component of chi-square?

16. What does the largest component of chi-square signify?

17.Why is it necessary to perform follow-up analysis to a chi-square test?

11.2 Inference for Relationships (pp.696-721)

1. What is the hypothesis for a test of homogeneity?

2. Describe the complications with multiple comparisons? How are they overcome?

3. Explain how to calculate the expected counts for a test that compares the distribution of a categorical variable in multiple groups or populations.

4. Write the formula for the Chi-square test statistic? Is this on the AP Exam formula sheet?

5. What does the Chi-square test statistic measure?

6. What information is contained in a two-way table for a Chi-square test?

7. How many degrees of freedom does a chi-square test for a two-way table with r rows and c columns have?

8. What requirements must be checked before carrying out a Chi-square test for Homogeneity?

9. State the null and alternative hypothesis for the Chi-square test for Homogeneity?

10. Can you use your calculators to do a Chi-square test of homogeneity? If yes, what are the calculator commands?

11. Summarize how to carry out a Chi-square Test for Homogeneity of Populations:

12. Explain how and when to conduct a follow-up analysis for a test of homogeneity?

13. What does it mean if two variables have an association?

14. What does it mean if two variables are independent?

15. State the null and alternative hypotheses for a Chi-square test for Association/Independence.

16. How is a test of association/independence different than a test of homogeneity?

17. How do you calculate expected counts for a test of association/independence?

18. Summarize how to carry out a Chi-square Test for Association/Independence:

19. What are the conditions for a test of association/independence?

20. When should you use a chi-square test and when should you use a two-sample z test?

Chapter 12: More about Regression

Key Vocabulary:

▪ sample regression line

▪ true regression line

▪ t interval for slope

▪ standard error of slope

▪ t test for slope

▪ standardized test statistic

▪ standard error

▪ exponential model

▪ power model

▪ logarithmic model

12.1 Inference for Linear Regression (pp.739-757)

1. What is the difference between a sample regression line and population (true) regression line?

2. Explain the sampling distribution of b?

3. Give the equation for the true regression line, and state what each component of the equation represents.

4. Summarize the conditions for regression inference:

• L

• I

• N

• E

• R

5. Explain how to check the conditions for regression inference:

• L

• I

• N

• E

• R

6. Record the formula for the standard error of the slope? Define the variables.

7. What is the formula for the t-interval of the slope of a least-squares regression line? Is this on the AP exam formula sheet?

8. What is the formula for the t-test for the slope of the population regression line? Is this on the AP exam formula sheet?

9. Describe the distribution of the standardized test statistic [pic].

10. What is the formula for constructing a confidence interval for a slope?

11. What calculator commands are used to get the value of t*?

12. Can you use your calculator to conduct a test and confidence interval for the slope?

12.2 Transforming to Achieve Linearity (pp.765-783)

1. What does it mean to transform data?

2. What is a power model?

3. Give three examples of power models?

4. Aside from power transformations, how can you linearize an association that follows a power model in the form y = axp?

5. Describe a logarithmic model. Give two examples.

6. Describe an exponential model. Give two examples.

7. Describe the two methods used to linearize a relationship that follows an exponential model.

8. Show how to use logarithms to transform the data given by y = axp to produce a linear relationship.

9. The big idea using logarithms to transform data is that "if a variable grows __________________, its __________________ grow linearly."

10. Describe how to achieve linearity from a power model as explained on page 777.

11. After using a logarithm transformation, what does the scatter plot of the data show?

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download