Objectives - Olivia Misa



ObjectivesCompute and interpret z-scoresCompute the quartiles of a data setCompute the percentiles of a data setCompute the five-number summary for a data setUnderstand the effects of outliersConstruct boxplots to visualize the five-number summary and outliersObjective 1Compute and Interpret z-scoresZ-ScoreWho is taller, a man 73 inches tall or a woman 68 inches tall? The obvious answer is that the man is taller. However, men are taller than women on the average. Suppose the question is asked this way: Who is taller relative to their gender, a man 73 inches tall or a woman 68 inches tall? One way to answer this question is with a z-score.The z-score of an individual data value tells how many _____________________________ that value is from its population mean. For example, a value one standard deviation above the mean has a z-score of ____________ and a value two standard deviations below the mean has a z-score of ____________. -561075689Let x be a value from a population with mean μ and standard deviation σ. The z-score for x?is z = . ____________________00Let x be a value from a population with mean μ and standard deviation σ. The z-score for x?is z = . ____________________Example:A National Center for Health Statistics study states that the mean height for adult men in the U.S. is μ = 69.4 inches, with a standard deviation of σ = 3.1 inches. The mean height for adult women is μ = 63.8 inches, with a standard deviation of σ = 2.8 inches. Who is taller relative to their gender, a man 73 inches tall, or a woman 68 inches tall?Solution:Z-Scores & The Empirical RuleSince the z-score is the number of standard deviations from the mean, we can easily interpret the z-score for bell-shaped populations using The Empirical Rule.When a population has a histogram that is approximately bell-shaped, thenApproximately 68% of the data will have z-scores between –1 and 1.Approximately 95% of the data will have z-scores between –2 and 2.All, or almost all of the data will have z-scores between –3 and 3.Objective 2Compute the Quartiles of a Data SetQuartilesIn a previous section, we learned how to compute the mean and median of a data set as measures of the center. Sometimes, it is useful to compute measures of position other than the center to get a more detailed description of the distribution. Quartiles divide a data set into four approximately equal pieces.Quartiles:Every data set has three quartiles:The __________________________, denoted Q1 separates the lowest __________ of the data from the highest __________.The __________________________, denoted Q2 separates the lowest __________ of the data from the highest __________. Q2 is the same as the median.The __________________________, denoted Q3 separates the lowest __________ of the data from the highest __________.Computing QuartilesThere are several methods for computing quartiles, all of which give similar results. The following procedure is one fairly straightforward method:Step 1:Arrange the data in increasing order.Step 2:Let n be the number of values in the data set. To compute the second quartile, simply compute the median. For the first or third quartiles, proceed as follows:For the first quartile, compute L=0.25nFor the third quartile, compute L=0.75nStep 3:If L is a whole number, the quartile is the average of the number in position L and the number in position L+1.If L is not a whole number, round it up to the next higher whole number. The quartile is the number in the position corresponding to the rounded-up value.Example:The following table presents the annual rainfall, in inches, in Los Angeles during the month of February from 1969 to 2013. Compute the quartiles for the data.0.000.080.130.140.160.170.200.290.560.670.700.921.221.301.481.641.721.902.372.582.843.063.123.213.293.543.573.714.134.274.374.644.894.945.546.106.617.897.968.038.878.9111.0212.7513.68Solution:Quartiles on the TI-84 plusThe 1-Var Stats command in the TI-84 PLUS Calculator displays a list of the most common parameters and statistics for a given data set. This command is accessed by pressing STAT and then highlighting the CALC menu.Example: Quartiles on the TI-84 PLUS Step 1:Enter the data in L1.Step 2:Press STAT and highlight the CALC menu.Step 3:Select 1-Var Stats and press ENTER. Enter L1 in the List field and run the command.Note: If your calculator does not support Stat Wizards, enter L1 next to the 1-Var Stats command on the home screen and press enter to run the commandThe quartile values produced by the TI-84 PLUS may differ from results obtained by hand because it uses a slightly different procedure than the one described in the text.Visualizing the Quartiles Following is a dotplot of the Los Angeles rainfall data with the quartiles indicated. The quartiles divide the data set into four parts, with approximately 25% of the data in each part.Objective 3Compute the Percentiles of a Data Set41623517789400Quartiles describe the shape of a distribution by dividing it into fourths. Sometimes it is useful to divide a data set into a greater number of pieces to get a more detailed description of the distribution. ________________________ divide a data set into hundredths.Percentiles:For a number p between 1 and 99, the pth percentile separates the lowest p% of the data from the highest (1– p)%.Computing PercentilesThe following procedure computes the pth percentile of a data set:Step 1:Arrange the data in increasing order.Step 2:Let n be the number of values in the data set. For the pth percentile, compute L=p100n.Step 3:If L is a whole number, the pth percentile is the average of the number in position L and the number in position L+1.If L is not a whole number, round it up to the next higher whole number. The pth percentile is the number in the position corresponding to the rounded-up value.Example:The following table presents the annual rainfall, in inches, in Los Angeles during the month of February from 1969 to 2013. Compute the 60th percentile for the data.0.000.080.130.140.160.170.200.290.560.670.700.921.221.301.481.641.721.902.372.582.843.063.123.213.293.543.573.714.134.274.374.644.894.945.546.106.617.897.968.038.878.9111.0212.7513.68Solution:Computing a Percentile from a Given Data ValueSometimes we are given a value from a data set and wish to compute the percentile corresponding to that value. Following is the procedure for doing this:Step 1:Arrange the data in increasing order.Step 2:Let x be the data value whose percentile is to be computed. Use the following formula to compute the percentile:Percentile = 100?Number?of?values?less?than?x+0.5Number?of?values?in?the?data?setRound the result to the nearest whole number. This is the percentile corresponding to the value x.Example:The following table presents the annual rainfall in Los Angeles during February from 1969 to 2013. In 1989, the rainfall was 1.90. What percentile does this correspond to?0.000.080.130.140.160.170.200.290.560.670.700.921.221.301.481.641.721.902.372.582.843.063.123.213.293.543.573.714.134.274.374.644.894.945.546.106.617.897.968.038.878.9111.0212.7513.68Solution:Objective 4Compute the Five-Number Summary for a Data setThe ___________________________________________ of a data set consists of the median, the first quartile, the third quartile, the smallest value, and the largest value. These values are generally arranged in order.Example:Recall the Los Angeles annual rainfall data. Compute the five-number summary.0.000.080.130.140.160.170.200.290.560.670.700.921.221.301.481.641.721.902.372.582.843.063.123.213.293.543.573.714.134.274.374.644.894.945.546.106.617.897.968.038.878.9111.0212.7513.68Solution:When using the TI-84 PLUS Calculator, the five-number summary is given by the 1-Var Stats command.Objective 5Understand the Effects of OutliersAn _______________ is a value that is considerably larger or considerably smaller than most of the values in a data set. Some outliers result from errors; for example a misplaced decimal point may cause a number to be much larger or smaller than the other values in a data set. Some outliers are correct values, and simply reflect the fact that the population contains some extreme values.Example:The temperature in a downtown location is measured for eight consecutive days during the summer. The readings, in Fahrenheit, are81.2 85.6 89.3 91.0 83.2 8.45 79.5 87.8Which reading is an outlier? Is the outlier an error or is it possible that it is correct?Solution:Interquartile RangeOne method for detecting outliers involves a measure called the Interquartile Range.Interquartile RangeThe interquartile range is found by subtracting the __________ quartile from the __________ quartile.IQR = __________________________IQR Method for Detecting OutliersThe most frequent method used to detect outliers in a data set is the IQR Method. The procedure for the IQR Method is:Step 1:Find the first quartile Q1, and the third quartile Q3.Step 2:Compute the interquartile range: IQR = Q3-Q1.Step 3:Compute the outlier boundaries. These boundaries are the cutoff points for determining outliers:Lower Outlier Boundary = Q1 – 1.5(IQR)Upper Outlier Boundary = Q3 + 1.5(IQR)Step 4:Any data value that is less than the lower outlier boundary or greater than the upper outlier boundary is considered to be an outlier.Example:The following table presents the number of students absent in a middle school in northwestern Montana for each school day in January. Identify any outliers.65 67 71 57 51 49 44 41 59 49 42 5645 77 44 42 45 46 100 59 53 51Solution:Objective 6Construct Boxplots to Visualize the Five-Number Summary and OutliersA _____________________ is a graph that presents the five-number summary along with some additional information about a data set. There are several different kinds of boxplots. The one we describe here is sometimes called a __________________________________.Example:The following table presents the number of students absent in a middle school in northwestern Montana for each school day in January. Identify any outliers.65 67 71 57 51 49 44 41 59 49 42 5645 77 44 42 45 46 100 59 53 51Solution:Boxplots on the TI-84 PLUS The following steps will create a boxplot for the student absences data on the TI-84 PLUS.Step 1:Enter the data in L1.Step 2:Press 2nd,Y=, then 1 to access the Plot1 menu. Select On and the boxplot type.Step 3:Press Zoom, 9 to view the plot.Determining the Shape of a Data Set from a Boxplot4319270698500Boxplots can be used to determine skewness in a data set.If the median is closer to the first quartile than to the third quartile, or the upper whisker is longer than the lower whisker, the data are skewed to the right.If the median is closer to the third quartile than to the first quartile, or the lower whisker is longer than the upper whisker, the data are skewed to the left.If the median is approximately halfway between the first and third quartiles, and the two whiskers are approximately equal in length, the data are approximately symmetricYOU SHOULD KNOW …How to compute and interpret z-scoresHow to compute the quartiles of a data setHow to compute a percentile of a data setHow to compute the percentile corresponding to a given data valueHow to find the five-number summary for a data setHow to determine outliers using the IQR methodHow to construct a boxplot and use it to determine skewness ................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download