ST 361 Normal Distribution



ST 361: Estimation --- Interval Estimation for [pic] (§7.2, 7.4)

Topics:

I. Interval estimation: confidence interval

II. (Two-sided) Confidence interval for estimating population mean[pic]

(a) When the population SD [pic] is known: use Z distribution (§7.2)

(b) When the population SD [pic] is NOT known: use t distribution (§7.4)

III. Two-sided confidence interval for estimating population mean difference[pic] (§7.5)

(a) when the population SD’s [pic] are known

(b) when the population SD’s [pic] are NOT unknown

----------------------------------------------------------------------------------------------------------------------

I. Interval Estimate = to use an interval of plausible values to estimate the parameter

❖ Why?

• Because of sampling variability, the point estimate is almost never exactly the correct value for the parameter

• Point estimates don’t tell us how close we are to the actual parameter

← So we use an interval call ___________________________to report the likely range for the parameter of interest. It has the general form of

❖ Confidence interval for the population mean [pic]

Consider a sample of [pic] (n [pic]30) is randomly selected from a population with mean [pic] and SD [pic]. To estimate [pic] we use the sample mean[pic].

1. Recall that [pic]~ _____________

2. Based on this normal distribution of [pic], we can show that the middle 95% of the [pic] fall within _____________________________________________________________

Notice that this is an interval centering at __________________________

3. However, in reality we don’t know [pic], and instead we only observe [pic] from the sample collected. So what we really want is an interval centering at [pic] with the same length, i.e., ________________. This interval is called _________________________________

4. Meaning of the 95% confidence interval (CI) “[pic]”:

[pic]

– If [pic]is from Region 2, then the interval [pic] _________________________

– If [pic]is from Region 1, then the interval [pic] _________________________

– If [pic]is from Region 3, then the interval [pic] _________________________

– Overall there are ______% of [pic]’s from Region 2. Hence _____________________

_______________________________________________________

– Meaning of the interval “[pic]”: ___________________________.

Thus we say that we have 95% of confidence that [pic] can be covered within the range [pic]

♣ Comment:

a) Note that a 95% CI does NOT interpret as 95% of CHANCE that μ is covered within the range [pic]!!!

b) Note that the fundamental assumption for constructing the CI for [pic] is that :

II. (a) Confidence interval for [pic] when the population SD [pic] is known (Ch7.2)

❖ If[pic] known, the CI for [pic] at a given confidence level is

Ex. What is the critical value z* at confidence level 98%?

Ex. What is the critical value z* at confidence level 95%?

► Comment: the higher the confidence level is, the wider or narrower (choose one) a CI becomes.

Ex. X is from a population with mean μ and SD σ =10. A sample of size=100 is collected and the sample mean is 5. What is the 90% CI for [pic]?

Ex. The caffeine content (in mg) was examined for a random sample of 50 cups of black coffee dispensed by a new machine and the mean was 110mg. Denote by μ the mean caffeine content for cups dispensed by the machine, and assume the population SD of caffeine content for cups dispensed by the machine is 7.1mg.

(1) Find an unbiased estimate for μ. and explain why this estimate is unbiased.

(2) Construct the 99% percent CI for μ..

II. (b) Confidence interval for [pic] when the population SD [pic] is NOT known (Ch 7.4)

❖ When [pic]unknown, the CI for [pic] at a given confidence level is

Why use t* instead?

• In practice, most of the time [pic]is not known. To calculate CI for[pic], we have to use _______ instead of [pic].[pic]

• If [pic]~Normal and [pic] is known, [pic]~ Z, and hence we use a z critical value.

• If [pic]~Normal and [pic] (instead of σ) is used in standardization, [pic] is not normally distributed. Instead, it follows a __________

--------------------------------------------------------------------------------------------------------------------------------------

❖ The t distribution:

← t distribution is similar to the standard normal distribution (the Z distribution) in many aspects:

(1) (2) (3)

← However, it has _____________________than the Z distribution. Different sample size results in different thickness of the tail in a t distribution. The smaller the sample size is, ________________________________________

← Each t distribution has a degree of freedom (df) associated with it. The df is defined as ___________ and the corresponding t distribution is denoted by ____________

❖ Use t- table to find the critical value

Page 566 Table IV

Ex. Use the t table to find 95% and 99% t-critical value for each of the following sample size:

|Sample size |Degree of freedom (df) = n-1 |t* (i.e., t-critical value) |

|n | | |

| | |95% |99% |

|3 | | | |

|6 |5 |2.571 |4.032 |

|12 |11 |2.201 |3.106 |

|65 | | | |

|[pic] | | | |

Note: When df>30, there is little difference in the area under the curve between the t curves and the z curve. Therefore the textbook recommends to use a z critical value when df>30. But in our class, I’ll ask you always to use t critical value when σ is unknown. When n is large, t* will become z* automatically.

Ex7. X~Normal distribution. n=25, [pic]=8 and s=2. What is the 95% CI for the population mean [pic]?

Ex8. X= # of claims received (per week) by an insurance company. Based on 41 weeks of samples, [pic] and s=20.0. What is the 95% CI for [pic]

[pic]

❖ Interval Length and Margin of Error (ch7.2)

a) Interval Length = ____________________________

b) Margin of Error (MOE) , aka _________________________________________, is defined as ___________________________________________

EX. Heights of the NCSU undergraduates is normally distributed with mean [pic] and SD [pic] One took a sample of size n=100 and measured the heights of each individual. The sample mean is [pic]. What is the margin of error at the confidence level of 95%?

– With a fixed confidence level, say, 95%, a wider CI or a narrower CI is better?

– With a fixed confidence level, how can one reduce MOE?

Ex. (Continued from the example above) John wishes to shorten the margin of error by half without compromising on the level of confidence, which is set to be 95%. What is the sample size John should use?

-----------------------

• When the sample size is very large (i.e., >120), t(n-1)o˜p˜q˜r˜t˜u˜w˜x˜y˜{˜€˜Ú˜Û˜î˜ï˜ð˜ñ˜™™)™íÖÁ°Ÿ°‘€°n°Á°\EÁ°Á°-j©­hnTÂhO

ØCJEHúÿOJ[?]QJ[?]U[pic]^J[?]aJ#j |’E[pic]hnTÂhO

ØCJU[pic]V[pic]aJ#hnTÂhO

Ø5?CJOJ[?]≈ Z !!

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download