7 The Laws of Large Numbers - Duke University
STA 205: Probability & Measure Theory
Robert L. Wolpert
7
The Laws of Large Numbers
The traditional interpretation of the probability of an event E is its asymptotic frequency: the limit as n ¡ú ¡Þ of the fraction of n repeated, similar,
and independent trials in which E occurs. Similarly the ¡°expectation¡± of a
random variable X is taken to be its asymptotic average, the limit as n ¡ú ¡Þ
of the average of n repeated, similar, and independent replications of X. For
statisticians trying to make inference about the underlying probability distribution f (x|¦È) governing observed random variables Xi , this suggests that
we should be interested in the probability P
distribution for large n of quantities like the average of the RV¡¯s, X?n ¡Ô n1 ni=1 Xi .
Three of the most celebrated theorems of probability theory concern this
sum. For independent random variables Xi , all with the same probability
distribution
satisfying E|Xi |3 < ¡Þ, set ? = EXi , ¦Ò 2 = E|Xi ? ?|2 , and
Pn
Sn = i=1 Xi . The three main results are:
Laws of Large Numbers:
Sn ? n?
?¡ú 0
¦Òn
Central Limit Theorem:
Sn ? n?
¡Ì
=? No(0, 1)
¦Ò n
Law of the Iterated Logarithm:
Sn ? n?
= 1.0
lim sup ¡À ¡Ì
¦Ò 2n log log n
n¡ú¡Þ
(i.p. and a.s.)
(in dist.)
(a.s.)
Together these three give a clear picture of how quickly and in what sense
1
n Sn tends to ?. We begin with the Law of Large Numbers (LLN), in its
¡°weak¡± form (asserting convergence i.p.) and in its ¡°strong¡± form (convergence a.s.). There are several versions of both theorems. The simplest
requires the Xi to be IID and L2 ; stronger results allow us to weaken (but
not eliminate) the independence requirement, permit non-identical distributions, and consider what happens if we relax the L3 requirement and allow
the RV¡¯s to be only L2 or L1 (or worse!).
The text covers these things well; to complement it I am going to: (1)
Prove the simplest version, and with it the Borel-Cantelli theorems; and (2)
1
STA 205
Week 7
R L Wolpert
Show what happens with Cauchy random variables, which don¡¯t satisfy the
requirements (the LLN fails).
7.1
Proofs of the Weak and Strong Laws
Here are two simple versions (one Weak, one Strong) of the Law of Large
Numbers; first we prove an elementary but very useful result:
Proposition 1 (Markov¡¯s Inequality) Let ¦Õ(x) ¡Ý 0 be non-decreasing
on R+ . For any random variable X ¡Ý 0 and constant a ¡Ê R+ ,
P[X ¡Ý a] ¡Ü P[¦Õ(X) ¡Ý ¦Õ(a)] ¡Ü E[¦Õ(X)]/¦Õ(a)
To see this, set Y = ¦Õ(a)1A for the event A = {¦Õ(X) ¡Ý ¦Õ(a)} and note
Y ¡Ü ¦Õ(X) so EY ¡Ü E¦Õ(X).
Theorem 1 (L2 WLLN) Let {Xn } be independent random variables with
2
the same mean ? = E[Xn ] and uniformly bounded
P variance E(Xn ? ?) ¡Ü B
for some fixed bound B < ¡Þ. Set Sn =
j¡Ün Xj and X?n ¡Ô Sn /n =
1 P
j¡Ün Xj . Then:
n
(?? > 0)
P[|X?n ? ?| > ?] ¡ú 0.
(1)
Proof.
E(Sn ? n?)2 =
n
X
i=1
E(Xi ? ?)2 ¡Ü n B
so for ? > 0
P[|X?n ? ?| > ?] = P[(Sn ? n?)2 > (n?)2 ]
¡Ü E[(Sn ? n?)2 ]/n2 ?2
¡Ü B/n ?2 ¡ú 0
as n ¡ú ¡Þ.
This Law of Large Numbers is called weak because its conclusion is only
that X?n converges to zero in probability (Eqn (1)); the strong Law of Large
Numbers asserts convergence of a stronger sort, called almost sure convergence (Eqn (2) below). If P[|X?n ? ?| > ?] were summable then by B-C
Page 2
July 12, 2011
STA 205
Week 7
R L Wolpert
we could conclude almost-sure convergence; unfortunately we have only the
bound P[|X?n ? ?| > ?] < c/n which tends to zero but isn¡¯t summable. It
is summable along the subsequence n2 , however; our approach to proving a
strong LLN is to show that |Sk ? Sn2 | isn¡¯t too big for any n2 < k < (n + 1)2 .
Theorem 2 (L2 SLLN) Under the same conditions,
P[X?n ¡ú ?] = 1.
(2)
Proof. Without loss of generalityPtake ? = 0 (otherwise subtract ? from
each Xn ), and fix ? > 0. Set Sn ¡Ô j¡Ün Xj . Then
P[|Sn | > n?] ¡Ü E|Sn |2 /(n?)2 ¡Ü nB/n2 ?2 = B/n?2
P[|Sn2 | > n2 ? i.o.] = 0 by B-C ? Sn2 /n2 ¡ú 0 a.s. Set
Dn ¡Ô
EDn2
max
n2 ¡Ük 1; Xi ¡Ê
/ L1 ,
n2 log n
Sn /n 6¡ú 0 i.p. or a.s.
but Sn /n ¡ú 0 i.p. and not
D. Medians: for ANY RV¡¯s Xn ¡ú X¡Þ i.p., then mn ¡ú m¡Þ if m¡Þ is
unique.
Page 4
July 12, 2011
STA 205
Week 7
R L Wolpert
Let Xi be iid standard Cauchy RV¡¯s, with
P[X1 ¡Ü t] =
Z
t
dx¦Ð[1 + x2 ] =
?¡Þ
1 1
+ arctan(t)
2 ¦Ð
and characteristic function
E ei¦ØX1 =
Z
¡Þ
ei¦Øx
?¡Þ
dx
= e?|¦Ø| ,
¦Ð[1 + x2 ]
so Sn /n has characteristic function
i¦ØSn /n
Ee
[X1 +¡¤¡¤¡¤+Xn ]
i¦Ø
n
= Ee
n
¦Ø
i¦Ø
X1
n
= Ee
= (e?| n | )n = e?|¦Ø|
and Sn /n also has the standard Cauchy distribution with P[Sn /n ¡Ü t] =
1
1
2 + ¦Ð arctan(t); in particular, Sn /n does not converge almost surely, or even
in probability.
7.2
An LLN for Correlated Sequences
In many applications we would like a Law of Large Numbers for sequences of
random variables that are not independent; for example, in Markov Chain
Monte Carlo integration, we have a stationary Markov chain {Xt } (this
means that the distribution of Xt is the same for all t and that the conditional distribution of Xu for u > t, given {Xs |s ¡Ü t}, depends only on Xt )
and want to estimate the population mean E[¦Õ(Xt )] for some function ¦Õ(¡¤)
by the sample mean
T ?1
1 X
¦Õ(Xt ).
E[¦Õ(Xt )] ¡Ö
T t=0
Even though they are identically distributed, the random variables Yt ¡Ô
¦Õ(Xt ) won¡¯t be independent if the Xt aren¡¯t independent, so the LLN we
already have doesn¡¯t quite apply.
A sequence of random variables Yt is called stationary if each Yt has the same
probability distribution and, moreover, each finite set (Yt1 +h , Yt2 +h , ..., Ytk +h )
has a joint distribution that doesn¡¯t depend on h. The sequence is called
¡°L2 ¡± if each Yt has a finite variance ¦Ò 2 (and hence also a well-defined mean
?); by stationarity it also follows that the covariance
¦Ãst = E[(Ys ? ?)(Yt ? ?)]
is finite and depends only on the absolute difference |t ? s|.
Page 5
July 12, 2011
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- a model of non belief in the law of large numbers
- section 6 3 law large numbers university of iowa
- 01 23 4 5 1 67 4 2 7 1 ed
- yuma county community health needs assessment
- 7 the laws of large numbers duke university
- notes 4 laws of large numbers department of mathematics
- law of large numbers central limit theorem university of california
- uniform laws of large numbers stanford university
- reading 6b central limit theorem and the law of large numbers
- the law of large numbers
Related searches
- the laws of the us
- all the laws of america
- duke university nonprofit certificate program
- duke university nonprofit management program
- duke university certificate in nonprofit
- is duke university a 501c3
- duke university executive leadership program
- what are the laws of logic
- duke university nonprofit leadership
- formulas for the laws of motion
- what are the laws of planetary motion
- duke university ein number