<?xml version="1.0"?>
<!-- name="generator" content="blosxom/2.0" -->
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN" "http://my.netscape.com/publish/formats/rss-0.91.dtd">

<rss version="0.91">
  <channel>
    <title>Notebooks   </title>
    <link>http://bactra.org/notebooks</link>
    <description>Cosma's Notebooks</description>
    <language>en</language>

  <item>
    <title>Empirical Process Theory</title>
    <link>http://bactra.org/notebooks/2010/01/04#empirical-process-theory</link>
    <description>
&lt;P&gt;(I first used the next few paragraphs as part of
a &lt;a href=&quot;../weblog/algae-2008-07.html#pollard&quot;&gt;review&lt;/a&gt; of Pollard's book
of lecture notes.  I have no shame about self-plagiarism.)

&lt;P&gt;The simplest sort of empirical process arises when trying to estimate a
probability distribution from sample data.  The difference between the
&lt;a href=&quot;http://en.wikipedia.org/wiki/Empirical_distribution_function&quot;&gt;empirical
distribution function&lt;/a&gt; &lt;em&gt;F&lt;sub&gt;n&lt;/sub&gt;&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;) and the true
distribution function &lt;em&gt;F&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;) converges to zero everywhere (by
the law of large numbers), and &amp;mdash; this is non-trivial &amp;mdash;
the &lt;em&gt;maximum&lt;/em&gt; difference between the empirical and true distribution
functions converges to zero, too (by
the &lt;a href=&quot;http://en.wikipedia.org/wiki/Glivenko-Cantelli_theorem&quot;&gt;Glivenko-Cantelli&lt;/a&gt;
theorem, a &lt;em&gt;uniform&lt;/em&gt; law of large numbers).  The &quot;empirical
process&quot; &lt;em&gt;E&lt;sub&gt;n&lt;/sub&gt;&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;) is the re-scaled
difference, &lt;em&gt;n&lt;/em&gt;&lt;sup&gt;1/2&lt;/sup&gt;[&lt;em&gt;F&lt;sub&gt;n&lt;/sub&gt;&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;)
- &lt;em&gt;F&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;)], and &lt;em&gt;it&lt;/em&gt; converges to a Gaussian stochastic
process that only depends on the true distribution (by
the &lt;a href=&quot;http://en.wikipedia.org/wiki/Donsker%27s_theorem&quot;&gt;functional
central limit theorem&lt;/a&gt;).  Empirical process theory is concerned with
generalizing this sort of material to other stochastic processes determined by
random samples, and indexed by infinite classes (like the real line, or the
class of all Borel sets on the line, or some space parameterizing
a &lt;a href=&quot;regression.html&quot;&gt;regression&lt;/a&gt; model).  The typical objects of
concern are proving uniform limit theorems, and with establishing
distributional limits.  (For instance, one might one want to prove that the
errors of &lt;em&gt;all&lt;/em&gt; possible regression models in some class will come close
to their expected errors, so that maximum-likelihood or least-squares
estimation is consistent.  [For more on that line of thought,
see &lt;a href=&quot;http://www.stat.math.ethz.ch/~geer/&quot;&gt;Sara van de Geer's&lt;/a&gt;
book.])  This endeavor is closely linked to
&lt;a href=&quot;learning-theory.html&quot;&gt;Vapnik-Chervonenkis-style learning theory&lt;/a&gt;,
and in fact one can see VC theory as an application of empirical process
theory.

&lt;P&gt;As usual, I am most interested in results
for &lt;a href=&quot;dependent-learning.html&quot;&gt;dependent data&lt;/a&gt;.


&lt;ul&gt;Recommended:
	&lt;li&gt;Bruce E. Hansen, &quot;The Likelihood Ratio Test Under Nonstandard
Conditions: Testing the Markov Switching Model of GNP&quot;, &lt;cite&gt;Journal of
Applied Econometrics&lt;/cite&gt;
&lt;strong&gt;7&lt;/strong&gt; (1992): S61--S82 [I very much like the approach of treating
the likelihood ratio as an empirical process; why haven't I seen it before?
(Also, the state-of-the-art in simulating Gaussian processes must be much
better now than what Hansen had in '92, which would make this even more
practical.) &lt;a href=&quot;http://www.ssc.wisc.edu/~bhansen/papers/jae_92.pdf&quot;&gt;PDF
reprint&lt;/a&gt;.]
	&lt;li&gt;&lt;a href=&quot;http://web.me.com/pascal.massart/Site/Home.html&quot;&gt;Pascal Massart&lt;/a&gt;, &lt;cite&gt;Concentration Inequalities and Model
Selection&lt;/cite&gt; [Using empirical process theory to get finite-sample, i.e.,
non-asymptotic, risk bounds for various forms
of &lt;a href=&quot;model-selection.html&quot;&gt;model selection&lt;/a&gt;.  Available for free as
a &lt;a href=&quot;http://eprints.pascal-network.org/archive/00002827/&quot;&gt;large PDF
preprint&lt;/a&gt;.]
	&lt;li&gt;David Pollard
		&lt;ul&gt;
		&lt;li&gt;&quot;Asymptotics via Empirical Processes&quot;,
&lt;cite&gt;Statistical Science&lt;/cite&gt; &lt;strong&gt;4&lt;/strong&gt; (1989): 341--354
		&lt;li&gt;&lt;cite&gt;Convergence of Stochastic Processes&lt;/cite&gt;
		&lt;li&gt;&lt;cite&gt;Empirical Processes: Theory and Applications&lt;/cite&gt; [&lt;a href=&quot;../weblog/algae-2008-07.html#pollard&quot;&gt;Mini-review&lt;/a&gt;]
		&lt;/ul&gt;
	&lt;li&gt;Sara van de Geer, &lt;cite&gt;Empirical Processes in M-Estimation&lt;/cite&gt;
	&lt;li&gt;Mathukumalli Vidyasagar, &lt;cite&gt;A Theory of Learning and
Generalization: With Applications to Neural Networks and Control Systems&lt;/cite&gt;
[&lt;a href=&quot;../weblog/algae-209-01.html#vidyasagar&quot;&gt;Mini-review&lt;/a&gt;]
	&lt;/ul&gt;

&lt;ul&gt;To read:
	&lt;li&gt;Radoslaw Adamczak, &quot;A tail inequality for suprema of unbounded
empirical processes with applications to Markov
chains&quot;, &lt;a href=&quot;http://arxiv.org/abs/0709.3110&quot;&gt;arxiv:0709.3110&lt;/a&gt;
	&lt;li&gt;Herold Dehling (ed.), &lt;cite&gt;Empirical Process Techniques for
Dependent Data&lt;/cite&gt; [&lt;a href=&quot;http://www.springer.com/978-0-8176-4201-3&quot;&gt;blurb&lt;/a&gt;]
	&lt;li&gt;Michael R. Kosorok, &lt;cite&gt;Introduction to Empirical Processes and
Semiparametric Inference&lt;/cite&gt;
[&lt;a href=&quot;http://www.bios.unc.edu/~kosorok/current.pdf&quot;&gt;PDF preprint&lt;/a&gt;;
&lt;a href=&quot;http://www.springer.com/978-0-387-74977-8&quot;&gt;blurb&lt;/a&gt;]
	&lt;li&gt;D. Marinucci, &quot;The Empirical Process for Bivariate Sequences with
Long Memory&quot;, &lt;a
href=&quot;http://dx.doi.org/10.1007/s11203-004-2790-9&quot;&gt;&lt;cite&gt;Statistical Inference
for Stochastic Processes&lt;/cite&gt; &lt;strong&gt;8&lt;/strong&gt; (2005): 205--224&lt;/a&gt;
	&lt;li&gt;Richard Samworth and Oliver Johnson, &quot;The empirical process in
Mallows distance, with application to goodness-of-fit tests&quot;, &lt;a
href=&quot;http://arxiv.org/abs/math.ST/0504424&quot;&gt;math.ST/0504424&lt;/a&gt;
	&lt;li&gt;Aad W. van der Vaart, Jon A. Wellner
		&lt;ul&gt;
		&lt;li&gt;&quot;Empirical processes indexed
by estimated
functions&quot;, &lt;a href=&quot;http://arxiv.org/abs/07-9.1013&quot;&gt;arxiv:0709.1013&lt;/a&gt; [&quot;We
consider the convergence of empirical processes indexed by functions that
depend on an estimated parameter $\eta$ and give several alternative conditions
under which the ``estimated parameter'' $\eta_n$ can be replaced by its natural
limit $\eta_0$ uniformly in some other indexing set $\Theta$&quot;]
		&lt;li&gt;&lt;cite&gt;Weak Convergence and Empirical Processes: With
Applications to Statistics&lt;/cite&gt;
		&lt;/ul&gt;
	&lt;li&gt;Bin Yu, &quot;Rates of Convergence for Empirical Processes of Stationary
Mixing Sequences,&quot; &lt;a href=&quot;http://projecteuclid.org/euclid.aop/1176988849&quot;&gt;&lt;cite&gt;Annals of Probability&lt;/cite&gt; &lt;strong&gt;22&lt;/strong&gt;
(1994): 94--116&lt;/a&gt;
	&lt;/ul&gt;
</description>
  </item>
  </channel>
</rss>