<?xml version="1.0"?>
<!-- name="generator" content="blosxom/2.0" -->
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN" "http://my.netscape.com/publish/formats/rss-0.91.dtd">

<rss version="0.91">
  <channel>
    <title>Notebooks   </title>
    <link>http://bactra.org/notebooks</link>
    <description>Cosma's Notebooks</description>
    <language>en</language>

  <item>
    <title>Forecasting Non-Stationary Processes</title>
    <link>http://bactra.org/notebooks/2011/06/28#non-stationary-forecasting</link>
    <description>
&lt;P&gt;Some non-stationary processes are in fact easy to forecast: periodic ones,
for example, are strictly speaking not stationary.  An ergodic Markov chain
started far from its invariant distribution is also non-stationary, but easy to
predict (it will approach the stationary distribution).  Both of these cases
are conditionally stationary, which I think is all that's really needed.

&lt;P&gt;What's more interesting is the problem of so to speak &lt;em&gt;really&lt;/em&gt;
non-stationary processes.  It's hard to imagine that there is any way to truly
predict an &lt;em&gt;arbitrary&lt;/em&gt; non-stationary process.  (Basically: as soon as
you think you have established a trend-line, the Adversary can always reverse
the trend, without creating any problems of consistency with earlier data.)  If
you can constrain the class of allowable non-stationary processes, however,
then something might be possible.  Alternately, one might lower expectations,
not to actually predicting well, but to predicting with low regret.

&lt;P&gt;I actually have an Idea about using model averaging here, but need to find
the time to work on it.

&lt;P&gt;See also:
	&lt;a href=&quot;ensemble-ml.html&quot;&gt;Ensemble Methods in Machine Learning&lt;/a&gt;;
	&lt;a href=&quot;time-series.html&quot;&gt;Time Series&lt;/a&gt;;
	&lt;a href=&quot;universal-prediction.html&quot;&gt;Universal Prediction&lt;/a&gt;

&lt;ul&gt;Recommended (very misc):
	&lt;li&gt;S. Caires and J. A. Ferreira, &quot;On the Non-parametric Prediction of
Conditionally Stationary Sequences&quot;, &lt;a
href=&quot;http://dx.doi.org/10.1007/s11203-004-0383-2&quot;&gt;&lt;cite&gt;Statistical Inference
for Stochastic Processes&lt;/cite&gt; &lt;strong&gt;8&lt;/strong&gt; (2005): 151--184&lt;/a&gt;
	&lt;li&gt;R. Dahlhaus, &quot;Fitting Time Series Models to Nonstationary
Processes&quot;,
&lt;a href=&quot;http://projecteuclid.org/euclid.aos/1034276620&quot;&gt;&lt;cite&gt;Annals of Statistics&lt;/cite&gt; &lt;strong&gt;25&lt;/strong&gt; (1997): 1--37&lt;/a&gt;
	&lt;li&gt;Mark Herbster and Manfred K. Warmuth, &quot;Tracking the Best
Expert&quot;, &lt;a href=&quot;http://dx.doi.org/10.1023/A:1007424614876&quot;&gt;&lt;cite&gt;Machine Learning&lt;/cite&gt; &lt;strong&gt;32&lt;/strong&gt; (1998): 151--178&lt;/a&gt;
[&lt;a href=&quot;http://www.cse.ucsc.edu/~mark/papers/track-long.ps&quot;&gt;PS version&lt;/a&gt;
via &lt;a href=&quot;http://www.cs.ucl.ac.uk/staff/m.herbster/&quot;&gt;Dr. Herbster&lt;/a&gt;]
	&lt;li&gt;Elad Hazan and Satyen Kale, &quot;Extracting certainty from uncertainty: regret bounded by variation in costs&quot;, &lt;a href=&quot;http://dx.doi.org/10.1007/s10994-010-5175-x&quot;&gt;&lt;cite&gt;Machine Learning&lt;/cite&gt; &lt;strong&gt;80&lt;/strong&gt; (2010): 165--188&lt;/a&gt;
	&lt;LI&gt;Jeremy Zico Kolter and Marcus A. Maloof
		&lt;ul&gt;
		&lt;li&gt;&quot;Dynamic Weighted Majority: An
Ensemble Method for Drifting
Concepts&quot;, &lt;a href=&quot;http://jmlr.csail.mit.edu/papers/v8/kolter07a.html&quot;&gt;&lt;cite&gt;Journal
of Machine Learning Research&lt;/cite&gt; &lt;strong&gt;8&lt;/strong&gt; (2007): 2755--2790&lt;/a&gt;
		&lt;li&gt;&quot;Using Additive Expert
Ensembles to Cope with Concept Drift&quot;, ICML 2005
[&lt;a href=&quot;http://ai.stanford.edu/~kolter/lib/exe/fetch.php?media=pubs:kolter-icml05.pdf&quot;&gt;PDF
reprint&lt;/a&gt; via Kolter]
		&lt;/ul&gt;
	&lt;li&gt;Claire Monteleoni and  Tommi S. Jaakkola,
&quot;Online Learning of Non-stationary Sequences&quot;, &lt;a href=&quot;http://books.nips.cc/papers/files/nips16/NIPS2003_LT04.pdf&quot;&gt;pp. 1093--1100 in
NIPS 2003 (vol. 16)&lt;/a&gt; [Figuring out at what rate to switch between experts]
	&lt;li&gt;Maxim Raginsky, Roummel F. Marcia, Jorge Silva and Rebecca M.
Willett
		&lt;ul&gt;
		&lt;li&gt;&quot;Sequential Probability Assignment via Online Convex Programming
Using Exponential Families&quot; [ISIT 2009; &lt;a href=&quot;http://people.ee.duke.edu/~willett/papers/raginsky_marcia_silva_willett_ISIT09.pdf&quot;&gt;PDF&lt;/a&gt;]
		&lt;li&gt;&quot;Sequential anomaly detection in the presence of noise and limited feedback&quot;, &lt;a href=&quot;http://arxiv.org/abs/0911.2904&quot;&gt;arxiv:0911.2904&lt;/a&gt;
		&lt;/ul&gt;
	&lt;li&gt;Kyupil Yeon, Moon Sup Song, Yongdai Kim, Hosik Choi, Cheolwoo
Park, &quot;Model averaging via penalized regression for tracking concept
drift&quot;, &lt;a href=&quot;http://dx.doi.org/10.1198/jcgs.2010.08104&quot;&gt;Journal of Computational and Graphical
Statistics&lt;/cite&gt; &lt;strong&gt;online before print&lt;/strong&gt; (2010)&lt;/a&gt;
	&lt;/ul&gt;

&lt;ul&gt;Modesty forbids me to recommend:
	&lt;li&gt;CRS, Abigail Z. Jacobs, Kristina Lisa Klinkner and Aaron Clauset, &quot;Adapting to Non-stationarity with Growing Expert Ensembles&quot;, &lt;a href=&quot;http://arxiv.org/abs/1103.0949&quot;&gt;arxiv:1103.0949&lt;/a&gt;
	&lt;/ul&gt;

&lt;ul&gt;To read:
	&lt;li&gt;Istv&amp;aacute;n Berkes, Lajos Horv&amp;aacute;th and Shiqing Ling, &quot;Estimation in nonstationary random coefficient autoregressive models&quot;, 
&lt;a href=&quot;http://dx.doi.org/10.1111/j.1467-9892.2009.00615.x&quot;&gt;&lt;cite&gt;Journal of Time Series Analysis&lt;/cite&gt; &lt;strong&gt;30&lt;/strong&gt; (2009): 395--416&lt;/a&gt; [&quot;the unit root problem does not exist in the RCA model&quot;!]
	&lt;li&gt;Satish T. S. Bukkapatnam and Changqing Cheng, &quot;Forecasting the evolution of nonlinear and nonstationary systems using recurrence-based local Gaussian process models&quot;, &lt;a href=&quot;http://dx.doi.org/10.1103/PhysRevE.82.056206&quot;&gt;&lt;cite&gt;Physical Review E&lt;/cite&gt;
&lt;strong&gt;82&lt;/strong&gt; (2010): 056206&lt;/a&gt;
	&lt;li&gt;Alexey Chernov, Vladimir Vovk, &quot;Prediction with Advice of Unknown Number of Experts&quot;, &lt;a href=&quot;http://arxiv.org/abs/1006.0475&quot;&gt;arxiv:1006.0475&lt;/a&gt;
	&lt;li&gt;Michael P. Clements and David F. Hendry, &lt;cite&gt;Forecasting Non-Stationary Economic Time
Series&lt;/cite&gt;
	&lt;li&gt;Rainer Dahlhaus and Wolfgang Polonik, &quot;Empirical spectral processes for locally stationary time series&quot;, &lt;cite&gt;Bernoulli&lt;/citE&gt; &lt;Strong&gt;15&lt;/strong&gt;
(2009): 1--39, &lt;a href=&quot;http://arxiv.org/abs/902.1448&quot;&gt;arxiv:902.1448&lt;/a&gt;
	&lt;li&gt;Ching-Kang Ing, Jin-Lung Lin, Shu-Hui Yu, &quot;Toward optimal multistep
forecasts in non-stationary
autoregressions&quot;, &lt;cite&gt;Bernoulli&lt;/cite&gt; &lt;strong&gt;15&lt;/strong&gt; (2009): 402--437
= &lt;a href=&quot;http://arxiv.org/abs/0906.2266&quot;&gt;arxiv:0906.2266&lt;/a&gt; [&quot;Optimal&quot;
assuming that you know you are facing a linear AR model.]
	&lt;li&gt;C. T. Jose, B. Ismail, S. Jayasekhar, &quot;Trend, Growth Rate, and Change Point Analysis: A Data Driven Approach&quot;, &lt;a href=&quot;http://dx.doi.org/10.1080/03610910701812477&quot;&gt;&lt;cite&gt;Communications in Statistics: Simulation and Computation&lt;/citE&gt; &lt;strong&gt;37&lt;/strong&gt; (2008): 498--506&lt;/a&gt;
	&lt;li&gt;Yan Karklin and Michael S. Lewicki, &quot;A Hierarchical Bayesian Model
for Learning Nonlinear Statistical Regularities in Nonstationary Natural
Signals&quot;, &lt;a
href=&quot;http://neco.mitpress.org/cgi/content/abstract/17/2/397&quot;&gt;&lt;cite&gt;Neural
Computation&lt;/cite&gt; &lt;strong&gt;17&lt;/strong&gt; (2005): 397--423&lt;/a&gt;
	&lt;li&gt;Zudi Lu, Dag Johan Steinskog, Dag Tjostheim and Qiwei Yao,
&quot;Adaptively Varying-Coefficient Spatiotemporal Models&quot;, &lt;a href=&quot;http://dx.doi.org/10.1111/j.1467-9868.2009.00710.x&quot;&gt;&lt;citE&gt;Journal of the Royal Statistical Society&lt;/cite&gt; B &lt;strong&gt;71&lt;/strong&gt; (2009): 859--880&lt;/a&gt; [&lt;a href=&quot;http://stats.lse.ac.uk/q.yao/qyao.links/paper/spatioVLM.pdf&quot;&gt;PDF preprint&lt;/a&gt;]
	&lt;li&gt;Joaquin Quinonero-Candela, Masashi Sugiyama, Anton Schwaighofer and
Neil D. Lawrence (eds.), &lt;cite&gt;Dataset Shift in Machine Learning&lt;/cite&gt;
	&lt;li&gt;Joshua W. Robinson, Alexander J. Hartemink, &quot;Learning Non-Stationary Dynamic Bayesian Networks&quot;, &lt;a href=&quot;http://jmlr.csail.mit.edu/papers/v11/robinson10a.html&quot;&gt;Journal of Machine Learning Research&lt;/cite&gt; &lt;strong&gt;11&lt;/strong&gt; (2010): 3647--3680&lt;/a&gt;
	&lt;li&gt;P. F. Verdes, P. M. Granitto and H. A. Ceccatto, &quot;Overembedding
Method for Modeling Nonstationary Systems&quot;, &lt;a
href=&quot;http://dx.doi.org/10.1103/PhysRevLett.96.118701&quot;&gt;&lt;cite&gt;Physical Review
Letters&lt;/cite&gt; &lt;strong&gt;96&lt;/strong&gt; (2006): 118701&lt;/a&gt;
	&lt;li&gt;Ou Zhao, Michael Woodroofe, &quot;Estimating a monotone trend&quot;,
&lt;a href=&quot;http://arxiv.org/abs/0812.3188&quot;&gt;arxiv:0812.3188&lt;/a&gt;
	&lt;/ul&gt;


&lt;ul&gt;To write:
	&lt;li&gt;CRS + co-conspirators to be named later, &quot;This Time Is Different&quot;
	&lt;/ul&gt;
</description>
  </item>
  </channel>
</rss>
