<?xml version="1.0"?>
<!-- name="generator" content="blosxom/2.0" -->
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN" "http://my.netscape.com/publish/formats/rss-0.91.dtd">

<rss version="0.91">
  <channel>
    <title>Notebooks   </title>
    <link>http://bactra.org/notebooks</link>
    <description>Cosma's Notebooks</description>
    <language>en</language>

  <item>
    <title>The Thomson Sampling Model</title>
    <link>http://bactra.org/notebooks/2010/06/27#thomson-sampling-model</link>
    <description>
&lt;P&gt;An alternative to the factor model in psychometrics (and potentially other
applications of factor analysis).  I have written about it at great length
&lt;a href=&quot;../weblog/523.html&quot;&gt;here&lt;/a&gt;, and in
my &lt;a href=&quot;http://www.stat.cmu.edu/~cshalizi/350&quot;&gt;lecture notes&lt;/a&gt;
on &lt;a href=&quot;data-mining.html&quot;&gt;data mining&lt;/a&gt;, which I hereby incorporate by
reference.  I'll just use this to record some ideas for possible work; if anyone wants to take them up before I get around to them, drop me a line, or at least put me in the acknowledgments!

&lt;P&gt;&lt;em&gt;Thomson vs. Erdos-Renyi&lt;/em&gt;. Thomson's original model sampled &quot;bonds&quot;
or &quot;abilities&quot; (i.e., latent variables) without replacement.  It's much easier
to analyze, however, if you use simple Bernoulli sampling, and naturally the
two come to much the same thing in the large-size limit.  This is reminiscent
to me of the two versions of the Erdos-Renyi random graph model, where you fix
either the number of edges (so sampling without replacement) or the probability
of an edge (Bernoulli sampling); is there something to this connection --- say
the appearance of a single general factor corresponding to the emergence of a
giant component?

&lt;P&gt;&lt;em&gt;Geometry vs. covariance&lt;/em&gt;.  Thomson's model produces the same
patterns of correlations as factor models (more exactly, can be made to come
arbitrarily close with arbitrarily high probability).  This naturally raises
the question of how one might distinguish between the two simply from the data,
as opposed to actual scientific knowledge of causal mechanisms.  Correlations,
clearly, won't do the job.  But: if we have &lt;i&gt;p&lt;/i&gt; observables, and &lt;i&gt;q&lt;/i&gt;
&amp;lt; &lt;i&gt;p&lt;/i&gt; factors, then the expected values of the observables must always
lie on a &lt;i&gt;q&lt;/i&gt;-dimensional linear subspace of the full &lt;i&gt;p&lt;/i&gt;-dimensional
space.  Unless I am missing something, however, if I have &lt;i&gt;q&lt;/i&gt;
&amp;gt; &lt;i&gt;p&lt;/i&gt; abilities in the Thomson model, there is no geometric constraint
on the expected values of observable vectors.  (Maybe there's something subtle
I'm missing from the sampling process?)  Might this provide a test?  In both
models our data equals expected vectors plus noise, so the factor model doesn't
predict that observations will fall &lt;em&gt;exactly&lt;/em&gt; on a hyper-plane, but
perhaps something could be done with this.

&lt;P&gt;See also:
	&lt;a href=&quot;iq.html&quot;&gt;IQ&lt;/a&gt;


&lt;ul&gt;Recommended:
	&lt;li&gt;David J. Bartholomew, Ian J. Deary and Martin Lawn, &quot;A New Lease on Life for Thomson's Bonds Model of Intelligence&quot;, &lt;a href=&quot;http://dx.doi.org/10.1037/a0016262&quot;&gt;&lt;cite&gt;Psychological Review&lt;/cite&gt; &lt;strong&gt;116&lt;/strong&gt; (2009): 567--579&lt;/a&gt; [Though they are shockingly naive about things like the interpretation of fMRI data]
	&lt;li&gt;Godfrey H. Thomson
		&lt;ul&gt;
		&lt;li&gt;&quot;A Hierarchy without a General Factor&quot;, &lt;cite&gt;British Journal of Psychology&lt;/cite&gt; &lt;strong&gt;8&lt;/strong&gt; (1916): 271--281
		&lt;li&gt;&quot;On the Cause of Hierarchical Order among the
Correlation Coefficients of a Number of Variates Taken in
Pairs&quot;, &lt;cite&gt;Proceedings of the Royal Society of London A&lt;/cite&gt; &lt;strong&gt;95&lt;/strong&gt; (1919): 400--408 [&lt;a href=&quot;http://www.jstor.org/pss/93637&quot;&gt;JSTOR&lt;/a&gt;]
		&lt;li&gt;&lt;cite&gt;The Factorial Analysis of Human Ability&lt;/cite&gt;
[&lt;a href=&quot;http://www.archive.org/details/factorialanalysi032965mbp&quot;&gt;Full text free online&lt;/a&gt;.  I &lt;em&gt;strongly&lt;/em&gt; recommend this book.]
		&lt;/ul&gt;
	&lt;/ul&gt;
</description>
  </item>
  </channel>
</rss>
