<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Polimath</title>
	<atom:link href="http://haptonstahl.org/polimath/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://haptonstahl.org/polimath</link>
	<description>Patterns in Politics</description>
	<lastBuildDate>Wed, 11 Apr 2012 01:34:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Review of &#8220;Building Data Science Teams&#8221; by DJ Patil</title>
		<link>http://haptonstahl.org/polimath/?p=220</link>
		<comments>http://haptonstahl.org/polimath/?p=220#comments</comments>
		<pubDate>Wed, 11 Apr 2012 01:34:21 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[DDG]]></category>
		<category><![CDATA[reviews]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=220</guid>
		<description><![CDATA[Having recently committed myself to earning my living as a Data Scientist, I&#8217;ve been reading anything I can find to guide my self-education. So I just spent the last hour reading and mulling over DJ Patil&#8216;s article/report Building Data Science Teams (BDST henceforth) which is available free from various outlets; I read the Kindle version.  [...]]]></description>
			<content:encoded><![CDATA[<p>Having recently committed myself to earning my living as a Data Scientist, I&#8217;ve been reading anything I can find to guide my self-education. So I just spent the last hour reading and mulling over <a href="http://twitter.com/dpatil">DJ Patil</a>&#8216;s article/report <a href="http://radar.oreilly.com/2011/09/building-data-science-teams.html">Building Data Science Teams</a> (BDST henceforth) which is available free from various outlets; I read the Kindle version.  (Disclaimer: DJ is a friend and occasional drinking buddy.)</p>
<p>Patil takes up the necessary and generally thankless task of writing a &#8220;big-think piece&#8221;.  It&#8217;s necessary because, with all the recent talk about Data Scientists, it would be easy for some to see &#8220;Data Science&#8221; as a recent entry in the long history of business fads.  Business fads tend to indulge in one of two sins: oversimplifying, or being so general that anything counts.  Both are sins to which Data Science advocates are susceptible.  An advocate of Data Science might oversimplify by giving a recipe or IT shopping list for doing Data Science; Patil avoids this triteness by emphasizing the wide diversity of the good teams he&#8217;s built or worked with.  Alternatively, one could sin by making anything count as Data Science.  Commenter &#8220;Verbose&#8221; anticipates this problem with his erudite critique of BDST: &#8220;<a href="http://radar.oreilly.com/2011/09/building-data-science-teams.html#comment-10852153">Same technical data analysis, new bullshit name</a>&#8220;.  Patil avoids this and provides a blueprint for others to do the same.</p>
<p>When discussing the etymology of &#8220;Data Scientist&#8221; Patil writes that &#8220;Research Scientist&#8221; would not be as appropriate a term for this profession:</p>
<blockquote><p>&#8220;Research scientist&#8221; was a reasonable job title used by companies like Sun, HP, Xerox, Yahoo, and IBM. However, we felt that most research scientists worked on projects that were futuristic and abstract, and the work was done in labs that were isolated from the product development teams. It might take years for lab research to affect key products, if it ever did. Instead, <strong>the focus of our teams was to work on data applications that would have an immediate and massive impact on the business</strong>. (emphasis added) The term that seemed to fit best was data scientist: those who use both data and science to create something new.</p></blockquote>
<p>Elsewhere Patil provides a solid definition of Data Scientist, but this paragraph encapsulates the concept just as well:  A Data Scientist uses data and science to have an immediate and massive impact on the business.  Just moving data around?  Not data science.  Have an impact in the vague future?  Not data science.  Improving entirely on the margins?  Not (all of) data science.  Holding the Data Scientist&#8217;s feet to the fire &#8212; asking &#8220;How does this immediately and massively impact our business?&#8221; &#8212; provides accountability and hence focus for the team.</p>
<p>Writing a &#8220;big-think piece&#8221; is also a thankless task: the breadth of the topic means that it&#8217;s easy for critics to find something to criticize as being presented too simply.  This ignores the contribution of providing a view of the new discipline from space, showing all of it as a piece, and showing (if briefly) how the disparate parts fit together.  I was glad for a look at a map for this road I&#8217;m traveling.</p>
<p>Overall BDST is short, shorter than I would have liked.  (I&#8217;m glad that Patil is sharing more of his experience through other venues.)  The advice Patil gives about Data Science, forging teams, and hiring Data Scientists seems both specific and useful; I&#8217;ll post again as I have occasion to use his advice.</p>
<p><strong>Recommendation</strong>: &#8220;<a href="http://radar.oreilly.com/2011/09/building-data-science-teams.html">Building Data Science Teams</a>&#8221; is short, but with enough good ideas as to be required for anyone in business intelligence, internal data analysis, or applied computational modeling and prediction.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=220</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Planned Serendipity</title>
		<link>http://haptonstahl.org/polimath/?p=192</link>
		<comments>http://haptonstahl.org/polimath/?p=192#comments</comments>
		<pubDate>Mon, 05 Sep 2011 20:22:21 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[planned serendipity]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=192</guid>
		<description><![CDATA[Yesterday I got back from a great APSA in Seattle.  My undergraduate students were despondent at me having to cancel class Thursday so I could attend.  A few were curious about what happens at a scientific conference and asked about the structure.  I explained that there would be several thousand political scientists at this conference and that [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I got back from a great <a href="http://www.apsanet.org/content_65547.cfm">APSA</a> in Seattle.  My undergraduate students were despondent at me having to cancel class Thursday so I could attend.  A few were curious about what happens at a scientific conference and asked about the structure.  I explained that there would be several thousand political scientists at this conference and that most of the planned interaction would take place in panels.</p>
<p>In theory, a panel takes place in a room where 3-300 (median = 10) people watch three to five papers get presented by their authors.  Then a discussant, who reads the papers in advance, comments on the papers both to draw connections among them and to stimulate conversation among the attendees.  Then the audience asks questions and offers feedback to the authors.  The whole panel takes about 1 3/4 hours.</p>
<p>Although panels comprise most of the scheduled events at a conference, they are not the best reason for scientists to attend conferences and they are far from the most rewarding part of a conference.  Panels are often poorly attended.  The papers in a panel often have very little to do with each other.  The discussant may not receive the papers until days or moments before the panel, if at all, and even so the comments may focus more on typography than on big ideas.</p>
<p>Panels are a party game. They are an excuse to get smart people, who are interested in similar things, together in a room talking. Put a bunch of clever folks together and strange, wonderful, unpredictable things happen.  A conference is <strong>mass planned serendipity</strong>.</p>
<p>The largest conference benefits to my research have happened when I have not been at panels: between panels, skipping panels, into the evening and the night. It&#8217;s the networking, but not &#8220;networking&#8221; in the Machiavelian, sales-person sense.  It&#8217;s the comment on my paper that someone was a little too shy to offer in front of everyone, the comment that helps me recast the paper so it will place higher.  It&#8217;s running into the same person at three panels and finally discovering we would love to work together on some research.  It&#8217;s the dinner outing that leads to an invited talk or an interview.  It&#8217;s the shared coffee followed up with a Facebook friending that leads to a new real friendship.</p>
<p>All of this is made possible by panels, but it&#8217;s not the direct result of the panels.  So when someone tells me they didn&#8217;t go to a lot of panels, I understand that they probably got a lot of professional good out of the conference.</p>
<p>Also, I had a lot of fun at the Space Needle.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=192</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Regain your confidence (intervals)</title>
		<link>http://haptonstahl.org/polimath/?p=76</link>
		<comments>http://haptonstahl.org/polimath/?p=76#comments</comments>
		<pubDate>Sun, 12 Jun 2011 04:40:59 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Bayes]]></category>
		<category><![CDATA[wrong question]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=76</guid>
		<description><![CDATA[Next time you see someone &#8220;misinterpret&#8221; a confidence interval, wait a second.  They&#8217;re actually probably okay. It is regular sport for Bayesians to criticize frequentist confidence intervals as unintuitive, usually misinterpreted, and based on what are usually unjustifiable assumptions.  There are good reasons for this:  they are unintuitive, usually misinterpreted, and based on what are [...]]]></description>
			<content:encoded><![CDATA[<p>Next time you see someone &#8220;misinterpret&#8221; a confidence interval, wait a second.  They&#8217;re actually probably okay.<span id="more-76"></span></p>
<p>It is regular <a title="See page 3" href="http://www.stat.ufl.edu/~casella/Talks/BayesRefresher.pdf">sport</a> for Bayesians to criticize frequentist confidence intervals as <a href="http://web1.graphpad.com/help/Prism5/prism5help.html?stat_more_about_confidence_interval.htm">unintuitive</a>, <a href="http://www.ippp.dur.ac.uk/old/Workshops/02/statistics/proceedings/karlen.pdf">usually misinterpreted</a>, and based on what are usually unjustifiable assumptions.  There are good reasons for this:  they are unintuitive, usually misinterpreted, and based on what are usually unjustifiable assumptions.</p>
<p>Confidence intervals take into account the uncertainty we have in trying to describe a population given that we only observe a random sample.  Maybe our sample is representative of the population, maybe not.  The smaller the sample, the more likely it is that, through random variation, we got a sample that suggests a relationship (causes us to reject the null hypothesis) when really there is no relationship (the null is true.)  If we take 100 random samples of the same size and for each construct a (different) confidence interval, how many will contain the true value of the parameter?  We expect that 95 of those constructed intervals will contain the true value.</p>
<p>We don&#8217;t have 100 samples of the same size, so this is not the question we want to answer.  Here&#8217;s what we want to know:  What is the probability that the true value is in the interval?</p>
<p>There are a variety of very good, sound reasons why the frequentist approach does not make sense. I understand the difference between believing that the true parameter is &#8220;fixed and known to God&#8221; (per the frequentist assumption) and a random variable (the Bayesian assumption.)  I agree completely that a confidence interval answers the wrong question.</p>
<p>It doesn&#8217;t matter.</p>
<p>For large samples and given the regularity conditions of maximum likelihood estimators, the marginal posterior distribution for a single parameter is <a href="http://books.google.com/books?id=eJO1abQ4l3wC&amp;lpg=PP1&amp;dq=james%20press%20subjective%20objective&amp;pg=PA173#v=onepage&amp;q=limiting%20large-sample%20normal%20distribution&amp;f=false">approximately norma</a>l with a mean equal to the MLE and standard deviation equal to the standard error.  Under these conditions, maximum likelihood and Bayesian estimators give you the same inferences.</p>
<p><strong>Bayesian researcher perspective:</strong> Suppose I want a credible interval for a parameter where I have a lot of data and the model is well-behaved but I don&#8217;t have convenient code for drawing posteriors.  I could estimate the MLE and use the resulting confidence interval as an approximate credible interval. My friend and colleague Jeff Gill calls this the &#8220;lazy Bayesian&#8221; approach.  Lazy, efficient, tomato, tomahto.</p>
<p><strong>Bayesian consumer perspective:</strong> Suppose you are reading an article where there is plenty of data and a well-behaved model, but the author provides frequentist confidence intervals.  You can treat them as approximate credible intervals.  Easy peasy.</p>
<p><strong>Pragmatist perspective:</strong> As long as the conditions are met, you can go on &#8220;misinterpreting&#8221; confidence intervals.</p>
<p>This &#8220;lazy Bayesian&#8221; approach is not limited to making simple inferences about single parameters.  <a title="Must have access to JSTOR" href="http://www.jstor.org/stable/2669316">King, Tomz, and Wittenberg</a> explain how to generate draws from the approximate posterior (as opposed to approximate draws from the actual posterior via MCMC) and make any inference a Bayesian can using the output from maximum likelihood estimation.</p>
<p>As a Bayesian, I advocate strongly for increasing the use of Bayesian methods.  However, we should be careful to avoid overselling the advantages.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=76</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Bayes fixes small n, doesn&#8217;t it?</title>
		<link>http://haptonstahl.org/polimath/?p=135</link>
		<comments>http://haptonstahl.org/polimath/?p=135#comments</comments>
		<pubDate>Thu, 03 Mar 2011 17:09:29 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=135</guid>
		<description><![CDATA[What is a methods-careful practitioner to do when the number of observations () is small?  I don&#8217;t know how many times I&#8217;ve been told by a well-meaning Bayesian some variation of Bayesian estimation addresses the &#8220;small problem&#8221; This is right and wrong. Maximum likelihood estimators (MLEs) leverage large in two ways. Making inferences about parameters. [...]]]></description>
			<content:encoded><![CDATA[<p>What is a methods-careful practitioner to do when the number of observations (<img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />) is small?  I don&#8217;t know how many times I&#8217;ve been told by a well-meaning Bayesian some variation of</p>
<blockquote><p>Bayesian estimation addresses the &#8220;small <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> problem&#8221;</p></blockquote>
<p>This is right and wrong.<span id="more-135"></span></p>
<p>Maximum likelihood estimators (MLEs) leverage large <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> in two ways.</p>
<ol>
<li>Making inferences about parameters.</li>
<li>Checking model fit.</li>
</ol>
<p>The goal of #1 is to answer questions like &#8220;Is <img src='http://s.wordpress.com/latex.php?latex=%5Cbeta%3E0&#038;bg=T&#038;fg=000000&#038;s=0' alt='\beta&gt;0' title='\beta&gt;0' class='latex' />?&#8221; or &#8220;What is the shortest interval that has a 95% change of containing <img src='http://s.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' />?&#8221;  MLEs let us take a stab at answering* this. A large number of observations means we can apply the Central Limit Theorem, which means we can use the standard errors to test simple hypotheses and build confidence intervals.  The goal of #2 is to justify our choice of model after the fact.  Presumably we had a good, substantively-informed theory to justify our choice of model, but it&#8217;s nice to be able to say afterward &#8220;See? The models fits well, so we weren&#8217;t crazy to choose this model.&#8221;</p>
<p>How many is a &#8220;small&#8221; number of observations?  It&#8217;s difficult to say.  <a href="http://www.amazon.com/dp/0803973748/">Long (1997)</a> gives some guidance about this, suggesting that 100 is a minimal number for an MLE, and that one needs at least 10 observations per parameter in the model.  However, we never really know if this is enough to be able to trust our inferences and fit.</p>
<p>What happens if we have too few observations?  Both #1 and #2 become unreliable.  We have too little data to assume that the Central Limit Theorem has &#8220;kicked in,&#8221; so our point estimates have more uncertainty, which means our standard errors are too small**, and therefore any inferences are unreliable.  Our tests for model fit are similarly starved for information, so any post-hoc model justification will be difficult.</p>
<p>What <strong>does</strong> Bayes fix?  Bayes estimators are finite-data estimators:  more data gives more accurate estimates, but the <em>measures of uncertainty of those estimates are reliable regardless of the amount of data</em>.  This means that inferences are reliable regardless of the size of the data set.  Want to know if <img src='http://s.wordpress.com/latex.php?latex=%5Cbeta%3E0&#038;bg=T&#038;fg=000000&#038;s=0' alt='\beta&gt;0' title='\beta&gt;0' class='latex' /> but our $\latex n$ is small?  No problem for a Bayesian.</p>
<p>What about model fit?  Bayesians have the same tools as frequentists for checking model fit <em>and</em> they also have the numerical and graphical analysis of <a href="http://www.wjh.harvard.edu/soc/faculty/western/pdfs/smr.pdf">posterior predictive distributions</a>.  As with inferences about parameters, more data is more information and thus is preferable.  However, any tests of model fit are still reliable.  As a purely practical matter, if we have a very small data set we probably will not be able to conclude anything about model fit from the data.  This puts the burden back on the substantive/theoretical argument for the model form.  If we understand the data-generating process very well, great; if not, then this part of our argument will need extra scrutiny.</p>
<p><strong>Bottom line:</strong> Bayesian estimators don&#8217;t create more information.  However, they do let us correctly identify how sure we are about the inferences we draw.  That&#8217;s a clear improvement.  Other Bayesians aren&#8217;t helping by overselling it.</p>
<p>===</p>
<p>* <a href="http://en.wikiquote.org/wiki/History_of_the_World:_Part_I#The_French_Revolution">Don&#8217;t get saucy with me</a> about how frequentist confidence intervals (CIs) either contain or don&#8217;t contain <img src='http://s.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' />, implying that it&#8217;s meaningless to talk about the probability of <img src='http://s.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' /> being in the interval. CIs are better than commonly described &#8212; details in my next post.</p>
<p>** In theory our standard errors could be too small, too large, or correct.  We can appropriately account for this additional uncertainty by increasing the size of our standard errors (think &#8220;put a confidence interval around our confidence interval&#8221;) except we can&#8217;t know how much to increase them.  See &#8220;reasoning, circular&#8221;.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=135</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Truth and Choices: Computational v. Analytical formal models</title>
		<link>http://haptonstahl.org/polimath/?p=74</link>
		<comments>http://haptonstahl.org/polimath/?p=74#comments</comments>
		<pubDate>Fri, 03 Dec 2010 17:20:19 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[computational modeling]]></category>
		<category><![CDATA[theory]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=74</guid>
		<description><![CDATA[How do we show a statement about politics is true? Analytic formal modelers suggest one way: The way to be sure that a statement about politics is true is to make a formal argument: explicitly state the assumptions and use rigorous logic to create a 100% valid argument from assumptions to conclusions.  Euclidian geometry was [...]]]></description>
			<content:encoded><![CDATA[<p><strong>How do we show a statement about politics is true?</strong> Analytic formal modelers suggest one way:</p>
<p style="text-align: center;"><span id="more-74"></span><a href="http://xkcd.com/263/"><img class="alignnone" src="http://imgs.xkcd.com/comics/certainty.png" alt="" width="740" height="312" /></a></p>
<p><em>The </em>way to be <em>sure</em> that a statement about politics is true is to make a formal argument: explicitly state the assumptions and use rigorous logic to create a 100% valid argument from assumptions to conclusions.  Euclidian geometry was a popular touchstone of Truth<sup>TM</sup> for two thousand years&#8230;until Gauss proved that <a href="http://mathworld.wolfram.com/ParallelPostulate.html">one of Euclid&#8217;s assumptions was arbitrary</a>.</p>
<p>Oddly enough, most of the mathematicians I know do not assert that math reveals Truth<sup>TM</sup>.  For a set of statements to be true, they must not disagree.  Gödel&#8217;s <a href="http://www.miskatonic.org/godel.html">showed</a> us that we <em>cannot</em> be sure that math does not contradict itself.  Accepting this means abandoning the claim that 100% truth can be generated analytically or using any logical system. For many mathematicians (not including probabilists or statisticians) once something is uncertain, all bets are off.</p>
<p>Analytic formal modelers know this, but assert that deduction (assumptions + logic =&gt; conclusion)  is better than statistical induction (theory + data =&gt; conclusion).  Let&#8217;s take a look at this analytically, shall we?  Consider this statement that is true simply as an application of the definition of <a href="http://en.wikipedia.org/wiki/Conditional_probability">conditional probability</a>:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Ctext%7BPr%7D%28%5Ctext%7Bconclusion%20is%20true%7D%29%3D%5Cunderbrace%7B%5Ctext%7BPr%7D%28%5Ctext%7Bconclusion%20true%7D%7C%5Ctext%7Bassumptions%7D%29%7D_%7B%5Ctext%7Bchain%20of%20reasoning%7D%7D%5Ccdot%5Ctext%7BPr%7D%28%5Ctext%7Bassumptions%20are%20true%7D%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\text{Pr}(\text{conclusion is true})=\underbrace{\text{Pr}(\text{conclusion true}|\text{assumptions})}_{\text{chain of reasoning}}\cdot\text{Pr}(\text{assumptions are true})' title='\text{Pr}(\text{conclusion is true})=\underbrace{\text{Pr}(\text{conclusion true}|\text{assumptions})}_{\text{chain of reasoning}}\cdot\text{Pr}(\text{assumptions are true})' class='latex' /></p>
<p style="text-align: left;">Analytic modelers use a deductive chain of reasoning, which has a (conditional) probability of being true that approaches 1, so the probability of the conclusion being true is essentially the same as the the probability that the assumptions are true.  One critique of this approach is that <a href="http://en.wikiquote.org/wiki/George_Box">all assumptions, if pushed far enough, are false</a>.  However, that attitude underestimates the usefulness of analytic results.  If the assumptions are <em>close</em> to being true and the chain of reasoning is &#8220;well-behaved&#8221; (robust to small errors in inputs) then the conclusions will be close to true.</p>
<p style="text-align: left;">Computational (including agent-based) modelers start with clear, explicit assumptions but use an inductive chain of reasoning which is therefore less likely to be conditionally true.  However, the original assumptions are often much more realistic:</p>
<table style="margin-left: auto; margin-right: auto;" border="1">
<tbody>
<tr>
<td><strong>Common analytic assumptions</strong></td>
<td><strong>Common computational assumptions</strong></td>
</tr>
<tr>
<td>1, 2, or infinitely many actors</td>
<td style="text-align: left;">5, 50, 200 actors</td>
</tr>
<tr>
<td>static discounting</td>
<td>dynamic discounting</td>
</tr>
<tr>
<td>&#8220;Common knowledge&#8221; about the state of the world, ie a kind of omniscience</td>
<td>Local/limited knowledge</td>
</tr>
</tbody>
</table>
<p style="text-align: left;">
<p style="text-align: left;"><strong>If the assumptions made in a computational model are more likely to be true (or nearly true) and the chain of reasoning is strong, then the conclusions are more likely to be true than those &#8220;proven&#8221; using an analytic formal model.</strong></p>
<p style="text-align: left;">Of course, that&#8217;s a tall order to fill.  Your mileage may vary.  Whether analytic or computational approaches is more appropriate depends on the specific assumptions being made and the strength of the induction used in the computational approach.  This is a trade off that should be made mindfully, not dogmatically.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=74</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>We all carry the scars</title>
		<link>http://haptonstahl.org/polimath/?p=101</link>
		<comments>http://haptonstahl.org/polimath/?p=101#comments</comments>
		<pubDate>Thu, 11 Nov 2010 18:09:15 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[not politics]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=101</guid>
		<description><![CDATA[I served in the US Navy for a few months in 1986, five years in the early 90s, and another year and a half in the reserves. I was never asked to shoot someone. I never pulled a trigger when the weapon was aimed at a person. I served during, but not &#8220;in&#8221; the first [...]]]></description>
			<content:encoded><![CDATA[<p>I served in the US Navy for a few months in 1986, five years in the early 90s, and another year and a half in the reserves.  I was never asked to shoot someone.  I never pulled a trigger when the weapon was aimed at a person.  I served during, but not &#8220;in&#8221; the first Gulf War.  I served during &#8220;peacetime&#8221;, or at least that&#8217;s how I thought about it.  However, over the last few months I have been thinking more about my time in uniform, realizing the lasting and deep effects that experience had on me.<span id="more-101"></span></p>
<p>Three nights ago I had a dream in which I was being assaulted.  One of the two offenders in my dream was someone from my division on the ship, someone I have not seen in over fifteen years.  I woke up wondering what that was all about, but I gave it little thought as I went on through my day, my happy, productive day.  Everyone has weird bad dreams sometimes.  Civilians just have ones with less gray paint.</p>
<p>When I was on the ship I lived in, essentially, a locker room. My 38 roommates and I slept in 13 triple-stacked bunks in a berthing space the size of my current living and family rooms.  Before the Navy I hated locker rooms.  I wasn&#8217;t frail, but I was a geek in high school and no athlete.  I hated the inevitable comparisons.  In the Navy I hated showering next to a 198-lb functionally retarded man who could bench press 500 lbs and who, because he had joined many years before and outranked me, treated me like a child.  I hated the regular joking about sailors being gay, complete with accompanying grabbing of each others&#8217; bodies.  Yesterday when changing at the gym I was completely comfortable in the locker room, thinking about the class I was going to teach a little later.</p>
<p>Here&#8217;s something weird:  I never heard about any crime on the ship more serious than a regular, playground-style fight.  I never heard about a sexual assault.  I never saw any sign of any actual homosexuals. (My gaydar wasn&#8217;t completely oblivious, as I had been around the &#8220;family&#8221; a lot in my previous college years.)  I never saw anyone shoot or get shot in the line of duty or otherwise.  I never really faced an enemy.  I never had to deal with a fellow sailor who was obviously a &#8220;bad guy&#8221;, someone who would, in the boring movie version of my life, be seen as any kind of villain.</p>
<p>There are a lot of people I know now who I like, but don&#8217;t trust.  There were some people I served with whom I really didn&#8217;t like, but I trust them, and would still, with my life.  And afterward it wouldn&#8217;t change how I feel about them.</p>
<p>Three guys I played poker with died in a plane crash.  When you deploy 10,000 sailors and soldiers on metal ships with guns, bombs, torpedoes, fuel, and high voltage equipment on indifferent seas for an indefinite length of time, some don&#8217;t come back.  It doesn&#8217;t matter whether it&#8217;s &#8220;peacetime&#8221; or not.  They were on a E-2C Hawkeye reconnaissance plane coming in for a landing on the carrier when they got waived off.  They started to gain altitude for another pass, but stalled out right after clearing the flight deck.  A carrier has four screws.  We never found a piece of the plane larger than a surfboard.  I don&#8217;t remember their names.</p>
<p>My ship was in the Adriatic Sea enforcing the no-fly zone over the former Republic of Yugoslavia.  We left to go home, relieved by another AEGIS cruiser who was, like us, fully provisioned and armed with Tomahawk missiles.  We didn&#8217;t even make it home before we found out that the relieving ship got to fire some of her missiles.  Lucky bastards!  We had trained and trained &#8212; we were ready! &#8212; but we never got to fire a shot.  Of course, nobody wants to kill people.  Even then I knew I was lucky not to have to live with the burden of having killed.</p>
<p>What I didn&#8217;t realize is that, in training, in making myself mentally and emotionally ready to pull the trigger, I had already lost something.  When we hurt others, we are diminished. Soldiers who are willing to do this, are willing to diminish themselves, do diminish themselves.  The sacrifice is admirable, but it is a sacrifice just to prepare to kill.  It does change you to prepare for war.  Some changes can be more obvious, some are less so.</p>
<p>I am fine.  I am healthy, physically and mentally.  But I did not serve in combat.  Many, many men and women are serving in combat now or have served in combat before. My experience was approximately 1.4 million times easier than their experience.  My experience was also at least a thousand times easier than that of my wife and every other military spouse. Preparing for war takes your mind off the dangers involved; if it didn&#8217;t few people could follow through. Staying at home waiting for a loved one to return provides no such distraction.  Thank you, my precious wife Dawn, for your sacrifice.</p>
<p>These days, I am living the dream, living the good life, the sweet retreat.  I think of my father who helped identify targets for bombers during Vietnam, who now is a retired minister.  I look at my children, and I hope they will serve others.  If called, and am confident they will respond and serve with honor.  However, I dearly hope they will never have to carry a weapon.  I wish that nobody did, but some people must.</p>
<blockquote><p>The price of freedom is eternal vigilance.</p>
<p>&#8211; Thomas Jefferson</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=101</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Change of Intuition about the Definition of Insanity</title>
		<link>http://haptonstahl.org/polimath/?p=82</link>
		<comments>http://haptonstahl.org/polimath/?p=82#comments</comments>
		<pubDate>Thu, 21 Oct 2010 17:19:06 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=82</guid>
		<description><![CDATA[My dad and I went to the recent Brown/Whitman California gubernatorial debate here at UC Davis. It was fun seeing &#8220;democracy&#8221; live and up close. One of the candidates twice repeated an old saw: One definition of insanity is doing the same thing over and over and expecting different results. The quote was misattributed to [...]]]></description>
			<content:encoded><![CDATA[<p>My dad and I went to the recent Brown/Whitman California gubernatorial debate here at UC Davis.  It was fun seeing &#8220;democracy&#8221; live and up close.  One of the candidates twice repeated an old saw:</p>
<blockquote><p><strong>One definition of insanity is doing the same thing over and over and expecting different results.</strong></p></blockquote>
<p><span id="more-82"></span>The quote was <a href="http://en.wikiquote.org/wiki/Rita_Mae_Brown">misattributed</a> to Einstein, and it&#8217;s often misattributed to Ben Franklin, Mark Twain, and others. <a href="http://rodgerv.wordpress.com/2007/02/10/insanity-doing-the-same-thing-over-and-over-again-and-expecting-different-results/">Various</a> <a href="http://blogs.oreilly.com/digitalmedia/2006/10/the-definition-of-insanity-is.html">folks</a> take issue with this pop-culture &#8220;definition&#8221; in different ways.  As a recovering mathematician, I&#8217;ve repeated it myself, and it used to make some sense to me.</p>
<p>However, as I&#8217;ve tried to move from brittle assumptions (knife-edge mixed equilibria, trembling hand-perfection) in my models to more robust assumptions (quantal response equilibria, non-parametric tests) this truism has rung less true.</p>
<p>Suppose you hit a subject with a treatment.</p>
<p><strong>A physical scientist</strong> tends to have objects of study that have simple internal states.  The aggregate systems are complex, but the smallest observable parts of the system are very simple.  The simplicity of the set of states means that I have a good chance of being able to choose a sample from those states that is sufficiently random so that there is little or no correlation between states and my variables of interest.  No correlation means that we get unbiased estimates of the parameters for models connecting my variables of interest.  I might run a physical experiment, say, <em>n</em> times to assure that we have a good sample and to compensate for measurement uncertainty.  I don&#8217;t really expect running it another <em>n</em> times to show me anything new; that <em>would</em> be crazy.</p>
<p><strong>A social scientist</strong> has objects of study that are vastly more complex. A person has far more possible internal states than an atom, a mole of gas, or a cell.  More germane, we can&#8217;t easily be sure of sampling from those internal states randomly, so we constantly face the threat of internal states being correlated with our variables of interest, which leads to bias if not statistically corrected.</p>
<p><strong>Can we just set <em>n</em> higher?  No.</strong> If we had a better understanding of the internal states of a person, we might be better able to sample randomly from those states.  At least for now, however, we do not grok the data-generating processes within people as well as we do those in chemistry.  Samples would have to astronomically large to overcome the selection bias in our samples of the internal states of a person.</p>
<p>I&#8217;ve heard this &#8220;definition&#8221; many, many times.  Each time before I kept my mouth shut.  This time I had to speak out.  Keeping silent would have driven <em>me</em> crazy.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=82</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dimensionality matters: three implications of ideology being multidimensional</title>
		<link>http://haptonstahl.org/polimath/?p=75</link>
		<comments>http://haptonstahl.org/polimath/?p=75#comments</comments>
		<pubDate>Wed, 25 Aug 2010 15:18:20 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dimensionality]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=75</guid>
		<description><![CDATA[Left or right? Liberal or conservative? Blue or red?  We know the terms bandied about in the punditverse, but it&#8217;s easy to forget that there is more than one way to divide the world into two political ideologies. If you categorize political ideology into two camps, you are implicitly using a one-dimensional model, a model [...]]]></description>
			<content:encoded><![CDATA[<p>Left or right? Liberal or conservative? Blue or red?  We know the terms bandied about in the punditverse, but it&#8217;s easy to forget that there is more than one way to divide the world into two political ideologies.</p>
<p><span id="more-75"></span>If you categorize political ideology into two camps, you are implicitly using a <strong>one-dimensional model</strong>, a model that puts everyone on a single line from left to right.</p>
<p><a href="http://www.politicalcompass.org/"><img class="aligncenter" src="http://www.politicalcompass.org/images/leftright.gif" alt="" width="400" height="21" /></a></p>
<p>If you think there are two basic issues &#8212; how much the government should influence the economy and how much the government should influence social issues &#8212; then you are using a two-dimensional model, a model that locates each person on a plane.</p>
<p><a href="http://www.politicalcompass.org/"><img class="aligncenter" src="http://www.politicalcompass.org/images/bothaxes.gif" alt="" width="400" height="400" /></a></p>
<p>So here&#8217;s a question:  <strong>How many dimensions are there in political ideology?</strong> This question is a lot like the question &#8220;<a href="http://dx.doi.org/10.1126/science.156.3775.636">How long is the coast of Britain?</a>&#8221; (<a href="http://www.math.yale.edu/mandelbrot/web_pdfs/howLongIsTheCoastOfBritain.pdf">alt</a>): the answer depends on how you&#8217;re going to use the answer.</p>
<p>If you want to explain variance in the US House and Senate, <a href="http://www.amazon.com/dp/019514242X/">one dimension is plenty in most cases, and two dimensions is sufficient the rest of the time</a>. If you are looking at an entire (two-year) Congress or a longer period, one dimension will do a good job most of the time</p>
<p>If you are trying to explain a single bill &#8212; or each of a set of single bills &#8212; the story is more complicated. On a given bill, some legislators vote for ideological reasons, some for procedural reasons, some because of interest groups or constituent pressure, some because of bribes, some because they are taking cues from other legislators, and some just make mistakes. <strong>When you are interested in distinguishing among legislators based on aggregate voting, few dimensions are needed. When you want to explain the path a bill takes, more dimensions may be required.</strong></p>
<p>This has implications in three kinds of research we conduct based on the analysis of roll call votes:</p>
<ol>
<li><strong>Theorizing about ideology</strong>:  When building theories about ideology, we cannot blindly assume that ideology is one-dimensional.  Arguably there are one-dimensional policy areas, legislators, and specific bills, and the one-dimensional case is worth considering, but it is not anywhere near universal.</li>
<li><strong>Ideology as an explained variable in empirical work</strong>:  How do parties, interest groups, constituent pressure, and other factors affect votes? When exploring factors that affect ideology &#8212; using statistical models where ideal points on the &#8220;left-hand side&#8221; &#8212; we should not ignore that there may be multiple dimensions to explain.</li>
<li><strong>Ideology as an explanatory variable</strong>: How does ideology affect interactions with the president, the courts, and the bureaucracy? Even scholars of institutions other than Congress should remember that when ideology is on the right-hand side of the statistical model we might want to include more than one dimension.</li>
</ol>
<p>To indulge in the <a href="http://en.wikipedia.org/wiki/Snowclone">snowclone</a>, <strong>dimensionality matters</strong> when considering ideology. Make a <a href="http://www.amazon.com/dp/0201523418/">mindful</a> choice when modeling it.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=75</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The better the question, the worse the answer</title>
		<link>http://haptonstahl.org/polimath/?p=64</link>
		<comments>http://haptonstahl.org/polimath/?p=64#comments</comments>
		<pubDate>Wed, 12 May 2010 14:04:38 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[wrong question]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=64</guid>
		<description><![CDATA[Justin Wolfers wrote recently about the level of interaction between economics and other social sciences.  In particular, he wonders why economic work is not well represented in a list of the books most cited in social science research.  It&#8217;s a good question: I find many of the tools and techniques developed by economists are useful [...]]]></description>
			<content:encoded><![CDATA[<p>Justin Wolfers <a href="http://freakonomics.blogs.nytimes.com/2010/04/16/are-economists-influential-enough/">wrote</a> recently about the level of interaction between economics and other social sciences.  In particular, he wonders why economic work is not well represented in a list of the books most cited in social science research.  It&#8217;s a good question: I find many of the tools and techniques developed by economists are useful in my works studying political phenomena, and I do cite economic research.</p>
<p>One particularly thoughtful <a href="http://freakonomics.blogs.nytimes.com/2010/04/16/are-economists-influential-enough/#comment-552991">commenter</a> on Wolfers&#8217; post notes that economics combines the controversy of addressing everyday issues with the general inaccessibility of chemistry.  This conflict may make some people resist the conclusions of economists, ie. strong prior + incomprehensible evidence = small amount of updating.</p>
<p><span id="more-64"></span>The comment continues:</p>
<blockquote><p>However, the inaccessibility of economics does not merely arise through  inadvertence.  As many jokes attest, economists are not merely  unsentimental, they are ANTI-sentimental.  An economist will often revel  in the opportunity to rub people’s noses in the conclusion that their  pre-conceptions are fluffy-headed poppycock.  To many people (including  some economists, I fear), economics appears to be less a social science  than a religion, revealing to a chosen few the mighty counter-intuitive  truths by which to pass judgment on a sinful world.</p>
<p>If I’m an accomplished scholar in another discipline, to what extent am I  open and receptive to this kind of intellectual upbraiding?</p></blockquote>
<p>Another <a href="http://freakonomics.blogs.nytimes.com/2010/04/16/are-economists-influential-enough/#comment-553049">commenter</a> notes</p>
<blockquote><p>My guess is at least part of this effect is reciprocity.  Economists are  famously bad when it comes to citing relevant research from the other  social sciences.  And I am not even talking about humanistic research:  experiments from social psychology and statistical work from sociology  are often ignored when economists do related (or even nearly identical)  work.  There is a widely held perception in the other social sciences  that economists are, at best, disinterested in having interdisciplinary  conversations–or, at worst, regularly tolerate cross-disciplinary  plagiarism.  My guess is that a culture of arrogance is the ultimate  cause.</p></blockquote>
<p>These are legitimate concerns, and there are reasonable rejoinders, but I think there is a more compelling explanation than &#8220;economists are a$$holes.&#8221;</p>
<p>When we develop a theory, we throw away some of the details about the world in order to make the theory simple enough to understand.  This abstraction is an art: there are no hard rules and too few guidelines on how to make useful theories.</p>
<p>Suppose that there is only one choice: how much detail to discard.  One can (grossly over-)generalize by saying that economists throw away more detail, ignoring things like irrational behavior, while other social scientists throw away less detail.  By simplifying more, the economists gain the ability to apply very technical tools to generate results that are very reliable <em>given the assumptions they made</em>.  By simplifying less, other social scientists are left with questions that more closely resemble reality but are harder to analyze.</p>
<p>Internal validity versus external validity.  Tractability versus verisimilitude.  We&#8217;ve heard this song and dance before.  The implied question is something like Heisenberg&#8217;s <a href="http://en.wikipedia.org/wiki/Uncertainty_principle">Uncertainty Principle</a>:</p>
<p style="padding-left: 30px;">Hypothesis:  (time to analyze) * (problems with applicability) &gt;= (some constant)</p>
<p>Does this hold?  Actually, I am optimistic that it doesn&#8217;t.  Regardless, we all face choices when trying tell a <a href="http://www.jstor.org/stable/1448820">story</a> that explains something.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=64</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>You&#8217;re Asking the Wrong Question, Fortunately</title>
		<link>http://haptonstahl.org/polimath/?p=55</link>
		<comments>http://haptonstahl.org/polimath/?p=55#comments</comments>
		<pubDate>Tue, 20 Apr 2010 11:49:24 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[wrong question]]></category>

		<guid isPermaLink="false">http://haptonstahl.org/polimath/?p=55</guid>
		<description><![CDATA[Today I got up, finishing a decision I started last night about how much to sleep before today.  I will choose my attire to fit the weather and strike the right tone in the classes I will teach. I will go to work and spend the day at work making optimal decisions about how to [...]]]></description>
			<content:encoded><![CDATA[<p>Today I got up, finishing a decision I started last night about how much to sleep before today.  I will choose my attire to fit the weather and strike the right tone in the classes I will teach. I will go to work and spend the day at work making optimal decisions about how to allocate my time and effort considering my immediate goals, teaching effectively and  preparing for an experiment, and longer term goals like getting along with my peers and building my tenure packet.  I will come home along a route that balances safety, convenience, fuel economy, and curiosity.  I will talk with my wife, play with my daughter, read to my son, all with an eye toward building both their individual lives and my relationships with them.  I may make a few allocation decisions about improving our house or saving for retirement.  I will decide whether to work out tomorrow morning, then  begin the decision about how much to sleep before tomorrow.</p>
<p><span id="more-55"></span>I don&#8217;t know if I&#8217;ll make the right decisions. I cannot know, even afterward, whether I did.  It turns out that it would take a very long time to check.  <a href="http://people.csail.mit.edu/costis/">Constantinos Daskalakis</a> has <a href="http://people.csail.mit.edu/costis/simplified.pdf">shown</a> that it would take a computer as complex as the entire universe longer than the lifetime of the universe to solve for Nash equilibria of many common games, like how to invest for retirement. There all I am trying to do is optimize money.  When I read to my son I am doing something harder:  I am trying to influence his education, his confidence, his happiness, and my relationship with him, all without being so bored that I fall asleep.  Most of the decisions we make all day are too hard to solve.</p>
<p>Of course, I can simplify each of these decisions to make them tractable. Any simplifying assumptions I make are wrong, and therefore I am answering the wrong question.  I can make specific, certainly wrong assumptions, I can approximate a bunch of decision characteristics by combining them into one or more stochastic elements in a simpler game.  Either way, I am limiting my rationality, or really acknowledging that my rationality is bounded. When I make one of my decisions I am not answering the question before me but rather a different, simpler, but wrong question.</p>
<p>You might object.  &#8220;Once we give up rationality, we give up prediction.  There are no limits to how we can be irrational.&#8221;  How does the joke go?  &#8220;The difference between genius and stupidity is   that genius has its limits.&#8221;  There is no limit on the answers we can get if we allow our models of human behavior to admit irrational behavior.  If you allow for heuristics, anything could happen, so we know nothing!</p>
<p>True, anything <em>could</em> happen, but we can still note that some things happen more than other things.  Data can guide us.  <a href="http://www.jstor.org/pss/1914185">We can examine which ways of simplifying are likely</a> and which are unlikely.  Assigning likelihood statements to different behaviors is the statistician&#8217;s strategy.  Some steps along this route are taken by <a href="http://en.wikipedia.org/wiki/Quantal_response_equilibrium">quantal response theory</a>. However, there is still much work to be done.</p>
<p>Acknowledging that when we model we always lose essential details, George Box said &#8220;All models are false but some models are useful.&#8221;  At some point we all have to decide to answer the wrong question, whether in the economy or just in deciding when to go to bed.  Otherwise we wouldn&#8217;t be able to get up in the morning.</p>
]]></content:encoded>
			<wfw:commentRss>http://haptonstahl.org/polimath/?feed=rss2&#038;p=55</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

