<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Anne Z.</title>
	<atom:link href="http://annezelenka.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://annezelenka.com</link>
	<description>notes on numbers and other randomness</description>
	<lastBuildDate>Sun, 22 Jan 2012 23:00:29 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='annezelenka.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Anne Z.</title>
		<link>http://annezelenka.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://annezelenka.com/osd.xml" title="Anne Z." />
	<atom:link rel='hub' href='http://annezelenka.com/?pushpress=hub'/>
		<item>
		<title>Links for January 20, 2012</title>
		<link>http://annezelenka.com/2012/01/20/links-for-january-20-2012/</link>
		<comments>http://annezelenka.com/2012/01/20/links-for-january-20-2012/#comments</comments>
		<pubDate>Fri, 20 Jan 2012 13:51:39 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[for-profit schools]]></category>
		<category><![CDATA[higher education]]></category>
		<category><![CDATA[program evaluation]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1881</guid>
		<description><![CDATA[Big data market survey: Hadoop solutions [Edd Dumbill/O'Reilly Radar]. Apache Hadoop is unquestionably the center of the latest iteration of big data solutions. At its heart, Hadoop is a system for distributing computation among commodity servers. It is often used &#8230; <a href="http://annezelenka.com/2012/01/20/links-for-january-20-2012/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1881&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong><a href="http://radar.oreilly.com/2012/01/big-data-ecosystem.html">Big data market survey: Hadoop solutions [Edd Dumbill/O'Reilly Radar]</a>.</strong></p>
<blockquote><p>Apache Hadoop is unquestionably the center of the latest iteration of big data solutions. At its heart, Hadoop is a system for distributing computation among commodity servers. It is often used with the Hadoop Hive project, which layers data warehouse technology on top of Hadoop, enabling ad-hoc analytical queries.</p></blockquote>
<p>I&#8217;m starting my first ever project with Hadoop this week&#8211;a prototype of an analytics warehouse using <a href="http://aws.amazon.com/elasticmapreduce/">Amazon Elastic MapReduce</a>. Colleagues have told me EMR is a great way to get your head around Hadoop-based data processing.</p>
<p><strong><a href="http://www.theatlantic.com/business/archive/2012/01/cbo-report-medicare-pilot-programs-dont-control-health-care-costs/251613/">CBO Report: Medicare pilot programs don&#8217;t control health-care costs [Megan McArdle/The Atlantic blogs]</a>.</strong> McArdle describes what happened with a housing-project demolition program whose pilot studies suggested  much better effects than were actually seen at scale:</p>
<blockquote><p>The initial study was small and involved highly screened people with a lot of support. And it seems to have suffered from publication bias&#8211;the most spectacular results got the most attention, even though these might just have been outliers.</p>
<p>This is distressingly common&#8211;not just in government or social-do-gooding research, but in organizations of all kinds&#8211;including corporations.</p></blockquote>
<p>Programs at scale often don&#8217;t show results as good as pilot studies of those programs. More generally in program evaluation, it&#8217;s hard to find evidence of strong (or even weak) effects of interventions. Social systems are complex; factors other than those targeted by the intervention often determine outcomes. This is something I need to communicate regularly to my colleagues and our partners&#8211;student learning is largely determined by factors other than what we have control over. That&#8217;s not to say we shouldn&#8217;t improve our course design, teaching practices, and so forth but it is to say that there aren&#8217;t many easy pickings out there for improving student outcomes.</p>
<p><strong><a href="http://blogs.reuters.com/felix-salmon/2012/01/16/for-profits-vs-not-for-profits/">For-profits vs not-for-profits [Felix Salmon/Reuters blog]</a>.</strong></p>
<blockquote><p>I know full well that a lot of not-for-profit organizations are run in a dreadful fashion; I’m just not convinced that introducing a profit motive is always or even often the best way to fix that problem&#8230;. I very much doubt that for-profit education is ever a good idea. I just don’t see how the incentives there could possibly be aligned.</p></blockquote>
<p>But the profit motive can&#8217;t provide optimal outcomes if there isn&#8217;t consumer discipline along with it. For-profit higher education is subsidized by the government in the form of grants and low-interest loans (and note that nonprofit education is subsidized in additional ways as well, in the case of public institutions). Would-be students do not have an incentive to seriously evaluate whether the education they are purchasing is worth what they pay, because there is a third-party payer involved. The situation is much like health care. Good discussion in post of the issues and controversy over for-profit higher education.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1881/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1881/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1881/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1881&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2012/01/20/links-for-january-20-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>Links for January 15, 2012</title>
		<link>http://annezelenka.com/2012/01/15/links-for-january-15-2012/</link>
		<comments>http://annezelenka.com/2012/01/15/links-for-january-15-2012/#comments</comments>
		<pubDate>Sun, 15 Jan 2012 15:30:29 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>
		<category><![CDATA[causation]]></category>
		<category><![CDATA[college]]></category>
		<category><![CDATA[open plan offices]]></category>
		<category><![CDATA[productivity]]></category>
		<category><![CDATA[work from home]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1865</guid>
		<description><![CDATA[The rise of the new group think [Susan Cain/New York Times]. Virtually all American workers now spend time on teams and some 70 percent inhabit open-plan offices, in which no one has “a room of one’s own.” During the last &#8230; <a href="http://annezelenka.com/2012/01/15/links-for-january-15-2012/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1865&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong><a href="http://www.nytimes.com/2012/01/15/opinion/sunday/the-rise-of-the-new-groupthink.html?_r=4&amp;hp=&amp;adxnnl=1&amp;adxnnlx=1326636006-gtHHyVlOpux024X%202L/Akg&amp;pagewanted=all">The rise of the new group think [Susan Cain/New York Times]</a></strong>.</p>
<blockquote><p>Virtually all American workers now spend time on teams and some 70 percent inhabit open-plan offices, in which no one has “a room of one’s own.” During the last decades, the average amount of space allotted to each employee shrank 300 square feet, from 500 square feet in the 1970s to 200 square feet in 2010&#8230;.</p>
<p>Privacy also makes us productive. In a fascinating study known as the Coding War Games, consultants Tom DeMarco and Timothy Lister compared the work of more than 600 computer programmers at 92 companies. They found that people from the same companies performed at roughly the same level — but that there was an enormous performance gap between organizations. What distinguished programmers at the top-performing companies wasn’t greater experience or better pay. It was how much privacy, personal workspace and freedom from interruption they enjoyed. Sixty-two percent of the best performers said their workspace was sufficiently private compared with only 19 percent of the worst performers. Seventy-six percent of the worst programmers but only 38 percent of the best said that they were often interrupted needlessly.</p></blockquote>
<p>I work in an open-plan office and I rather like it, mainly because my coworkers are fun and because my clean, small, mostly quiet work area is such a nice change from my sprawling, messy, mostly noisy house. We work on a puzzle together when we&#8217;re taking a break from work and wear headphones when we want uninterrupted time. I wonder, though, if I&#8217;d be more productive with a private office or even a cubicle. I don&#8217;t achieve flow as much I&#8217;d like at work. Not sure if that&#8217;s because the job is relatively new to me or because the work environment is an obstacle.</p>
<p><strong><a href="http://www.ritholtz.com/blog/2012/01/hume-causation-science/">Hume, causation &amp; science [Barry Ritholtz/The Big Picture]</a></strong>. &#8220;We humans love a grossly over-simplified narrative.&#8221; Determining when we can attribute causation to a correlation is one of the major challenges of research design and statistical analysis.</p>
<p><strong><a href="http://www.fastcompany.com/1806307/how-to-work-from-home-like-you-mean-it">How to work from home like you mean it [Kevin Purdy/Fast Company]</a></strong>. I&#8217;m thinking of working one day a week at home to achieve some of that flow I&#8217;ve been missing. If I do, I&#8217;ll follow some of these tips so it doesn&#8217;t devolve into eight hours of Internet surfing.</p>
<p><strong><a href="http://www.washingtonpost.com/blogs/on-small-business/post/lack-of-interest-and-aptitude-keeps-students-out-of-stem-majors/2012/01/06/gIQAoDzRfP_blog.html">Lack of interest and aptitude keeps students out of STEM majors [Olga Khazan/Washington Post On Small Business blog]</a></strong>. &#8220;A <a href="http://www.washingtonpost.com/business/economy/new-study-shows-architecture-arts-degrees-yield-highest-unemployment/2012/01/03/gIQAwpaXZP_story.html" target="_blank">study released this week </a>by Georgetown University’s Center on Education and the Workforce found that recent graduates in computer science, mathematics and engineering all had unemployment rates below 9 percent (with the rates dropping below 6 percent among those who had some experience.) Conversely, the rates for graduates in architecture and the arts were 13.9 and 11.1 percent, respectively.&#8221;</p>
<p><strong><a href="http://opinionator.blogs.nytimes.com/2012/01/11/what-is-college-for-part-2/?src=recg">What is college for? (Part 2) [Gary Gutting/The New York Times]</a></strong>.</p>
<blockquote><p>Concretely, students graduating from high school should, to cite one plausible model, be able to read with understanding classic literature (from, say, Austen and Browning to Whitman and Hemingway) and write well-organized and grammatically sound essays; they should know the basic outlines of American and European history, have a good beginner’s grasp of at least two natural sciences as well as pre-calculus mathematics, along with a grounding in a foreign language.</p>
<p>Students with this sort of education would be excellent candidates for many satisfying and well-paying jobs in, for example, sales and service industries, except for those that require highly specialized skills. From the standpoint of employment, high school graduates would have no need of college unless they wanted to be accountants or engineers, pursue pre-professional programs leading to law or medical school or train for doctoral work in science or the humanities. Apart from this, the only good reason they would have for going to college would be for its intellectual culture.</p></blockquote>
<p>Compelling idea, but seems unlikely to happen because (1) our high schools are mostly incapable of providing such an education and (2) our culture is overly invested in the idea of college as the basic ticket to success in today&#8217;s economy. E.g.: <a href="http://www.joannejacobs.com/2012/01/d-c-may-require-college-application-for-all/">D.C. may require college application for all [Joanne Jacobs]</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1865/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1865/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1865/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1865&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2012/01/15/links-for-january-15-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>How data science is like magic</title>
		<link>http://annezelenka.com/2012/01/07/how-data-science-is-like-magic/</link>
		<comments>http://annezelenka.com/2012/01/07/how-data-science-is-like-magic/#comments</comments>
		<pubDate>Sun, 08 Jan 2012 00:34:26 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[books]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[the magicians]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1853</guid>
		<description><![CDATA[In The Magicians[1], Lev Grossman describes magic as it might exist, but he could as well be describing the real-world practice of statistical analysis or software development: As much as it was like anything, magic was like a language. And &#8230; <a href="http://annezelenka.com/2012/01/07/how-data-science-is-like-magic/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1853&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In <em><a href="http://www.amazon.com/Magicians-Novel-Lev-Grossman/dp/0452296293/ref=sr_1_1?ie=UTF8&amp;qid=1325980870&amp;sr=8-1">The Magicians</a>[1]</em>, Lev Grossman describes magic as it might exist, but he could as well be describing the real-world practice of statistical analysis or software development:</p>
<blockquote><p>As much as it was like anything, magic was like a language. And like a language, textbooks and teachers treated it as an orderly system for the purposes of teaching it, but in reality it was complex and chaotic and organic. It obeyed rules only to the extent that it felt like it, and there were almost as many special cases and one-time variations as there were rules. These Exceptions were indicated by rows of asterisks and daggers and other more obscure typographical fauna which invited the reader to peruse the many footnotes that cluttered up the margins of magical reference books like Talmudic commentary.</p>
<p>It was Mayakovsky&#8217;s [the teacher's] intention to make them memorize all these minutiae, and not only to memorize them but to absorb and internalize them. The very best spellcasters had talent, he told his captive, silent audience, but they also had unusual under-the-hood mental machinery, the delicate but powerful correlating and cross-checking engines necessary to access and manipulate and manage this vast body of information. (p149)</p></blockquote>
<p>To be a good data scientist, whether using traditional statistical techniques or machine learning algorithms (or both), you must know all the rules and approach it first as an orderly system. Then you begin to learn all the special cases and one-time variations and you study and study and practice and practice until you can almost unconsciously adjust to each unique situation that arises.</p>
<p>When I took <a href="http://en.wikipedia.org/wiki/Analysis_of_variance">ANOVA</a> in my Ph.D. program, I could hardly believe there was entire course devoted to it. But it was much like Grossman&#8217;s description above. Each week we learned new special cases and one-time variations. I did ANOVA in so many different Circumstances that now I have absorbed and internalized its application as well as the design of studies that would usefully be analyzed with it or with some more flexible variation of it (e.g., <a href="http://en.wikipedia.org/wiki/Hierarchical_linear_modeling">hierarchical linear modeling</a>). It felt cookbook at the beginning, but at the end of the course, I felt like I&#8217;d begun to develop that &#8220;unusual under-the-hood mental machinery&#8221; that Grossman suggested an effective magician in his imagined world would need.</p>
<p>That&#8217;s not to say that there aren&#8217;t important universal principles and practices and foundational knowledge to understand if you are to be an effective statistician or data miner or machine learner programmer; it&#8217;s not to say that awareness of Circumstances and methodical practice are all you need. It is to say that data science is ultimately a <strong>practice</strong> not a philosophy and you reach expertise in it through doing things over and over again, each time in slightly different ways.</p>
<p>In <em>The Magicians</em>, protagonist Quentin practices Legrand&#8217;s Hammer Charm, under thousands of different Circumstances:</p>
<blockquote><p>Page by page the Circumstances listed in the book became more and more esoteric and counterfactual. He cast Legrand&#8217;s Hammer Charm at noon and at midnight, in summer and winter, on mountaintops and a thousand yards beneath the earth&#8217;s surface. He cast the spell underwater and on the surface of the moon. He cast it in early evening during a blizzard on a beach on the island of Mangareva, which would almost certainly never happen since Mangareva is part of French Polynesia, in the South Pacific. He cast the spell as a man, as a woman, and once&#8211;was this really relevant?&#8211;as a hermaphrodite. He cast it in anger, with ambivalence, and with bitter regret. (pp150-151)</p></blockquote>
<p>Sometimes I feel like I have fit logistic regression in all these situations (perhaps not as a hermaphrodite). The next logistic regression I fit, I will say to myself &#8220;<a href="http://www.youtube.com/watch?v=3PycZtfns_U">Wax on, wax off</a>&#8221; as Quentin did when faced with a new spell that he had to practice according to each set of Circumstances.</p>
<hr />
<p><small>[1]Highly recommended, but with caveats. Read it last summer &#8212; loved it &#8212; sent it to my 15-year-old son at camp. He loved it too and bought me the sequel for Christmas. After reading the second one, I had to re-read the first. It&#8217;s <a href="http://www.amazon.com/Magicians-Novel-Lev-Grossman/product-reviews/0452296293/ref=dp_top_cm_cr_acr_txt?ie=UTF8&amp;showViewpoints=1">a polarizing book</a>. Don&#8217;t pick it up if you are offended by heavy drinking, gratuitous sex, and a wandering plot. Do pick it up if you felt like your young adulthood was marked by heavy drinking, gratuitous sex, a wandering plot, and not nearly enough magic. My son tends to read adult books so I didn&#8217;t hesitate to share it with him, but it probably would not be appropriate for most teenagers.</small></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1853/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1853/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1853/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1853&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2012/01/07/how-data-science-is-like-magic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>Links for January 7, 2012</title>
		<link>http://annezelenka.com/2012/01/07/links-for-january-7-2012/</link>
		<comments>http://annezelenka.com/2012/01/07/links-for-january-7-2012/#comments</comments>
		<pubDate>Sat, 07 Jan 2012 23:47:29 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>
		<category><![CDATA[analytics as a service]]></category>
		<category><![CDATA[for-profit schools]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[intermittent fasting]]></category>
		<category><![CDATA[online learning]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1848</guid>
		<description><![CDATA[Nutrition advice: The vitamin D-lemma [Amy Maxmen/Nature]. &#8220;The difficulty of distilling strong advice from weak evidence.&#8221; This is a key challenge for researchers/statisticians/data scientists in any domain, not just in health. Will Amazon offer analytics as a service? [Quentin Hardy/Bits]. &#8230; <a href="http://annezelenka.com/2012/01/07/links-for-january-7-2012/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1848&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.nature.com/news/2011/110706/full/475023a.html">Nutrition advice: The vitamin D-lemma [Amy Maxmen/Nature]</a>. &#8220;The difficulty of distilling strong advice from weak evidence.&#8221; This is a key challenge for researchers/statisticians/data scientists in any domain, not just in health.</p>
<p><a href="http://bits.blogs.nytimes.com/2012/01/04/will-amazon-offer-analytics-as-a-service/">Will Amazon offer analytics as a service? [Quentin Hardy/Bits]</a>. Interesting to get an idea what that might look like. I don&#8217;t think, though, this would compete with SAS and similar software as the post implies. Would someone looking to implement a product recommendation engine implement it in SAS? Probably not. For example, <a href="http://www.dataspora.com/2009/02/predictive-analytics-using-r/">Google is said to use R for model exploration and prototyping</a>, then puts them into production using Python or C++. I feel a &#8220;choosing your analytics tool&#8221; post coming on.</p>
<p><a href="http://www.huffingtonpost.com/2011/12/30/community-college-for-profit-college_n_1174243.html">Community college budget cuts drive students to for-profit school [Chris Kirkham/Huffington Post]</a>. Balanced coverage of why students turn to for-profit schools and the pros and cons of such choices. My observation: community college tuition is artificially low due to government subsidization while for-profit tuition is artificially high, again because of government interference (in the form of financial aid). No market forces to bring about a reasonable balance between supply and demand. The big losers are students (and taxpayers).</p>
<p><a href="http://techcrunch.com/2012/01/05/benchprep/">Benchprep is codecademy for any subject, high school to med school [Josh Constine/TechCrunch]</a>. &#8220;Eventually, publishers might get a clue that interactive digital education is going to destroy their paper book business. If they’re smart they’ll start developing their own courses or raise licensing fees. Until then though, BenchPrep will be the savior of anyone frustrated by the static book-learning experience.&#8221; I&#8217;m pretty certain some big textbook publishers <a href="http://pearsonhighered.com/educator/mylabmastering/index.page">see that already</a>.</p>
<p><a href="http://ideas.time.com/2011/12/14/forget-dieting-try-intermittent-fasting/">Forget dieting, try intermittent fasting [Josh Ozersky/Time Ideas]</a>. &#8220;And that’s why instead of eating healthier, I’m going for longer stretches without eating so I can actually enjoy a whole meal. I don’t starve myself; I drink a protein shake if I get hungry and consume endless glasses of diet iced tea. People tell me this is bad, that I will soon gain back all the weight I’ve lost – and these rejoinders are always given with a smug malice, as if the people uttering them actually despise me for trying to compensate for the pleasures of the plate.&#8221;</p>
<p>I fast most days at work until about 2 or 3 pm, then have a small snack. I eat whatever I want once I get home from work around 5 pm. I find this allows me to eat generally what I want while maintaining my weight at a level I&#8217;m happy with. I have found, like Josh, that people get really upset about this plan, almost offended that I would eat this way. Funny how everyone thinks they know what is healthy and what is not, despite the difficulties in determining that (see first link in this post).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1848/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1848/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1848/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1848&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2012/01/07/links-for-january-7-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>Links for December 30, 2011</title>
		<link>http://annezelenka.com/2011/12/30/links-for-december-30-2011/</link>
		<comments>http://annezelenka.com/2011/12/30/links-for-december-30-2011/#comments</comments>
		<pubDate>Fri, 30 Dec 2011 23:54:28 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1828</guid>
		<description><![CDATA[Yes, and&#8230; [W.P. McNeill/Corner Cases]. Living by the &#8220;yes, and&#8221; ethos of improvisational comedy. Always build on what the other person said&#8211;stay open to their insight and direction. Be a pliable weed not a concrete pylon. Don&#8217;t get mired in &#8230; <a href="http://annezelenka.com/2011/12/30/links-for-december-30-2011/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1828&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://cornercases.wordpress.com/2011/12/22/yes-and/">Yes, and&#8230; [W.P. McNeill/Corner Cases]</a>. Living by the &#8220;yes, and&#8221; ethos of improvisational comedy. Always build on what the other person said&#8211;stay open to their insight and direction. Be a pliable weed not a concrete pylon. Don&#8217;t get mired in dogma. I&#8217;m thinking this would work equally well in interactions with coworkers as with kids.</p>
<p><a href="http://marginalrevolution.com/marginalrevolution/2011/11/college-has-been-oversold.html">College has been oversold [Alex Tabarrok/Marginal Revolution]</a>. The total number of students graduating from college is way up, but the numbers graduating with STEM degrees haven&#8217;t increased. That&#8217;s bad for individuals and bad for the economy. &#8220;An argument can be made for subsidizing students in fields with potentially large spillovers, such as microbiology, chemical engineering, nuclear physics and computer science. There is little justification for subsidizing sociology, dance and English majors.&#8221;</p>
<p><a href="http://timkastelle.org/blog/2010/03/you-have-to-break-connections-to-get-your-ideas-to-spread/">You have to break connections to get your ideas to spread [Tim Kastelle/Innovation Leadership Network</a>. Innovation requires disruption. "When you come up with a great new idea, you need to think about this economic network in two ways. The first is: how can I connect to all of the complementary parts of the economy that are needed to get my idea to work? The second is: if I’m going to get my idea to spread, which of these existing connections need to be broken?"</p>
<p><a href="https://www.mckinseyquarterly.com/The_second_economy_2853">The second economy [W. Brian Arthur/McKinsey Quarterly]</a>. We are in the process of building out the economy&#8217;s neural system, what Arthur calls &#8220;the second economy&#8221; growing up alongside the first economy, the industrial economy. Downside: loss of jobs as computers take over.</p>
<p><a href="http://www.stats.ox.ac.uk/~ripley/Nelder80.pdf">Selecting amongst large classes of models [Brian D. Ripley] (pdf)</a>. We have the data and the computational resources to &#8220;trawl through literally thousands of models (and in some cases many more).&#8221; How to pick among them? A subject I intend to learn a lot more about in 2012.</p>
<p><a href="http://www.forbes.com/sites/danwoods/2011/12/07/curing-the-big-data-storage-fetish/">Curing the big data storage fetish [Dan Woods/Forbes]</a>. &#8220;One popular way to express lust for big data for its own sake is to create a gargantuan Hadoop cluster.&#8221; Not enough to just store the data, need to build a data-driven culture. &#8220;But how do you create  a company culture like CapitalOne or <a href="http://www.forbes.com/companies/google/">Google</a> or eBay or Zynga or LinkedIn, where data is essentially part of the management team? At all of these companies there are data scientists, the elite professionals, but there are also swarms of data enthusiasts, people who are eager to use data to help do their jobs better.&#8221;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1828/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1828/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1828/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1828&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/12/30/links-for-december-30-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>Honey, I shrunk the statistician</title>
		<link>http://annezelenka.com/2011/12/29/honey-i-shrunk-the-statistician/</link>
		<comments>http://annezelenka.com/2011/12/29/honey-i-shrunk-the-statistician/#comments</comments>
		<pubDate>Fri, 30 Dec 2011 01:34:09 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1798</guid>
		<description><![CDATA[In my Ph.D. program, I learned all about how to analyze small data. I learned rules of thumb for how much data I needed to run a particular analysis, and what to do if I didn&#8217;t have enough. I worked with &#8230; <a href="http://annezelenka.com/2011/12/29/honey-i-shrunk-the-statistician/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1798&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In my Ph.D. program, I learned all about how to analyze <strong>small data</strong>. I learned rules of thumb for how much data I needed to run a particular analysis, and what to do if I didn&#8217;t have enough. I worked with what seemed now (and even seemed then) to be toy data sets, except they weren&#8217;t toys, because when you&#8217;re running a psychological experiment you might be lucky to have 30 participants and when you&#8217;re analyzing the whole world&#8217;s math performance (think <a href="http://nces.ed.gov/timss/">TIMSS</a>), you can do it with a data set less than a gigabyte in size.</p>
<div id="attachment_1803" class="wp-caption alignright" style="width: 229px"><a style="color:#df0000;line-height:23px;font-size:14px;" href="http://www.victorianweb.org/art/illustration/tenniel/alice/1.4.html"><img class="size-medium wp-image-1803 " style="line-height:23px;font-size:14px;border-color:initial;border-style:initial;" title="Alice in Wonderland" src="http://mizzee.files.wordpress.com/2011/12/alice.jpg?w=219&#038;h=300" alt="" width="219" height="300" /></a><p class="wp-caption-text">image courtesy Victorian Web</p></div>
<p>I did some program evaluation while I finished my degree and sometimes colleagues would lament, &#8220;we don&#8217;t have enough data!&#8221; Usually, we did. We could thank <a href="http://en.wikipedia.org/wiki/Ronald_Fisher">Sir Ronald Aylmer Fisher</a> for that. He worked with small data in agricultural settings and gave us the venerable ANOVA technique, which works fine with just a handful of cases per group (assuming balanced group sizes, normality, and homoscedasticity). Maybe we might give a nod to <a href="http://en.wikipedia.org/wiki/William_Sealy_Gosset">William Sealy Gosset</a>, too, for introducing Student&#8217;s t distribution, which helps when the Central Limit Theorem hasn&#8217;t kicked in yet and brought us to normality.</p>
<p>But Sir Ronald and Student can&#8217;t help me now. I&#8217;m down a rabbit hole&#8230; in some sort of web-scale wonderland of <strong>big data</strong>. I feel like Alice after drinking the magic potion, too small to reach the key on the table. The data is so much broader and bigger than I am, so much broader and bigger than my puny methods and my puny desktop R environment that wants to suck everything into memory in order to analyze it.</p>
<p>I stay awake at night thinking how to analyze all this data and deliver on its promise, how to analyze across schools and courses and so many, many students, not to mention all their clickstreams. How can I get through the locked door and experience the rest of wonderland when I&#8217;m so small and the data&#8217;s so big? I could sample the data, I think, and then I&#8217;d be in the realm where I&#8217;m comfortable, dealing with sampling distributions and generalizing to a population and applying the small-data methods I know already. Perhaps I can extract it by subset &#8212; by subsets of like courses, perhaps, or by school (I&#8217;m doing that already&#8211;not scalable and doesn&#8217;t address some of the most interesting questions). What about trying out <a href="http://www.revolutionanalytics.com/products/enterprise-big-data.php">Revolutions&#8217; big data support for R</a>? Or maybe I can apply haute big-data techniques: Hadoop-ify it (<a href="http://hive.apache.org/">Hive</a>, <a href="http://pig.apache.org/">Pig</a>, <a href="http://hbase.apache.org/">HBase</a>???) then use simplistic (<a href="http://en.wikipedia.org/wiki/Embarrassingly_parallel">embarrassingly parallel</a>) algorithms with MapReduce. Problem is, none of the methods I like to use and seem appropriate for educational settings (multilevel modeling for example) are easily parallelized. I&#8217;m stumped.</p>
<p>It&#8217;s okay to be stumped, I think &#8212; part of creation is living with uncertainty:</p>
<blockquote><p>Most people who are not consummate creators avoid tension. They want quick answers. They don’t like living in the realm of not knowing something they want to know. They have an intolerance for those moments in the creative process in which you have no idea how to get from where you are to where you want to be. Actually, this is one of the very best moments there are. This is when something completely original can be born, when you go beyond your usual ways of addressing similar situations, where you can drive the creative process into high gear. [Robert Fritz on <a href="http://www.robertfritz.com/index.php?content=writingnr&amp;news_id=181">supercharging the creative process</a>]</p></blockquote>
<p>Alice ate some cake that made her bigger. Is there some cake that will make me and my methods big enough to answer the questions I want answered? For now I&#8217;m in the realm of not knowing but I hope in 2012 I will have some answers: first, answers about how to make myself big again, and second, answers from the data.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1798/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1798/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1798/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1798&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/12/29/honey-i-shrunk-the-statistician/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>

		<media:content url="http://mizzee.files.wordpress.com/2011/12/alice.jpg?w=219" medium="image">
			<media:title type="html">Alice in Wonderland</media:title>
		</media:content>
	</item>
		<item>
		<title>Dissertation topic: Constructing predictive indexes</title>
		<link>http://annezelenka.com/2011/08/14/dissertation-topic/</link>
		<comments>http://annezelenka.com/2011/08/14/dissertation-topic/#comments</comments>
		<pubDate>Sun, 14 Aug 2011 16:41:22 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[diary of a doctoral student]]></category>
		<category><![CDATA[psychometrics]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1762</guid>
		<description><![CDATA[The actual working title of my dissertation is: Modeling Social Participation as Predictive of Life Satisfaction and Social Connectedness: Scale or Index? When I tell people my topic, I usually start with the domain area: social participation as related to &#8230; <a href="http://annezelenka.com/2011/08/14/dissertation-topic/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1762&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The actual working title of my dissertation is: <em>Modeling Social Participation as Predictive of Life Satisfaction and Social Connectedness: Scale or Index?</em></p>
<p>When I tell people my topic, I usually start with the domain area: social participation as related to life satisfaction in older U.S. adults (my data set is people age 65 and over from the <a href="http://hrsonline.isr.umich.edu/">Health and Retirement Study</a>), but really, the topic is a statistical and measurement one. Participation happens to be something I&#8217;m personally interested in and fits the statistical problem area, but I could do this same project in a variety of domains with a range of constructs. Maybe I ought to change my elevator speech to start with the statistical/measurement part.</p>
<p>Most <a href="http://annezelenka.com/2010/07/28/data-science-dont-forget-psychometrics/">psychometrics</a> concerns itself with the measurement of latent psychological constructs like attitudes, intelligence, academic achievement and so forth. Psychometricians have developed sophisticated means of constructing instruments (surveys or assessments, for example) that can measure these latent constructs. The approach taken is often based on either classical test theory or item response theory. Either way, the assumption is that observed data (such as a student&#8217;s answers to test questions or a subject&#8217;s survey responses) are caused by whatever unobserved trait is intended to be measured.</p>
<p>However, there are some things we want to measure that don&#8217;t fit this model. Social participation is one of them. Participation instruments generally ask the respondent to report his or her level of participation in various activities. In a latent factor setting, you would then assume some underlying level of participation that gave rise to the observed frequencies of participation. That&#8217;s not quite right though. If someone increases their participation in some area &#8212; say by joining an investment club &#8212; their overall level of participation goes up. The increase in participation in the investment club seems causally prior to the increase in overall participation. This is the opposite direction of causality than that proposed by traditional psychometric models.</p>
<p><img class="alignleft size-medium wp-image-1766" style="border-color:initial;border-style:initial;" title="formative vs reflective" src="http://mizzee.files.wordpress.com/2011/08/formative-vs-reflective.jpg?w=300&#038;h=220" alt="" width="300" height="220" /></p>
<p>Some people call a measurement instrument developed by some sort of summation of disparate items an index rather than a scale, where a scale follows the latent factor model. The development of such indexes follows a so-called formative measurement model, where what you&#8217;re trying to measure is formed of what you observe, in contrast to the development of scales that follows a reflective measurement model, where what you observe reflects the underlying latent factor of interest. In the diagram, the first figure represents formative measurement (observed indicators x1-x3 cause the latent construct eta 1) and the second figure represents reflective (observed indicators y1 to y3 reflect the level of the latent construct).</p>
<p>There has been plenty of <a href="http://neoacademic.com/2011/04/25/stats-and-methods-urban-legends-1-formative-measurement/">criticism of formative measurement</a>, but I think it can be made useful, and that&#8217;s the aim of my dissertation project. I&#8217;m now at the analysis stage and just beginning to really understand the usefulness and potential of formative indexes.</p>
<p>As an aside, I don&#8217;t like to call formative measurement &#8220;measurement.&#8221; I prefer to think of it as &#8220;modeling.&#8221; I think what you&#8217;re doing with index development is constructing a one- or few-number summary of a lot of individual data items in a way that predicts outcomes of interest. Think of the Apgar score as a good example. It gives you a one number summary of the health of the baby and its likelihood to survive and thrive, but you&#8217;re not measuring one thing in particular about the baby. Well, maybe you are measuring overall health. Hmmmm.</p>
<p><em>To be continued&#8230;</em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1762/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1762/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1762/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1762&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/08/14/dissertation-topic/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>

		<media:content url="http://mizzee.files.wordpress.com/2011/08/formative-vs-reflective.jpg?w=300" medium="image">
			<media:title type="html">formative vs reflective</media:title>
		</media:content>
	</item>
		<item>
		<title>Big ideas require social connection</title>
		<link>http://annezelenka.com/2011/08/14/big-ideas/</link>
		<comments>http://annezelenka.com/2011/08/14/big-ideas/#comments</comments>
		<pubDate>Sun, 14 Aug 2011 15:38:40 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[that&#039;s random]]></category>

		<guid isPermaLink="false">http://annezelenka.com/?p=1748</guid>
		<description><![CDATA[Neal Gabler in The Elusive Big Idea thinks online connectedness implies a &#8220;post-idea world&#8221;: It is certainly no accident that the post-idea world has sprung up alongside the social networking world. Even though there are sites and blogs dedicated to &#8230; <a href="http://annezelenka.com/2011/08/14/big-ideas/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1748&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Neal Gabler in <a href="http://www.nytimes.com/2011/08/14/opinion/sunday/the-elusive-big-idea.html">The Elusive Big Idea</a> thinks online connectedness implies a &#8220;post-idea world&#8221;:</p>
<blockquote><p>It is certainly no accident that the post-idea world has sprung up alongside the social networking world. Even though there are sites and blogs dedicated to ideas, Twitter, Facebook, Myspace, Flickr, etc., the most popular sites on the Web, are basically information exchanges, designed to feed the insatiable information hunger, though this is hardly the kind of information that generates ideas. It is largely useless except insofar as it makes the possessor of the information feel, well, informed. Of course, one could argue that these sites are no different than conversation was for previous generations, and that conversation seldom generated big ideas either, and one would be right.</p>
<p>But the analogy isn’t perfect. For one thing, social networking sites are the primary form of communication among young people, and they are supplanting print, which is where ideas have typically gestated. For another, social networking sites engender habits of mind that are inimical to the kind of deliberate discourse that gives rise to ideas. Instead of theories, hypotheses and grand arguments, we get instant 140-character tweets about eating a sandwich or watching a TV show. While social networking may enlarge one’s circle and even introduce one to strangers, this is not the same thing as enlarging one’s intellectual universe. Indeed, the gab of social networking tends to shrink one’s universe to oneself and one’s friends, while thoughts organized in words, whether online or on the page, enlarge one’s focus.</p></blockquote>
<p>[<a href="http://www.stoweboyd.com/post/8906225520/the-elusive-big-idea-neal-gabler">via Stowe Boyd</a>]</p>
<p>But big ideas have always rested on some sort of social connection across people and they haven’t always required the written word. Big ideas don’t issue forth from the head of one really smart person, working alone, reading, reading, reading then Eureka! No, they arise from the synthesis of multiple ideas and practical knowledge that together are greater than the sum of their parts. What’s required for this? That people be in some sort of social connection with each other.</p>
<p><strong>Big idea: Cliff dwelling</strong></p>
<p>Last weekend, we visited <a href="http://www.nps.gov/meve/index.htm">Mesa Verde</a>, an archeological site and national park that shows how the Ancestral Puebloans lived in the time period from roughly 550 to 1300 A.D. While the cliff dwellings themselves inspired awe (and sometimes required special tickets and patience to tour), it was more interesting to me to see the progression of architecture at sites along the Mesa Top Loop Road. Here’s where you could see the Puebloans build towards <a href="http://www.nps.gov/meve/forteachers/upload/ancestral_puebloans.pdf">the big idea of cliff dwellings</a>. From 550 to 750 A.D., they mostly lived in pithouses on top of mesas but sometimes in cliff alcoves, then progressed to adobe houses clustered into villages. By 1000 A.D. they had developed stone masonry techniques for constructing buildings two or three stories high with 50 or more rooms. It was about 1200 A.D. that they moved their buildings into cliff alcoves, a spectacular form of architecture with practical and aesthetic benefits.</p>
<p><a href="http://mizzee.files.wordpress.com/2011/08/spruce-tree-house.jpg"><img class="alignleft size-medium wp-image-1752" title="Spruce Tree House" src="http://mizzee.files.wordpress.com/2011/08/spruce-tree-house.jpg?w=300&#038;h=176" alt="" width="300" height="176" /></a>Who came up with the big idea to build a multi-story stone dwelling in the shaded and protected alcove of a rocky cliff? It certainly wasn&#8217;t one person acting in isolation and doing a lot of reading and writing. This big idea did not rest upon the written word and lots of deep thought but rather on social knowledge and practice of stone masonry construction and on ideas and experience about where the best place to live was (mesa top or cliff alcove?). If the Ancestral Puebloans did have Twitter, I could imagine them using it share tips about how best to construct a beautiful and sturdy multi-story dwelling, sited for protection and convenience. Most big ideas rest upon a wealth of little bits of knowledge that don&#8217;t need to be written up as a journal article or book.</p>
<p><strong>Many ways to connect</strong></p>
<p>How we connect differs depending on where the people we want to connect with are. In the 17<sup>th</sup> and 18<sup>th</sup> centuries, French intellectuals gathered in salons. In the 20<sup>th</sup> century in the U.S. and other western countries, connecting through higher education channels and journal articles and at government-funded research centers was a good bet. Now, some people connect online using tools like Twitter and Facebook and Google+. This doesn&#8217;t necessarily replace long-form writing but it does complement it &#8212; making the generation of big ideas more, not less, likely.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1748/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1748/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1748/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1748&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/08/14/big-ideas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>

		<media:content url="http://mizzee.files.wordpress.com/2011/08/spruce-tree-house.jpg?w=300" medium="image">
			<media:title type="html">Spruce Tree House</media:title>
		</media:content>
	</item>
		<item>
		<title>links for 2011-03-05</title>
		<link>http://annezelenka.com/2011/03/05/links-for-2011-03-05/</link>
		<comments>http://annezelenka.com/2011/03/05/links-for-2011-03-05/#comments</comments>
		<pubDate>Sat, 05 Mar 2011 16:01:56 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>

		<guid isPermaLink="false">http://mizzee.wordpress.com/2011/03/05/links-for-2011-03-05/</guid>
		<description><![CDATA[Richard Hamming: You and Your Research &#34;I claim that some of the reasons why so many people who have greatness within their grasp don&#039;t succeed are: they don&#039;t work on important problems, they don&#039;t become emotionally involved, they don&#039;t try &#8230; <a href="http://annezelenka.com/2011/03/05/links-for-2011-03-05/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1738&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<ul class="delicious">
<li>
<div class="delicious-link"><a href="http://www.paulgraham.com/hamming.html">Richard Hamming: You and Your Research</a></div>
<div class="delicious-extended">&quot;I claim that some of the reasons why so many people who have greatness within their grasp don&#039;t succeed are: they don&#039;t work on important problems, they don&#039;t become emotionally involved, they don&#039;t try and change what is difficult to some other situation which is easily done but is still important, and they keep giving themselves alibis why they don&#039;t. They keep saying that it is a matter of luck.&quot;</div>
<div class="delicious-tags">(tags: <a href="http://www.delicious.com/annez/research">research</a> <a href="http://www.delicious.com/annez/productivity">productivity</a> <a href="http://www.delicious.com/annez/science">science</a> <a href="http://www.delicious.com/annez/motivation">motivation</a> <a href="http://www.delicious.com/annez/inspiration">inspiration</a>)</div>
</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1738/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1738/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1738/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1738&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/03/05/links-for-2011-03-05/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
		<item>
		<title>links for 2011-03-04</title>
		<link>http://annezelenka.com/2011/03/04/links-for-2011-03-04/</link>
		<comments>http://annezelenka.com/2011/03/04/links-for-2011-03-04/#comments</comments>
		<pubDate>Fri, 04 Mar 2011 16:02:52 +0000</pubDate>
		<dc:creator>Anne Z.</dc:creator>
				<category><![CDATA[links]]></category>

		<guid isPermaLink="false">http://mizzee.wordpress.com/2011/03/04/links-for-2011-03-04/</guid>
		<description><![CDATA[Richard Hamming: You and Your Research &#34;I claim that some of the reasons why so many people who have greatness within their grasp don&#039;t succeed are: they don&#039;t work on important problems, they don&#039;t become emotionally involved, they don&#039;t try &#8230; <a href="http://annezelenka.com/2011/03/04/links-for-2011-03-04/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1737&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<ul class="delicious">
<li>
<div class="delicious-link"><a href="http://www.paulgraham.com/hamming.html">Richard Hamming: You and Your Research</a></div>
<div class="delicious-extended">&quot;I claim that some of the reasons why so many people who have greatness within their grasp don&#039;t succeed are: they don&#039;t work on important problems, they don&#039;t become emotionally involved, they don&#039;t try and change what is difficult to some other situation which is easily done but is still important, and they keep giving themselves alibis why they don&#039;t. They keep saying that it is a matter of luck.&quot;</div>
<div class="delicious-tags">(tags: <a href="http://www.delicious.com/annez/research">research</a> <a href="http://www.delicious.com/annez/productivity">productivity</a> <a href="http://www.delicious.com/annez/science">science</a> <a href="http://www.delicious.com/annez/motivation">motivation</a> <a href="http://www.delicious.com/annez/inspiration">inspiration</a>)</div>
</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mizzee.wordpress.com/1737/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mizzee.wordpress.com/1737/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mizzee.wordpress.com/1737/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=annezelenka.com&amp;blog=6010417&amp;post=1737&amp;subd=mizzee&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://annezelenka.com/2011/03/04/links-for-2011-03-04/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a17837517aa34572e477fe0ddc4d4995?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Anne Z.</media:title>
		</media:content>
	</item>
	</channel>
</rss>
