<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Research Pipeline Blog &#187; data visualization</title>
	<atom:link href="http://www.researchpipeline.com/wordpress/category/data-visualization/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.researchpipeline.com/wordpress</link>
	<description>Notes about data and analysis</description>
	<lastBuildDate>Tue, 20 Dec 2011 04:35:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Earthquakes vs. Time of Day</title>
		<link>http://www.researchpipeline.com/wordpress/2011/12/08/earthquakes-vs-time-of-day/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/12/08/earthquakes-vs-time-of-day/#comments</comments>
		<pubDate>Fri, 09 Dec 2011 05:28:56 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data mining]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[earth]]></category>
		<category><![CDATA[earthquakes]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=297</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
Time and tide wait for no one. Add to that: earthquakes. I live in the San Francisco Bay Area, a.k.a. &#8220;earthquake country&#8221;, in a small house built in the 1950s before earthquake building codes had been created. Within the next 30 years, the USGS tells us we can expect a &#8220;big one&#8221; in the East [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<div id="attachment_444" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/world-map-of-earthquakes-small-version.png"><img class="size-full wp-image-444" title="Map of world earthquakes magnitude 5 or greater" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/world-map-of-earthquakes-small-version.png" alt="" width="500" height="258" /></a><p class="wp-caption-text">66,725 Earthquakes from 1973 - 2011 with color indicating magnitude</p></div>
<p>Time and tide wait for no one. Add to that: earthquakes. I live in the San Francisco Bay Area, a.k.a. &#8220;earthquake country&#8221;, in a small house built in the 1950s before earthquake building codes had been created. Within the next 30 years, the USGS tells us we can expect a &#8220;big one&#8221; in the East Bay right along the fault where I dwell.</p>
<p>So here&#8217;s my question: If there&#8217;s a 30-year window for the next big one to occur, can I at least know the most likely time of day? This is not a crazy question. The time of day is really just a way of expressing where the sun is located with respect to your geographical location. The moon is responsible for sweeping the tides all around the Earth. So it seems reasonable to think that the moon or the sun may at least be an influence in &#8220;Earth tides&#8221; which might act as a trigger for earthquakes. Here&#8217;s a quick sketch showing my hypothesis:</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/hypothesis1.png"><img class="aligncenter size-full wp-image-449" title="Hypothesis for earth tide quakes" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/hypothesis1.png" alt="" width="450" height="406" /></a></p>
<p>This question turned out to be fairly straight-forward to answer and I&#8217;ll cut to the chase and say, no, not really. There are some hours that are a teeny bit more earthquake prone than others, but the variations proved to be statistically insignificant.</p>
<h3><strong>The Method</strong></h3>
<p>I downloaded all earthquakes magnitude 5 or greater from 1973 through mid-2011 from the <a href="http://earthquake.usgs.gov/earthquakes/eqarchives/epic/epic_global.php" target="_blank">USGS Global Earthquake Search website</a>. This gave me a list of 66,725 earthquakes &#8212; a reasonable sized dataset. I mapped the positions and color-coded magnitudes of all 66,725 earthquakes (green = mag 5.0 up through red = mag 9.1), shown at the top of this blog entry.</p>
<p>It&#8217;s an interesting fact that these earthquakes span a time period of 338,117 hours which implies a chance of 20% for an earthquake (mag 5 or greater) during any hour. The chance during any hour for a magnitude 6 or greater earthquake drops to only 1.6%. By the time you get to a magnitude 7 or greater it&#8217;s much less than 1%/hr.</p>
<p>The next step was to calculate the right ascension of the sun and moon and translate the longitudinal position of each earthquake into right ascension to match. Below shows an illustration of what I&#8217;m describing.</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/longitude-sketch-small.png"><img class="aligncenter size-full wp-image-445" title="Map of world earthquakes magnitude 5 or greater" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/longitude-sketch-small.png" alt="" width="500" height="360" /></a></p>
<p>The important thing to note is that the latitude of the sun, moon and earthquake can all be different and I am calculating the difference with respect solely to longitude. This is because I&#8217;m wondering about the most earthquake-prone time of day so the longitudes are the relevant quantities rather than the latitudes.</p>
<p>After these coordinates had been calculated for all 66,725 earthquakes, I found the difference of the position of the sun/moon with respect to each earthquake in terms of right ascension. Following that, I grouped the differences by &#8220;relative hour&#8221; (by which I mean the relative position as described above) and graphed the resulting histogram.</p>
<h3><strong>Position of the Sun vs. Earthquake Time</strong></h3>
<p>The histogram of the earthquakes relative to the position of the sun looked like this.</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/RA-lag-for-sun-smaller.png"><img class="aligncenter size-full wp-image-439" title="Histogram showing sun position relative to accumulated earthquakes" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/RA-lag-for-sun-smaller.png" alt="" width="500" height="365" /></a></p>
<p>The blue bars show what the histogram should look like if there was an equal probability of having an earthquake regardless of the position of the sun and the red shows the actual data. You can see the bars are very close in size. There&#8217;s a small peak around the 17th hour. But is it statistically significant?</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/showing-RA-lag-for-sun-small.png"><img class="aligncenter size-full wp-image-437" title="defining the RA lag for sun with respect to an earthquake" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/showing-RA-lag-for-sun-small.png" alt="" width="381" height="282" /></a></p>
<p>The mean number of earthquakes during any given relative hour was 2780 earthquakes. The standard deviation was +/- 60 earthquakes. The number of earthquakes in the 17th relative hour was 2902 earthquakes &#8212; just outside two standard deviations which shows it to be an outlier by 2 earthquakes. Not a strong outlier! In fact, the p-value using the Watson U-Squared test is a paltry 0.48 which is well outside the threshold for being a significant result. Translation: not publishable!</p>
<h3><strong>Position of the Moon vs. Earthquake Time</strong></h3>
<p>Since it is the moon, not the sun that is primarily responsible for sweeping the ocean tides around the Earth, perhaps I am looking at the wrong entity (actually, the sun is responsible for a smaller, secondary ocean tide, but the magnitude pales in comparison to the moon&#8217;s effect). The time of day has nothing to do with the moon&#8217;s position so if there worked out to be a correlation then you&#8217;d need to consult moon charts every day!</p>
<p>I redid my previous analysis, grouping earthquakes by position relative to the moon. The following histogram was the result:</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/RA-lag-for-moon-smaller.png"><img class="aligncenter size-full wp-image-440" title="RA lag for moon" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/12/RA-lag-for-moon-smaller.png" alt="" width="500" height="341" /></a>As can easily be seen with the eye, the deviation of the data from the mean is even smaller this time (+/- 53). In fact, there were no points outside two standard deviations and the p-value was 0.33. Again, statistically insignificant!</p>
<h3><strong>Conclusion</strong></h3>
<p>In short, there&#8217;s no particular hour to be extra wary of earthquakes. Unfortunately, I&#8217;ll just have to settle for the USGS&#8217;s &#8220;sometime in the next 30 years&#8221;.</p>
<h6>Data extracted from:</h6>
<ul>
<li>Earthquake data from the USGS website which lists earthquake worldwide by year: <a href="http://earthquake.usgs.gov/earthquakes/eqarchives/year/">http://earthquake.usgs.gov/earthquakes/eqarchives/year/</a></li>
<li>To specify exactly which earthquakes you are interested from which database, etc (where this dataset came from): <a href="http://earthquake.usgs.gov/earthquakes/eqarchives/epic/epic_global.php">http://earthquake.usgs.gov/earthquakes/eqarchives/epic/epic_global.php</a></li>
<li>Planet positions calculated via WolframAlpha: <a href="http://www.wolframalpha.com/">http://www.wolframalpha.com/</a></li>
</ul>
<h6>References:</h6>
<div>
<ul>
<li>General information on the Earth: <a href="http://en.wikipedia.org/wiki/Earth">http://en.wikipedia.org/wiki/Earth</a></li>
</ul>
</div>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/12/08/earthquakes-vs-time-of-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CDC Flu App Challenge</title>
		<link>http://www.researchpipeline.com/wordpress/2011/04/29/cdc-flu-app-challenge/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/04/29/cdc-flu-app-challenge/#comments</comments>
		<pubDate>Fri, 29 Apr 2011 18:48:29 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=384</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
The CDC is getting into apps. They are challenging developers to design an app to fight the flu using mashups with their datasets as well as other publicly available datasets. Cash prizes! It seems like they are looking for nice ways to present and interact with flu data that will keep people educated about the [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>The CDC is getting into apps. They are<a href="http://fluapp.challenge.gov/"> challenging developers</a> to design an app to fight the flu using mashups with their datasets as well as other publicly available datasets. Cash prizes! It seems like they are looking for nice ways to present and interact with flu data that will keep people educated about the flu year-round. You can design apps for just about any platform, including websites and mobile phones. The deadline is May 27, 2011.</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/CDC_challenge.png"><img class="size-full wp-image-385 aligncenter" title="CDC_challenge" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/CDC_challenge.png" alt="" width="399" height="330" /></a></p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/04/29/cdc-flu-app-challenge/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Economic Recovery in Red vs Blue States</title>
		<link>http://www.researchpipeline.com/wordpress/2011/04/21/economic-recovery-in-red-vs-blue-states/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/04/21/economic-recovery-in-red-vs-blue-states/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 03:12:17 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[politics]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=301</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
I was chatting with my Dad recently and he brought up a debate he&#8217;d heard on the radio between a Republican and Democratic candidate. The Republican candidate said that in our present-day recession economy, Republican states were better off than Democratic states. My Dad seemed to particularly relish how the Democratic candidate scrambled to defend his [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p style="text-align: left;"><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/republican-vs-democrat.jpg"><img class="alignleft size-full wp-image-354" title="republican-vs-democrat" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/republican-vs-democrat.jpg" alt="" width="225" height="194" /></a>I was chatting with my Dad recently and he brought up a debate he&#8217;d heard on the radio between a Republican and Democratic candidate.</p>
<p style="text-align: left;">The Republican candidate said that in our present-day recession economy, Republican states were better off than Democratic states. My Dad seemed to particularly relish how the Democratic candidate scrambled to defend his party but didn&#8217;t contradict anything the Republican guy was saying.</p>
<p style="text-align: left;">Politicians are known for saying anything to win elections. Is it really true that Republicans manage the economies of their states better?</p>
<p style="text-align: left;">I found a treasure trove of state political data on <a href="http://en.wikipedia.org/wiki/Red_states_and_blue_states" target="_blank">Wikipedia</a>. I also found information on unemployment for the month of February 2011 at the website for the <a href="http://www.bls.gov/" target="_blank">US Bureau for Labor Statistics</a>. And finally, I was able to get an estimate of the budget gap/head via <a href="http://www.cbpp.org/cms/?fa=view&amp;id=711" target="_blank">The Center on Budget and Policy Priorities</a>.</p>
<p style="text-align: left;">And so I put everything together into a <a href="http://www.researchpipeline.com/mediawiki/index.php?title=Republican_and_Democratic_Economic_Data,_Feb_2011" target="_blank">spreadsheet</a> and stared at the data.</p>
<p style="text-align: left;">One assumption I&#8217;d always had (thanks to the New Deal era of President FDR) was that government spending was the best way to keep the country afloat during a recession. My data allowed me to plot unemployment vs. the state budget gap per person. Each dot in the graph below represents a state. The data is obviously very noisy, but there appeared to be a correlation between higher state budget gaps and greater unemployment. You could fit a line to this data, but the correlation was weak (only ~0.24).</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Unemployment-vs-Budget-Gap-Govs_small.png"><img class="aligncenter size-full wp-image-332" title="Unemployment vs Budget Gap Showing Political Party of Governor" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Unemployment-vs-Budget-Gap-Govs_small.png" alt="" width="450" height="295" /></a></p>
<p style="text-align: left;">The colors of the dots in this graph also show whether the state governor was a Republican or Democrat (if the governor&#8217;s party changed because of the November 2010 election, I went with the previous governor).</p>
<p style="text-align: left;">One might also conclude from this graph that Republican governors ran up larger state budget gaps and had higher unemployment than Democratic governors.  Just for the record, the state on the extreme right is Alaska, home of Sarah Palin. The state with the highest unemployment is Nevada.</p>
<p style="text-align: left;">Not wishing to make conclusions too quickly, we can use another metric to decide how to categorize the &#8220;Republicaness&#8221; of a state &#8212; the ratio of Republicans and Democrats in the upper and lower state legislatures.</p>
<p style="text-align: left;">It turned out that <strong>lower </strong>legislatures were all majority Democratic. That was surprising! In the chart below, red dots are more highly Republican legislatures and blue are more highly Democratic. Shades of purple show the degree of mixture. You can see the dots are all blue and shades of purple.</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Unemployment-vs-Budget-Gap-Lower-Leg_small.png"><img class="aligncenter size-full wp-image-350" title="Unemployment vs Budget Gap in the Lower Legislature" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Unemployment-vs-Budget-Gap-Lower-Leg_small.png" alt="" width="450" height="335" /></a></p>
<p style="text-align: left;">On the other hand, the <strong>upper </strong>legislatures in the states varied between majority Republican and Democrat.</p>
<p style="text-align: left;"><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Upper-Legislature-vs-Budget-Gap_small.png"><img class="aligncenter size-full wp-image-330" title="Percent Republicans in Upper Legislature vs Budget Gap" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Upper-Legislature-vs-Budget-Gap_small.png" alt="" width="450" height="328" /></a>The colors of the dots in this image now reflect the ratio of Republicans vs. Democrats in the <strong>upper</strong> legislatures. I&#8217;ve added threshold lines showing the highest value of the budget gap associated with each party. Clearly, the Democratic legislatures had larger budget gaps, but only narrowly. What struck me as a stronger relationship here was that purple, or split states, actually had the highest budget gap/person.</p>
<p style="text-align: left;">In fact, if I rotate the figure and fit a Gaussian bounding the outer edges&#8230;</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Budget-Gap-vs-Republican-Percent-Upper-Legislature_small.png"><img class="aligncenter size-full wp-image-348" title="Budget Gap vs Percentage Republicans in Upper Legislature" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Budget-Gap-vs-Republican-Percent-Upper-Legislature_small.png" alt="" width="450" height="326" /></a></p>
<p style="text-align: left;">The higher the budget gap, the more mixed the legislature. Note that the reverse was not true: a more mixed legislature did not necessarily imply that the budget gap was larger. In fact, there were several purple states very close to and even on top of the $0/head mark. Perhaps a graph of the standard deviation would also be enlightening:</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Std-Budget-Gap-vs-Republican-Percent-Upper-Legislature_small.png"><img class="aligncenter size-full wp-image-349" title="Std Budget Gap vs Percent Republicans in Upper Legislature " src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Std-Budget-Gap-vs-Republican-Percent-Upper-Legislature_small.png" alt="" width="450" height="326" /></a></p>
<p style="text-align: left;">Is it obvious that the more homogeneous the legislature, the more fiscally responsible its actions? Democrats spend more, but also tax more. Republicans tax less, but also spend less. An even mixture of the two bodies can lead to the extremes of spending less and taxing more (the $0/head Montana) or spending more and taxing less (the $1830/head Alaska).</p>
<p style="text-align: left;">It seems clear that majority Republican states are not better-off than majority Democratic states. <strong>But&#8230; </strong>Both the Republican and Democratic candidates could have gotten away with claiming that their states were better off than (some of) the purple states!</p>
<p style="text-align: left;">For your enjoyment, I uploaded my spreadsheet of state Republican vs. Democrat data to this website&#8217;s wiki: <a href="http://www.researchpipeline.com/mediawiki/index.php?title=Republican_and_Democratic_Economic_Data,_Feb_2011" target="_blank">Republican and Democratic Economic Data, Feb 2011</a>.</p>
<p style="text-align: left;">Footnotes:</p>
<p style="text-align: left;">Data extracted and combined from:</p>
<ul>
<li style="text-align: left;">The bureau of labor statistics: <a href="http://www.bls.gov/">http://www.bls.gov/</a></li>
<li style="text-align: left;"><a href="http://www.bls.gov/"></a>wikipedia&#8217;s map of red/blue states and data:  <a href="http://en.wikipedia.org/wiki/Red_states_and_blue_states">http://en.wikipedia.org/wiki/Red_states_and_blue_states</a></li>
<li style="text-align: left;"><a href="http://www.cbpp.org/cms/?fa=view&amp;id=711" target="_blank">The Center on Budget and Policy Priorities</a></li>
</ul>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/04/21/economic-recovery-in-red-vs-blue-states/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Ratio of Republicans in State Upper Legislatures</title>
		<link>http://www.researchpipeline.com/wordpress/2011/04/21/ratio-of-republicans-in-state-upper-legislatures/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/04/21/ratio-of-republicans-in-state-upper-legislatures/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 03:11:28 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[hosted data]]></category>
		<category><![CDATA[politics]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=333</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
Here&#8217;s an interesting relationship. The graph shows the percentage Republicans in the states Lower Legislatures versus the Upper Legislatures. It would appear that if you&#8217;re a Republican, you have the best chances of winning an election in the lower house in states where the upper house is split. You will have a hard time in [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>Here&#8217;s an interesting relationship. The graph shows the percentage Republicans in the states Lower Legislatures versus the Upper Legislatures.</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Lower-vs-Upper-Legislatures_small.png"><img class="aligncenter" title="Percentage Republican in Lower Legislature vs Upper Legislatures" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/04/Lower-vs-Upper-Legislatures_small.png" alt="" width="450" height="313" /></a></p>
<p>It would appear that if you&#8217;re a Republican, you have the best chances of winning an election in the lower house in states where the upper house is split. You will have a hard time in states where the upper house is either mostly Republican or Democratic.</p>
<p>Go figure&#8230;</p>
<p>The data for this spreadsheet has been uploaded to this website&#8217;s wiki: <a href="http://www.researchpipeline.com/mediawiki/index.php?title=Republican_and_Democratic_Economic_Data,_Feb_2011" target="_blank">Republican and Democratic Economic Data, Feb 2011</a></p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/04/21/ratio-of-republicans-in-state-upper-legislatures/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tax Day Visualization Contest</title>
		<link>http://www.researchpipeline.com/wordpress/2011/03/16/tax-day-visualization-contest/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/03/16/tax-day-visualization-contest/#comments</comments>
		<pubDate>Wed, 16 Mar 2011 22:55:09 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[contests]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=247</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
Always a sucker for contests! There&#8217;s a tax day visualization contest on datavizchallenge.org. The challenge is to visualize how politicians are spending your tax dollars. The deadline for submission is tax day, April 18th, 2011. The website providing the details, whatwepayfor.com, has done a nice job providing the data and an interface for extracting slices [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>Always a sucker for contests!</p>
<p>There&#8217;s a tax day visualization contest on <a href="http://www.datavizchallenge.org/challenge" target="_blank">datavizchallenge.org</a>. The challenge is to visualize how politicians are spending your tax dollars. The deadline for submission is tax day, April 18th, 2011.</p>
<p>The website providing the details, whatwepayfor.com, has done a nice job providing the data and an interface for extracting slices of data in real time via <a href="http://www.datavizchallenge.org/using-api" target="_blank">http urls</a>.</p>
<p>First prize wins $5000 and another $5000 goes to discretionary awards&#8230; Good luck!</p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/taxdayviz.png"><img class="size-full wp-image-279 aligncenter" title="taxdayviz" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/taxdayviz.png" alt="" width="350" height="317" /></a></p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/03/16/tax-day-visualization-contest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>International Women&#8217;s Day</title>
		<link>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/#comments</comments>
		<pubDate>Wed, 09 Mar 2011 01:04:57 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=253</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
Thanks to Google&#8217;s ngrams project page I have wasted my scarce spare hours looking at micro trends in literature. A couple of months ago, the Google ngrams project presented a database of all the words from Google&#8217;s extensive book collection. Making the books freely available presents copyright issues, but a database of word frequency in [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>Thanks to Google&#8217;s<a href="http://ngrams.googlelabs.com/" target="_blank"> ngrams project</a> page I have wasted my scarce spare hours looking at micro trends in literature. A couple of months ago, the Google ngrams project presented a database of all the words from Google&#8217;s extensive book collection. Making the books freely available presents copyright issues, but a database of word frequency in a collection of books is legal. They even created a simple graphing tool so you can basically play with the data. Or you can download the entire dataset for your own purposes. Micro-trends in literature might not sound very exciting, but once I started trying words, it became an addictive tool to try to prove my zany cultural theories.</p>
<p>One graph seemed very appropriate for today, International Women&#8217;s Day.  I plotted the words &#8220;men&#8221;, &#8220;women&#8221; and &#8220;children&#8221; versus time. And look!</p>
<p><span style="color: #0000ff;">&#8220;men&#8221; in blue</span>, <span style="color: #ff0000;">&#8220;women&#8221; in red</span>, <span style="color: #00ff00;">&#8220;children&#8221; in green</span></p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/womendaygraph.png"><img class="size-medium wp-image-257 aligncenter" title="womendaygraph" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/womendaygraph-300x112.png" alt="" width="300" height="112" /></a></p>
<p>The years range from 1800 to 2008 and you can see clearly that the word &#8220;men&#8221; (the blue line) rules by a long shot up until about 1920. To be fair, &#8220;men&#8221; can be used in a generalized sense to mean both men and women similar to the word &#8220;mankind&#8221;. Since there&#8217;s no context I can&#8217;t distinguish what percentage of the words actually refer to both sexes.</p>
<p>But the interesting part of the graph is the uptick in the usage of &#8220;women&#8221; starting during the era of 1960s feminism. Even more interesting, &#8220;women&#8221; overtakes &#8220;men&#8221; in the mid-1990s.</p>
<p>Shortly afterwards, &#8220;women&#8221; decreases and &#8220;men&#8221; once again rule. A decline in feminism? Or perhaps the bubble in the 1990s was due to the peak in so-called chick-lit which has since gone out of favor. To provide a cultural reference point, <em>Bridget Jones Diary</em>, the epitome of chick-lit, came out in 1996.</p>
<p>&#8220;Children&#8221; seem to have a steady increase all the way from the 1800s to the present day. The slow rate of increase in the word &#8220;children&#8221; surprises me since there&#8217;s been an explosion of children&#8217;s books since the days of Beatrix Potter. Perhaps Google has disdained uploading children&#8217;s literature into its database? I also tried the words &#8220;boy&#8221; and &#8220;girl&#8221; and they show a lower percentage of usage than &#8220;children&#8221;:</p>
<p><span style="color: #3366ff;">&#8220;children&#8221; in blue</span>,<span style="color: #ff0000;"> &#8220;boy&#8221; in red</span>, <span style="color: #00ff00;">&#8220;girl&#8221; in green</span></p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/childrengraph.png"><img class="size-medium wp-image-258 aligncenter" title="childrengraph" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/childrengraph-300x111.png" alt="" width="300" height="111" /></a></p>
<p>Happy International Women&#8217;s Day.</p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Brings Data Back</title>
		<link>http://www.researchpipeline.com/wordpress/2011/02/18/google-brings-data-back/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/02/18/google-brings-data-back/#comments</comments>
		<pubDate>Fri, 18 Feb 2011 09:25:14 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data hosting]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=220</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
If you follow my blog (wink!) you&#8217;ll recall that I was surprised that Google cancelled its data hosting service, Palimpset. Well, they&#8217;ve brought it back big time, albeit under a new moniker. The Google Public Data Explorer was announced yesterday. You can upload any dataset you like, so long as you format the data using [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>If you follow my blog (wink!) you&#8217;ll recall that I was surprised that <a href="http://www.researchpipeline.com/wordpress/2008/12/28/googles-palimpset-cancelled/" target="_blank">Google cancelled its data hosting service, Palimpset</a>. Well, they&#8217;ve brought it back big time, albeit under a new moniker. The <a href="http://www.google.com/publicdata/home" target="_blank">Google Public Data Explorer</a> was announced yesterday. You can upload any dataset you like, so long as you format the data using DSPL which is related to XML. The hosting service is totally free.</p>
<p>They&#8217;ve also got a service called <a href="http://datawiki.googlelabs.com/" target="_blank">DataWiki </a>listed in their Google Labs section which allows users to upload &#8220;structured&#8221; data. I&#8217;m not sure how these two services differ and whether Google really needs both. But at least duplication is far better than the situation in 2009 when they cancelled the one data-hosting service they ran.</p>
<p>In addition to the data hosting, there is a <a href="http://www.google.com/publicdata/home" target="_blank">set of tools</a> which can be used to display the data. Right now, the home page shows a graphic for lifespan vs. number of offspring by geographical region. The graphic includes a cool slider which shows the fertility bubbles jumping around as time progresses.</p>
<p><iframe width="400" height="325" frameborder="0" scrolling="no" marginwidth="0" marginheight="0" src="http://www.google.com/publicdata/explore/embed?ds=d5bncppjof8f9_&amp;ctype=b&amp;strail=false&amp;nselm=s&amp;met_x=sp_dyn_le00_in&amp;scale_x=lin&amp;ind_x=false&amp;met_y=sp_dyn_tfrt_in&amp;scale_y=lin&amp;ind_y=false&amp;met_s=sp_pop_totl&amp;scale_s=lin&amp;ind_s=false&amp;dimp_c=country:region&amp;ifdim=country&amp;pit=1199145600000&amp;hl=en&amp;dl=en_US&amp;iconSize=0.5&amp;uniSize=0.035"></iframe></p>
<p>I&#8217;m guessing this is a harbinger for Google to return to its research roots and take a step back from the profit juggernaut it has turned into.</p>
<p>A related snippet of news that I came across was the announcement by Intel that it is starting a center at Stanford devoted to visual computing. And GE recently came out with <a href="http://www.visualizing.org" target="_blank">visualizing.org</a>, a website devoted to data visualization which also incorporates other elements such as website contests. I noticed they have a <a href="http://www.visualizing.org/contests/teeb-visualizing-value-nature-challenge" target="_blank">contest for visualzing eco data</a> with a $5,000 prize! Their ads have been appearing everywhere, including on this blog.</p>
<p>&nbsp;</p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/02/18/google-brings-data-back/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualize word freqency</title>
		<link>http://www.researchpipeline.com/wordpress/2011/02/15/visualize-word-freqency/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/02/15/visualize-word-freqency/#comments</comments>
		<pubDate>Tue, 15 Feb 2011 09:15:43 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=133</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
I came across a visualization website that can transform a blog&#8217;s text (or the text of any url) into a visual display. Wordle.net takes a text and churns out a graphic wherein each word is sized according to frequency. Then it arranges all the words together in a  vaguely oval shape which describes how big [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p style="text-align: left;">I came across a visualization website that can transform a blog&#8217;s text (or the text of any url) into a visual display.  <a href="http://Wordle.net" target="_blank">Wordle.net</a> takes a text and churns out a graphic wherein each word is sized according to frequency. Then it arranges all the words together in a  vaguely oval shape which describes how big the dataset is. The squarer the picture, the greater the number of words. Here&#8217;s the wordle for this blog:  <a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/02/wordle.png"><img class="aligncenter size-medium wp-image-211" title="wordle" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/02/wordle-300x194.png" alt="" width="300" height="194" /></a>..and just for fun, here&#8217;s the wordle for the first page of the wiki:</p>
<p style="text-align: center;"><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/02/wordle_wiki2.png"><img class="aligncenter size-medium wp-image-214" title="wordle_wiki" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/02/wordle_wiki2-300x196.png" alt="" width="300" height="196" /></a></p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/02/15/visualize-word-freqency/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

