<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Research Pipeline Blog &#187; Google</title>
	<atom:link href="http://www.researchpipeline.com/wordpress/tag/google/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.researchpipeline.com/wordpress</link>
	<description>Notes about data and analysis</description>
	<lastBuildDate>Tue, 20 Dec 2011 04:35:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>International Women&#8217;s Day</title>
		<link>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/</link>
		<comments>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/#comments</comments>
		<pubDate>Wed, 09 Mar 2011 01:04:57 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data visualization]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=253</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
Thanks to Google&#8217;s ngrams project page I have wasted my scarce spare hours looking at micro trends in literature. A couple of months ago, the Google ngrams project presented a database of all the words from Google&#8217;s extensive book collection. Making the books freely available presents copyright issues, but a database of word frequency in [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>Thanks to Google&#8217;s<a href="http://ngrams.googlelabs.com/" target="_blank"> ngrams project</a> page I have wasted my scarce spare hours looking at micro trends in literature. A couple of months ago, the Google ngrams project presented a database of all the words from Google&#8217;s extensive book collection. Making the books freely available presents copyright issues, but a database of word frequency in a collection of books is legal. They even created a simple graphing tool so you can basically play with the data. Or you can download the entire dataset for your own purposes. Micro-trends in literature might not sound very exciting, but once I started trying words, it became an addictive tool to try to prove my zany cultural theories.</p>
<p>One graph seemed very appropriate for today, International Women&#8217;s Day.  I plotted the words &#8220;men&#8221;, &#8220;women&#8221; and &#8220;children&#8221; versus time. And look!</p>
<p><span style="color: #0000ff;">&#8220;men&#8221; in blue</span>, <span style="color: #ff0000;">&#8220;women&#8221; in red</span>, <span style="color: #00ff00;">&#8220;children&#8221; in green</span></p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/womendaygraph.png"><img class="size-medium wp-image-257 aligncenter" title="womendaygraph" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/womendaygraph-300x112.png" alt="" width="300" height="112" /></a></p>
<p>The years range from 1800 to 2008 and you can see clearly that the word &#8220;men&#8221; (the blue line) rules by a long shot up until about 1920. To be fair, &#8220;men&#8221; can be used in a generalized sense to mean both men and women similar to the word &#8220;mankind&#8221;. Since there&#8217;s no context I can&#8217;t distinguish what percentage of the words actually refer to both sexes.</p>
<p>But the interesting part of the graph is the uptick in the usage of &#8220;women&#8221; starting during the era of 1960s feminism. Even more interesting, &#8220;women&#8221; overtakes &#8220;men&#8221; in the mid-1990s.</p>
<p>Shortly afterwards, &#8220;women&#8221; decreases and &#8220;men&#8221; once again rule. A decline in feminism? Or perhaps the bubble in the 1990s was due to the peak in so-called chick-lit which has since gone out of favor. To provide a cultural reference point, <em>Bridget Jones Diary</em>, the epitome of chick-lit, came out in 1996.</p>
<p>&#8220;Children&#8221; seem to have a steady increase all the way from the 1800s to the present day. The slow rate of increase in the word &#8220;children&#8221; surprises me since there&#8217;s been an explosion of children&#8217;s books since the days of Beatrix Potter. Perhaps Google has disdained uploading children&#8217;s literature into its database? I also tried the words &#8220;boy&#8221; and &#8220;girl&#8221; and they show a lower percentage of usage than &#8220;children&#8221;:</p>
<p><span style="color: #3366ff;">&#8220;children&#8221; in blue</span>,<span style="color: #ff0000;"> &#8220;boy&#8221; in red</span>, <span style="color: #00ff00;">&#8220;girl&#8221; in green</span></p>
<p><a href="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/childrengraph.png"><img class="size-medium wp-image-258 aligncenter" title="childrengraph" src="http://www.researchpipeline.com/wordpress/wp-content/uploads/2011/03/childrengraph-300x111.png" alt="" width="300" height="111" /></a></p>
<p>Happy International Women&#8217;s Day.</p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2011/03/08/international-womens-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Scraping</title>
		<link>http://www.researchpipeline.com/wordpress/2009/01/17/data-scraping/</link>
		<comments>http://www.researchpipeline.com/wordpress/2009/01/17/data-scraping/#comments</comments>
		<pubDate>Sun, 18 Jan 2009 05:53:56 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[datasets]]></category>
		<category><![CDATA[data scraping]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[maps]]></category>
		<category><![CDATA[mashup]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=36</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
I came across a useful post on the blog ouseful.wordpress.com. The blogger, Tony Hirst, blogs about whatever he finds interesting. He figured out a way to scrape the data off of Wikipedia using the Google spreadsheet function =importHTM(”&#8221;,”table”,N). The  blogger gives detailed instructions on how to extract a table containing population data from England using [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>I came across a useful post on the blog <a href="http://ouseful.wordpress.com/2008/10/14/data-scraping-wikipedia-with-google-spreadsheets/" target="_blank">ouseful.wordpress.com</a>. The blogger, Tony Hirst, blogs about whatever he finds interesting. He figured out a way to scrape the data off of Wikipedia using the Google spreadsheet function <em>=importHTM(”&#8221;,”table”,N).</em></p>
<p>The  blogger gives detailed instructions on how to extract a table containing population data from England using the Google spreadsheet function. He then uses <a href="http://pipes.yahoo.com/pipes/" target="_blank">Yahoo! pipes</a> to geocode the data and create a Google mashup. It&#8217;s a very ingenious method of extracting limited datasets!</p>
<p>Another blogger, <a href="http://googlemapsmania.blogspot.com/2008/10/data-scaping-wikipedia.html" target="_blank">GoogleMapsMania</a>, suggested using <a href="http://www.batchgeocode.com/" target="_blank">Batchgeocode</a> to get the latitude and longitudinal data and then applying that data in the <a href="http://gmaps-samples.googlecode.com/svn/trunk/spreadsheetsmapwizard/makecustommap.htm" target="_blank">Google Spreadsheet Map Wizard</a> to map out the data</p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2009/01/17/data-scraping/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s Palimpset cancelled</title>
		<link>http://www.researchpipeline.com/wordpress/2008/12/28/googles-palimpset-cancelled/</link>
		<comments>http://www.researchpipeline.com/wordpress/2008/12/28/googles-palimpset-cancelled/#comments</comments>
		<pubDate>Mon, 29 Dec 2008 03:04:46 +0000</pubDate>
		<dc:creator>Lyndie Chiou</dc:creator>
				<category><![CDATA[data hosting]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Palimpset]]></category>
		<category><![CDATA[terabytes]]></category>

		<guid isPermaLink="false">http://www.researchpipeline.com/wordpress/?p=17</guid>
		<description><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
I was surprised to read that Google has decided to cancel its data hosting service, nee Palimpsest. The name Palimpsest came from the book, Archimedes Palimpsest. To read more on the project which recently restored Archimedes Palimpset click here. Google was originally planning to host Terabytes of data, including astronomical and large governmental data sets. [...]]]></description>
			<content:encoded><![CDATA[<style type="text/css">
#leftcontainerBox {
float:left;
position: fixed;
top: 60%;
left: 70px;
}

#leftcontainerBox .buttons {
float:left;
clear:both;
margin:4px 4px 4px 4px;

padding-bottom:2px;
}


#bottomcontainerBox {
height: 30px;
width:50%;
padding-top:1px;
}

#bottomcontainerBox .buttons {
float:left;
height: 30px;
margin:4px 4px 4px 4px;
}

</style>
<p>I was surprised to read that Google has decided to cancel its data hosting service, nee Palimpsest. The name Palimpsest came from the book, Archimedes Palimpsest. To read more on the project which recently restored Archimedes Palimpset click <a href="http://www.archimedespalimpsest.org/" target="_blank">here</a>.</p>
<p>Google was originally planning to host Terabytes of data, including astronomical and large governmental data sets. Many bloggers had been hailing Google&#8217;s service as the start of a new era of transparency for the USA.</p>
<p>The decision to cancel Google&#8217;s Palimpset came just a week ago. The sharp downturn in the economy played a key role. Google&#8217;s stock fell from an all-time high of about $700 a year ago to around $300 today. This caused Google to sharply curtail &#8220;experimental&#8221; programs that have no guaranteed revenues.</p>
<p style="text-align: right;"><em><strong>-Lyndie Chiou</strong></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.researchpipeline.com/wordpress/2008/12/28/googles-palimpset-cancelled/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

