<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>My Observations</title>
	<atom:link href="http://ruthygarcia.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://ruthygarcia.wordpress.com</link>
	<description>Just another WordPress.com site</description>
	<lastBuildDate>Thu, 17 Jan 2013 18:25:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='ruthygarcia.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>My Observations</title>
		<link>http://ruthygarcia.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://ruthygarcia.wordpress.com/osd.xml" title="My Observations" />
	<atom:link rel='hub' href='http://ruthygarcia.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Do multilingual people build bridges across countries?</title>
		<link>http://ruthygarcia.wordpress.com/2013/01/17/do-multilingual-people-build-bridges-across-countries/</link>
		<comments>http://ruthygarcia.wordpress.com/2013/01/17/do-multilingual-people-build-bridges-across-countries/#comments</comments>
		<pubDate>Thu, 17 Jan 2013 16:44:27 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Information Propagation]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=42</guid>
		<description><![CDATA[Today Irene Eleta  from university of Maryland visited the lab with a seminar called   Multilingual Users of Twitter: Social Ties Across Language Borders or How a Story Could Travel the World.&#8221;  As me, she has been working a lot with Twitter &#8230; <a href="http://ruthygarcia.wordpress.com/2013/01/17/do-multilingual-people-build-bridges-across-countries/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=42&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Today <a href="http://www.ieleta.com">Irene Eleta</a>  from university of Maryland visited the lab with a seminar called   Multilingual Users of Twitter: Social Ties Across Language Borders or How a Story Could Travel the World.&#8221;  As me, she has been working a lot with Twitter for her PhD thesis. She is particularly interested in exploring the role of multilingual users in different social media platforms.  Among her challenges , she aims to find solutions in</p>
<ul>
<li>classifying tweets in a certain language but quoting the name of songs/books/movies in another language.</li>
<li>Detecting automatic messages in different languages</li>
<li>Scripts of translators in arabic, jew, etc. (coding)</li>
</ul>
<p>My suggestion in the first problem is to &#8220;ignore&#8221; single tweets where two languages are detected and only consider those that have a high probability of being only from one language. The reason is that many people use many english expressions without knowing English.  Users may be classified as multilingual when only they are using names of movies in English or using expressions etc.</p>
<p>On the other hand, (maybe what I was more interested about) was about the role of multilingual people in Information propagation.  The idea is to measure the real importance of these people in the moment of special events like protests, revolutions, crisis or catastrophic events such as earthquakes.  Some of the questions to answer would be:</p>
<ul>
<li>Are previously classified multilingual people important somehow in propagating information in special events? My naive hypothesis would be &#8220;yes, they are.&#8221; Because they know other languages, multilingual people will care more about propagating information to the world so that the world can also understand what is going on&#8230;.. and in particular, the language chosen to communicate with the world will be English. In order to explore this, temporal analysis will be needed.</li>
</ul>
<p>Looking from another perspective, the detection of multilingual users and the study of their interactions can trigger the invention of new useful functionalities in many sites. For example, up to now I always have problems with language detections and spell checking using Gmail&#8230;.wouldn&#8217;t it be nice if Gmail will know the &#8220;language&#8221; that you use with your friends and automatically change the spell checker? It seems for me that up to now Gmail saves the previously used spell checker&#8230;and it bother me a lot to be switching languages all the time to avoid those annoying red lines.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/42/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=42&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2013/01/17/do-multilingual-people-build-bridges-across-countries/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>
	</item>
		<item>
		<title>Goodbye Aaron Swartz</title>
		<link>http://ruthygarcia.wordpress.com/2013/01/13/goodbye-aaron-swartz/</link>
		<comments>http://ruthygarcia.wordpress.com/2013/01/13/goodbye-aaron-swartz/#comments</comments>
		<pubDate>Sun, 13 Jan 2013 19:12:33 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=29</guid>
		<description><![CDATA[Didn&#8217;t know much about Aaron Swartz until recently. He committed suicide after the fear of being in prison for almost all his life. He was a great programmer, activist, full of great  ideas about free information  and with many dreams. At age of &#8230; <a href="http://ruthygarcia.wordpress.com/2013/01/13/goodbye-aaron-swartz/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=29&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Didn&#8217;t know much about Aaron Swartz until recently. He committed suicide after the fear of being in prison for almost all his life. He was a great programmer, activist, full of great  ideas about free information  and with many dreams. At age of 14 he co-authored the <a href="http://en.wikipedia.org/wiki/RSS">RSS 1.0</a> project. He seems to have been a great combination of good intentions, intelligence and fairness. He dreamed of freely sharing information, of finding a way around to the stupidity of patent laws (seriously some of them are truly ridiculous). Big loss for those who fight for fairness in this world.  There are two interesting blog posts I read so far about him, one from a someone who knew him personally  <a href="http://www.zephoria.org/thoughts/archives/2013/01/13/aaron-swartz.html">danah boyd</a>  and other<a href="http://unhandled.com/2013/01/12/the-truth-about-aaron-swartzs-crime/"> post</a> arguing in his defense and the unfairness of the 35 year sentence. His family&#8217;s official statement is <a href="http://rememberaaronsw.tumblr.com/post/40372208044/official-statement-from-the-family-and-partner-of-aaron">here</a>, they blame MIT and JSTOR for this.</p>
<p>&#8220;Punishment sometimes don&#8217;t seem to fit the crime,&#8221; and definitely it was not fair to condem him for 35 years of prison just because he wanted to share the scientific articles of MIT to the world. People do that all the time. Sometimes you can get a scientific article just by asking the author. I feel saddened when these things happen, specially if it involves smart, creative and good people. Who knows the great things he could have done for humanity&#8230;  RIP for Aaron Swartz.</p>
<p>Good bye Aaron Swartz.</p>
<p>Here some explanation about some of the irony of Intelectual Property.</p>
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='640' height='390' src='http://www.youtube.com/embed/Fgh2dFngFsg?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span>
<p>P.S: I refer to a great <a href="http://sushiknights.org/">site </a>in Spanish for &#8220;Hacktivistas y Cultura Libre&#8221; where a friend and former Scientist of Yahoo Labs! actively participates.</p>
<p>&nbsp;</p>
<p>Update: Now, you can liberate knowledge in <a href="http://aaronsw.archiveteam.org/">JSTOR LIBERATOR</a> site! People like Aaron can leave this world but they will influence others to continue unfinished tasks.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/29/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/29/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=29&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2013/01/13/goodbye-aaron-swartz/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>
	</item>
		<item>
		<title>PhD, things to keep in mind.</title>
		<link>http://ruthygarcia.wordpress.com/2013/01/13/phd-things-to-keep-in-mind/</link>
		<comments>http://ruthygarcia.wordpress.com/2013/01/13/phd-things-to-keep-in-mind/#comments</comments>
		<pubDate>Sun, 13 Jan 2013 18:52:56 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=27</guid>
		<description><![CDATA[There is a talk I heard a couple of years ago, it was Marisa Meyer (current yahoo CEO) IT commencement address at Standford.  I can now understand and fully appreciate when she said  &#8221;Find the smartest people you can and surround &#8230; <a href="http://ruthygarcia.wordpress.com/2013/01/13/phd-things-to-keep-in-mind/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=27&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>There is a talk I heard a couple of years ago, it was Marisa Meyer (current yahoo CEO) IT commencement address at Standford.  I can now understand and fully appreciate when she said</p>
<blockquote><p> &#8221;Find the smartest people you can and surround yourself with them. Working with smart people means that you will be challenged to do your best. You have to strive to keep up with them and as a result they will elevate your thinking. When there are better players around you, you get better.&#8221;</p></blockquote>
<p>It is hard though to realize all the things one have to do to cope with people we admire. I have been so lucky to meet so many &#8220;great&#8221; people here. It was exactly what I asked the world when I left my country. Keeping up with them is another story. I realized that my work methodology is chaotic and that it urgently needs to be improved. I am trying to do that this year.</p>
<p>So the things that I have learned observing people who do &#8220;good research and still enjoy free time&#8221; are the following :</p>
<ol>
<li><strong> Efficiency at work: no procrastination.</strong> Establish clear goals every day during work.   This is a big problem for me. I tend to multitask too much. Although it has been said that women multitask a lot, I think this is not good for research. I know there are people have different methods to cope with work. There are people at the lab even on Sundays but I guess that if one aims to have a &#8220;life,&#8221; so no procrastination and efficiency is a must.</li>
<li><strong> Time management: </strong>Plan always the next action otherwise it will be hard to do everything you want to do&#8230;even the weekends.</li>
<li><strong>Team work: </strong>Finding a team to work. I think it is more productive and fun. If you code a lot, it would be great if you find a PhD partner who also likes coding , you can share work, discuss, motivate each other. If you plan to write two papers, one with your friend as first author and the other you as first author then even better.</li>
<li><strong>A good advisor</strong>: Advisors do not have time but comments from them are helpful. So in order to have a good feedback, you need to have results or hypothesis ready to show to your advisor. Their experience and help always are useful. I personally ask advice from people I admire.</li>
<li><strong>Love: </strong> try to love what you are doing as much as you can.  This is  hard sometimes&#8230;an idea that you believe can be great and fun can turn out to be the worst nightmare. I feel it has happened to me but oh well&#8230; love &#8220;bites&#8221; sometimes, we have to keep trying.</li>
</ol>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/27/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/27/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=27&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2013/01/13/phd-things-to-keep-in-mind/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>
	</item>
		<item>
		<title>Loving your job</title>
		<link>http://ruthygarcia.wordpress.com/2011/02/02/loving-your-job/</link>
		<comments>http://ruthygarcia.wordpress.com/2011/02/02/loving-your-job/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 19:35:33 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Observations]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=22</guid>
		<description><![CDATA[When I finished high school I was convinced that I wanted to study International Affairs. I grew up in an environment where the most common topics of discussion were politics, history and sociology. On top of that, my first boyfriend &#8230; <a href="http://ruthygarcia.wordpress.com/2011/02/02/loving-your-job/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=22&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>When I finished high school I was convinced that I wanted to study  International Affairs. I grew up in an environment where the most common  topics of discussion were politics, history and sociology. On top of  that, my first boyfriend was a sociologist that would absolutely love to  talk about politics, laws, religion, etc.</p>
<p>Years later, I was lucky to do an internship at the UN for the  Ecuadorian mission in New York where I attended several conferences and   meetings including the meeting  <em>Women 2000: Gender Equality , Development and Peace</em> where a bunch of women got together to talk about the progress that was done on gender equality.</p>
<p>Ironically, it was precisely at the UN that I got disappointed about  it all. I realized with much grief that people would spend so much time  talking about how to write the paper of a meeting or how important  it  was for a country to show in that paper what their representatives have  talked&#8230;I had the impression that the majority of people there didn&#8217;t  really care about the solutions and the actions of very important  things, they cared more about the protocols, the papers and the meetings  and connections. I also felt like many of those delegates were not the  right people to be there …  I mean, it was difficult for me to  understand how the presence of certain people could actually help in  something.</p>
<p>Given that I always had a fascination for math , logic and  programming I changed my mind and chose other major: Computers. At first  it was Computer Science and then I changed to Computer Engineering when  I got back to Ecuador.</p>
<p>The truth is that protocols, procedures and writing processes are  important in almost every field. Connections as well, sometimes they are  fair, sometimes they are not.  I do believe that if you are brilliant  in something,  people will look for you,  if you are brilliant in  something you are lucky!  But if you are just someone who struggles hard  and who can make a good job after a lot of effort then connections  always help. The sad part is when you are bad at something and you still  get a good position in something you are bad at because of the  connections.</p>
<p>I also realized that in life you find  few passionate and courageous  people in their jobs …. I have the impression that everybody is tired  the majority of the time regardless the field chosen to work in. Few are  the ones who really love their jobs&#8230;</p>
<p>Do I love my choice?  I don&#8217;t know&#8230;. I want to discover it.  Can I be someone who makes a bit of a difference in this?</p>
<p>I must confess,  sometimes I do not find any meaning in what I am  doing &#8230;sometimes I think I will be doing something more productive if I  plant potatoes in the garden of my house in Ecuador. But maybe I am not  the only one thinking that. At the end, research is about discovery …  maybe I will find my passion soon in the middle of the screen among Pig  scripts. What I want to discover though&#8230;is something that could be  used to help people&#8230;in anything, but to help people.</p>
<p>What I love about this new world (Research) though is that I find  very interesting fellows &#8230;a lot of the people I  have come across have  different talents and interests&#8230;. and almost all of them share  authenticity in their personality. In my lab , beside researchers, you  find musicians, athletics, dancers and writers&#8230;</p>
<p>I love getting to know women in Tech,  despite the fact that we are  only a small percentage in this field, so far they all have made such a  great impression upon me.</p>
<p>Tomorrow is the New Chinese Year, the year of the rabbit&#8230;.please dear rabbit let it be my year of discovery.</p>
<p><a href="http://ruthygarcia.files.wordpress.com/2011/02/rabbit.jpg"><img class="aligncenter size-thumbnail wp-image-23" title="rabbit" src="http://ruthygarcia.files.wordpress.com/2011/02/rabbit.jpg?w=150&#038;h=141" alt="" width="150" height="141" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/22/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=22&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2011/02/02/loving-your-job/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>

		<media:content url="http://ruthygarcia.files.wordpress.com/2011/02/rabbit.jpg?w=150" medium="image">
			<media:title type="html">rabbit</media:title>
		</media:content>
	</item>
		<item>
		<title>PIG AND HADOOP CONFIGURED!</title>
		<link>http://ruthygarcia.wordpress.com/2010/10/25/pig-and-hadoop-configured/</link>
		<comments>http://ruthygarcia.wordpress.com/2010/10/25/pig-and-hadoop-configured/#comments</comments>
		<pubDate>Mon, 25 Oct 2010 22:40:07 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[configuration]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[Pig]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=17</guid>
		<description><![CDATA[Experience I finally managed to configure Pig and Hadoop on my computer. I used Pig 0.7 and Hadoop 0.20.2. It took me a while to configure but finally I made it.  Hadoop and Pig are constantly getting updated so don&#8217;t &#8230; <a href="http://ruthygarcia.wordpress.com/2010/10/25/pig-and-hadoop-configured/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=17&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><strong>Experience</strong></p>
<p>I finally managed to configure Pig and Hadoop on my computer. I used Pig 0.7 and Hadoop 0.20.2. It took me a while to configure but finally I made it.  Hadoop and Pig are constantly getting updated so don&#8217;t trust much on tutorials of older versions if you are not very experienced on the matter.  Nevertheless, I should mention <a title="tutorial hadoop" href="http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Single-Node_Cluster%29">this tutorial</a> because it helped me a great deal in understanding how to configure hadoop. The only major misunderstanding was with the configuration of the ssh,  so if you are a beginner like me, be careful to mess with ssh .</p>
<p><strong>Advices:</strong></p>
<ol>
<li>Read the apache tutorials on Pig and Hadoop but be careful with some mistakes they make on the writing</li>
<li>Use the tutorial that comes within the folder of Pig (the tutorial files they talk on the Pig tutorial are inside the Pig&#8217;s folder).</li>
<li>Get the latest stable versions</li>
</ol>
<p><em>1. Red the apache tutorials on Pig and Hadoop but be careful with some mistakes : It means that they do have some mistakes, for example on this <a href="http://pig.apache.org/docs/r0.5.0/setup.html">part of the tutorial</a> the id.pig  is:</em></p>
<pre>A = load 'passwd' using PigStorage(':');
B = foreach A generate $0 as id;
dump B;
store B into ‘id.out’;</pre>
<p>They forget to mention that you either use dump or store&#8230;you may have some errors if you use both. Second if you copied and pasted this code then you will for sure have an error instead change the last part with &#8216;id.out&#8217; (not the same as above).   I also received an error with the following mapreduce script</p>
<pre>Unix:   $ java -cp pig.jar:.:$HADOOPDIR idmapreduce</pre>
<p>It can not find the passwd file on hdfs directory and  it does not have a logout file to write the results. Instead of figuring out the problem, I went ahead and ran another mapreduce job with another command from the<a href="http://pig.apache.org/docs/r0.5.0/tutorial.html"> next section</a> of this tutorial (following the steps) and it worked!</p>
<pre>$ java -cp $PIGDIR/pig.jar:$HADOOP_CONF_DIR  org.apache.pig.Main 
script1-hadoop.pig</pre>
<p>So if this script worked fine then the previous one must have something wrong, I will test tomorrow if putting passwd on the hdfs  would eventually solve the problem.</p>
<p><em>2. Use the tutorial that comes within the folder of Pig  (the tutorial files they talk on the Pig tutorial are inside the Pig&#8217;s  folder) and 3. Get the latest stable versions</em></p>
<p>This is important because there are changes between versions. I made a stupid stupid mistake on this. I did not know that the files used for testing on the Pig&#8217;s tutorials are actually inside a folder called &#8220;tutorial&#8221; inside my Pig&#8217;s folder. So I downloaded a tutorial of a previous Pig&#8217;s version&#8230;.and of course I kept getting mistakes since I was running with a later version of Pig.  After I made the  appropriate corrections , it worked!!</p>
<p>The errors I was having were scary and hard to interpret , I got for example: &#8220;INFO executionengine.HExecutionEngine: Connecting to hadoop file system at: file:///&#8221;  and &#8220;ERROR mapReduceLayer.MapReduceLauncher: java.io.IOException: excite.log.bz2 does not exist&#8221; (posted <a href="https://issues.apache.org/jira/browse/PIG-1698">here</a>).</p>
<p>It was finally solved when I used the appropriate tutorial files. It was not easy to figure it out.</p>
<p><strong>Future work</strong></p>
<p>Well, now that I have Pig and hadoop running smoothly, I will start to make a lot of experiments. My task is to give a &#8220;score&#8221; to tweets according to a list of words with or without weights.  So for example if my tweet of 8 words is  &#8220;Samsung Launching New Android Device on November   <a href="http://on.mash.to/9wJbGC&#038;#8221" rel="nofollow">http://on.mash.to/9wJbGC&#038;#8221</a>; and my list has three words Iphone BlackBerry and Android , the total weight of this tweet will be 1/8. Things get more complicated when I have to filter content and use weights&#8230;. I will run my experiments in one large file containing a lot of tweets and THEN after having it right I will run in the cluster of yahoo&#8230;which has huge amount of data.</p>
<p><strong>Questions</strong></p>
<ol>
<li>Should we consider numbers and urls as words? I was told that urls should be considered as counting words but I am a little reluctant about it. (of course RT,  via, @&#8230; will not be considered)</li>
<li>I am afraid with regard to the languages&#8230;.how to sort that?</li>
</ol>
<p><strong>Motivation</strong></p>
<p>I will assist remotely to a class in California LA regarding Pig (introductory 2 hour course) and given by yahoo <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/17/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=17&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2010/10/25/pig-and-hadoop-configured/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>
	</item>
		<item>
		<title>Going to Yahoo R +D</title>
		<link>http://ruthygarcia.wordpress.com/2010/10/19/7/</link>
		<comments>http://ruthygarcia.wordpress.com/2010/10/19/7/#comments</comments>
		<pubDate>Tue, 19 Oct 2010 19:49:15 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[office]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=7</guid>
		<description><![CDATA[Started my phd with a lot of work. After two weeks of getting things ready and beautifying my desk I was moved to yahoo R&#38;D where I am supposed to stay at least half of the day&#8230; but in the &#8230; <a href="http://ruthygarcia.wordpress.com/2010/10/19/7/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=7&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Started my phd with a lot of work. After two weeks of getting things ready and beautifying my desk I was moved to yahoo R&amp;D where I am supposed to stay at least half of the day&#8230; but in the reality I am staying almost the whole day due to the complexity of my tasks.</p>
<p>I am motivated of course but I have to do a lot of things I have never done before&#8230;lots of learning these days.</p>
<p>What I can say is that the project I am getting into is very interesting because it will analyze &#8220;diversity&#8221; of opinions and cultural differences&#8230;At least try to catch that from what people say online.</p>
<p>Hadoop + Pig = me crazy.</p>
<p>How is that for my first post?</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/7/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=7&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2010/10/19/7/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>
	</item>
		<item>
		<title>Hello world!</title>
		<link>http://ruthygarcia.wordpress.com/2010/10/10/hello-world/</link>
		<comments>http://ruthygarcia.wordpress.com/2010/10/10/hello-world/#comments</comments>
		<pubDate>Sun, 10 Oct 2010 14:06:29 +0000</pubDate>
		<dc:creator>ruthygarcia</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ruthygarcia.wordpress.com/?p=1</guid>
		<description><![CDATA[Hello world, this is my new blog!. I have always liked to write what I do and think.  Since 11 y.o I carry a diary, the frequency has decreased a big deal and I do not update my diary on &#8230; <a href="http://ruthygarcia.wordpress.com/2010/10/10/hello-world/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=1&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><em><strong>Hello world, this is my new blog!. </strong></em></p>
<p>I have always liked to write what I do and think.  Since 11 y.o I carry a diary, the frequency has decreased a big deal and I do not update my diary on a paper anymore, now everything I write is digital. I think that is one of the reasons of my very bad handwriting.</p>
<p>What does this blog differentiate from the previous one? In this blog, I will focus more on my PHD and everything that I discover along the way. In other words, no drama, no love stories, just science, code and of course some observations about life in general.</p>
<p>I will write mostly in English but I may be tempted to write in Spanish a couple of times. English is not my native language but I take it as a challenge and a way to practice my writing skills.</p>
<p style="text-align:center;"><a href="http://ruthygarcia.files.wordpress.com/2010/10/images.jpeg"><img class="alignnone size-thumbnail wp-image-15" title="hellowkity" src="http://ruthygarcia.files.wordpress.com/2010/10/images.jpeg?w=137&#038;h=150" alt="" width="137" height="150" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ruthygarcia.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ruthygarcia.wordpress.com/1/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ruthygarcia.wordpress.com&#038;blog=16564565&#038;post=1&#038;subd=ruthygarcia&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ruthygarcia.wordpress.com/2010/10/10/hello-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/21979d6110351fc23cd2528d26b89467?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ruthygarcia</media:title>
		</media:content>

		<media:content url="http://ruthygarcia.files.wordpress.com/2010/10/images.jpeg?w=137" medium="image">
			<media:title type="html">hellowkity</media:title>
		</media:content>
	</item>
	</channel>
</rss>
