<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Somethink to Chew On</title>
	<atom:link href="http://www.harlan.harris.name/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.harlan.harris.name</link>
	<description>the blog of Harlan Harris</description>
	<lastBuildDate>Sat, 07 Jan 2012 01:55:14 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>Comment on ggplot and concepts &#8212; what&#8217;s right, and what&#8217;s wrong by chango</title>
		<link>http://www.harlan.harris.name/2010/03/ggplot-and-concepts-whats-right-and-whats-wrong/comment-page-1/#comment-192</link>
		<dc:creator>chango</dc:creator>
		<pubDate>Sat, 07 Jan 2012 01:55:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=47#comment-192</guid>
		<description>Interesting criticisms of ggplot2. I actually came to this page from R bloggers to see if Hadley had weighed in. He&#039;s extremely open to criticism and outside ideas (there are many developers of ggplot and they are an amazing community, he&#039;s not the only person driving it). 

In fact, ggplot2 is under going major work currently (under the guise of a separate layers package), see the thread in the ggplot2-dev group. 

http://groups.google.com/group/ggplot2-dev/browse_thread/thread/6d41a475a51025d1

Here he describes the what and the why of rewriting it. He&#039;s aware of the speed issues, and the nightmare of the proto objects ( which i was expecting to be is major drawback). The appropriate use of operators for clarity is a great point (though I believe commutativity a property of the object being operated on and not the operator itself) and I too believe that qplot should be scrapped. Alas, I do not understand the need to build a separate, new jjplot... but it could be cool. There&#039;s just so much awesome work going on on ggplot2. 

It&#039;s actually why I switched to R and Hadley&#039;s other packages are equally paradigm changing and save me a ton of work. When they defuddle me, the community is strongly supportive.</description>
		<content:encoded><![CDATA[<p>Interesting criticisms of ggplot2. I actually came to this page from R bloggers to see if Hadley had weighed in. He&#8217;s extremely open to criticism and outside ideas (there are many developers of ggplot and they are an amazing community, he&#8217;s not the only person driving it). </p>
<p>In fact, ggplot2 is under going major work currently (under the guise of a separate layers package), see the thread in the ggplot2-dev group. </p>
<p><a href="http://groups.google.com/group/ggplot2-dev/browse_thread/thread/6d41a475a51025d1" rel="nofollow">http://groups.google.com/group/ggplot2-dev/browse_thread/thread/6d41a475a51025d1</a></p>
<p>Here he describes the what and the why of rewriting it. He&#8217;s aware of the speed issues, and the nightmare of the proto objects ( which i was expecting to be is major drawback). The appropriate use of operators for clarity is a great point (though I believe commutativity a property of the object being operated on and not the operator itself) and I too believe that qplot should be scrapped. Alas, I do not understand the need to build a separate, new jjplot&#8230; but it could be cool. There&#8217;s just so much awesome work going on on ggplot2. </p>
<p>It&#8217;s actually why I switched to R and Hadley&#8217;s other packages are equally paradigm changing and save me a ton of work. When they defuddle me, the community is strongly supportive.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by Thought this was cool: 什么是数据科学(Data Science) &#171; CWYAlpha</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-189</link>
		<dc:creator>Thought this was cool: 什么是数据科学(Data Science) &#171; CWYAlpha</dc:creator>
		<pubDate>Mon, 28 Nov 2011 14:54:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-189</guid>
		<description>[...] 参考资料：http://www.drewconway.com/zia/?p=2378http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/http://flowingdata.com/2009/06/04/rise-of-the-data-scientist/http://radar.oreilly.com/2010/06/what-is-data-science.html [...]</description>
		<content:encoded><![CDATA[<p>[...] 参考资料：http://www.drewconway.com/zia/?p=2378http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/http://flowingdata.com/2009/06/04/rise-of-the-data-scientist/http://radar.oreilly.com/2010/06/what-is-data-science.html [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by Harlan</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-181</link>
		<dc:creator>Harlan</dc:creator>
		<pubDate>Wed, 16 Nov 2011 03:04:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-181</guid>
		<description>Hi Chris. I think you&#039;re absolutely right. Statistical training alone is not adequate for addressing the value of many real-world questions. I think that substantial systems intuition, and the ability to simulate the outcome of business (or government, or whatever) policy changes that might be due to the results of your analysis, is critical. Probably the people who do this best are quantitative MBAs, who have the formal training to do this work and the ability to talk with non-technical business people, and OR people, who are likewise trained to simulate complex processes and estimate the real-world effects (in money, defects, whatever) of various changes. A stats PhD seems much less useful in application of data science than many other degrees.

DBAs are valuable, of course, but they don&#039;t do any of the things I talked about above.</description>
		<content:encoded><![CDATA[<p>Hi Chris. I think you&#8217;re absolutely right. Statistical training alone is not adequate for addressing the value of many real-world questions. I think that substantial systems intuition, and the ability to simulate the outcome of business (or government, or whatever) policy changes that might be due to the results of your analysis, is critical. Probably the people who do this best are quantitative MBAs, who have the formal training to do this work and the ability to talk with non-technical business people, and OR people, who are likewise trained to simulate complex processes and estimate the real-world effects (in money, defects, whatever) of various changes. A stats PhD seems much less useful in application of data science than many other degrees.</p>
<p>DBAs are valuable, of course, but they don&#8217;t do any of the things I talked about above.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by human mathematics</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-180</link>
		<dc:creator>human mathematics</dc:creator>
		<pubDate>Wed, 16 Nov 2011 02:50:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-180</guid>
		<description>I&#039;d like to hear your comments on the Quora post &quot;What does one have to learn to become a data scientist?&quot;. I thought the post indicated a lack of definition to the field.

I also read something in the NYT maybe a year or two ago which also claimed that stats PhD&#039;s are/will be a valuable labour force. Again, once I see the evidence, I&#039;ll believe it...</description>
		<content:encoded><![CDATA[<p>I&#8217;d like to hear your comments on the Quora post &#8220;What does one have to learn to become a data scientist?&#8221;. I thought the post indicated a lack of definition to the field.</p>
<p>I also read something in the NYT maybe a year or two ago which also claimed that stats PhD&#8217;s are/will be a valuable labour force. Again, once I see the evidence, I&#8217;ll believe it&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by human mathematics</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-179</link>
		<dc:creator>human mathematics</dc:creator>
		<pubDate>Wed, 16 Nov 2011 02:48:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-179</guid>
		<description>Meh ... I hope Hal Varian is right, but as of now it looks like NoSQL, Hadoop, and various other upgraded ETL/database stuff is where the money&#039;s at. I think statisticians face the problem that they can&#039;t communicate exactly what value they will add to the bottom line (&quot;I will explain things that you don&#039;t understand to you ... which will accomplish ... um ... well I don&#039;t know your business well enough to say&quot;). Whereas DB&#039;s are necessary to the basic functioning of a business.

I see a parallel to Quants (finance). If you are creative and can come up with strategies, great once you&#039;re in the door. But you need to be an excellent programmer to get in the door (so they know they can extract some value from you as a servant, before you become an advisor).</description>
		<content:encoded><![CDATA[<p>Meh &#8230; I hope Hal Varian is right, but as of now it looks like NoSQL, Hadoop, and various other upgraded ETL/database stuff is where the money&#8217;s at. I think statisticians face the problem that they can&#8217;t communicate exactly what value they will add to the bottom line (&#8220;I will explain things that you don&#8217;t understand to you &#8230; which will accomplish &#8230; um &#8230; well I don&#8217;t know your business well enough to say&#8221;). Whereas DB&#8217;s are necessary to the basic functioning of a business.</p>
<p>I see a parallel to Quants (finance). If you are creative and can come up with strategies, great once you&#8217;re in the door. But you need to be an excellent programmer to get in the door (so they know they can extract some value from you as a servant, before you become an advisor).</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Smartphones, MP3 players, and Bluetooth: the division of labor by Somethink to Chew On &#187; Apple TV and cross-device user-interface integration</title>
		<link>http://www.harlan.harris.name/2010/03/smartphones-mp3-players-and-bluetooth-the-division-of-labor/comment-page-1/#comment-173</link>
		<dc:creator>Somethink to Chew On &#187; Apple TV and cross-device user-interface integration</dc:creator>
		<pubDate>Sun, 06 Nov 2011 20:57:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=64#comment-173</guid>
		<description>[...] into a TV that would be actually compelling, though. And in some ways it&#8217;s the same way that I earlier blogged about for MP3 players. Cross-device user-interfaces. Here&#8217;s how it might work for a [...]</description>
		<content:encoded><![CDATA[<p>[...] into a TV that would be actually compelling, though. And in some ways it&#8217;s the same way that I earlier blogged about for MP3 players. Cross-device user-interfaces. Here&#8217;s how it might work for a [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on ggplot and concepts &#8212; what&#8217;s right, and what&#8217;s wrong by LC</title>
		<link>http://www.harlan.harris.name/2010/03/ggplot-and-concepts-whats-right-and-whats-wrong/comment-page-1/#comment-170</link>
		<dc:creator>LC</dc:creator>
		<pubDate>Thu, 27 Oct 2011 00:55:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=47#comment-170</guid>
		<description>Very nice post. I do think that there are many appealing concepts and decisions introduced in ggplot, but I get burned every time I learn it because it&#039;s restrictive in the types of plots you can make with its native syntax (HW deliberately makes it difficult if not impossible to make types of plots which he disapproves, and these mostly include issues regarding axis addition or customizations of limits), and also because of the (lack of) speed. And you&#039;re quite right about the &quot;abuse&quot; of the operator! I couldn&#039;t quite put my finger on it but you are spot on in that it does break your intuitive understanding of what is advertised as a superposition of layers. Having complained about all this, HW&#039;s packages (reshape, plyr) are still the main reason why I use R, but in the end I still go back to lattice graphics for the plotting. It&#039;s warty, but fast and amenable for further customization than ggplot2 (and there&#039;s even a ggplot2like() themed settings available in latticeExtra).</description>
		<content:encoded><![CDATA[<p>Very nice post. I do think that there are many appealing concepts and decisions introduced in ggplot, but I get burned every time I learn it because it&#8217;s restrictive in the types of plots you can make with its native syntax (HW deliberately makes it difficult if not impossible to make types of plots which he disapproves, and these mostly include issues regarding axis addition or customizations of limits), and also because of the (lack of) speed. And you&#8217;re quite right about the &#8220;abuse&#8221; of the operator! I couldn&#8217;t quite put my finger on it but you are spot on in that it does break your intuitive understanding of what is advertised as a superposition of layers. Having complained about all this, HW&#8217;s packages (reshape, plyr) are still the main reason why I use R, but in the end I still go back to lattice graphics for the plotting. It&#8217;s warty, but fast and amenable for further customization than ggplot2 (and there&#8217;s even a ggplot2like() themed settings available in latticeExtra).</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by getstats &#187; Getstats &#8211; Campaigning to make Britain better with numbers and statistics</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-141</link>
		<dc:creator>getstats &#187; Getstats &#8211; Campaigning to make Britain better with numbers and statistics</dc:creator>
		<pubDate>Wed, 05 Oct 2011 11:37:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-141</guid>
		<description>[...] and are statisticians data scientists? These are just two of the immediate questions that &#8216;Data Science, Moore&#8217;s Law and Moneyball&#8217; a recent piece on Harlan Harris&#8217;s blog, has raised for [...]</description>
		<content:encoded><![CDATA[<p>[...] and are statisticians data scientists? These are just two of the immediate questions that &#8216;Data Science, Moore&#8217;s Law and Moneyball&#8217; a recent piece on Harlan Harris&#8217;s blog, has raised for [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by Mic</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-136</link>
		<dc:creator>Mic</dc:creator>
		<pubDate>Thu, 29 Sep 2011 20:43:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-136</guid>
		<description>I liked your post!  I&#039;m a physicist by training, but I&#039;ve been working in the &quot;data science&quot; and &quot;analytics&quot; area for going on 20 years now.  It&#039;s just never been called that before...

It&#039;s really coming into play now because lots of data is available, computing power is cheap, and smart people can get answers from the data quickly using these tools (and creating their own).  The marketplace is just now ready to recognize &quot;data science&quot; on its own terms.</description>
		<content:encoded><![CDATA[<p>I liked your post!  I&#8217;m a physicist by training, but I&#8217;ve been working in the &#8220;data science&#8221; and &#8220;analytics&#8221; area for going on 20 years now.  It&#8217;s just never been called that before&#8230;</p>
<p>It&#8217;s really coming into play now because lots of data is available, computing power is cheap, and smart people can get answers from the data quickly using these tools (and creating their own).  The marketplace is just now ready to recognize &#8220;data science&#8221; on its own terms.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Science, Moore&#8217;s Law, and Moneyball by Taylor</title>
		<link>http://www.harlan.harris.name/2011/09/data-science-moores-law-and-moneyball/comment-page-1/#comment-131</link>
		<dc:creator>Taylor</dc:creator>
		<pubDate>Thu, 29 Sep 2011 14:13:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.harlan.harris.name/?p=149#comment-131</guid>
		<description>Great post, Harlan. I&#039;d always been drawn to the idea that a data scientist is a career defined by an eclectic path or skill set, instead of one set of tools and procedures.  However, I&#039;d never seen that definition put forth until your talk.  Point #2 was me, but it was a result of all the other great discussion from the DC meetup.</description>
		<content:encoded><![CDATA[<p>Great post, Harlan. I&#8217;d always been drawn to the idea that a data scientist is a career defined by an eclectic path or skill set, instead of one set of tools and procedures.  However, I&#8217;d never seen that definition put forth until your talk.  Point #2 was me, but it was a result of all the other great discussion from the DC meetup.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

