<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Cycles of Doom in Batch Processing Workflows</title>
	<atom:link href="http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/</link>
	<description>For engineers by engineers</description>
	<lastBuildDate>Mon, 30 Aug 2010 16:14:07 -0700</lastBuildDate>
	
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: James Brown</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1473</link>
		<dc:creator>James Brown</dc:creator>
		<pubDate>Fri, 08 Jan 2010 19:46:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1473</guid>
		<description>@Jon:  LOL at the concept that batch processing time is only CPU bound in any given system.  In this context, adding disks does not mean I am only looking to increase storage.  

I will leave your misinterpretation of my 1982 comment as a homework exercise.</description>
		<content:encoded><![CDATA[<p>@Jon:  LOL at the concept that batch processing time is only CPU bound in any given system.  In this context, adding disks does not mean I am only looking to increase storage.  </p>
<p>I will leave your misinterpretation of my 1982 comment as a homework exercise.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1471</link>
		<dc:creator>Jon</dc:creator>
		<pubDate>Thu, 07 Jan 2010 20:45:17 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1471</guid>
		<description>LOL at hadoop being 1982 or the concept that more disk space reduces processing time. When you design/architect a system and find that your &quot;go to solution&quot; is throwing hardware at it, you know you have a poor design.</description>
		<content:encoded><![CDATA[<p>LOL at hadoop being 1982 or the concept that more disk space reduces processing time. When you design/architect a system and find that your &#8220;go to solution&#8221; is throwing hardware at it, you know you have a poor design.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Trøll Åndersen</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1466</link>
		<dc:creator>Trøll Åndersen</dc:creator>
		<pubDate>Wed, 06 Jan 2010 19:24:15 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1466</guid>
		<description>@James Brown In Norway&#039;s summer we salt your kind and leave you out in the sun for a few weeks. We are not amused by your joke involving integers and lunar cycles.

 @ Aaron Faen, why are you paying so much for disk? That sounds like the price we paid for back when I was managing the Iralalaru project in &#039;02.</description>
		<content:encoded><![CDATA[<p>@James Brown In Norway&#8217;s summer we salt your kind and leave you out in the sun for a few weeks. We are not amused by your joke involving integers and lunar cycles.</p>
<p> @ Aaron Faen, why are you paying so much for disk? That sounds like the price we paid for back when I was managing the Iralalaru project in &#8216;02.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bryan</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1461</link>
		<dc:creator>bryan</dc:creator>
		<pubDate>Mon, 04 Jan 2010 16:52:11 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1461</guid>
		<description>@James Brown: Disks and RAM may be cheap, but the machines to house them are not. Nor, more importantly, are the cabinets, cooling, or power. Blithely scaling up (or out) is not always the solution, no matter what all the cloud folks would like us to believe.

@Aaron Anderson: In our application, we&#039;re not talking about a mainframe, so we don&#039;t have to buy enterprise disks. We have a big cluster of &quot;commodity&quot; machines that use standard SATA disks, so adding 2TB is not nearly the $200k investment. However, having an extra 2TB of storage is pretty much never the problem we need to solve.</description>
		<content:encoded><![CDATA[<p>@James Brown: Disks and RAM may be cheap, but the machines to house them are not. Nor, more importantly, are the cabinets, cooling, or power. Blithely scaling up (or out) is not always the solution, no matter what all the cloud folks would like us to believe.</p>
<p>@Aaron Anderson: In our application, we&#8217;re not talking about a mainframe, so we don&#8217;t have to buy enterprise disks. We have a big cluster of &#8220;commodity&#8221; machines that use standard SATA disks, so adding 2TB is not nearly the $200k investment. However, having an extra 2TB of storage is pretty much never the problem we need to solve.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Aaron Anderson</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1460</link>
		<dc:creator>Aaron Anderson</dc:creator>
		<pubDate>Mon, 04 Jan 2010 02:37:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1460</guid>
		<description>Enterprise disk is NOT cheap. I really hate that people think that. Even IT managers say that &quot;disk is cheap, throw some in the mainframe... I&#039;ve got a 1.5TB drive on my desk that cost me $150 dollars. &quot; When mainframe storage for an IBM series might cost upwards of 200,000 dollars for a disk pack to hold 2TB.

This article is so 2010.</description>
		<content:encoded><![CDATA[<p>Enterprise disk is NOT cheap. I really hate that people think that. Even IT managers say that &#8220;disk is cheap, throw some in the mainframe&#8230; I&#8217;ve got a 1.5TB drive on my desk that cost me $150 dollars. &#8221; When mainframe storage for an IBM series might cost upwards of 200,000 dollars for a disk pack to hold 2TB.</p>
<p>This article is so 2010.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James Brown</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1455</link>
		<dc:creator>James Brown</dc:creator>
		<pubDate>Wed, 30 Dec 2009 17:56:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1455</guid>
		<description>Why should I waste my time with math when I can keep adding cheap disks and ram?

This article is so 1982.</description>
		<content:encoded><![CDATA[<p>Why should I waste my time with math when I can keep adding cheap disks and ram?</p>
<p>This article is so 1982.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The mathematics behind Hadoop-based systems &#8212; thoughts from the red planet</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1452</link>
		<dc:creator>The mathematics behind Hadoop-based systems &#8212; thoughts from the red planet</dc:creator>
		<pubDate>Tue, 29 Dec 2009 05:35:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1452</guid>
		<description>[...] originally wrote about this effect here. I hope you find this writeup to be clearer and more extensive. [...]</description>
		<content:encoded><![CDATA[<p>[...] originally wrote about this effect here. I hope you find this writeup to be clearer and more extensive. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: markI</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1447</link>
		<dc:creator>markI</dc:creator>
		<pubDate>Thu, 24 Dec 2009 04:55:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1447</guid>
		<description>huuu... If you increase the number of machines and diminish the size of the blocks, theoretically you can reduce a lot of overhead, no? Then again if real time is your goal a finite state machine approach might become more scalable, but that means redo your architecture without Hadoop! How much data are we talking about? 100Tb? 1Pb?</description>
		<content:encoded><![CDATA[<p>huuu&#8230; If you increase the number of machines and diminish the size of the blocks, theoretically you can reduce a lot of overhead, no? Then again if real time is your goal a finite state machine approach might become more scalable, but that means redo your architecture without Hadoop! How much data are we talking about? 100Tb? 1Pb?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nathan Marz</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1439</link>
		<dc:creator>Nathan Marz</dc:creator>
		<pubDate>Fri, 18 Dec 2009 21:28:47 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1439</guid>
		<description>@MikeD: It depends on how scalable your workflow is. If your processors contend on shared resources, then you will have limited ability to reduce that &quot;P&quot; parameter by increasing the number of processors.</description>
		<content:encoded><![CDATA[<p>@MikeD: It depends on how scalable your workflow is. If your processors contend on shared resources, then you will have limited ability to reduce that &#8220;P&#8221; parameter by increasing the number of processors.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: MikeD</title>
		<link>http://blog.rapleaf.com/dev/2009/12/18/cycles-of-doom-in-batch-processing-workflows/comment-page-1/#comment-1438</link>
		<dc:creator>MikeD</dc:creator>
		<pubDate>Fri, 18 Dec 2009 19:38:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rapleaf.com/dev/?p=340#comment-1438</guid>
		<description>How is this model affected by elastic computing (spinning up more processing machines)? Is that not helpful since bottlenecks may be storage services?</description>
		<content:encoded><![CDATA[<p>How is this model affected by elastic computing (spinning up more processing machines)? Is that not helpful since bottlenecks may be storage services?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
