| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by technomancy 6185 days ago

> The idea of iterating with Clojure in a REPL on a small dataset to develop or refine an algorithm, then pressing a button and see how it does running on some large dataset on EC2, sounds sublime.

Unfortunately it usually works out to spending a few hours iterating on the functions in the REPL till it works great and then spending a week battling with Hadoop to make it actually run the way you intended it to. (no exaggeration here.)

Though we've been working with raw Hadoop; it sounds like Cascading makes it much less painful. We'll see how that goes.

1 comments

bradfordcross 6184 days ago

True dat. Spoken like a true clown zen master. It wouldn't be clowncomputing if we didn't have red rubber noses and funny rainbow hair.

link