Great post on data processing:

“Command-line tools can be 235x faster than your Hadoop cluster”

aadrake.com/command-line-t…

/via Hacker News