tomasino shared this article last night on irc and i found it to be very useful, so i’m posting it here.
awk is an extremely underutilized tool these days, it seems. Combined with perl, you can do some pretty fantastic records processing on a single machine much faster than you can on an ELKS or Hadoop cluster.
add a little bit of sed too and yeah, arcane arts of mystical processing magic! ;P
(sed can totally be replaced by perl stuff, but still…)