When you deploy an emr cluster in your ryo private vpc, check...

dns

Rtfm and Rtfb properly next time

In EMR, the ability for machines to find and talk to each other is provided by the DNS resolution and the DNS hostname VPC settings.

Failure to do this on your vpc

will result in lots of gnashing of teeth and lots of everything your mom told you she would wash your mouth out for - aka

At 20m a pop I was quickly approaching my 10,000 hour mastery of the

  • maybe if I change the version of emr,
  • it must be a frigging bug in emr,
  • maybe its the combo of applications I chose,
  • I am going to roll my own bloody hadoop on ec2

fud loop