run scripts written in other languages (original) (raw)

It wouldn't actually be that difficult for MRJob to run scripts written in other languages if they implemented the MRJob protocol (--steps, --mapper, --reducer, and --step-num). Instead of prepending python to our command inside Hadoop streaming, we'd prepend ruby or java or (for shell scripts) nothing. We'd probably run them like:

python mrjob.job.MRJob --mr-job-script mr_perform_aweomeness.rb

or alternately:

mrjob mr_perform_awesomeness.rb

The main thing is, I'm not sure there's any demand for such a feature.

Tell you what, you write the base MRJob class in your favorite language and put it up on github, and I'll hook it up to mrjob for you. :)