run scripts written in other languages (original) (raw)
It wouldn't actually be that difficult for MRJob to run scripts written in other languages if they implemented the MRJob protocol (--steps, --mapper, --reducer, and --step-num). Instead of prepending python to our command inside Hadoop streaming, we'd prepend ruby or java or (for shell scripts) nothing. We'd probably run them like:
python mrjob.job.MRJob --mr-job-script mr_perform_aweomeness.rb
or alternately:
mrjob mr_perform_awesomeness.rb
The main thing is, I'm not sure there's any demand for such a feature.
Tell you what, you write the base MRJob class in your favorite language and put it up on github, and I'll hook it up to mrjob for you. :)