Looking at putting a class together regarding Hadooop, the distributed computing framework that revolves around MapReduce. Tons of companies are investing lots of time and money and energy into its use and it would be great if we could build some simple labs and some simple code around working with some big datasets and bending some brain around the framework and developing in the Map/Reduce programming style.
The hello world of MapReduce is a word-counting type application, wherein you map all words found in a dataset and then count them and list your findings. Again, this is MapReduce in one of its simplest forms, but with a little ingenuity, I can see these technologies being leverages for financial analysis, GIS dataset analysis and whatever else our imaginations can contrive.
I propose developing a class around building a Hadoop workflow because hey, the best way to learn something is to teach it to others.
Ideas are appreciated, drop in on the irc channel #hackrva on freenode or send a shout-out to the mailing list and let's discuss this a little further.