Hadoop demos
A. MsWordCount This project will allow the user to load up a MS word .doc file into HDFS and then use the same for computing the word count example. The challenges taken up are -
- Word document need separate record reader using Apache POI library from http://www.apache.org/dyn/closer.cgi/poi/release/bin/poi-bin-3.11-20141221.zip