Hadoop is an open source Javabased programming framework that supports the processing and storage of extremely large data sets in a distributed