I want from you to use the dataset I provide inside the given project and to write a MapReduce program for Apache Hadoop, using Java in order to calculate the data cube for the dimension Trn_type, CityName_Customer and CityName_branch with measurements for sum(amount) and avg(amount).
For the problem solution you can develop more than one MapReduce jobs. For every Group By that is being contained in the requested cube, you shoul create an output file using MultipleOutputs. For example the output file for the GroupBy(Trn_type,CityName_branch) will be named "trntypeCitynamebranch" and its contents will have this appearance:
#trntype,citynamebranch,sum,avg
C,Beijing,251849454883.729,40625.8007838899
----The project should be developed in the maven project I provide after personal communication----
Hi, I am Amit. I have the required experience in Hadoop MapReduce and Java to write the MapReduce program for your requirement. Please provide me with your data set, so that i can analyse and confirm you on time required. I can integrate the code into the maven project provided by you.
Looking forward to work on this.
Regards,
Amit Kumar
I have around 4 years of experience in product based big data company. My expertise is in Hadoop map reduce. I have implemented many data science algorithm using map reduce on a very large data set.
Have worked on Hadoop in 3 different projects(Twitter Sentimental analysis, CDR analysis, DNA sequence analysis)
All projects where developed in Java
currently not working, so can complete it in 4 days
Thank you