I want from you to use the dataset I provide inside the given project and to write a MapReduce program for Apache Hadoop, using Java in order to calculate the data cube for the dimension Trn_type, CityName_Customer and CityName_branch with measurements for sum(amount) and avg(amount).
For the problem solution you can develop more than one MapReduce jobs. For every Group By that is being contained in the requested cube, you shoul create an output file using MultipleOutputs. For example the output file for the GroupBy(Trn_type,CityName_branch) will be named "trntypeCitynamebranch" and its contents will have this appearance:
#trntype,citynamebranch,sum,avg
C,Beijing,251849454883.729,40625.8007838899
----The project should be developed in the maven project I provide after personal communication----
Hello,
How's it going? Regarding your project, we can built this in a short period of time. HIRE ME and we will work on it RIGHT away. :)
Cheers,
Joey Cruz