/big-data-hadoop-spark

Assignment for UoM lesson "Big Data"

Primary LanguageJava

Big-Data

Hadoop MapReduce programm (Homework 1)

output file contains users who viewed a file on more than one date

Input csv files structure

ip,date,time,zone,cik,accession,extention,code,size,idx,norefer,noagent,find,crawler,browser

filename is equal to extension or accession+extension