Challenge 1

Tools needed

Ruby (Any version will probably work, but I am on 2.3.1)

How to run

In terminal, navigate to "part1" directory
Run command "ruby count_data.rb input.txt" without quotation marks. (Replace input.txt with your desired txt file)

Scenario:

You’re given an input file. Each line consists of a timestamp (unix epoch in seconds) and a url separated by ‘|’ (pipe operator). The entries are not in any chronological order. Your task is to produce a daily summarized report on url hit count, organized daily (mm/dd/yyyy GMT) with the earliest date appearing first. For each day, you should display the number of times each url is visited in the order of highest hit count to lowest count. Your program should take in one command line argument: input file name. The output should be printed to stdout. You can assume that the cardinality (i.e. number of distinct values) of hit count values and the number of days are much smaller than the number of unique URLs. You may also assume that number of unique URLs can fit in memory, but not necessarily the entire file.

input.txt

Output

08/08/2014 GMT
www.facebook.com 2
www.google.com 2
news.ycombinator.com 1

08/09/2014 GMT
www.nba.com 3
sports.yahoo.com 2
www.cnn.com 1

08/10/2014 GMT
www.twitter.com 1

Nenry/c1

Challenge 1

Tools needed

How to run

Scenario: