sprawl how is your favorite topic doing around the world As Discussed : There are two parts to the problem statement: Analysis on the near real time data stream Analysis on the historical data Common aspect would be ETL operations.