/ProjectWyze

The Real-time Big Data Unframework since 2007, when the author didn't know coding.

Primary LanguageCSSOtherNOASSERTION

Project Wyze

Project Wyze is the Segway of real-time data processing with the power of a Ferrari. It lets you focus on what you want to do with the data itself, and forget about scheduling, managing the stream and handling failures. Write minimal code and enjoy instant gratification with a convenience you could only find on custom-built proprietary systems, while having freedom of customizing all you want.

Named after Principality of Wy, Project Wyze is an intuitive un-framework. Stream in the data, and you can both process the current data and look back with one call (coming soon). Design your own visualizations and playback your analytics on a timeline like a YouTube livestream (coming soon). It is also a polyglot supporting both AMQP and MQTT endpoints.

Project Wyze is designed for everyone doing Internet of Things, Remote Monitoring, Quantitative Trading, and everything involving Streaming Analytics in general.

Credits

Project Wyze is a collective effort of Jimmy He (@streamliner18) and Sasan Erfan (@sasma) at the University of Illinois at Urbana-Champaign.

Project Wyze is powered by RabbitMQ, MongoDB and Redis. The architecture draws inspirations from Apache Storm, Apache Spark, and is partially derived from Jimmy He's patent (CN 104239505 B) as well as Ricequant's Open Source RQAlpha financial backtest engine.

Also, special thanks to these friends for support and inspirations:

  • (Ethan) Luo Yicheng (@ethanluoyc) M.Eng CS, Imperial College London
  • Professor Christopher D. Schmitz (@cdschmit), ECE @ University of Illinois at Urbana-Champaign
  • Yichi Zhang B.E. Financial Engineering, Wuhan University
  • Bo Ma former Tech Coach, Baidu Inc

Plug-n-Play

Deploying Project Wyze for your day-to-day amusement is simple. Download the latest release and just run:

docker-compose up

It opens up a MQTT endpoint at 1883, a RabbitMQ console at 15672, a RabbitMQ endpoint at 5672, an output streaming endpoint at 5080, and a management console at 5081.

Under each component's folder, you may also find a utils section, where we provide useful tools that we use to test individual components, run sample workloads, and cut corners in our workflow (yes!), so that you can do so as well (You're welcome. <3)

Refer to the Deployment Guide for instructions on deploying a modified codebase or in a production environment. Trust me, that's way simpler than you think it is.

Architecture

Project Wyze is a collection of Docker containers orchestrated via docker-compose and coordinated with a central RabbitMQ broker, which doubles as a MQTT broker. It is intended and encouraged that different parts be bypassed or replaced to suit a particular use case as long as the pipeline is integral.

Wyze is a three-tier system. The Ingress Tier handles data ingestion and preprocessing, as well as demuxing MQTT messages to offload certain processing from low-power IoT appliances. The Data Tier stores and processes the data per the configurations. Processed data remains its streaming properties by residing both in a playback-able Kafka queue and a RabbitMQ queue for more real-time streaming performance. Finally, the Egress Tier handles client-side subscription of real-time streaming data and serves data back in the timeline according to playback requests. Independent of the three tiers but also a part of the system, a Management Console is provided for configuring the system and coding up processing directives in Python scripts. For detailed information about each component, refer to the development guide.

Contributing to Project Wyze

Contribution Guide is coming soon. We highly recommend reading through the guide before making a contribution, as improperly constructed commits and/or pull requests will be pitifully rejected.

We also welcome bug reports and feature suggestions from the Issues section.