/SimpleStorm

Simple Storm-like distributed application implementation

Primary LanguageJavaScriptOtherNOASSERTION

SimpleStorm

Simple Storm-like distributed application, implementation, for Node.js. See

Installation

Via npm on Node:

npm install simplestorm

Reference in your program:

var simplestorm = require('simplestorm');

Usage

You have Spouts (message sources) and Bolts (message processor). An Spout should have a start function.

function Spout() {    
    this.start = function(context) {
        // ...
        context.emit(msg); // you can emit a message many times
        // ....
    }
}

An spout emit message via its controller, in any of its methods.

A Bolt has a process method:

function MyBolt() {    
    this.process = function(msg, context) {
		// Message process
		// and emit new message(s)
		context.emit(newmsg);
	}
}

There is a topology builder:

// Objects

var spout = new Spout();
var downloader = new Downloader();
var resolver = new Resolver();
var harvester = new Harvester();

// Setting Builder

var builder = ss.createTopologyBuilder();

builder.setSpout("spout", spout);
builder.setBolt("downloader", downloader).shuffleGrouping("resolver").shuffleGrouping("spout");
builder.setBolt("resolver", resolver).shuffleGrouping("harvester");
builder.setBolt("harvester", harvester).shuffleGrouping("downloader");

var topology = builder.createTopology();

Start a topology:

topology.start();

Stop a topology (WIP):

topology.stop();

A topology worker can listen external messages, sent from other topology workers:

topology.listen(port);

If they are many topology workers, they can share its address using a central server. In a process, create a topology server:

var tserver = ss.createTopologyServer();
tserver.listen(port);

In each topology worker, connect to the topology server:

topology.connectToServer(port, host);

When a topology worker connects to the central server, it sends its address. The server shares that address with the rest of the topology workers. See Distributed Web Crawler with Server/Workers for a running example.

Development

git clone git://github.com/ajlopez/SimpleStorm.git
cd SimpleStorm
npm install
npm test

Samples

Versions

  • 0.0.1: Published.
  • 0.0.2: Published. Using SimpleQueue 0.0.2 for Distributed Web Crawler.
  • 0.0.3: Published. Major internal/external refactor. Spout.start, Bold.process. Topology.listen. Topology Server. Task message exchange between running worker processes (in different nodes). Workers sample.
  • 0.0.4: Published. Using SimpleMessages 0.0.6

To Do

  • Implement local ack of message tree
  • Implement distributed ack
  • Use a queue that can replay a fail message
  • Adopt MultiNodes

Contribution

Feel free to file issues and submit pull requests — contributions are welcome.

If you submit a pull request, please be sure to add or update corresponding test cases, and ensure that npm test continues to pass.

(Thanks to JSON5 by aseemk. This file is based on that project README.md).