/carat-dataset-tools

Holds a sample project able to open the Carat dataset.

Primary LanguageScalaMIT LicenseMIT

carat-dataset-tools

Holds a sample project able to open the Carat dataset. To get access to the Carat dataset, contact carat ät cs helsinki fi. See also: http://carat.cs.helsinki.fi/

Build instructions

  1. Install maven3.
  2. `cd carat-dataset-tools; mvn clean package
  3. The above produces the file target/carat-dataset-tools-1.0.0-jar-with-dependencies.jar. You are now ready to run the example class of the project.

Run instructions

$SPARK_HOME/bin/spark-submit --class fi.helsinki.cs.nodes.carat.examples.SamplesFromJsonGz \
$PWD/target/carat-dataset-tools-1.0.0-jar-with-dependencies.jar --input /path/to/caratdata --output dummy.csv

Learn more

See the example class.

Development

In the Carat project, we use Scala IDE: http://scala-ide.org/