daltonj
Jeff Dalton. Associate Professor. Includes work on BBC Voice UoG collaboration.
University of Edinburgh
daltonj's Stars
mcartright/julien
Toolkit for Information Retrieval research
factorie/factorie
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
ejmeij/trec-kba
This project contains some Hadoop code for working with the TREC Knowledge Base Acceleration dataset. In particular, it provides classes to read/write topic files, read/write run files, and expose the documents in the Thrift files as Hadoop-readable objects.
CIIR/Proteus
Million Book Project
umasscs/umassthesis
Unofficial UMass thesis style files for use with LaTeX
davglass/zipcodes
Zipcode lookup node module
metzlerd/mavuno
Mavuno: A Hadoop-Based Text Mining Toolkit
aritter/twitter_nlp
Twitter NLP Tools
brendano/ark-tweet-nlp
CMU ARK Twitter Part-of-Speech Tagger
twitter/util
Wonderful reusable code from Twitter
twitter-archive/commons
Twitter common libraries for python and the JVM (deprecated)
dwijnand/sbt-extras
A more featureful runner for sbt, the simple/scala/standard build tool
twitter-archive/twitter-text-java
A Java implementation of Twitter's text processing library
twitter-archive/ostrich
A stats collector & reporter for Scala servers (deprecated)
mesos/spark
Lightning-fast cluster computing in Java, Scala and Python.
bixo/bixo
Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading pipe assembly, you can quickly create specialized web mining applications.
daltonj/CIIRShared
repository for the clue/synonym project
dlwh/breeze
Breeze is a library for numerical processing, machine learning, and natural language processing. Its primary focus is on being generic, clean, and powerful without sacrificing (much) efficiency. Breeze is the merger of the ScalaNLP and Scalala projects, because one of the original maintainers is unable to continue development. The Scalala parts are largely rewritten.
scalala/Scalala
Scalala has been superseded by dlwh/breeze. Scalala is a high performance numeric linear algebra library for Scala, with rich Matlab-like operators on vectors and matrices; a library of numerical routines; support for plotting.
stevej/scala-json
Scala JSON toolkit. Originally from the Odersky "Stairway" Book, tightened up and tests added by Twitter, Inc.