/redicorpus

Distributed, out-of-core corpus building, querying, and modeling

Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause

Redicorpus -- the distributed, out of core, real-time solution for building and querying linguistic data

Build Status codecov.io Documentation Status

In development -- unstable

Description

Redicorpus builds linguistic corpora in real-time to give you temporal resultion on the order of a single day, instead of years or decades.

Its database and computing tasks are distributed in parallel, which makes it fault-tolerant and easy to scale out.

Frequently used intermediate data are computed in advance, which reduces the latency for common queries.