Shekharv's Stars
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
plotly/plotly.js
Open-source JavaScript charting library behind Plotly and Dash
plotly/plotly.py
The interactive graphing library for Python :sparkles: This project now includes Plotly Express!
cayleygraph/cayley
An open-source graph database
pinpoint-apm/pinpoint
APM, (Application Performance Management) tool for large-scale distributed systems.
datamade/usaddress
:us: a python library for parsing unstructured United States address strings into address components
bigdatagenomics/adam
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
datamade/parserator
:bookmark: A toolkit for making domain-specific probabilistic parsers
gearpump/gearpump
Lightweight real-time big data streaming engine over Akka
larsga/Duke
Duke is a fast and flexible deduplication engine written in Java
datamade/probablepeople
:family: a python library for parsing unstructured western names into name components.
openscoring/openscoring
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
implydata/plywood
A toolkit for querying and interacting with Big Data
bigdatagenomics/mango
A scalable genome browser. Apache 2 licensed.
tfmorris/Names
A comprehensive database of name variants