Pinned Repositories
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
algorithms
Minimal examples of data structures and algorithms in Python
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
awesome-distributed-systems
A curated list to learn about distributed systems
mage-ai
🧙 Mage is an open-source tool for building and running data pipelines that transform your data.
metaflow
:rocket: Build and manage real-life data science projects with ease!
mlflow
Open source platform for the machine learning lifecycle
posthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and a/b testing that you can self-host.
prefect
The easiest way to coordinate your dataflow
soumojit's Repositories
soumojit/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
soumojit/argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
soumojit/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
soumojit/mage-ai
🧙 Mage is an open-source tool for building and running data pipelines that transform your data.
soumojit/metaflow
:rocket: Build and manage real-life data science projects with ease!
soumojit/posthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and a/b testing that you can self-host.
soumojit/prefect
The easiest way to coordinate your dataflow
soumojit/beanie
Asynchronous Python ODM for MongoDB
soumojit/casbin
An authorization library that supports access control models like ACL, RBAC, ABAC in Golang: https://discord.gg/S5UjpzGZjN
soumojit/ClickHouse
ClickHouse® is a real-time analytics DBMS
soumojit/dgs-framework
GraphQL for Java with Spring Boot made easy.
soumojit/duckdb
DuckDB is an analytical in-process SQL database management system
soumojit/gluesql
GlueSQL is quite sticky. It attaches to anywhere.
soumojit/gluonts
Probabilistic time series modeling in Python
soumojit/influxdb
Scalable datastore for metrics, events, and real-time analytics
soumojit/kaldb
soumojit/lucene
Apache Lucene open-source search software
soumojit/mongo
The MongoDB Database
soumojit/OpenSearch
🔎 Open source distributed and RESTful search engine.
soumojit/pebble
RocksDB/LevelDB inspired key-value database in Go
soumojit/redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.
soumojit/remix
Build Better Websites. Create modern, resilient user experiences with web fundamentals.
soumojit/rethinkdb
The open-source database for the realtime web.
soumojit/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
soumojit/sonic
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
soumojit/streamlit
Streamlit — The fastest way to build data apps in Python
soumojit/terarkdb
A RocksDB compatible KV storage engine with better performance
soumojit/the-algorithm
Source code for Twitter's Recommendation Algorithm
soumojit/tigerbeetle
The distributed financial accounting database designed for mission critical safety and performance.
soumojit/twitter-server
Twitter-Server defines a template from which services at Twitter are built