ZuoMatthew's Stars
nalgeon/redka
Redis re-implemented with SQLite
chdb-io/chdb
chDB is an embedded OLAP SQL Engine 🚀 powered by ClickHouse
dolthub/dolt
Dolt – Git for Data
zeroSteiner/rule-engine
A lightweight, optionally typed expression language with a custom grammar for matching arbitrary Python objects.
StarRocks/starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
apache/arrow-adbc
Database connectivity API standard and libraries for Apache Arrow
zauberzeug/nicegui
Create web-based user interfaces with Python. The nice way.
Eventual-Inc/Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
valkey-io/valkey
A new project to resume development on the formerly open-source Redis project. We're calling it Valkey, since it's a twist on the key-value datastore.
cozodb/cozo
A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!
myscale/MyScaleDB
An open-source, high-performance SQL vector database built on ClickHouse.
apache/iceberg-python
Apache PyIceberg
morph-kgc/morph-kgc
Powerful RDF Knowledge Graph Generation with RML Mappings
TuGraph-family/tugraph-db
TuGraph is a high performance graph database.
kuzudb/kuzu
Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
hydradatabase/hydra
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
google/lightweight_mmm
LightweightMMM 🦇 is a lightweight Bayesian Marketing Mix Modeling (MMM) library that allows users to easily train MMMs and obtain channel attribution information.
paradedb/paradedb
Postgres for Search and Analytics
pymc-devs/pymc
Bayesian Modeling and Probabilistic Programming in Python
GreptimeTeam/greptimedb
An open-source, cloud-native, distributed time-series database with PromQL/SQL/Python supported. Available on GreptimeCloud.
mlcommons/GaNDLF
A generalizable application framework for segmentation, regression, and classification using PyTorch
sktime/skpro
A unified framework for tabular probabilistic regression and probability distributions in python
sb-ai-lab/Py-Boost
Python based GBDT implementation on GPU. Efficient multioutput (multiclass/multilabel/multitask) training
lee-group-cmu/RFCDE
Random Forests for Conditional Density Estimation
SelfExplainML/PiML-Toolbox
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
SolarArbiter/solarforecastarbiter-core
Core data gathering, validation, processing, and reporting package for the Solar Forecast Arbiter
pvlib/pvlib-python
A set of documented functions for simulating the performance of photovoltaic energy systems.