alexanderdean's Stars
yoheinakajima/babyagi
iterative/dvc
🦉 Data Versioning and ML Experiments
weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
marqo-ai/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
Eventual-Inc/Daft
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
chocolate-doom/chocolate-doom
Chocolate Doom is a Doom source port that is minimalist and historically accurate.
iterative/datachain
AI-data warehouse to enrich, transform and analyze unstructured data
restatedev/restate
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.
slatedb/slatedb
A cloud native embedded storage engine built on object storage.
apache/jena
Apache Jena
NVIDIA-Merlin/NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
warpstreamlabs/bento
Fancy stream processing made operationally mundane. This repository is a fork of the original project before the license was changed.
topoteretes/cognee
Reliable LLM Memory for AI Applications and AI Agents
yoheinakajima/mindgraph
proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai
yquake2/yquake2
The Yamagi Quake II client
airbnb/chronon
Chronon is a data platform for serving for AI/ML applications.
FalkorDB/FalkorDB
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
lakehq/sail
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
castagna/jena-examples
A collection of ready to use, small and self contained examples on how to use Apache Jena
humanlayer/humanlayer
HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling
ryrobes/rvbbit
Reactive Data Board & Visual Flow Platform
NVIDIA-Merlin/systems
Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature stores, nearest neighbor search, and exploration strategies) into end-to-end recommendation pipelines that can be served with Triton Inference Server.
semanticdatalayer/SML
Open-source repository for Semantic Modeling Language (SML)
google-marketing-solutions/Tightlock
snowplow-incubator/snowplow-javascript-tracker-examples
A repository for examples applications using the Snowplow JavaScript Tracker
cashfree/killbill-kafka-consumer-plugin
This is the OSGI based plugin for Kill Bill | Open-source billing and payment platform to provide support to collect usage through Kafka Stream.