Hungsiro506's Stars
meilisearch/meilisearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
cloudflare/pingora
A library for building fast, reliable and evolvable network services.
typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
apache/brpc
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" means "better RPC".
openobserve/openobserve
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
joelparkerhenderson/architecture-decision-record
Architecture decision record (ADR) examples for software planning, IT leadership, and template documentation
bytebase/bytebase
World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.
tigerbeetle/tigerbeetle
The financial transactions database designed for mission critical safety and performance.
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
growthbook/growthbook
Open Source Feature Flagging and A/B Testing Platform
jesse-ai/jesse
An advanced crypto trading bot written in Python
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
treeverse/lakeFS
lakeFS - Data version control for your data lake | Git for data
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
datafold/data-diff
Compare tables within or across databases
apache/paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
ept/hermitage
What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.
teamhide/fastapi-boilerplate
FastAPI boilerplate for real world production
kubewharf/kubeadmiral
Multi-Cluster Kubernetes Orchestration
ananthdurai/schemata
Schema modelling framework for decentralised domain-driven ownership of data.
kbastani/order-delivery-microservice-example
This repository contains a functional example of an order delivery service similar to UberEats, DoorDash, and Instacart.
king/bravo
Utilities for processing Flink checkpoints/savepoints
makenotion/datahub-tools
simplify working with DataHub API endpoints
dilipsundarraj1/kafka-for-developers-using-schema-registry
This repository has the content to interact with Kafka using AVRO and Schema Registry.
hazelcast/big-data-benchmark
Nordstrom/elwin
An experimentation platform based on Facebook's Planout
michalklempa/flink-state-metadata
Move and alter Flink savepoint files so that Flink job can start from a relocated savepoint
DataDome/public-flink-utils
Utilities for Flink