moshangcheng's Stars
huihut/interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
surrealdb/surrealdb
A scalable, distributed, collaborative, document-graph database, for the realtime web
mindsdb/mindsdb
Platform for building AI that can learn and answer questions over federated data.
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
umijs/qiankun
📦 🚀 Blazing fast, simple and complete solution for micro frontends.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
DayBreak-u/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
manticoresoftware/manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
finos/perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
Netflix/metaflow
Open Source AI/ML Platform
uber/cadence
Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way.
yujiosaka/headless-chrome-crawler
Distributed crawler powered by Headless Chrome
schemaorg/schemaorg
Schema.org - schemas and supporting software
apache/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
apache/arrow-datafusion
Apache DataFusion SQL Query Engine
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
seladb/PcapPlusPlus
PcapPlusPlus is a multiplatform C++ library for capturing, parsing and crafting of network packets. It is designed to be efficient, powerful and easy to use. It provides C++ wrappers for the most popular packet processing engines such as libpcap, Npcap, WinPcap, DPDK, AF_XDP and PF_RING.
tinysearch/tinysearch
🔍 Tiny, full-text search engine for static websites built with Rust and Wasm
google/cpu_features
A cross platform C99 library to get cpu features at runtime.
madawei2699/awesome-seo
Google SEO Research and Web Traffic Monetization
CeresDB/ceresdb
CeresDB is a high-performance, distributed, cloud native time-series database.
apache/incubator-kvrocks
Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
ChunelFeng/CGraph
【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
substrait-io/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
blaze-init/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
apache/tez
Apache Tez
betodealmeida/shillelagh
Making it easy to query APIs via SQL
oap-project/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
astronomer/airflow-dbt-demo
A repository of sample code to accompany our blog post on Airflow and dbt.