chncaesar's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
meta-llama/llama
Inference code for Llama models
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
chroma-core/chroma
the AI-native open-source embedding database
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
Qihoo360/Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
zhp8341/flink-streaming-platform-web
基于flink的实时流计算web平台
souffle-lang/souffle
Soufflé is a variant of Datalog for tool designers crafting analyses in Horn clauses. Soufflé synthesizes a native parallel C++ program from a logic specification.
pyspark-ai/pyspark-ai
English SDK for Apache Spark
linkedin/coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
allwefantasy/auto-coder
AbsaOSS/spline
Data Lineage Tracking And Visualization Solution
quxiucheng/apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
allwefantasy/byzer-llm
Easy, fast, and cheap pretrain,finetune, serving for everyone
AbsaOSS/spline-spark-agent
Spline agent for Apache Spark
ashkapsky/BigDatalog
allwefantasy/BYZER-RETRIEVAL
Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both BM25 retrieval algorithm and vector retrieval algorithm.
RoundYuanYuan/spark-field-lineage
spark 字段血缘 spark field lineage
chncaesar/byzer-llm