wangkc1008's Stars
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
xlang-ai/Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
Exploration-Lab/BookSQL
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
answerlink/IntelliQ
Advanced Multi-Turn QA System with LLM and Intent Recognition. 基于LLM大语言模型意图识别、参数抽取结合slot词槽技术实现多轮问答、NL2API. 打造Function Call多轮问答最佳实践
microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
SeldonIO/MLServer
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
tobymao/sqlglot
Python SQL Parser and Transpiler
mlflow/mlflow
Open source platform for the machine learning lifecycle
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
PaddlePaddle/FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
multimodal-art-projection/MAP-NEO
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
katanaml/sparrow
Data processing with ML, LLM and Vision LLM
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
meta-llama/llama3
The official Meta Llama 3 GitHub site
xai-org/grok-1
Grok open release
thuml/Time-Series-Library
A Library for Advanced Deep Time Series Models.
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
ise-uiuc/magicoder
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
lakesoul-io/LakeSoul
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
kwai/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.