dafu-wu

Make a small contribution to the world

dafu-wu's Stars

leondz/garak
LLM vulnerability scanner
Language:Python1.1k125
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python3.4k300
GitHubDaily/GitHubDaily
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
31k3.4k
andrewyng/translation-agent
Language:Python4k447
traceloop/openllmetry
Open-source observability for your LLM application, based on OpenTelemetry
Language:Python1.5k121
triton-inference-server/backend
Common source, scripts and utilities for creating Triton backends.
Language:C++27380
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python12.2k1.2k
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python59886
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda21.9k2.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.7k710
agiresearch/AIOS
AIOS: LLM Agent Operating System
Language:Python3k357
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Language:Python12k1.2k
protectai/modelscan
Protection against Model Serialization Attacks
Language:Python23846
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k231
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.6k219
databricks/megablocks
Language:Python1.1k156
NVIDIA/deepops
Tools for building GPU clusters
Language:Shell1.2k318
xai-org/grok-1
Grok open release
Language:Python49.2k8.3k
kedacore/keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
Language:Go8.1k1k
outlines-dev/outlines
Structured Text Generation
Language:Python7.2k370
nexusflowai/NexusRaven-V2
Language:Jupyter Notebook34729
rubra-ai/rubra
Open Weight, tool-calling LLMs
Language:Makefile13919
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python89978
OpenNLPLab/lightning-attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
Language:Python17415
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.4k488
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.3k475
parallel75/Microsoft_AutoGen_Tutorial
微软 AutoGen 框架 Demo
Language:Python5511
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python2.8k183
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
1k163
ray-project/llmperf-leaderboard
40510

dafu-wu

dafu-wu's Stars

leondz/garak

InternLM/lmdeploy

GitHubDaily/GitHubDaily

andrewyng/translation-agent

traceloop/openllmetry

triton-inference-server/backend

infiniflow/ragflow

triton-inference-server/tensorrtllm_backend

karpathy/llm.c

BradyFU/Awesome-Multimodal-Large-Language-Models

agiresearch/AIOS

princeton-nlp/SWE-agent

protectai/modelscan

databricks/dbrx

flexflow/FlexFlow

databricks/megablocks

NVIDIA/deepops

xai-org/grok-1

kedacore/keda

outlines-dev/outlines

nexusflowai/NexusRaven-V2

rubra-ai/rubra

uclaml/SPIN

OpenNLPLab/lightning-attention

pytorch-labs/gpt-fast

yangjianxin1/Firefly

parallel75/Microsoft_AutoGen_Tutorial

sgl-project/sglang

HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese

ray-project/llmperf-leaderboard