sz640

sz640's Stars

unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python15.8k1.1k
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python36345
hamishivi/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python5212
generative-ai-on-aws/generative-ai-on-aws
Generative AI on AWS
Language:Jupyter Notebook448190
zhangxjohn/LLM-Agent-Benchmark-List
A banchmark list for evaluation of large language models.
561
AlibabaResearch/DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
Language:Python1.2k185
acadTags/Awesome-medical-coding-NLP
A collection of papers on automated medical coding from free-texts
11317
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Language:Python1.4k103
mlfoundations/open_lm
A repository for research on medium sized language models.
Language:Python46969
microsoft/OptiGuide
Large Language Models for Supply Chain Optimization
Language:Jupyter Notebook30243
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.1k147
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.3k3.9k
aws-samples/generative-ai-amazon-bedrock-langchain-agent-example
Language:Python215383
karpathy/LLM101n
LLM101n: Let's build a Storyteller
28.7k1.6k
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python31.5k3.6k
uptrain-ai/uptrain
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
Language:Python2.2k192
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.5k502
andrewyng/translation-agent
Language:Python4.6k526
alexbatalov/fallout2-ce
Fallout 2 for modern operating systems
Language:C++1.7k119
glgh/awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
30012
stanford-futuredata/ARES
Language:Python43748
GAIR-NLP/scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
Language:Python403
Yale-LILY/SummEval
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
Language:Python36142
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language:Python8.6k1.6k
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Language:Shell25.2k3.2k
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
Language:Python21k3.6k
brokenloop/jsontopydantic
Web tool for generating Pydantic models from JSON objects
Language:TypeScript31610
ax-llm/ax
The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.
Language:TypeScript1k68
Scale3-Labs/langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
Language:TypeScript43334
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4k523