sz640's Stars
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
hamishivi/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
generative-ai-on-aws/generative-ai-on-aws
Generative AI on AWS
zhangxjohn/LLM-Agent-Benchmark-List
A banchmark list for evaluation of large language models.
AlibabaResearch/DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
acadTags/Awesome-medical-coding-NLP
A collection of papers on automated medical coding from free-texts
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
mlfoundations/open_lm
A repository for research on medium sized language models.
microsoft/OptiGuide
Large Language Models for Supply Chain Optimization
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
aws-samples/generative-ai-amazon-bedrock-langchain-agent-example
karpathy/LLM101n
LLM101n: Let's build a Storyteller
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
uptrain-ai/uptrain
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
andrewyng/translation-agent
alexbatalov/fallout2-ce
Fallout 2 for modern operating systems
glgh/awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
stanford-futuredata/ARES
GAIR-NLP/scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
Yale-LILY/SummEval
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
brokenloop/jsontopydantic
Web tool for generating Pydantic models from JSON objects
ax-llm/ax
The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.
Scale3-Labs/langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
mosaicml/llm-foundry
LLM training code for Databricks foundation models