i-gao's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
guidance-ai/guidance
A guidance language for controlling large language models.
Stability-AI/StableLM
StableLM: Stability AI Language Models
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
stas00/ml-engineering
Machine Learning Engineering Open Book
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
arcee-ai/mergekit
Tools for merging pretrained large language models.
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
ray-project/llmperf
LLMPerf is a library for validating and benchmarking LLMs
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
ray-project/llmperf-leaderboard
huggingface/OBELICS
Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.
jthickstun/watermark
Code for watermarking language models
Hritikbansal/generative-robustness
Create generated datasets and train robust classifiers
hauselin/domain-quality-ratings
Comprehensive database of ratings for 11k news domains
rom1504/slurm-tracking-bot
Simple slurm tracking bot to check usage
MadryLab/AIaaS_Supply_Chains
Dataset and overview