mshen2

mshen2's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python166k 1.6k 2.6k44.1k
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML111k 1.4k 015k
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++69.1k 638 1.8k7.6k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python53.8k 446 1315.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26.7k 223 4.4k3.9k
Stability-AI/StableLM
StableLM: Stability AI Language Models
Language:Jupyter Notebook15.8k 200 761k
sympy/sympy
A computer algebra system written in pure Python
Language:Python12.8k 291 13.3k4.4k
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k 68 269518
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k 50 187397
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.4k 49 289469
bigscience-workshop/promptsource
Toolkit for creating, sharing and using natural language prompts.
Language:Python2.6k 32 162346
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.5k 37 34125
microsoft/promptbench
A unified evaluation framework for large language models
Language:Python2.4k 21 50178
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.2k 39 30113
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language:Python2.2k 74 11311
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Language:Python1.8k 17 109136
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
Language:Python1.6k 18 542356
teknium1/GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
Language:Python1.6k 46 5169
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
Language:Python1.5k 27 25146
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.4k 7 140224
sahil280114/codealpaca
Language:Python1.4k 21 19108
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Language:Python1k 37 2041
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python909 19 6994
PrithivirajDamodaran/Parrot_Paraphraser
A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Language:Python867 14 46143
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Language:Python770 23 52195
kaistAI/SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
Language:Python219 7 619
g8a9/ferret
A python package for benchmarking interpretability techniques on Transformers.
Language:Python207 1 2316
Spico197/Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
Language:Python128 3 98
kasnerz/tabgenie
A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.
Language:Python52 5 583
abrazinskas/sigir2022-opinion-summarization-tutorial
This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.
34 4 11