mshen2's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Stability-AI/StableLM
StableLM: Stability AI Language Models
sympy/sympy
A computer algebra system written in pure Python
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
bigscience-workshop/promptsource
Toolkit for creating, sharing and using natural language prompts.
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
microsoft/promptbench
A unified evaluation framework for large language models
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
teknium1/GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
sahil280114/codealpaca
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
PrithivirajDamodaran/Parrot_Paraphraser
A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
kaistAI/SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
g8a9/ferret
A python package for benchmarking interpretability techniques on Transformers.
Spico197/Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
kasnerz/tabgenie
A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.
abrazinskas/sigir2022-opinion-summarization-tutorial
This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.