xiangjjj's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
meta-llama/llama
Inference code for Llama models
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
huggingface/trl
Train transformer language models with reinforcement learning.
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
mosaicml/llm-foundry
LLM training code for Databricks foundation models
microsoft/DialoGPT
Large-scale pretraining for dialogue
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
huggingface/huggingface_hub
The official Python client for the Huggingface Hub.
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
EleutherAI/the-pile
aws/deep-learning-containers
AWS Deep Learning Containers are pre-built Docker images that make it easier to run popular deep learning frameworks and tools on AWS.
conceptofmind/LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
amazon-science/alexa-teacher-models
amazon-science/ReFinED
ReFinED is an efficient and accurate entity linking (EL) system.
commoncrawl/cc-crawl-statistics
Statistics of Common Crawl monthly archives mined from URL index files
cocrawler/cdx_toolkit
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
google-research/dialog-inpainting
neptune-ai/examples
📝 Examples of how to use Neptune for different use cases and with various MLOps tools
grill-lab/DL-Hard
Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.
maximedb/full
Evaluation of open-domain dialog using Follow-Ups Log-Likelihood (FULL) https://aclanthology.org/2022.coling-1.40/
amazon-science/ccsum