luxinyu1's Stars
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
veronica320/Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
tatsu-lab/opinions_qa
facebookresearch/LIGHT
LIGHT is a platform for text-situated dialogue research. We originally hosted LIGHT as a live game with dialogue models in a grounded setting. This repo contains all of the code to get the LIGHT game running, as well as reproducible code for the research projects along the way of getting LIGHT to where it was.
suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
stanfordnlp/string2string
String-to-String Algorithms for Natural Language Processing
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
google-research/cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
litanlitudan/skyagi
SkyAGI: Emerging human-behavior simulation capability in LLM
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
chroma-core/chroma
the AI-native open-source embedding database
lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Stability-AI/StableLM
StableLM: Stability AI Language Models
sunlab-osu/Understanding-CoT
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
lupantech/chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
BAAI-Zlab/COIG
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
FreedomIntelligence/LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
thu-coai/Safety-Prompts
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
mingkaid/rl-prompt
Accompanying repo for the RLPrompt paper
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts