junwucs's Stars
rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
OpenNMT/Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
eole-nlp/eole
Open language modeling toolkit based on PyTorch
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Open-Source-O1/o1_Reasoning_Patterns_Study
McGill-NLP/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
GAIR-NLP/OlympicArena
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
state-spaces/mamba
Mamba SSM architecture
MARIO-Math-Reasoning/Super_MARIO
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
princeton-nlp/LM-Science-Tutor
Linear95/SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
WooooDyy/LLM-Reverse-Curriculum-RL
Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
alienzhou/web-highlighter
✨ A no-runtime dependency lib for text highlighting & persistence on any website ✨🖍️
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
google-deepmind/funsearch
icip-cas/awesome-auto-alignment
Collection of papers for scalable automated alignment.
sangamesh-kodge/Verifix
[Verifix] - Post-Training Correction to Improve Label Noise Robustness with Verified Samples
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
NoviScl/AI-Researcher
zhentingqi/rStar
ezelikman/quiet-star
Code for Quiet-STaR