junwucs

Xiong Jun Wu @ Ant Group, Beijing

Universe

junwucs's Stars

rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Language:Python2.2k465
OpenNMT/Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Language:C++28470
eole-nlp/eole
Open language modeling toolkit based on PyTorch
Language:Python6012
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Language:Python6.8k2.3k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.6k984
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Language:Python49262
Open-Source-O1/o1_Reasoning_Patterns_Study
Language:Python575
McGill-NLP/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
Language:Python806
GAIR-NLP/OlympicArena
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
Language:JavaScript853
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Language:Python796
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.3k34
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python1k71
state-spaces/mamba
Mamba SSM architecture
Language:Python13.2k1.1k
MARIO-Math-Reasoning/Super_MARIO
Language:Python24216
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Language:Python27923
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Language:Python35714
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.3k117
princeton-nlp/LM-Science-Tutor
Language:Python362
Linear95/SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
Language:Python9611
WooooDyy/LLM-Reverse-Curriculum-RL
Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
Language:Python725
alienzhou/web-highlighter
✨ A no-runtime dependency lib for text highlighting & persistence on any website ✨🖍️
Language:TypeScript882144
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Language:Python33629
google-deepmind/funsearch
Language:Jupyter Notebook733130
icip-cas/awesome-auto-alignment
Collection of papers for scalable automated alignment.
727
sangamesh-kodge/Verifix
[Verifix] - Post-Training Correction to Improve Label Noise Robustness with Verified Samples
Language:Python4
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Language:Python3.9k352
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
5.1k284
NoviScl/AI-Researcher
Language:Python22521
zhentingqi/rStar
Language:Python49457
ezelikman/quiet-star
Code for Quiet-STaR
Language:Python64788

junwucs

junwucs's Stars

rsennrich/subword-nmt

OpenNMT/Tokenizer

eole-nlp/eole

OpenNMT/OpenNMT-py

NVIDIA/TensorRT-LLM

uclaml/SPPO

Open-Source-O1/o1_Reasoning_Patterns_Study

McGill-NLP/VinePPO

GAIR-NLP/OlympicArena

kyegomez/Lets-Verify-Step-by-Step

GAIR-NLP/O1-Journey

openreasoner/openr

state-spaces/mamba

MARIO-Math-Reasoning/Super_MARIO

dingo-actual/infini-transformer

OpenLMLab/LEval

jquesnelle/yarn

princeton-nlp/LM-Science-Tutor

Linear95/SPAG

WooooDyy/LLM-Reverse-Curriculum-RL

alienzhou/web-highlighter

tianyi-lab/Reflection_Tuning

google-deepmind/funsearch

icip-cas/awesome-auto-alignment

sangamesh-kodge/Verifix

bklieger-groq/g1

hijkzzz/Awesome-LLM-Strawberry

NoviScl/AI-Researcher

zhentingqi/rStar

ezelikman/quiet-star