tomyoung903

Tom Young is a research fellow at NUS working on language models.

NTU50 Nanyang Ave, 639798

tomyoung903's Stars

meta-llama/llama
Inference code for Llama models
Language:Python57.1k 525 1.1k9.6k
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python40.5k 323 2k4.5k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k 386 1.7k4.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23k 190 5242.3k
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.7k 143 48671
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k 99 199608
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.4k 39 1.2k2k
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.4k 26 581468
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.3k 19 84189
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k 14 874
volcengine/veScale
A PyTorch Native LLM Training Framework
Language:Python688 33 1836
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python537 13 3043
XueFuzhao/InstructionWild
456 9 641
ConvLab/ConvLab
DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:
Language:Python403 24 58110
NUS-HPC-AI-Lab/InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Language:Python322 7 1718
magic-research/Dataset_Quantization
[ICCV2023] Dataset Quantization
Language:Python256 7 1419
thu-coai/ccm
This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).
Language:Python219 16 1268
NUS-HPC-AI-Lab/SpeeD
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Language:Python161 9 85
jasonwu0731/GLMP
PyTorch code for ICLR 2019 paper: Global-to-local Memory Pointer Networks for Task-Oriented Dialogue https://arxiv.org/pdf/1901.04713
Language:Python160 15 924
wang-chen/thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
Language:TeX149 3 749
Yanqing0327/DREAM
Efficient Dataset Distillation by Representative Matching
Language:Python110 2 139
Tomiinek/MultiWOZ_Evaluation
Unified MultiWOZ evaluation scripts for the context-to-response task.
Language:Python57 5 612
mireshghallah/mixmatch
Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
Language:Python41 2 35
tomyoung903/FusedChat
FusedChat is a dialogue dataset. It contains dialogue sessions fusing task-oriented dialogues and open-domain dialogues.
Language:Python29 2 62
tomyoung903/MLM_inconsistencies
Inconsistencies in Masked Language Models
Language:Python7 1 00

tomyoung903

tomyoung903's Stars

meta-llama/llama

All-Hands-AI/OpenHands

hpcaitech/ColossalAI

hpcaitech/Open-Sora

lucidrains/PaLM-rlhf-pytorch

THUDM/GLM-130B

EleutherAI/lm-evaluation-harness

open-compass/opencompass

eric-mitchell/direct-preference-optimization

XueFuzhao/OpenMoE

volcengine/veScale

declare-lab/instruct-eval

XueFuzhao/InstructionWild

ConvLab/ConvLab

NUS-HPC-AI-Lab/InfoBatch

magic-research/Dataset_Quantization

thu-coai/ccm

NUS-HPC-AI-Lab/SpeeD

jasonwu0731/GLMP

wang-chen/thesis_template_ntu

Yanqing0327/DREAM

Tomiinek/MultiWOZ_Evaluation

mireshghallah/mixmatch

tomyoung903/FusedChat

tomyoung903/MLM_inconsistencies