tomyoung903
Tom Young is a research fellow at NUS working on language models.
NTU50 Nanyang Ave, 639798
tomyoung903's Stars
meta-llama/llama
Inference code for Llama models
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
volcengine/veScale
A PyTorch Native LLM Training Framework
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
XueFuzhao/InstructionWild
ConvLab/ConvLab
DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:
NUS-HPC-AI-Lab/InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
magic-research/Dataset_Quantization
[ICCV2023] Dataset Quantization
thu-coai/ccm
This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).
NUS-HPC-AI-Lab/SpeeD
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
jasonwu0731/GLMP
PyTorch code for ICLR 2019 paper: Global-to-local Memory Pointer Networks for Task-Oriented Dialogue https://arxiv.org/pdf/1901.04713
wang-chen/thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
Yanqing0327/DREAM
Efficient Dataset Distillation by Representative Matching
Tomiinek/MultiWOZ_Evaluation
Unified MultiWOZ evaluation scripts for the context-to-response task.
mireshghallah/mixmatch
Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
tomyoung903/FusedChat
FusedChat is a dialogue dataset. It contains dialogue sessions fusing task-oriented dialogues and open-domain dialogues.
tomyoung903/MLM_inconsistencies
Inconsistencies in Masked Language Models