zhiyuanhubj

PhD student in NUS

National Unversity of SingaporeSingapore

Pinned Repositories

AAAI-19_slide_poster
20 1 014
Generating-Chinese-Ci
This repository is about Chinese Ci(宋词) generation, and the paper has been accepted by AAAI-19
30
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
21
Long_form_VideoQA
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
Language:Python120
longLLM-Extrapolation-Papers
41
LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Language:Python60 1 54
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python2 1 00
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1 1 00
ProToD
70
UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Language:Python64 3 33

zhiyuanhubj's Repositories

zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Language:Python64 3 33
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Language:Python60 1 54
zhiyuanhubj/Long_form_VideoQA
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
Language:Python120
zhiyuanhubj/ProToD
70
zhiyuanhubj/longLLM-Extrapolation-Papers
41
zhiyuanhubj/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
21
zhiyuanhubj/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python2 1 00
zhiyuanhubj/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1 1 00
zhiyuanhubj/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
0 0 00
zhiyuanhubj/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python0 0 00
zhiyuanhubj/Data-Processing-for-Satisfaction-Prediction
Language:Python00
zhiyuanhubj/DPAC-DialogueGAN
This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)
Language:Python0 1 00
zhiyuanhubj/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
zhiyuanhubj/code_switch
Language:Python1 0
zhiyuanhubj/cs_assigment
Language:Jupyter Notebook
zhiyuanhubj/EvalAI-Starters
How to create a challenge on EvalAI?
zhiyuanhubj/gpt4free
decentralising the Ai Industry, just some language model api's...
Language:Python0 0
zhiyuanhubj/human_evaluation
1 0
zhiyuanhubj/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Language:Python0 0
zhiyuanhubj/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
zhiyuanhubj/longLLM-Extrapolation-Paper
1 0
zhiyuanhubj/MAgIC
This is the official implementation for the paper: Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers
zhiyuanhubj/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
zhiyuanhubj/pics
pics
Language:Jupyter Notebook
zhiyuanhubj/Planning_Under_Uncertainty
zhiyuanhubj/Tenant
Language:Python
zhiyuanhubj/tutorials
PyTorch tutorials.
zhiyuanhubj/UGRO-CIMK23
zhiyuanhubj/zhiyuan.github.io
2 0
zhiyuanhubj/zhiyuanhubj.github.io
My personal homepage
Language:SCSS