Pinned Repositories
AAAI-19_slide_poster
Generating-Chinese-Ci
This repository is about Chinese Ci(宋词) generation, and the paper has been accepted by AAAI-19
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Long_form_VideoQA
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
longLLM-Extrapolation-Papers
LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
ProToD
UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
zhiyuanhubj's Repositories
zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
zhiyuanhubj/Long_form_VideoQA
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
zhiyuanhubj/ProToD
zhiyuanhubj/longLLM-Extrapolation-Papers
zhiyuanhubj/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
zhiyuanhubj/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
zhiyuanhubj/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
zhiyuanhubj/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
zhiyuanhubj/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
zhiyuanhubj/Data-Processing-for-Satisfaction-Prediction
zhiyuanhubj/DPAC-DialogueGAN
This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)
zhiyuanhubj/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
zhiyuanhubj/code_switch
zhiyuanhubj/cs_assigment
zhiyuanhubj/EvalAI-Starters
How to create a challenge on EvalAI?
zhiyuanhubj/gpt4free
decentralising the Ai Industry, just some language model api's...
zhiyuanhubj/human_evaluation
zhiyuanhubj/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
zhiyuanhubj/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
zhiyuanhubj/longLLM-Extrapolation-Paper
zhiyuanhubj/MAgIC
This is the official implementation for the paper: Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers
zhiyuanhubj/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
zhiyuanhubj/pics
pics
zhiyuanhubj/Planning_Under_Uncertainty
zhiyuanhubj/Tenant
zhiyuanhubj/tutorials
PyTorch tutorials.
zhiyuanhubj/UGRO-CIMK23
zhiyuanhubj/zhiyuan.github.io
zhiyuanhubj/zhiyuanhubj.github.io
My personal homepage