Pinned Repositories
APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
bert-intent-slot-detector
BERT-based intent and slots detector for chatbots.
BinarySentEmb
Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
DetGP
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
RLM
Code for the paper - Replacing Language Model for Style Transfer
SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
TC-estimation
Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators
Linear95's Repositories
Linear95/CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
Linear95/bert-intent-slot-detector
BERT-based intent and slots detector for chatbots.
Linear95/SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
Linear95/APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
Linear95/BinarySentEmb
Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
Linear95/DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
Linear95/TC-estimation
Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators
Linear95/DetGP
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
Linear95/RLM
Code for the paper - Replacing Language Model for Style Transfer
Linear95/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
Linear95/linear95.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Linear95/ECC_classification
The implement of ECC classification
Linear95/LLM-with-RL-papers
A collection of LLM with RL papers
Linear95/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Linear95/emacs-init
My emacs init file for python coding in deep learning
Linear95/Linear95
My personal repository
Linear95/Megatron-LM
Ongoing research training transformer models at scale
Linear95/awesome-auto-alignment
Linear95/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
Linear95/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
Linear95/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)