Linear95

Researcher at Tencent AI Lab

Alibaba GroupShenzhen, China

Pinned Repositories

APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
Language:Python56 1 33
Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
2 0 00
bert-intent-slot-detector
BERT-based intent and slots detector for chatbots.
Language:Python173 1 1125
BinarySentEmb
Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
Language:Python43 6 37
CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
Language:Jupyter Notebook329 6 2740
DetGP
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
Language:Python11 3 20
DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
Language:Python25 1 03
RLM
Code for the paper - Replacing Language Model for Style Transfer
Language:Python3 1 00
SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
Language:Python124 4 922
TC-estimation
Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators
Language:Jupyter Notebook16 4 02

Linear95's Repositories

Linear95/CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
Language:Jupyter Notebook329 6 2740
Linear95/bert-intent-slot-detector
BERT-based intent and slots detector for chatbots.
Language:Python173 1 1125
Linear95/SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
Language:Python124 4 922
Linear95/APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
Language:Python56 1 33
Linear95/BinarySentEmb
Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
Language:Python43 6 37
Linear95/DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
Language:Python25 1 03
Linear95/TC-estimation
Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators
Language:Jupyter Notebook16 4 02
Linear95/DetGP
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
Language:Python11 3 20
Linear95/RLM
Code for the paper - Replacing Language Model for Style Transfer
Language:Python3 1 00
Linear95/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
2 0 00
Linear95/linear95.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript2 1 04
Linear95/ECC_classification
The implement of ECC classification
Language:Python1 1 00
Linear95/LLM-with-RL-papers
A collection of LLM with RL papers
1 0 0
Linear95/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
Linear95/emacs-init
My emacs init file for python coding in deep learning
Language:Emacs Lisp0 1 00
Linear95/Linear95
My personal repository
0 2 00
Linear95/Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 0 00
Linear95/awesome-auto-alignment
Linear95/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
0 0
Linear95/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
0 0
Linear95/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
0 0

Linear95

Pinned Repositories

APO

Awesome-LLM-Robotics

bert-intent-slot-detector

BinarySentEmb

CLUB

DetGP

DSP

RLM

SPAG

TC-estimation

Linear95's Repositories

Linear95/CLUB

Linear95/bert-intent-slot-detector

Linear95/SPAG

Linear95/APO

Linear95/BinarySentEmb

Linear95/DSP

Linear95/TC-estimation

Linear95/DetGP

Linear95/RLM

Linear95/Awesome-LLM-Robotics

Linear95/linear95.github.io

Linear95/ECC_classification

Linear95/LLM-with-RL-papers

Linear95/alpaca-lora

Linear95/emacs-init

Linear95/Linear95

Linear95/Megatron-LM

Linear95/awesome-auto-alignment

Linear95/Awesome-LLM-Reasoning

Linear95/Awesome-LLM-RL

Linear95/awesome-RLHF