panyxy's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
paperswithcode/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
FeiLiu36/LLM4Opt
A Collection on Large Language Models for Optimization
Edward-Sun/DIFUSCO
Code of NeurIPS paper: arxiv.org/abs/2302.08224
henry-yeh/DeepACO
[NeurIPS 2023] DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization
martyput/MDP_book
yining043/VRP-DACT
This repo implements our paper, "Learning to Iteratively Solve Routing Problems with Dual-Aspect Collaborative Transformer", which has been accepted at NeurIPS 2021.
ahottung/NLNS
Neural Large Neighborhood Search for the Capacitated Vehicle Routing Problem
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
r-three/phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
zdhNarsil/GFlowNet-CombOpt
PyTorch implementation for our NeurIPS 2023 spotlight paper "Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets".
Not-Diamond/awesome-ai-model-routing
A curated list of awesome approaches to AI model routing
ahottung/EAS
Efficient Active Search
Stalence/erdos_neu
Official Repo for the NeurIPS2020 paper "Erdos Goes Neural: An Unsupervised Learning Framework for Combinatorial Optimization on Graphs"
yimengmin/UTSP
code repo for paper Unsupervised Learning for Solving the Travelling Salesman Problem
DIMESTeam/DIMES
yining043/NeuOpt
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
marmotlab/DAN
Public version of the decentralized, attention-based mTSP code
naver/bq-nco
Thinklab-SJTU/NAR-CO-Solver
Official implementation non-autoregressive combinatorial optimizaiton solvers, covering our ICLR 2023 paper and SCIENTIA SINICA Informationis paper
alga-hopf/drl-graph-partitioning
DRL models for graph partitioning and sparse matrix ordering.
wangjksjtu/rl-perturbed-reward
Reinforcement Learning with Perturbed Reward, AAAI 2020
WindyLee0822/Process_Q_Model
official implementation of paper "Process Reward Model with Q-value Rankings"
gaocrr/ELG
Official implementation of IJCAI'24 paper "Towards Generalizable Neural Solvers for Vehicle Routing Problems via Ensemble with Transferrable Local Policy"
Graph-COM/CO_ProxyDesign
The repository for 'Unsupervised Learning for Combinatorial Optimization with Principled Proxy Design'
kaist-silab/meta-sage
[ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization
Graph-COM/Meta_CO
the official repository of the paper unsupervised learning for combinatorial optimization needs meta learning