zhangzheming33's Stars
pengxl8518/RecSys
mouna99/dien
zhougr1993/DeepInterestNetwork
i-Jayus/RecSystem-Pytorch
推荐系统论文算法实现,包括序列推荐,多任务学习,元学习等。 Recommendation system papers implementations, including sequence recommendation, multi-task learning, meta-learning, etc.
tmdt-buw/schlably
Official Schlably Repository by the Institute for TMDT
zcaicaros/L2D
Official implementation of paper "Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning"
Lei-Kun/FJSP-benchmarks
The public benchmark instances of flexible job shop scheduling problem
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
huggingface/course
The Hugging Face course on Transformers
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
huggingface/trl
Train transformer language models with reinforcement learning.
RLHFlow/Online-RLHF
A recipe for online RLHF.
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
songwenas12/fjsp-drl
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
inisis/brocolli
Everything in Torch Fx
DD-DuDa/TensorRT-in-Action
TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。
Lei-Kun/Dispatching-rules-for-FJSP
This is the official code for the baseline methods of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
Lei-Kun/End-to-end-DRL-for-FJSP
This is the official code of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
DubingXiang/light_or
light_or is a tool that help you develop Operational Research algorithms to solve combinatorial optimization problems.
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
Lisennlp/TinyBert
简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型
TobiasLee/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
MineQihang/BDCI2023
CCF大数据与计算智能大赛 - 线上线下全场景生鲜超市库存履约一体化决策
jinwen-yang/cuPDLP.jl