zhangzheming33

OR&AI

zhangzheming33's Stars

pengxl8518/RecSys
Language:Python116
mouna99/dien
Language:Python1.1k401
zhougr1993/DeepInterestNetwork
Language:Python1.6k558
i-Jayus/RecSystem-Pytorch
推荐系统论文算法实现，包括序列推荐，多任务学习，元学习等。 Recommendation system papers implementations, including sequence recommendation, multi-task learning, meta-learning, etc.
Language:Python14219
tmdt-buw/schlably
Official Schlably Repository by the Institute for TMDT
Language:Python6023
zcaicaros/L2D
Official implementation of paper "Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning"
Language:Python28793
Lei-Kun/FJSP-benchmarks
The public benchmark instances of flexible job shop scheduling problem
567
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
Language:Python909
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python6.9k497
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python64939
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2k165
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k162
huggingface/course
The Hugging Face course on Transformers
Language:MDX2.2k708
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Language:MDX3.8k585
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.3k1.2k
RLHFlow/Online-RLHF
A recipe for online RLHF.
Language:Python37843
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python64357
songwenas12/fjsp-drl
Language:Python20155
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Language:C++1.1k322
inisis/brocolli
Everything in Torch Fx
Language:Python33663
DD-DuDa/TensorRT-in-Action
TensorRT-in-Action 是一个 GitHub 代码库，提供了使用 TensorRT 的代码示例，并有对应 Jupyter Notebook。
Language:Jupyter Notebook102
Lei-Kun/Dispatching-rules-for-FJSP
This is the official code for the baseline methods of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
Language:Python7612
Lei-Kun/End-to-end-DRL-for-FJSP
This is the official code of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
Language:Python23554
DubingXiang/light_or
light_or is a tool that help you develop Operational Research algorithms to solve combinatorial optimization problems.
Language:C++21
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
Language:Python4.2k783
Lisennlp/TinyBert
简洁易用版TinyBert：基于Bert进行知识蒸馏的预训练语言模型
Language:Python25049
TobiasLee/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Language:Python31
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Language:Python3k628
MineQihang/BDCI2023
CCF大数据与计算智能大赛 - 线上线下全场景生鲜超市库存履约一体化决策
Language:Python152
jinwen-yang/cuPDLP.jl
Language:Julia398

zhangzheming33

zhangzheming33's Stars

pengxl8518/RecSys

mouna99/dien

zhougr1993/DeepInterestNetwork

i-Jayus/RecSystem-Pytorch

tmdt-buw/schlably

zcaicaros/L2D

Lei-Kun/FJSP-benchmarks

Vance0124/Token-level-Direct-Preference-Optimization

FlagOpen/FlagEmbedding

princeton-nlp/SimPO

eric-mitchell/direct-preference-optimization

openai/lm-human-preferences

huggingface/course

huggingface/deep-rl-class

huggingface/trl

RLHFlow/Online-RLHF

RLHFlow/RLHF-Reward-Modeling

songwenas12/fjsp-drl

microsoft/onnxruntime-inference-examples

inisis/brocolli

DD-DuDa/TensorRT-in-Action

Lei-Kun/Dispatching-rules-for-FJSP

Lei-Kun/End-to-end-DRL-for-FJSP

DubingXiang/light_or

InsaneLife/ChineseNLPCorpus

Lisennlp/TinyBert

TobiasLee/Pretrained-Language-Model

huawei-noah/Pretrained-Language-Model

MineQihang/BDCI2023

jinwen-yang/cuPDLP.jl