SwiftSquirrel's Stars
yc930401/Actor-Critic-pytorch
Actor Critic model to play Cartpole game
tsen159/REINFORCE-algorithm
PyTorch implementation of vanilla REINFORCE algorithm, REINFORCE with baseline and REINFORCE with GAE
ronikobrosly/causal-curve
A python package with tools to perform causal inference using observational data when the treatment of interest is continuous.
DanielPalaio/MountainCar-v0_DeepRL
OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)
CFOnHeart/ReforceLearning
主要利用QLearning,DQN,ImprovedDQN(Ddouble DQN) 解决gym框架下的三个问题CartPole-v0,MountainCar-v0,Acrobot-v1
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
h-shahidi/cs885-rl
airalcorn2/RankNet
My (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
liyinxiao/LambdaRankNN
LambdaRank Neural Network model using Keras.
Freemanzxp/GBDT_Simple_Tutorial
python实现GBDT的回归、二分类以及多分类,将算法流程详情进行展示解读并可视化,庖丁解牛地理解GBDT。Gradient Boosting Decision Trees regression, dichotomy and multi-classification are realized based on python, and the details of algorithm flow are displayed, interpreted and visualized to help readers better understand Gradient Boosting Decision Trees
rbgirshick/fast-rcnn
Fast R-CNN
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
sahil280114/codealpaca
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
yakami129/VirtualWife
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
YUANZHUO-BNU/metahuman_overview
数字人资料整理
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
borgwang/tinynn
A lightweight deep learning library
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
philschmid/deep-learning-pytorch-huggingface
Ryota-Kawamura/LangChain-for-LLM-Application-Development
In LangChain for LLM Application Development, you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework.
onlyphantom/llm-python
Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
timmens/causal-forest
Implements the Causal Forest algorithm formulated in Athey and Wager (2018).
golbin/WaveNet
Yet another WaveNet implementation in PyTorch.
JellalYu/DeepAR
Implementation of DeepAR in PyTorch.
Lapis-Hong/wide_deep
Wide and Deep Learning for CTR Prediction in tensorflow