SwiftSquirrel

SwiftSquirrel's Stars

yc930401/Actor-Critic-pytorch
Actor Critic model to play Cartpole game
Language:Python5122
tsen159/REINFORCE-algorithm
PyTorch implementation of vanilla REINFORCE algorithm, REINFORCE with baseline and REINFORCE with GAE
Language:Python2
ronikobrosly/causal-curve
A python package with tools to perform causal inference using observational data when the treatment of interest is continuous.
Language:Python27018
DanielPalaio/MountainCar-v0_DeepRL
OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)
Language:Python231
CFOnHeart/ReforceLearning
主要利用QLearning,DQN,ImprovedDQN(Ddouble DQN) 解决gym框架下的三个问题CartPole-v0,MountainCar-v0,Acrobot-v1
Language:Python125
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k841
h-shahidi/cs885-rl
Language:Jupyter Notebook43
airalcorn2/RankNet
My (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Language:Python24747
liyinxiao/LambdaRankNN
LambdaRank Neural Network model using Keras.
Language:Python8422
Freemanzxp/GBDT_Simple_Tutorial
python实现GBDT的回归、二分类以及多分类，将算法流程详情进行展示解读并可视化，庖丁解牛地理解GBDT。Gradient Boosting Decision Trees regression, dichotomy and multi-classification are realized based on python, and the details of algorithm flow are displayed, interpreted and visualized to help readers better understand Gradient Boosting Decision Trees
Language:Python716196
rbgirshick/fast-rcnn
Fast R-CNN
Language:Python3.3k1.6k
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML7.8k753
sahil280114/codealpaca
Language:Python1.4k108
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.5k157
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4.1k482
yakami129/VirtualWife
VirtualWife是一个虚拟数字人项目，支持B站直播，支持openai、ollama
Language:Python1.5k269
YUANZHUO-BNU/metahuman_overview
数字人资料整理
31045
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Language:Jupyter Notebook2.1k377
borgwang/tinynn
A lightweight deep learning library
Language:Python36993
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Language:Jupyter Notebook2.1k575
philschmid/deep-learning-pytorch-huggingface
Language:Jupyter Notebook630145
Ryota-Kawamura/LangChain-for-LLM-Application-Development
In LangChain for LLM Application Development, you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework.
Language:Jupyter Notebook174145
onlyphantom/llm-python
Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone
Language:Jupyter Notebook669262
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.1k3.8k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.2k905
timmens/causal-forest
Implements the Causal Forest algorithm formulated in Athey and Wager (2018).
Language:Python6412
golbin/WaveNet
Yet another WaveNet implementation in PyTorch.
Language:Python11430
JellalYu/DeepAR
Implementation of DeepAR in PyTorch.
Language:Python101
Lapis-Hong/wide_deep
Wide and Deep Learning for CTR Prediction in tensorflow
Language:Python290134