Pinned Repositories
Cointegrated-Pairs-Trading
Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021
CPP-hw
C++ homeworks @ St. Petersburg State University
Decision-Tree
Decision Tree Implementation as a part of my ML hw @ SPbU
dhc-robust-mapf
Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.
JB-Nucleon-Configurations
Kaggle-In-house-classification
Kaggle classification contest report (in Russian)
LeetCode-solutions
LeetCode solutions
multi-agent-pathfinding
Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*
ppl-kaggle-titanic
Titanic Kaggle contest
talks
My public talks are presented here
acforvs's Repositories
acforvs/multi-agent-pathfinding
Heuristic Search vs. Learning. "Distributed Heuristic Multi-Agent Path Finding with Communication" reproduced, trained & benchmarked with M*
acforvs/dhc-robust-mapf
Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.
acforvs/Cointegrated-Pairs-Trading
Algo trading strategy, entrance task to CMF, Quantitative Analytics program, 2021
acforvs/talks
My public talks are presented here
acforvs/ppl-kaggle-titanic
Titanic Kaggle contest
acforvs/Decision-Tree
Decision Tree Implementation as a part of my ML hw @ SPbU
acforvs/JB-Nucleon-Configurations
acforvs/Kaggle-In-house-classification
Kaggle classification contest report (in Russian)
acforvs/LeetCode-solutions
LeetCode solutions
acforvs/ppl-railway-station
Railway modelling
acforvs/ppl-text-index
Text file processing & index creation
acforvs/rhyme-bot
acforvs/tiktok
Entrance task for the "Tiktok for drivers" project, interactive map
acforvs/Gradient-Descent-Homework
Gradient Descent Homework for the ML Course @ SPbU
acforvs/DHC
Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021
acforvs/transformer
PyTorch implementation of the original transformer, from scratch
acforvs/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
acforvs/optax
Optax is a gradient processing and optimization library for JAX.
acforvs/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
acforvs/awac_iql
Offline to Online RL: AWAC & IQL PyTorch Implementation
acforvs/CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
acforvs/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
acforvs/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
acforvs/introtodeeplearning
Lab Materials for MIT 6.S191: Introduction to Deep Learning
acforvs/starter-hugo-academic
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
acforvs/tau
Pipeline Parallelism for PyTorch
acforvs/tests
acforvs/text-generation-inference
Large Language Model Text Generation Inference
acforvs/TransPath
acforvs/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)