Pinned Repositories
3d_masks
3d masks using three.js and facemesh by tensorflow.js
acms
agents
An Open-source Framework for Autonomous Language Agents
AI-QMIX
Code for "AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning"
awesome-llm-rl-agents
List of sources related to llms, transformers and reinforcement learning agents
awesome-ml-cybersecurity
rllib.js
Reinforcement learning library with JavaScript.
VK_NEXT_CHAT
3D Web chat, using Three.js + Peer.js + Node.js
webrtc_chat_3d_engine
There is Engine for creating web 3d chats. It is made using WebRTC (Peer.js) + WebGL (Three.js).
tokarev-i-v's Repositories
tokarev-i-v/awesome-ml-cybersecurity
tokarev-i-v/algo
tokarev-i-v/alphageometry
tokarev-i-v/ARP
Procgen Experiments of "Guide Your Agent with Adaptive Multimodal Rewards"
tokarev-i-v/backtrader_moexalgo
MOEX API AlgoPack integration with Backtrader. На данных с биржи MOEX теперь можно создавать полноценные торговые стратегии. Проводить Backtesting и делать Live торговлю через брокеров Алор, Финам и тех, у кого есть торговый терминал Quik.
tokarev-i-v/barrier-method
An expansion of the Triple-Barrier Method by Marcos López de Prado
tokarev-i-v/csle
A research platform to develop automated security policies using quantitative methods, e.g. optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.
tokarev-i-v/cultural-accumulation
tokarev-i-v/DeepCubeAI
Learning Discrete World Models for Heuristic Search
tokarev-i-v/DIFUSCO
Code of NeurIPS paper: arxiv.org/abs/2302.08224
tokarev-i-v/EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
tokarev-i-v/ember
Elastic Malware Benchmark for Empowering Researchers
tokarev-i-v/ergodic_rl
tokarev-i-v/garak
LLM vulnerability scanner
tokarev-i-v/grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
tokarev-i-v/GrokkedTransformer
Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
tokarev-i-v/LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
tokarev-i-v/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
tokarev-i-v/llm.c
LLM training in simple, raw C/CUDA
tokarev-i-v/loss-of-plasticity
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
tokarev-i-v/MacroHFT
tokarev-i-v/metasploit-framework
Metasploit Framework
tokarev-i-v/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
tokarev-i-v/mistral_finetune_notebooks
tokarev-i-v/pipegoose
Megatron-LM 3D parallelism for 🤗 transformers model *(still work in progress)*
tokarev-i-v/secml_malware
Create adversarial attacks against machine learning Windows malware detectors
tokarev-i-v/TheArtofHPC_pdfs
All pdfs of Victor Eijkhout's Art of HPC books and courses
tokarev-i-v/tokarev-i-v.github.io
My site
tokarev-i-v/torax
TORAX: Tokamak transport simulation in JAX
tokarev-i-v/wapiti
Web vulnerability scanner written in Python3