tokarev-i-v

Life long learner

Pinned Repositories

3d_masks
3d masks using three.js and facemesh by tensorflow.js
Language:JavaScript1 3 10
AI-QMIX
Code for "AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning"
Language:Python2 0 01
awesome-llm-rl-agents
List of sources related to llms, transformers and reinforcement learning agents
1 2 01
awesome-ml-cybersecurity
0 1 00
reasoning-lib
10
researchim
10
rllib.js
Reinforcement learning library with JavaScript.
Language:JavaScript38 5 11
tfjs-gans
There are collections of GANs made using tfjs and THREE.js
Language:JavaScript1 1 00
VK_NEXT_CHAT
3D Web chat, using Three.js + Peer.js + Node.js
Language:JavaScript1 2 00
webrtc_chat_3d_engine
There is Engine for creating web 3d chats. It is made using WebRTC (Peer.js) + WebGL (Three.js).
Language:JavaScript2 1 00

tokarev-i-v's Repositories

tokarev-i-v/researchim
10
tokarev-i-v/awesome-ml-cybersecurity
0 1 00
tokarev-i-v/algo
0 0
tokarev-i-v/cultural-accumulation
Language:Jupyter Notebook0 0
tokarev-i-v/dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
tokarev-i-v/DeepCubeAI
Learning Discrete World Models for Heuristic Search
Language:Python0 0
tokarev-i-v/grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
Language:Python0 0
tokarev-i-v/GrokkedTransformer
Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
Language:Python0 0
tokarev-i-v/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python0 0
tokarev-i-v/LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
Language:Python0 0
tokarev-i-v/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Language:Python0 0
tokarev-i-v/LLaMA-O1
Large Reasoning Models
Language:Python
tokarev-i-v/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda0 0
tokarev-i-v/loss-of-plasticity
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
Language:Python0 0
tokarev-i-v/MacroHFT
Language:Python0 0
tokarev-i-v/MathBlackBox
Language:Python
tokarev-i-v/mctslib
Language:Jupyter Notebook0 0
tokarev-i-v/models-at-home
tokarev-i-v/open-oasis
Inference script for Oasis 500M
tokarev-i-v/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python
tokarev-i-v/quiet-star
Code for Quiet-STaR
tokarev-i-v/RethinkMCTS
tokarev-i-v/rStar
tokarev-i-v/ScaleQuest
We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
Language:Python
tokarev-i-v/search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
tokarev-i-v/SelfCorrectionLanguageModelTraining
Language:Python
tokarev-i-v/Super_MARIO
tokarev-i-v/TheArtofHPC_pdfs
All pdfs of Victor Eijkhout's Art of HPC books and courses
0 0
tokarev-i-v/torax
TORAX: Tokamak transport simulation in JAX
Language:Python0 0
tokarev-i-v/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2022/Spring 2022
Language:Jupyter Notebook0 0