Pinned Repositories
alphazero_chess
My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments
Anti_AI
Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain
ChessGPT
ChessGPT - Bridging Policy Learning and Language Modeling
Contilearn
to make LLMs learn at the go
Eureka_vivek
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
Gym
HusePricePrediction
RL_code_implementations
vieveks.github.io
personal website
Vishanu
A general purpose cyber virus and anti virus
vieveks's Repositories
vieveks/Anti_AI
Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain
vieveks/RL_code_implementations
vieveks/vieveks.github.io
personal website
vieveks/Vishanu
A general purpose cyber virus and anti virus
vieveks/alphazero_chess
My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments
vieveks/ChessGPT
ChessGPT - Bridging Policy Learning and Language Modeling
vieveks/Contilearn
to make LLMs learn at the go
vieveks/Eureka_vivek
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
vieveks/Gym
vieveks/HusePricePrediction
vieveks/langchain
⚡ Building applications with LLMs through composability ⚡
vieveks/minijax
codes for different llm architectures in jax and haiku
vieveks/nanoGPT-understanding-
The simplest, fastest repository for training/finetuning medium-sized GPTs.
vieveks/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
vieveks/pingu
Your personal robotic home assistant
vieveks/pytorch-alpha-zero
to try out alphazero training and understand the algorithm
vieveks/reasoning_agent
vieveks/reinforcement-learning_dennybritz
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
vieveks/rl_agent_trials
vieveks/Self_driving_game
vieveks/tf_agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
vieveks/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
vieveks/torch_rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
vieveks/tradez
trading platform
vieveks/Unlearning
Different algorithms to achieve unlearning
vieveks/vastai_temp
temporary repo