vieveks

Pinned Repositories

alphazero_chess
My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments
Language:Python00
Anti_AI
Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain
Language:Jupyter Notebook10
ChessGPT
ChessGPT - Bridging Policy Learning and Language Modeling
Language:Python00
Contilearn
to make LLMs learn at the go
Language:Python00
Eureka_vivek
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
Language:Jupyter Notebook00
Gym
Language:Python00
HusePricePrediction
Language:Jupyter Notebook00
RL_code_implementations
Language:Jupyter Notebook10
vieveks.github.io
personal website
Language:HTML10
Vishanu
A general purpose cyber virus and anti virus
Language:Python10

vieveks's Repositories

vieveks/Anti_AI
Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain
Language:Jupyter Notebook10
vieveks/RL_code_implementations
Language:Jupyter Notebook10
vieveks/vieveks.github.io
personal website
Language:HTML10
vieveks/Vishanu
A general purpose cyber virus and anti virus
Language:Python10
vieveks/alphazero_chess
My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments
Language:Python00
vieveks/ChessGPT
ChessGPT - Bridging Policy Learning and Language Modeling
Language:Python00
vieveks/Contilearn
to make LLMs learn at the go
Language:Python00
vieveks/Eureka_vivek
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
Language:Jupyter Notebook00
vieveks/Gym
Language:Python00
vieveks/HusePricePrediction
Language:Jupyter Notebook00
vieveks/langchain
⚡ Building applications with LLMs through composability ⚡
vieveks/minijax
codes for different llm architectures in jax and haiku
Language:Jupyter Notebook
vieveks/nanoGPT-understanding-
The simplest, fastest repository for training/finetuning medium-sized GPTs.
vieveks/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
vieveks/pingu
Your personal robotic home assistant
Language:Python1
vieveks/pytorch-alpha-zero
to try out alphazero training and understand the algorithm
Language:Python
vieveks/reasoning_agent
vieveks/reinforcement-learning_dennybritz
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
vieveks/rl_agent_trials
Language:Python
vieveks/Self_driving_game
Language:Python
vieveks/tf_agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
vieveks/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python
vieveks/torch_rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
vieveks/tradez
trading platform
Language:Python
vieveks/Unlearning
Different algorithms to achieve unlearning
Language:Jupyter Notebook
vieveks/vastai_temp
temporary repo
Language:Jupyter Notebook