clam004
AI/ML engineer & physician interested in language models, reinforcement learning, medicine and neuroscience
care.coachSF bay area
Pinned Repositories
adaptive-computation-time
The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset
chat-transformer
A chatbot using the Vaswani transformer as it's sequence-to-sequence module
General-Deep-Learning-NLP-Classifier
Template for multi-class classification with variable length sequences using Gated Recurrent Units
intro_continual_learning
This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence
max_ent_irl
Maximum Entropy Inverse Reinforcement Learning - notes and tutorial for IRL using the principle of maximum entropy
minichatgpt
annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
proximalpolicyoptimization
basic implementation of PPO reinforcement learning algorithm on lunar lander
RL-Chat-pytorch
reinforcement learning on a encoder-decoder GRU for chatbot dialogue generation
triton-ft-api
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
unsupervised-speech-representation-learning
This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition
clam004's Repositories
clam004/intro_continual_learning
This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence
clam004/chat-transformer
A chatbot using the Vaswani transformer as it's sequence-to-sequence module
clam004/minichatgpt
annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
clam004/triton-ft-api
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
clam004/KV-caching-toy-example
clam004/rlhf
fine tuning natural language generation using a reinforcement learning signal
clam004/AIRL
ADVERSARIAL INVERSE REINFORCEMENT LEARNING
clam004/argparse
clam004/async-in-out
a practical show it, explain it, modify it, guide to asynchronous input/output programming in python
clam004/azure-texting
send and receive SMS using python and Azure
clam004/azure-voting-app-redis
clam004/Basic_Algorithm_Exercises
Basic Algorithm Design Analysis Notebook for Teaching
clam004/clam004
Config files for my GitHub profile.
clam004/docker-pytorch-api
Deploying PyTorch as a RESTAPI using Docker and FastAPI with CUDA support
clam004/foundRL
foundational reinforcement learning concepts with code and corresponding equations side-by-side applied to small but meaningful example problems
clam004/gen-data
a collection of synthetic data generating examples
clam004/k8s-fast
kubernetes-fastapi
clam004/makemore
An autoregressive character-level language model for making more things
clam004/meta-learned-memory
clam004/ML_MATH
repository for nuanced machine learning mathematical concepts
clam004/notebook_tutorials
clam004/pyzmqnotes
Learning 0mq with examples and notes from articles on the web
clam004/rust-grpc-python-tonic
minimal code example of passing data between python and rust via gRPC using tonic and protobuf
clam004/rust-protobuf-pyo3-example
clam004/rust-split-example
clam004/rusty
The Rust Programming Language
clam004/seqGAN
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
clam004/sim-slurm-cluster-gpu
Docker local slurm cluster
clam004/together-examples
clam004/trl
Train transformer language models with reinforcement learning.