clam004

AI/ML engineer & physician interested in language models, reinforcement learning, medicine and neuroscience

care.coachSF bay area

Pinned Repositories

adaptive-computation-time
The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset
Language:HTML2 3 00
chat-transformer
A chatbot using the Vaswani transformer as it's sequence-to-sequence module
Language:Jupyter Notebook21 3 12
General-Deep-Learning-NLP-Classifier
Template for multi-class classification with variable length sequences using Gated Recurrent Units
Language:Jupyter Notebook1 2 00
intro_continual_learning
This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence
Language:Jupyter Notebook359 7 124
max_ent_irl
Maximum Entropy Inverse Reinforcement Learning - notes and tutorial for IRL using the principle of maximum entropy
Language:Jupyter Notebook3 2 00
minichatgpt
annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
Language:Jupyter Notebook18 2 01
proximalpolicyoptimization
basic implementation of PPO reinforcement learning algorithm on lunar lander
Language:Jupyter Notebook3 2 00
RL-Chat-pytorch
reinforcement learning on a encoder-decoder GRU for chatbot dialogue generation
Language:Jupyter Notebook20 2 45
triton-ft-api
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
Language:Python5 2 00
unsupervised-speech-representation-learning
This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition
Language:Jupyter Notebook10 3 20

clam004's Repositories

clam004/intro_continual_learning
This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence
Language:Jupyter Notebook359 7 124
clam004/chat-transformer
A chatbot using the Vaswani transformer as it's sequence-to-sequence module
Language:Jupyter Notebook21 3 12
clam004/minichatgpt
annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
Language:Jupyter Notebook18 2 01
clam004/triton-ft-api
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
Language:Python5 2 00
clam004/KV-caching-toy-example
Language:Jupyter Notebook1 1 0
clam004/rlhf
fine tuning natural language generation using a reinforcement learning signal
Language:Jupyter Notebook1 2 0
clam004/AIRL
ADVERSARIAL INVERSE REINFORCEMENT LEARNING
Language:Jupyter Notebook2 0
clam004/argparse
Language:Python1 0
clam004/async-in-out
a practical show it, explain it, modify it, guide to asynchronous input/output programming in python
Language:Jupyter Notebook1 0
clam004/azure-texting
send and receive SMS using python and Azure
1 0
clam004/azure-voting-app-redis
Language:Shell1 0
clam004/Basic_Algorithm_Exercises
Basic Algorithm Design Analysis Notebook for Teaching
Language:Jupyter Notebook2 0
clam004/clam004
Config files for my GitHub profile.
1 0
clam004/docker-pytorch-api
Deploying PyTorch as a RESTAPI using Docker and FastAPI with CUDA support
Language:Jupyter Notebook2 0
clam004/foundRL
foundational reinforcement learning concepts with code and corresponding equations side-by-side applied to small but meaningful example problems
1 0
clam004/gen-data
a collection of synthetic data generating examples
Language:Jupyter Notebook1 0
clam004/k8s-fast
kubernetes-fastapi
Language:Python2 0
clam004/makemore
An autoregressive character-level language model for making more things
Language:Python0 0
clam004/meta-learned-memory
Language:Jupyter Notebook1 0
clam004/ML_MATH
repository for nuanced machine learning mathematical concepts
Language:Jupyter Notebook2 0
clam004/notebook_tutorials
Language:Jupyter Notebook1 0
clam004/pyzmqnotes
Learning 0mq with examples and notes from articles on the web
Language:Python0 0
clam004/rust-grpc-python-tonic
minimal code example of passing data between python and rust via gRPC using tonic and protobuf
Language:Python1 0
clam004/rust-protobuf-pyo3-example
Language:Rust1 0
clam004/rust-split-example
Language:Rust1 0
clam004/rusty
The Rust Programming Language
Language:Jupyter Notebook1 0
clam004/seqGAN
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Language:Jupyter Notebook0 0
clam004/sim-slurm-cluster-gpu
Docker local slurm cluster
Language:Dockerfile0 0
clam004/together-examples
Language:Jupyter Notebook1 0
clam004/trl
Train transformer language models with reinforcement learning.
Language:Python0 0