gordicaleksa
Flirting with LLMs. Tensor Core maximalist. If I say stupid stuff it's not me it's my prompt.
ex-DeepMind, ex-MicrosoftBelgrade, Serbia -> London -> SF? :)
Pinned Repositories
get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
llm.c
LLM training in simple, raw C/CUDA
Open-NLLB
Effort to open-source NLLB checkpoints.
pytorch-deepdream
PyTorch implementation of DeepDream algorithm (Mordvintsev et al.). Additionally I've included playground.py to help you better understand basic concepts behind the algo.
pytorch-GANs
My implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.), DCGAN (Radford et al.), etc.
pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
pytorch-neural-style-transfer
Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the image - for better understanding of how NST works.
pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
stable_diffusion_playground
Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generate and then later interpolate between the images of your choice.
gordicaleksa's Repositories
gordicaleksa/pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
gordicaleksa/get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
gordicaleksa/Open-NLLB
Effort to open-source NLLB checkpoints.
gordicaleksa/pytorch-neural-style-transfer
Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the image - for better understanding of how NST works.
gordicaleksa/pytorch-GANs
My implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.), DCGAN (Radford et al.), etc.
gordicaleksa/pytorch-deepdream
PyTorch implementation of DeepDream algorithm (Mordvintsev et al.). Additionally I've included playground.py to help you better understand basic concepts behind the algo.
gordicaleksa/stable_diffusion_playground
Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generate and then later interpolate between the images of your choice.
gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
gordicaleksa/serbian-llm-eval
Serbian LLM Eval.
gordicaleksa/pytorch-naive-video-neural-style-transfer
Create naive (no temporal loss) NST for videos with person segmentation. Just place your videos in data/, run and you get your stylized and segmented videos.
gordicaleksa/OpenGemini
Effort to open-source 10.5 trillion parameter Gemini model.
gordicaleksa/gordicaleksa
GitHub's new feature: repo with the same name as your GitHub name initialized with README.md will show on your landing page!
gordicaleksa/slovenian-llm-eval
Slovenian LLM Eval.
gordicaleksa/stable-diffusion
gordicaleksa/llm.c
LLM training in simple, raw C/CUDA
gordicaleksa/Open-NLLB-stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) for the Open-NLLB effort.
gordicaleksa/streamlit_playground
Simple Streamlit app.
gordicaleksa/awesomeMLSys
An ML Systems Onboarding list
gordicaleksa/metaseq
Fork that goes with my YT video.
gordicaleksa/fsdl-text-recognizer-2022
Source of the FSDL 2022 labs, which are at https://github.com/full-stack-deep-learning/fsdl-text-recognizer-2022-labs
gordicaleksa/jina
Cloud-native neural search framework for 𝙖𝙣𝙮 kind of data
gordicaleksa/airoboros
Customizable implementation of the self-instruct paper.
gordicaleksa/axolotl
axolotl
gordicaleksa/datasketch_threadsafe
gordicaleksa/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
gordicaleksa/micrograd
The Autograd Engine that implements backpropagation
gordicaleksa/mlp
The Multilayer Perceptron Language Model
gordicaleksa/ngram
The n-gram Language Model
gordicaleksa/tensor
The Tensor (or Array)