gordicaleksa

Flirting with LLMs. Tensor Core maximalist. If I say stupid stuff it's not me it's my prompt.

ex-DeepMind, ex-MicrosoftSan Francisco

Pinned Repositories

get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
Language:Jupyter Notebook751 9 0113
llm.c
LLM training in simple, raw C/CUDA
Language:Cuda5 0 00
Open-NLLB
Effort to open-source NLLB checkpoints.
Language:Python458 8 2645
pytorch-deepdream
PyTorch implementation of DeepDream algorithm (Mordvintsev et al.). Additionally I've included playground.py to help you better understand basic concepts behind the algo.
Language:Jupyter Notebook392 8 986
pytorch-GANs
My implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.), DCGAN (Radford et al.), etc.
Language:Python382 12 158
pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
Language:Jupyter Notebook2.6k 47 14344
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Language:Python155 2 033
pytorch-neural-style-transfer
Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the image - for better understanding of how NST works.
Language:Python427 8 1289
pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
Language:Jupyter Notebook1k 30 9180
stable_diffusion_playground
Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generate and then later interpolate between the images of your choice.
Language:Python206 6 923

gordicaleksa's Repositories

gordicaleksa/pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
Language:Jupyter Notebook2.6k 47 14344
gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
Language:Jupyter Notebook1k 30 9180
gordicaleksa/get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
Language:Jupyter Notebook751 9 0113
gordicaleksa/Open-NLLB
Effort to open-source NLLB checkpoints.
Language:Python458 8 2645
gordicaleksa/pytorch-neural-style-transfer
Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the image - for better understanding of how NST works.
Language:Python427 8 1289
gordicaleksa/pytorch-deepdream
PyTorch implementation of DeepDream algorithm (Mordvintsev et al.). Additionally I've included playground.py to help you better understand basic concepts behind the algo.
Language:Jupyter Notebook392 8 986
gordicaleksa/pytorch-GANs
My implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.), DCGAN (Radford et al.), etc.
Language:Python382 12 158
gordicaleksa/stable_diffusion_playground
Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generate and then later interpolate between the images of your choice.
Language:Python206 6 923
gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Language:Python155 2 033
gordicaleksa/serbian-llm-eval
Serbian LLM Eval.
Language:Python96 5 17
gordicaleksa/pytorch-naive-video-neural-style-transfer
Create naive (no temporal loss) NST for videos with person segmentation. Just place your videos in data/, run and you get your stylized and segmented videos.
Language:Python81 6 29
gordicaleksa/OpenGemini
Effort to open-source 10.5 trillion parameter Gemini model.
17 2 0
gordicaleksa/gordicaleksa
GitHub's new feature: repo with the same name as your GitHub name initialized with README.md will show on your landing page!
12 3 06
gordicaleksa/slovenian-llm-eval
Slovenian LLM Eval.
Language:Python7 2 01
gordicaleksa/stable-diffusion
Language:Jupyter Notebook6 1 01
gordicaleksa/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda5 0 00
gordicaleksa/Open-NLLB-stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) for the Open-NLLB effort.
Language:Python5 1 01
gordicaleksa/awesomeMLSys
An ML Systems Onboarding list
4 0 01
gordicaleksa/metaseq
Fork that goes with my YT video.
Language:Python4 2 0
gordicaleksa/streamlit_playground
Simple Streamlit app.
Language:Python4 2 0
gordicaleksa/fsdl-text-recognizer-2022
Source of the FSDL 2022 labs, which are at https://github.com/full-stack-deep-learning/fsdl-text-recognizer-2022-labs
Language:Jupyter Notebook3 1 0
gordicaleksa/jina
Cloud-native neural search framework for 𝙖𝙣𝙮 kind of data
Language:Python2 1 01
gordicaleksa/airoboros
Customizable implementation of the self-instruct paper.
Language:Python1 1 0
gordicaleksa/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
Language:Python1 1 0
gordicaleksa/axolotl
axolotl
Language:Python1 0
gordicaleksa/datasketch_threadsafe
Language:Python2 0
gordicaleksa/micrograd
The Autograd Engine that implements backpropagation
Language:Python0 0
gordicaleksa/mlp
The Multilayer Perceptron Language Model
Language:Python0 0
gordicaleksa/ngram
The n-gram Language Model
Language:C0 0
gordicaleksa/tensor
The Tensor (or Array)
Language:Python0 0