joanrod
PhD student at Mila, ÉTS and ServiceNow Research in Montreal, Canada. I work on AI and deep learning projects
ServiceNow ResearchMontreal
Pinned Repositories
awesome-tips
CLIP
Contrastive Language-Image Pretraining
CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
ControlNet
Let us control diffusion models!
cross-modal-retrieval-with-triplet-network
Text-to-Image and Image-to-Text model retrieval
figure-diffusion
Generating figures from research papers, using textual captions from the paper.
galai
Model API for GALACTICA
ocr-vqgan
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
paper2figure-dataset
Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)
star-vector
joanrod's Repositories
joanrod/star-vector
joanrod/ocr-vqgan
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
joanrod/figure-diffusion
Generating figures from research papers, using textual captions from the paper.
joanrod/paper2figure-dataset
Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)
joanrod/awesome-tips
joanrod/galai
Model API for GALACTICA
joanrod/CLIP
Contrastive Language-Image Pretraining
joanrod/CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
joanrod/ControlNet
Let us control diffusion models!
joanrod/cross-modal-retrieval-with-triplet-network
Text-to-Image and Image-to-Text model retrieval
joanrod/deforum-stable-diffusion
joanrod/k-diffusion
Karras et al. (2022) diffusion models for PyTorch
joanrod/M3-Project
joanrod/M5-Visual-Recognition
joanrod/tracknet
TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera Vehicle Tracking
joanrod/UPF-Hand-Written-Text-Recognition
joanrod/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe
joanrod/joanrod
joanrod/joanrod.github.io
joanrod/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
joanrod/Megatron-LM
Ongoing research training transformer models at scale
joanrod/moviepy
Video editing with Python
joanrod/open_clip
An open source implementation of CLIP.
joanrod/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
joanrod/torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
joanrod/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
joanrod/v-diffusion-pytorch
v objective diffusion inference code for PyTorch.
joanrod/vdm