Pinned Repositories
annotated_deep_learning_paper_implementations
🧠 Implementations/tutorials of deep learning papers with side-by-side notes; including transformers (original, xl, switch, feedback), optimizers(adam, radam, adabelief), gans(dcgan, cyclegan, stylegan2), reinforcement learning (ppo, dqn), capsnet, sketch-rnn, etc.
BTTR
Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"
coding-interview-university
A complete computer science study plan to become a software engineer.
coursera-deep-learning-specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
cracking-the-data-science-interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
deep-homography
Implementation of Deep Homography in Keras
signature-recognition-1
A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
Stock_Analysis_For_Quant
Different Types of Stock Analysis in Python, R, Matlab, Excel, Power BI
Table-detection-Mask_RCNN
Table detection has been an interesting and challenging problem in the field of document analyses. Object detection algorithms such as Faster-RCNN has been exploited so many times to detect tables in the documents. This is just another experiment but with a different architecture which is Mask_RCNN.
WordDetectorNN
Detect handwritten words (neural network based).
kapitsa2811's Repositories
kapitsa2811/CLEval
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
kapitsa2811/StackMix-OCR
kapitsa2811/academic
All academic activities
kapitsa2811/Autonomous-car-detection-using-YOLO-model
kapitsa2811/awesome-ocr-1
Links to awesome OCR projects
kapitsa2811/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
kapitsa2811/CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
kapitsa2811/conformer_ocr
Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This project only focused on variants of vanilla Transformer (Conformer) and Feature Extraction (CNN-based approach).
kapitsa2811/DiffFashion
kapitsa2811/DiffMatch
Official implementation of "Diffusion Model for Dense Matching" (ICLR'24 Oral)
kapitsa2811/DreamMatcher
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (CVPR 2024)
kapitsa2811/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
kapitsa2811/GRIF-DM
Official Implementation for ECAI 2024 paper "GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models"
kapitsa2811/InjectFusion_official
kapitsa2811/InST
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
kapitsa2811/kraken
OCR engine for all the languages
kapitsa2811/Magnet
Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function" [NeurIPS 2024]
kapitsa2811/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
kapitsa2811/OCR-model
An easy-to-run OCR model pipeline based on CRNN and CTC loss
kapitsa2811/PEAN
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution (ACMMM 2024)
kapitsa2811/ProSpect
Official implementation of the paper "ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models"(SIGGRAPH Asia 2023)
kapitsa2811/ResNet-101T-for-HCR
kapitsa2811/RIVAL
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain
kapitsa2811/ru-dalle
Generate images from texts. In Russian
kapitsa2811/StyleID
[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
kapitsa2811/TensorFlow-2.x-YOLOv3
YOLOv3 implementation in TensorFlow 2.3.1
kapitsa2811/Treasure-of-Transformers
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
kapitsa2811/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
kapitsa2811/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
kapitsa2811/x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers