Pinned Repositories
ChatRex
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Rex-Thinker
Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning
RexSeek
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
CodeCookbook
Cookbook for Crafting Good Code
CTPN_CRNN_ChineseOCR_PyQt5
CTPN and CRNN based Chinese OCR, developed with PyQt5
Efficient-Deep-Learning
A bag of tricks to speed up your deep learning process
Text-Recognition-on-Cross-Domain-Datasets
Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even ancient books
Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Mountchicken's Repositories
Mountchicken/Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
Mountchicken/Efficient-Deep-Learning
A bag of tricks to speed up your deep learning process
Mountchicken/Text-Recognition-on-Cross-Domain-Datasets
Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even ancient books
Mountchicken/CodeCookbook
Cookbook for Crafting Good Code
Mountchicken/CTPN_CRNN_ChineseOCR_PyQt5
CTPN and CRNN based Chinese OCR, developed with PyQt5
Mountchicken/Structured_Dreambooth_LoRA
Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.
Mountchicken/ResNet18-CIFAR10
ResNet18 on CIFAR10 reachs 95.09% Accuracy on TestSet
Mountchicken/ImageCaptioning-Attention-PyQt5
ImageCaptioning improved with an attention mechanism. Also a PyQt5 application
Mountchicken/Two-Stream-RNN-Pytorch
Modeling Temporal Dynamics and Spatial Configurations of Actions UsingTwo-Stream Recurrent Neural Networks
Mountchicken/Mountchicken.github.io
Mountchicken/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Mountchicken/Aria
Codebase for Aria - an Open Multimodal Native MoE
Mountchicken/aryclenio
Mountchicken/DeepStudio
DeepStudio
Mountchicken/detectron2
Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.
Mountchicken/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Mountchicken/Forest-Chorus
森林合唱团游戏
Mountchicken/gpt-oss
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Mountchicken/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Mountchicken/Image-Captioning-pytorch
An Easy attempt to Image Captioning with Inception_V3 as backbone. Pytorch based, no attention used(May update latter)
Mountchicken/lab-website-template
(Pre-release) An easy-to-use, flexible website template for labs, with automatic citations, GitHub tag imports, pre-built components, and more
Mountchicken/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Mountchicken/MMOCR_tutorials
Jupyter notebook tutorials for MMOCR
Mountchicken/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Mountchicken/Mountchicken
Mountchicken/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Mountchicken/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Mountchicken/VisionLLM
VisionLLM Series
Mountchicken/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Mountchicken/YYDZ
The YYDZ (Yi Yan Ding Zhen / One Eye Ding Zhen) dataset