jamesbondzhou

hust

Pinned Repositories

AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
Language:Python00
CHINESE-OCR
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Language:Python00
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
00
grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
Language:Jupyter Notebook00
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook00
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python00
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python00
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python00
objectdetection_script
一些关于目标检测的脚本的改进思路代码，详细请看readme.md
Language:Python00
Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python00

jamesbondzhou's Repositories

jamesbondzhou/AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
Language:Python00
jamesbondzhou/CHINESE-OCR
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Language:Python00
jamesbondzhou/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
00
jamesbondzhou/grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
Language:Jupyter Notebook00
jamesbondzhou/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook00
jamesbondzhou/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python00
jamesbondzhou/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python00
jamesbondzhou/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python00
jamesbondzhou/objectdetection_script
一些关于目标检测的脚本的改进思路代码，详细请看readme.md
Language:Python00
jamesbondzhou/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python00
jamesbondzhou/pytorch-fcn
PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)
Language:Python00
jamesbondzhou/pytorch_ctpn
This is a pytorch implementation of CTPN(Detecting Text in Natural Image with Connectionist Text Proposal Network)
Language:Python0 0 00
jamesbondzhou/TextBoxes
TextBoxes: A Fast Text Detector with a Single Deep Neural Network
Language:C++0 0 00
jamesbondzhou/TextBoxes_plusplus
TextBoxes++: A Single-Shot Oriented Scene Text Detector
Language:C++0 0 00
jamesbondzhou/yolov5
YOLOv5 汉化版，保持官方同步更新
Language:Python00
jamesbondzhou/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
jamesbondzhou/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

jamesbondzhou

Pinned Repositories

AdvancedEAST

CHINESE-OCR

CVinW_Readings

grounded-segment-any-parts

Grounded-Segment-Anything

GroundingDINO

InternVL

LLaVA

objectdetection_script

Otter

jamesbondzhou's Repositories

jamesbondzhou/AdvancedEAST

jamesbondzhou/CHINESE-OCR

jamesbondzhou/CVinW_Readings

jamesbondzhou/grounded-segment-any-parts

jamesbondzhou/Grounded-Segment-Anything

jamesbondzhou/GroundingDINO

jamesbondzhou/InternVL

jamesbondzhou/LLaVA

jamesbondzhou/objectdetection_script

jamesbondzhou/Otter

jamesbondzhou/pytorch-fcn

jamesbondzhou/pytorch_ctpn

jamesbondzhou/TextBoxes

jamesbondzhou/TextBoxes_plusplus

jamesbondzhou/yolov5

jamesbondzhou/Segment-Everything-Everywhere-All-At-Once

jamesbondzhou/Video-LLaVA