Pinned Repositories
3D-PointCloud
Papers and Datasets about Point Cloud.
AdvancedRAG
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , Agents.
Amharic_OCR
Amharic OCR based on MMOCR
arxiv-daily
AttentionOCR
Scene text recognition
Awesome-LLM-Healthcare
The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review".
Awesome-LLM-repos
Awesome-LLM: a curated list of Large Language Model
bevfusion
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
ETHIOPIC-Datasets
HUST-ASTD
dikubab's Repositories
dikubab/3D-PointCloud
Papers and Datasets about Point Cloud.
dikubab/Amharic_OCR
Amharic OCR based on MMOCR
dikubab/ETHIOPIC-Datasets
dikubab/HUST-ASTD
dikubab/arxiv-daily
dikubab/bevfusion
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
dikubab/CAPE
(CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
dikubab/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
dikubab/DeepInteraction
DeepInteraction
dikubab/ICDAR2019-ArT-Recognition-Alchemy
PKU Team Zero's code for participation in ICDAR2019 ArT Recognition track (Champion)
dikubab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
dikubab/jinchenji_MaskTextSpotter
dikubab/Korean-License-Plate-Generator-1
Generating Korean License Plates with YOLO format labels
dikubab/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
dikubab/MaskTextSpotterV3
The code of "Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting"
dikubab/MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
dikubab/MASTER-TF
MASTER
dikubab/mindocr
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
dikubab/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
dikubab/OCR-models-PaddlePaddle
Recent OCR and related works on PaddlePaddle 2.0
dikubab/Ocr_Application
dikubab/Offline-Chinese-Handwriting-Text-Page-Spotter-with-Text-Kernel
Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel
dikubab/pan_pp.pytorch
Official implementations of PSENet, PAN and PAN++.
dikubab/parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
dikubab/PICK-pytorch
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
dikubab/PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
dikubab/synthtiger
Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021
dikubab/TESTR
(CVPR 2022) Text Spotting Transformers
dikubab/Total-Text-Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
dikubab/vedastr
A scene text recognition toolbox based on PyTorch