Pinned Repositories
-Revisiting-Reverse-Distillation
(CVPR 2023) Revisiting Reverse Distillation for Anomaly Detection
.vimrc
abcnet_custom_dataset
ABCNet标注格式数据集制作,将ICDAR15转为ABCNet标注格式
AE_TextSpotter
AIOZ-GDANCE
AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)
AndroidStudyCode
关于Android的一些原理学习和代码实现
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Attention_ocr.pytorch
This repository implements the the encoder and decoder model with attention model for OCR
LaTeX_OCR
:gem: 数学公式识别
torchOCR
garspace's Repositories
garspace/torchOCR
garspace/-Revisiting-Reverse-Distillation
(CVPR 2023) Revisiting Reverse Distillation for Anomaly Detection
garspace/.vimrc
garspace/AIOZ-GDANCE
AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)
garspace/Attention_ocr.pytorch
This repository implements the the encoder and decoder model with attention model for OCR
garspace/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
garspace/Bailando
音乐驱动舞蹈: CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
garspace/ChatRoom
微信小程序 在线聊天
garspace/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Text Spotting"
garspace/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
garspace/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
garspace/dockerfile
garspace/groundingLMM
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
garspace/LLM-Tuning
Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.
garspace/MaskTextSpotterV3
The code of "Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting"
garspace/Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
garspace/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
garspace/NeteaseMusicWxMiniApp
仿网易云音乐APP的微信小程序
garspace/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。
garspace/parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
garspace/RRPN_plusplus
RRPN++: Guidance Towards More Accurate Scene Text Detection
garspace/RT-DETR
rtdetr
garspace/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
garspace/sampleQAT
Inference of quantization aware trained networks using TensorRT
garspace/SLOPER4D
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments (CVPR2023)
garspace/TensorRT-Alpha
🔥🔥🔥TensorRT-Alpha supports YOLOv8、YOLOv7、YOLOv6、YOLOv5、YOLOv4、v3、YOLOX、YOLOR...🚀🚀🚀CUDA IS ALL YOU NEED.🍎🍎🍎It also supports end2end CUDA C acceleration and multi-batch inference.
garspace/Torch-Pruning
[CVPR-2023] Towards Any Structural Pruning; LLMs / Diffusion / YOLOv8 / CNNs / Transformers
garspace/TPSNet
garspace/trocr-chinese
transformers ocr for chinese
garspace/vedo
A python module for scientific analysis of 3D data based on VTK and Numpy