EverestRs's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
optuna/optuna
A hyperparameter optimization framework
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
huawei-noah/Efficient-AI-Backbones
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
DingXiaoH/RepVGG
RepVGG: Making VGG-style ConvNets Great Again
HobbitLong/RepDistiller
[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
ZhugeKongan/torch-template-for-deep-learning
Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
DingXiaoH/RepLKNet-pytorch
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
AlibabaResearch/efficientteacher
A Supervised and Semi-Supervised Object Detection Library for YOLO Series
apple/ml-mobileone
This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".
optuna/optuna-examples
Examples for https://github.com/optuna/optuna
optuna/optuna-dashboard
Real-time Web Dashboard for Optuna.
ZhugeKongan/Attention-mechanism-implementation
Self-attention、Non-local、SE、SK、CBAM、DANet
Jiawei-Yang/Denoising-ViT
This is the official code release for our work, Denoising Vision Transformers.
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
ZHUANGHP/Analytic-continual-learning
This repository will be posting analytic continual learning series, including Analytic Class-Incremental Learning (ACIL), Gaussian Kernel Embedded Analytic Learning (GKEAL), Dual-Stream Analytic Learning (DS-AL), etc.
ChengpengChen/RepGhost
RepGhost: A Hardware-Efficient Ghost Module via Re-parameterization
wangwen-whu/WTW-Dataset
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.
MathamPollard/awesome-table-structure-recognition
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
apple/ml-autofocusformer
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
xinghaochen/SLAB
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"
laonafahaodange/RepRFN
Reparameterized Residual Feature Network For Lightweight Image Super-Resolution
whn09/table_structure_recognition
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
suous/RepNeXt
RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization
liwei109/SepViT
SepViT: Separable Vision Transformer