Xlsean's Stars
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
megvii-research/Sobolev_INRs
[ECCV 2022] The official experimental code of "Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives"
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Xlsean/detectron2
Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Xlsean/PolarMask
Code for 'PolarMask: Single Shot Instance Segmentation with Polar Representation'