fuzzypants123

fuzzypants123's Stars

babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python34k 306 8735.1k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python29.9k 265 1k3.5k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python20.5k 163 1452k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python18.6k 293 1.3k2.4k
Delgan/loguru
Python logging made (stupidly) simple
Language:Python18.3k 138 957671
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python8.2k 53 3961.2k
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Language:Python7.3k 63 227511
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Language:Python5.9k 67 242869
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python4.7k 72 171687
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Language:Jupyter Notebook1.8k 23 62140
hustvl/YOLOP
You Only Look Once for Panopitic Driving Perception.（MIR2022）
Language:Python1.8k 30 195404
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python922 42 2672
sangyun884/HR-VITON
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
Language:Python797 15 90165
Skallwar/suckit
Suck the InTernet
Language:Rust713 10 8140
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Language:Jupyter Notebook701 15 2031
AnythingInAnyScene/anything_in_anyscene
Language:Python312 21 533
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
Language:Python219 7 2035
SoccerNet/sn-gamestate
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)
Language:Python170 13 520
microsoft/SceneLandmarkLocalization
Source code and data for papers "Improved Scene Landmark Detection for Camera Localization" (3DV 2024) and "Learning to Detect Scene Landmarks for Camera Localization" (CVPR 2024).
Language:Python153 17 917
Traffic-X/ViT-CoMer
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.
Language:Python143 4 147
sming256/OpenTAD
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
Language:Python75 3 73
ztsrxh/RoadBEV
Codes for RoadBEV: road surface reconstruction in Bird's Eye View
Language:Python707
fudan-zvg/RoadNet
[ICCV2023 Oral] RoadNetworkTRansformer & [AAAI 2024] LaneGraph2Seq
Language:Python55 9 114
MaySummerWind/DocHunt
🎉 汇聚并整理飞书等公开分享文档链接，解决没有官方全局搜索痛点，让知识持续传递。A list cool, beauty, interesting doc of feishu.
52 1 04
mengtan00/SA-BEV
This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (ICCV 2023)
Language:Python52 3 35
ChiShengChen/ResVMamba
The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning
Language:Python36 2 12
TUMFTM/GMMCalib
LiDAR-to-LiDAR Calibration
Language:Python24 1 13
CaiYingFeng/VRSO
Language:Python230
vincentqqb/PriorLane
Language:Python19 1 32
SAIC-Vision/WS-3D-Lane
[ICRA 2023] WS-3D-Lane: Weakly Supervised 3D Lane Detection with 2D Lane Labels
Language:Python114