Darius-H's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
d2l-ai/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Loyalsoldier/clash-rules
🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
nerfstudio-project/nerfstudio
A collaboration friendly studio for NeRFs
curlconverter/curlconverter
Transpile curl commands into Python, JavaScript and 27 other languages
kuangliu/pytorch-cifar
95.47% on CIFAR10 with PyTorch
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
tonquer/picacg-qt
哔咔漫画, PicACG comic PC client(Windows, Linux, MacOS)
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
SwinTransformer/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
open-mmlab/mmhuman3d
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
tijiang13/InstantAvatar
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
HongwenZhang/PyMAF-X
[TPAMI 2023] PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
pai4451/ML2021
My coursework for Machine Learning (2021 Spring) at National Taiwan University (NTU)
Zain-Jiang/Dict-TTS
bytedance/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Make-An-Audio/Make-An-Audio.github.io
Darius-H/GLaDOS-CheckIn
GLaDOS AutoCheckIn 定时自动签到