Darius-H

Zhejiang UniversityHangzhou,Zhejiang,China

Darius-H's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k 1.1k 15.7k26.3k
d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python61.3k 1.1k 010.9k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python53.9k 446 1315.6k
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python29.1k 368 8.3k9.4k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19k 277 2.9k2.6k
Loyalsoldier/clash-rules
🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET)，兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。
18.4k 101 2661.6k
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python11.8k 144 1.1k2k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python10.9k 183 1.9k1.8k
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.2k 44 4081.5k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k 132 49854
nerfstudio-project/nerfstudio
A collaboration friendly studio for NeRFs
Language:Python9.3k 117 1.6k1.2k
curlconverter/curlconverter
Transpile curl commands into Python, JavaScript and 27 other languages
Language:TypeScript7.4k 74 311914
kuangliu/pytorch-cifar
95.47% on CIFAR10 with PyTorch
Language:Python5.9k 85 1362.1k
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
Language:Python5.8k 84 1.1k1.6k
tonquer/picacg-qt
哔咔漫画, PicACG comic PC client(Windows, Linux, MacOS)
Language:Python3.5k 24 292180
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.4k 57 70305
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.4k 60 170255
SwinTransformer/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
Language:Python1.4k 9 93196
open-mmlab/mmhuman3d
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
Language:Python1.2k 23 218130
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language:Python737 71 14107
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包
Language:Python671 14 48105
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook610 15 6176
tijiang13/InstantAvatar
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
Language:Python353 13 7429
HongwenZhang/PyMAF-X
[TPAMI 2023] PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images
Language:Python207 8 3528
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Language:Python194 5 2611
pai4451/ML2021
My coursework for Machine Learning (2021 Spring) at National Taiwan University (NTU)
Language:Jupyter Notebook149 4 337
Zain-Jiang/Dict-TTS
Language:Python131 6 710
bytedance/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Language:Python116 4 414
Make-An-Audio/Make-An-Audio.github.io
Language:HTML7 1 0
Darius-H/GLaDOS-CheckIn
GLaDOS AutoCheckIn 定时自动签到
Language:JavaScript1

Darius-H

Darius-H's Stars

huggingface/transformers

d2l-ai/d2l-zh

labmlai/annotated_deep_learning_paper_implementations

open-mmlab/mmdetection

huggingface/datasets

Loyalsoldier/clash-rules

serengil/deepface

PaddlePaddle/PaddleSpeech

jacobgil/pytorch-grad-cam

AIGC-Audio/AudioGPT

nerfstudio-project/nerfstudio

curlconverter/curlconverter

kuangliu/pytorch-cifar

open-mmlab/mmcv

tonquer/picacg-qt

facebookresearch/encodec

lucidrains/audiolm-pytorch

SwinTransformer/Video-Swin-Transformer

open-mmlab/mmhuman3d

Text-to-Audio/Make-An-Audio

breezedeus/CnSTD

shivammehta25/Matcha-TTS

tijiang13/InstantAvatar

HongwenZhang/PyMAF-X

XinhaoMei/WavCaps

pai4451/ML2021

Zain-Jiang/Dict-TTS

bytedance/Make-An-Audio-2

Make-An-Audio/Make-An-Audio.github.io

Darius-H/GLaDOS-CheckIn