Jason-Zhou-JC's Stars
tensorflow/models
Models and examples built with TensorFlow
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
lllyasviel/ControlNet
Let us control diffusion models!
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
openai/guided-diffusion
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
QianyanTech/Image-Downloader
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
ShivamShrirao/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
qiuqiangkong/audioset_tagging_cnn
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
audioset/ontology
The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.
HCIILAB/SCUT-HEAD-Dataset-Release
SCUT HEAD is a large-scale head detection dataset, including 4405 images labeld with 111251 heads.
lyakaap/NetVLAD-pytorch
PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.
yeyupiaoling/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
DCASE-REPO/DESED_task
Domestic environment sound event detection task
turpaultn/DCASE2019_task4
Baseline of dcase 2019 task 4
CVI-SZU/UniFace
CVI-SZU/UniTSFace