httang1224

Pinned Repositories

yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Language:Python51.1k16.4k
CSC-Unet
Language:Python10
IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
Language:Python10
DDPM
PyTorch DDPM implementation
Language:Python10
AudioClassfication
Language:Python00
AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
Language:Python00
AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
Language:Jupyter Notebook00
Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
00
awesome-programming-books
计算机类专业经典书籍集合，Java、Scala、C/C++、算法、计算机基础、数学、英语等电子书以及互联网大厂技术峰会资料
00
awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Language:TeX00

httang1224's Repositories

httang1224/CSC-Unet
1
httang1224/DDPM
PyTorch DDPM implementation
1
httang1224/IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
1
httang1224/AudioClassfication
httang1224/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
httang1224/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
httang1224/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
httang1224/DeepLearning
A deep learning code base, mainly for paper replication, in the areas of image recognition, object detection, image segmentation, self-supervision, etc. Each project can be run independently, and there are corresponding articles to explain.
httang1224/diffusion_model
An online playground of diffusion model
httang1224/dive-into-cv-pytorch
动手学CV-Pytorch版
httang1224/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
httang1224/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
httang1224/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
httang1224/PVT
Official implementation of PVT series
httang1224/PyTorch-Tutorial-2nd
《Pytorch实用教程》（第二版）无论是零基础入门，还是CV、NLP、LLM项目应用，或是进阶工程化部署落地，在这里都有。相信在本书的帮助下，读者将能够轻松掌握 PyTorch 的使用，成为一名优秀的深度学习工程师。
httang1224/pytorch_Realtime_Multi-Person_Pose_Estimation
httang1224/pytorchforaudio
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
httang1224/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
httang1224/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
httang1224/RepVGG
RepVGG: Making VGG-style ConvNets Great Again
httang1224/Res2Net-PretrainedModels
(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"
httang1224/RT-ODLab
YOLO Tutorial
httang1224/senet.pytorch
PyTorch implementation of SENet
httang1224/SKNet-PyTorch
Nearly Perfect & Easily Understandable PyTorch Implementation of SKNet
httang1224/stable-diffusion
A latent text-to-image diffusion model
httang1224/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
httang1224/T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
httang1224/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
httang1224/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
httang1224/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite