Pinned Repositories
ArcaneGAN
ArcaneGAN
Bert-VITS2
vits2 backbone with multilingual-bert
ChineseNER
基于Bi-GRU + CRF 的中文机构名、人名识别, 支持google bert模型
clip-as-service
Embed images and sentences into fixed-length vectors with CLIP
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
CLIP_prefix_caption
Simple image captioning model
DECA
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
DeepLearning-MuLi-Notes
Notes about courses Dive into Deep Learning by Mu Li
deltille
Detector of partial or occluded deltille (triangular) and rectangular checkerboards in camera images
DiffVar
russell-shu's Repositories
russell-shu/Bert-VITS2
vits2 backbone with multilingual-bert
russell-shu/clip-as-service
Embed images and sentences into fixed-length vectors with CLIP
russell-shu/DECA
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
russell-shu/DeepLearning-MuLi-Notes
Notes about courses Dive into Deep Learning by Mu Li
russell-shu/DiffVar
russell-shu/-
作业
russell-shu/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
russell-shu/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
russell-shu/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
russell-shu/espnet
End-to-End Speech Processing Toolkit
russell-shu/FastThresholdClustering
FastThresholdClustering is an efficient vector clustering algorithm based on FAISS, particularly suitable for large-scale vector data clustering tasks. The algorithm features intuitive and easy-to-select hyperparameters, uses cosine similarity as its distance metric, and supports GPU acceleration.
russell-shu/FFHQ-UV
FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction
russell-shu/galai
Model API for GALACTICA
russell-shu/github-actions-youtube-dl
使用 GitHub Actions 下载 YouTube 视频
russell-shu/InternLM_homework
russell-shu/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
russell-shu/MeshCNN
Convolutional Neural Network for 3D meshes in PyTorch
russell-shu/multiface
Hosts the Multiface dataset, which is a multi-view dataset of multiple identities performing a sequence of facial expressions.
russell-shu/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
russell-shu/NER---AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
russell-shu/nlp-paper
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
russell-shu/nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
russell-shu/PASSL
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,BEiT,MAE等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
russell-shu/Python-100-Days
Python - 100天从新手到大师
russell-shu/russell-shu
Config files for my GitHub profile.
russell-shu/sd-fuse_rk3399
Create bootable SD card for NanoPC-T4/NanoPi M4/NanoPi NEO4
russell-shu/stable-diffusion
A latent text-to-image diffusion model
russell-shu/stable-diffusion-webui
Stable Diffusion web UI
russell-shu/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
russell-shu/Tutorial
LLM&VLM Tutorial