Pinned Repositories
apollo_read
An open autonomous driving platform
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, etc. Also support StyleGAN2, DFDNet.
Bert-VITS2
vits2 backbone with bert
carefree-creator
AI magics meet Infinite draw board.
CarND-Advanced-Lane-Lines
Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
Chinese-Text-Classification-Pytorch
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
Comedian1926's Repositories
Comedian1926/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Comedian1926/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Comedian1926/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Comedian1926/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, etc. Also support StyleGAN2, DFDNet.
Comedian1926/Bert-VITS2
vits2 backbone with bert
Comedian1926/carefree-creator
AI magics meet Infinite draw board.
Comedian1926/Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
Comedian1926/Chinese-Text-Classification-Pytorch
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
Comedian1926/DreamSound
Code for Investigating Personalization Methods in Text to Music Generation
Comedian1926/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Comedian1926/facial-pose-estimation-pytorch-v2
Comedian1926/facial-pose-estimation-unreal
Comedian1926/Learned-Motion-Matching
A neural-network-based generative model for video-game characters animations
Comedian1926/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Comedian1926/FaceStudio
Put Your Face Everywhere in Seconds.
Comedian1926/generative-models
Generative Models by Stability AI
Comedian1926/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Comedian1926/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Comedian1926/neo-ai-dlr
Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.
Comedian1926/nnie
重构海思sample中的NNIE模块
Comedian1926/openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
Comedian1926/stable-diffusion-webui
Stable Diffusion web UI
Comedian1926/strongtrack
A python tool with facial landmark annotation and coefficient finder
Comedian1926/StyleGestures
Comedian1926/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Comedian1926/talking-head-anime-3-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
Comedian1926/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
Comedian1926/TensorRT-ERNIE
Comedian1926/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Comedian1926/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/