Comedian1926

Pinned Repositories

apollo_read
An open autonomous driving platform
Language:C++0 1 00
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python00
audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python00
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
0 0 00
BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, etc. Also support StyleGAN2, DFDNet.
Language:Python0 1 00
Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
carefree-creator
AI magics meet Infinite draw board.
Language:Jupyter Notebook0 1 00
CarND-Advanced-Lane-Lines
Language:Shell0 1 00
Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
Language:Jupyter Notebook0 0 00
Chinese-Text-Classification-Pytorch
中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention，DPCNN，Transformer，基于pytorch，开箱即用。
Language:Python0 1 00

Comedian1926's Repositories

Comedian1926/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python00
Comedian1926/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python00
Comedian1926/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
0 0 00
Comedian1926/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, etc. Also support StyleGAN2, DFDNet.
Language:Python0 1 00
Comedian1926/Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
Comedian1926/carefree-creator
AI magics meet Infinite draw board.
Language:Jupyter Notebook0 1 00
Comedian1926/Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
Language:Jupyter Notebook0 0 00
Comedian1926/Chinese-Text-Classification-Pytorch
中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention，DPCNN，Transformer，基于pytorch，开箱即用。
Language:Python0 1 00
Comedian1926/DreamSound
Code for Investigating Personalization Methods in Text to Music Generation
Language:Python0 0 00
Comedian1926/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Python0 0 00
Comedian1926/facial-pose-estimation-pytorch-v2
Language:Jupyter Notebook0 1 00
Comedian1926/facial-pose-estimation-unreal
Language:C++0 1 00
Comedian1926/Learned-Motion-Matching
A neural-network-based generative model for video-game characters animations
Language:Python0 1 00
Comedian1926/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python0 0
Comedian1926/FaceStudio
Put Your Face Everywhere in Seconds.
0 0
Comedian1926/generative-models
Generative Models by Stability AI
Language:Python0 0
Comedian1926/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python
Comedian1926/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Language:Python0 0
Comedian1926/neo-ai-dlr
Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.
Language:C++1 0
Comedian1926/nnie
重构海思sample中的NNIE模块
Language:C1 0
Comedian1926/openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
Language:Jupyter Notebook0 0
Comedian1926/stable-diffusion-webui
Stable Diffusion web UI
Language:Python1 0
Comedian1926/strongtrack
A python tool with facial landmark annotation and coefficient finder
Language:Python1 0
Comedian1926/StyleGestures
Language:Python1 0
Comedian1926/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python0 0
Comedian1926/talking-head-anime-3-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
Language:Python1 0
Comedian1926/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
Language:Python0 0
Comedian1926/TensorRT-ERNIE
Language:Python0 0
Comedian1926/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python0 0
Comedian1926/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Language:Python1 0