carcloudfly

Pinned Repositories

act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Language:Python0 0 00
alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
0 0 00
Arc2Face
Arc2Face: A Foundation Model of Human Faces
Language:Python0 0 00
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python0 0 00
CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
Language:Python0 0 00
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python0 0 00
DrEureka
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
Language:Python0 0 00
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python0 0 00
EDA-AI
Implementation of NeurIPS 2021 paper "On Joint Learning for Solving Placement and Routing in Chip Design" & NeurIPS 2022 paper "The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design".
Language:Prolog0 0 00
ffmpeg-webrtc
ffmpeg-webrtc for whip and whep protocol
Language:C0 0 00

carcloudfly's Repositories

carcloudfly/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Language:Python0 0 00
carcloudfly/alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
0 0 00
carcloudfly/Arc2Face
Arc2Face: A Foundation Model of Human Faces
Language:Python0 0 00
carcloudfly/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python0 0 00
carcloudfly/CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
Language:Python0 0 00
carcloudfly/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python0 0 00
carcloudfly/DrEureka
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
Language:Python0 0 00
carcloudfly/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python0 0 00
carcloudfly/EDA-AI
Implementation of NeurIPS 2021 paper "On Joint Learning for Solving Placement and Routing in Chip Design" & NeurIPS 2022 paper "The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design".
Language:Prolog0 0 00
carcloudfly/ffmpeg-webrtc
ffmpeg-webrtc for whip and whep protocol
Language:C0 0 00
carcloudfly/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Language:Python0 0 00
carcloudfly/LookOnceToHear
A novel human-interaction method for real-time speech extraction on headphones.
Language:Python0 0
carcloudfly/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python0 0
carcloudfly/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。
carcloudfly/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Language:Python0 0
carcloudfly/noise-reduction
noise reduction
Language:Python0 0
carcloudfly/yay_robot
PyTorch implementation of YAY Robot
Language:Python0 0