Pinned Repositories
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
agentscope
Start building LLM-empowered multi-agent applications in an easier way.
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
DiffSynth-Studio
Enjoy the magic of Diffusion models!
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
swift
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)
ModelScope's Repositories
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope/AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
modelscope/richdreamer
Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
modelscope/scepter
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
modelscope/motionagent
MotionAgent is your AI assistent to convert ideas into motion pictures.
modelscope/modelscope-classroom
modelscope/normal-depth-diffusion
modelscope/lite-sora
An initiative to replicate Sora
modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
modelscope/eval-scope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
modelscope/kws-training-suite
modelscope/AdaDet
AdaDet: A Development Toolkit for Object Detection based on ModelScope
modelscope/modelscope-studio