Happenmass
A fashionabel creater who desire to create a beautifule code world. love to explore more effective Machine Learning Code and I want achieve my own value in Git
Happenmass's Stars
MuShibo/Micro-Wheeled_leg-Robot
全球最小的桌面级双轮腿机器人!
LiuDingchuan/Graduate_Project
Webots Simulation of A Wheeled Bipedal Robot Using Model Based LQR ( from a undergraduate graduation project)
lifeiteng/OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
ggerganov/ggml
Tensor library for machine learning
lucidrains/MIMO-pytorch
Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Audio-AGI/AudioSep
Official implementation of "Separate Anything You Describe"
Happenmass/LiveAssistPro
LiveAssistPro is an AI assistant for live streaming that uses Zhipu AI’s vision model to analyze screen content and return JSON descriptions. It detects user speech via VAD and ASR, enabling real-time interaction with an AI role-playing assistant, which adapts responses based on both screen visuals and spoken input for dynamic engagement.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
catcto/CosyVoiceDocker
This repository provides a Docker image for CosyVoice
lovemefan/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
jingzhunxue/flow_mirror
flow mirror models from JZX AI Labs
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
catcto/SenseVoiceDocker
This repository provides a Docker image for SenseVoice
Happenmass/openai-batch-api-processor
OpeAIBatcher is a Python wrapper for the OpenAI Batch API, designed to streamline batch processing of large datasets. This utility facilitates file uploads, batch creation, status tracking, and result retrieval, enabling efficient handling of extensive API requests with OpenAI's services.
Nemo2011/bilibili-api
哔哩哔哩常用API调用。支持视频、番剧、用户、频道、音频等功能。原仓库地址:https://github.com/MoyuScript/bilibili-api
originalFactor/biliapi
BiliBili API 的Python SDK
gildor2/UEViewer
Viewer and exporter for Unreal Engine 1-4 assets (UE Viewer).
AIFSH/SenseVoice-ComfyUI
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
amphionspace/SD-Eval
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
zhulu111/ComfyUI_Bxb
SD变现宝:一键把comfyui工作流转换成小程序。