Pinned Repositories
-
用多层BLSTM模型同时进行中文分词和标点符号预测
16SoundsUSB
16 Synchronized Inputs USB (UAC2) Sound Card Based on XMOS xCORE-200
acoular
Library for acoustic beamforming
AESRC2020
Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
kaldi-gop
Computes the Goodness of Pronunciation (GOP). Bases on Kaldi.
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Keras-Trigger-Word
How to do Real Time Trigger Word Detection with Keras | DLology
speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
speechbrain
A PyTorch-based Speech Toolkit
yh646492956's Repositories
yh646492956/acoular
Library for acoustic beamforming
yh646492956/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
yh646492956/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
yh646492956/bloomz.cpp
C++ implementation for BLOOM
yh646492956/continue
⏩ The easiest way to code with any LLM—Continue is an open-source autopilot for VS Code and JetBrains
yh646492956/DCT-Net
Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartoonization
yh646492956/EduChat
An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya
yh646492956/emqx
The most scalable open-source MQTT broker for IoT, IIoT, and connected vehicles
yh646492956/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
yh646492956/faiss
A library for efficient similarity search and clustering of dense vectors.
yh646492956/fast-whisper-finetuning
yh646492956/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
yh646492956/FunASR
A Fundamental End-to-End Speech Recognition Toolkit
yh646492956/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
yh646492956/gpt_academic
为ChatGPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型。兼容复旦MOSS, llama, rwkv, 盘古, newbing, claude等
yh646492956/icefall
yh646492956/kws-training-suite
yh646492956/llama
Inference code for LLaMA models
yh646492956/mem0
The Memory layer for your AI apps
yh646492956/mmdetection
OpenMMLab Detection Toolbox and Benchmark
yh646492956/mosquitto
Eclipse Mosquitto - An open source MQTT broker
yh646492956/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
yh646492956/paho.mqtt.python
paho.mqtt.python
yh646492956/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
yh646492956/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
yh646492956/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
yh646492956/Starmoon
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development using Python, NextJs, Arduino, ESP32, LLMs (GPT), STT, TTS, Emotion Analysis, AI agent
yh646492956/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
yh646492956/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
yh646492956/wenda
闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题