Pinned Repositories
audino
Open source audio annotation tool for humans™
awesome-speech
this is a treasure-house of speech
chinese_text_normalization
Chinese text normalization for speech processing
ChineseGLUE
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
speech
speech-learning
xbsdsongnan's Repositories
xbsdsongnan/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
xbsdsongnan/BXC_VideoAnalyzer_v4
C++开发的视频行为分析系统v4版本
xbsdsongnan/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
xbsdsongnan/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
xbsdsongnan/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
xbsdsongnan/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
xbsdsongnan/flux
Official inference repo for FLUX.1 models
xbsdsongnan/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
xbsdsongnan/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
xbsdsongnan/gorse
Gorse open source recommender system engine
xbsdsongnan/gpt4free
The official gpt4free repository | various collection of powerful language models
xbsdsongnan/inpaint-web
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。
xbsdsongnan/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
xbsdsongnan/Kolors
Kolors Team
xbsdsongnan/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
xbsdsongnan/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
xbsdsongnan/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
xbsdsongnan/MaxKB
🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。
xbsdsongnan/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
xbsdsongnan/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
xbsdsongnan/poster-design
一款漂亮且功能强大的在线海报设计器,图片编辑器,仿稿定设计,适用于多种场景:海报生成、电商产品图、文章长图、视频/公众号封面等。A beautiful online image designer, suitable for various scenarios like generate posters, making design easier!
xbsdsongnan/PowerToys
Windows system utilities to maximize productivity
xbsdsongnan/RecAI
Bridging LLM and Recommender System.
xbsdsongnan/recommenders
Best Practices on Recommendation Systems
xbsdsongnan/text_security_audit
text security audit 安全审核-语义模型过滤 敏感内容检测系统
xbsdsongnan/upscayl
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
xbsdsongnan/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
xbsdsongnan/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
xbsdsongnan/wordscheck
敏感词检测,违禁词过滤,敏感词过滤,敏感词库,一键启动,本地运行,私有化部署,1分钟接入完成,开箱即用,支持docker,支持在线api
xbsdsongnan/XRec
[EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation"