Pinned Repositories
ASR-Rescoring
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
auditok
An audio/acoustic activity detection and audio segmentation tool
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
bark
🔊 Text-Prompted Generative Audio Model
BELLE
BELLE: Bloom-Enhanced Large Language model Engine(开源中文对话大模型-70亿参数)
DPSL-ASR
Dual-Path Style Learning for End-to-End Noise-Robust Automatic Speech Recognition (DPSL-ASR).
OpenChatKit
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Alex-Songs's Repositories
Alex-Songs/OpenChatKit
Alex-Songs/ASR-Rescoring
Alex-Songs/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Alex-Songs/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Alex-Songs/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
Alex-Songs/bark
🔊 Text-Prompted Generative Audio Model
Alex-Songs/BELLE
BELLE: Bloom-Enhanced Large Language model Engine(开源中文对话大模型-70亿参数)
Alex-Songs/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Alex-Songs/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地部署 (Chinese LLaMA & Alpaca LLMs)
Alex-Songs/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
Alex-Songs/CMSIS-DSP
CMSIS-DSP embedded compute library for Cortex-M and Cortex-A
Alex-Songs/faster-whisper
Faster Whisper transcription with CTranslate2
Alex-Songs/gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
Alex-Songs/gpt_academic
为GPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型。兼容复旦MOSS, llama, rwkv, 盘古等。
Alex-Songs/InferLLM
a lightweight LLM model inference framework
Alex-Songs/InStock
InStock股票系统,基于akshare抓取股票每日关键数据,计算股票各种指标,识别K线各种形态,内置多种选股策略,支持选股验证回测,是量化投资工具。Stock system, based on akshare, captures key daily data of stocks, calculates various stock indicators, K-line pattern recognition, has a variety of built-in stock selection strategies, and supports stock selection verification back test. quantitative investment tool.
Alex-Songs/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
Alex-Songs/llama
Inference code for LLaMA models
Alex-Songs/LLaMA-Adapter
LLaMA-Adapter: Tuning LLaMa within One Hour and 8M Parameters
Alex-Songs/llama.cpp
Port of Facebook's LLaMA model in C/C++
Alex-Songs/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Alex-Songs/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Alex-Songs/pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
Alex-Songs/Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
Alex-Songs/TensorflowASR
集成了Tensorflow 2版本的端到端语音识别模型,并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2
Alex-Songs/text
Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.
Alex-Songs/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Alex-Songs/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Alex-Songs/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Alex-Songs/whisper.cpp
Port of OpenAI's Whisper model in C/C++