LingXuanYin's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
acheong08/ChatGPT
Reverse engineered ChatGPT API
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
1adrianb/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
thuml/Time-Series-Library
A Library for Advanced Deep Time Series Models.
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
huggingface/parler-tts
Inference and training library for high-quality TTS models.
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
cs230-stanford/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Nixtla/nixtla
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.
OSU-NLP-Group/HippoRAG
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
Womsxd/AutoMihoyoBBS
米游社自动签到,支持:崩坏二、崩坏三、原神、未定事件簿,米游币自动获取
thuml/iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
QData/spacetimeformer
Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."
city-super/BungeeNeRF
[ECCV22] BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
amazon-science/earth-forecasting-transformer
Official implementation of Earthformer
openclimatefix/metnet
PyTorch Implementation of Google Research's MetNet and MetNet-2
openclimatefix/skillful_nowcasting
Implementation of DeepMind's Deep Generative Model of Radar (DGMR) https://arxiv.org/abs/2104.00954
aikunyi/FourierGNN
Official implementation of the paper "FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective"
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts
silverriver/MMChat
[LREC] MMChat: Multi-Modal Chat Dataset on Social Media
Frank-Wang-oss/FCSTGNN
jscslld/HMER
《人工智能原理》课程设计(基于Resnet-Transformer的手写数学表示式识别)
GoHomeToMacDonal/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)