leduckhai
def me('techmonzter'): return 'nerd'
FH Aachen, RWTH AachenHo Chi Minh City, Aachen, Toronto
leduckhai's Stars
nrl-ai/llama-assistant
AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
UConn-DSIS/Empowering-Time-Series-Analysis-with-LLM
Official website for "Empowering Time Series Analysis with Large Language Models: A Survey"
WenjieDu/Awesome_Imputation
Awesome Deep Learning for Time-Series Imputation, including a must-read paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"
xiyuanzh/time-series-papers
An up-to-date list of time-series related papers in AI venues.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
asappresearch/slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. Official website: https://asappresearch.github.io/slue-toolkit/
microsoft/Pengi
An Audio Language model for Audio Tasks
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
v-nhandt21/Vinorm
Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syllables
langmaninternet/VietnameseTextNormalizer
Thư viện chuẩn hóa văn bản Tiếng Việt
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
HKAB/whisper-finetune-vietnamese
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
alexeygrigorev/data-science-interviews
Data science interview questions and answers
aitomatic/openssa
OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving
Yuan-ManX/ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
stopwords/vietnamese-stopwords
Vietnamese stopwords
ducanhdt/openai_whisper_finetuning
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
AtsushiSakai/PythonRobotics
Python sample codes for robotics algorithms.
nguyenvulebinh/lyric-alignment
Vietnamese song lyric alignment framework
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
AI4Bharat/IndicWav2Vec
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2