thelinhbkhn2014's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
PrimeIntellect-ai/OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
baochi0212/LaVy
Pioneering in Vietnamese Multimodal Large Language Model
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
danielvarga/hunalign
Sentence aligner
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
VinAIResearch/PhoGPT
PhoGPT: Generative Pre-training for Vietnamese (2023)
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
VinAIResearch/Anti-DreamBooth
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)
VinAIResearch/WaveDiff
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
VinAIResearch/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
thelinhbkhn2014/Text2PhonemeSequence
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
VinAIResearch/PhoST
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
thelinhbkhn2014/VnCoreNLP_Wrapper
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
HKUST-KnowComp/BMGF-RoBERTa
Source Code for IJCAI 2020 paper "On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification"
VinAIResearch/PhoMT
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
rsennrich/Bleualign
Machine-Translation-based sentence alignment tool for parallel text