thelinhbkhn2014

thelinhbkhn2014's Stars

karpathy/LLM101n
LLM101n: Let's build a Storyteller
29k1.6k
PrimeIntellect-ai/OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
Language:Python22617
baochi0212/LaVy
Pioneering in Vietnamese Multimodal Large Language Model
Language:Python386
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Language:Python1.5k122
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python51243
danielvarga/hunalign
Sentence aligner
Language:C++10838
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.7k3.9k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python18.3k1.9k
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k399
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Language:Python2.9k199
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k319
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.6k756
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Language:TypeScript18.6k2k
VinAIResearch/PhoGPT
PhoGPT: Generative Pre-training for Vietnamese (2023)
Language:Python74367
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML
7.2k567
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
Language:Python15k2.4k
VinAIResearch/Anti-DreamBooth
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)
Language:Python20316
VinAIResearch/WaveDiff
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
Language:Python36928
VinAIResearch/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Language:Python29635
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Language:Python13518
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k1.2k
thelinhbkhn2014/Text2PhonemeSequence
Language:Python3910
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python2.9k416
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
VinAIResearch/PhoST
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
192
thelinhbkhn2014/VnCoreNLP_Wrapper
Language:Python255
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k26.5k
HKUST-KnowComp/BMGF-RoBERTa
Source Code for IJCAI 2020 paper "On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification"
Language:Python207
VinAIResearch/PhoMT
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
404
rsennrich/Bleualign
Machine-Translation-based sentence alignment tool for parallel text
Language:Python29881