Pinned Repositories
commonvoice-th
Kaldi recipe to train commonvoice corpus in Thai language
dataset-releases
model-releases
mt-opus
English-Thai Machine Translation with OPUS data
Thai-NNER
Pytorch implementation of paper: Thai Nested Named Entity Recognition
thai2nmt
English-Thai Machine Translation Models
thai2transformers
Pretraining transformer based Thai language models
vistec-ser
Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.
WangchanX
WangchanX Fine-tuning Pipeline
wav2vec2-large-xlsr-53-th
Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
VISTEC-depa AI Research Institute of Thailand's Repositories
vistec-AI/thai2transformers
Pretraining transformer based Thai language models
vistec-AI/wav2vec2-large-xlsr-53-th
Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
vistec-AI/WangchanX
WangchanX Fine-tuning Pipeline
vistec-AI/Thai-NNER
Pytorch implementation of paper: Thai Nested Named Entity Recognition
vistec-AI/dataset-releases
vistec-AI/commonvoice-th
Kaldi recipe to train commonvoice corpus in Thai language
vistec-AI/thai2nmt
English-Thai Machine Translation Models
vistec-AI/vistec-ser
Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.
vistec-AI/crfcut
Thai sentence segmentation with conditional random fields
vistec-AI/model-releases
vistec-AI/WangchanX-Eval
WangchanX Eval
vistec-AI/colab
Collections of Google Colab notebooks and some data.
vistec-AI/sme-depa
Help small businesses make money from their transaction data; workshop at depa
vistec-AI/WSSET
TF2 implementation of paper: Self-supervised Deep Metric Learning for Pointsets, ICDE 2021
vistec-AI/thai_websites_crawler
Scripts for crawling the 500 most visited websites in Thailand according to Alexa for `th` and `en` parallel texts.
vistec-AI/WangchanLion
vistec-AI/thwiki-text
vistec-AI/Bilingual-Financial-NER-Model
vistec-AI/thai2nmt_preprocess
vistec-AI/ai-builders-orientation
Lesson 0 - Orientation
vistec-AI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
vistec-AI/pdf2parallel
Extract en-th parallel sentences from PDFs
vistec-AI/ai2api
Productionize NLP models trained on Pytorch by AIResearch.in.th
vistec-AI/capital_market_text_data
vistec-AI/scb_workshop
vistec-AI/SynthMIDI
A single-note classification dataset generated from MIDI file.
vistec-AI/generated_reviews_enth
Generated product reviews dataset for machine translation quality estimation, part of [scb-mt-en-th-2020](https://arxiv.org/pdf/2007.03541.pdf)
vistec-AI/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
vistec-AI/nlp
🤗nlp – Datasets and evaluation metrics for Natural Language Processing in NumPy, Pandas, PyTorch and TensorFlow
vistec-AI/vistec-ai.github.io