Pinned Repositories
bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
dynet
DyNet: The Dynamic Neural Network Toolkit
Mantidae
A C++ Lightweight Neural Machine Translation Toolkit
news-corpus
Corpus tiếng việt
OntoNotes-5.0-NER-BIO
A BIO formatted Named Entity Recognition data set extracted from the OntoNotes 5.0 release.
RelatedWorkSummarizationDataset
Dataset for the paper: Cong Duy Vu Hoang and Min-Yen Kan (2010) Towards Automated Related Work Summarization. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp. 427-435.
TEDTalksCrawler
A Crawler for TED Talks data
tensor2tensor
A library for generalized sequence to sequence models
Transformer-DyNet
An Implementation of Transformer (Attention Is All You Need) in DyNet
VNTC
A Large-scale Vietnamese News Text Classification Corpus
duyvuleo's Repositories
duyvuleo/Transformer-DyNet
An Implementation of Transformer (Attention Is All You Need) in DyNet
duyvuleo/duorat
duyvuleo/duyvuleo.github.io
Cong Duy Vu Hoang's Personal Homepage
duyvuleo/allennlp
An open-source NLP research library, built on PyTorch.
duyvuleo/allennlp-optuna
⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy
duyvuleo/Awesome-Code-LLM
👨💻 An awesome and curated list of best code-LLM for research.
duyvuleo/chat-gpt-google-extension
A browser extension to display ChatGPT response alongside search engine results
duyvuleo/datasets
🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas
duyvuleo/DeBERTa
The implementation of DeBERTa
duyvuleo/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
duyvuleo/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
duyvuleo/gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
duyvuleo/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
duyvuleo/nlpaug
Data augmentation for NLP
duyvuleo/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
duyvuleo/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
duyvuleo/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
duyvuleo/Prompt-Engineering-Guide
:octopus: Guide and resources for prompt engineering
duyvuleo/pynndescent
A Python nearest neighbor descent for approximate nearest neighbors
duyvuleo/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
duyvuleo/Slurm_tools
My tools for the Slurm HPC workload manager
duyvuleo/SmBop
duyvuleo/sqlglot
Python SQL Parser and Transpiler
duyvuleo/tensor2struct-public
Semantic parsers based on encoder-decoder framework
duyvuleo/test-suite-sql-eval
Semantic Evaluation for Text-to-SQL with Distilled Test Suites
duyvuleo/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
duyvuleo/transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
duyvuleo/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
duyvuleo/vietocr
Transformer OCR
duyvuleo/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers