TFKit is a tool kit mainly for language generation.
It leverages the use of transformers on many tasks with different models in this all-in-one framework.
All you need is a little change of config.
With transformer models - BERT/ALBERT/T5/BART......
Text Generation | 📝 seq2seq language model |
Text Generation | 🖊️ causal language model |
Text Generation | 🖨️ once generation model / once generation model with ctc loss |
Text Generation | 📝 onebyone generation model |
Learn more from the document.
Simple installation from PyPI
pip install git+https://github.com/voidful/TFkit.git@refactor-dataset
input, target
tfkit-train \
--task clas \
--config xlm-roberta-base \
--train training_data.csv \
--test testing_data.csv \
--lr 4e-5 \
--maxlen 384 \
--epoch 10 \
--savedir roberta_sentiment_classificer
tfkit-eval \
--task roberta_sentiment_classificer/1.pt \
--metric clas \
--valid testing_data.csv
Multi-task training
tfkit-train \
--task clas clas \
--config xlm-roberta-base \
--train training_data_taskA.csv training_data_taskB.csv \
--test testing_data_taskA.csv testing_data_taskB.csv \
--lr 4e-5 \
--maxlen 384 \
--epoch 10 \
--savedir roberta_sentiment_classificer_multi_task
Due to time constraints, the following tasks are temporarily not supported
Classification | 🏷️ multi-class and multi-label classification |
Question Answering | 📃 extractive qa |
Question Answering | 🔘 multiple-choice qa |
Tagging | 👁️🗨️ sequence level tagging / sequence level with crf |
Self-supervise Learning | 🤿 mask language model |
- transformers models list: you can find any pretrained models here
- nlprep: download and preprocessing data in one line
- nlp2go: create demo api as quickly as possible.
Thanks for your interest.There are many ways to contribute to this project. Get started here.
Icons modify from Freepik from www.flaticon.com
Icons modify from Nikita Golubev from www.flaticon.com