/Bangla-NLP-Dataset

Bangla NLP dataset. Bangla NER,POStag, text summarization, stopword, translate, sentiment analysis, wiki articles, root word, dataset etc.

MIT LicenseMIT

Bangla-NLP-Dataset

Bangla NLP dataset. This repository contains sbnltk datasets, which were used in Bangla nlp toolkit - sbnltk . Also , Existing Datasets are being listed here!

OUR DATASET IS IN LFS MODE! SO YOU HAVE TO CLONE IT FOR GETTING DATA!

WE WILL SOON UPLOAD ALL DEEP LEARNING BASED DATASETS!

sbnltk Dataset List(DUMP & HUMAN Evaluated)(sbnltk Dataset)

  • Bangla Number List drive
  • Bangla root word List drive
  • Bangla Word List (highest to lowest occurrence) drive
  • Bangla Wiki Dump word drive
  • Bangla POStag static dataset(single word) drive
  • Bangla NER Static Dataset(single word) drive
  • Bangla Stop word list drive
  • Bangla Dump Pos tag drive
  • Bangla Dump question Classification Dataset drive
  • Bangla Dump Sentiment Analysis drive
  • Google Translation Dataset drive
  • NER Existing Dataset(Modified + adding Date entity) drive
  • News Article Dataset drive
  • POS tag converted Data drive
  • POS tag human evaluated Data drive
  • DUMP NER data (active and passive both) drive
  • DUMP NER data(active only) drive
  • Extractive Text Summarization github
  • Abstractive Text Summarization(newspaper) drive kaggle
  • News Article Classification(text Classification) drive kaggle
  • Topic Keywords classfication(keywords generator) drive kaggle

Paper

  • Text Summarization paper cite

EXISTING DATASET

I am not the owner of these following datasets. It's just a collection to find amazing peoples and their works Please give them support! Your support will encourage them to do more amazing things.

AWESOME DATASET SOURCES

NEWS ARTICLES AND DOCUMENTS

SPEECH TO TEXT / TEXT TO SPEECH

SENTIMENT ANALYSIS / SENTENCE CLASSIFICATION

BANGLA MACHINE TRANSLATION DATASET

BANGLA POSTAG DATASET

BANGLA NER DATASET

QUESTION ANSWERING DATASET

BANGLA TEXT SUMMARIZATION

BANGLA FAKE NEWS DETECTION

MISC

Motivation

Usage and Contribute