Pinned Repositories
csnli
Language identification and normalisation in code switching data tailored with a three-step decoding process
csnlp
Neural Stacking Dependency Parsers for Code Switching texts
HI-EN-PTB
indic-trans
The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.
indic-wx-converter
Python library for converting UTF to WX and vice-versa for Indian languages.
Kashmiri-Parsing-Pipeline
A dependency parsing pipeline for Kashmiri which includes a POS-tagger, a Chunker and an Intra-chunk Dependency Parser.
litcm
Language Identification and transliteration tool for Indian language code mixed data.
polyglot-tokenizer
Tokenizer for world's most spoken languages and social media texts like Facebook, Twitter etc.
wikiHowToImprove
wikiHowToImprove: A Resource and Analyses on Edits in Instructional Texts
irshadbhat's Repositories
irshadbhat/indic-wx-converter
Python library for converting UTF to WX and vice-versa for Indian languages.
irshadbhat/csnli
Language identification and normalisation in code switching data tailored with a three-step decoding process
irshadbhat/csnlp
Neural Stacking Dependency Parsers for Code Switching texts
irshadbhat/wikiHowToImprove
wikiHowToImprove: A Resource and Analyses on Edits in Instructional Texts
irshadbhat/indic-trans
The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.
irshadbhat/polyglot-tokenizer
Tokenizer for world's most spoken languages and social media texts like Facebook, Twitter etc.
irshadbhat/HI-EN-PTB
irshadbhat/large-qa-datasets
A collection of large question answering datasets
irshadbhat/wikiHow_MoRR
Data and code for EMNLP 2020 paper: Towards Modeling Revision Requirements in wikiHow Instructions.
irshadbhat/awesome-sentiment-analysis
Repository with all what is necessary for sentiment analysis and related areas
irshadbhat/bert_score
BERT score for text generation
irshadbhat/corrsim
Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.
irshadbhat/deepcut
A Thai word tokenization library using Deep Neural Network
irshadbhat/EMNLP2019-Split-And-Recombine
The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"
irshadbhat/irshadbhat
Config files for my GitHub profile.
irshadbhat/irshadbhat.github.io
Build a Jekyll blog in minutes, without touching the command line.
irshadbhat/lilac
Curate better data for LLMs
irshadbhat/mendable-nextjs-chatbot
Next.js Starter Template for building chatbots with Mendable
irshadbhat/NER-pytorch
LSTM+CRF NER
irshadbhat/OpenNMT-py
Open-Source Neural Machine Translation in PyTorch http://opennmt.net/
irshadbhat/parser
A collection of state-of-the-art syntactic parsing models based on Biaffine Parser.
irshadbhat/pointer_summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
irshadbhat/python-jamo
Hangul syllable decomposition and synthesis using jamo.
irshadbhat/seq2seq-summarizer
Pointer-generator reinforced seq2seq summarization in PyTorch
irshadbhat/spark
Mirror of Apache Spark
irshadbhat/thai-named-entity
Thai Named Entity List
irshadbhat/transformer-pointer-generator
A Abstractive Summarization Implementation with Transformer and Pointer-generator
irshadbhat/UD_Hindi_English-HIENCS
irshadbhat/w2c_wiki_embeddings
Word embeddings for concatenated W2C and wikipedia data
irshadbhat/wiki_embeddings
Word embeddings from Wikipedia text