Department of Linguistics, K.M. Institute of Hindi and Linguistics

Linguistics Department @Agra

India

Pinned Repositories

bodo
This repository contains all the resources (corpora) of Bodo and tools that were developed for creating and managing these resources
3 3 01
hindi-politeness
3 3 01
indianlr.github.io
A repository for listing the non-scheduled and endangered Indian language resources and technologies. The website could be accessed here
Language:HTML0 3 00
kmi-linguistics.github.io
Research and Development at the Department of Linguistics in K.M. Institute of Hindi and Linguistics at Dr. Bhim Rao Ambedkar University, Agra
Language:HTML0 3 00
magahi
This repository contains all the data, tools, applications and publications related to Magahi, an Indo-Aryan language
Language:Java4 3 02
mscrabble
Repository for Multilingual Scrabble Generator and Games - especially aimed towards endangered languages
Language:JavaScript3 5 03
propaganda
Repository of the data and models generated by Mr. Shyam Ratan as part of his MPhil dissrtation titled 'Automatic Detection Of Propaganda In Hindi On Social Media'
Language:Jupyter Notebook0 2 01
SpeeD-IA
Repository for different Speech Datasets and Models for Indo-Aryan languages prepared by the Department under different projects
0 2 01
trac-1
Repository hosting dataset for the Shared Task on Aggression Identification during First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) as COLING - 2018. Please visit the workshop website - https://sites.google.com/view/trac1/home - for more details
6 3 02
vardial2018
This repository contains the dataset used for Indo-Aryan Language identitifcation Shared Task as part of the Evaluation Campaign in the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial) at COLING 2018. It has 15k sentences each in Awadhi, Bhojpuri, Braj, Magahi and Hindi
6 3 12

Department of Linguistics, K.M. Institute of Hindi and Linguistics's Repositories

kmi-linguistics/trac-1
Repository hosting dataset for the Shared Task on Aggression Identification during First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) as COLING - 2018. Please visit the workshop website - https://sites.google.com/view/trac1/home - for more details
6 3 02
kmi-linguistics/vardial2018
This repository contains the dataset used for Indo-Aryan Language identitifcation Shared Task as part of the Evaluation Campaign in the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial) at COLING 2018. It has 15k sentences each in Awadhi, Bhojpuri, Braj, Magahi and Hindi
6 3 12
kmi-linguistics/magahi
This repository contains all the data, tools, applications and publications related to Magahi, an Indo-Aryan language
Language:Java4 3 02
kmi-linguistics/bodo
This repository contains all the resources (corpora) of Bodo and tools that were developed for creating and managing these resources
3 3 01
kmi-linguistics/hindi-politeness
3 3 01
kmi-linguistics/mscrabble
Repository for Multilingual Scrabble Generator and Games - especially aimed towards endangered languages
Language:JavaScript3 5 03
kmi-linguistics/bhojpuri
Resources and Technologies for Bhojpuri
Language:Python0 4 00
kmi-linguistics/braj
Repository for all codes, data and resources on Braj Bhasha that is being developed at the Institute.
Language:Python0 3 00
kmi-linguistics/ComMA
0 2 01
kmi-linguistics/indianlr.github.io
A repository for listing the non-scheduled and endangered Indian language resources and technologies. The website could be accessed here
Language:HTML0 3 00
kmi-linguistics/kmi-linguistics.github.io
Research and Development at the Department of Linguistics in K.M. Institute of Hindi and Linguistics at Dr. Bhim Rao Ambedkar University, Agra
Language:HTML0 3 00
kmi-linguistics/NLP
Natural Language Processing R&D @K.M. Institute of Hindi and Linguistics
0 3 00
kmi-linguistics/propaganda
Repository of the data and models generated by Mr. Shyam Ratan as part of his MPhil dissrtation titled 'Automatic Detection Of Propaganda In Hindi On Social Media'
Language:Jupyter Notebook0 2 01
kmi-linguistics/SpeeD-IA
Repository for different Speech Datasets and Models for Indo-Aryan languages prepared by the Department under different projects
0 2 01
kmi-linguistics/awadhi
Repository for all codes, data and resources on Awadhi language that is being developed at the Institute. Currently, it contains all the data generated as part of the M.Phil. dissertation of Mr. Abdul Basit.
3 01
kmi-linguistics/Code-mixing
3 0
kmi-linguistics/crawlers
Language:Python4 04
kmi-linguistics/indianlr
A repository of language resources and technologies for non-scheduled and endangered Indian languages
3 0
kmi-linguistics/sigtyp2020
This repository contains code and details of the KMI-Panlingua-IITKGP system submitted to the SigTyp 2020 Shared Task on Prediction of Linguistic Features. It could be used for training and prediction on any new dataset in the same format with similar information.
3 0
kmi-linguistics/speech-aggression
Repository of data and scripts of UGC-UKIERI Project on "Automatic Detection of Verbal Threat in HIndi and English Aggressive Speech"
Language:Shell3 01
kmi-linguistics/taluitew
Repository for all data and resources on Taluitew, a Tibeto-Burman language of Naga Group, spoken in parts of Manipur that is being developed at the Institute. Currently, it contains all the data generated as part of the M.Phil. dissertation of Mr. Chingrimung Lungleng.
3 0
kmi-linguistics/text-aggression
This is the repository of the aggression project carried out as part of the The Aggression Project at the Microsoft Research India Summer Workshop on Artificial Social Intelligence in June 2017. The repository contains all codes and datasets generated during the school.
Language:Python3 0
kmi-linguistics/trac-2
Repository hosting dataset for the Shared Task on Aggression and Misogyny Identification during Second Workshop on Trolling, Aggression and Cyberbullying (TRAC - 2) as LREC-2020. Please visit the workshop website - https://sites.google.com/view/trac2/shared-task - for more details
2 0
kmi-linguistics/western-hindi
Repository for all data and resources on Western Hindi that is being developed at the Institute. Currently, it contains all the data generated as part of the M.Phil. dissertation of Ms. Saba Parween.
3 0

Department of Linguistics, K.M. Institute of Hindi and Linguistics

Pinned Repositories

bodo

hindi-politeness

indianlr.github.io

kmi-linguistics.github.io

magahi

mscrabble

propaganda

SpeeD-IA

trac-1

vardial2018

Department of Linguistics, K.M. Institute of Hindi and Linguistics's Repositories

kmi-linguistics/trac-1

kmi-linguistics/vardial2018

kmi-linguistics/magahi

kmi-linguistics/bodo

kmi-linguistics/hindi-politeness

kmi-linguistics/mscrabble

kmi-linguistics/bhojpuri

kmi-linguistics/braj

kmi-linguistics/ComMA

kmi-linguistics/indianlr.github.io

kmi-linguistics/kmi-linguistics.github.io

kmi-linguistics/NLP

kmi-linguistics/propaganda

kmi-linguistics/SpeeD-IA

kmi-linguistics/awadhi

kmi-linguistics/Code-mixing

kmi-linguistics/crawlers

kmi-linguistics/indianlr

kmi-linguistics/sigtyp2020

kmi-linguistics/speech-aggression

kmi-linguistics/taluitew

kmi-linguistics/text-aggression

kmi-linguistics/trac-2

kmi-linguistics/western-hindi