language-resources
There are 187 repositories under language-resources topic.
neologd/mecab-ipadic-neologd
Neologism dictionary based on the language resources on the Web for mecab-ipadic
RichardLitt/low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
telegram-zhCN/telegram-language-resources
Source strings and zh-CN translate resources of Telegram
mreichhoff/HanziGraph
A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.
neologd/mecab-unidic-neologd
Neologism dictionary based on the language resources on the Web for mecab-unidic
kbatsuren/CogNet
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
motazsaad/tweets-collector
Collect tweets (tweets corpus) using Twitter API. Collection can be based on hashtags, keywords, geographical location
giellalt/lang-fao
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Faroese language
UnitexGramLab/unitex-lingua
Unitex/GramLab Language Resources
giellalt/lang-crk
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language
kimkim00/UIT-ViSD4SA
ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset
singletongue/japanese-bert
BERT models with tokenization for Japanese texts.
giellalt/lang-kal
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Kalaallisut (Greenlandic) language
motazsaad/emotion-lexicon
Arabic - English emotion lexicon
czcorpus/wag
WaG - install your own word profile generator out of diverse data resources
blue32a/laravel-language-ja
Japanese language resources for Laravel. (Laravelの日本語リソース)
giellalt/lang-fin
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Finnish language
giellalt/lang-kpv
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Komi-Zyrian language
giellalt/lang-sme
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Northern Sami language
giellalt/lang-srs
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Tsuut'ina (Sarsi) language
CoEDL/hermes
:speech_balloon: Cross-platform application for the creation of language resources from ELAN linguistic analysis files, or from scratch.
KurdishBLARK/KTC
Kurdish Textbooks Corpus
AlexW00/tandem-gpt
A virtual tandem partner to practice new vocab/grammar with
GiellaLT-Archive/giella-shared
Shared linguistic resources, like names, digits, fst filtering and dependency parsing.
sonu-gupta/tosdr-terms-of-service-corpus
This repository contains python code to create a corpus of 12,215 terms of service documents scraped from TOSDR, intended for legal, privacy, and natural language processing research.
ufal/universal-segmentations
Build scripts for the UniSegments collection of morphologically segmented lexicons for many languages
giellalt/lang-rus
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Russian language
giellalt/lang-smn
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Inari Sami language
giellalt/lang-sms
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Skolt Sami language
CoolCat467/Localization-Translation-Utility
Script for simplifying the process of translating MineOS Language (.lang) files
Dugong-Chinese/chinese-resource-app
This is a web application that will serve to be the community-driven go-to site for finding Chinese resources and learning Mandarin.
giellalt/lang-est-x-utee
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Estonian language
giellalt/lang-khk
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Halh Mongolian language
lukyjanek/universal-derivations
The scripts for compiling the Universal Derivations collections of harmonised word-formation resources for multiple langugaes.
martin-he543/classics-cheat-sheets
Summary grammar and modified DVLs for OCR's Classical Greek (9-1), Latin (9-1) GCSEs, from the 2016 syllabi. Used as part of educational resources in Tiffin School and the Kingston Academy.
mreichhoff/FreqBurger
Example sentences, sankey flow diagrams, and frequency graphs for language learners.