language-resources

There are 187 repositories under language-resources topic.

  • neologd/mecab-ipadic-neologd

    Neologism dictionary based on the language resources on the Web for mecab-ipadic

    Language:Shell2.8k12261291
  • RichardLitt/low-resource-languages

    Resources for conservation, development, and documentation of low resource (human) languages.

    Language:TeX4243510159
  • telegram-zhCN/telegram-language-resources

    Source strings and zh-CN translate resources of Telegram

    Language:Python12211060
  • mreichhoff/HanziGraph

    A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.

    Language:JavaScript112495
  • neologd/mecab-unidic-neologd

    Neologism dictionary based on the language resources on the Web for mecab-unidic

    Language:Shell876311
  • kbatsuren/CogNet

    CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

  • motazsaad/tweets-collector

    Collect tweets (tweets corpus) using Twitter API. Collection can be based on hashtags, keywords, geographical location

    Language:Python254216
  • giellalt/lang-fao

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Faroese language

    Language:Text1823131
  • UnitexGramLab/unitex-lingua

    Unitex/GramLab Language Resources

    Language:HTML1810126
  • giellalt/lang-crk

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language

    Language:Text1624341
  • kimkim00/UIT-ViSD4SA

    ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset

  • singletongue/japanese-bert

    BERT models with tokenization for Japanese texts.

    Language:Python14311
  • giellalt/lang-kal

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Kalaallisut (Greenlandic) language

    Language:Text122763
  • motazsaad/emotion-lexicon

    Arabic - English emotion lexicon

  • czcorpus/wag

    WaG - install your own word profile generator out of diverse data resources

    Language:TypeScript952202
  • blue32a/laravel-language-ja

    Japanese language resources for Laravel. (Laravelの日本語リソース)

    Language:PHP8104
  • giellalt/lang-fin

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Finnish language

    Language:Text82491
  • giellalt/lang-kpv

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Komi-Zyrian language

    Language:Text824100
  • giellalt/lang-sme

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Northern Sami language

    Language:Text8205311
  • giellalt/lang-srs

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Tsuut'ina (Sarsi) language

    Language:Text82641
  • hermes

    CoEDL/hermes

    :speech_balloon: Cross-platform application for the creation of language resources from ELAN linguistic analysis files, or from scratch.

    Language:Python7152
  • KurdishBLARK/KTC

    Kurdish Textbooks Corpus

  • AlexW00/tandem-gpt

    A virtual tandem partner to practice new vocab/grammar with

    Language:TypeScript6203
  • GiellaLT-Archive/giella-shared

    Shared linguistic resources, like names, digits, fst filtering and dependency parsing.

    Language:Rich Text Format62220
  • sonu-gupta/tosdr-terms-of-service-corpus

    This repository contains python code to create a corpus of 12,215 terms of service documents scraped from TOSDR, intended for legal, privacy, and natural language processing research.

    Language:HTML6101
  • ufal/universal-segmentations

    Build scripts for the UniSegments collection of morphologically segmented lexicons for many languages

    Language:Python690
  • giellalt/lang-rus

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Russian language

    Language:Text525231
  • giellalt/lang-smn

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Inari Sami language

    Language:Text42291
  • giellalt/lang-sms

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Skolt Sami language

    Language:HTML423140
  • CoolCat467/Localization-Translation-Utility

    Script for simplifying the process of translating MineOS Language (.lang) files

    Language:Python3101
  • Dugong-Chinese/chinese-resource-app

    This is a web application that will serve to be the community-driven go-to site for finding Chinese resources and learning Mandarin.

    Language:Python3212
  • giellalt/lang-est-x-utee

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Estonian language

    Language:Text325101
  • giellalt/lang-khk

    Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Halh Mongolian language

    Language:Text32511
  • lukyjanek/universal-derivations

    The scripts for compiling the Universal Derivations collections of harmonised word-formation resources for multiple langugaes.

    Language:Python3104
  • martin-he543/classics-cheat-sheets

    Summary grammar and modified DVLs for OCR's Classical Greek (9-1), Latin (9-1) GCSEs, from the 2016 syllabi. Used as part of educational resources in Tiffin School and the Kingston Academy.

  • mreichhoff/FreqBurger

    Example sentences, sankey flow diagrams, and frequency graphs for language learners.

    Language:JavaScript3201