andjc's Stars
unicode-org/cldr
The home of the Unicode Common Locale Data Repository
Denis2054/Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more
ElizaLo/NLP-Natural-Language-Processing
Projects and useful articles / links
sinaahmadi/klpt
The Kurdish Language Processing Toolkit
libindic/Transliteration
Transliteration module for Indian Languages
ldo/qahirah
A more Pythonic binding for the Cairo graphics library
simoncozens/fontFeatures
Python library for manipulating OpenType font features
mhajiloo/persiantools
Jalali date and datetime with other tools
laurieburchell/open-lid-dataset
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
google/transliteration
Transliteration data and models
libris/librisxl
Libris XL
google-research-datasets/TF-IDF-IIF-top100-wordlists
These are lists for a variety of languages containing words that are distinctive to each language.
roozbehp/unicode-data
Temporary holding place for my suggestions for future version of Unicode data files. Report bugs to https://www.unicode.org/reporting.html. For Script Exemplars, report bugs by email.
silnrsi/font-scheherazade
Scheherazade is a general-purpose Arabic font including many characters needed for minority languages.
AI4LAM/TeachingAndLearning
A repository to organize materials from the AI4LAM Teach and Learning Working Group
encukou/czech-sort
Python tool for simple Czech alphabetization
lcnetdev/scriptshifter
ldo/qahirah_examples
Examples of usage of Qahirah Python binding for Cairo graphics
maximilianh/maxtools
Various command line tools, mostly for bioinformatics, for .tab,.bed,.maf plus various parsers
santhoshtr/wq
An experimental natural language based querying system for Wikipedia
ldo/pybidi
Python wrapper for FriBidi
raeytype/geez-handwriting-fonts
Geʾez Handwriting Fonts
ldo/python_freetype_examples
example uses of python_freetype
silnrsi/collation
Collation tools
devmarrie/ChatAfrica
Hello Motherland🤗, our go-to home of answered queries about Africa!
w3c/amlreq
Enabling the Web for languages of the Americas
moyogo/Designing-Latin-S
A closer look at the additional glyphs of Latin S character set.
PCC-Test/han-eacc-ucs
EACC mappings to Unicode
pwatters/100PointCyberCheck
100PointCyberCheck cyber assessment tool
TreeRex/han-eacc-ucs
Improved EACC to Unicode mappings