Pinned Repositories
ARBML
Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.
Calliar
A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.
CIDAR
Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
klaam
Arabic speech recognition, classification and text-to-speech.
masader
The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
qawafi
Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.
tkseem
Arabic Tokenization Library. It provides many tokenization algorithms.
tnkeeh
Arabic cleaning, normalization and segmentation library.
whisperar
ARBML's Repositories
ARBML/ARBML
Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.
ARBML/klaam
Arabic speech recognition, classification and text-to-speech.
ARBML/masader
The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
ARBML/Calliar
A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.
ARBML/tkseem
Arabic Tokenization Library. It provides many tokenization algorithms.
ARBML/tnkeeh
Arabic cleaning, normalization and segmentation library.
ARBML/whisperar
ARBML/CIDAR
Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
ARBML/qawafi
Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.
ARBML/Ashaar
Arabic poetry analysis and generation.
ARBML/nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
ARBML/Taqyim
Python intefrace for evaluation on chatgpt models
ARBML/rasm
Arabic Art using GANs
ARBML/bayanat
Explore the content of Arabic text datasets.
ARBML/dar
A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
ARBML/CIDAR-v2
ARBML/masader-webservice
ARBML/adawat
ARBML/CalliarGen
ARBML/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
ARBML/rouge_score_ar
ARBML/.github
ARBML/Bohour
Bohour, a package that abstracts arabic poetry science, Aroud
ARBML/cidar_human_eval
ARBML/arbml.github.io
ARBML/atmatah
a repository containing scripts to automate processes, for instance configuring web-apps on remote machines
ARBML/masader_bot
ARBML/masader_form
ARBML/mat-bpe
ARBML/ProceduralCalliar