quRan
An R package for the full text of the Qur'an
This package contains the complete text of the Qur'an in Arabic (with and without vowels) and in English (the Yusuf Ali and Saheeh International translations).
The text for each volume or translation was sourced from Tanzil and Al Quran Cloud.
Each volume is in a data frame with a row for each ayah, along with 18 columns of detailed metadata (including ayah number, surah number, surah name in Arabic and English, juz', hizb, and so on), ready for text analysis.
The package contains:
quran_ar
: The Qur'an in Arabic (with vowels)quran_ar_min
: The Qur'an in Arabic (without vowels)quran_en_yusufali
: The Yusuf Ali English translation of the Qur'anquran_en_sahih
: The Saheeh International English translation of the Qur'an
Surah titles in Arabic, transliterated English, and translated English are all factors to preserve the correct order.
Installation
This package is not released on CRAN yet. You can install the development version from Github:
library(remotes)
install_github("andrewheiss/quRan")
library(quRan)
Usage
For some ideas on getting started with analyzing these texts, see Text Mining with R by Julia Silge and David Robinson, or Julia Silge's blog post on sentiment analysis of Jane Austen's novels. For help within R, try ?quran_ar
or similar for getting started with the datasets.
Here's how to do some basic data manipulation with the data:
library(dplyr)
library(quRan)
quran_ar %>%
group_by(surah_title_en) %>%
summarize(total_ayat = n())
#> # A tibble: 114 x 2
#> surah_title_en total_ayat
#> <fct> <int>
#> 1 Al-Faatiha 7
#> 2 Al-Baqara 286
#> 3 Aal-i-Imraan 200
#> 4 An-Nisaa 176
#> 5 Al-Maaida 120
#> 6 Al-An'aam 165
#> 7 Al-A'raaf 206
#> 8 Al-Anfaal 75
#> 9 At-Tawba 129
#> 10 Yunus 109
#> # ... with 104 more rows
quran_ar %>%
filter(surah_title_en == "Al-Faatiha") %>%
select(surah, ayah, text)
#> # A tibble: 7 x 3
#> surah ayah text
#> <int> <int> <chr>
#> 1 1 1 بِسْمِ اللَّهِ الرَّحْمَٰنِ الرَّحِيمِ
#> 2 1 2 الْحَمْدُ لِلَّهِ رَبِّ الْعَالَمِينَ
#> 3 1 3 الرَّحْمَٰنِ الرَّحِيمِ
#> 4 1 4 مَالِكِ يَوْمِ الدِّينِ
#> 5 1 5 إِيَّاكَ نَعْبُدُ وَإِيَّاكَ نَسْتَعِينُ
#> 6 1 6 اهْدِنَا الصِّرَاطَ الْمُسْتَقِيمَ
#> 7 1 7 صِرَاطَ الَّذِينَ أَنْعَمْتَ عَلَيْهِمْ غَيْرِ الْمَغْضُوبِ عَلَيْهِمْ وَلَا الضَّالِّينَ
Code of Conduct
This project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
Credits
The text and translations are all open source or licensed under Creative Commons by Tanzil and Al Quran Cloud. Complete details are found in the LICENSE file.
Fatihah background in hex icon from seeklogo.