/FCCLC

French Canadian complexity level corpus

MIT LicenseMIT

French Canadian complexity level corpus (FCCLC)

This repository contains the data used to train the models of our article "Quantifying French Document Complexity".

The data present in this repository has been created by us.

Download the Data

Our dataset is hosted in this repository in a zip format. You can manually download it by clicking here or you can use wget as follow:

wget https://github.com/GRAAL-Research/FCCLC/raw/main/FCCLC.zip

Cite the Dataset

If you use the data provided in this repository, please cite us using the following:

@article{Primpied2022Quantifying,
	author = {Primpied, Vincent and Beauchemin, David and Khoury, Richard},
	journal = {Proceedings of the Canadian Conference on Artificial Intelligence},
	year = {2022},
	month = {may 27},
	note = {https://caiac.pubpub.org/pub/iaeeogod},
	publisher = {Canadian Artificial Intelligence Association (CAIAC)},
	title = {Quantifying {French} {Document} {Complexity} },
}

License

This dataset is under MIT License.

Dataset Metadata

The following table is necessary for this dataset to be indexed by search engines such as Google Dataset Search.

property value
name French Canadian complexity level corpus
alternateName FCCLC
url
description FCCLC is an annotated dataset of different French documents with their associated complexity level on a grading scale.
creator
property value
name Vincent Primpied
sameAs https://scholar.google.com/citations?hl=en&user=HYfBQIoAAAAJ
name David Beauchemin
sameAs https://scholar.google.com/citations?hl=fr&user=ntoPgSUAAAAJ
name Richard Khoury
sameAs https://scholar.google.com/citations?user=9MrPtC0AAAAJ&hl=en&oi=ao
provider
property value
name GRAIL
sameAs https://grail.ift.ulaval.ca/
license
property value
name MIT
url
citation https://caiac.pubpub.org/pub/iaeeogod/release/1