AtlasIA is an open-source initiative aimed at collecting the largest Moroccan Darija dataset for Darija ⇆ English translation. This platform provides a user-friendly interface for collecting and storing data, making it accessible to the public to drive advancements in the Natural Language Processing (NLP) field for Moroccan Darija.
-
Data Collection: Users can contribute to the dataset by providing Darija ⇆ English translations through the platform's intuitive interface.
-
Open Source: AtlasIA is built on open-source technologies, allowing developers to contribute, modify, and extend its functionalities.
-
Large Language Model (LLM) Development: The collected dataset serves as the foundation for developing the first Moroccan Darija Large Language Model, enabling more accurate and context-aware translations.
-
Frontend: React.js is used for building the frontend of the web application. It provides a fast, interactive, and responsive user interface.
-
Styling: Tailwind CSS is utilized for styling components, offering a utility-first approach for creating custom designs quickly.
To run the AtlasIA web application locally, follow these steps:
- Clone this repository to your local machine.
- Install the required dependencies by running
npm install
in the project directory. - Start the development server by running
npm start
. - Access the web application at
http://localhost:3000
in your web browser.
Contributions to AtlasIA are welcome! If you'd like to contribute to the project, please follow these guidelines:
- Fork the repository and create a new branch for your feature or bug fix.
- Ensure your code adheres to the project's coding style and conventions.
- Test your changes thoroughly and ensure they don't introduce any regressions.
- Submit a pull request detailing the changes you've made.