AtlasIA

AtlasIA is an open-source initiative aimed at collecting the largest Moroccan Darija dataset for Darija ⇆ English translation. This platform provides a user-friendly interface for collecting and storing data, making it accessible to the public to drive advancements in the Natural Language Processing (NLP) field for Moroccan Darija.

Features

  • Data Collection: Users can contribute to the dataset by providing Darija ⇆ English translations through the platform's intuitive interface.

  • Open Source: AtlasIA is built on open-source technologies, allowing developers to contribute, modify, and extend its functionalities.

  • Large Language Model (LLM) Development: The collected dataset serves as the foundation for developing the first Moroccan Darija Large Language Model, enabling more accurate and context-aware translations.

Technologies Used

  • Frontend: React.js is used for building the frontend of the web application. It provides a fast, interactive, and responsive user interface.

  • Styling: Tailwind CSS is utilized for styling components, offering a utility-first approach for creating custom designs quickly.

Getting Started

To run the AtlasIA web application locally, follow these steps:

  1. Clone this repository to your local machine.
  2. Install the required dependencies by running npm install in the project directory.
  3. Start the development server by running npm start.
  4. Access the web application at http://localhost:3000 in your web browser.

Contributing

Contributions to AtlasIA are welcome! If you'd like to contribute to the project, please follow these guidelines:

  • Fork the repository and create a new branch for your feature or bug fix.
  • Ensure your code adheres to the project's coding style and conventions.
  • Test your changes thoroughly and ensure they don't introduce any regressions.
  • Submit a pull request detailing the changes you've made.