The supplementary files, datasets and code repositories for our paper titled: Chatbot Model Development Using BERT for West Sumatra Halal Tourism Information.
Irmasari Hafidz, Bayu Siddhi Mukti, Qudsiyah Zahra Ilham Naseela , Ahmadhian Daffa Yudistira, I Putu Adhitya Pratama Mangku Purnama, Nurul Fajrin Ariyani, Hanim Maria Astuti, Aris Tjahyanto
Link https://journal.its.ac.id/index.php/hr/article/view/1819 Hafidz, I., Mukti, B.S., Naseela, Q.Z.I., Yudistira, A.D., Purnama, I.P.A.P.M., Ariyani, N.F., Astuti, H.M. and Tjahyanto, A., 2024. Chatbot Model Development Using BERT for West Sumatera Halal Tourism Information. Halal Research Journal, 4(2), pp.117-131.
- Python 3.8 or higher
- Jupyter Notebook
- Google Colab (optional, for running the notebook in the cloud)
-
Clone the Repository
git clone https://github.com/irhafidz/2024chatbot_halaltourism_WestSumatra.git cd 2024chatbot_halaltourism_WestSumatra
-
Set Up a Virtual Environment
python -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate`
-
Install Dependencies
pip install -r requirements.txt
-
Download and Place the Dataset
- Ensure the
data
folder from the repository contains the necessary dataset files for training the chatbot model.
- Ensure the
-
Open Jupyter Notebook
jupyter notebook
-
Navigate to the Notebook
- Open the
chatbot.ipynb
file in your Jupyter Notebook interface.
- Open the
-
Run the Cells
- Execute the cells in the notebook sequentially to train the chatbot model and test its functionality.
-
Open the Notebook in Google Colab
-
Upload the Dataset
- Upload the dataset from the
data
folder to the Colab environment.
- Upload the dataset from the
-
Run the Cells
- Execute the cells in the notebook sequentially to train the chatbot model and test its functionality.
-
data/
: Folder containing the dataset files for training the chatbotchatlist.csv: Raw datasets that consist questions and pre-defined labels (9 labels, 125 questions for each label (1125 in total))
clean.csv: Dataset that has been cleaned and split for training and testing (80% training and 20% testing)
answer.json: Dataset containing answer choices for each label
testing.csv: Raw datasets that consist questions and pre-defined labels to be tested
-
LICENSE:
License to this project. -
README.md
: Project documentation file. -
chatbot.ipynb
: Jupyter Notebook for training and testing the chatbot model. -
requirements.txt
: List of dependencies required for the project.
We welcome contributions to this project. Please follow these steps to contribute:
- Fork the repository.
- Create a new branch.
- Make your changes.
- Submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for more details.