/16-hotel-reviews

This is an open source project for the stage E of the Hamoye Data Science Internship program, cohort 2020, with real life applications in the health, engineering, demography, education and technology.

Primary LanguageJupyter Notebook

515K Hotel Reviews Data in Europe

Can you make your own trip more cozy by using data science?

Content

The csv file contains 17 fields. The description of each field is as below:

1.Hotel_Address: Address of hotel.

  1. Review_Date: Date when reviewer posted the corresponding review.

  2. Average_Score: Average Score of the hotel, calculated based on the latest comment in the last year.

  3. Hotel_Name: Name of Hotel

  4. Reviewer_Nationality: Nationality of Reviewer

  5. Negative_Review: Negative Review the reviewer gave to the hotel. If the reviewer does not give the negative review, then it should be: 'No Negative'

  6. ReviewTotalNegativeWordCounts: Total number of words in the negative review.

  7. Positive_Review: Positive Review the reviewer gave to the hotel. If the reviewer does not give the negative review, then it should be: 'No Positive'

  8. ReviewTotalPositiveWordCounts: Total number of words in the positive review.

  9. Reviewer_Score: Score the reviewer has given to the hotel, based on his/her experience

  10. TotalNumberofReviewsReviewerHasGiven: Number of Reviews the reviewers has given in the past.

  11. TotalNumberof_Reviews: Total number of valid reviews the hotel has.

  12. Tags: Tags reviewer gave the hotel.

  13. dayssincereview: Duration between the review date and scrape date.

  14. AdditionalNumberof_Scoring: There are also some guests who just made a scoring on the service rather than a review. This number indicates how many valid scores without review in there.

  15. lat: Latitude of the hotel

  16. lng: longtitude of the hotel

Here is a link to the dataset