MongoDB is a document database designed for highly accessible and scalable web applications. Its flexible schema technique is popular among agile development teams, enabling quick application launches without the hassle of setting up a traditional relational database. MongoDB's document-oriented architecture simplifies data storage for structured and unstructured data, with a format similar to JSON.
This project focuses on exploring MongoDB, a NoSQL database, its basic architecture, data modeling, querying, aggregations, and tools for tasks like Data Analytics, ETL, and Data Visualization.
The project uses a transportation dataset, including information about truck drivers, routes, cities, trucks, and historical truck schedules. The dataset is organized into several tables:
-
driver_details
driver_id
: Unique identifier for each drivername
: Name of the truck drivergender
: Gender of the truck driverage
: Age of the truck driverexperience
: Experience of the truck driver in yearsdriving_style
: Driving style of the truck driver (conservative or proactive)ratings
: Average rating of the truck driver on a scale of 1 to 10vehicle_no
: Number of the driver’s truckaverage_speed_mph
: Average speed of the truck driver in miles per hour
-
truck_details
truck_id
: Unique identification number of the trucktruck_age
: Age of the truck in yearsload_capacity_pounds
: Loading capacity of the truck in poundsmileage_mpg
: Mileage of the truck in miles per gallonfuel_type
: Fuel type of the truck
-
city_details
city_id
: Unique identification number of the citycity_name
: Name of the citylat
: Latitudelon
: Longitude
-
route_details
route_id
: Unique identifier of the routesorigin_city_id
: City identification number for the origin citydestination_city_id
: City identification number for the destinationdistance(Miles)
: Distance between the origin and destination cities in milesaverage_hours
: Average time needed to travel from the origin to the destination in hours
-
truck_schedule_data
truck_id
: Unique identifier of the truckroute_id
: Unique identifier of the routedeparture_date
: Departure DateTime of the truckestimated_arrival
: Estimated arrival DateTime of the truckdelay
: Binary variable indicating if the truck’s arrival was delayed (0 for on-time arrival, 1 for delayed arrival)
- Language:
Python
- Package:
PyMongo
- Services:
MongoDB Atlas
,MongoDB Compass