Ride-Sharing Platform Synthetic Data: Users, Drivers, Vehicles, Rides, and Ratings
Description
This dataset contains synthetic data representing a ride-sharing platform. It includes simulated information about:
- Users: Details about the customers using the ride-sharing service.
- Drivers: Information about drivers providing the rides.
- Vehicles: Data on the vehicles used for the rides.
- Rides: Records of each ride taken, including start/end locations, distance, and fare.
- Ratings: User ratings and feedback for each ride.
The data is generated to simulate a realistic ecosystem for a ride-hailing service, useful for analysis, modeling, and testing.
Dataset Contents
The dataset is divided into the following five CSV files:
- users.csv: Contains information about users, including user ID, name, email, phone number, registration date, age, gender, and location.
- drivers.csv: Includes details about drivers such as driver ID, name, vehicle ID, rating, total rides, and availability.
- vehicles.csv: Lists vehicle details like vehicle ID, make, model, year, and capacity.
- rides.csv: Contains ride information including ride ID, user ID, start location, end location, start/end time, distance (km), fare amount, and driver ID.
- ratings.csv: Includes ride ratings and user comments, with each rating linked to a ride and a user.
Data Generation
The data was generated using Python's Faker library and randomized to create a realistic dataset with:
- 100,000 users
- 3,000 drivers
- 3,000 vehicles
- 5,000,000 rides
This dataset is ideal for conducting various types of analysis, such as:
- User behavior analysis
- Driver performance analysis
- Ride demand modeling
- Fare prediction
- Sentiment analysis on ride ratings
Usage
This dataset can be used for machine learning, statistical analysis, or as a test dataset for building applications related to ride-sharing platforms.
License
The dataset is synthetic and free to use for educational and research purposes.
Note: This dataset does not reflect real-world data and is generated for simulation purposes.