8 datasets found
  1. CitiBike System Data

    • kaggle.com
    Updated Jun 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sujan Shirol (2020). CitiBike System Data [Dataset]. https://www.kaggle.com/sujan97/citibike-system-data/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 22, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sujan Shirol
    Description

    Context

    Citi Bike is New York City’s bike share system, and the largest in the nation. Citi Bike launched in May 2013 and has become an essential part of transportation network. They make commute fun, efficient and affordable – not to mention healthy and good for the environment.

    Where do Citi Bikers ride? When do they ride? How far do they go? Which stations are most popular? What days of the week are most rides taken on? Discovering the answers to these questions and more.

    Content

    Trip duration Start time Stop time Start station id Start station name Start station latitude Start station longitude End station id End station name End station latitude End station longitude Bikeid Birth year User type - (Customer = 24-hour pass or 3-day pass user; Subscriber = Annual Member) Gender - (Zero=unknown; 1=male; 2=female)

    Acknowledgements

    Courtesy - CitiBike System Data ( https://www.citibikenyc.com/system-data )

  2. MiBici Bike Sharing Dataset – Guadalajara, Mexico

    • kaggle.com
    Updated Jun 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alireza Salehii (2025). MiBici Bike Sharing Dataset – Guadalajara, Mexico [Dataset]. https://www.kaggle.com/datasets/alirezasalehii/mexico-mibici-withoutapi/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 7, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Alireza Salehii
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    Mexico, Guadalajara
    Description

    🧭 Context

    The MiBici Bike Sharing Dataset presents a comprehensive view of Guadalajara’s public bicycle-sharing system from 2015 to 2019. MiBici is the official bike-sharing program in the Guadalajara Metropolitan Area (GMA) — one of Mexico's largest urban centers — designed to offer a sustainable, accessible, and low-emission mobility alternative. This dataset compiles trip-level data, station infrastructure details (including elevation), and corresponding daily weather and calendar annotations to support deep analysis of urban mobility behavior. 🔍 Motivation and Inspiration

    This dataset was compiled to enable researchers, students, and practitioners to:

    Explore how weather, elevation, holidays, and time affect bike usage

    Investigate user clustering, temporal ride patterns, and station growth

    Develop machine learning models for predicting demand, user behavior, or system bottlenecks

    Support urban planning decisions and infrastructure optimization

    Inspired by similar open datasets from systems like CitiBike (NYC), Capital Bikeshare (DC), and Divvy (Chicago), this project aims to bring comparable high-quality public transportation data to the context of Latin American cities. 📦 Data Sources

    This dataset integrates and enriches information from multiple publicly available sources:

    MiBici Open Data Portal: Trip and station information

    Mexican Government Open Data: Station metadata, calendar information

    Google/Open Elevation APIs: Elevation values for each station

    National Meteorological Data: Historical daily weather observations for Guadalajara

    All datasets were cleaned, standardized, and merged to ensure usability for academic, data science, and policy applications.

  3. Bike Sharing Service Trip Data

    • kaggle.com
    zip
    Updated Jul 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arquimedes Quintero (2021). Bike Sharing Service Trip Data [Dataset]. https://www.kaggle.com/aruki7/bike-sharing-data
    Explore at:
    zip(208590220 bytes)Available download formats
    Dataset updated
    Jul 1, 2021
    Authors
    Arquimedes Quintero
    Description

    Context

    This data has been made available by Divvy Service in the City of Chicago under the following license: https://www.divvybikes.com/data-license-agreement

    Content

    The data structure has 15 variables and 4,459,802 observations. Each observation is a single bike trip and it includes the start and ending times as well as the initial and final station of the trip with geolocation coordinates. There are variables to identify the type of bike docking station and the type of user (field member_casual) which identifies the type of user who took the trip.

    Acknowledgements

    This data is part of the capstone project for Google Data Analytics Certificate (coursera) and it was provided by the Divvy Service of the city of Chicago. The source unclean data can be accessed through this site: https://divvy-tripdata.s3.amazonaws.com/index.html

    Inspiration

  4. divvy-tripdata-cleaned

    • kaggle.com
    Updated Jun 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    A. N. M. JUBAER (2022). divvy-tripdata-cleaned [Dataset]. http://doi.org/10.34740/kaggle/dsv/3809382
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 15, 2022
    Dataset provided by
    Kaggle
    Authors
    A. N. M. JUBAER
    License

    http://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html

    Description

    Lyft Bikes and Scooters, LLC (“Bikeshare”) operates the City of Chicago’s (“City”) Divvy bicycle sharing service. Bikeshare and the City are committed to supporting bicycling as an alternative transportation option. As part of that commitment, the City permits Bikeshare to make certain Divvy system data owned by the City (“Data”) available to the public

  5. Bike-Shairing-Assignment

    • kaggle.com
    Updated Jul 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashutosh Srivastava (2025). Bike-Shairing-Assignment [Dataset]. https://www.kaggle.com/datasets/ashusri4/bikeshairingassignment
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 3, 2025
    Dataset provided by
    Kaggle
    Authors
    Ashutosh Srivastava
    Description

    Bike-Sharing-Assignment

    Problem Statement: A bike-sharing system is a service in which bikes are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" which is usually computer-controlled wherein the user enters the payment information, and the system unlocks it. This bike can then be returned to another dock belonging to the same system.

    A US bike-sharing provider BoomBikes has recently suffered considerable dips in their revenues due to the ongoing Corona pandemic. The company is finding it very difficult to sustain in the current market scenario. So, it has decided to come up with a mindful business plan to be able to accelerate its revenue as soon as the ongoing lockdown comes to an end, and the economy restores to a healthy state.

    In such an attempt, BoomBikes aspires to understand the demand for shared bikes among the people after this ongoing quarantine situation ends across the nation due to Covid-19. They have planned this to prepare themselves to cater to the people's needs once the situation gets better all around and stand out from other service providers and make huge profits.

    They have contracted a consulting company to understand the factors on which the demand for these shared bikes depends. Specifically, they want to understand the factors affecting the demand for these shared bikes in the American market. The company wants to know:

    Which variables are significant in predicting the demand for shared bikes. How well those variables describe the bike demands Based on various meteorological surveys and people's styles, the service provider firm has gathered a large dataset on daily bike demands across the American market based on some factors.

    Business Goal: You are required to model the demand for shared bikes with the available independent variables. It will be used by the management to understand how exactly the demands vary with different features. They can accordingly manipulate the business strategy to meet the demand levels and meet the customer's expectations. Further, the model will be a good way for management to understand the demand dynamics of a new market.

  6. Divvy's Bike Share Data for Google Data Analytics

    • kaggle.com
    Updated Mar 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akash Sarkar (2024). Divvy's Bike Share Data for Google Data Analytics [Dataset]. https://www.kaggle.com/datasets/akashsarkar15/divvys-bike-share-data-for-google-data-analytics/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Akash Sarkar
    Description

    Lyft Bikes and Scooters, LLC (“Bikeshare”) operates the City of Chicago’s (“City”) Divvy bicycle sharing service. Bikeshare and the City are committed to supporting bicycling as an alternative transportation option. As part of that commitment, the City permits Bikeshare to make certain Divvy system data owned by the City (“Data”) available to the public, subject to the terms and conditions of this License Agreement (“Agreement”). By accessing or using any of the Data, you agree to all of the terms and conditions of this Agreement.

  7. Cyclistic_2023

    • kaggle.com
    Updated May 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Viktoriia (2024). Cyclistic_2023 [Dataset]. https://www.kaggle.com/datasets/vikitorim/cyclistic-2023
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 26, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Viktoriia
    Description

    Data License Agreement

    Lyft Bikes and Scooters, LLC (“Bikeshare”) operates the City of Chicago’s (“City”) Divvy bicycle sharing service. Bikeshare and the City are committed to supporting bicycling as an alternative transportation option. As part of that commitment, the City permits Bikeshare to make certain Divvy system data owned by the City (“Data”) available to the public, subject to the terms and conditions of this License Agreement (“Agreement”). By accessing or using any of the Data, you agree to all of the terms and conditions of this Agreement.

    License. Bikeshare hereby grants to you a non-exclusive, royalty-free, limited, perpetual license to access, reproduce, analyze, copy, modify, distribute in your product or service and use the Data for any lawful purpose (“License”). Prohibited Conduct. The License does not authorize you to do, and you will not do or assist others in doing, any of the following

    Use the Data in any unlawful manner or for any unlawful purpose;
    Host, stream, publish, distribute, sublicense, or sell the Data as a stand-alone dataset; provided, however, you may include the Data as source material, as applicable, in analyses, reports, or studies published or distributed for non-commercial purposes;
    Access the Data by means other than the interface Bikeshare provides or authorizes for that purpose;
    Circumvent any access restrictions relating to the Data;
    Use data mining or other extraction methods in connection with Bikeshare's website or the Data;
    Attempt to correlate the Data with names, addresses, or other information of customers or Members of Bikeshare; and
    State or imply that you are affiliated, approved, endorsed, or sponsored by Bikeshare.
    Use or authorize others to use, without the written permission of the applicable owners, the trademarks or trade names of Lyft Bikes and Scooters, LLC, the City of Chicago or any sponsor of the Divvy service. These marks include, but are not limited to DIVVY, and the DIVVY logo, which are owned by the City of Chicago.
    

    No Warranty. THE DATA IS PROVIDED “AS IS,” AS AVAILABLE (AT BIKESHARE’S SOLE DISCRETION) AND AT YOUR SOLE RISK. TO THE MAXIMUM EXTENT PROVIDED BY LAW BIKESHARE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING THE IMPLIED WARRANTIES OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE, AND NON-INFRINGEMENT. BIKESHARE FURTHER DISCLAIMS ANY WARRANTY THAT THE DATA WILL MEET YOUR NEEDS OR WILL BE OR CONTINUE TO BE AVAILABLE, COMPLETE, ACCURATE, TIMELY, SECURE, OR ERROR FREE.

    Limitation of Liability and Covenant Not to Sue. Bikeshare, its parent, affiliates and sponsors, and their respective directors, officers, employees, or agents will not be liable to you or anyone else for any loss or damage, including any direct, indirect, incidental, and consequential damages, whether foreseeable or not, based on any theory of liability, resulting in whole or in part from your access to or use of the Data. You will not bring any claim for damages against any of those persons or entities in any court or otherwise arising out of or relating to this Agreement, the Data, or your use of the Data. In any event, if you were to bring and prevail on such a claim, your maximum recovery is limited to $100 in the aggregate even if you or they had been advised of the possibility of liability exceeding that amount. Ownership and Provision of Data. The City of Chicago owns all right, title, and interest in the Data. Bikeshare may modify or cease providing any or all of the Data at any time, without notice, in its sole discretion. No Waiver. Nothing in this Agreement is or implies a waiver of any rights Bikeshare or the City of Chicago has in the Data or in any copyrights, patents, or trademarks owned or licensed by Bikeshare, its parent, affiliates or sponsors. The DIVVY trademarks are owned by the City of Chicago. Termination of Agreement. Bikeshare may terminate this Agreement at any time and for any reason in its sole discretion. Termination will be effective upon Bikeshare’s transmission of written notice to you at the email address you provided to Bikeshare in connection with this or by Bikeshare's announcement on its website (currently www.divvybikes.com/data that it is revoking all licenses. Sections 2–6 and 9–10 will survive termination. Contact. Questions relating to this Agreement, including requests for permission to use trademarks and trade names, should be sent to bike-data@lyft.com. Applicable Law and Forum. This Agreement is governed by the laws of the State of Illinois, without regard to conflicts of law principles. Any dispute arising under or relating to this Agreement will be brought only in a court of competent jurisdiction sitting in New York City, New York. Entire Agreement. This Agreement is the complete and exclusive agreement and understanding between Bikeshare and you with respect to its su...

  8. Google Capstone Project Portfolio

    • kaggle.com
    Updated Sep 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Austin Roush (2021). Google Capstone Project Portfolio [Dataset]. https://www.kaggle.com/datasets/austinroush/google-capstone-project-cyclistic-case-study/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 8, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Austin Roush
    Description

    Ask

    Business Task:

    Analyze Cyclistic historical bike trip data to identify trends that explain how annual members and casual riders differ. Transform data into actionable insights and create compelling data visualizations that explain why casual riders should purchase an annual membership. Design a new marketing strategy to convert causal riders into annual members. Use digital media to create effective marketing targeted towards casual riders realizing why it would be beneficial to become an annual member.

    Key stakeholders to be considered are Cyclistic customers, Lily Moreno, the Cyclistic marketing analytics team, as well as the Cyclistic executive team. Cyclistic customers include casual riders and members, some with disabilities that use assistive options. Only 30% of riders use Cyclistic to commute to work, while most riders use the bike-share service for leisure. Lily Moreno is the director of marketing. The marketing analytics team helps guide the marketing strategy. The executive team decides whether to approve the recommended marketing program.

    Prepare

    A description of all data sources used:

    Cyclistic bike-share historical trip data is public. It is located on the Divvy website. The .CSV files are sorted by year and month, dating back to 2013. The data is not in real-time, but it is current because it is published every month. Each file has comprehensive data on individual rider ID’s, bike type, time & date of trip, station location information, and whether each rider is a casual rider or a member.

    The Divvy website includes the following system data:

    Each trip is anonymized and includes: • Trip start day and time • Trip end day and time • Trip start station • Trip end station • Rider type (Member, Single Ride, and Day Pass) The data has been filtered to remove trips that are taken by staff as they service and inspect the system; and any trips that were below 60 seconds in length (potentially false starts or users trying to re-dock a bike to ensure it was secure).

    The Data License Agreement explains that Motivate International Inc. (“Motivate”) operates the City of Chicago’s (“City”) Divvy bike-share service. The City of Chicago is the owner of all Divvy data and makes it accessible to the public. Lyft is the operator of Divvy in Chicago. Lyft has a privacy policy that explains their commitment to respecting our personal information.

    The Divvy Data License Agreement explains the following:

    • License. Motivate hereby grants to you a non-exclusive, royalty-free, limited, perpetual license to access, reproduce, analyze, copy, modify, distribute in your product or service and use the Data for any lawful purpose (“License”).

    • No Warranty. THE DATA IS PROVIDED “AS IS,” AS AVAILABLE (AT MOTIVATE’S SOLE DISCRETION) AND AT YOUR SOLE RISK. TO THE MAXIMUM EXTENT PROVIDED BY LAW MOTIVATE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING THE IMPLIED WARRANTIES OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE, AND NON-INFRINGEMENT. MOTIVATE FURTHER DISCLAIMS ANY WARRANTY THAT THE DATA WILL MEET YOUR NEEDS OR WILL BE OR CONTINUE TO BE AVAILABLE, COMPLETE, ACCURATE, TIMELY, SECURE, OR ERROR FREE.

    In contrast to all Divvy system data being reliable, the “No Warranty” terms and conditions make it so that there is no guarantee if the data will be “AVAILABLE, COMPLETE, ACCURATE, TIMELY, SECURE, OR ERROR FREE.” The credibility of the data could potentially be negatively affected if they are not held responsible.

    Sampling bias could take place because Chicago is significantly affected by weather. There is also an influx of tourists at certain times of the year. Weather and tourism’s effect on data can be accounted for because these influences are constant.

    Divvy bike-share consistently providing accurate data is necessary to create and follow through with an effective marketing strategy. All the data is original and owned by the City of Chicago making it a credible source. Lyft is also a credible source because they have the technology to accurately collect data. Although the Data License Agreement states that it has “No Warranty,” the source of the data and the way it is managed makes it credible. Divvy bike-share data is cited using the following:

    • Divvy (https://www.divvybikes.com) • Divvy Historical Data (https://divvy-tripdata.s3.amazonaws.com/index.html) • Divvy System Data (https://www.divvybikes.com/system-data) • Divvy Data License Agreement (https://www.divvybikes.com/data-license-agreement) • Lyft’s Privacy Policy (https://www.lyft.com/privacy)

    The sources of the data confirm data credibility. The data is detailed and thorough making it effective and efficient for marketing purposes.

    Process

    Documentation of any cleaning or manipulation of data:

    1. Format Cells --> Alignment --> Shrink to Fit top row

    2. Data --> Remove Duplicates

    3. Create and calculate n...

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sujan Shirol (2020). CitiBike System Data [Dataset]. https://www.kaggle.com/sujan97/citibike-system-data/discussion
Organization logo

CitiBike System Data

Bike rental dataset from CitiBank

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 22, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sujan Shirol
Description

Context

Citi Bike is New York City’s bike share system, and the largest in the nation. Citi Bike launched in May 2013 and has become an essential part of transportation network. They make commute fun, efficient and affordable – not to mention healthy and good for the environment.

Where do Citi Bikers ride? When do they ride? How far do they go? Which stations are most popular? What days of the week are most rides taken on? Discovering the answers to these questions and more.

Content

Trip duration Start time Stop time Start station id Start station name Start station latitude Start station longitude End station id End station name End station latitude End station longitude Bikeid Birth year User type - (Customer = 24-hour pass or 3-day pass user; Subscriber = Annual Member) Gender - (Zero=unknown; 1=male; 2=female)

Acknowledgements

Courtesy - CitiBike System Data ( https://www.citibikenyc.com/system-data )

Search
Clear search
Close search
Google apps
Main menu