paint-brush
Using Data to Predict Bike Share Demand In London Without Codeby@jackriewe
822 reads
822 reads

Using Data to Predict Bike Share Demand In London Without Code

by Jack RieweDecember 8th, 2019
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Using machine learning to predict demand using no-code machine learning, we created the ideal day when the number of bike-share rentals was the highest in London. We used a dataset that collected data over a year-long period to identify what season the bike share rentals are the highest. Users are more likely to use other forms of transportation (Uber, train, bus) to avoid bad weather. The ideal day for bike share rental is 9.5°C on a non-holiday weekday with a wind speed of 20 km/h.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Using Data to Predict Bike Share Demand In London Without Code
Jack Riewe HackerNoon profile picture

2019 is a strange time for transportation. People are riding weird scooters around and using their cellphones to order rides. New York City and London have begun taxing cars that enter city centers in order to cut down on drivers. Transportation alternatives have emerged such as Zipcar, Lime, Bird, and a variety of eBikes or bike shares have increased tremendously proving it’s time to get your rental fleet ready. 

While the UK is a long way from California, there are things we can learn from datasets across the world. For one, we can learn from the attributes related to the number of bike rentals how to predict demand. If you’re a Business Analyst, knowing the demand for your product can help avoid shortages or logistical nightmares and cut down maintenance and other costs while increasing usage. 

Let’s walk through how to forecast demand using no-code machine learning.

Understanding What Data Results in an Accurate Demand Forecast.

Before using a dataset for predicting demand, you want to identify a specific business problem. In this case of bike share rentals, we want to know what are the type of days a London citizen is most and least likely to rent a bike share. Knowing the problem leads to identifying what kind of data you need to collect. 

While there are many different datasets you can collect to identify relationships between usage and variables, the most obvious for cyclists is the weather. Users are more likely to be outside in warmer weather and more likely to use other forms of transportation (Uber, train, bus) to avoid bad weather. 

To make your models the most accurate possible, you want to collect the most data you can over the longest amount of time you can. In this case, it’s a good idea to identify patterns in seasonal usage. We used a dataset that collected data over a year-long period to identify what season the bike share rentals are the highest. 

Other attributes to get the most accurate ideal day for bike share rentals include:

  • Bike share rental numbers
  • Timestamp
  • Temperature
  • Humidity
  • Wind
  • Speed
  • Type of weather
  • Date (to indicate if it’s a holiday or weekend/weekday or season)

Some other attributes to collect related to demand could be the geographic location of the rentals, where the user ended the ride, dates of city events such as concerts, games, festivals, etc.

The dataset we used for this use case collected this metadata:

Using this historical data, let’s find some patterns.

Questions to Ask Your Data

Breaking away from the traditional ML process or SQL queries, we take a Natural Language Powered Data Science approach to ask this dataset questions in English. Just upload your dataset, ask questions, get results, repeat. 

With the business problem being “What are the type of days a London citizen is most and least likely to rent a bike share so we can provide the right amount of bikes to increase profit and decrease maintenance costs?”

Good questions to ask you data include:

  • What season has the highest bike share rental count?
  • What attributes are directly proportional to the bike share rental count?
  • What causes the bike share rental count to be the highest?
  • What causes the bike share rental count to be the lowest?

    The bike share rental count column is labeled “cnt” in this dataset.

    Identifying the Perfect Day to Rent a Bike in London

    We actually released these results a few days ago on Twitter as a part of Nocodevember, but let’s explore deeper. 

    For full transparency, here are the machine learning tech specs used:

    From asking the bike share dataset what attributes are proportional to the bike share count numbers, our platform created the ideal day when the count number is the highest:

    Humidity: 26.46%

    Temperature: 29.46°C/85°F

    Feels Like: 29.06°C/85°F

    Weather: Partly Cloudy

    Wind Speed: 28.46km/h

    Season: Summer

    Time: Weekday/Non-Holiday

    The days where the number of bike-share rentals was the lowest looks like this:

    Humidity: 22.49%

    Temperature: 1.63°C/34.934°F

    Feels Like: -4.51°C/23.882°F

    Weather: Snowfall

    Wind Speed: 3.71 km/h

    Season: Winter

    Time: Weekend/Holiday

    ‍Taking it a step further, we can predict the number of bike shares to be rented on any given day with this data.

    For example: At 5 pm on a Summer, non-holiday weekday that is 9.5°C, with a wind speed of 20 km/h, and broken clouds, the predicted count will be 1286.108911 bike share rentals.

    Here are the day-by-day prediction results:

    Additionally, here is a season-by-season comparison of the number of bike rentals:

    0 = Spring, 1 = Summer, 2 = Fall, 3 = Winter

    Lastly, here is a comparison in relation to the type of weather:

    1 = Clear ; mostly clear but have some values with haze/fog/patches of fog/ fog in vicinity, 2 = scattered clouds / few clouds, 3 = Broken clouds, 4 = Cloudy, 7 = Rain/ light Rain shower/ Light rain, 10 = rain with thunderstorm, 26 = snowfall, 94 = Freezing Fog

    What to Take Away

    Using this process, you now have an unbelievable superpower. You can predict what the bike count will be on any given day as long as you have the historical data. 

    With Predictive Analytics, you can now plan fleet deployment, promotions, and strategize resources based on weather. 

    Even though it’s a weird time in the history of transportation, let’s add some level of predictability to it. Before you deploy your fleet, follow this no-code ML process.