Introduction

Launched in May 2013, Citi Bike quickly became one of the most favorite means of transportation for people in New York. Nowadays, the platform utilizes more than ten thousand bikes and manages over fifty thousand rides per day adding millions of rows of data every month. Such amounts of data is interesting in terms of statistical analyses to determine behavioral patterns of the platform’s users and the context behind the data. Gender-based analyses are interesting in terms of determining the factors that may affect any differences in between the usage patterns of males and females. For this research, the idea was: do men cycle longer distances than women? To try to analyse the idea, the author developed the following null hypothesis: the average trip distance cycled by women in the month of October, 2016 is the same or less than that of men.

Data

To tackle the issue, the Citi Bike usage data for the month of October 2016 was parsed from the platform's website. The author chose October based on the premise that during that month the climate is mind tochoose cycling for commuting or traveling to a park.the following data was needed:
• Start station latitude.
• Start station longitude.
• End station latitude.
• End station longitude.
• User type.
• Gender.
• Travel distance1.
Out of the mentioned data, only travel distance needs to be calculated by using the given geographicalcoordinates (described in the methodology section). Note, that for the research only Subscribers of Citi Bikewere used. The figure below shows the distributions of calculated distances and observed frequences for menand women.