Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Last update: Dec 01, 2021

Related tags

Overview

opendata

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

import asyncio
from opendata.sources.bikeshare.bay_wheels import trips as bay_wheels

trips_df, _ = asyncio.run(bay_wheels.async_load(trip_sample_rate=1000))

len(trips_df.index)
# 8731

trips_df.columns
# Index(['started_at', 'ended_at', 'start_station_id', 'end_station_id',
#        'start_station_name', 'end_station_name', 'rideable_type', 'ride_id',
#        'start_lat', 'start_lng', 'end_lat', 'end_lng', 'gender', 'user_type',
#        'bike_id', 'birth_year'],
#       dtype='object')

An example analysis can be found here: https://observablehq.com/@brady/bikeshare

Supports sampling and local file caching to improve performance.

Markets supported

import opendata.sources.bikeshare.bay_wheels
import opendata.sources.bikeshare.bixi
import opendata.sources.bikeshare.divvy
import opendata.sources.bikeshare.capital_bikeshare
import opendata.sources.bikeshare.citi_bike
import opendata.sources.bikeshare.cogo
import opendata.sources.bikeshare.niceride
import opendata.sources.bikeshare.bluebikes
import opendata.sources.bikeshare.metro_bike_share
import opendata.sources.bikeshare.indego

Bootstrap

Set up your environment

brew install chromedriver
brew install python3
python3 -m pip install pre-commit

pre-commit install --install-hooks
python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Entering virtualenv

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Usage

Try the test export to CSV:

python3 test.py

Updating pip requirements

pip-compile

Pre-commit setup

pre-commit install --install-hooks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Related tags

Overview

opendata

Markets supported

Bootstrap

Entering virtualenv

Usage

Updating pip requirements

Pre-commit setup

Bikeshare markets to add

USA

World

Owner

Brady Law

BioMASS - A Python Framework for Modeling and Analysis of Signaling Systems

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

AptaMat is a simple script which aims to measure differences between DNA or RNA secondary structures.

Python implementation of Principal Component Analysis

A variant of LinUCB bandit algorithm with local differential privacy guarantee

peptides.py is a pure-Python package to compute common descriptors for protein sequences

A set of procedures that can realize covid19 virus detection based on blood.

Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.

BasstatPL is a package for performing different tabulations and calculations for descriptive statistics.

Python Project on Pro Data Analysis Track

Using Python to scrape some basic player information from www.premierleague.com and then use Pandas to analyse said data.

Python reader for Linked Data in HDF5 files

Udacity - Data Analyst Nanodegree - Project 4 - Wrangle and Analyze Data

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

PyTorch implementation for NCL (Neighborhood-enrighed Contrastive Learning)

Hidden Markov Models in Python, with scikit-learn like API

Conduits - A Declarative Pipelining Tool For Pandas

A meta plugin for processing timelapse data timepoint by timepoint in napari

Scraping and analysis of leetcode-compensations page.

Business Intelligence (BI) in Python, OLAP