data-analyses

Place for sharing quick reports, and works in progress

This repository is for quick sharing of works in progress and simple analyses. For collaborative short-term tasks, create a new folder and work off a separate branch. For longer-term projects, consider making a new repository!

Using this Repo

Use this link to get started in JupyterHub, set up SSH, and start commiting to the repo!

JupyterHub Developers

If you are developing in JupyterHub, follow the JupyterHub setup docs.

Contributing

Follow these steps to start contributing:

Clone this data-analyses repo.
From the repo root (data-analyses/), run make install_env (runs uv sync + pre-commit setup)
In JupyterHub, select the "Pyproject Local" kernel when opening a notebook

Note

If you run into the error No such file or directory, you may need to install uv running pip install uv.

uv

This repository uses uv for package management. To learn more go to uv documentation.

Basic commands:

uv sync install missing packages, update existing ones, and remove unnecessary ones to ensure the environment matches the lockfile.
uv add <package name> include and install a new package to the main project.
uv add <package name> --dev include and install new packages/dependencies to the dev group.
uv add <package name> --portfolio include and install new packages/dependencies to the portfolio group.
uv add <package name> --test include and install new packages/dependencies used only for testing under the test group.
uv remove <package name> remove and uninstall packages/dependencies from the project.

nbdime

nbdime provides command-line tools for diffing and merging notebooks.

Basic commands:

nbdiff compare notebooks in a terminal-friendly way.
nbshow present a single notebook in a terminal-friendly way.

Pre-commit

This repository uses pre-commit hooks to format code, including Black. This ensures baseline consistency in code formatting.

Pre-commit checks will run before you can make commits locally. If a pre-commit check fails, it will need to be addressed before you can make your commit. Many formatting issues are fixed automatically within the pre-commit actions, so check the changes made by pre-commit on failure -- they may have automatically addressed the issues that caused the failure, in which case you can simply re-add the files, re-attempt the commit, and the checks will then succeed.

Installing pre-commit locally saves time dealing with formatting issues on pull requests. There is a GitHub Action that runs pre-commit on all files, not just changed ones, as part of our continuous integration.

Quick Links - Get Started in Data Analysis

Data Analytics Documentation - Welcome

https://docs.calitp.org/data-infra/analytics_welcome/overview.html

Data Analytics Documentation - Introduction to Analytics Tools

https://docs.calitp.org/data-infra/analytics_tools/overview.html

Publishing Reports

The sites folder contains the YAML files that drive sites deployed to https://analysis.calitp.org/; the existing sites can be used as examples/templates for deploying additional sites. Also, the Data Services Documentation has a specific chapter dedicated to various ways to publish data.

Caveats (when using the portfolio site)

Jupyter Book/Sphinx do not play nicely with Markdown headers written out in display() calls. Therefore, portfolio.py uses a custom Papermill engine to template Markdown cells directly, following Python formatted-string syntax. For example, your Markdown cell could contain # {district_name} and it will be templated by the underlying engine.

Name		Name	Last commit message	Last commit date
Latest commit History 10,206 Commits
.github		.github
5311_analyses		5311_analyses
_shared_utils		_shared_utils
ah_starterkit		ah_starterkit
ahsc_grant		ahsc_grant
bus_procurement_cost		bus_procurement_cost
bus_stop_cathy		bus_stop_cathy
ca_transit_speed_maps		ca_transit_speed_maps
calitp-data-analysis		calitp-data-analysis
cell_coverage		cell_coverage
consolidated_app		consolidated_app
conveyal_update		conveyal_update
dla		dla
equity_index		equity_index
facilities_services		facilities_services
freight_economic_competitiveness		freight_economic_competitiveness
fs_kit		fs_kit
gtfs_curate		gtfs_curate
gtfs_digest		gtfs_digest
gtfs_flex_research_questions		gtfs_flex_research_questions
gtfs_funnel		gtfs_funnel
gtfs_orgs		gtfs_orgs
gtfs_quality_check		gtfs_quality_check
gtfs_schedule		gtfs_schedule
ha_portfolio		ha_portfolio
high_quality_transit_areas		high_quality_transit_areas
holiday_service_research		holiday_service_research
iac		iac
la_metro_demo		la_metro_demo
latency		latency
lax_metro		lax_metro
lctop		lctop
littlepay_data_issues		littlepay_data_issues
msd_dashboard_metric		msd_dashboard_metric
mst_payments		mst_payments
mtc_511_rt		mtc_511_rt
ntd		ntd
open_data		open_data
payments		payments
portfolio		portfolio
project_list		project_list
project_prioritization		project_prioritization
py_crow_flies		py_crow_flies
quarterly_performance_objective		quarterly_performance_objective
realizable_transit_accessibility		realizable_transit_accessibility
rt_delay		rt_delay
rt_predictions		rt_predictions
rt_scheduled_v_ran		rt_scheduled_v_ran
rt_segment_speeds		rt_segment_speeds
sb125_analyses		sb125_analyses
sccp		sccp
tests/fixtures/portfolio		tests/fixtures/portfolio
thruway_intercity_bus		thruway_intercity_bus
tircp		tircp
traffic_ops		traffic_ops
transfer_analysis_mvp		transfer_analysis_mvp
transit_agency_peer_groups		transit_agency_peer_groups
transit_priority		transit_priority
transit_priority_infrastructure		transit_priority_infrastructure
transit_provider_dashboard		transit_provider_dashboard
transit_service_intensity		transit_service_intensity
transit_stacks_analysis		transit_stacks_analysis
.bashrc		.bashrc
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_config.yml		_config.yml
_template.tpl		_template.tpl
agencies_50m_from_shn.ipynb		agencies_50m_from_shn.ipynb
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data-analyses

Using this Repo

JupyterHub Developers

Contributing

uv

nbdime

Pre-commit

Quick Links - Get Started in Data Analysis

Data Analytics Documentation - Welcome

Data Analytics Documentation - Introduction to Analytics Tools

Publishing Reports

Caveats (when using the portfolio site)

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

data-analyses

Using this Repo

JupyterHub Developers

Contributing

uv

nbdime

Pre-commit

Quick Links - Get Started in Data Analysis

Data Analytics Documentation - Welcome

Data Analytics Documentation - Introduction to Analytics Tools

Publishing Reports

Caveats (when using the portfolio site)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages