Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Research Request - RT Predictions / Real-Time Information POC metrics #709

Closed
tiffanychu90 opened this issue Apr 7, 2023 · 3 comments
Closed
Assignees
Labels
gtfs-rt Work related to GTFS-Realtime research request Issues that serve as a request for research (summary and handoff)

Comments

@tiffanychu90
Copy link
Member

tiffanychu90 commented Apr 7, 2023

Complete the below when receiving a research request, and continue to add to this issue as you receive additional details and produce deliverables. Be sure to also add the appropriate project-level label to this issue (eg gtfs-rt, DLA).

Research Question

Single sentence description:
Ahead of RFPs/procurement going on, we need to figure out conceptually what performance metrics we might be interested in, and do a proof-of-concept with those calculations. Slack thread.

This goes into rt_predictions analytics-driven warehouse development.

Detailed description:

  • From "what we collect" to "what can we learn/measure that's useful", use this POC to capture our learnings. Draft metrics
  • Inform future full-scale implementation
  • We haven't yet touched TripUpdates tables too much, so let's just grab a workable amount. For this MVP, stick with 2 operators, and we can even pick 2 routes for each operator and get at the calculations across trips.
  • Test out the various aggregations.
  • Metrics are: update completeness, wait time, prediction inconsistency, and reliable prediction accuracy

How will this research be used?

Inform future tables to store in our warehouse that are more aggregated than the raw, but also provide enough granularity. We can't go reaching into TripUpdates regularly because it's way too big.

Data sources

  • Cal-ITP data sources:
  • RT TripUpdates
  • schedule stop_times

Deliverables:

  1. preprocessing - cutoff for trip updates before trip start
  2. update completeness - trip updates dataset
  3. update completeness - vehicle positions dataset
  4. expected wait time
  5. prediction inconsistency
  6. reliable prediction accuracy

Timeline of deliverables:

End of April

@tiffanychu90 tiffanychu90 added gtfs-rt Work related to GTFS-Realtime research request Issues that serve as a request for research (summary and handoff) analytics-warehouse-poc labels Apr 7, 2023
@tiffanychu90 tiffanychu90 self-assigned this Apr 7, 2023
@tiffanychu90
Copy link
Member Author

tiffanychu90 commented May 11, 2023

Related task to include in pre-processing step

Tweaks to be made: Slack thread

@tiffanychu90
Copy link
Member Author

Tweak how metric 3 (prediction inconsistency / jitter) is calculated by simplifying it to be dependent only on prediction. Slack thread

@tiffanychu90
Copy link
Member Author

Slack thread about the mart_ad_hoc tables and Laurie's Google Doc with notes about all the tables needed any time we answer stop_time_updates questions (set of 3 tables with individual prediction (stop_time_update), the final update (the arrival at the stop), and combo of scheduled stop times + scheduled trips.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gtfs-rt Work related to GTFS-Realtime research request Issues that serve as a request for research (summary and handoff)
Projects
None yet
Development

No branches or pull requests

1 participant