You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current Foothill Transit (ITP ID 112) feed's feed_info.txt has two rows with two sets of active dates (both have the same start date) and the same feed version identifier.
This causes fanout in the warehouse, because we have two copies of the same feed in gtfs_schedule_dim_feeds. Specifically, a test is failing on gtfs_schedule_fact_daily_trips because we have two copies of the same trip.
While this is clearly a bug in their feed, I am not entirely sure how to address this in the warehouse. I am inclined to put in a hacky condition in gtfs_schedule_dim_feeds or feed_info_clean to filter out one of these rows specifically because this should be rare and putting in logic that tries to programmatically address this kind of situation is unlikely to be robust.
AC:
Only one row in gtfs_schedule_dim_feeds for the Foothill Transit feed from 2022-05-03.
The unique combination of columns test on gtfs_schedule_fact_daily_trips should pass.
The text was updated successfully, but these errors were encountered:
The current Foothill Transit (ITP ID
112
) feed'sfeed_info.txt
has two rows with two sets of active dates (both have the same start date) and the same feed version identifier.This causes fanout in the warehouse, because we have two copies of the same feed in
gtfs_schedule_dim_feeds
. Specifically, a test is failing ongtfs_schedule_fact_daily_trips
because we have two copies of the same trip.While this is clearly a bug in their feed, I am not entirely sure how to address this in the warehouse. I am inclined to put in a hacky condition in
gtfs_schedule_dim_feeds
orfeed_info_clean
to filter out one of these rows specifically because this should be rare and putting in logic that tries to programmatically address this kind of situation is unlikely to be robust.AC:
gtfs_schedule_dim_feeds
for the Foothill Transit feed from2022-05-03
.gtfs_schedule_fact_daily_trips
should pass.The text was updated successfully, but these errors were encountered: