Skip to content

Data Product - stop-level metrics #4101

@tiffanychu90

Description

@tiffanychu90

User story / feature request

Create a data product that will visualize / map the stop-level metrics derived from GTFS RT trip updates.

Notes

Use the week's worth of data to figure out certain configurations for these mart tables

  • incrementalize some views
    • Currently, these tables are all views. It's not immediately usable in a data product because dt / hour or base64_url must be used in the partition elimination upstream. This error will come up if we keep everything as views: Cannot query over table 'cal-itp-data-infra-staging.external_gtfs_rt_v2.trip_updates' without a filter over column(s) 'base64_url', 'dt', 'hour' that can be used for partition elimination
    • The pattern to copy is done in #4172 for how incremental models work.
    • Specifically, configs with insert_overwrite, sometimes needing to get the grain defined to use unique_key='key'
    • Make sure joins are making use of partition or cluster columns in BQ
  • Take a look at how metrics look when they're averaged to stop grain.
    • Are there outliers
    • Do certain operators look funky
    • Gather learnings to inform filters needed within data product
      • Go through BQ job history and look at specifically the dbt operation that is taking place to merge, append, etc
      • Need to add docs related to staging profile within profiles.yml

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions