feat(airbnb010, airbnb011): SCD2 snapshot tasks for superhost evolution#129
Merged
feat(airbnb010, airbnb011): SCD2 snapshot tasks for superhost evolution#129
Conversation
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…th inserts) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…bnb010-scd2-snapshot
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Adds an alternate solution seed for snap__hosts and updates the equality test to check against multiple answer keys (pass if any match). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
… snap_hosts seed Three bugs fixed in interact.py: 1. Pass allowed_tools from plugin set config when creating agent 2. Use run-tests.sh pipeline instead of bare dbt test 3. Copy test_setup script to container Simplify airbnb010 solution seeds: - Remove alternate snap_hosts seed with renamed columns - Keep only raw-column seed (ID/NAME) as single answer key - Update _no-op.txt seed schema and regenerate test files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…sion compat Add daily cache-busting for the dbt-fusion curl install in Docker builds so new fusion versions are picked up automatically. Also add dbt-fusion variants to airbnb010/011 task configs and update test SQL to use load_relation(ref()) for fusion compatibility. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Remove the current_superhost_tenure column from prompts, solution models, and seed CSVs for both airbnb010 and airbnb011. This column relied on now() being frozen, which agents reasonably use CURRENT_DATE for instead — making it an unfair test of snapshot logic. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Commenting out until dbt-labs/dbt-fusion#1447 is resolved. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…d mark ready - Remove HOST_ID/HOST_NAME aliases from snap__hosts solution SQL, use source column names (ID, NAME) directly - Update dim_superhost_evolution and src_hosts solutions to alias at point of use instead - Fix airbnb011 CSV header to match (HOST_ID→ID, HOST_NAME→NAME) - Change status from dev to ready Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
airbnb010(hard, 1 variant): SCD2 snapshot for hosts usingtimestampstrategy, plusdim_superhost_evolutionanalytical modelairbnb011(hard, 2 variants: base + hint): Same SCD2 goal butUPDATED_ATis stuck at 1980-01-01 — agent must discover this and usecheckstrategy instead. The hint variant nudges the agent about unreliable timestamps.RAW_HOSTSon each dbt invocation to simulate a live data sourcetest_setuprebuilds snapshot history with frozen DuckDB timestamps for reproducible evaluationTest Plan
ade run airbnb010 --agent sagepasses 5/5 testsade run airbnb011 --agent sagepasses 5/5 tests for both variants (10/10 total)🤖 Generated with Claude Code