Fix N+1 query pattern in task instance states and count endpoints by steveahnahn · Pull Request #60352 · apache/airflow

steveahnahn · 2026-01-09T20:57:51Z

Problem

The previous implementation fetched all task instances from the database and then filtered by map_index in Python. For DAGs with mapped tasks containing large map indices, this caused unnecessary database load and memory usage.

Solution

Push the map_index filter to the SQL query, allowing the database to handle filtering efficiently:

Move map_index filtering from Python to SQL in get_task_instance_states and get_task_instance_count endpoints
Add map_index parameter to _get_group_tasks helper function to filter at the database level

SameerMesiah97 · 2026-01-10T13:26:09Z

This looks good at first glance. But as the filter based on map index is a new addition (and likely not to be covered by existing tests), I think it would be worth adding a small test to lock in the expected behavior and guard against future regressions.

In particular, it would be a good idea to cover:

A mapped task within a task group that produces multiple task instances (i.e. multiple map indices for the same task ID).
The default behavior when map_index is not provided, ensuring all relevant task instances are returned (including unmapped tasks).

For (1), you could test both code paths by passing and omitting task_group_id. If feasible (and if doesn't make the test too bulky), you could cover all scenarios in a single parametrized test.

steveahnahn · 2026-01-14T22:02:06Z

Thanks for the review, the scenarios you mentioned are already covered in airflow-core/tests/unit/api_fastapi/execution_api/versions/head/test_task_instances.py:

test_get_count_mix_of_task_and_task_group_dynamic_task_mapping
test_get_task_states_mix_of_task_and_task_group_dynamic_task_mapping

The existing parametrized cases cover mapped tasks with multiple map indices, default behavior without map_index, and filtering with task_group_id. These tests will now exercise the new db-level filtering path.

I did end up adding one missed test combination, filtering by map_index without providing task_group_id

github-actions · 2026-03-01T00:27:07Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 5 days if no further activity occurs. Thank you for your contributions.

steveahnahn · 2026-03-01T23:18:28Z

@SameerMesiah97 it has been some time but tagging for an approval, thanks!

fix inefficient fetch all and filter

6b0d6bb

steveahnahn requested review from amoghrajesh, ashb and kaxil as code owners January 9, 2026 20:57

boring-cyborg bot added the area:API Airflow's REST/HTTP API label Jan 9, 2026

add unittest case: map-index but no task-group

564d4ce

github-actions bot added the stale Stale PRs per the .github/workflows/stale.yml policy file label Mar 1, 2026

Merge branch 'main' into fix/n-plus-1-query-task-instances

6b0c8cd

github-actions bot removed the stale Stale PRs per the .github/workflows/stale.yml policy file label Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix N+1 query pattern in task instance states and count endpoints#60352

Fix N+1 query pattern in task instance states and count endpoints#60352
steveahnahn wants to merge 3 commits intoapache:mainfrom
steveahnahn:fix/n-plus-1-query-task-instances

steveahnahn commented Jan 9, 2026

Uh oh!

SameerMesiah97 commented Jan 10, 2026

Uh oh!

steveahnahn commented Jan 14, 2026

Uh oh!

github-actions bot commented Mar 1, 2026

Uh oh!

steveahnahn commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

steveahnahn commented Jan 9, 2026

Problem

Solution

Uh oh!

SameerMesiah97 commented Jan 10, 2026

Uh oh!

steveahnahn commented Jan 14, 2026

Uh oh!

github-actions bot commented Mar 1, 2026

Uh oh!

steveahnahn commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants