Fix: correct unpacking order in play_one_step (truncated/info swapped) #216

ShindeShivam · 2025-09-06T06:42:13Z

Chapter :18

Cell : Fixed Q-Value Targets , Double DQN , Dueling Double DQN

Play_one_step() returns (next_state, reward, done, truncated, info), but the training loops unpacked it as (obs, reward, done, info, truncated).

Fixed by using the correct order:

obs, reward, done, truncated, info = play_one_step(env, obs, epsilon)

ageron · 2025-10-13T22:47:25Z

Great catch @ShindeShivam, thanks a lot for the PR. 👍

ShindeShivam added 3 commits September 6, 2025 12:07

Fix: correct unpacking order in play_one_step (truncated/info swapped)

2b9ef13

Fix: correct play_one_step unpacking in chapter 18

8dc5bbd

also in exercise

4c66094

ageron merged commit 8391321 into ageron:main Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: correct unpacking order in play_one_step (truncated/info swapped) #216

Fix: correct unpacking order in play_one_step (truncated/info swapped) #216

Uh oh!

ShindeShivam commented Sep 6, 2025 •

edited

Loading

Uh oh!

ageron commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix: correct unpacking order in play_one_step (truncated/info swapped) #216

Fix: correct unpacking order in play_one_step (truncated/info swapped) #216

Uh oh!

Conversation

ShindeShivam commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Cell : Fixed Q-Value Targets , Double DQN , Dueling Double DQN

Fixed by using the correct order:

Uh oh!

ageron commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ShindeShivam commented Sep 6, 2025 •

edited

Loading