RFC: virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add configurable policy #1911

smalis-msft · 2025-08-25T19:39:46Z

The intention behind this change is to provide better crash stacks and dumps in ICMs. By panicking at the point of error construction, rather than after the error has bubbled up to a higher layer, we are better able to capture stack context around the cause of the failure and to provide a meaningful stack to Watson (instead of just blaming the HaltRequest::Panic handler for all these different error sources). Essentially simulating 'unwrapping' these errors.

However the ability to keep a VM 'alive' for inspection after a failure is still useful, and we want to preserve it. Therefore this change makes this new behavior configurable.

…configurable policy

Copilot

Pull Request Overview

This RFC introduces a new fatal_error method to the CpuIo trait that allows configurable handling of fatal VM errors. Instead of using generic VpHaltReason variants, errors now trigger immediate panics (for better crash stacks) or debug breaks (for VM inspection), based on policy configuration.

Key changes:

Adds CpuIo::fatal_error method with configurable FatalErrorPolicy
Removes generic VpHaltReason variants (InvalidVmState, Hypervisor, EmulationFailure)
Updates all error handling sites to use the new fatal_error method

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
vmm_core/vmm_core_defs/src/lib.rs	Removes `InvalidVmState` and `VpError` halt reason variants
vmm_core/virt/src/io.rs	Adds new `fatal_error` method to `CpuIo` trait
vmm_core/virt/src/generic.rs	Removes generic error halt reason variants
vmm_core/src/vmotherboard_adapter.rs	Implements `fatal_error` with configurable policy
vmm_core/src/partition_unit/vp_set.rs	Removes handling for removed halt reason variants
Multiple virt_* files	Updates error handling to use `dev.fatal_error()`
openhcl/underhill_core/src/worker.rs	Configures fatal error policy and removes panic handling

openhcl/virt_mshv_vtl/src/processor/mshv/arm64.rs

openhcl/underhill_core/src/worker.rs

smalis-msft added 3 commits August 25, 2025 12:54

virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add …

98aab26

…configurable policy

Make this an unwrap

2b35acf

wires

1d047a1

Copilot AI review requested due to automatic review settings August 25, 2025 19:39

smalis-msft requested a review from a team as a code owner August 25, 2025 19:39

Copilot AI reviewed Aug 25, 2025

View reviewed changes

openhcl/virt_mshv_vtl/src/processor/mshv/arm64.rs Show resolved Hide resolved

openhcl/underhill_core/src/worker.rs Show resolved Hide resolved

smalis-msft added 2 commits August 25, 2025 15:56

Fix arm

d06f584

Fix hvf

99ef7b2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add configurable policy #1911

RFC: virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add configurable policy #1911

Uh oh!

smalis-msft commented Aug 25, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RFC: virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add configurable policy #1911

Are you sure you want to change the base?

RFC: virt: Replace generic VpHaltReasons with new CpuIo::fatal_error, add configurable policy #1911

Uh oh!

Conversation

smalis-msft commented Aug 25, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!