Skip to content

[Question] question title #2238

@jvoids

Description

@jvoids

❓ Question

Hi guys.
In case of custom FeatureExtractor is it possible to teach some parts of it controlled but in terms of general Algorithm flow. I.e there's exact specific behavior is expected for the feature extractor to perform. So that it should not be trained as a black box along the input - extractor - actor flow but as a white box.

E.g. having environment that returns extra data within info result from a step() call (as it is not the observation it self)

obs, reward, terminated, info, done = env.step(action)

So that is it possible to organize feature_extractor some how to use specific data from the info as a target at back propagation phase.

Or should such logic be trained just separately and used exclusively at inference mode within feature_extractor

Thank you!

Checklist

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions