🚀 Extended Bedrock Batch Orchestrator with Multi-Stage Pipeline Support and MultiModality #654
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
🚀 Extended Bedrock Batch Orchestrator with Multi-Stage Pipeline Support
Summary
This PR extends the Bedrock Batch Orchestrator to support multi-stage pipeline configurations, enabling batch inference workflows where multiple stages can be chained together. It also adds multimodal support for image processing and includes Amazon Nova model compatibility.
🎯 Key Features
Multi-Stage Pipeline Architecture
transform_stage.pyLambda function handles data flow between pipeline stagesvalidate_pipeline_config.pyto ensure pipeline configurations are valid before executionEnhanced Preprocessing & Prompt Matching
Improved Postprocessing
Notification System
Amazon Nova Model Support
📊 Changes by Component
Infrastructure (CDK)
transform_stage,send_notification,validate_pipeline_configLambda Functions
preprocess.py: +429 lines - Multimodal support, category-based routingpostprocess.py: +282 lines - Structured output extraction, enhanced parsingprocessor.py: +204 lines - Multi-stage job managementprompt_templates.py: +496 lines - New templates including Nova modelscustom_types.py: +76 lines - Enhanced type definitions for pipelinesConfiguration & Examples
clothing-analysis-full.json,clothing-analysis-test.jsonDocumentation
📈 Statistics
🔧 Configuration
Multi-stage pipelines are configured via JSON files in
pipeline-configs/:{ "stages": [ { "stage_name": "stage_1", "prompt_mode": "single", "model_id": "amazon.nova-lite-v1:0", "output_schema": {...} } ] }🎨 Use Cases Enabled
⚙️ Breaking Changes
None - fully backward compatible with existing single-stage configurations.
🧪 Testing
Includes sample configurations and test data for:
📝 Notes
cdk.json