feat: reverse mode batched AD #1740

avik-pal · 2025-10-10T19:40:35Z

avik-pal · 2025-10-10T20:35:44Z

We should merge and release a JLL with EnzymeAD/Enzyme-JAX#1466 before merging this

avik-pal · 2025-10-11T19:31:16Z

SliceSimplify seems to be incorrect:

module {
  func.func @main(%arg0: tensor<2x3xf32> {enzymexla.memory_effects = []}) -> (tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>) attributes {enzymexla.memory_effects = []} {
    %cst = stablehlo.constant dense<[[1.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00], [0.000000e+00, 1.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00], [0.000000e+00, 0.000000e+00, 1.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00], [0.000000e+00, 0.000000e+00, 0.000000e+00, 1.000000e+00, 0.000000e+00, 0.000000e+00], [0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 1.000000e+00, 0.000000e+00], [0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, 1.000000e+00]]> : tensor<6x6xf32>
    %0 = stablehlo.slice %cst [0:6, 0:1] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %1 = stablehlo.reshape %0 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %2 = stablehlo.slice %cst [0:6, 1:2] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %3 = stablehlo.reshape %2 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %4 = stablehlo.slice %cst [0:6, 2:3] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %5 = stablehlo.reshape %4 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %6 = stablehlo.slice %cst [0:6, 3:4] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %7 = stablehlo.reshape %6 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %8 = stablehlo.slice %cst [0:6, 4:5] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %9 = stablehlo.reshape %8 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %10 = stablehlo.slice %cst [0:6, 5:6] : (tensor<6x6xf32>) -> tensor<6x1xf32>
    %11 = stablehlo.reshape %10 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    return %1, %3, %5, %7, %9, %11 : tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>
  }
}

After slice simplify

module {
  func.func @main(%arg0: tensor<2x3xf32> {enzymexla.memory_effects = []}) -> (tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>) attributes {enzymexla.memory_effects = []} {
    %cst = stablehlo.constant dense<[[0.000000e+00], [0.000000e+00], [1.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00]]> : tensor<6x1xf32>
    %cst_0 = stablehlo.constant dense<[[0.000000e+00], [0.000000e+00], [0.000000e+00], [1.000000e+00], [0.000000e+00], [0.000000e+00]]> : tensor<6x1xf32>
    %cst_1 = stablehlo.constant dense<[[0.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00], [1.000000e+00], [0.000000e+00]]> : tensor<6x1xf32>
    %cst_2 = stablehlo.constant dense<[[0.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00], [1.000000e+00]]> : tensor<6x1xf32>
    %cst_3 = stablehlo.constant dense<0.000000e+00> : tensor<6x1xf32>
    %cst_4 = stablehlo.constant dense<[[1.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00], [0.000000e+00]]> : tensor<6x1xf32>
    %0 = stablehlo.reshape %cst_4 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %1 = stablehlo.reshape %cst_3 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %2 = stablehlo.reshape %cst_2 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %3 = stablehlo.reshape %cst_1 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %4 = stablehlo.reshape %cst_0 : (tensor<6x1xf32>) -> tensor<2x3xf32>
    %5 = stablehlo.reshape %cst : (tensor<6x1xf32>) -> tensor<2x3xf32>
    return %0, %1, %2, %3, %4, %5 : tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>, tensor<2x3xf32>
  }
}

EDIT: fixed with the latest push to EnzymeJAX

avik-pal · 2025-10-12T00:05:49Z

Need JuliaPackaging/Yggdrasil#12270 before merging

avik-pal · 2025-10-12T19:39:01Z

This is good to go from my end

avik-pal force-pushed the ap/stacked_batchdup branch 2 times, most recently from 0f51436 to 36da208 Compare October 10, 2025 20:02

avik-pal mentioned this pull request Oct 10, 2025

Incorrect results for nested AD #1733

Open

avik-pal requested a review from wsmoses October 10, 2025 21:49

avik-pal marked this pull request as ready for review October 10, 2025 21:49

avik-pal force-pushed the ap/stacked_batchdup branch 2 times, most recently from f437aae to d71b23f Compare October 11, 2025 17:19

avik-pal force-pushed the ap/stacked_batchdup branch from 7c12684 to bbadac4 Compare October 12, 2025 00:05

avik-pal mentioned this pull request Oct 12, 2025

feat: batched_jacobian for Reactant LuxDL/Lux.jl#1507

Open

4 tasks

avik-pal force-pushed the ap/stacked_batchdup branch 2 times, most recently from bb431db to 49b55b9 Compare October 12, 2025 14:49

avik-pal added 15 commits October 13, 2025 00:08

feat: reverse mode batched AD

85cb480

fix: materialize

dec1fcb

refactor: cleanup loops

4032450

fix: ensure we call overlayed functions

3b0736b

test: vector mode reverse AD

e9b6b01

fix: don't prevent transpose of all values

3aa24a8

test: add testcase from #1733

ea93854

fix: stack on the last dim

fc23b06

feat: stacked batch duplicated annotations

d08bf70

fix: use specific enzymexla commit

af5e1b2

fix: access via EnzymeCore

a1d3952

fix: use latest enzymejax patch

7d68c81

fix: slice dim

a540d10

fix: stacked

01e365c

fix: bump jll version

e5acdeb

fix: explicit imports

10d888e

avik-pal force-pushed the ap/stacked_batchdup branch from 3fda458 to 10d888e Compare October 13, 2025 04:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: reverse mode batched AD #1740

feat: reverse mode batched AD #1740

avik-pal commented Oct 10, 2025 •

edited

Loading

Uh oh!

avik-pal commented Oct 10, 2025

Uh oh!

avik-pal commented Oct 11, 2025 •

edited

Loading

Uh oh!

avik-pal commented Oct 12, 2025

Uh oh!

avik-pal commented Oct 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: reverse mode batched AD #1740

Are you sure you want to change the base?

feat: reverse mode batched AD #1740

Conversation

avik-pal commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

avik-pal commented Oct 10, 2025

Uh oh!

avik-pal commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

avik-pal commented Oct 12, 2025

Uh oh!

avik-pal commented Oct 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

avik-pal commented Oct 10, 2025 •

edited

Loading

avik-pal commented Oct 11, 2025 •

edited

Loading