chore(weave): Realtime API, support collecting audio data by chance-wnb · Pull Request #6249 · wandb/weave

chance-wnb · 2026-03-03T02:09:22Z

Description

Adds audio capture and serialization support to the OpenAI Realtime API integration. The adapter now accumulates raw PCM audio chunks during streaming and converts them to WAV format when the audio call ends. A new serializeAudio method is exposed on the WeaveClient for manual audio serialization in call outputs.

Key changes:

Added pcmToWav helper function to convert 24kHz 16-bit mono PCM to WAV format
Modified audio event handler to accumulate PCM chunks per response ID
Updated closeAudioCall to serialize accumulated audio chunks and include them in call output
Added public serializeAudio method to WeaveClient for manual audio serialization
Added proper cleanup of audio chunks on disconnect and detach

Screenshot

Testing

Locally tested
Unit tests are expected in the upper stack PRs later

chance-wnb · 2026-03-03T02:09:38Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

codecov · 2026-03-03T02:13:50Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

wandbot-3000 · 2026-03-03T02:14:30Z

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=7cc9136314b89235fe834290c5f92c8cdf9822f2

neutralino1 · 2026-03-09T22:04:53Z

sdks/node/src/integrations/openai.realtime.agent.ts

+function pcmToWav(pcm: Buffer): Buffer {
+  const channels = 1;
+  const sampleRate = 24000;
+  const bitDepth = 16;
+  const wav = Buffer.alloc(44 + pcm.length);
+  wav.write('RIFF', 0);
+  wav.writeUInt32LE(36 + pcm.length, 4);
+  wav.write('WAVE', 8);
+  wav.write('fmt ', 12);
+  wav.writeUInt32LE(16, 16);
+  wav.writeUInt16LE(1, 20); // PCM
+  wav.writeUInt16LE(channels, 22);
+  wav.writeUInt32LE(sampleRate, 24);
+  wav.writeUInt32LE(sampleRate * channels * (bitDepth / 8), 28);
+  wav.writeUInt16LE(channels * (bitDepth / 8), 32);
+  wav.writeUInt16LE(bitDepth, 34);
+  wav.write('data', 36);
+  wav.writeUInt32LE(pcm.length, 40);
+  wav.set(pcm, 44); // Uint8Array.set — accepts ArrayLike<number>, no Buffer-copy type issues
+  return wav;
+}


Is this a costly operation? It seems that it is just setting the data in the right container, any memory copy?

As far as I know the javascript Buffer data structure is already the right tool for byte-wise operations. It is already much efficient than the classic js arrays.

wav.set(pcm, 44);

This is the memory copy part. The previous lines are trivial (constant time despite many).

Is this a costly operation

I think it is alright. I can't think of doing it any other ways. The format must be converted as far as I know.

PS: this is apparently AI generated code, I am not capable of writing such a thing myself. lol. I guess it is better than importing a 3rd party library.

Is this a costly operation

As a friendly reminder the audio stream conversion is done once per closeAudioCall event.

Let me know if you feel something is fishy and have improvement proposals. Thanks!

Don't we support the original format via Content? cc @zbirenbaum

PCM detection doesn't work properly (maybe that's changed with use of python magic) I had to convert to wav for my impl as well. Maybe we could solve this by manually setting the mimetype?

chore(weave): Realtime API, support collecting audio data

143eef3

chance-wnb mentioned this pull request Mar 3, 2026

chore(weave): initial set up of the openai agents realtime API support. #6247

Open

chance-wnb mentioned this pull request Mar 3, 2026

chore(weave): Enable capturing of tool calls. #6248

Open

chance-wnb marked this pull request as ready for review March 3, 2026 02:10

chance-wnb requested a review from a team as a code owner March 3, 2026 02:10

neutralino1 reviewed Mar 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(weave): Realtime API, support collecting audio data#6249

chore(weave): Realtime API, support collecting audio data#6249
chance-wnb wants to merge 1 commit intochance/realtime_tool_callfrom
chance/realtime_audio_support

chance-wnb commented Mar 3, 2026 •

edited

Loading

Uh oh!

chance-wnb commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026

Uh oh!

wandbot-3000 bot commented Mar 3, 2026

Uh oh!

neutralino1 Mar 9, 2026

Uh oh!

chance-wnb Mar 9, 2026 •

edited

Loading

Uh oh!

chance-wnb Mar 9, 2026

Uh oh!

neutralino1 Mar 9, 2026

Uh oh!

zbirenbaum Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chance-wnb commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Screenshot

Testing

Uh oh!

chance-wnb commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026

Codecov Report

Uh oh!

wandbot-3000 bot commented Mar 3, 2026

Uh oh!

neutralino1 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

chance-wnb Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chance-wnb Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

neutralino1 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

zbirenbaum Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chance-wnb commented Mar 3, 2026 •

edited

Loading

chance-wnb Mar 9, 2026 •

edited

Loading