Fix BatchEncoding.to() for nested elements #38985

eginhard · 2025-06-23T13:47:28Z

What does this PR do?

Extend BatchEncoding.to() to also work for nested elements.

When using voice presets in Bark, the processor returns a BatchEncoding of

{ "input_ids": torch.Tensor, "attention_mask": torch.Tensor, "history_prompt": BatchFeature}

Currently, only tensor elements are moved, so running on cuda the following code fails with RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select):

import scipy
import torch

from transformers import AutoProcessor
from transformers import BarkModel

model = BarkModel.from_pretrained("suno/bark-small")
device = "cuda:0" if torch.cuda.is_available() else "cpu"
model = model.to(device)

sampling_rate = model.generation_config.sample_rate
processor = AutoProcessor.from_pretrained("suno/bark-small")
voice_preset = "v2/en_speaker_6"

# prepare the inputs
text_prompt = "Let's try generating speech, with Bark, a text-to-speech model"
inputs = processor(text_prompt, voice_preset=voice_preset)

# generate speech
speech_output = model.generate(**inputs.to(device))
scipy.io.wavfile.write("bark_out.wav", rate=sampling_rate, data=speech_output[0].cpu().numpy())

A workaround was to manually do inputs["history_prompt"].to(device). This PR fixes this by moving all nested elements with a callable to().

Fixes #34634

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case: BarkProcessor voice_preset doesn't work #34634 (comment)
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@Rocketknight1

Rocketknight1

Yes, LGTM! Thank you for the PR.

HuggingFaceDocBuilderDev · 2025-06-23T14:43:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

kylesayrs

Neat

Rocketknight1 · 2025-06-24T12:14:11Z

cc @itazap for tokenizers, feel free to merge it if you're happy

ebezzam · 2025-07-17T14:16:31Z

@Rocketknight1, @itazap any update on this PR?

I can confirm it would address #34634 and a Bark test mentioned in #39478 🙂

Rocketknight1 · 2025-07-18T13:14:40Z

Good point - I took another look and I think this is safe, so I'm going to merge! If it breaks anything, anyone finding this PR can yell at me 😅

Fix BatchEncoding.to() for nested elements

32137fd

Rocketknight1 force-pushed the fix-batchencoding-to branch from bee69bb to 32137fd Compare June 23, 2025 14:29

Rocketknight1 approved these changes Jun 23, 2025

View reviewed changes

kylesayrs approved these changes Jun 23, 2025

View reviewed changes

ebezzam mentioned this pull request Jul 17, 2025

Fix Bark failing tests #39478

Merged

Rocketknight1 merged commit 561a79a into huggingface:main Jul 18, 2025
20 checks passed

eginhard deleted the fix-batchencoding-to branch July 18, 2025 13:17

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request Jul 22, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

0852b5a

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

48c5cca

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

0408576

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

c8bc323

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

3e74637

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

74b5b36

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

19a254e

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix BatchEncoding.to() for nested elements (huggingface#38985)

7d1fd7c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix BatchEncoding.to() for nested elements #38985

Fix BatchEncoding.to() for nested elements #38985

Uh oh!

eginhard commented Jun 23, 2025 •

edited

Loading

Uh oh!

Rocketknight1 left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 23, 2025

Uh oh!

kylesayrs left a comment

Uh oh!

Rocketknight1 commented Jun 24, 2025

Uh oh!

ebezzam commented Jul 17, 2025

Uh oh!

Rocketknight1 commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix BatchEncoding.to() for nested elements #38985

Fix BatchEncoding.to() for nested elements #38985

Uh oh!

Conversation

eginhard commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 23, 2025

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Jun 24, 2025

Uh oh!

ebezzam commented Jul 17, 2025

Uh oh!

Rocketknight1 commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eginhard commented Jun 23, 2025 •

edited

Loading