FEAT Add JailbreakV_28k dataset from HF #1098

AdrGav941 · 2025-09-22T22:09:48Z

Description

This PR adds support for the JailbreakV_28k dataset to PyRIT.

Addresses #1007

Changes Made:

Added integration for JailbreakV_28k
Normalizes and associates the datasets "policy" column with harm-category
Allows for filtering on harm categories (policy values)

Files Added/Modified:

pyrit/datasets/fetch_jailbreakv_28k_dataset.py - Main implementation
pyrit/datasets/init.py - Added exports for new functions
tests/unit/datasets/test_fetch_jailbreakv_28k_dataset.py - Unit tests
tests\integration\datasets\test_fetch_datasets.py - Integration tests added

Tests and Documentation

PyTest parametrized testing for filtering and choice of text field (dataset has jailbreak and redteaming prompts)
Dataset mocking with both text fields and policy mapped to harm_category

romanlutz

Thanks for getting started on this!

The integration test for datasets is missing, but I suspect it will require a custom one as the dataset is meant to be multimodal (see other comment).

pyrit/datasets/fetch_jailbreakv_28k_dataset.py

…rom HF

romanlutz · 2025-09-26T18:17:17Z

pyrit/datasets/__init__.py

    "fetch_jbb_behaviors_dataset",
    "fetch_jbb_behaviors_by_harm_category",
    "fetch_jbb_behaviors_by_jbb_category",
+    "fetch_jailbreakv_28k_dataset",


mind keeping these alphabetical? I realize we missed out on that before but no better time to fix it than now 🙂

romanlutz · 2025-09-26T18:18:53Z

pyrit/datasets/fetch_jailbreakv_28k_dataset.py

+        The dataset license: mit
+        authors: Weidi Luo, Siyuan Ma, Xiaogeng Liu, Chaowei Xiao, Xiaoyu Guo


Suggested change

The dataset license: mit

authors: Weidi Luo, Siyuan Ma, Xiaogeng Liu, Chaowei Xiao, Xiaoyu Guo

The dataset license: MIT

Authors: Weidi Luo, Siyuan Ma, Xiaogeng Liu, Chaowei Xiao, Xiaoyu Guo

romanlutz · 2025-09-26T18:29:09Z

pyrit/datasets/fetch_jailbreakv_28k_dataset.py

+                if image_abs_path:
+                    group_id = uuid.uuid4()
+                    text_seed_prompt = SeedPrompt(
+                        value=item.get(text_field, ""),
+                        harm_categories=[policy],
+                        prompt_group_id=group_id,
+                        data_type="text",
+                        **common_metadata,  # type: ignore[arg-type]
+                    )
+                    image_seed_prompt = SeedPrompt(
+                        value=image_abs_path,
+                        harm_categories=[policy],
+                        prompt_group_id=group_id,
+                        data_type="image_path",
+                        **common_metadata,  # type: ignore[arg-type]
+                    )
+                    seed_prompts.append(text_seed_prompt)
+                    seed_prompts.append(image_seed_prompt)
+                else:
+                    missing_images += 1


Suggested change

if image_abs_path:

group_id = uuid.uuid4()

text_seed_prompt = SeedPrompt(

value=item.get(text_field, ""),

harm_categories=[policy],

prompt_group_id=group_id,

data_type="text",

**common_metadata, # type: ignore[arg-type]

)

image_seed_prompt = SeedPrompt(

value=image_abs_path,

harm_categories=[policy],

prompt_group_id=group_id,

data_type="image_path",

**common_metadata, # type: ignore[arg-type]

)

seed_prompts.append(text_seed_prompt)

seed_prompts.append(image_seed_prompt)

else:

missing_images += 1

if not image_abs_path:

missing_images += 1

continue

group_id = uuid.uuid4()

text_seed_prompt = SeedPrompt(

value=item.get(text_field, ""),

harm_categories=[policy],

prompt_group_id=group_id,

data_type="text",

**common_metadata, # type: ignore[arg-type]

)

image_seed_prompt = SeedPrompt(

value=image_abs_path,

harm_categories=[policy],

prompt_group_id=group_id,

data_type="image_path",

**common_metadata, # type: ignore[arg-type]

)

seed_prompts.append(text_seed_prompt)

seed_prompts.append(image_seed_prompt)

i.e., move the second case to first and remove indentation

romanlutz · 2025-09-26T18:30:17Z

pyrit/datasets/fetch_jailbreakv_28k_dataset.py

+    if not seed_prompts:
+        raise ValueError(
+            "JailBreakV-28K fetch produced 0 prompts. "
+            "Likely caused by all items returned after filtering having invalid image paths."
+        )


how many are currently missing? Should we have a cutoff (>0) at which point it should error out?

Currently, the vast majority are missing, making this dataset not as useful as previously expected. I have started a discussion on HF about adding the full images folder which currently is only contained a zip file held in a separate cloud drive.

romanlutz · 2025-09-26T19:58:25Z

tests/integration/datasets/test_fetch_datasets.py

+        assert sum(p.data_type == "text" for p in jailbreakv_28k.prompts) == len(jailbreakv_28k.prompts) / 2
+        assert sum(p.data_type == "image_path" for p in jailbreakv_28k.prompts) == len(jailbreakv_28k.prompts) / 2
+    except Exception as e:
+        pytest.skip(f"Integration test skipped due to: {e}")


skip? why not fail?

Good point, I am not sure why I was simply going off of what I thought was the convention based on previously merged custom integration tests (jbb dataset integration)

Adrian Gavrila added 4 commits September 22, 2025 17:00

Adding JailBreakV_28k dataset and tests

1ab7c61

Fixes for handling splits

5d297d3

pre-commit hooks

1afab68

removing unused params

2b7b81f

romanlutz reviewed Sep 23, 2025

View reviewed changes

pyrit/datasets/fetch_jailbreakv_28k_dataset.py Show resolved Hide resolved

pyrit/datasets/fetch_jailbreakv_28k_dataset.py Outdated Show resolved Hide resolved

pyrit/datasets/fetch_jailbreakv_28k_dataset.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed Sep 23, 2025

View reviewed changes

pyrit/datasets/fetch_jailbreakv_28k_dataset.py Outdated Show resolved Hide resolved

Adrian Gavrila added 2 commits September 25, 2025 08:53

Adding image support handling for invalid image_path entries coming f…

8ec3baa

…rom HF

Integration tests, ValueError for empty seed_prompts, comment cleanup

115c1c2

romanlutz reviewed Sep 26, 2025

View reviewed changes

romanlutz self-assigned this Sep 28, 2025

romanlutz mentioned this pull request Oct 7, 2025

FEAT: add support for multimodal data from HarmBench #1110

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT Add JailbreakV_28k dataset from HF #1098

FEAT Add JailbreakV_28k dataset from HF #1098

AdrGav941 commented Sep 22, 2025 •

edited

Loading

Uh oh!

romanlutz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

AdrGav941 Sep 29, 2025

Uh oh!

romanlutz Sep 26, 2025

Uh oh!

AdrGav941 Sep 29, 2025

Uh oh!

Uh oh!

		The dataset license: mit
		authors: Weidi Luo, Siyuan Ma, Xiaogeng Liu, Chaowei Xiao, Xiaoyu Guo

FEAT Add JailbreakV_28k dataset from HF #1098

Are you sure you want to change the base?

FEAT Add JailbreakV_28k dataset from HF #1098

Conversation

AdrGav941 commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

romanlutz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AdrGav941 commented Sep 22, 2025 •

edited

Loading