Feature Request: Add GPU acceleration support for GLiNER model inference

### Background and Issue
Although you mentioned in [your blog post](vscode-file://vscode-app/Applications/Visual%20Studio%20Code.app/Contents/Resources/app/out/vs/code/electron-sandbox/workbench/workbench.html) that GPU usage reduces latency, I did not observe any GPU utilization when using this library out of the box. After examining the code, I couldn't find any logic that moves the model to the GPU.

I followed the example provided in the README section of the repository. Could you please confirm if there are additional steps required to enable GPU usage and achieve lower latency?

That said, after making the following changes to your code, I am now able to see GPU utilization.

https://github.com/guardrails-ai/guardrails_pii/pull/13

### Proposed Solution
Add an optional `use_gpu` parameter to the GuardrailsPII validator that enables GPU acceleration when available. The implementation should:

- Add GPU device management: Automatically detect CUDA availability and move the GLiNER model to GPU when requested
- Maintain backward compatibility: Default to CPU inference to preserve existing behavior
- Graceful fallback: Fall back to CPU if GPU is requested but not available

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: Add GPU acceleration support for GLiNER model inference #14

Background and Issue

Proposed Solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature Request: Add GPU acceleration support for GLiNER model inference #14

Description

Background and Issue

Proposed Solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions