Add triton dtype for torch.float8_e4m3fnuz to allow explicit casts

### Describe the bug

Currently, triton does not have a dtype for `torch.float8_e4m3fnuz`: https://github.com/triton-lang/triton/blob/aff4b7a74bc77edddd4fe4d09f2618246c2556b8/python/triton/language/core.py#L380

The datatype is supported by the backend and a cast like `some_tensor.to(target_pointer.dtype.element_ty)` is working. 
However, a explicit type cast like `some_tensor.to(tl.fp8e4fnuz)` is not possible. 
This is a problem in situations where the kernel cannot access the correct datatype via a pytorch pointer (one example: https://github.com/vllm-project/vllm/pull/24503#issuecomment-3285240291). 

I know, this might be hard to keep it compatible cross-platform, but maybe still worth considering. 


### Environment details

Triton 3.4.0, MI300

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add triton dtype for torch.float8_e4m3fnuz to allow explicit casts #8164

Describe the bug

Environment details

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add triton dtype for torch.float8_e4m3fnuz to allow explicit casts #8164

Description

Describe the bug

Environment details

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions