[CuTeDSL] Lower scalar Float16/BFloat16 load through Uint16+bitcast by cheshire · Pull Request #3267 · NVIDIA/cutlass

cheshire · 2026-05-24T17:16:25Z

nvvm.load.ext rejects both bf16 and f16 result types at MLIR verification with "Unsupported FP type for ExtLoadOp", even though the underlying PTX op is just ld.b16. In cute.arch.load, route a scalar Float16/BFloat16 request through a Uint16 load + llvm.bitcast back to the requested FP type. Transparent to callers.

The same workaround handles Float16. Vector loads of f16/bf16 are not touched (they go through ir.VectorType and were not verified to hit the same issue).

Added test/python/CuTeDSL/test_arch_load.py exercising both the worked-around 16-bit FP path and the dtypes that nvvm.load.ext accepts directly (Float32 / Uint16 / Uint32 / Int32) as a regression check.

Fixes NVIDIA#3266 `nvvm.load.ext` rejects both `bf16` and `f16` result types at MLIR verification with "Unsupported FP type for ExtLoadOp", even though the underlying PTX op is just `ld.b16`. In `cute.arch.load`, route a scalar Float16/BFloat16 request through a `Uint16` load + `llvm.bitcast` back to the requested FP type. Transparent to callers. The same workaround handles Float16 — found while writing the regression test — so the patch covers both. Vector loads of f16/bf16 are not touched (they go through `ir.VectorType` and were not verified to hit the same issue). Added test/python/CuTeDSL/test_arch_load.py exercising both the worked-around 16-bit FP path and the dtypes that `nvvm.load.ext` accepts directly (Float32 / Uint16 / Uint32 / Int32) as a regression check.

cheshire · 2026-05-31T04:21:43Z

@grypp WDYT?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CuTeDSL] Lower scalar Float16/BFloat16 load through Uint16+bitcast#3267

[CuTeDSL] Lower scalar Float16/BFloat16 load through Uint16+bitcast#3267
cheshire wants to merge 1 commit into
NVIDIA:mainfrom
cheshire:fix/3266-bf16-load

cheshire commented May 24, 2026

Uh oh!

cheshire commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cheshire commented May 24, 2026

Uh oh!

cheshire commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant