Fix for #3607 by ravi688 · Pull Request #3631 · KhronosGroup/glslang

ravi688 · 2024-06-23T13:51:10Z

Convert 8/16-bit int (and their composite vector) types to their corresponding 32-bit types first and then convert the resulting 32-bit type to the target 8/16-bit type.

This change emits appropriate Op{S|U}Convert instructions instead of OpCompositeExtract followed by OpCompositeConstruct for 8/16-bit integer types.

this fixes Incorrect SPIRV codegen for 8bit/16bit variables in buffers #3607
and this also fixes assertion failure in the PR Generate vector constructions more efficiently when sizes match #3628

What types of shaders are affected?

The following GLSL shader:

#version 460


#extension GL_EXT_shader_8bit_storage : require
#extension GL_EXT_shader_16bit_storage : require
#extension GL_EXT_shader_explicit_arithmetic_types_float16 : require


layout(binding = 1 ) uniform _16bit_storage
{
        i16vec4 i16v4;
};

// This is read back and checked on the CPU side to verify the converions
layout(binding = 2 ) writeonly buffer ConversionOutBuffer
{
        i8vec4 i16v4_to_i8v4;
} cob;

out vec4 fcolor;

void main()
{
        // Conversions
        {
                cob.i16v4_to_i8v4   = i8vec4(i16v4);
        }

        bool RED = true;
        bool GREEN = false;

        fcolor = vec4( (RED) ? 1.0f : 0.0f,
                                   (GREEN) ? 1.0f : 0.0f,
                                   0.0f, 1.0f);
}

Now compiles to the SPIR-V (i.e. with this patch applied):

...
    %19 = OpAccessChain %_ptr_Uniform_v4short %_ %int_0
    %20 = OpLoad %v4short %19
    %22 = OpSConvert %v4int %20
    %23 = OpSConvert %v4char %22
    %25 = OpAccessChain %_ptr_Uniform_v4char %cob %int_0
    OpStore %25 %23
...

Earlier it used to be compiled to the following SPIR-V instructions (i.e. without patch applied):

...
    %19 = OpAccessChain %_ptr_Uniform_v4short %_ %int_0
    %20 = OpLoad %v4short %19
    %22 = OpCompositeExtract %int %20 0 <-- incorrect instruction
    %23 = OpCompositeExtract %int %20 1 <-- incorrect instruction
    %24 = OpCompositeExtract %int %20 2 <-- incorrect instruction
    %25 = OpCompositeExtract %int %20 3 <-- incorrect instruction
    %26 = OpCompositeConstruct %v4int %22 %23 %24 %25
    %27 = OpSConvert %v4char %26
    %29 = OpAccessChain %_ptr_Uniform_v4char %cob %int_0
    OpStore %29 %27
...

…esponding 32-bit types first - this fixes KhronosGroup#3607 - and this also fixes assertion failure in the PR KhronosGroup#3628 - this change emits appropriate Op{S|U}Convert instructions instead of OpCompositeExtract followed by OpCompositeConstruct for 8/16-bit integer types.

ravi688 · 2024-06-23T14:05:06Z

@arcady-lunarg , could you please add the unit tests on your side again? I've ran out of my time this weekend. I see the tests are failing and I think it is expected for spv.8bit-16bit-construction.frag. You may inspect the SPIR-V with my patch applied for this shader and correct the reference SPIR-V file used for testing.

ravi688 · 2024-06-29T17:34:05Z

I think the following set of SPIR-V instructions are inefficient as what more the SPV_KHR_{16|8}bit_storage extensions allow.

...
    %19 = OpAccessChain %_ptr_Uniform_v4short %_ %int_0
    %20 = OpLoad %v4short %19
    %22 = OpSConvert %v4int %20
    %23 = OpSConvert %v4char %22
    %25 = OpAccessChain %_ptr_Uniform_v4char %cob %int_0
    OpStore %25 %23
...

Direct conversion from short to char is possible and allowed in SPV_KHR_{16|8}bit_storage extensions. However, it is not allowed in the corresponding GLSL extensions.

Any comments appreciated...

dnovillo · 2025-06-26T20:26:24Z

@ravi688 I have some time to look at this, but it seems to be failing CIs. Could you update the PR and ping me? Thanks.

dnovillo

Please add tests as well. You can use the new gtests/SpvPatternTest.cpp harness which should simplify adding tests for some combinations. At least the ones in the original issue. Thanks.

dnovillo · 2025-08-27T22:13:29Z

+                        aggregateOp = EOpConstructUint;
+                    else
+                        aggregateOp = (TOperator)(EOpConstructUVec2 + op - EOpConstructU16Vec2);
+                    newNode = intermediate.setAggregateOperator(newNode, aggregateOp, tempType, node->getLoc());


This same block is repeated 4 times. Consider extracting it into a helper function.

This was referenced Jun 23, 2024

Incorrect SPIRV codegen for 8bit/16bit variables in buffers #3607

Open

Generate vector constructions more efficiently when sizes match #3628

Merged

dnovillo self-requested a review June 26, 2025 20:26

dnovillo requested changes Aug 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for #3607#3631

Fix for #3607#3631
ravi688 wants to merge 1 commit intoKhronosGroup:mainfrom
ravi688:BugFix-3607

ravi688 commented Jun 23, 2024 •

edited

Loading

Uh oh!

ravi688 commented Jun 23, 2024

Uh oh!

ravi688 commented Jun 29, 2024 •

edited

Loading

Uh oh!

dnovillo commented Jun 26, 2025

Uh oh!

dnovillo left a comment

Uh oh!

dnovillo Aug 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ravi688 commented Jun 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What types of shaders are affected?

Uh oh!

ravi688 commented Jun 23, 2024

Uh oh!

ravi688 commented Jun 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dnovillo commented Jun 26, 2025

Uh oh!

dnovillo left a comment

Choose a reason for hiding this comment

Uh oh!

dnovillo Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ravi688 commented Jun 23, 2024 •

edited

Loading

ravi688 commented Jun 29, 2024 •

edited

Loading