Conversation
| { | ||
| vfloat32m2_t _s = __riscv_vle32_v_f32m2(scale_data, vlm2); | ||
| _scale = __riscv_vcreate_v_f32m2_f32m8(_s, _s, _s, _s); | ||
| } |
There was a problem hiding this comment.
All "elempack == vlm2" checks are added to bypass the "FORCE_INPUT_PACK8" unit test. It seems work, but I wonder, in RVV, when we will have pack8 input for dequantize layer? Isn't Net Module always packing elements in single vector length?
There was a problem hiding this comment.
All "elempack == vlm2" checks are added to bypass the "FORCE_INPUT_PACK8" unit test. It seems work, but I wonder, in RVV, when we will have pack8 input for dequantize layer? Isn't Net Module always packing elements in single vector length?
yes, always use the native packn. packn can be pack4 pack8 pack16 etc .
c4a48de to
c3465db
Compare
c3465db to
ca3e8c2
Compare
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #6658 +/- ##
========================================
Coverage 93.96% 93.96%
========================================
Files 932 934 +2
Lines 299059 299395 +336
========================================
+ Hits 280998 281321 +323
- Misses 18061 18074 +13 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Road to int8 optimization Episode 2: Dequantize Layer