Use shufflevector to do the subvector extracts. This allows a lot more load merging on AMDGPU and also on NVPTX when <2 x half> is involved. Differential Revision: https://reviews.llvm.org/D117219 |
||
|---|---|---|
| .. | ||
| AMDGPU | ||
| NVPTX | ||
| X86 | ||
| int_sideeffect.ll | ||
Use shufflevector to do the subvector extracts. This allows a lot more load merging on AMDGPU and also on NVPTX when <2 x half> is involved. Differential Revision: https://reviews.llvm.org/D117219 |
||
|---|---|---|
| .. | ||
| AMDGPU | ||
| NVPTX | ||
| X86 | ||
| int_sideeffect.ll | ||