llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	6c80267d0f	[CostModel][X86] getScalarizationOverhead - improve extraction costs for > 128-bit vectors We were using the default getScalarizationOverhead expansion for extraction costs, which adds up all the individual element extraction costs. This is fine for 128-bit vectors, but for 256/512-bit vectors each element extraction also has to account for extracting the upper 128-bit subvector extraction before it can handle the element. For scalarization costs we only need to extract each demanded subvector once. Differential Revision: https://reviews.llvm.org/D125527	2022-05-24 15:18:08 +01:00
Simon Pilgrim	a5c45c4dc1	[CostModel][X86] Auto generate gather/scatter LV costs using UTC_ARGS --filter control Also fix a sse42 -> sse4.2 typo so that we actually test costs for sse4.2	2022-05-12 17:39:06 +01:00
Roman Lebedev	2f80ea7f4f	[NFC][LV] Use different braces in debug output The analysis passes output function name encapsulated in `'` braces, but LV uses `"`. Harmonizing this may help in creating an update script for the LV costmodel test checks. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D121105	2022-03-07 19:32:37 +03:00
Roman Lebedev	8cd782487f	[X86][LoopVectorize] "Fix" `X86TTIImpl::getAddressComputationCost()` We ask `TTI.getAddressComputationCost()` about the cost of computing vector address, and then multiply it by the vector width. This doesn't make any sense, it implies that we'd do a vector GEP and then scalarize the vector of pointers, but there is no such thing in the vectorized IR, we perform scalar GEP's. This is especially bad on X86, and was effectively prohibiting any scalarized vectorization of gathers/scatters, because `X86TTIImpl::getAddressComputationCost()` says that cost of vector address computation is `10` as compared to `1` for scalar. The computed costs are similar to the ones with D111222+D111220, but we end up without masked memory intrinsics that we'd then have to expand later on, without much luck. (D111363) Differential Revision: https://reviews.llvm.org/D111460	2021-11-30 10:47:56 +03:00
Roman Lebedev	db848fbf67	[NFC][LV][X86] Improve test coverage for masked mem ops	2021-10-27 13:36:04 +03:00
Roman Lebedev	ff05e25a84	[NFC][X86][LV] Add some test coverage for [un]masked gather/scatter While we did have test coverage for the intrinsics, i don't believe there was LV-based test coverage.	2021-09-29 14:28:49 +03:00

6 Commits