llvm-project

Commit Graph

Author	SHA1	Message	Date
Jolanta Jensen	66e3589cd7	[NFC][CostModel] Added floating point frem test for SVE Differential Revision: https://reviews.llvm.org/D136241	2022-10-19 19:34:14 +00:00
David Green	de6dfbbb30	[ARM] Fix for MVE i128 vector icmp costs. We were hitting an assert as the legalied type needn't be a vector. Fixes #58364	2022-10-14 18:49:25 +01:00
Simon Pilgrim	a640aa5bfd	[CostModel][X86] Add insertelement costs into a known base vector value We were only testing inserting into undef/poison base vectors Test coverage for Issue #58261	2022-10-11 12:07:25 +01:00
Craig Topper	de0de294eb	[RISCV] Update cost of vector roundeven to match round which uses the same sequence but a different FRM value. Reviewed By: reames, eopXD Differential Revision: https://reviews.llvm.org/D134978	2022-09-30 20:01:35 -07:00
Philip Reames	02bfe2de7c	[RISCV] Adjust vector immediate store materialization cost This change updates the costs to make constant pool loads match their actual cost, and adds the broadcast special case to avoid too many regressions. We really need more information about the constants being rematerialized, but this is an incremental improvement. Differential Revision: https://reviews.llvm.org/D134746	2022-09-29 07:37:13 -07:00
eopXD	02a982829c	[RISCV] Add lowering for llvm.roundeven Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D134785	2022-09-29 06:08:14 -07:00
Florian Hahn	eba84971ae	Revert "[AARCH64][CostModel] Modified the cost of mask vector load/store" This reverts commit `1c62af3e23`. The commit causes the test below to fail. Revert for now to get the bots back to green. Failing test: lvm/test/Transforms/LoopVectorize/AArch64/masked-op-cost.ll	2022-09-28 15:35:13 +01:00
liqinweng	1c62af3e23	[AARCH64][CostModel] Modified the cost of mask vector load/store Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D134413	2022-09-28 19:40:29 +08:00
Simon Pilgrim	196f27bb56	[CostModel][X86] Add missing cost kinds for v2i64 icmp on SLM	2022-09-25 15:12:21 +01:00
Simon Pilgrim	faff990e9b	[X86] Fix Icelake VPMULLQ zmm pipes and adjust AVX512DQ v8i64 mul costs to match worse case Icelake PMULLQ throughput regressed cf SkylakeServer as its Pipe0 only Confirmed with Intel SOM, Agner and instlatx64	2022-09-25 14:18:08 +01:00
Simon Pilgrim	a6e9141505	[TTI] Add OperandValueProperties::OP_NegatedPowerOf2 enum (PR51436) The mul by constant costmodels handle power-of-2 constants, but not negated-power-of-2, despite the backends handling both. This patch adds the OperandValueProperties::OP_NegatedPowerOf2 enum and wires it for use for basic mul cost analysis and SLP handling. Fixes #50778 Differential Revision: https://reviews.llvm.org/D111968	2022-09-23 14:03:18 +01:00
Hassnaa Hamdi	181f200a1c	[NFC]: AArch64-SVE modify some comments	2022-09-23 12:07:31 +00:00
Hassnaa Hamdi	f2072e0ae0	[AArh64-SVE]: Improve cost model for div/udiv/mul 128-bit vector operations Differential Revision: https://reviews.llvm.org/D132477	2022-09-22 16:50:55 +00:00
Simon Pilgrim	e56b507447	[CostModel][X86] Add CostKinds test coverage for mul-by-constant patterns Help check to see the costs predicted for mul->shift conversions	2022-09-22 16:40:57 +01:00
Simon Pilgrim	e31482fb73	[CostModel][X86] Add gep.ll CostKind test coverage	2022-09-22 15:07:25 +01:00
Simon Pilgrim	867cd843fe	[CostModel][X86] Regenerate gep.ll test checks	2022-09-22 15:07:25 +01:00
Simon Pilgrim	e030be64d8	[CostModel][X86] Add partial CostKinds handling for funnelshifts/rotates This mainly just adds costs for the targets where we have actual funnelshift/rotate instructions (VBMI2/XOP etc.) - the cases where we expand still need addressing, although for many the default shift+or expansion, especially for uniform cases, isn't that bad. This was achieved with the 'cost-tables vs llvm-mca' script D103695	2022-09-22 11:24:11 +01:00
Simon Pilgrim	b2cd8118d0	[CostModel][X86] Add CostKinds handling for smax/smin/umax/umin instructions This was achieved with the 'cost-tables vs llvm-mca' script D103695	2022-09-22 10:19:23 +01:00
Simon Pilgrim	839ba13c3e	[CostModel][X86] Add vbmi2 costs for funnelshift/rotate intrinsics Add costs for the funnel shift instructions - fixes some discrepancies I was hitting with costs numbers from the 'cost-tables vs llvm-mca' script D103695	2022-09-21 13:48:22 +01:00
Simon Pilgrim	2a80a8623c	[CostModel][X86] Add vbmi2 test coverage for funnelshift/rotate intrinsics vbmi2 has vector funnel shift support that we should be costing correctly	2022-09-21 13:34:32 +01:00
Simon Pilgrim	46e036b5c2	[CostModel][X86] Remove out of date TODO ROTR constant and uniform-constant tests were added some time ago by `2fe1076a08`	2022-09-21 13:25:38 +01:00
Simon Pilgrim	7241b194de	[CostModel][X86] Add CostKinds test coverage for funnelshift/rotate intrinsics	2022-09-21 12:00:20 +01:00
Simon Pilgrim	7e5db16850	[CostModel][X86] Add CostKinds test coverage for min/max intrinsics	2022-09-19 20:50:25 +01:00
Simon Pilgrim	6b4d409f69	[CostModel][X86] Add CostKinds handling for CTLZ_ZERO_UNDEF/CTTZ_ZERO_UNDEF instructions This was achieved with the 'cost-tables vs llvm-mca' script D103695	2022-09-19 17:37:58 +01:00
Simon Pilgrim	135c9b2c4b	[CostModel][X86] Add CostKinds handling for vector ctlz instructions This was achieved with the 'cost-tables vs llvm-mca' script D103695	2022-09-19 16:44:09 +01:00
Simon Pilgrim	2538adde5c	[CostModel][X86] Add CostKinds handling for cttz This was achieved with the 'cost-tables vs llvm-mca' script D103695	2022-09-19 15:57:03 +01:00
Simon Pilgrim	d90a42d64c	[CostModel][X86] Add CTLZ_ZERO_UNDEF/CTTZ_ZERO_UNDEF cost handling Without LZCNT/BMI, the *_ZERO_UNDEF costs are cheaper as they can avoid the zero handling.	2022-09-19 14:06:33 +01:00
Simon Pilgrim	23cb1c42cd	[CostModel][X86] Update throughput costs for CTLZ ops This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695 (and recent fixes to the bdver2 + alderlake models) Adding full CostKinds costs are affecting some other tests as they make assumptions about SizeLatency costs, so they need addressing first	2022-09-16 16:56:49 +01:00
Simon Pilgrim	f8fa04295f	[CostModel][X86] Add CostKinds handling for vector integer comparisons These were based off a mixture of vector integer add/sub costs and the numbers from the 'cost-tables vs llvm-mca' script from D103695 - the extra costs for different predicates are still proving tricky to implement, but I've gotten most costs to within +/1 now - the AVX512 are tricky as we still don't handle predicate results properly, so most of these were done by hand.	2022-09-16 13:03:41 +01:00
Simon Pilgrim	94620e4fc3	[CostModel][X86] Add CostKinds handling for vector shift by generic/non-uniform shift amounts These are the worst case generic vector shift costs, where nothing is known about the shift amounts - in particular this should stop us using the default sizelatency cost of 1 for so many pre-AVX2 vector shifts that can often actually expand during lowering to +20 uops, just for 128-bit vectors, resulting in some horrible inline/unroll decisions. This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695 (I'll update the patch soon for reference)	2022-09-15 16:51:58 +01:00
Simon Pilgrim	5182839e1b	[CostModel][X86] Remove redundant SSSE3 checks from div/rem costs These all match the default SSE2 costs so use those instead	2022-09-15 15:56:11 +01:00
Simon Pilgrim	7435ae25ab	[CostModel][X86] Remove redundant BTVER2 checks from arithmetic costs These all match the default AVX/AVX1 costs so use those instead	2022-09-15 15:29:00 +01:00
Simon Pilgrim	c119828302	[CostModel][X86] Remove redundant BTVER2 checks from shift costs These all match the default AVX/AVX1 costs so use those instead	2022-09-15 15:28:59 +01:00
Simon Pilgrim	0ec028fe10	[CostModel][X86] Add CostKinds handling for vector shift by uniform/constuniform ops Vector shift by const uniform is the cheapest shift instruction we have, non-const uniform have a marginally higher cost - some targets 'splat' the amount internally to use the shift-per-element instruction, others see a higher cost for the explicit zeroing of the upper bits for the (64-bit) shift amount. This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695 (I'll update the patch soon for reference)	2022-09-15 14:05:30 +01:00
Simon Pilgrim	854a4595b6	[CostModel][X86] getArithmeticInstrCost - move GLM/SLM custom costs AFTER constant shift -> multiply canonicalization Corrects the shift by constant costs to better account for them being converted to multiples for lowering - which demonstrates that we should probably be trying harder NOT to convert these to multiplies for some CPUs (v4i32 in particular).	2022-09-14 11:46:26 +01:00
Simon Pilgrim	40ab7875f8	[CostModel][X86] Fix throughput costs for AVX512BW v32i16 shifts Fixes regression from `a931dbfbd3`	2022-09-14 11:18:23 +01:00
jacquesguan	ecf327f154	[RISCV] Add cost model for vector insert/extract element. This patch adds cost model for vector insert/extract element instructions. In RVV, we could use vector scalar move instruction to insert or extract the first element, and use vslide to move it. But for mask vector or i64 vector in i32 target, we need special instructions to make it. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D133007	2022-09-14 11:10:18 +08:00
Simon Pilgrim	ac89b03934	[CostModel][X86] Add CostKinds test coverage for bitreverse intrinsics	2022-09-13 11:23:03 +01:00
jacquesguan	1f7b94e0ec	[RISCV][test] Add test for the cost model of vector insert/extract element. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D133005	2022-09-13 16:55:58 +08:00
jacquesguan	b98b4fae75	[RISCV] Add cost model for compare and select instructions. This patch adds cost model for vector compare and select instructions. For vector FP compare instruction, it only add the comparisions supported natively. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D132296	2022-09-13 14:44:46 +08:00
Simon Pilgrim	20ad05f9b4	[CostModel][X86] Add CostKinds handling for abs ops This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695	2022-09-12 16:34:37 +01:00
Simon Pilgrim	e4f4542517	[CostModel][X86] Add CostKinds test coverage for abs intrinsics	2022-09-12 15:36:41 +01:00
Simon Pilgrim	bd0109f392	[CostModel][X86] Move AVX512/AVX2 uniform shift costs into the generic uniform cost tables They shouldn't be happening after XOP shift costs - AVX2 shift supports takes preference over XOP for everything but vXi8 shifts - the improvement is pretty limited as it only affects bdver4 targets but it does help clean up a fraction of the messy shift cost logic....	2022-09-12 12:08:42 +01:00
Simon Pilgrim	a931dbfbd3	[CostModel][X86] Merge AVX512BW vXi8/vXi16 shifts into default AVX512BW cost table We only need to handle the uniform cases early	2022-09-10 18:18:42 +01:00
Simon Pilgrim	10edf88458	[CostModel][X86] Update CTPOP costs With the bdver2 model updates, many of the AVX1 costs were far too high - it also helped expose some costs mismatches for Atom/Silvermont	2022-09-10 17:57:20 +01:00
Mingming Liu	8aa800614b	[AArch64][CostModel] Detects that {extract,insert}-element at lane 0 has the same cost as the other lane for vector instructions in the IR. Currently, {extract,insert}-element has zero cost at lane 0 [1]. However, there is a cost (by fmov instruction [2], or ext/ins instruction) to move values from SIMD registers to GPR registers, when the element is used explicitly as integers. See https://godbolt.org/z/faPE1nTn8, when fmov is generated for d* register -> x* register conversion. Implementation-wise, add a private method `AArch64TTIImpl::getVectorInstrCostHelper` as a helper function. This way, instruction-based method could share the core logic (e.g., returning zero cost if type is legalized to scalar). [1] `2cf320d41e/llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp (L1853)` [2] `2cf320d41e/llvm/lib/Target/AArch64/AArch64InstrInfo.td (L8150-L8157)` Differential Revision: https://reviews.llvm.org/D128302	2022-09-09 09:47:30 -07:00
Simon Pilgrim	55b78e28d8	[CostModel][X86] Add missing i8 throughput cost	2022-09-09 10:58:51 +01:00
liqinweng	9b4e75ee76	[RISCV][COST] Add cost model for mask vector select instruction when its condition is a scalar type Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D132992	2022-09-08 18:55:49 +08:00
Simon Pilgrim	e74102a963	[CostModel][X86] Merge getTypeBasedIntrinsicInstrCost into getIntrinsicInstrCost For the few non type based intrinsic cases we can just check for !isTypeBasedOnly() to access the args directly. I don't think we have a need to keep getTypeBasedIntrinsicInstrCost in BasicTTIImpl.h any more and can do a similar merge there as well - but it's a messier refactor and will take a while.	2022-09-07 12:04:09 +01:00
Simon Pilgrim	648e182d92	[CostModel][X86] getIntrinsicInstrCost - convert to CostKindTblEntry Begin the refactoring to use CostKindTblEntry and return real latency/codesize/sizelatency costs instead of reusing the throughput numbers This should allow us to merge getTypeBasedIntrinsicInstrCost into getIntrinsicInstrCost and remove all remaining references	2022-09-06 22:05:32 +01:00

1 2 3 4 5 ...

1346 Commits