llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	e5d9ab08c3	[Hexagon] Fix insertion point for pointer difference calculation HVC::calculatePointerDifference inserts temporary instructions for simplification, and calulation of known bits. These instructions were inserted at the end of a basic block (after the terminator), which caused BB->getTerminator() to return nullptr. This, in turn, caused a crash when a PHI instruction was examined in computeKnownBits.	2022-10-19 14:23:39 -07:00
Krzysztof Parzyszek	6a8cfe9a72	[Hexagon] Use shifts by scalar for funnel shifts by scalar HVX has vector shifts by a scalar register. Use those in the expansions of funnel shifts where profitable.	2022-10-18 09:49:17 -07:00
Krzysztof Parzyszek	9fde8e907b	[Hexagon] Fix MULHS lowering for HVX v60 The carry bit from an intermediate addition was not properly propagated. For example mulhs(7fffffff, 7fffffff) was evaluated as 3ffeffff, while the correct result is 3fffffff.	2022-10-18 07:54:38 -07:00
Krzysztof Parzyszek	fb063ea2ea	[Hexagon] Clean up leftover instructions in HvxIdioms Quick and dirty fix, because this is causing one builder to fail.	2022-10-14 16:45:03 -07:00
Krzysztof Parzyszek	6cb2a02a38	[Hexagon] Report if changes were made in HvxIdioms pass This should fix ``` Pass modifies its input and doesn't report it: Hexagon Vector Combine Pass modifies its input and doesn't report it UNREACHABLE executed at [...hecks-debian/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1436! ```	2022-10-14 15:46:33 -07:00
Krzysztof Parzyszek	361a27c155	[Hexagon] Recognize idioms for fixed-point vector multiplication Recognize Q.15Q.15 and Q.31Q.31, with and without rounding.	2022-10-14 15:22:25 -07:00
Krzysztof Parzyszek	b465a98316	[Hexagon] Fix isTypeForHVX for vector predicates HexagonSubtarget::isTypeFixHVX would stop breaking the type up when it reached 64 bits in width. HVX vector predicates can be shorter than that, for example <32 x i1> would have a bitwidth of 32, and it's still a valid HVX type.	2022-10-14 14:38:41 -07:00
Krzysztof Parzyszek	705e77abed	[Hexagon] Lower funnel shifts for HVX HVX v62+ has bidirectional shifts, which do not mask the shift amount to the bit width. Instead, the shift amount is sign-extended from the log(BW) bit value, and a negative value causes a shift in the other direction. For the shift amount being -log(BW), this reversed shift will shift all bits out, inserting 0s or sign bits depending on the type and direction.	2022-10-14 14:13:18 -07:00
Krzysztof Parzyszek	e8375e3042	[Hexagon] Use IRBuilderBase in function parameters This will allow using builders with different folders.	2022-10-14 12:10:59 -07:00
Krzysztof Parzyszek	7f4ce3f1eb	[Hexagon] Introduce PS_vsplat[ir][bhw] pseudo instructions HVX v60 only has splats that take a 32-bit word as input, while v62+ has splats that take 8- or 16-bit value. This makes writing output patterns that need to use a splat annoying, because the entire output pattern needs to be replicated for various versions of HVX. To avoid this, the patterns will always use the pseudos, and then the pseudos will be handled using a post-ISel hook.	2022-10-14 12:03:13 -07:00
Krzysztof Parzyszek	79632163db	[Hexagon] Switch vunpackub->op->vpackeb pattern to vzb/vshuffeb V6_vzb and V6_vshuffeb can use any 2 resources in a packet, while V6_vunpackub/V6_vpackeb both need a shift resource. Also, add patterns for shifting vectors of i8.	2022-10-12 15:31:28 -07:00
Krzysztof Parzyszek	dca7e451ee	[Hexagon] Handle packing of even/odd 32-bit words This is a workaround until perfect shuffle generation is improved.	2022-10-12 13:00:14 -07:00
Krzysztof Parzyszek	2d8d2bec70	[Hexagon] Implement TLI::isExtractSubvectorCheap hook	2022-10-12 12:48:56 -07:00
Krzysztof Parzyszek	cb6804104f	[Hexagon] Remove unused function, NFC	2022-10-11 08:05:22 -07:00
Krzysztof Parzyszek	09d84e0ad8	[Hexagon] Implement helper to get intrinsic for instruction opcode There are intrinsics for most scalar instructions and almost all HVX instructions. What's somewhat painful is that there are two intrinsics for each HVX instruction: one for 64- and one for 128-byte mode. Instead of checking the current codegen settings every time, this function would simply return the right intrinsic.	2022-10-07 15:56:06 -07:00
Krzysztof Parzyszek	d184045d36	[Hexagon] Formatting changes, NFC	2022-10-07 09:13:51 -07:00
Krzysztof Parzyszek	e492cdc358	[Hexagon] Add couple of helper functions in HexagonVectorCombine 1. `length(value/type)`: return the number of elements in the vector input, 2. `getHvxTy(elem_type)`: return the HVX vector type with the element type provided. These will help write things more succintly.	2022-10-07 09:10:08 -07:00
Krzysztof Parzyszek	06019b8e55	[Hexagon] Add default parameter to HexagonVectorCombine::getIntTy, NFC	2022-10-07 08:52:19 -07:00
Krzysztof Parzyszek	d376b2667a	[Hexagon] Make HexagonSubtarget::isHVXVectorType take EVT instead of MVT EVT can be created for any Type, and so this function can now be used to check if given Type, as-is, is an HVX type (as opposed to a type that may be subject to legalization to an HVX type).	2022-10-07 08:42:39 -07:00
Krzysztof Parzyszek	2216d8f6b8	[Hexagon] Replace llvm::Optional with std::optional, NFC	2022-10-07 08:23:39 -07:00
Krzysztof Parzyszek	473210ae90	[Hexagon] Constify member refererence, NFC	2022-10-07 08:23:39 -07:00
Simon Pilgrim	759bedade5	Fix MSVC "not all control paths return a value" warning. NFCI.	2022-09-28 10:56:37 +01:00
Krzysztof Parzyszek	7da2b91887	[Hexagon] Unify getSizeOfs in HexagonVectorCombine, NFC	2022-09-27 10:51:52 -07:00
Krzysztof Parzyszek	9c9e877b7e	[Hexagon] Move function to a different class, NFC "Sector" is a concept from AlignVectors, so the check for it should be there.	2022-09-27 10:32:52 -07:00
Krzysztof Parzyszek	dfaf7a2846	[Hexagon] Avoid some unnecessary sign-extend instructions Simplify (sext_inreg (extractu ...)) -> (extract ...) where appropriate.	2022-09-26 12:30:18 -07:00
Krzysztof Parzyszek	d6c0a5be7f	[Hexagon] Make sure we can still shift scalar vectors by non-splats	2022-09-26 11:25:06 -07:00
Fangrui Song	8805e5d1b7	[Hexagon] Fix -Wunused-variable in non-assertion builds after `f6e7ad5604`	2022-09-21 14:14:45 -07:00
Krzysztof Parzyszek	f6e7ad5604	[Hexagon] Revamp type legalization of ext/trunc/sat in HVX Resizing operations (e.g. sign extension) in DAG can go from any width to any other width, e.g. i8 -> i32. If the input and the result differ by a factor larger than 2, the operation cannot be legal in HVX, since the only two legal vector sizes in HVX are a single vector and a pair of vectors. To simplify the legalization, such operations are expanded into steps that only double/halve the type size, so that each such step can be fully legalized on its own. The complication is that DAG will automatically fold these steps back into one, e.g. sext(sext) -> sext. To prevent that new HexagonISD nodes are introduced: TL_EXTEND and TL_TRUNCATE. Once legalized, these nodes are replaced with the original opcodes. The type legalization is now common to aext/sext/zext/trunc and Hexagon- specific ssat/usat nodes.	2022-09-21 11:25:27 -07:00
Simon Pilgrim	0015edeefd	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFCI.	2022-09-20 14:24:07 +01:00
Simon Pilgrim	70582bc4d3	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warnings. NFCI.	2022-09-20 10:35:32 +01:00
Haojian Wu	eec19987c0	Fix one more unused warning in release build, NFC	2022-09-19 20:56:39 +02:00
Haojian Wu	20822e2d42	Fix an unused warning in release build, NFC	2022-09-19 20:45:51 +02:00
Krzysztof Parzyszek	94a71361d6	[Hexagon] Implement [SU]INT_TO_FP and FP_TO_[SU]INT for HVX	2022-09-19 11:11:20 -07:00
Krzysztof Parzyszek	ec51e38062	[Hexagon] Add HVX patterns for ISD::ABS	2022-09-19 10:12:15 -07:00
Krzysztof Parzyszek	3eee45cdc8	[Hexagon] Rework SplitHvxPairOp to be a general vector splitting utiity Enable creating an idiom: V -> opJoin(SplitVectorOp(V))	2022-09-19 09:42:13 -07:00
Krzysztof Parzyszek	e5844462f6	[Hexagon] Use proper output chain when widening HVX loads	2022-09-19 09:04:13 -07:00
Sergei Barannikov	c6acb4eb0f	[SDAG] Add `getCALLSEQ_END` overload taking `uint64_t`s All in-tree targets pass pointer-sized ConstantSDNodes to the method. This overload reduced amount of boilerplate code a bit. This also makes getCALLSEQ_END consistent with getCALLSEQ_START, which already takes uint64_ts.	2022-09-15 14:02:12 -04:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Krzysztof Parzyszek	3c817574c2	[Hexagon] Handle shifts of short vectors of i8	2022-09-08 07:52:16 -07:00
Krzysztof Parzyszek	c37acb6426	[Hexagon] Move vectorization checks from subtarget to TTI	2022-09-07 14:47:24 -07:00
Kazu Hirata	2833760c57	[Target] Qualify auto in range-based for loops (NFC)	2022-08-28 17:35:09 -07:00
Simon Pilgrim	f9de13232f	[X86] Promote i8/i16 CTTZ (BSF) instructions and remove speculation branch This patch adds a Type operand to the TLI isCheapToSpeculateCttz/isCheapToSpeculateCtlz callbacks, allowing targets to decide whether branches should occur on a type-by-type/legality basis. For X86, this patch proposes to allow CTTZ speculation for i8/i16 types that will lower to promoted i32 BSF instructions by masking the operand above the msb (we already do something similar for i8/i16 TZCNT). This required a minor tweak to CTTZ lowering - if the src operand is known never zero (i.e. due to the promotion masking) we can remove the CMOV zero src handling. Although BSF isn't very fast, most CPUs from the last 20 years don't do that bad a job with it, although there are some annoying passthrough EFLAGS dependencies. Additionally, now that we emit 'REP BSF' in most cases, we are tending towards assuming this will most likely be executed as a TZCNT instruction on any semi-modern CPU. Differential Revision: https://reviews.llvm.org/D132520	2022-08-24 17:28:18 +01:00
Philip Reames	c9608d57b8	[TTI] Plumb through OperandValueInfo in getMemoryOpCost [NFC] This has the effect of exposing the power-of-two property for use in memory op costing, but no target actually uses it yet. The main point of this change is simple consistency with the recently changes getArithmeticInstrCost, and to remove the last (interface) use of OperandValueKind.	2022-08-23 07:55:42 -07:00
Philip Reames	104fa367ee	[TTI] Use OperandValueInfo in getArithmeticInstrCost implementation [NFC] This change completes the process of replacing OperandValueKind and OperandValueProperties which were previously passed independently in this API with a single container class which contains both. This is the change which motivated the whole sequence which preceeded it. In an original spike version of this change, I'd noticed a nasty bug: I'd changed the signature without changing names, and as result, we silently passed additional information through a callsite which previously dropped the power-of-two fact. This might be harmless in most cases, but at least a couple clearly dependend for correctness on not passing that property through. I did my best to split off prior changes which reduced the scope of this one, and which made it possible to use compiler assistance. For instance, every parameter which changes type in this change also changes name. This was intentional to make sure that every call site possible effected must show up in the diff. This let me audit each one closely.	2022-08-22 15:16:39 -07:00
Simon Pilgrim	5263155d5b	[CostModel] Add CostKind argument to getShuffleCost Defaults to TCK_RecipThroughput - as most explicit calls were assuming TCK_RecipThroughput (vectorizers) or was just doing a before-vs-after comparison (vectorcombiner). Calls via getInstructionCost were just dropping the CostKind, so again there should be no change at this time (as getShuffleCost and its expansions don't use CostKind yet) - but it will make it easier for us to better account for size/latency shuffle costs in inline/unroll passes in the future. Differential Revision: https://reviews.llvm.org/D132287	2022-08-21 10:54:51 +01:00
Alexey Bataev	d53e245951	[COST][NFC]Introduce OperandValueKind in getMemoryOpCost, NFC. Added OperandValueKind OpdInfo parameter to getMemoryOpCost functions to better estimate cost with immediate values. Part of D126885.	2022-08-19 07:33:00 -07:00
Krzysztof Parzyszek	252cea037b	[Hexagon] Add defaulted operator= to classes with defaulted copy ctor This avoids deprecation warning: ``` warning: definition of implicit copy assignment operator for 'AddrInfo' is deprecated because it has a user-declared copy constructor [-Wdeprecated-copy] ``` This fixes https://github.com/llvm/llvm-project/issues/57229	2022-08-18 14:00:41 -07:00
Simon Pilgrim	fdec50182d	[CostModel] Replace getUserCost with getInstructionCost * Replace getUserCost with getInstructionCost, covering all cost kinds. * Remove getInstructionLatency, it's not implemented by any backends, and we should fold the functionality into getUserCost (now getInstructionCost) to make it easier for targets to handle the cost kinds with their existing cost callbacks. Original Patch by @samparker (Sam Parker) Differential Revision: https://reviews.llvm.org/D79483	2022-08-18 11:55:23 +01:00
Daniil Fukalov	7ed3d81333	[NFCI] Move cost estimation from TargetLowering to TargetTransformInfo. TragetLowering had two last InstructionCost related `getTypeLegalizationCost()` and `getScalingFactorCost()` members, but all other costs are processed in TTI. E.g. it is not comfortable to use other TTI members in these two functions overrided in a target. Minor refactoring: `getTypeLegalizationCost()` now doesn't need DataLayout parameter - it was always passed from TTI. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D117723	2022-08-18 00:38:55 +03:00
Eli Friedman	cfd2c5ce58	Untangle the mess which is MachineBasicBlock::hasAddressTaken(). There are two different senses in which a block can be "address-taken". There can be a BlockAddress involved, which means we need to map the IR-level value to some specific block of machine code. Or there can be constructs inside a function which involve using the address of a basic block to implement certain kinds of control flow. Mixing these together causes a problem: if target-specific passes are marking random blocks "address-taken", if we have a BlockAddress, we can't actually tell which MachineBasicBlock corresponds to the BlockAddress. So split this into two separate bits: one for BlockAddress, and one for the machine-specific bits. Discovered while trying to sort out related stuff on D102817. Differential Revision: https://reviews.llvm.org/D124697	2022-08-16 16:15:44 -07:00

1 2 3 4 5 ...

2924 Commits