llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	315a0c79f3	Simplify code for calling a function where CanLowerReturn fails, fixing a small bug in the process. llvm-svn: 157446	2012-05-25 00:09:29 +00:00
Craig Topper	9520719b9b	Mark some static arrays as const. llvm-svn: 157377	2012-05-24 06:35:32 +00:00
Owen Anderson	f2118ea826	Fix use of an unitialized value in the LegalizeOps expansion for ISD::SUB. No in-tree targets exercise this path. Patch by Micah Villmow. llvm-svn: 157215	2012-05-21 22:39:20 +00:00
Chad Rosier	5d1f5d2be3	Typo. llvm-svn: 157195	2012-05-21 17:13:41 +00:00
Peter Collingbourne	8eb05fd093	When legalising shifts, do not pre-build a list of operands which may be RAUW'd by the recursive call to LegalizeOps; instead, retrieve the other operands when calling UpdateNodeOperands. Fixes PR12889. llvm-svn: 157162	2012-05-20 18:36:15 +00:00
Jakob Stoklund Olesen	1f1c6add10	Properly constrain register classes for sub-registers. Not all GR64 registers have sub_8bit sub-registers. llvm-svn: 157150	2012-05-20 06:38:37 +00:00
Stepan Dyatkovskiy	b638ee0ed3	Recommited reworked r156804: SelectionDAGBuilder::Clusterify : main functinality was replaced with CRSBuilder::optimize, so big part of Clusterify's code was reduced. llvm-svn: 157046	2012-05-18 08:32:28 +00:00
Stepan Dyatkovskiy	96d0c925e9	SelectionDAGBuilder: CaseBlock, CaseRanges and CaseCmp changed representation of Low and High from signed to unsigned. Since unsigned ints usually simpler, faster and allows to reduce some extra signed bit checks needed before <,>,<=,>= comparisons. llvm-svn: 156985	2012-05-17 08:56:30 +00:00
Duncan Sands	49080cd9a1	Fix a thinko in DisintegrateMERGE_VALUES. Patch by Xiaoyi Guo. llvm-svn: 156909	2012-05-16 07:57:18 +00:00
Stepan Dyatkovskiy	e01e9863c5	Rejected r156804 due to buildbots failures. llvm-svn: 156808	2012-05-15 06:50:18 +00:00
Stepan Dyatkovskiy	d450d3fa12	SelectionDAGBuilder::Clusterify : main functinality was replaced with CRSBuilder::optimize, so big part of Clusterify's code was reduced. llvm-svn: 156804	2012-05-15 05:09:41 +00:00
Dan Gohman	164fe18cfe	Rename @llvm.debugger to @llvm.debugtrap. llvm-svn: 156774	2012-05-14 18:58:10 +00:00
Chad Rosier	a33015d4e0	Revert 156658. llvm-svn: 156662	2012-05-11 23:21:01 +00:00
Chad Rosier	e40f5d3ee0	[fast-isel] Fast-isel doesn't use the expect intrinsic. llvm-svn: 156658	2012-05-11 23:10:58 +00:00
Dan Gohman	dfab443ae8	Define a new intrinsic, @llvm.debugger. It will be similar to __builtin_trap(), but it generates int3 on x86 instead of ud2. llvm-svn: 156593	2012-05-11 00:19:32 +00:00
Jim Grosbach	92f6adc8be	DAGCombiner should not change the type of an extract_vector index. When a combine twiddles an extract_vector, care should be take to preserve the type of the index operand. No luck extracting a reasonable testcase, unfortunately. rdar://11391009 llvm-svn: 156419	2012-05-08 20:56:07 +00:00
Jakob Stoklund Olesen	3c52f0281f	Add an MF argument to TRI::getPointerRegClass() and TII::getRegClass(). The getPointerRegClass() hook can return register classes that depend on the calling convention of the current function (ptr_rc_tailcall). So far, we have been able to infer the calling convention from the subtarget alone, but as we add support for multiple calling conventions per target, that no longer works. Patch by Yiannis Tsiouris! llvm-svn: 156328	2012-05-07 22:10:26 +00:00
Owen Anderson	ab63d84252	Teach DAG combine to fold x-x to 0.0 when unsafe FP math is enabled. llvm-svn: 156324	2012-05-07 20:51:25 +00:00
Benjamin Kramer	e31f31e5c0	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Jakob Stoklund Olesen	e326ed33a8	Make sure findRepresentativeClass picks the widest super-register. We want the representative register class to contain the largest super-registers available. This makes the function less sensitive to the register class numbering. llvm-svn: 156220	2012-05-04 22:53:28 +00:00
Jakob Stoklund Olesen	75fbe90839	Use SuperRegClassIterator for findRepresentativeClass(). The masks returned by SuperRegClassIterator are computed automatically by TableGen. This is better than depending on the manually specified SuperRegClasses. llvm-svn: 156147	2012-05-04 02:19:22 +00:00
Andrew Trick	32aea358e1	Added TargetRegisterInfo::getAllocatableClass. The ensures that virtual registers always belong to an allocatable class. If your target attempts to create a vreg for an operand that has no allocatable register subclass, you will crash quickly. This ensures that targets define register classes as intended. llvm-svn: 156046	2012-05-03 01:14:37 +00:00
Owen Anderson	41b0665b5b	Teach DAGCombine the same multiply-by-1.0 folding trick when doing FMAs, just like it now knows for FMULs. llvm-svn: 156029	2012-05-02 22:17:40 +00:00
Owen Anderson	b5f167c660	Teach DAG combine that multiplication by 1.0 can always be constant folded. llvm-svn: 156023	2012-05-02 21:32:35 +00:00
Jakub Staszak	cd2353402d	Use dyn_cast instead of checking opcode and cast. llvm-svn: 155957	2012-05-01 23:06:00 +00:00
Bill Wendling	b6b50c6638	Strip the pointer casts off of allocas so that the selection DAG can find them. PR10799 llvm-svn: 155954	2012-05-01 22:50:45 +00:00
Jakub Staszak	cec09b2594	Add some constantness. No functionality change. llvm-svn: 155859	2012-04-30 23:41:30 +00:00
Andrew Trick	833f04962a	Reapply 155668: Fix the SD scheduler to avoid gluing the same node twice. This time, also fix the caller of AddGlue to properly handle incomplete chains. AddGlue had failure modes, but shamefully hid them from its caller. It's luck ran out. Fixes rdar://11314175: BuildSchedUnits assert. llvm-svn: 155749	2012-04-28 01:03:23 +00:00
Andrew Trick	7a773ec053	Temporarily revert r155668: Fix the SD scheduler to avoid gluing. This definitely caused regression with ARM -mno-thumb. llvm-svn: 155743	2012-04-27 22:55:59 +00:00
Andrew Trick	03fa574af5	Fix the SD scheduler to avoid gluing the same node twice. DAGCombine strangeness may result in multiple loads from the same offset. They both may try to glue themselves to another load. We could insist that the redundant loads glue themselves to each other, but the beter fix is to bail out from bad gluing at the time we detect it. Fixes rdar://11314175: BuildSchedUnits assert. llvm-svn: 155668	2012-04-26 21:48:25 +00:00
Elena Demikhovsky	8d7e56c409	ZERO_EXTEND/SIGN_EXTEND/TRUNCATE optimization for AVX2 llvm-svn: 155309	2012-04-22 09:39:03 +00:00
Nadav Rotem	31caa27bf5	Teach getVectorTypeBreakdown about promotion of vectors in addition to widening of vectors. llvm-svn: 155296	2012-04-21 20:08:32 +00:00
Jakob Stoklund Olesen	d114da6004	Fix PR12599. The X86 target is editing the selection DAG while isel is selecting nodes following a topological ordering. When the DAG hacking triggers CSE, nodes can be deleted and bad things happen. llvm-svn: 155257	2012-04-20 23:36:09 +00:00
Jakob Stoklund Olesen	e3a891cf08	Make ISelPosition a local variable. Now that multiple DAGUpdateListeners can be active at the same time, ISelPosition can become a local variable in DoInstructionSelection. We simply register an ISelUpdater with CurDAG while ISelPosition exists. llvm-svn: 155249	2012-04-20 22:08:50 +00:00
Jakob Stoklund Olesen	beb9469d5c	Register DAGUpdateListeners with SelectionDAG. Instead of passing listener pointers to RAUW, let SelectionDAG itself keep a linked list of interested listeners. This makes it possible to have multiple listeners active at once, like RAUWUpdateListener was already doing. It also makes it possible to register listeners up the call stack without controlling all RAUW calls below. DAGUpdateListener uses an RAII pattern to add itself to the SelectionDAG list of active listeners. llvm-svn: 155248	2012-04-20 22:08:46 +00:00
Joel Jones	828531f798	Fixes a problem in instruction selection with testing whether or not the transformation: (X op C1) ^ C2 --> (X op C1) & ~C2 iff (C1&C2) == C2 should be done. This change has been tested: Using a debug+asserts build: on the specific test case that brought this bug to light make check-all lnt nt using this clang to build a release version of clang Using the release+asserts clang-with-clang build: on the specific test case that brought this bug to light make check-all lnt nt Checking in because Evan wants it checked in. Test case forthcoming after scrubbing. llvm-svn: 154955	2012-04-17 22:23:10 +00:00
Hal Finkel	e0cf6397fd	Remove dead SD nodes after the combining pass. Fixes PR12201. llvm-svn: 154786	2012-04-16 03:33:22 +00:00
Nadav Rotem	02ef0c3524	When emulating vselect using OR/AND/XOR make sure to bitcast the result back to the original type. llvm-svn: 154764	2012-04-15 15:08:09 +00:00
Nadav Rotem	9d376b6578	Reapply 154397. Original message: Fix a dagcombine optimization which assumes that the vsetcc result type is always of the same size as the compared values. This is ture for SSE/AVX/NEON but not for all targets. llvm-svn: 154490	2012-04-11 08:26:11 +00:00
Craig Topper	692d584910	Fix an overly indented line. Remove an 'else' after an 'if' that returns. llvm-svn: 154479	2012-04-11 04:55:51 +00:00
Craig Topper	bc680061e8	Inline implVisitAluOverflow by introducing a nested switch to convert the intrinsic to an nodetype. llvm-svn: 154478	2012-04-11 04:34:11 +00:00
Craig Topper	3ef01cdb2e	Optimize code a bit by calling push_back only once in some loops. Reduces compiled code size a bit. llvm-svn: 154473	2012-04-11 03:06:35 +00:00
Owen Anderson	6f1ee1634d	Move the constant-folding support for FP_ROUND in SelectionDAG from the one-operand version of getNode() to the two-operand version, since it became a two-operand node at sound point. Zap a testcase that this allows us to completely fold away. llvm-svn: 154447	2012-04-10 22:46:53 +00:00
Duncan Sands	4f53074cca	Add a comment noting that the fdiv -> fmul conversion won't generate multiplication by a denormal, and some tests checking that. llvm-svn: 154431	2012-04-10 20:35:27 +00:00
Eric Christopher	e9abba71fe	To ensure that we have more accurate line information for a block don't elide the branch instruction if it's the only one in the block, otherwise it's ok. PR9796 and rdar://11215207 llvm-svn: 154417	2012-04-10 18:18:10 +00:00
Owen Anderson	3efc8f22bd	Revert r154397, which was causing make check failures on the buildbots. llvm-svn: 154414	2012-04-10 18:02:12 +00:00
Nadav Rotem	065564d85a	Fix a dagcombine optimization which assumes that the vsetcc result type is always of the same size as the compared values. This is ture for SSE/AVX/NEON but not for all targets. llvm-svn: 154397	2012-04-10 14:58:31 +00:00
Anton Korobeynikov	4d1220de34	Transform div to mul with reciprocal only when fp imm is legal. This fixes PR12516 and uncovers one weird problem in legalize (workarounded) llvm-svn: 154394	2012-04-10 13:22:49 +00:00
Evan Cheng	136861d994	Make the code slightly more palatable. llvm-svn: 154378	2012-04-10 03:15:18 +00:00
Evan Cheng	f8bad08001	Fix a long standing tail call optimization bug. When a libcall is emitted legalizer always use the DAG entry node. This is wrong when the libcall is emitted as a tail call since it effectively folds the return node. If the return node's input chain is not the entry (i.e. call, load, or store) use that as the tail call input chain. PR12419 rdar://9770785 rdar://11195178 llvm-svn: 154370	2012-04-10 01:51:00 +00:00
Rafael Espindola	1d9672bdce	Don't try to zExt just to check if an integer constant is zero, it might not fit in a i64. llvm-svn: 154364	2012-04-10 00:16:22 +00:00
Akira Hatanaka	8483a6c47d	Have TargetLowering::getPICJumpTableRelocBase return a node that points to the GOT if jump table uses 64-bit gp-relative relocation. llvm-svn: 154341	2012-04-09 20:32:12 +00:00
Rafael Espindola	8f62b3248e	Pattern match a setcc of boolean value with 0 as a truncate. llvm-svn: 154322	2012-04-09 16:06:03 +00:00
Craig Topper	9c3da316ec	Remove unnecessary type check when combining and/or/xor of swizzles. Move some checks to allow better early out. llvm-svn: 154309	2012-04-09 07:19:09 +00:00
Craig Topper	e5893f64e8	Remove unnecessary 'else' on an 'if' that always returns llvm-svn: 154308	2012-04-09 05:59:53 +00:00
Craig Topper	e3ad4834ae	Optimize code slightly. No functionality change. llvm-svn: 154307	2012-04-09 05:55:33 +00:00
Craig Topper	5894fe430a	Replace some explicit checks with asserts for conditions that should never happen. llvm-svn: 154305	2012-04-09 05:16:56 +00:00
Craig Topper	6148fe65e8	Optimize code a bit. No functional change intended. llvm-svn: 154299	2012-04-08 23:15:04 +00:00
Benjamin Kramer	bb6ff08766	Silence sign-compare warning. llvm-svn: 154297	2012-04-08 19:04:45 +00:00
Duncan Sands	2f1dc3814b	Only have codegen turn fdiv by a constant into fmul by the reciprocal when -ffast-math, i.e. don't just always do it if the reciprocal can be formed exactly. There is already an IR level transform that does that, and it does it more carefully. llvm-svn: 154296	2012-04-08 18:08:12 +00:00
Craig Topper	c8e2d91a58	Simplify code that tries to do vector extracts for shuffles when the mask width and the input vector widths don't match. No need to check the min and max are in range before calculating the start index. The range check after having the start index is sufficient. Also no need to check for an extract from the beginning differently. llvm-svn: 154295	2012-04-08 17:53:33 +00:00
Chandler Carruth	16f0ebcbb5	Move the TLSModel information into the TargetMachine rather than hiding in TargetLowering. There was already a FIXME about this location being odd. The interface is simplified as a consequence. This will also make it easier to change TLS models when compiling with PIE. llvm-svn: 154292	2012-04-08 17:20:55 +00:00
Craig Topper	d024cef233	Turn avx2 vinserti128 intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove patterns for selecting the intrinsic. Similar was already done for avx1. llvm-svn: 154272	2012-04-07 22:32:29 +00:00
Craig Topper	e09d1c5c48	Remove 'else' after 'if' that ends in return. llvm-svn: 154267	2012-04-07 21:23:41 +00:00
Nadav Rotem	71d07ae5cb	1. Remove the part of r153848 which optimizes shuffle-of-shuffle into a new shuffle node because it could introduce new shuffle nodes that were not supported efficiently by the target. 2. Add a more restrictive shuffle-of-shuffle optimization for cases where the second shuffle reverses the transformation of the first shuffle. llvm-svn: 154266	2012-04-07 21:19:08 +00:00
Duncan Sands	5f8397a934	Convert floating point division by a constant into multiplication by the reciprocal if converting to the reciprocal is exact. Do it even if inexact if -ffast-math. This substantially speeds up ac.f90 from the polyhedron benchmarks. llvm-svn: 154265	2012-04-07 20:04:00 +00:00
Jakob Stoklund Olesen	37492eac8c	Don't break the IV update in TLI::SimplifySetCC(). LSR always tries to make the ICmp in the loop latch use the incremented induction variable. This allows the induction variable to be kept in a single register. When the induction variable limit is equal to the stride, SimplifySetCC() would break LSR's hard work by transforming: (icmp (add iv, stride), stride) --> (cmp iv, 0) This forced us to use lea for the IC update, preventing the simpler incl+cmp. <rdar://problem/7643606> <rdar://problem/11184260> llvm-svn: 154119	2012-04-05 20:30:20 +00:00
Owen Anderson	a6eebf6013	Treat f16 the same as f80/f128 for the purposes of generating constants during instruction selection. llvm-svn: 154113	2012-04-05 18:50:32 +00:00
Pete Cooper	8a3dc0ed8c	f16 FREM can now be legalized by promoting to f32 llvm-svn: 154039	2012-04-04 19:36:31 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Craig Topper	4c7d995029	Remove default case from switch that was already covering all cases. llvm-svn: 153996	2012-04-04 04:42:42 +00:00
Pete Cooper	e7bff68a5e	Removed useless switch for default case when switch was covering all the enum values llvm-svn: 153984	2012-04-04 00:53:04 +00:00
Pete Cooper	9511ec86f9	Add VSELECT to LegalizeVectorTypes::ScalariseVectorResult. Previously it would crash if it encountered a 1 element VSELECT. Solution is slightly more complicated than just creating a SELET as we have to mask or sign extend the vector condition if it had different boolean contents from the scalar condition. Fixes <rdar://problem/11178095> llvm-svn: 153976	2012-04-03 22:57:55 +00:00
Chad Rosier	2a02fe1bb2	Fix an issue in SimplifySetCC() specific to vector comparisons. When folding X == X we need to check getBooleanContents() to determine if the result is a vector of ones or a vector of negative ones. I tried creating a test case, but the problem seems to only be exposed on a much older version of clang (around r144500). rdar://10923049 llvm-svn: 153966	2012-04-03 20:11:24 +00:00
Owen Anderson	98f2c0c384	Add predicates for checking whether targets have free FNEG and FABS operations, and prevent the DAGCombiner from turning them into bitwise operations if they do. llvm-svn: 153901	2012-04-02 22:10:29 +00:00
Nadav Rotem	702f080767	Optimizing swizzles of complex shuffles may generate additional complex shuffles. Do not try to optimize swizzles of shuffles if the source shuffle has more than a single user, except when the source shuffle is also a swizzle. llvm-svn: 153864	2012-04-02 07:11:12 +00:00
Nadav Rotem	b078350872	This commit contains a few changes that had to go in together. 1. Simplify xor/and/or (bitcast(A), bitcast(B)) -> bitcast(op (A,B)) (and also scalar_to_vector). 2. Xor/and/or are indifferent to the swizzle operation (shuffle of one src). Simplify xor/and/or (shuff(A), shuff(B)) -> shuff(op (A, B)) 3. Optimize swizzles of shuffles: shuff(shuff(x, y), undef) -> shuff(x, y). 4. Fix an X86ISelLowering optimization which was very bitcast-sensitive. Code which was previously compiled to this: movd (%rsi), %xmm0 movdqa .LCPI0_0(%rip), %xmm2 pshufb %xmm2, %xmm0 movd (%rdi), %xmm1 pshufb %xmm2, %xmm1 pxor %xmm0, %xmm1 pshufb .LCPI0_1(%rip), %xmm1 movd %xmm1, (%rdi) ret Now compiles to this: movl (%rsi), %eax xorl %eax, (%rdi) ret llvm-svn: 153848	2012-04-01 19:31:22 +00:00
Rafael Espindola	80c540e656	Teach CodeGen's version of computeMaskedBits to understand the range metadata. This is the CodeGen equivalent of r153747. I tested that there is not noticeable performance difference with any combination of -O0/-O2 /-g when compiling gcc as a single compilation unit. llvm-svn: 153817	2012-03-31 18:14:00 +00:00
Bill Wendling	9f829f1cc4	If we have a VLA that has a "use" in a metadata node that's then used here but it has no other uses, then we have a problem. E.g., int foo (const int x) { char a[x]; return 0; } If we assign 'a' a vreg and fast isel later on has to use the selection DAG isel, it will want to copy the value to the vreg. However, there are no uses, which goes counter to what selection DAG isel expects. <rdar://problem/11134152> llvm-svn: 153705	2012-03-30 00:02:55 +00:00
Eric Christopher	24a6298512	More debug output. llvm-svn: 153571	2012-03-28 07:34:36 +00:00
Chris Lattner	1cc25e8a40	fix what looks like a real logic bug, found by PVS-Studio (part of PR12357) llvm-svn: 153513	2012-03-27 16:27:21 +00:00
Eric Christopher	c1e2dcdb8a	Add a debug statement. llvm-svn: 153428	2012-03-26 06:10:32 +00:00
Hal Finkel	71c2ba3d2e	Add the ability to promote legal integer VAARGs. This is required for the PPC64 SVR4 ABI. llvm-svn: 153372	2012-03-24 03:53:52 +00:00
Evan Cheng	8ab58a21a5	Source order scheduler should not preschedule nodes with multiple uses. rdar://11096639 llvm-svn: 153270	2012-03-22 19:31:17 +00:00
Evan Cheng	79f03e915d	Assign node orders to target intrinsics which do not produce results. rdar://11096639 llvm-svn: 153269	2012-03-22 19:29:09 +00:00
Chad Rosier	6a63a74113	[fast-isel] Fold "urem x, pow2" -> "and x, pow2-1". This should fix the 271% execution-time regression for nsieve-bits on the ARMv7 -O0 -g nightly tester. This may also improve compile-time on architectures that would otherwise generate a libcall for urem (e.g., ARM) or fall back to the DAG selector. rdar://10810716 llvm-svn: 153230	2012-03-22 00:21:17 +00:00
Jim Grosbach	e13adc38d0	Checking a build_vector for an all-ones value. Type legalization can zero-extend the elements of the build_vector node, so, for example, we may have an <8 x i8> with i32 elements of value 255. That should return 'true' for the vector being all ones. llvm-svn: 153203	2012-03-21 17:48:04 +00:00
Craig Topper	aaeae98936	When combining (vextract shuffle (load ), <1,u,u,u>), 0) -> (load ), add users of the final load to the worklist too. Needed by changes I'm preparing to make to X86 backend. llvm-svn: 153078	2012-03-20 05:28:39 +00:00
Eric Christopher	60e01c560a	Do everything up to generating code to try to get a register for a variable. The previous code would break the debug info changing code invariant. This will regress debug info for arguments where we elide the alloca created. Fixes rdar://11066468 llvm-svn: 153074	2012-03-20 01:07:58 +00:00
Eric Christopher	997aaa9237	Untabify. llvm-svn: 153073	2012-03-20 01:07:56 +00:00
Eric Christopher	e5e54c87fa	Add another debugging statement here. llvm-svn: 153072	2012-03-20 01:07:53 +00:00
Eric Christopher	1a06cc9ae6	Use lookUpRegForValue here instead of duplicating the code. llvm-svn: 153071	2012-03-20 01:07:47 +00:00
Pete Cooper	e69be6df4f	f16 FDIV can now be legalized by promoting to f32 llvm-svn: 153064	2012-03-19 23:38:12 +00:00
Duncan Sands	3fb2fc6edb	Fix DAG combine which creates illegal vector shuffles. Patch by Heikki Kultala. llvm-svn: 153035	2012-03-19 15:35:44 +00:00
NAKAMURA Takumi	a7e57ace28	Revert r152613 (and r152614), "Inline the d'tor and add an anchor instead." for workaround of g++-4.4's miscompilation. It caused MSP430DAGToDAGISel::SelectIndexedBinOp() to be miscompiled. When two ReplaceUses()'s are expanded as inline, vtable in base class is stored to latter (ISelUpdater)ISU. llvm-svn: 152877	2012-03-16 00:01:55 +00:00
Eric Christopher	3390a6e5e3	We actually handle AllocaInst via getRegForValue below just fine. Part of rdar://8905263 llvm-svn: 152845	2012-03-15 21:33:47 +00:00
Eric Christopher	142820ba8d	Add some debugging output into fast isel as well. llvm-svn: 152844	2012-03-15 21:33:44 +00:00
Eric Christopher	be7a1016fc	Add another debug statement. llvm-svn: 152843	2012-03-15 21:33:41 +00:00
Nadav Rotem	6fd1d32c63	When optimizing certain BUILD_VECTOR nodes into other BUILD_VECTOR nodes, add the new node into the work list because there is a potential for further optimizations. llvm-svn: 152784	2012-03-15 08:49:06 +00:00
Bill Wendling	df170db2f6	Add a xform to the DAG combiner. Transform: (fsub x, (fadd x, y)) -> (fneg y) and (fsub x, (fadd y, x)) -> (fneg y) if 'unsafe math' is specified. <rdar://problem/7540295> llvm-svn: 152777	2012-03-15 05:12:00 +00:00
Bill Wendling	618d57310a	Insert the debugging instructions in one fell-swoop so that it doesn't call the expensive "getFirstTerminator" call. This reduces the time of compilation in PR12258 from >10 minutes to < 10 seconds. llvm-svn: 152704	2012-03-14 07:14:25 +00:00
Evan Cheng	d5f8e5766c	Fortify r152675 a bit. Although I'm not able to come up with a test case that would trigger the truncation case. llvm-svn: 152678	2012-03-13 22:16:11 +00:00
Evan Cheng	7bf83096df	DAG combine incorrectly optimize (i32 vextract (v4i16 load $addr), c) to (i16 load $addr+csizeof(i16)) and replace uses of (i32 vextract) with the i16 load. It should issue an extload instead: (i32 extload $addr+csizeof(i16)). rdar://11035895 llvm-svn: 152675	2012-03-13 22:00:52 +00:00
Bill Wendling	ac499ab244	Add a return type. llvm-svn: 152614	2012-03-13 05:52:28 +00:00
Bill Wendling	8adb10c8a9	Inline the d'tor and add an anchor instead. llvm-svn: 152613	2012-03-13 05:51:56 +00:00
Bill Wendling	508a3e5185	Refactor the SelectionDAG's 'dump' methods into their own .cpp file. No functionality change. llvm-svn: 152611	2012-03-13 05:47:27 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Benjamin Kramer	e1e549d617	Give dagcombiner's worklist some inline capacity. llvm-svn: 152454	2012-03-10 00:23:58 +00:00
Craig Topper	5a4bcc749a	Use uint16_t to store instruction implicit uses and defs. Reduces static data. llvm-svn: 152301	2012-03-08 08:22:45 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Andrew Trick	52226d409b	misched preparation: rename core scheduler methods for consistency. We had half the API with one convention, half with another. Now was a good time to clean it up. llvm-svn: 152255	2012-03-07 23:00:49 +00:00
Andrew Trick	60cf03e772	misched preparation: clarify ScheduleDAG and ScheduleDAGInstrs roles. ScheduleDAG is responsible for the DAG: SUnits and SDeps. It provides target hooks for latency computation. ScheduleDAGInstrs extends ScheduleDAG and defines the current scheduling region in terms of MachineInstr iterators. It has access to the target's scheduling itinerary data. ScheduleDAGInstrs provides the logic for building the ScheduleDAG for the sequence of MachineInstrs in the current region. Target's can implement highly custom schedulers by extending this class. ScheduleDAGPostRATDList provides the driver and diagnostics for current postRA scheduling. It maintains a current Sequence of scheduled machine instructions and logic for splicing them into the block. During scheduling, it uses the ScheduleHazardRecognizer provided by the target. Specific changes: - Removed driver code from ScheduleDAG. clearDAG is the only interface needed. - Added enterRegion/exitRegion hooks to ScheduleDAGInstrs to delimit the scope of each scheduling region and associated DAG. They should be used to setup and cleanup any region-specific state in addition to the DAG itself. This is necessary because we reuse the same ScheduleDAG object for the entire function. The target may extend these hooks to do things at regions boundaries, like bundle terminators. The hooks are called even if we decide not to schedule the region. So all instructions in a block are "covered" by these calls. - Added ScheduleDAGInstrs::begin()/end() public API. - Moved Sequence into the driver layer, which is specific to the scheduling algorithm. llvm-svn: 152208	2012-03-07 05:21:52 +00:00
Andrew Trick	e932bb77b5	misched preparation: modularize schedule emission. ScheduleDAG has nothing to do with how the instructions are scheduled. llvm-svn: 152206	2012-03-07 05:21:44 +00:00
Andrew Trick	edee68ce1b	misched preparation: modularize schedule printing. ScheduleDAG will not refer to the scheduled instruction sequence. llvm-svn: 152205	2012-03-07 05:21:40 +00:00
Andrew Trick	46a58664f7	misched preparation: modularize schedule verification. ScheduleDAG will not refer to the scheduled instruction sequence. llvm-svn: 152204	2012-03-07 05:21:36 +00:00
Andrew Trick	7c6c41a56a	whitespace llvm-svn: 152203	2012-03-07 05:21:32 +00:00
Andrew Trick	1b2324d0e8	Cleanup in preparation for misched: Move DAG visualization logic. Soon, ScheduleDAG will not refer to the BB. llvm-svn: 152177	2012-03-07 00:18:22 +00:00
Andrew Trick	5297d8df99	whitespace llvm-svn: 152175	2012-03-07 00:18:15 +00:00
Andrew Trick	0c84efe8dd	Cleanup: DAG building is specific to either SD or MI scheduling. Not part of the target interface. llvm-svn: 152174	2012-03-07 00:18:12 +00:00
Evan Cheng	80893ce5f5	Extend r148086 to check for [r +/- reg] address mode. This fixes queens performance regression (due to increased register pressure from overly aggressive pre-inc formation). llvm-svn: 152162	2012-03-06 23:33:32 +00:00
Owen Anderson	2ee7c4dfc5	Make it possible for a target to mark FSUB as Expand. This requires providing a default expansion (FADD+FNEG), and teaching DAGCombine not to form FSUBs post-legalize if they are not legal. llvm-svn: 152079	2012-03-06 00:29:31 +00:00
Bill Wendling	7cf6db7e3c	Fix warnings about adding a bool to a string. Patch by Sean Silva! llvm-svn: 152042	2012-03-05 19:29:36 +00:00
Craig Topper	1d32658877	Use uint16_t to store register overlaps to reduce static data. llvm-svn: 152001	2012-03-04 10:43:23 +00:00
James Molloy	f6298e9281	Fix a codegen fault in which log2 or exp2 could be dead-code eliminated even though they could have sideeffects. Only allow log2/exp2 to be converted to an intrinsic if they are declared "readnone". llvm-svn: 151807	2012-03-01 14:32:18 +00:00
Benjamin Kramer	d05a0c6c42	LegalizeIntegerTypes: Reorder operations in the "big shift by small amount" optimization, making the lives of later passes easier. llvm-svn: 151722	2012-02-29 13:27:00 +00:00
Evan Cheng	65f9d19c4f	Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. llvm-svn: 151645	2012-02-28 18:51:51 +00:00
Benjamin Kramer	f2e160c665	Fix off-by one in comment. llvm-svn: 151644	2012-02-28 18:37:06 +00:00
Benjamin Kramer	0c281a7deb	LegalizeIntegerTypes: Reenable the large shift with small amount optimization. To avoid problems with zero shifts when getting the bits that move between words we use a trick: first shift the by amount-1, then do another shift by one. When amount is 0 (and size 32) we first shift by 31, then by one, instead of by 32. Also fix a latent bug that emitted the low and high words in the wrong order when shifting right. Fixes PR12113. llvm-svn: 151637	2012-02-28 17:58:00 +00:00
Daniel Dunbar	ee7b899343	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Nadav Rotem	1d666099be	Code cleanup following CR by Duncan. llvm-svn: 151627	2012-02-28 14:13:19 +00:00
Nadav Rotem	875e463b19	Fix a bug in the code that builds SDNodes from vector GEPs. When the GEP index is a vector of pointers, the code that calculated the size of the element started from the vector type, and not the contained pointer type. As a result, instead of looking at the data element pointed by the vector, this code used the size of the vector. This works for 32bit members (on 32bit systems), but not for other types. Added code to peel the vector type and added a test. llvm-svn: 151626	2012-02-28 11:54:05 +00:00
Evan Cheng	87c7b09d8d	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Hal Finkel	b9a3d61894	Don't crash when a glue node contains an internal CopyToReg This is necessary to support the existing ppc lowering code for indirect calls. Fixes PR12071. llvm-svn: 151373	2012-02-24 17:53:59 +00:00
Benjamin Kramer	6fe3e3d335	SDAGBuilder: Remove register sets that were never read and prune dead code surrounding it. llvm-svn: 151364	2012-02-24 14:01:17 +00:00
Pete Cooper	682c76b7d4	Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove duplicate patterns for selecting the intrinsics llvm-svn: 151342	2012-02-24 03:51:49 +00:00
Eric Christopher	da97054114	If the Address of a variable is an argument then treat the entire variable declaration as an argument because we want that address anyhow for our debug information. This seems to fix rdar://9965111, at least we have more debug information than before and from reading the assembly it appears to be the correct location. llvm-svn: 151335	2012-02-24 01:59:08 +00:00
Eric Christopher	219d51d649	Tabs, formatting and long lines oh my! llvm-svn: 151334	2012-02-24 01:59:01 +00:00
Bill Wendling	38b31619f6	Allow an integer to be converted into an MMX type when it's used in an inline asm. <rdar://problem/10106006> llvm-svn: 151303	2012-02-23 23:25:25 +00:00
Eric Christopher	18c6be7132	More newline cleanups. llvm-svn: 151235	2012-02-23 03:39:43 +00:00
Eric Christopher	5c45205b79	Add some handy-dandy newlines. llvm-svn: 151234	2012-02-23 03:39:39 +00:00
Michael J. Spencer	8b98bf2d6b	Properly emit _fltused with FastISel. Refactor to share code with SDAG. Patch by Joe Groff! llvm-svn: 151183	2012-02-22 19:06:13 +00:00
Craig Topper	760b134ffa	Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified. llvm-svn: 151134	2012-02-22 05:59:10 +00:00
James Molloy	862fe49c55	Teach the DAGCombiner that certain loadext nodes followed by ANDs can be converted to zeroexts. llvm-svn: 150957	2012-02-20 12:02:38 +00:00
Eric Christopher	81e2bf2b77	Ignore the lifetime intrinsics in fast-isel. llvm-svn: 150848	2012-02-17 23:03:39 +00:00
James Molloy	920ae8c642	Remove extraneous #include and spelling mistake introduced in r150669. llvm-svn: 150670	2012-02-16 09:48:07 +00:00
James Molloy	67b6b11b52	Modify the algorithm when traversing the DAGCombiner's worklist to be O(log N) for all operations. This fixes a horrible worst case with lots of nodes where 99% of the time was being spent in std::remove. llvm-svn: 150669	2012-02-16 09:17:04 +00:00
Pete Cooper	4dd0963d56	Added hook to let targets custom lower splitting of illegal vectors llvm-svn: 150550	2012-02-15 00:55:31 +00:00
Nadav Rotem	29984ba033	Fix PR12000. Some vector operations may use scalar operands with types that are greater than the vector element type. For example BUILD_VECTOR of type <1 x i1> with a constant i8 operand. This patch fixes the assertion. llvm-svn: 150477	2012-02-14 13:06:32 +00:00
Lang Hames	29d6ed6416	Rename getExceptionAddressRegister() to getExceptionPointerRegister() for consistency with setExceptionPointerRegister(...). llvm-svn: 150460	2012-02-14 04:45:49 +00:00
Bill Wendling	05d6f2ff1e	Don't reserve the R0 and R1 registers here. We don't use these registers, and marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437	2012-02-13 23:47:16 +00:00

1 2 3 4 5 ...

5558 Commits