llvm-project

Commit Graph

Author	SHA1	Message	Date
Diego Novillo	0354a9f67b	SamplePGO - Fix PR 25482 - Do not rely on llvm.dbg.cu for discriminators The discriminators pass relied on the presence of llvm.dbg.cu to decide whether to add discriminators, but this fails in the case where debug info is only enabled partially when -fprofile-sample-use is active. The reason llvm.dbg.cu is not present in these cases is to prevent codegen from emitting debug info (as it is only used for the sample profile pass). This changes the discriminators pass to also emit discriminators even when debug info is not being emitted. llvm-svn: 252763	2015-11-11 17:54:37 +00:00
Hemant Kulkarni	c6638c7561	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252760	2015-11-11 17:47:54 +00:00
Sanjay Patel	f740129198	[MIPS] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz() MIPS32 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any MIPS32 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: jr $ra clz $2, $4 cttz: addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 jr $ra subu $2, $2, $1 Instead of: ctlz: beqz $4, $BB0_2 addiu $2, $zero, 32 clz $2, $4 $BB0_2: jr $ra nop cttz: beqz $4, $BB1_2 addiu $2, $zero, 32 addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 subu $2, $2, $1 $BB1_2: jr $ra nop See D14469 for the larger motivation. Differential Revision: http://reviews.llvm.org/D14500 llvm-svn: 252755	2015-11-11 17:24:56 +00:00
Diego Novillo	0767ae5896	Properly fix unused variable in disable-assert builds. I missed the side-effects of ParseBFI in my previous attempt (r252748). Thanks dblaikie for the suggestion of adding a void use of the unused variable instead. llvm-svn: 252751	2015-11-11 16:39:22 +00:00
Diego Novillo	29f88a2460	Remove unused variable in disable-assert builds. NFC. llvm-svn: 252748	2015-11-11 16:14:52 +00:00
Douglas Katzman	a14039764b	Visibly fail if attempting to encode register AH,BH,CH,DH in a REX-prefixed instruction. Differential Revision: http://reviews.llvm.org/D13316 Fixes PR25003 llvm-svn: 252743	2015-11-11 15:51:16 +00:00
James Molloy	ce12c92f66	[ARM] Combine BFIs together If we have a chain of BFIs, we may be able to combine several together into one merged BFI. We can do this if the "from" bits from one BFI OR'd with the "from" bits from the other BFI form a contiguous range, and the same with the "to" bits. llvm-svn: 252740	2015-11-11 15:40:40 +00:00
Charlie Turner	d82c9389e7	[SLP] Enable -slp-vectorize-hor by default. Measurements primarily on AArch64 have shown this feature does not significantly effect compile-time. The are no significant perf changes in LNT, but for AArch64 at least, there are wins in third party benchmarks. As discussed on llvm-dev, we're going to try turning this on by default and see how other targets react to the change. llvm-svn: 252733	2015-11-11 15:03:46 +00:00
Aaron Ballman	470b5f1a79	Silencing a signed vs unsigned type mismatch warning. llvm-svn: 252732	2015-11-11 14:57:28 +00:00
Aaron Ballman	107bb0d193	Silencing nine warnings for "enumeral and non-enumeral type in conditional expression"; NFC. llvm-svn: 252728	2015-11-11 13:44:06 +00:00
Michael Kuperstein	12982a816c	[X86] Replace LEAs with INC/DEC when profitable If possible and profitable, replace lea %reg, 1(%reg) and lea %reg, -1(%reg) with inc %reg and dec %reg respectively. Patch by: anton.nadolsky@intel.com Differential Revision: http://reviews.llvm.org/D14059 llvm-svn: 252722	2015-11-11 11:44:31 +00:00
Yury Gribov	d7731988ef	[ASan] Enable optional ASan recovery. Differential Revision: http://reviews.llvm.org/D14242 llvm-svn: 252719	2015-11-11 10:36:49 +00:00
Craig Topper	b24a58e28f	[X86] Fix feature flags on some MMX register instructions that really were introduced with SSE or SSE2. llvm-svn: 252709	2015-11-11 07:29:25 +00:00
Craig Topper	700a1a23d7	[X86] Remove redundant MMX isel patterns. llvm-svn: 252708	2015-11-11 07:29:22 +00:00
Dan Gohman	754cd11d90	[WebAssembly] Support non-legal argument and return types. llvm-svn: 252687	2015-11-11 01:33:02 +00:00
Ahmed Bougacha	4a85643907	[MC] Use LShr for constant evaluation of ">>" on non-arm64 darwin. Follow-up to r235963: this matches other assemblers and is less unexpected (e.g. PR23227). llvm-svn: 252681	2015-11-11 00:51:36 +00:00
Matthias Braun	2c98d0f477	MachineInstr: addRegisterDefReadUndef() => setRegisterDefReadUndef() This way we can not only add but also remove read undef flags. llvm-svn: 252678	2015-11-11 00:41:58 +00:00
Matt Arsenault	8246d4aead	AMDGPU: Print more fields in comments llvm-svn: 252677	2015-11-11 00:27:46 +00:00
Sanjoy Das	dc26df4abe	[ValueTracking] Remove untested / unreachable code, NFC Right now isTruePredicate is only ever called with Pred == ICMP_SLE or ICMP_ULE, and the ICMP_SLT and ICMP_ULT cases are dead. This change removes the untested dead code so that the function is not misleading. llvm-svn: 252676	2015-11-11 00:16:41 +00:00
Matt Arsenault	61cb6fa848	AMDGPU: Remove dead code llvm-svn: 252675	2015-11-11 00:01:36 +00:00
Matt Arsenault	6690d7de39	AMDGPU: Set isAllocatable = 0 on VS_32/VS_64 llvm-svn: 252674	2015-11-11 00:01:32 +00:00
Sanjoy Das	925681053d	[ValueTracking] Teach isImpliedCondition a new bitwise trick Summary: This change teaches isImpliedCondition to prove things like (A \| 15) < L ==> (A \| 14) < L if the low 4 bits of A are known to be zero. Depends on D14391 Reviewers: majnemer, reames, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14392 llvm-svn: 252673	2015-11-10 23:56:20 +00:00
Sanjoy Das	af1400f84b	[ValueTracking] Use m_APInt instead of m_ConstantInt, NFC This change would add functionality if isImpliedCondition worked on vector types; but since it bail out on vector predicates this change is an NFC. llvm-svn: 252672	2015-11-10 23:56:15 +00:00
Matthias Braun	4353b30542	TableGen: Emit LaneMask for register classes without subregisters as ~0u This makes it slightly easier to handle classes with and without subregister uniformly. llvm-svn: 252671	2015-11-10 23:23:05 +00:00
Reid Kleckner	7f84a939ed	[WinEH] Insert the MBB for EH_RESTORE after the catchret Inserting it before the target block could be bad, we might already have a fallthrough edge to it. llvm-svn: 252670	2015-11-10 23:22:20 +00:00
Kostya Serebryany	b7e286bed7	[libFuzzer] add UninstrumentedTest.cpp (missing from a previous commit) llvm-svn: 252658	2015-11-10 22:02:56 +00:00
Dan Gohman	16d314d300	[WebAssembly] Remove special cases for things that are no longer special. NFC. llvm-svn: 252656	2015-11-10 21:48:21 +00:00
Bill Schmidt	3c44c6f189	Add PPCMIPeephole.cpp to CMakeLists.txt llvm-svn: 252654	2015-11-10 21:43:45 +00:00
Dan Gohman	b84ae9bb38	[WebAssembly] Support for floating point min and max. llvm-svn: 252653	2015-11-10 21:40:21 +00:00
Bill Schmidt	34af5e1c76	[PowerPC] Add an MI SSA peephole pass. This patch adds a pass for doing PowerPC peephole optimizations at the MI level while the code is still in SSA form. This allows for easy modifications to the instructions while depending on a subsequent pass of DCE. Both passes are very fast due to the characteristics of SSA. At this time, the only peepholes added are for cleaning up various redundancies involving the XXPERMDI instruction. However, I would expect this will be a useful place to add more peepholes for inefficiencies generated during instruction selection. The pass is placed after VSX swap optimization, as it is best to let that pass remove unnecessary swaps before performing any remaining clean-ups. The utility of these clean-ups are demonstrated by changes to four existing test cases, all of which now have tighter expected code generation. I've also added Eric Schweiz's bugpoint-reduced test from PR25157, for which we now generate tight code. One other test started failing for me, and I've fixed it (test/Transforms/PlaceSafepoints/finite-loops.ll) as well; this is not related to my changes, and I'm not sure why it works before and not after. The problem is that the CHECK-NOT: of "statepoint" from test1 fails because of the "statepoint" in test2, and so forth. Adding a CHECK-LABEL in between keeps the different occurrences of that string properly scoped. llvm-svn: 252651	2015-11-10 21:38:26 +00:00
Teresa Johnson	2d5fb8cac4	Ensure ModuleLinker materializes complete comdat groups Summary: The module linker lazy links some "discardable if unused" global values (e.g. linkonce), materializing and linking them only if they are referenced in the module. If a comdat group contains a linkonce member that is not referenced, however, it would not be materialized and linked, leading to an incomplete comdat group. If there are other object files not part of the same LTO link that also define and use that comdat group, the linker may select the incomplete group leading to link time unsats. To solve this, whenever a global value body is linked, make sure we materialize any other members of the same comdat group that are not yet materialized. This ensures they are in the lazy link list and get linked as well. Added new test and adjusted old test to remove parts that didn't make sense with fix. Reviewers: rafael Subscribers: dexonsmith, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14516 llvm-svn: 252647	2015-11-10 21:09:06 +00:00
Sanjoy Das	bd1c1bfbd2	[IR] Make {Call,Invoke}::cloneImpl aware of operand bundles This was an omission in the patch that landed initial support for operand bundles. So far we haven't hit this, but we will once the inliner is able to inline calls to functions that contain calls with operand bundles. llvm-svn: 252645	2015-11-10 20:13:21 +00:00
Sanjoy Das	b9ca6dcc6b	[OperandBundles] Identify operand bundles with both their names and IDs No code uses this functionality yet. This change just exposes information / structure that was already present. llvm-svn: 252644	2015-11-10 20:13:15 +00:00
Sanjay Patel	33ec5dbe35	less indent; NFCI llvm-svn: 252643	2015-11-10 20:09:02 +00:00
Sanjay Patel	af1b48bfdc	[ARM] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz() ARM V6T2 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any ARM V6T2 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: clz r0, r0 bx lr cttz: rbit r0, r0 clz r0, r0 bx lr Instead of: ctlz: cmp r0, #0 moveq r0, #32 clzne r0, r0 bx lr cttz: cmp r0, #0 moveq r0, #32 rbitne r0, r0 clzne r0, r0 bx lr This will help solve a general speculation/despeculation problem noted in PR24818: https://llvm.org/bugs/show_bug.cgi?id=24818 Differential Revision: http://reviews.llvm.org/D14469 llvm-svn: 252639	2015-11-10 19:24:31 +00:00
Matt Arsenault	aa118e299c	LegalizeDAG: Implement promote for scalar_to_vector This allows avoiding the default Expand behavior which introduces stack usage. Bitcast the scalar and replace the missing elements with undef. This is covered by existing tests and used by a future commit which makes 64-bit vectors legal types on AMDGPU. llvm-svn: 252632	2015-11-10 18:48:11 +00:00
Matt Arsenault	a46aa641f2	LegalizeDAG: Implement promote for insert_vector_elt This is covered by existing tests and used by a future commit which makes 64-bit vectors legal types on AMDGPU. llvm-svn: 252631	2015-11-10 18:48:08 +00:00
Matt Arsenault	0b7958a59b	LegalizeDAG: Implement promote for extract_vector_elt This is for AMDGPU to implement v2i64 extract as extract of half of a v4i32. This is covered by existing tests and used by a future commit which makes 64-bit vectors legal types on AMDGPU. llvm-svn: 252630	2015-11-10 18:48:04 +00:00
Philip Reames	2d858747df	[ValueTracking] Recognize that and(x, add (x, -1)) clears the low bit This is a cleaned up version of a patch by John Regehr with permission. Originally found via the souper tool. If we add an odd number to x, then bitwise-and the result with x, we know that the low bit of the result must be zero. Either it was zero in x originally, or the add cleared it in the temporary value. As a result, one of the two values anded together must have the bit cleared. Differential Revision: http://reviews.llvm.org/D14315 llvm-svn: 252629	2015-11-10 18:46:14 +00:00
Teresa Johnson	dfbebc37da	[ThinLTO] Update comment per change in WeakAny handling (NFC) llvm-svn: 252627	2015-11-10 18:26:31 +00:00
Teresa Johnson	3cd8161c9b	[ThinLTO] WeakAny fixes/cleanup Ensure WeakAny variables are imported as ExternalWeak declarations. To handle WeakAny more consistently and fix this issue: 1) Update helper doImportAsDefinition to properly flag WeakAny variables and aliases as not importing defintions. Update callers of doImportAsDefinition to remove now redundant checks for WeakAny aliases, or ignore aliases, as appropriate. 2) Add any !doImportAsDefinition GVs to DoNotLinkFromSource set during linking of the GV prototype, where we usually add GVs to the DoNotLinkFromSource set for other reasons. Remove now unnecessary adding of WeakAny aliases to DoNotLinkFromSource set from copyGlobalAliasProto. Remove now unnecessary guard against linking non-imported function bodies from ModuleLinker::run. llvm-svn: 252626	2015-11-10 18:20:11 +00:00
Sanjay Patel	241c31fb64	[AArch64] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz() AArch64 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any AArch64 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: clz w0, w0 ret cttz: rbit w8, w0 clz w0, w8 ret Instead of: ctlz: cbz w0, .LBB0_2 clz w0, w0 ret .LBB0_2: orr w0, wzr, #0x20 ret cttz: cbz w0, .LBB1_2 rbit w8, w0 clz w0, w8 ret .LBB1_2: orr w0, wzr, #0x20 ret See D14469 for the larger motivation. Differential Revision: http://reviews.llvm.org/D14505 llvm-svn: 252625	2015-11-10 18:11:37 +00:00
Renato Golin	0e77d72b0a	Revert "Strip metadata when speculatively hoisting instructions" This reverts commit r252604, as it broke all ARM and AArch64 buildbots, as well as some x86, et al. llvm-svn: 252623	2015-11-10 18:01:16 +00:00
Michael Kuperstein	a01a5ee72f	[X86] Do not try to custom-lower sitofp/fptosi in soft-float mode Differential Revision: http://reviews.llvm.org/D14495 llvm-svn: 252621	2015-11-10 17:37:49 +00:00
Xinliang David Li	6021b75a1f	Fix asan warning (NFC) llvm-svn: 252617	2015-11-10 17:11:33 +00:00
Sanjay Patel	766589efdc	add 'MustReduceDepth' as an objective/cost-metric for the MachineCombiner This is one of the problems noted in PR25016: https://llvm.org/bugs/show_bug.cgi?id=25016 and: http://lists.llvm.org/pipermail/llvm-dev/2015-October/090998.html The spilling problem is independent and not addressed by this patch. The MachineCombiner was doing reassociations that don't improve or even worsen the critical path. This is caused by inclusion of the "slack" factor when calculating the critical path of the original code sequence. If we don't add that, then we have a more conservative cost comparison of the old code sequence vs. a new sequence. The more liberal calculation must be preserved, however, for the AArch64 MULADD patterns because benchmark regressions were observed without that. The two failing test cases now have identical asm that does what we want: a + b + c + d ---> (a + b) + (c + d) Differential Revision: http://reviews.llvm.org/D13417 llvm-svn: 252616	2015-11-10 16:48:53 +00:00
James Molloy	9d55f19cfa	Reapply "[ARM] Combine CMOV into BFI where possible" Added fixes for stage2 failures: CMOV is not commutable; commuting the operands results in the condition being flipped! d'oh! Original commit message: If we have a CMOV, OR and AND combination such as: if (x & CN) y \|= CM; And: * CN is a single bit; * All bits covered by CM are known zero in y; Then we can convert this to a sequence of BFI instructions. This will always be a win if CM is a single bit, will always be no worse than the TST & OR sequence if CM is two bits, and for thumb will be no worse if CM is three bits (due to the extra IT instruction). llvm-svn: 252606	2015-11-10 14:22:05 +00:00
Igor Laevsky	01c3692a10	Strip metadata when speculatively hoisting instructions This is fix for PR24059. When we are hoisting instruction above some condition it may turn out that metadata on this instruction was control dependant on the condition. This metadata becomes invalid and we need to drop it. This patch should cover most obvious places of speculative execution (which I have found by greping isSafeToSpeculativelyExecute). I think there are more cases but at least this change covers the severe ones. Differential Revision: http://reviews.llvm.org/D14398 llvm-svn: 252604	2015-11-10 14:10:31 +00:00
Tilmann Scheller	990a8d88c8	[PowerPC] Remove redundant code. The local variable Hi is never being read. Issue identified by the Clang static analyzer. llvm-svn: 252600	2015-11-10 12:29:37 +00:00
Oliver Stannard	d414c99b9c	[AArch64] Fix halfword load merging for big-endian targets For big-endian targets, when we merge two halfword loads into a word load, the order of the halfwords in the loaded value is reversed compared to little-endian, so the load-store optimiser needs to swap the destination registers. This does not affect merging of two word loads, as we use ldp, which treats the memory as two separate 32-bit words. llvm-svn: 252597	2015-11-10 11:04:18 +00:00
Hans Wennborg	21ce8ecb09	Inliner: Do zero-cost inlines even if above a negative threshold (PR24851) Differential Revision: http://reviews.llvm.org/D14499 llvm-svn: 252595	2015-11-10 09:47:48 +00:00
Igor Breger	b6b27af46a	AVX512 : Implemented encoding and DAG lowering for VMOVHPS/PD and VMOVLPS/PD instructions. Differential Revision: http://reviews.llvm.org/D14492 llvm-svn: 252592	2015-11-10 07:09:07 +00:00
David Blaikie	578a31fe0a	Remove another variable unused in -Asserts build llvm-svn: 252582	2015-11-10 04:10:04 +00:00
David Blaikie	e35168f008	Remove some unused variables to clean up the -Werror build llvm-svn: 252580	2015-11-10 03:16:28 +00:00
Colin LeMahieu	3c7ecf9af1	[Hexagon] Adding instruction aliases and tests. llvm-svn: 252579	2015-11-10 01:58:26 +00:00
Andy Ayers	809cbe9ea0	Support for emitting inline stack probes For CoreCLR on Windows, stack probes must be emitted as inline sequences that probe successive stack pages between the current stack limit and the desired new stack pointer location. This implements support for the inline expansion on x64. For in-body alloca probes, expansion is done during instruction lowering. For prolog probes, a stub call is initially emitted during prolog creation, and expanded after epilog generation, to avoid complications that arise when introducing new machine basic blocks during prolog and epilog creation. Added a new test case, modified an existing one to exclude non-x64 coreclr (for now). Add test case Fix tests llvm-svn: 252578	2015-11-10 01:50:49 +00:00
Colin LeMahieu	13cc3ab785	[Hexagon] Fixing compound register printing and reenabling more tests. llvm-svn: 252574	2015-11-10 00:51:56 +00:00
Tim Northover	339c83e27f	AArch64: add experimental support for address tagging. AArch64 has the ability to use the top 8-bits of an "address" for extra information, with the memory subsystem automatically masking them off for loads and stores. When that's happening, we can sometimes skip masks on memory operations in the compiler. However, this requires the host OS and support stack to preserve those bits so it can't be enabled everywhere. In principle iOS 8.0 and above do take the required precautions and but we'll put it under a flag for now. llvm-svn: 252573	2015-11-10 00:44:23 +00:00
Kevin Enderby	dc0dbe1f69	Fix llvm-nm(1) printing of llvm-bitcode files for -format darwin to match darwin’s nm(1). Also a small fix to match printing of Mach-O objects with -format posix. llvm-svn: 252567	2015-11-10 00:31:08 +00:00
Derek Schuff	ffa143ce81	[WebAssembly] Support 'unreachable' expression Lower LLVM's 'unreachable' terminator to ISD::TRAP, and lower ISD::TRAP to wasm's 'unreachable' expression. WebAssembly type-checks expressions, but a noreturn function with a return type that doesn't match the context will cause a check failure. So we lower LLVM 'unreachable' to ISD::TRAP and then lower that to WebAssembly's 'unreachable' expression, which typechecks in any context and causes a trap if executed. Differential Revision: http://reviews.llvm.org/D14515 llvm-svn: 252566	2015-11-10 00:30:57 +00:00
Matt Arsenault	6d87f28afd	Remove unnecessary call to getAllocatableRegClass I'm not sure what the point of this was. I'm not sure why you would ever define an instruction that produces an unallocatable register class. No tests fail with this removed, and it seems like it should be a verifier error to define such an instruction. This was problematic for AMDGPU because it would make bad decisions by arbitrarily changing the register class when unsetting isAllocatable for VS_32/VS_64, which is currently set as a workaround to this problem. AMDGPU uses the VS_32/VS_64 register classes to represent operands which can use either VGPRs or SGPRs. When isAllocatable is unset for these, this would need to pick either the SGPR or VGPR class and insert either a copy we don't want, or an illegal copy we would need to deal with later. A semi-arbitrary register class ordering decision is made in tablegen, which resulted in always picking a VGPR class because it happens to have more registers than the SGPR register class. We really just want to use whatever register class the original register had. llvm-svn: 252565	2015-11-10 00:30:14 +00:00
Xinliang David Li	ee4158957b	[PGO] Make indexed value profile data more compact - Make indexed value profile data more compact by peeling out the per-site value count field into its own smaller sized array. - Introduced formal data structure definitions to specify value profile data layout in indexed format. Previously the layout of the data is only assumed in the client code (scattered in three different places : size computation, EmitData, and ReadData - The new data structure serves as a central place for layout documentation. - Add interfaces to force BE output for value profile data (testing purpose) - Add byte swap unit tests Differential Revision: http://reviews.llvm.org/D14401 llvm-svn: 252563	2015-11-10 00:24:45 +00:00
Colin LeMahieu	b7a5f9fc29	[Hexagon] Fixing store instructions and reenabling a few more tests. llvm-svn: 252561	2015-11-10 00:22:00 +00:00
Akira Hatanaka	3bfc3e2d2a	[ARM] Handle t2ADDri in ARMAsmPrinter::EmitUnwindingInstruction. This fixes a bug in ARMAsmPrinter::EmitUnwindingInstruction where llvm_unreachable was reached because t2ADDri wasn't handled. Test case provided by Tim Northover. rdar://problem/23270609 http://reviews.llvm.org/D14518 llvm-svn: 252557	2015-11-10 00:10:41 +00:00
Colin LeMahieu	8ab7e8e1b5	[Hexagon] Fixing load instruction parsing and reenabling tests. llvm-svn: 252555	2015-11-10 00:02:27 +00:00
Matthias Braun	7e624d5f11	MachineVerifier: Streamline live interval related error reporting Simply perform additional report_context() calls after a report() instead of adding more and more overloaded variations of report(). Also improve several instances where information was output in an ad-hoc way probably because no matching report() overload was available. llvm-svn: 252552	2015-11-09 23:59:33 +00:00
Matthias Braun	716b43306b	MachineVerifier: Add missing linebreak MachineInstr::print() with SkipOppers==true does not produce a linebreak, so we have to do that in MachineVerifier::report(). llvm-svn: 252551	2015-11-09 23:59:29 +00:00
Matthias Braun	45718db0a1	MachineVerifier: MI::print has no TargetMachine overload The code was passing a target machine pointer which degraded to a true operand to SkipOppers. llvm-svn: 252550	2015-11-09 23:59:25 +00:00
Matthias Braun	42b4b63056	MachineVerifier: print list of live intervals if available llvm-svn: 252549	2015-11-09 23:59:23 +00:00
Reid Kleckner	420f0542cc	[WinEH] Remove isBarrier from instructions that do not return Fixes machine verification failures with David's latest EH change. llvm-svn: 252541	2015-11-09 23:34:42 +00:00
Sanjay Patel	533c10c651	add a SelectionDAG method to check if no common bits are set in two nodes; NFCI This was suggested in: http://reviews.llvm.org/D13956 and is a follow-on to: http://reviews.llvm.org/rL252515 http://reviews.llvm.org/rL252519 This lets us remove logically equivalent/duplicated code from DAGCombiner and X86ISelDAGToDAG. A corresponding function for IR instructions already exists in ValueTracking. llvm-svn: 252539	2015-11-09 23:31:38 +00:00
Davide Italiano	bfd3082e85	[TargetLibraryInfo] Add support for fls, flsl, flsll. This is a prerequisite for further optimisations of these functions, which will be commited as a separate patch. Differential Revision: http://reviews.llvm.org/D14219 llvm-svn: 252535	2015-11-09 23:23:20 +00:00
Kostya Serebryany	5eab74e9bc	[libFuzzer] make libFuzzer link if there is no sanitizer coverage instrumentation (it will fail at start-up time) llvm-svn: 252533	2015-11-09 23:17:45 +00:00
Reid Kleckner	40aa9c6d00	Combine ifdefs around dl_iterate_phdr in Unix/Signals.inc This avoids the need to have two dummy implementations of findModulesAndOffsets. llvm-svn: 252531	2015-11-09 23:10:29 +00:00
David Majnemer	2652b75700	[WinEH] Don't emit CATCHRET from visitCatchPad Instead, emit a CATCHPAD node which will get selected to a target specific sequence. llvm-svn: 252528	2015-11-09 23:07:48 +00:00
Sanjay Patel	32538d6811	[x86] try harder to match bitwise 'or' into an LEA The motivation for this patch starts with the epic fail example in PR18007: https://llvm.org/bugs/show_bug.cgi?id=18007 ...unfortunately, this patch makes no difference for that case, but it solves some simpler cases. We'll get there some day. :) The current 'or' matching code was using computeKnownBits() via isBaseWithConstantOffset() -> MaskedValueIsZero(), but that's an unnecessarily limited use. We can do more by copying the logic in ValueTracking's haveNoCommonBitsSet(), so we can treat the 'or' as if it was an 'add'. There's a TODO comment here because we should lift the bit-checking logic into a helper function, so it's not duplicated in DAGCombiner. An example of the better LEA matching: leal (%rdi,%rdi), %eax andl $1, %esi orl %esi, %eax Becomes: andl $1, %esi leal (%rsi,%rdi,2), %eax Differential Revision: http://reviews.llvm.org/D13956 llvm-svn: 252515	2015-11-09 21:16:49 +00:00
Colin LeMahieu	9d851f0435	[Hexagon] Separating statement to match what clang-format would do. llvm-svn: 252513	2015-11-09 21:06:28 +00:00
Reid Kleckner	64b003f05d	[WinEH] Tweak funclet prologue/epilogue insertion to pass verifier For some reason we'd never run MachineVerifier on WinEH code, and you explicitly have to ask for it with llc. I added it to a few test cases to get some coverage. Fixes PR25461. llvm-svn: 252512	2015-11-09 21:04:00 +00:00
Andrew Kaylor	fdd48fa1e1	[WinEH] Re-committing r252249 (Clone funclets with multiple parents) with additional fixes for determinism problems Differential Revision: http://reviews.llvm.org/D14454 llvm-svn: 252508	2015-11-09 19:59:02 +00:00
Reid Kleckner	390191dacc	[Hexagon] Fix -Wmicrosoft-enum-value warning with explicit enum type llvm-svn: 252505	2015-11-09 19:44:38 +00:00
Sanjay Patel	776e59b0fe	don't repeat function names in comments; NFC llvm-svn: 252502	2015-11-09 19:18:26 +00:00
Mike Aizatsky	662b4fd325	Moving FileManager::removeDotPaths to llvm::sys::path::remove_dots Differential Revision: http://reviews.llvm.org/D14393 llvm-svn: 252499	2015-11-09 18:56:31 +00:00
Adhemerval Zanella	35891fe6aa	[sanitizer] Use same shadow offset for ASAN on aarch64 This patch makes ASAN for aarch64 use the same shadow offset for all currently supported VMAs (39 and 42 bits). The shadow offset is the same for 39-bit (36). Similar to ppc64 port, aarch64 transformation also requires to use an add instead of 'or' for 42-bit VMA. llvm-svn: 252495	2015-11-09 18:03:48 +00:00
Dehao Chen	3656e3064b	Add discriminators for call instructions that are from the same line and same basic block. Summary: Call instructions that are from the same line and same basic block needs to have separate discriminators to distinguish between different callsites. Reviewers: davidxl, dnovillo, dblaikie Subscribers: dblaikie, probinson, llvm-commits Differential Revision: http://reviews.llvm.org/D14464 llvm-svn: 252492	2015-11-09 17:30:38 +00:00
Chad Rosier	19dc92dc8d	Simplify. NFC. llvm-svn: 252491	2015-11-09 16:56:06 +00:00
Oliver Stannard	c1103398f2	GlobalOpt should maintain externally_initialized when splitting aggregates When GlobalOpt splits an internal, global variable with an aggregate type, it should propagate the externally_initialized flag to the newly created globals. This makes the pass safe for our downstream use of this flag, while still allowing some useful optimisations (such as removing dead parts of the split aggregate) to be performed. Differential Revision: http://reviews.llvm.org/D13382 llvm-svn: 252490	2015-11-09 16:47:16 +00:00
James Molloy	45f67d52d0	[LoopVectorize] Address post-commit feedback on r250032 Implemented as many of Michael's suggestions as were possible: * clang-format the added code while it is still fresh. * tried to change Value* to Instruction* in many places in computeMinimumValueSizes - unfortunately there are several places where Constants need to be handled so this wasn't possible. * Reduce the pass list on loop-vectorization-factors.ll. * Fix a bug where we were querying MinBWs for I->getOperand(0) but using MinBWs[I]. llvm-svn: 252469	2015-11-09 14:32:05 +00:00
Silviu Baranga	2910a4f6b1	Allow LLE/LD and the loop versioning infrastructure to use SCEV predicates Summary: LAA currently generates a set of SCEV predicates that must be checked by users. In the case of Loop Distribute/Loop Load Elimination, no such predicates could have been emitted, since we don't allow stride versioning. However, in the future there could be SCEV predicates that will need to be checked. This change adds support for SCEV predicate versioning in the Loop Distribute, Loop Load Eliminate and the loop versioning infrastructure. Reviewers: anemet Subscribers: mssimpso, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14240 llvm-svn: 252467	2015-11-09 13:26:09 +00:00
Charlie Turner	90dafb1b6d	[AArch64] Add UABDL patterns for log2 shuffle. Summary: This matches the sum-of-absdiff patterns emitted by the vectoriser using log2 shuffles. Relies on D14207 to be able to match the `extract_subvector(..., 0)` Reviewers: t.p.northover, jmolloy Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14208 llvm-svn: 252465	2015-11-09 13:10:52 +00:00
Charlie Turner	7b7b06f737	[AArch64] Handle extract_subvector(..., 0) in ISel. Summary: Lowering this pattern early to an `EXTRACT_SUBREG` was making it impossible to match larger patterns in tblgen that use `extract_subvector(..., 0)` as part of the their input pattern. It seems like there will exist somewhere a better way of specifying this pattern over all relevant register value types, but I didn't manage to find it. Reviewers: t.p.northover, jmolloy Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14207 llvm-svn: 252464	2015-11-09 12:45:11 +00:00
Renato Golin	6d435f12f0	[EABI] Add LLVM support for -meabi flag "GCC requires the freestanding environment provide memcpy, memmove, memset and memcmp": https://gcc.gnu.org/onlinedocs/gcc-5.2.0/gcc/Standards.html Hence in GNUEABI targets LLVM should not convert 'memops' to their equivalent '__aeabi_memops'. This convertion violates GCC contract. The -meabi flag controls whether or not LLVM will modify 'memops' in GNUEABI targets. Without -meabi: use the triple default EABI. With -meabi=default: use the triple default EABI. With -meabi=gnu: use 'memops'. With -meabi=4 or -meabi=5: use '__aeabi_memops'. With -meabi set to an unknown value: same as -meabi=default. Patch by Vinicius Tinti. llvm-svn: 252462	2015-11-09 12:40:30 +00:00
Renato Golin	1d8a2c952f	Revert "[ARM] Combine CMOV into BFI where possible" This reverts commit r252057, as it broke ARM self-hosting buildbots, probably due to a code-gen fault. llvm-svn: 252460	2015-11-09 12:19:10 +00:00
Oliver Stannard	563585789c	[CodeGen] Always promote f16 if not legal We don't currently have any runtime library functions for operations on f16 values (other than conversions to and from f32 and f64), so we should always promote it to f32, even if that is not a legal type. In that case, the f32 values would be softened to f32 library calls. SoftenFloatRes_FP_EXTEND now needs to check the promoted operand's type, as it may ne a no-op or require a different library call. getCopyFromParts and getCopyToParts now need to cope with a floating-point value stored in a larger integer part, as is the case for any target that needs to store an f16 value in a 32-bit integer register. Differential Revision: http://reviews.llvm.org/D12856 llvm-svn: 252459	2015-11-09 11:03:18 +00:00
Colin LeMahieu	9ea507edc7	[Hexagon] Adding override to methods. llvm-svn: 252453	2015-11-09 07:10:24 +00:00
Colin LeMahieu	775d7ad677	[Hexagon] Fixing warnings. llvm-svn: 252448	2015-11-09 05:47:56 +00:00
Colin LeMahieu	a1adb51e6b	[Hexagon] Removing extra gen line. llvm-svn: 252447	2015-11-09 05:31:39 +00:00
Colin LeMahieu	892f54f408	[Hexagon] Maybe the makefile? llvm-svn: 252446	2015-11-09 05:16:08 +00:00
Colin LeMahieu	d5537bf219	[Hexagon] Adding LLVMBuild.txt reference to HexagonAsmParser. llvm-svn: 252444	2015-11-09 04:31:02 +00:00
Colin LeMahieu	7cd0892729	[Hexagon] Enabling ASM parsing on Hexagon backend and adding instruction parsing tests. General updating of the code emission. llvm-svn: 252443	2015-11-09 04:07:48 +00:00
Mehdi Amini	3383ccc400	Add a method to the BitcodeReader to parse only the identification block Summary: Mimic parseTriple(); and exposes it to LTOModule.cpp Reviewers: dexonsmith, rafael Subscribers: llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252442	2015-11-09 02:46:41 +00:00
Colin LeMahieu	8a0453e23a	[AsmParser] Backends can parameterize ASM tokenization. llvm-svn: 252439	2015-11-09 00:31:07 +00:00
Colin LeMahieu	7820dff228	[AsmParser] Provide target direct access to mnemonic token. Allow assignment parsing to be hooked by target. Allow target to specify if identifier is a label. Differential Revision: http://reviews.llvm.org/D14255 llvm-svn: 252435	2015-11-09 00:15:45 +00:00
Xinliang David Li	441959d296	[PGO] Instr func name var creation code refactoring Move the code from cfe to LLMV and become shared interfaces. There is no functional change. llvm-svn: 252433	2015-11-09 00:01:22 +00:00
Colin LeMahieu	a4c85d4c96	[AsmParser] Allow tokens to be put back in to the token stream. Differential Revision: http://reviews.llvm.org/D14252 llvm-svn: 252432	2015-11-08 23:48:23 +00:00
Maksim Panchenko	87ef57148a	[RuntimeDyld] Add support for R_X86_64_PC8 relocation. llvm-svn: 252423	2015-11-08 19:34:17 +00:00
NAKAMURA Takumi	02d97aa74e	Appease hosts without HAVE_BACKTRACE nor ENABLE_BACKTRACES. llvm/lib/Support/Signals.cpp:66:13: warning: unused function 'printSymbolizedStackTrace' [-Wunused-function] llvm/lib/Support/Signals.cpp:52:13: warning: function 'findModulesAndOffsets' has internal linkage but is not defined [-Wundefined-internal] llvm-svn: 252418	2015-11-08 09:45:06 +00:00
Hal Finkel	f046f72efa	[PowerPC] Fix LoopPreIncPrep not to depend on SCEV constant simplifications Under most circumstances, if SCEV can simplify X-Y to a constant, then it can also simplify Y-X to a constant. However, there is no guarantee that this is always true, and concensus is not to consider that a correctness bug in SCEV (although it is undesirable). PPCLoopPreIncPrep gathers pointers used to access memory (via loads, stores and prefetches) into buckets, where in each bucket the relative pointer offsets are constant. We used to keep each bucket as a multimap, where SCEV's subtraction operation was used to define the ordering predicate. Instead, use a fixed SCEV base expression for each bucket, record the constant offsets from that base expression, and adjust it later, if desirable, once all pointers have been collected. Doing it this way should be more compile-time efficient than the previous scheme (in addition to making the implementation less sensitive to SCEV simplification quirks). Fixes PR25170. llvm-svn: 252417	2015-11-08 08:04:40 +00:00
David Majnemer	b222184223	[LoopStrengthReduce] Don't bother fixing up PHIs from EH Pad preds We cannot really insert fixup code into a PHI's predecessor. This fixes PR25445. llvm-svn: 252416	2015-11-08 05:04:07 +00:00
David Majnemer	e35244cf63	[WinEH] Update PHIs of CATCHRET successors The TailDuplication machine pass ran across a malformed CFG: a PHI node referred it's predecessor's predecessor instead of it's predecessor. This occurred because we split the edge in X86ISelLowering when we processed the CATCHRET but forgot to do something about the PHI nodes. This fixes PR25444. llvm-svn: 252413	2015-11-08 02:36:00 +00:00
Yaron Keren	9ffee46d45	Erase unused FunctionDIs variables after r252219. llvm-svn: 252401	2015-11-07 10:21:25 +00:00
Akira Hatanaka	97cb397132	[Bitcode] Add enums for call instruction markers and flags. NFC. This commit adds enums in LLVMBitCodes.h to improve readability and maintainability. This is a follow-up to r252368 which was discussed here: http://reviews.llvm.org/D12923 llvm-svn: 252395	2015-11-07 02:48:49 +00:00
Nico Weber	00406472e8	Try to fix build more -- like r252392 but for WebAssembly. llvm-svn: 252394	2015-11-07 02:47:31 +00:00
Sanjoy Das	76dd243f99	Unbreak the build My code clashed with some ilist iterator changes upstream. Fix by adding an explicit "&*" coercion. llvm-svn: 252392	2015-11-07 02:26:53 +00:00
Sanjoy Das	ea1df7fe9f	[FunctionAttrs] Add comment and clarify assertion message; NFC llvm-svn: 252389	2015-11-07 01:56:07 +00:00
Sanjoy Das	54c3ca694a	[OperandBundles] Rename accessor, NFC Rename getOperandBundle to getOperandBundleAt since that's more obvious. llvm-svn: 252388	2015-11-07 01:56:04 +00:00
Sanjoy Das	71fe81fd25	[FunctionAttrs] Add handling for operand bundles Summary: Teach the FunctionAttrs to do the right thing for IR with operand bundles. Reviewers: reames, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14408 llvm-svn: 252387	2015-11-07 01:56:00 +00:00
Sanjoy Das	436e2397f8	[FunctionAttrs] Fix an iterator wraparound bug Summary: This change fixes an iterator wraparound bug in `determinePointerReadAttrs`. Ideally, ++'ing off the `end()` of an iplist should result in a failed assert, but currently iplist seems to silently wrap to the head of the list on `end()++`. This is why the bad behavior is difficult to demonstrate. Reviewers: chandlerc, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14350 llvm-svn: 252386	2015-11-07 01:55:53 +00:00
Joseph Tremoulet	f748c8937e	[WinEH] Update exception pointer registers Summary: The CLR's personality routine passes these in rdx/edx, not rax/eax. Make getExceptionPointerRegister a virtual method parameterized by personality function to allow making this distinction. Similarly make getExceptionSelectorRegister a virtual method parameterized by personality function, for symmetry. Reviewers: pgavlin, majnemer, rnk Subscribers: jyknight, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D14344 llvm-svn: 252383	2015-11-07 01:11:31 +00:00
David Majnemer	eafa28a0d9	[InstCombine] Teach FoldPHIArgZextsIntoPHI about EHPads FoldPHIArgZextsIntoPHI cannot insert an instruction after the PHI if there is an EHPad in the BB. Doing so would result in an instruction inserted after a terminator. llvm-svn: 252377	2015-11-07 00:52:53 +00:00
Duncan P. N. Exon Smith	83c4b68720	ADT: Remove last implicit ilist iterator conversions, NFC Some implicit ilist iterator conversions have crept back into Analysis, Transforms, Hexagon, and llvm-stress. This removes them. I'll commit a patch immediately after this to disallow them (in a separate patch so that it's easy to revert if necessary). llvm-svn: 252371	2015-11-07 00:01:16 +00:00
David Majnemer	27f2447fb3	[InstCombine] Don't insert an instruction after a terminator We tried to insert a cast of a phi in a block whose terminator is an EHPad. This is invalid. Do not attempt the transform in these circumstances. llvm-svn: 252370	2015-11-06 23:59:23 +00:00
Akira Hatanaka	5cfcce12eb	Add 'notail' marker for call instructions. This marker prevents optimization passes from adding 'tail' or 'musttail' markers to a call. Is is used to prevent tail call optimization from being performed on the call. rdar://problem/22667622 Differential Revision: http://reviews.llvm.org/D12923 llvm-svn: 252368	2015-11-06 23:55:38 +00:00
Pawel Bylica	6e680b2be7	Revert r252366: [Support] Use GetTempDir to get the temporary dir path on Windows. llvm-svn: 252367	2015-11-06 23:44:23 +00:00
Pawel Bylica	b43221439c	[Support] Use GetTempDir to get the temporary dir path on Windows. Summary: In general GetTempDir follows the same logic as the replaced code: checks env variables TMP, TEMP, USERPROFILE in order. However, it also perform other checks like making separators native (\), making the path absolute, etc. This change fixes FileSystemTest.CreateDir unittest that had been failing when run from Unix-like shell on Windows (Unix-like path separator (/) used in env variables). Reviewers: chapuni, rafael, aaron.ballman Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D14231 llvm-svn: 252366	2015-11-06 23:21:49 +00:00
Ahmed Bougacha	cf49b523a0	[AArch64][FastISel] Don't even try to select vector icmps. We used to try to constant-fold them to i32 immediates. Given that fast-isel doesn't otherwise support vNi1, when selecting the result users, we'd fallback to SDAG anyway. However, if the users were in another block, we'd insert broken cross-class copies (GPR32 to FPR64). Give up, let SDAG agree with itself on a vNi1 legalization strategy. llvm-svn: 252364	2015-11-06 23:16:53 +00:00
Ahmed Bougacha	b49eb3ab4b	[X86] Fold (trunc (i32 (zextload i16))) into vbroadcast. When matching non-LSB-extracting truncating broadcasts, we now insert the necessary SRL. If the scalar resulted from a load, the SRL will be folded into it, creating a narrower, offset, load. However, i16 loads aren't Desirable, so we get i16->i32 zextloads. We already catch i16 aextloads; catch these as well. llvm-svn: 252363	2015-11-06 23:16:48 +00:00
Ahmed Bougacha	05a0514b12	[X86] SRL non-LSB extracts when folding to truncating broadcasts. Now that we recognize this, we can support it instead of bailing out. That is, we can fold: (v8i16 (shufflevector (v8i16 (bitcast (v4i32 (build_vector X, Y, ...)))), <1,1,...,1>)) into: (v8i16 (vbroadcast (i16 (trunc (srl Y, 16))))) llvm-svn: 252362	2015-11-06 23:16:43 +00:00
Ahmed Bougacha	68614a36d1	[X86] Don't fold non-LSB extracts into truncating broadcasts. We used to incorrectly assume that the offset we're extracting from was a multiple of the element size. So, we'd fold: (v8i16 (shufflevector (v8i16 (bitcast (v4i32 (build_vector X, Y, ...)))), <1,1,...,1>)) into: (v8i16 (vbroadcast (i16 (trunc Y)))) whereas we should have extracted the higher bits from X. Instead, bail out if the assumption doesn't hold. llvm-svn: 252361	2015-11-06 23:16:38 +00:00
Tom Stellard	05691a678e	DAGCombiner: Check shouldReduceLoadWidth before combining (and (load), x) -> extload Reviewers: resistor, arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13805 llvm-svn: 252349	2015-11-06 21:58:37 +00:00
David Majnemer	7204cff0a1	[InstCombine] Don't RAUW tokens with undef Let SimplifyCFG remove unreachable BBs which define token instructions. llvm-svn: 252343	2015-11-06 21:26:32 +00:00
Davide Italiano	d9f87b4642	[SimplifyLibCalls] Don't hardcode the function name. llvm-svn: 252342	2015-11-06 21:05:07 +00:00
Quentin Colombet	9a8efc08d3	[ShrinkWrapping] Teach shrink-wrapping how to analyze RegMask. Previously we were conservatively assuming that RegMask operands clobber callee saved registers. llvm-svn: 252341	2015-11-06 21:00:13 +00:00
Matthias Braun	9198c671e8	MachineScheduler: Add regpressure information to debug dump llvm-svn: 252340	2015-11-06 20:59:02 +00:00
Tom Stellard	41b7e63040	AMDGPU/SI: Refactor VOP[12C] tablegen definitions Summary: Pass the VOPProfile object all the through to *_m multiclasses. This will allow us to do more simplifications in the future. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13437 llvm-svn: 252339	2015-11-06 20:56:18 +00:00
Mehdi Amini	b0e3192a48	Fix SLPVectorizer commutativity reordering The SLPVectorizer had a very crude way of trying to benefit from associativity: it tried to optimize for splat/broadcast or in order to have the same operator on the same side. This is benefitial to the cost model and allows more vectorization to occur. This patch improve the logic and make the detection optimal (locally, we don't look at the full tree but only at the immediate children). Should fix https://llvm.org/bugs/show_bug.cgi?id=25247 Reviewers: mzolotukhin Differential Revision: http://reviews.llvm.org/D13996 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252337	2015-11-06 20:17:51 +00:00
Andrew Kaylor	4731bea3e5	Improved the operands commute transformation for X86-FMA3 instructions. All 3 operands of FMA3 instructions are commutable now. Patch by Slava Klochkov Reviewers: Quentin Colombet(qcolombet), Ahmed Bougacha(ab). Differential Revision: http://reviews.llvm.org/D13269 llvm-svn: 252335	2015-11-06 19:47:25 +00:00
Dan Gohman	4b96d8d1ff	[WebAssembly] Make expression-stack pushing explicit Modelling of the expression stack is evolving. This patch takes another step by making pushes explicit. Differential Revision: http://reviews.llvm.org/D14338 llvm-svn: 252334	2015-11-06 19:45:01 +00:00
Sanjoy Das	55ea67cea7	[ValueTracking] Add parameters to isImpliedCondition; NFC Summary: This change makes the `isImpliedCondition` interface similar to the rest of the functions in ValueTracking (in that it takes a DataLayout, AssumptionCache etc.). This is an NFC, intended to make a later diff less noisy. Depends on D14369 Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14391 llvm-svn: 252333	2015-11-06 19:01:08 +00:00
Sanjoy Das	c01b4d2b28	[ValueTracking] De-pessimize isImpliedCondition around unsigned compares Summary: Currently `isImpliedCondition` will optimize "I +_nuw C < L ==> I < L" only if C is positive. This is an unnecessary restriction -- the implication holds even if `C` is negative. Reviewers: reames, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14369 llvm-svn: 252332	2015-11-06 19:01:03 +00:00
Sanjoy Das	9349dcc74a	[ValueTracking] Add a framework for encoding implication rules Summary: This change adds a framework for adding more smarts to `isImpliedCondition` around inequalities. Informally, `isImpliedCondition` will now try to prove "A < B ==> C < D" by proving "C <= A && B <= D", since then it follows "C <= A < B <= D". While this change is in principle NFC, I could not think of a way to not handle cases like "i +_nsw 1 < L ==> i < L +_nsw 1" (that ValueTracking did not handle before) while keeping the change understandable. I've added tests for these cases. Reviewers: reames, majnemer, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14368 llvm-svn: 252331	2015-11-06 19:00:57 +00:00
Matt Arsenault	f59e538937	AMDGPU: Cleanup includes llvm-svn: 252328	2015-11-06 18:23:00 +00:00
Matt Arsenault	0c90e9501e	AMDGPU: Create emergency stack slots during frame lowering Test has a bogus verifier error which will be fixed by later commits. llvm-svn: 252327	2015-11-06 18:17:45 +00:00
Matt Arsenault	08f14de244	AMDGPU: Remove unused scratch resource operands The SGPR spill pseudos don't actually use them. llvm-svn: 252324	2015-11-06 18:07:53 +00:00
Matt Arsenault	3931948bb6	AMDGPU: Add pass to detect used kernel features Mark kernels that use certain features that require user SGPRs to support with kernel attributes. We need to know before instruction selection begins because it impacts the kernel calling convention lowering. For now this only detects the workitem intrinsics. llvm-svn: 252323	2015-11-06 18:01:57 +00:00
Matt Arsenault	4dc7a5a5c6	AMDGPU: Fix hardcoded alignment of spill. Instead of forcing 4 alignment when spilled, set register class alignments. llvm-svn: 252322	2015-11-06 17:54:47 +00:00
Matt Arsenault	623e6fd466	AMDGPU: Hack for VS_32 register pressure For some reason VS_32 ends up factoring into the pressure heuristics even though we should never see a virtual register with this class. When SGPRs are reserved for register spilling, this for some reason triggers reg-crit scheduling. Setting isAllocatable = 0 may help with this since that seems to remove it from the default implementation's generated table. llvm-svn: 252321	2015-11-06 17:54:43 +00:00
Teresa Johnson	1063293a89	Restore "Move metadata linking after lazy global materialization/linking." Summary: This reverts commit r251965. Restore "Move metadata linking after lazy global materialization/linking." This restores commit r251926, with fixes for the LTO bootstrapping bot failure. The bot failure was caused by references from debug metadata to otherwise unreferenced globals. Previously, this caused the lazy linking to link in their defs, which is unnecessary. With this patch, because lazy linking is complete when we encounter the metadata reference, the materializer created a declaration. For definitions such as aliases and comdats, it is illegal to have a declaration. Furthermore, metadata linking should not change code generation. Therefore, when linking of global value bodies is complete, the materializer will simply return nullptr as the new reference for the linked metadata. This change required fixing a different test to ensure there was a real reference to a linkonce global that was only being reference from metadata. Note that the new changes to the only-needed-named-metadata.ll test illustrate an issue with llvm-link -only-needed handling of comdat groups, whereby it may result in an incomplete comdat group. I note this in the test comments, but the issue is orthogonal to this patch (it can be reproduced without any metadata at head). Reviewers: dexonsmith, rafael, tra Subscribers: tobiasvk, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D14447 llvm-svn: 252320	2015-11-06 17:50:53 +00:00
Teresa Johnson	189b252652	Restore "Move metadata linking after lazy global materialization/linking." This reverts commit r251965. llvm-svn: 252319	2015-11-06 17:50:48 +00:00
Reid Kleckner	b8fd162fc5	[WinEH] Mark funclet entries and exits as clobbering all registers Summary: In this implementation, LiveIntervalAnalysis invents a few register masks on basic block boundaries that preserve no registers. The nice thing about this is that it prevents the prologue inserter from thinking it needs to spill all XMM CSRs, because it doesn't see any explicit physreg defs in the MI. Reviewers: MatzeB, qcolombet, JosephTremoulet, majnemer Subscribers: MatzeB, llvm-commits Differential Revision: http://reviews.llvm.org/D14407 llvm-svn: 252318	2015-11-06 17:06:38 +00:00
Chad Rosier	43f9b48975	[LIR] Simplify code by making DataLayout globally accessible. NFC. llvm-svn: 252317	2015-11-06 16:33:57 +00:00
Jun Bum Lim	22fe15ee86	[AArch64]Enable the narrow ld promotion only on profitable microarchitectures The benefit from converting narrow loads into a wider load (r251438) could be micro-architecturally dependent, as it assumes that a single load with two bitfield extracts is cheaper than two narrow loads. Currently, this conversion is enabled only in cortex-a57 on which performance benefits were verified. llvm-svn: 252316	2015-11-06 16:27:47 +00:00
Rafael Espindola	889d7bb4cb	Bring r252305 back with a test fix. We now create the .eh_frame section early, just like every other special section. This means that the special flags are visible in code that explicitly asks for ".eh_frame". llvm-svn: 252313	2015-11-06 15:30:45 +00:00
Rafael Espindola	1aa4d1c56f	Revert "Simplify the creation of .eh_frame/.debug_frame sections." This reverts commit r252305. Investigating a test failure. llvm-svn: 252306	2015-11-06 14:51:09 +00:00
Rafael Espindola	e69bcd7ef8	Simplify the creation of .eh_frame/.debug_frame sections. llvm-svn: 252305	2015-11-06 14:47:44 +00:00
Rafael Espindola	5b2131cd32	git clang-format and fix variable names. NFC. llvm-svn: 252304	2015-11-06 14:12:17 +00:00
Rafael Espindola	b20b70687a	Use SHT_X86_64_UNWIND on every OS. That is the ABI required type. Linkers still check the section name, so everything should still work. llvm-svn: 252300	2015-11-06 13:35:35 +00:00
Rafael Espindola	97588e1564	Pass SectionStart directly to the one function that uses it. llvm-svn: 252299	2015-11-06 13:14:59 +00:00
Daniel Sanders	5762a4f9d1	[mips][ias] Range check uimm4 operands and fixed a bug this revealed. Summary: The bug was that the sldi instructions have immediate widths dependant on their element size. So sldi.d has a 1-bit immediate and sldi.b has a 4-bit immediate. All of these were using 4-bit immediates previously. Reviewers: vkalintiris Subscribers: llvm-commits, atanasyan, dsanders Differential Revision: http://reviews.llvm.org/D14018 llvm-svn: 252297	2015-11-06 12:41:43 +00:00
Daniel Sanders	38ce0f629c	[mips][ias] Range check uimm3 operands. Summary: Reviewers: vkalintiris Subscribers: atanasyan, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D14016 llvm-svn: 252296	2015-11-06 12:31:27 +00:00
Daniel Sanders	ea4f653d18	[mips][ias] Range check uimm2 operands and fix a bug this revealed. Summary: The bug was that the MIPS32R6/MIPS64R6/microMIPS32R6 versions of LSA and DLSA (unlike the MSA version) failed to account for the off-by-one encoding of the immediate. The range is actually 1..4 rather than 0..3. Reviewers: vkalintiris Subscribers: atanasyan, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D14015 llvm-svn: 252295	2015-11-06 12:22:31 +00:00
Daniel Sanders	52da7af4d2	[mips][ias] Range check uimmz operands. Reviewers: vkalintiris Subscribers: dsanders, atanasyan, llvm-commits Differential Revision: http://reviews.llvm.org/D14013 llvm-svn: 252294	2015-11-06 12:11:03 +00:00
Vasileios Kalintiris	b04672cade	[mips] Define patterns for the atomic_{load,store}_{8,16,32,64} nodes. Summary: Without these patterns we would generate a complete LL/SC sequence. This would be problematic for memory regions marked as WRITE-only or READ-only, as the instructions LL/SC would read/write to the protected memory regions correspondingly. Reviewers: dsanders Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14397 llvm-svn: 252293	2015-11-06 12:07:20 +00:00
Tom Stellard	1e1b05db24	AMDGPU/SI: Emit HSA kernels with symbol type STT_AMDGPU_HSA_KERNEL Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13804 llvm-svn: 252291	2015-11-06 11:45:14 +00:00
James Molloy	e6f87ca812	Add a new attribute: norecurse This attribute allows the compiler to assume that the function never recurses into itself, either directly or indirectly (transitively). This can be used among other things to demote global variables to locals. llvm-svn: 252282	2015-11-06 10:32:53 +00:00
NAKAMURA Takumi	9947cacebf	Revert r252249 (and r252255, r252258), "[WinEH] Clone funclets with multiple parents" It behaved flaky due to iterating pointer key values on std::set and std::map. llvm-svn: 252279	2015-11-06 10:07:33 +00:00
Xinliang David Li	6aa216c21c	Code style fix (caused by wrongly default clang-format style) (NFC) llvm-svn: 252276	2015-11-06 07:54:21 +00:00
Rafael Espindola	46be435228	Simplify the alignment handling in FDE emission. llvm-svn: 252271	2015-11-06 03:02:51 +00:00
Rafael Espindola	472954fa63	Delete dead store. NFC. llvm-svn: 252270	2015-11-06 02:44:22 +00:00
Reid Kleckner	e535c1f856	Range-for some LiveIntervals code under review llvm-svn: 252267	2015-11-06 02:01:02 +00:00
Reid Kleckner	51460c139e	[WinEH] Split EH_RESTORE out of CATCHRET for 32-bit EH This adds the EH_RESTORE x86 pseudo instr, which is responsible for restoring the stack pointers: EBP and ESP, and ESI if stack realignment is involved. We only need this on 32-bit x86, because on x64 the runtime restores CSRs for us. Previously we had to keep the CATCHRET instruction around during SEH so that we could convince X86FrameLowering to restore our frame pointers. Now we can split these instructions earlier. This was confusing, because we had a return instruction which wasn't really a return and was ultimately going to be removed by X86FrameLowering. This change also simplifies X86FrameLowering, which really shouldn't be building new MBBs. No observable functional change currently, but with the new register mask stuff in D14407, CATCHRET will become a register allocator barrier, and our existing tests rely on us having reasonable register allocation around SEH. llvm-svn: 252266	2015-11-06 01:49:05 +00:00
Rafael Espindola	339464228d	Use a range loop. llvm-svn: 252260	2015-11-06 01:25:56 +00:00
Andrew Kaylor	f477585a2b	Fix build warnings llvm-svn: 252255	2015-11-06 01:08:35 +00:00
Andrew Kaylor	29cd576554	[WinEH] Clone funclets with multiple parents Windows EH funclets need to always return to a single parent funclet. However, it is possible for earlier optimizations to combine funclets (probably based on one funclet having an unreachable terminator) in such a way that this condition is violated. These changes add code to the WinEHPrepare pass to detect situations where a funclet has multiple parents and clone such funclets, fixing up the unwind and catch return edges so that each copy of the funclet returns to the correct parent funclet. Differential Revision: http://reviews.llvm.org/D13274?id=39098 llvm-svn: 252249	2015-11-06 00:20:50 +00:00
Rafael Espindola	6efa6fb4d7	Pass the streamer to the constructor instead of every other method. NFC. llvm-svn: 252246	2015-11-06 00:05:57 +00:00
Rafael Espindola	a1d960ef54	Simplify the constructor. NFC. llvm-svn: 252243	2015-11-05 23:55:51 +00:00
Rafael Espindola	68c2165fd1	git-clang-format an area I am about to change. llvm-svn: 252241	2015-11-05 23:54:18 +00:00
Rafael Espindola	626788c093	Small simplification by moving early continue earlier. llvm-svn: 252237	2015-11-05 23:47:20 +00:00
Sanjoy Das	c1a2977fb2	Re-apply r251050 with a for PR25421 The bug: I missed adding break statements in the switch / case. Original commit message: [SCEV] Teach SCEV some axioms about non-wrapping arithmetic Summary: - A s< (A + C)<nsw> if C > 0 - A s<= (A + C)<nsw> if C >= 0 - (A + C)<nsw> s< A if C < 0 - (A + C)<nsw> s<= A if C <= 0 Right now `C` needs to be a constant, but we can later generalize it to be a non-constant if needed. Reviewers: atrick, hfinkel, reames, nlewycky Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13686 llvm-svn: 252236	2015-11-05 23:45:38 +00:00
Richard Trieu	f8978e1a74	Revert r251050 to fix miscompile when running Clang -O1 See bug for details: https://llvm.org/bugs/show_bug.cgi?id=25421 Some comparisons were incorrectly replaced with a constant value. llvm-svn: 252231	2015-11-05 23:20:36 +00:00
Peter Collingbourne	d4bff30370	DI: Reverse direction of subprogram -> function edge. Previously, subprograms contained a metadata reference to the function they described. Because most clients need to get or set a subprogram for a given function rather than the other way around, this created unneeded inefficiency. For example, many passes needed to call the function llvm::makeSubprogramMap() to build a mapping from functions to subprograms, and the IR linker needed to fix up function references in a way that caused quadratic complexity in the IR linking phase of LTO. This change reverses the direction of the edge by storing the subprogram as function-level metadata and removing DISubprogram's function field. Since this is an IR change, a bitcode upgrade has been provided. Fixes PR23367. An upgrade script for textual IR for out-of-tree clients is attached to the PR. Differential Revision: http://reviews.llvm.org/D14265 llvm-svn: 252219	2015-11-05 22:03:56 +00:00
Tim Northover	775aaeb765	Remove windows line endings introduced by r252177. NFC. llvm-svn: 252217	2015-11-05 21:54:58 +00:00
Alexey Samsonov	55fda1be94	[ASan] Disable instrumentation for inalloca variables. inalloca variables were not treated as static allocas, therefore didn't participate in regular stack instrumentation. We don't want them to participate in dynamic alloca instrumentation as well. llvm-svn: 252213	2015-11-05 21:18:41 +00:00
Alexander Kornienko	db73c2f54c	Refactor: Simplify boolean conditional return statements in lib/llvm/ExecutionEngine/Orc Patch by Richard Thomson! Differential revision: http://reviews.llvm.org/D9973 llvm-svn: 252212	2015-11-05 21:18:09 +00:00
Reid Kleckner	6ddae31045	[WinEH] Fix funclet prologues with stack realignment We already had a test for this for 32-bit SEH catchpads, but those don't actually create funclets. We had a bug that only appeared in funclet prologues, where we would establish EBP and ESI as our FP and BP, and then downstream prologue code would overwrite them. While I was at it, I fixed Win64+funclets+stackrealign. This issue doesn't come up as often there due to the ABI requring 16 byte stack alignment, but now we can rest easy that AVX and WinEH will work well together =P. llvm-svn: 252210	2015-11-05 21:09:49 +00:00
Alexander Kornienko	484e48e3a3	Refactor: Simplify boolean conditional return statements in llvm/lib/Analysis Patch by Richard Thomson! Differential revision: http://reviews.llvm.org/D9967 llvm-svn: 252209	2015-11-05 21:07:12 +00:00
Dan Gohman	b9ce5a8b6c	[WebAssembly] Fix copypasta. Noticed by dschff in http://reviews.llvm.org/rL252203 llvm-svn: 252208	2015-11-05 20:59:49 +00:00
Dan Gohman	da7f428a4a	[WebAssembly] Rename Immediate instructions to Const. This more closely reflects the naming convention in the spec. llvm-svn: 252204	2015-11-05 20:44:29 +00:00
Dan Gohman	af29bd4fd4	[WebAssembly] Add AsmString strings for most instructions. Mangling type information into MachineInstr opcode names was a temporary measure, and it's starting to get hairy. At the same time, the MC instruction printer wants to use AsmString strings for printing. This patch takes the first step, starting the process of adding AsmStrings for instructions. llvm-svn: 252203	2015-11-05 20:42:30 +00:00
Dan Gohman	d7ffb919c1	[WebAssembly] Update wasm builtin functions to match spec changes. The page_size operator has been removed from the spec, and the resize_memory operator has been changed to grow_memory. llvm-svn: 252202	2015-11-05 20:16:59 +00:00
Sanjay Patel	387e66e79f	replace MachineCombinerPattern namespace and enum with enum class; NFCI Also, remove an enum hack where enum values were used as indexes into an array. We may want to make this a real class to allow pattern-based queries/customization (D13417). llvm-svn: 252196	2015-11-05 19:34:57 +00:00
Dan Gohman	e9361d58ff	[WebAssembly] Add WebAssemblyMCInstLower.cpp. This isn't used yet; it's just a start towards eventually using MC to do instruction printing, and eventually binary encoding. llvm-svn: 252194	2015-11-05 19:28:16 +00:00
Kevin Enderby	7a96942a6a	Reapply r250906 with many suggested updates from Rafael Espindola. The needed lld matching changes to be submitted immediately next, but this revision will cause lld failures with this alone which is expected. This removes the eating of the error in Archive::Child::getSize() when the characters in the size field in the archive header for the member is not a number. To do this we have all of the needed methods return ErrorOr to push them up until we get out of lib. Then the tools and can handle the error in whatever way is appropriate for that tool. So the solution is to plumb all the ErrorOr stuff through everything that touches archives. This include its iterators as one can create an Archive object but the first or any other Child object may fail to be created due to a bad size field in its header. Thanks to Lang Hames on the changes making child_iterator contain an ErrorOr<Child> instead of a Child and the needed changes to ErrorOr.h to add operator overloading for * and -> . We don’t want to use llvm_unreachable() as it calls abort() and is produces a “crash” and using report_fatal_error() to move the error checking will cause the program to stop, neither of which are really correct in library code. There are still some uses of these that should be cleaned up in this library code for other than the size field. The test cases use archives with text files so one can see the non-digit character, in this case a ‘%’, in the size field. These changes will require corresponding changes to the lld project. That will be committed immediately after this change. But this revision will cause lld failures with this alone which is expected. llvm-svn: 252192	2015-11-05 19:24:56 +00:00
Davide Italiano	a345877ce8	[SimplifyLibCalls] Use hasFloatVersion(). NFCI. llvm-svn: 252186	2015-11-05 19:18:23 +00:00
Oleg Ranevskyy	057c5a6b2b	[DebugInfo] Fix ARM/AArch64 prologue_end position. Related to D11268. Summary: This review is related to another review request http://reviews.llvm.org/D11268, does the same and merely fixes a couple of issues with it. D11268 is quite old and has merge conflicts against the current trunk. This request - rebases D11268 onto the new trunk; - resolves the merge conflicts; - fixes the prologue_end tests, which do not pass due to the subprogram definitions not marked as distinct. Reviewers: echristo, rengolin, kubabrecka Subscribers: aemerson, rengolin, jyknight, dsanders, llvm-commits, asl Differential Revision: http://reviews.llvm.org/D14338 llvm-svn: 252177	2015-11-05 17:50:17 +00:00
Petar Jovanovic	99fba3c141	Add cfi instr for CFA calculation when movpc is expanded to call and pop This fixes the issue of wrong CFA calculation in the following case: 0x08048400 <+0>: push %ebx 0x08048401 <+1>: sub $0x8,%esp 0x08048404 <+4>: call 0x8048409 <test+9> 0x08048409 <+9>: pop %eax 0x0804840a <+10>: add $0x1bf7,%eax 0x08048410 <+16>: mov %eax,%ebx 0x08048412 <+18>: call 0x80483f0 <bar> 0x08048417 <+23>: add $0x8,%esp 0x0804841a <+26>: pop %ebx 0x0804841b <+27>: ret The highlighted instructions are a product of movpc instruction. The call instruction changes the stack pointer, and pop instruction restores its value. However, the rule for computing CFA is not updated and is wrong on the pop instruction. So, e.g. backtrace in gdb does not work when on the pop instruction. This adds cfi instructions for both call and pop instructions. cfi_adjust_cfa_offset** instruction is used with the appropriate offset for setting the rules to calculate CFA correctly. Patch by Violeta Vukobrat. Differential Revision: http://reviews.llvm.org/D14021 llvm-svn: 252176	2015-11-05 17:19:59 +00:00
Derek Schuff	8a76b04a63	[WebAssembly] Rename ior operator to or to match the spec Summary: The spec uses "or" for inclusive-or and "xor" for exclusive-or Reviewers: sunfish Subscribers: jfb, llvm-commits, dschuff Differential Revision: http://reviews.llvm.org/D14362 llvm-svn: 252174	2015-11-05 17:08:11 +00:00
James Molloy	bef6e43107	[ARM] Compute known bits for ARMISD::CMOV We can conservatively know that CMOV's known bits are the intersection of known bits for each of its operands. This helps PerformCMOVToBFICombine find more opportunities. I tried hard to create a testcase for this and failed - we have to sufficiently confuse DAG.computeKnownBits which can see through all the cheap tricks I tried to narrow my larger testcase down :( This code is actually exercised in CodeGen/ARM/bfi.ll, there's just no functional difference because DAG.computeKnownBits gets the right answer in that case. llvm-svn: 252168	2015-11-05 15:21:58 +00:00
Aaron Ballman	3c44b42e70	Fix a signed/unsigned mismatch warning; NFC. llvm-svn: 252164	2015-11-05 14:22:56 +00:00
Asaf Badouh	f99c054ebc	revert rev. 252153 due to build failure on ubuntu [X86][AVX512] add comi with Sae llvm-svn: 252154	2015-11-05 08:55:54 +00:00
Asaf Badouh	7fdabf0a35	[X86][AVX512] add comi with Sae add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 252153	2015-11-05 08:45:06 +00:00
James Molloy	9e959ac397	[SimplifyCFG] Tweak heuristic for merging conditional stores We were correctly skipping dbginfo intrinsics and terminators, but the initial bailout wasn't, causing it to bail out on almost any block. llvm-svn: 252152	2015-11-05 08:40:19 +00:00
Asaf Badouh	a8209d92cc	[X86][AVX512] small bugfix in VPBROADCASTM VPBROADCASTMW2D and VPBROADCASTMB2Q Differential Revision: http://reviews.llvm.org/D14335 llvm-svn: 252151	2015-11-05 08:08:21 +00:00
Saleem Abdulrasool	01556dede1	RuntimeDyld: fix -Wtype-limits Adjust the casted type. By casting to the same size rather than just the signed-ness, we were asserting tautological statements. NFC. llvm-svn: 252150	2015-11-05 06:24:09 +00:00
Mehdi Amini	afd135197b	Fix LoopAccessAnalysis when potentially nullptr check are involved Summary: GetUnderlyingObjects() can return "null" among its list of objects, we don't want to deduce that two pointers can point to the same memory in this case, so filter it out. Reviewers: anemet Subscribers: dexonsmith, llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252149	2015-11-05 05:49:43 +00:00
Matt Arsenault	5b22dfa65d	AMDGPU: Also track whether SGPRs were spilled llvm-svn: 252145	2015-11-05 05:27:10 +00:00
Matt Arsenault	d41c0dbff0	AMDGPU: Print number user SGPRs This doesn't quite match how SC prints it, which doesn't put it in a comment. llvm-svn: 252144	2015-11-05 05:27:07 +00:00
Matt Arsenault	68802d3177	AMDGPU: Disallow s[102:103] on VI in assembler llvm-svn: 252142	2015-11-05 03:11:27 +00:00
Sanjoy Das	98bfe26bf8	[FunctionAttrs] Remove a loop, NFC refactor Summary: Remove the loop over the uses of the CallSite in ArgumentUsesTracker. Since we have the `Use *` for actual argument operand, we can just use pointer subtraction. The time complexity remains the same though (except for a vararg argument) -- `std::advance` is O(UseIndex) for the ArgumentList iterator. The real motivation is to make a later change adding support for operand bundles simpler. Reviewers: reames, chandlerc, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14363 llvm-svn: 252141	2015-11-05 03:04:40 +00:00
Matt Arsenault	a40450cba2	AMDGPU: Fix assert when legalizing atomic operands The operand layout is slightly different for the atomic opcodes from the usual MUBUF loads and stores. This should only fix it on SI/CI. VI is still broken because it still emits the addr64 replacement. llvm-svn: 252140	2015-11-05 02:46:56 +00:00
Matt Arsenault	bed42a7320	AMDGPU: Make addr64 atomic operand order consistent vaddr comes before srsrc in every other MUBUF instruction, and is the order it is printed. llvm-svn: 252139	2015-11-05 02:46:53 +00:00
Mehdi Amini	7ae928ed8c	Fix OSX build after r252118 (missing parameter for findModulesAndOffsets()) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252137	2015-11-05 02:29:57 +00:00
Mehdi Amini	766d05b012	Remove empty lines From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252136	2015-11-05 02:29:53 +00:00
Joseph Tremoulet	6afccf6120	[WinEH] Fix establisher param reg in CLR funclets Summary: The CLR's personality routine passes the pointer to the establisher frame in RCX, not RDX. Reviewers: pgavlin, majnemer, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14343 llvm-svn: 252135	2015-11-05 02:20:07 +00:00
Sanjoy Das	776e4a7da7	[IR] Add bounds checking to dataOperandHasImpliedAttr This is similar to the bounds check added to paramHasAttr in r252073. llvm-svn: 252130	2015-11-05 01:53:26 +00:00
Kostya Serebryany	b8d0da1386	[libFuzzer] print a bit fewer lines llvm-svn: 252123	2015-11-05 01:19:42 +00:00
Rafael Espindola	e61a902371	Go back to producing relocations for out of range symbols. This brings back the behavior from before r252090 for out of range symbols. Should bring some arm bots back. llvm-svn: 252119	2015-11-05 01:10:15 +00:00
Reid Kleckner	ba5757da64	[Windows] Symbolize with llvm-symbolizer instead of dbghelp in a self-host Summary: llvm-symbolizer understands both PDBs and DWARF, so it is more likely to succeed at symbolization. If llvm-symbolizer is unavailable, we will fall back to dbghelp. This also makes our crash traces more similar between Windows and Linux. Reviewers: Bigcheese, zturner, chapuni Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12884 llvm-svn: 252118	2015-11-05 01:07:54 +00:00
Matt Arsenault	6c2e200d38	AMDGPU: Fix typo llvm-svn: 252116	2015-11-05 01:03:08 +00:00
Xinliang David Li	192c748027	[PGO] Use template file to define runtime structures With this change, instrumentation code and reader/write code related to profile data structs are kept strictly in-sync. THis will be extended to cfe and compile-rt references as well. Differential Revision: http://reviews.llvm.org/D13843 llvm-svn: 252113	2015-11-05 00:47:26 +00:00
Mehdi Amini	ba19c6eed8	Fix Abbrev emission in WriteIdentificationBlock This Abbrev was not emitted and basically unused, just leacking there. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252110	2015-11-05 00:25:03 +00:00
Rafael Espindola	b23f57832a	Fix pr24832. It is pretty simple now that the yak is shaved. llvm-svn: 252105	2015-11-05 00:10:08 +00:00
Rafael Espindola	7ae65d87cf	Simplify now that emitValueToOffset always returns false. llvm-svn: 252102	2015-11-04 23:59:18 +00:00
Rafael Espindola	04d39260d6	Simplify .org processing and make it a bit more powerful. We now always create the fragment, which lets us handle things like .org after a .align. llvm-svn: 252101	2015-11-04 23:50:29 +00:00
Davide Italiano	51507d2ad8	[SimplifyLibCalls] New transformation: tan(atan(x)) -> x This is enabled only under -ffast-math. So, instead of emitting: 4007b0: 50 push %rax 4007b1: e8 8a fd ff ff callq 400540 <atanf@plt> 4007b6: 58 pop %rax 4007b7: e9 94 fd ff ff jmpq 400550 <tanf@plt> 4007bc: 0f 1f 40 00 nopl 0x0(%rax) for: float mytan(float x) { return tanf(atanf(x)); } we emit a single retq. Differential Revision: http://reviews.llvm.org/D14302 llvm-svn: 252098	2015-11-04 23:36:56 +00:00
Kostya Serebryany	e692621a9d	[libFuzzer] when choosing the next unit to mutate, give some preference to the most recent units (they are more likely to be interesting) llvm-svn: 252097	2015-11-04 23:22:25 +00:00
Sanjoy Das	ea34382dfa	[CaptureTracking] Support operand bundles conservatively Summary: Earlier CaptureTracking would assume all "interesting" operands to a call or invoke were its arguments. With operand bundles this is no longer true. Note: an earlier change got `doesNotCapture` working correctly with operand bundles. This change uses DSE to test the changes to CaptureTracking. DSE is a vehicle for testing only, and is not directly involved in this change. Reviewers: reames, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14306 llvm-svn: 252095	2015-11-04 23:21:06 +00:00
Rafael Espindola	49b8548903	Slightly saner handling of thumb branches. The generic infrastructure already did a lot of work to decide if the fixup value is know or not. It doesn't make sense to reimplement a very basic case: same fragment. llvm-svn: 252090	2015-11-04 23:00:39 +00:00
Quentin Colombet	421723cdd8	[x86] Teach the shrink-wrapping hooks to do the proper thing with Win64. Win64 has some strict requirements for the epilogue. As a result, we disable shrink-wrapping for Win64 unless the block that gets the epilogue is already an exit block. Fixes PR24193. llvm-svn: 252088	2015-11-04 22:37:28 +00:00
Eugene Zelenko	ffec81ca00	Fix some Clang-tidy modernize warnings, other minor fixes. Fixed warnings are: modernize-use-override, modernize-use-nullptr and modernize-redundant-void-arg. Differential revision: http://reviews.llvm.org/D14312 llvm-svn: 252087	2015-11-04 22:32:32 +00:00
Justin Bogner	c2b98f03db	PM: Rephrase PrintLoopPass as a wrapper around a new-style pass. NFC Splits PrintLoopPass into a new-style pass and a PrintLoopPassWrapper, much like we already do for PrintFunctionPass and PrintModulePass. llvm-svn: 252085	2015-11-04 22:24:08 +00:00
Cong Hou	23a3bf0147	Add new interfaces to MBB for manipulating successors with probabilities instead of weights. NFC. This is part-1 of the patch that replaces all edge weights in MBB by probabilities, which only adds new interfaces. No functional changes. Differential revision: http://reviews.llvm.org/D13908 llvm-svn: 252083	2015-11-04 21:37:58 +00:00
Simon Pilgrim	f669d381f9	Warning fix. llvm-svn: 252078	2015-11-04 21:27:22 +00:00
Sanjoy Das	a4bae3bb21	[IR] Add a `data_operand` abstraction Summary: Data operands of a call or invoke consist of the call arguments, and the bundle operands associated with the `call` (or `invoke`) instruction. The motivation for this change is that we'd like to be able to query "argument attributes" like `readonly` and `nocapture` for bundle operands naturally. This change also provides a conservative "implementation" for these attributes for any bundle operand, and an extension point for future work. Reviewers: chandlerc, majnemer, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14305 llvm-svn: 252077	2015-11-04 21:05:24 +00:00
Simon Pilgrim	7e6606f4f1	[X86][SSE] Add general memory folding for (V)INSERTPS instruction This patch improves the memory folding of the inserted float element for the (V)INSERTPS instruction. The existing implementation occurs in the DAGCombiner and relies on the narrowing of a whole vector load into a scalar load (and then converted into a vector) to (hopefully) allow folding to occur later on. Not only has this proven problematic for debug builds, it also prevents other memory folds (notably stack reloads) from happening. This patch removes the old implementation and moves the folding code to the X86 foldMemoryOperand handler. A new private 'special case' function - foldMemoryOperandCustom - has been added to deal with memory folding of instructions that can't just use the lookup tables - (V)INSERTPS is the first of several that could be done. It also tweaks the memory operand folding code with an additional pointer offset that allows existing memory addresses to be modified, in this case to convert the vector address to the explicit address of the scalar element that will be inserted. Unlike the previous implementation we now set the insertion source index to zero, although this is ignored for the (V)INSERTPSrm version, anything that relied on shuffle decodes (such as unfolding of insertps loads) was incorrectly calculating the source address - I've added a test for this at insertps-unfold-load-bug.ll Differential Revision: http://reviews.llvm.org/D13988 llvm-svn: 252074	2015-11-04 20:48:09 +00:00
Sanjoy Das	b11b440f8e	[IR] Add bounds checking to paramHasAttr Summary: This is intended to make a later change simpler. Note: adding this bounds checking required fixing `X86FastISel`. As far I can tell I've preserved original behavior but a careful review will be appreciated. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14304 llvm-svn: 252073	2015-11-04 20:33:45 +00:00
Andrew Kaylor	e41a8c4182	Created new X86 FMA3 opcodes (FMA_Int) that are used now for lowering of scalar FMA intrinsics. Patch by Slava Klochkov The key difference between FMA and FMA_Int opcodes is that FMA_Int opcodes are handled more conservatively. It is illegal to commute the 1st operand of FMA*_Int instructions as the upper bits of scalar FMA intrinsic result must be taken from the 1st operand, but such commute transformation would change those upper bits and invalidate the intrinsic's result. Reviewers: Quentin Colombet, Elena Demikhovsky Differential Revision: http://reviews.llvm.org/D13710 llvm-svn: 252060	2015-11-04 18:10:41 +00:00
James Molloy	e7d679cf4c	[ARM] Combine CMOV into BFI where possible If we have a CMOV, OR and AND combination such as: if (x & CN) y \|= CM; And: * CN is a single bit; * All bits covered by CM are known zero in y; Then we can convert this to a sequence of BFI instructions. This will always be a win if CM is a single bit, will always be no worse than the TST & OR sequence if CM is two bits, and for thumb will be no worse if CM is three bits (due to the extra IT instruction). llvm-svn: 252057	2015-11-04 16:55:07 +00:00
Teresa Johnson	f1b0a6e37c	[ThinLTO] Always set linkage type to external when converting alias When converting an alias to a non-alias when the aliasee is not imported, ensure that the linkage type is set to external so that it is a valid linkage type. Added a test case that exposed this issue. llvm-svn: 252054	2015-11-04 16:01:16 +00:00
James Molloy	4de84ddec9	[SimplifyCFG] Merge conditional stores We can often end up with conditional stores that cannot be speculated. They can come from fairly simple, idiomatic code: if (c & flag1) a = x; if (c & flag2) a = y; ... There is no dominating or post-dominating store to a, so it is not legal to move the store unconditionally to the end of the sequence and cache the intermediate result in a register, as we would like to. It is, however, legal to merge the stores together and do the store once: tmp = undef; if (c & flag1) tmp = x; if (c & flag2) tmp = y; if (c & flag1 \|\| c & flag2) *a = tmp; The real power in this optimization is that it allows arbitrary length ladders such as these to be completely and trivially if-converted. The typical code I'd expect this to trigger on often uses binary-AND with constants as the condition (as in the above example), which means the ending condition can simply be truncated into a single binary-AND too: 'if (c & (flag1\|flag2))'. As in the general case there are bitwise operators here, the ladder can often be optimized further too. This optimization involves potentially increasing register pressure. Even in the simplest case, the lifetime of the first predicate is extended. This can be elided in some cases such as using binary-AND on constants, but not in the general case. Threading 'tmp' through all branches can also increase register pressure. The optimization as in this patch is enabled by default but kept in a very conservative mode. It will only optimize if it thinks the resultant code should be if-convertable, and additionally if it can thread 'tmp' through at least one existing PHI, so it will only ever in the worst case create one more PHI and extend the lifetime of a predicate. This doesn't trigger much in LNT, unfortunately, but it does trigger in a big way in a third party test suite. llvm-svn: 252051	2015-11-04 15:28:04 +00:00
Filipe Cabecinhas	a2b0ac40cf	Error out when faced with value names containing '\0' Bug found with afl-fuzz. llvm-svn: 252048	2015-11-04 14:53:36 +00:00
Michael Kuperstein	a3b79dd783	[ELF] elfiamcu triple should imply e_machine == EM_IAMCU Differential Revision: http://reviews.llvm.org/D14109 llvm-svn: 252043	2015-11-04 11:21:50 +00:00
Michael Kuperstein	b34de72269	[X86] DAGCombine should not introduce FILD in soft-float mode The x86 "sitofp i64 to double" dag combine, in 32-bit mode, lowers sitofp directly to X86ISD::FILD (or FILD_FLAG). This should not be done in soft-float mode. llvm-svn: 252042	2015-11-04 11:17:53 +00:00
Philip Reames	aeefae0cc5	[LVI] Update a comment to clarify what's actually happening and why llvm-svn: 252033	2015-11-04 01:47:04 +00:00
Philip Reames	814fb60130	[CVP] Fold return values if possible In my previous change to CVP (251606), I made CVP much more aggressive about trying to constant fold comparisons. This patch is a reversal in direction. Rather than being agressive about every compare, we restore the non-block local restriction for most, and then try hard for compares feeding returns. The motivation for this is two fold: * The more I thought about it, the less comfortable I got with the possible compile time impact of the other approach. There have been no reported issues, but after talking to a couple of folks, I've come to the conclusion the time probably isn't justified. * It turns out we need to know the context to leverage the full power of LVI. In particular, asking about something at the end of it's block (the use of a compare in a return) will frequently get more precise results than something in the middle of a block. This is an implementation detail, but it's also hard to get around since mid-block queries have to reason about possible throwing instructions and don't get to use most of LVI's block focused infrastructure. This will become particular important when combined with http://reviews.llvm.org/D14263. Differential Revision: http://reviews.llvm.org/D14271 llvm-svn: 252032	2015-11-04 01:43:54 +00:00
Igor Laevsky	35fe692025	[StatepointLowering] Remove distinction between call and invoke safepoints There is no point in having invoke safepoints handled differently than the call safepoints. All relevant decisions could be made by looking at whether or not gc.result and gc.relocate lay in a same basic block. This change will allow to lower call safepoints with relocates and results in a different basic blocks. See test case for example. Differential Revision: http://reviews.llvm.org/D14158 llvm-svn: 252028	2015-11-04 01:16:10 +00:00
Alexey Samsonov	5365a01dc7	[LLVMSymbolize] Reduce indentation by using helper function. NFC. llvm-svn: 252022	2015-11-04 00:30:26 +00:00
Alexey Samsonov	884adda0fb	[LLVMSymbolize] Properly propagate object parsing errors from the library. llvm-svn: 252021	2015-11-04 00:30:24 +00:00
Adam Nemet	7c94c9bf07	Fix unused variable warning from r252017 llvm-svn: 252019	2015-11-04 00:10:33 +00:00
Adam Nemet	e54a4fa95d	LLE 6/6: Add LoopLoadElimination pass Summary: The goal of this pass is to perform store-to-load forwarding across the backedge of a loop. E.g.: for (i) A[i + 1] = A[i] + B[i] => T = A[0] for (i) T = T + B[i] A[i + 1] = T The pass relies on loop dependence analysis via LoopAccessAnalisys to find opportunities of loop-carried dependences with a distance of one between a store and a load. Since it's using LoopAccessAnalysis, it was easy to also add support for versioning away may-aliasing intervening stores that would otherwise prevent this transformation. This optimization is also performed by Load-PRE in GVN without the option of multi-versioning. As was discussed with Daniel Berlin in http://reviews.llvm.org/D9548, this is inferior to a more loop-aware solution applied here. Hopefully, we will be able to remove some complexity from GVN/MemorySSA as a consequence. In the long run, we may want to extend this pass (or create a new one if there is little overlap) to also eliminate loop-indepedent redundant loads and store that require versioning due to may-aliasing intervening stores/loads. I have some motivating cases for store elimination. My plan right now is to wait for MemorySSA to come online first rather than using memdep for this. The main motiviation for this pass is the 456.hmmer loop in SPECint2006 where after distributing the original loop and vectorizing the top part, we are left with the critical path exposed in the bottom loop. Being able to promote the memory dependence into a register depedence (even though the HW does perform store-to-load fowarding as well) results in a major gain (~20%). This gain also transfers over to x86: it's around 8-10%. Right now the pass is off by default and can be enabled with -enable-loop-load-elim. On the LNT testsuite, there are two performance changes (negative number -> improvement): 1. -28% in Polybench/linear-algebra/solvers/dynprog: the length of the critical paths is reduced 2. +2% in Polybench/stencils/adi: Unfortunately, I couldn't reproduce this outside of LNT The pass is scheduled after the loop vectorizer (which is after loop distribution). The rational is to try to reuse LAA state, rather than recomputing it. The order between LV and LLE is not critical because normally LV does not touch scalar st->ld forwarding cases where vectorizing would inhibit the CPU's st->ld forwarding to kick in. LoopLoadElimination requires LAA to provide the full set of dependences (including forward dependences). LAA is known to omit loop-independent dependences in certain situations. The big comment before removeDependencesFromMultipleStores explains why this should not occur for the cases that we're interested in. Reviewers: dberlin, hfinkel Subscribers: junbuml, dberlin, mssimpso, rengolin, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13259 llvm-svn: 252017	2015-11-03 23:50:08 +00:00
Adam Nemet	397f5829c7	[LAA] LLE 5/6: Add predicate functions Dependence::isForward/isBackward, NFC Summary: Will be used by the LoopLoadElimination pass. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13258 llvm-svn: 252016	2015-11-03 23:50:03 +00:00

... 3 4 5 6 7 ...

84609 Commits