llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	082c183f06	[PM] Teach SCEV to invalidate itself when its dependencies become invalid. This fixes use-after-free bugs that will arise with any interesting use of SCEV. I've added a dedicated test that works diligently to trigger these kinds of bugs in the new pass manager and also checks for them explicitly as well as triggering ASan failures when things go squirly. llvm-svn: 291426	2017-01-09 07:44:34 +00:00
Daniel Berlin	b755aea8eb	NewGVN: Fix PR 31573, a failure to verify memory congruency due to not excluding ourselves when checking if any equivalent stores exist. llvm-svn: 291421	2017-01-09 05:34:29 +00:00
Craig Topper	96ab6fd2eb	[AVX-512] Change another pattern that was using BLENDM to use masked moves. A future patch will conver it back to BLENDM if its beneficial to register allocation. llvm-svn: 291419	2017-01-09 04:19:34 +00:00
Craig Topper	6393afce97	[AVX-512] Add patterns to use a zero masked VPTERNLOG instruction for vselects of all ones and all zeros. Previously we emitted a VPTERNLOG and a separate masked move. llvm-svn: 291415	2017-01-09 02:44:34 +00:00
Piotr Padlewski	09ad678bc4	[MemDep] NFC walk invariant.group graph only down Summary: By using stripPointerCasts we can get to the root value and then walk down the bitcast graph Reviewers: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28181 llvm-svn: 291405	2017-01-08 22:26:06 +00:00
Craig Topper	f51ba1e3da	[AVX-512] If avx512dq is available use vpmovm2d/vpmovm2q instead of vselect of zeroes/ones when handling sign extends of i1 without VLX. llvm-svn: 291402	2017-01-08 21:32:30 +00:00
Craig Topper	0930a523cc	[X86] Add avx512bw and avx512dq command lines to the vector compare results test. This is preparation for improving a case with avx512dq. llvm-svn: 291401	2017-01-08 21:32:26 +00:00
Sanjay Patel	bf51c8a975	[x86] fix usage of stale operands when lowering select I noticed this problem as part of the ongoing attempt to canonicalize min/max ops in IR. The debug output shows nodes like this: t4: i32 = xor t2, Constant:i32<-1> t21: i8 = setcc t4, Constant:i32<0>, setlt:ch t14: i32 = select t21, t4, Constant:i32<-1> And because the select is holding onto the t4 (xor) node while EmitTest creates a new x86-specific xor node, the lowering results in: t4: i32 = xor t2, Constant:i32<-1> t25: i32,i32 = X86ISD::XOR t2, Constant:i32<-1> t28: i32,glue = X86ISD::CMOV Constant:i32<-1>, t4, Constant:i8<15>, t25:1 Differential Revision: https://reviews.llvm.org/D28374 llvm-svn: 291392	2017-01-08 15:53:40 +00:00
Simon Pilgrim	9c58950eeb	[CostModel][X86] Fixed vXi8 uniform shift costs. The 'fast' costs should only work for shifts by uniform constants (uniform non-constant are lowered using the slow default implementation). Logical shifts were not taking into account that we must mask the psrlw result, so the costs needed to be doubled. Added missing AVX2/AVX512BW costs as well. llvm-svn: 291391	2017-01-08 14:14:36 +00:00
Simon Pilgrim	1fa5487c05	[CostModel][X86] Moved legal uniform shift costs earlier. XOP was prematurely matching, doubling the cost of ashr/lshr uniform shifts. llvm-svn: 291390	2017-01-08 13:12:03 +00:00
Dylan McKay	8fa6d8db9c	[AVR] Implement TargetLoweing::getRegisterByName This allows the use of the 'read_register' intrinsics used by clang's named register globals features. llvm-svn: 291375	2017-01-07 23:39:47 +00:00
Simon Pilgrim	9681c407b4	[CostModel][X86] Update SSE41/AVX1 vXi32 SHL costs SSE41 provides pmulld which allows the simpler pslld/paddd/cvttps2dq/pmulld pattern than SSE2's use of pmuludq. llvm-svn: 291372	2017-01-07 22:27:43 +00:00
Craig Topper	a74e3088df	[AVX-512] Remove patterns from the other VBLENDM instructions. They are all redundant with masked move instructions. We should probably teach the two address instruction pass to turn masked moves into BLENDM when its beneficial to the register allocator. llvm-svn: 291371	2017-01-07 22:20:34 +00:00
Craig Topper	2d02d4926b	[X86] Regenerate a test to remove tab characters. llvm-svn: 291370	2017-01-07 22:20:28 +00:00
Craig Topper	da84ff3ed4	[AVX-512] Add masked forms of the alternate MOVDDUP patterns. I'm not too sure how to get isel to select even all of the unmasked forms, but at least we have a consistent set now. llvm-svn: 291368	2017-01-07 22:20:23 +00:00
Simon Pilgrim	a470296367	[CostModel][X86] Fix AVX2 v16i16 shift 'splat' costs. llvm-svn: 291366	2017-01-07 22:08:09 +00:00
Simon Pilgrim	82e3e05fe2	[CostModel][X86] Match 256-bit vector shift 'splat' costs for AVX2 and above We were matching against general vector shift costs before the uniform splat costs llvm-svn: 291365	2017-01-07 21:47:10 +00:00
Simon Pilgrim	935beac173	[X86][AVX2] Regenerate arithmetic tests Fixed missing checks for tests that used a '-' in the name, which was messing with update_llc_test_checks.py llvm-svn: 291363	2017-01-07 20:38:36 +00:00
Mehdi Amini	d5549f3dac	[ThinLTO] Fix assertions on lazy-loading of Metadata TBAA attachments Summary: The issue happens with: %0 = ....., !tbaa !0 %1 = ....., !tbaa !1 With !0 that references !1. In this case when loading !0 we generates a temporary for the operand !1. We now flush it immediately and trigger the load of !1 before moving on. If we don't we get the temporary when attaching to %1. This is usually not an issue except that we eagerly try to update TBAA MDNodes, which is obviously not possible if we only have a temporary. Differential Revision: https://reviews.llvm.org/D28423 llvm-svn: 291362	2017-01-07 20:24:23 +00:00
Hal Finkel	ec85fc5eac	[llvm-opt-report] Fix context-sensitive lines where nothing happened Don't print a line multiple times, each for different inlining contexts, if nothing happened in any context. This prevents situations like this: [[ > main: 65 \| if ((i * ni + j) % 20 == 0) fprintf > print_array: 65 \| if ((i * ni + j) % 20 == 0) fprintf ]] which could happen if different optimizations were missed in different inlining contexts. llvm-svn: 291361	2017-01-07 20:21:17 +00:00
Matt Arsenault	a7d2194168	SimplifyLibCalls: Remove incorrect optimization of fabs fabs(x * x) is not generally safe to assume x is positive if x is a NaN. This is also less general than it could be, so this will be replaced with a transformation on the intrinsic. llvm-svn: 291359	2017-01-07 19:55:12 +00:00
Simon Pilgrim	a4109d6433	[CostModel][AVX512BW] Add v32i16 vector shift costs for avx512bw targets. llvm-svn: 291354	2017-01-07 17:54:10 +00:00
Daniel Berlin	32f8d560dd	NewGVN: Make sure we properly lookup operand leaders while creating congruence classes for stores, and then keep them up to date. Add testcases. llvm-svn: 291351	2017-01-07 16:55:14 +00:00
Simon Pilgrim	a1b8e2c725	[X86][AVX512] Use lowerShuffleAsRepeatedMaskAndLanePermute for non-VBMI v64i8 shuffles (PR31470) llvm-svn: 291347	2017-01-07 15:37:50 +00:00
Dan Gohman	0e2ceb8121	[WebAssembly] Don't abort on code with UB. Gracefully leave code that performs function-pointer bitcasts implying non-trivial pointer conversions alone, rather than aborting, since it's just undefined behavior. llvm-svn: 291326	2017-01-07 01:50:01 +00:00
Dan Gohman	1b637458f6	[WebAssembly] Add a pass to create wrappers for function bitcasts. WebAssembly requires caller and callee signatures to match exactly. In LLVM, there are a variety of circumstances where signatures may be mismatched in practice, and one can bitcast a function address to another type to call it as that type. This patch adds a pass which replaces bitcasted function addresses with wrappers to replace the bitcasts. This doesn't catch everything, but it does match many common cases. llvm-svn: 291315	2017-01-07 00:34:54 +00:00
Daniel Berlin	d92e7f9f74	NewGVN: Fix PR 31501. Summary: LLVM's non-standard notion of phi nodes means we can't both try to substitute for undef in phi nodes and use phi nodes as leaders all the time. This changes NewGVN to use the same semantics as SimplifyPHINode to decide which phi nodes are equivalent. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28312 llvm-svn: 291308	2017-01-07 00:01:42 +00:00
Teresa Johnson	9006d52651	[ThinLTO] Handle conflicting local names gracefully Summary: r285871 introduced an assert that was overly aggressive in the case of a same-named local in different same-named files (in different directories), where the source name and therefore the GUID ended up the same because the files were compiled in their own directory without any leading path. Change the handling in the promotion logic to get the summary for the version in that module. This also exposed an issue where we are not always importing the right copy, which is a performance not correctness issue (because the renaming is based on the module hash which must be different, see the bug report for details). I will fix that as a follow-on. Fixes PR31561. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28411 llvm-svn: 291304	2017-01-06 23:38:41 +00:00
David Majnemer	63da0c238b	[InstSimplify] Optimize away udivs in the presence of range metadata We know that udiv %V, C can be optimized away to 0 if %V is ult C. llvm-svn: 291296	2017-01-06 22:58:02 +00:00
Kuba Mracek	5a2f078ee2	Follow-up for r291289: Fix failing global_metadata_darwin.ll test llvm-svn: 291292	2017-01-06 22:22:22 +00:00
Michal Gorny	d1b954884c	[llvm-config] Print --system-libs only when static linking Modify the --system-libs option in llvm-config to print system libs only when using static linking. The system libraries are irrelevant when linking to a shared library since the library has appropriate library dependencies embedded. Modify the --system-libs test appropriately to force static linking, and disable it if static libs are not available (i.e. BUILD_SHARED_LIBS is enabled). Differential Revision: https://reviews.llvm.org/D27805 llvm-svn: 291285	2017-01-06 21:33:54 +00:00
Michal Gorny	9283f5b200	[cmake] Canonicalize CMake booleans to 0/1 for lit interop Canonicalize all CMake booleans to 0/1 before passing them to lit, to ensure that the Python side handles all of them consistently and correctly. 0/1 is a safe choice of values that trigger the same boolean interpretation in CMake, Python and C++. Furthermore, using them without quotes improves the chance Python will explicitly fail when an incorrect value (such as ON/OFF, TRUE/FALSE, YES/NO) is accidentally passed, rather than silently misinterpreting the value. This replaces a lot of different logics spread around lit site files, attempting to partially reproduce the boolean logic used in CMake and usually silently failing when an uncommon value was used instead. In fact, some of them were never working correctly since different values were assigned in CMake and checked in Python. The alternative solution could be to create a common parser for CMake booleans in lit and use it consistently throughout the site files. However, it does not seem like the best idea to create redundant implementation of the same logic and have to follow upstream if it ever is extended to handle more values. Differential Revision: https://reviews.llvm.org/D28294 llvm-svn: 291284	2017-01-06 21:33:48 +00:00
Michal Gorny	82eb45a6f8	[test] Remove unused 'test_examples' config var Remove config.test_examples from lit.site.cfg and the relevant ENABLE_EXAMPLES definition from CMake. It is not used anywhere. Differential Revision: https://reviews.llvm.org/D28283 llvm-svn: 291283	2017-01-06 21:33:39 +00:00
David Majnemer	8c0e62f507	[InstSimplify] Optimize away urems in the presence of range metadata We know that urem %V, C can be optimized away to %V if %V is ult C. llvm-svn: 291282	2017-01-06 21:23:51 +00:00
Mehdi Amini	27d224fbbb	Fix LoopLoadElimination to keep original alignment on the inital hoisted store This is fixing a bug where Loop Vectorization is widening a load but with a lower alignment. Hoisting the load without propagating the alignment will allow inst-combine to later deduce a higher alignment that what the pointer actually is. Differential Revision: https://reviews.llvm.org/D28408 llvm-svn: 291281	2017-01-06 21:06:51 +00:00
Jan Vesely	06200bd7bc	AMDGPU/R600: Don't use REGISTER_{LOAD,STORE} ISD nodes This will make transition to SCRATCH_MEMORY easier Differential Revision: https://reviews.llvm.org/D24746 llvm-svn: 291279	2017-01-06 21:00:46 +00:00
Simon Pilgrim	08519d7b02	[X86][SSE] Standardized triples in vector shift tests Made no sense for them to be different and caused useless diffs in assembly remarks. llvm-svn: 291274	2017-01-06 19:56:57 +00:00
Simon Pilgrim	9cbcc5ff0b	[CostModel][X86] Add AVX512 and 512-bit vector shift cost tests. llvm-svn: 291269	2017-01-06 19:41:26 +00:00
Matthias Braun	258b847c4f	AArch64CollectLOH: Rewrite as block-local analysis. Re-apply r288561: This time with a fix where the ADDs that are part of a 3 instruction LOH would not invalidate the "LastAdrp" state. This fixes http://llvm.org/PR31361 Previously this pass was using up to 5% compile time in some cases which is a bit much for what it is doing. The pass featured a full blown data-flow analysis which in the default configuration was restricted to a single block. This rewrites the pass under the assumption that we only ever work on a single block. This is done in a single pass maintaining a state machine per general purpose register to catch LOH patterns. Differential Revision: https://reviews.llvm.org/D27329 This reverts commit 9e6cedb0a4f14364d6511597a9160305e7d34493. llvm-svn: 291266	2017-01-06 19:22:01 +00:00
Sanjay Patel	2715d92389	[InstCombine] add a vector version of a test added in r291262; NFC llvm-svn: 291265	2017-01-06 19:14:05 +00:00
Sanjay Patel	8d4aa10960	[InstCombine] move and add tests for icmp + shl nsw; NFC As discussed here: http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html ...we should be able to better optimize this pattern. llvm-svn: 291262	2017-01-06 18:57:54 +00:00
Wolfgang Pieb	c17a279eda	[DWARF] Null out the debug locs of (loop invariant) instructions hoisted by LICM in order to avoid jumpy line tables. Calls are left alone because they may be inlined. Differential Revision: https://reviews.llvm.org/D28390 llvm-svn: 291258	2017-01-06 18:38:57 +00:00
Chad Rosier	e177185e79	[AArch64] Reduce vector insert/extract cost for Falkor. Differential Revision: https://reviews.llvm.org/D28403 llvm-svn: 291254	2017-01-06 18:03:26 +00:00
Konstantin Zhuravlyov	67a6d5401a	[AMDGPU] Do not emit .AMDGPU.config section for amdhsa Differential Revision: https://reviews.llvm.org/D27732 llvm-svn: 291245	2017-01-06 17:02:10 +00:00
Simon Pilgrim	9122793b15	[X86][AVX] Regenerate shuffle 128-bit tests. The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now. llvm-svn: 291242	2017-01-06 15:56:52 +00:00
Simon Pilgrim	10cc5d555f	[X86][AVX] Regenerate tzcnt tests. The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now. llvm-svn: 291241	2017-01-06 15:54:23 +00:00
Filipe Cabecinhas	4647b74b51	[ASan] Make ASan instrument variable-masked loads and stores Summary: Previously we only supported constant-masked loads and stores. Reviewers: kcc, RKSimon, pgousseau, gbedwell, vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28370 llvm-svn: 291238	2017-01-06 15:24:51 +00:00
Simon Pilgrim	d8333372bc	[CostModel][X86] Fix 512-bit SDIV/UDIV 'big' costs. Set the costs on the lowest target that supports the type. llvm-svn: 291229	2017-01-06 11:12:53 +00:00
Simon Pilgrim	441d1d35d2	[CostModel][X86] Add SDIV/UDIV cost tests for a wider range of targets Added a test demonstrating bug in AVX512 division costs llvm-svn: 291228	2017-01-06 11:02:40 +00:00
Daniel Jasper	965d802ec7	Move test input to directory called Inputs. It is a common convention that our internal test runner depends upon. llvm-svn: 291227	2017-01-06 10:22:15 +00:00
Craig Topper	e86fb932ea	[AVX-512] Add EXTRACT_SUBVECTOR support to combineBitcastForMaskedOp. llvm-svn: 291214	2017-01-06 05:18:48 +00:00
Craig Topper	8cbac879db	[AVX-512] Add more masked vector extract test cases with and without a bitcast between the select. The ones with the bitcast need additional work to fold the mask operation properly. This will be fixed in a future commit. llvm-svn: 291213	2017-01-06 05:18:44 +00:00
Peter Collingbourne	81271b7bd2	LowerTypeTests: Split the pass in two: a resolution phase and a lowering phase. This change separates how type identifiers are resolved from how intrinsic calls are lowered. All information required to lower an intrinsic call is stored in a new TypeIdLowering data structure. The idea is that this data structure can either be initialized using the module itself during regular LTO, or using the module summary in ThinLTO backends. Differential Revision: https://reviews.llvm.org/D28341 llvm-svn: 291205	2017-01-06 02:22:47 +00:00
David Majnemer	eaba06cffa	[SelectionDAG] Correctly transform range metadata to AssertZExt We used the logBase2 of the high instead of the ceilLogBase2 resulting in the wrong result for certain values. For example, it resulted in an i1 AssertZExt when the exclusive portion of the range was 3. llvm-svn: 291196	2017-01-06 00:11:46 +00:00
Geoff Berry	d46b6e8096	[AArch64] Fold some filled/spilled subreg COPYs Summary: Extend AArch64 foldMemoryOperandImpl() to handle folding spills of subreg COPYs with read-undef defs like: %vreg0:sub_32<def,read-undef> = COPY %WZR; GPR64:%vreg0 by widening the spilled physical source reg and generating: STRXui %XZR <fi#0> as well as folding fills of similar COPYs like: %vreg0:sub_32<def,read-undef> = COPY %vreg1; GPR64:%vreg0, GPR32:%vreg1 by generating: %vreg0:sub_32<def,read-undef> = LDRWui <fi#0> Reviewers: MatzeB, qcolombet Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D27425 llvm-svn: 291180	2017-01-05 21:51:42 +00:00
Teresa Johnson	6c475a7595	ThinLTO: add early "dead-stripping" on the Index Summary: Using the linker-supplied list of "preserved" symbols, we can compute the list of "dead" symbols, i.e. the one that are not reachable from a "preserved" symbol transitively on the reference graph. Right now we are using this information to mark these functions as non-eligible for import. The impact is two folds: - Reduction of compile time: we don't import these functions anywhere or import the function these symbols are calling. - The limited number of import/export leads to better internalization. Patch originally by Mehdi Amini. Reviewers: mehdi_amini, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23488 llvm-svn: 291177	2017-01-05 21:34:18 +00:00
Joerg Sonnenberger	83963995c6	PR 31534: When emitting both DWARF unwind tables and debug information, do not use .cfi_sections. This requires checking if any non-declaration function in the module needs an unwind table. llvm-svn: 291172	2017-01-05 20:55:28 +00:00
Michael Kuperstein	c9acad12e9	[LICM] Allow promotion of some stores that are not guaranteed to execute. Promotion is always legal when a store within the loop is guaranteed to execute. However, this is not a necessary condition - for promotion to be memory model semantics-preserving, it is enough to have a store that dominates every exit block. This is because if the store dominates every exit block, the fact the exit block was executed implies the original store was executed as well. Differential Revision: https://reviews.llvm.org/D28147 llvm-svn: 291171	2017-01-05 20:42:06 +00:00
Matthias Braun	1172332203	CodeGen: Assert that liveness is up to date when reading block live-ins. Add an assert that checks whether liveins are up to date before they are used. - Do not print liveins into .mir files anymore in situations where they are out of date anyway. - The assert in the RegisterScavenger is superseded by the new one in livein_begin(). - Skip parts of the liveness updating logic in IfConversion.cpp when liveness isn't tracked anymore (just enough to avoid hitting the new assert()). Differential Revision: https://reviews.llvm.org/D27562 llvm-svn: 291169	2017-01-05 20:01:19 +00:00
Sanjay Patel	686527c1e0	[x86] add test to show bug in select lowering; NFC llvm-svn: 291151	2017-01-05 18:35:44 +00:00
Simon Pilgrim	b01e844241	[CostModel][X86] Include the cost of 256-bit upper subvector extract/insertion in AVX1 v4i64 MUL Matches other MUL/ADD/SUB 256-bit case on AVX1 llvm-svn: 291149	2017-01-05 18:20:25 +00:00
Chad Rosier	e20a3a4831	[AArch64][CostModel] Add coverage for bswap intrinsics. llvm-svn: 291140	2017-01-05 16:55:32 +00:00
Zvi Rackover	b10f7de3b5	[X86] Add test cases that cover pr31551. NFC. llvm-svn: 291127	2017-01-05 16:48:28 +00:00
Simon Pilgrim	bca02f9e20	[CostModel][X86] Add support for broadcast shuffle costs Currently only for broadcasts with input and output of the same width. Differential Revision: https://reviews.llvm.org/D27811 llvm-svn: 291122	2017-01-05 15:56:08 +00:00
Zvi Rackover	4b7d724d62	[X86] Optimize vector shifts with variable but uniform shift amounts Summary: For instructions such as PSLLW/PSLLD/PSLLQ a variable shift amount may be passed in an XMM register. The lower 64-bits of the register are evaluated to determine the shift amount. This patch improves the construction of the vector containing the shift amount. Reviewers: craig.topper, delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28353 llvm-svn: 291120	2017-01-05 15:11:43 +00:00
Chad Rosier	3ccd1dffff	[AArch64] Remove mcpu option as this test is not target specific. NFC. llvm-svn: 291117	2017-01-05 15:05:03 +00:00
Tony Jiang	3a2f00b024	[PowerPC] Implement missing ISA 2.06 instructions. Instructions: fctidu[.], fctiwu[.], ftdiv, ftsqrt are not implemented. Implement them and add corresponding test cases in this patch. llvm-svn: 291116	2017-01-05 15:00:45 +00:00
Chad Rosier	e1dc73d9a7	[AArch64] Remove unused arguments from tests. NFC. llvm-svn: 291112	2017-01-05 14:48:53 +00:00
Teresa Johnson	519465b993	[ThinLTO] Subsume all importing checks into a single flag Summary: This adds a new summary flag NotEligibleToImport that subsumes several existing flags (NoRename, HasInlineAsmMaybeReferencingInternal and IsNotViableToInline). It also subsumes the checking of references on the summary that was being done during the thin link by eligibleForImport() for each candidate. It is much more efficient to do that checking once during the per-module summary build and record it in the summary. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28169 llvm-svn: 291108	2017-01-05 14:32:16 +00:00
Mohammed Agabaria	23599ba794	Currently isLikelyComplexAddressComputation tries to figure out if the given stride seems to be 'complex' and need some extra cost for address computation handling. This code seems to be target dependent which may not be the same for all targets. Passed the decision whether the given stride is complex or not to the target by sending stride information via SCEV to getAddressComputationCost instead of 'IsComplex'. Specifically at X86 targets we dont see any significant address computation cost in case of the strided access in general. Differential Revision: https://reviews.llvm.org/D27518 llvm-svn: 291106	2017-01-05 14:03:41 +00:00
Kristof Beyls	a983e7c4a4	[GlobalISel] Add support for address-taken basic blocks To make this work, pointers from the MachineBasicBlock to the LLVM-IR-level basic blocks need to be initialized, as the AsmPrinter uses this link to be able to print out labels for the basic blocks that are address-taken. Most of the changes in this commit are about adapting existing tests to include the basic block name that is now printed out in the MIR format, now that the name becomes available as the link to the LLVM-IR basic block is initialized. The relevant test change for the functionality added in this patch are the added "(address-taken)" strings in test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D28123 llvm-svn: 291105	2017-01-05 13:27:52 +00:00
Kristof Beyls	eced071e88	[GlobalISel] Add support for switch statements This commit does this using a trivial chain of conditional branches. In the future, we probably want to reuse the optimized switch lowering used in SelectionDAG. Differential Revision: https://reviews.llvm.org/D28176 llvm-svn: 291099	2017-01-05 11:28:51 +00:00
Kristof Beyls	2252440b81	[GlobalISel] Fix AArch64 ICMP instruction selection Differential Revision: https://reviews.llvm.org/D28175 llvm-svn: 291097	2017-01-05 10:16:08 +00:00
Elena Demikhovsky	143cbc425b	AVX-512: Optimized pattern for truncate with unsigned saturation. DAG patterns optimization: truncate + unsigned saturation supported by VPMOVUS* instructions in AVX-512. Differential revision: https://reviews.llvm.org/D28216 llvm-svn: 291092	2017-01-05 08:21:09 +00:00
Saleem Abdulrasool	9b9e86b4bd	test: remove unnecessary triple argument This test is entirely target agnostic. Avoid the triple to repair the build bots. llvm-svn: 291088	2017-01-05 06:30:12 +00:00
Saleem Abdulrasool	6252bd8eac	MC: support passing search paths to the IAS This is needed to support inclusion in inline assembly via the `.include` directive. llvm-svn: 291085	2017-01-05 05:56:39 +00:00
Peter Collingbourne	b2ce2b6805	IR: Module summary representation for type identifiers; summary test scaffolding for lowertypetests. Set up basic YAML I/O support for module summaries, plumb the summary into the pass and add a few command line flags to test YAML I/O support. Bitcode support to come separately, as will the code in LowerTypeTests that actually uses the summary. Also add a couple of tests that pass by virtue of the pass doing nothing with the summary (which happens to be the correct thing to do for those tests). Differential Revision: https://reviews.llvm.org/D28041 llvm-svn: 291069	2017-01-05 03:39:00 +00:00
Craig Topper	eea52429cd	[AVX-512] Update vextract64x4 intrinsic upgrade test cases to use a legal immediate so they test the instruction selection correctly. llvm-svn: 291061	2017-01-05 01:34:55 +00:00
Mehdi Amini	87ea8c60a6	Mark test that is testing statistics output as requiring Assertions We only enable statistic in an assert build by default. llvm-svn: 291044	2017-01-05 01:08:01 +00:00
Sanjay Patel	95faecb766	[InstSimplify] add tests to show missing select simplifications; NFC llvm-svn: 291043	2017-01-05 00:40:52 +00:00
Wolfgang Pieb	ce13e716c5	[DWARF] Null out the debug locs of load instructions that have been moved by GVN performing partial redundancy elimination (PRE). Not doing so can cause jumpy line tables and confusing (though correct) source attributions. Differential Revision: https://reviews.llvm.org/D27857 llvm-svn: 291037	2017-01-04 23:58:26 +00:00
Mehdi Amini	19ef4fad91	Use lazy-loading of Metadata in MetadataLoader when importing is enabled (NFC) Summary: This is a relatively simple scheme: we use the index emitted in the bitcode to avoid loading all the global metadata. Instead we load the index with their position in the bitcode so that we can load each of them individually. Materializing the global metadata block in this condition only triggers loading the named metadata, and the ones referenced from there (transitively). When materializing a function, metadata from the global block are loaded lazily as they are referenced. Two main current limitations are: 1) Global values other than functions are not materialized on demand, so we need to eagerly load METADATA_GLOBAL_DECL_ATTACHMENT records (and their transitive dependencies). 2) When we load a single metadata, we don't recurse on the operands, instead we use a placeholder or a temporary metadata. Unfortunately tepmorary nodes are very expensive. This is why we don't have it always enabled and only for importing. These two limitations can be lifted in a subsequent improvement if needed. With this change, the total link time of opt with ThinLTO and Debug Info enabled is going down from 282s to 224s (~20%). Reviewers: pcc, tejohnson, dexonsmith Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28113 llvm-svn: 291027	2017-01-04 22:54:33 +00:00
Matt Arsenault	3bdd75d01e	InstCombine: Fold cos(-x) -> cos(x) Also cos(fabs(x)) -> cos(x) llvm-svn: 291022	2017-01-04 22:49:03 +00:00
Tim Shen	5480eb8445	[Legalizer] Fix fp-to-uint to fp-tosint promotion assertion. Summary: When promoting fp-to-uint16 to fp-to-sint32, the result is actually zero extended. For example, given double 65534.0, without legalization: fp-to-uint16: 65534.0 -> 0xfffe With the legalization: fp-to-sint32: 65534.0 -> 0x0000fffe Without this patch, legalization wrongly emits a signed extend assertion, which is consumed by later icmp instruction, and cause miscompile. Note that the floating point value must be in [0, 65535), otherwise the behavior is undefined. This patch reverts r279223 behavior and adds more tests and documentations. In PR29041's context, James Molloy mentioned that: We don't need to mask because conversion from float->uint8_t is undefined if the integer part of the float value is not representable in uint8_t. Therefore we can assume this doesn't happen! which is totally true and good, because fptoui is documented clearly to have undefined behavior when overflow/underflow happens. We should take the advantage of this behavior so that we can save unnecessary mask instructions. Reviewers: jmolloy, nadav, echristo, kbarton Subscribers: mehdi_amini, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D28284 llvm-svn: 291015	2017-01-04 22:11:42 +00:00
Evgeny Stupachenko	c88697dc16	The patch fixes (base, index, offset) match. Summary: Instead of matching: (a + i) + 1 -> (a + i, undef, 1) Now it matches: (a + i) + 1 -> (a, i, 1) Reviewers: rengolin Differential Revision: http://reviews.llvm.org/D26367 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 291012	2017-01-04 21:43:39 +00:00
Chad Rosier	63687e40bc	[AArch64] Update the feature set for Qualcomm's Falkor CPU. llvm-svn: 291010	2017-01-04 21:26:23 +00:00
Michael Kuperstein	f381f35977	Add positive test for sqrt "partial inlining". NFC. llvm-svn: 291009	2017-01-04 21:24:56 +00:00
Nirav Dave	0f9d111f97	[AArch64] Fix over-eager early-exit in load-store combiner Fix early-exit analysis for memory operation pairing when operations are not emitted in ascending order. Reviewers: mcrosier, t.p.northover Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D28251 llvm-svn: 291008	2017-01-04 21:21:46 +00:00
Michael Kuperstein	020af9c258	Remove accidentally target-dependent test and pacify bots. llvm-svn: 291004	2017-01-04 21:08:53 +00:00
Hal Finkel	b2f951d87a	[PowerPC] Fix logic dealing with nop after calls (and tail-call eligibility) This change aims to unify and correct our logic for when we need to allow for the possibility of the linker adding a TOC restoration instruction after a call. This comes up in two contexts: 1. When determining tail-call eligibility. If we make a tail call (i.e. directly branch to a function) then there is no place for the linker to add a TOC restoration. 2. When determining when we need to add a nop instruction after a call. Likewise, if there is a possibility that the linker might need to add a TOC restoration after a call, then we need to put a nop after the call (the bl instruction). First problem: We were using similar, but different, logic to decide (1) and (2). This is just wrong. Both the resideInSameModule function (used when determining tail-call eligibility) and the isLocalCall function (used when deciding if the post-call nop is needed) were supposed to be determining the same underlying fact (i.e. might a TOC restoration be needed after the call). The same logic should be used in both places. Second problem: The logic in both places was wrong. We only know that two functions will share the same TOC when both functions come from the same section of the same object. Otherwise the linker might cause the functions to use different TOC base addresses (unless the multi-TOC linker option is disabled, in which case only shared-library boundaries are relevant). There are a number of factors that can cause functions to be placed in different sections or come from different objects (-ffunction-sections, explicitly-specified section names, COMDAT, weak linkage, etc.). All of these need to be checked. The existing logic only checked properties of the callee, but the properties of the caller must also be checked (for example, calling from a function in a COMDAT section means calling between sections). There was a conceptual error in the resideInSameModule function in that it allowed tail calls to functions with weak linkage and protected/hidden visibility. While protected/hidden visibility does prevent the function implementation from being replaced at runtime (via interposition), it does not prevent the linker from using an alternate implementation at link time (i.e. using some strong definition to replace the provided weak one during linking). If this happens, then we're still potentially looking at a required TOC restoration upon return. Otherwise, in general, the post-call nop is needed wherever ELF interposition needs to be supported. We don't currently support ELF interposition at the IR level (see http://lists.llvm.org/pipermail/llvm-dev/2016-November/107625.html for more information), and I don't think we should try to make it appear to work in the backend in spite of that fact. Unfortunately, because of the way that the ABI works, we need to generate code as if we supported interposition whenever the linker might insert stubs for the purpose of supporting it. Differential Revision: https://reviews.llvm.org/D27231 llvm-svn: 291003	2017-01-04 21:05:13 +00:00
Michael Kuperstein	fc74da13a9	Add positive test for sqrt "partial inlining". NFC. llvm-svn: 291001	2017-01-04 20:48:30 +00:00
Tobias Grosser	9d88b858c8	Add missing CHECK: line to test case added in 29097 Without this CHECK line, we may not detect incorrectly detected additional regions at the end of the region tree. llvm-svn: 290994	2017-01-04 19:35:38 +00:00
Jan Vesely	d48445d513	AMDGPU/SI: Implement sendmsghalt intrinsic v2: expose using amdgcn prefix Differential Revision: https://reviews.llvm.org/D23511 llvm-svn: 290977	2017-01-04 18:06:55 +00:00
Tobias Grosser	8ab80ba3a2	RegionInfo: add new test case This test case has been reduced from test/Analysis/RegionInfo/mix_1.ll and provides us with a minimal example of a test case which caused problems while working on an improved version of the RegionInfo analysis. We upstream this test case, as it certainly can be helpful in future debugging and optimization tests. Test case reduced by Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 290974	2017-01-04 17:50:15 +00:00
Robert Lougher	5bf0416f45	Reapply "[SimplifyCFG] In sinkLastInstruction correctly set debugloc of common inst" This reapplies r289828 (reverted in r289833 as it broke the address sanitizer). The debugloc is now only set when the instruction is not a call, as this causes the verifier to assert (the inliner requires an inlinable callsite to have a debug loc if the caller and callee have debug info). Original commit message: Simplify CFG will try to sink the last instruction in a series of basic blocks, creating a "common" instruction in the successor block (sinkLastInstruction). When it does this, the debug location of the single instruction should be the merged debug locations of the commoned instructions. Original review: https://reviews.llvm.org/D27590 llvm-svn: 290973	2017-01-04 17:40:32 +00:00
Simon Pilgrim	6cfb5caf05	Revert r290970 [SLPVectorizer] Regenerate test. The check script will use var names before they are declared, which filecheck doesn't like. llvm-svn: 290971	2017-01-04 16:12:07 +00:00
Simon Pilgrim	4629b46bba	[SLPVectorizer] Regenerate test. Missed var name llvm-svn: 290970	2017-01-04 16:01:55 +00:00
Simon Pilgrim	1d5b0377af	Regenerate test. llvm-svn: 290969	2017-01-04 15:52:41 +00:00
Asiri Rathnayake	9670051657	Fix x86 gold tests on non-x86 targets. These tests are missing a target triple and the -m elf_x86_64 gold option, which makes them fail on non-x86 targets. Differential revision: https://reviews.llvm.org/D28285 Reviewers: tejohnson llvm-svn: 290965	2017-01-04 14:43:51 +00:00
Teresa Johnson	0fca905cb3	[ThinLTO] Rework llvm-link to use the FunctionImporter Summary: Change llvm-link to use the FunctionImporter handling, instead of manually invoking the Linker. We still need to load the module in llvm-link to do the desired testing for invalid import requests (weak functions), and to get the GUID (in case the function is local). Also change the drop-debug-info test to use llvm-link so that importing is forced (in order to test debug info handling) and independent of import logic changes. Reviewers: mehdi_amini Subscribers: mgorny, llvm-commits, aprantl Differential Revision: https://reviews.llvm.org/D28277 llvm-svn: 290964	2017-01-04 14:27:31 +00:00
Davide Italiano	db00939403	[SPARC] Fix test so that it checks the correct label. Before it wasn't checking anything. llvm-svn: 290963	2017-01-04 14:01:58 +00:00
Simon Pilgrim	bb895f3e9c	[CostModel][X86] Updated vXi8 and vXi16 Reverse/Alternate shuffle costs Actual codegen is much better than the extract+insert patterns that was assumed. llvm-svn: 290962	2017-01-04 14:01:33 +00:00
Davide Italiano	039368e2d2	[MC/COFF] Fix a test to actually check the relocation. Inspired by r290953 + grep -R 'CHCEK'. llvm-svn: 290958	2017-01-04 13:12:00 +00:00
Florian Hahn	5815f6c53c	[framelowering] Skip dbg values when getting next/previous instruction. Summary: In mergeSPUpdates, debug values need to be ignored when getting the previous element, otherwise debug data could have an impact on codegen. In eliminateCallFramePseudoInstr, debug values after the erased element could have an impact on codegen and should be skipped. Closes PR31319 (https://llvm.org/bugs/show_bug.cgi?id=31319) Reviewers: aprantl, MatzeB, mkuper Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D27688 llvm-svn: 290955	2017-01-04 12:08:35 +00:00
Chandler Carruth	96809ae7ea	[Inliner] Fix a test where I typo'ed 'CHECK' as 'CHCEK' when converting to FileCheck. Fortunately, it passes. =] Spotted in review by Bob Wilson! llvm-svn: 290953	2017-01-04 11:15:01 +00:00
Simon Pilgrim	c76ea4b638	[X86] Attempt to pre-truncate arithmetic operations if useful In some cases its more efficient to combine TRUNC( BINOP( X, Y ) ) --> BINOP( TRUNC( X ), TRUNC( Y ) ) if the binop is legal for the truncated types. This is true for vector integer multiplication (especially vXi64), as well as ADD/AND/XOR/OR in cases where we only need to truncate one of the inputs at runtime (e.g. a duplicated input or an one use constant we can fold). Further work could be done here - scalar cases (especially i64) could often benefit (if we avoid partial registers etc.), other opcodes, and better analysis of when truncating the inputs reduces costs. I have considered implementing this for all targets within the DAGCombiner but wasn't sure we could devise a suitable cost model system that would give us the range we need. Differential Revision: https://reviews.llvm.org/D28219 llvm-svn: 290947	2017-01-04 08:05:42 +00:00
Craig Topper	d0aa53b9ae	[AVX-512] Add support for detecting 512-bit shuffles that contain a 128-bit subvector insertion from the lowest subvector of one of the sources. These are best handled with a vinsert32x4 or vinsert64x2 instruction. llvm-svn: 290946	2017-01-04 07:32:03 +00:00
Craig Topper	a3b9a4edd5	[AVX-512] Add more test cases for shuffles that should be handled with subvector insert instructions. llvm-svn: 290945	2017-01-04 07:31:59 +00:00
Craig Topper	9e065c5b5c	[AVX-512] Fix a typo in a couple case names to match their behavior. llvm-svn: 290944	2017-01-04 07:31:57 +00:00
Craig Topper	42e8e33ccd	[AVX-512] Add avx512dq to the vector-shuffle-512-v16.ll test command lines in preparation for a future change that needs these features. llvm-svn: 290943	2017-01-04 07:31:54 +00:00
David Majnemer	b5e365c970	[InstCombine] Add a test for r290733 llvm-svn: 290929	2017-01-04 02:21:37 +00:00
David Majnemer	cb892e9066	[InstCombine] Move casts around shift operations It is possible to perform a left shift before zero extending if the shift would only shift out zeros. llvm-svn: 290928	2017-01-04 02:21:34 +00:00
David Majnemer	022d2a563b	[InstCombine] Combine adds across a zext We can perform the following: (add (zext (add nuw X, C1)), C2) -> (zext (add nuw X, C1+C2)) This is only possible if C2 is negative and C2 is greater than or equal to negative C1. llvm-svn: 290927	2017-01-04 02:21:31 +00:00
Matt Arsenault	56ff4839ae	InstCombine: Fold fabs on select of constants llvm-svn: 290913	2017-01-03 22:40:34 +00:00
Sanjay Patel	ada846aff0	[InstCombine] tighten checks for tests of assume -> metadata transform; NFC llvm-svn: 290903	2017-01-03 19:32:11 +00:00
Simon Pilgrim	1145989a71	[X86][SSE] Add extra truncated arithmetic tests for D28219 llvm-svn: 290902	2017-01-03 19:18:07 +00:00
Adrian Prantl	36daf63b2b	Add llvm-bcanalyzer support for new metadata node types. Also sort the existing list by value. llvm-svn: 290901	2017-01-03 19:17:49 +00:00
Xin Tong	883dd1b6c4	Enable disabled loopidiom test. Apparently we handle it now Summary: Enable disabled loopidiom test. Apparently we handle it now. Maybe due to improvements to AA. Reviewers: atrick, danielcdh, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28171 llvm-svn: 290900	2017-01-03 19:08:05 +00:00
Craig Topper	9496e3f916	[AVX-512] Teach shuffle lowering to use vinsert instructions for shuffles corresponding to 256-bit subvector inserts. llvm-svn: 290870	2017-01-03 07:00:40 +00:00
Craig Topper	fa875a1d3d	[AVX-512] Teach EVEX to VEX conversion pass to handle VINSERT and VEXTRACT instructions. llvm-svn: 290869	2017-01-03 05:46:18 +00:00
Craig Topper	15d116ab41	[AVX-512] Re-generate tests that were updated for r290663 without using update_llc_test_checks.py so duplicate check lines weren't merged. llvm-svn: 290868	2017-01-03 05:46:10 +00:00
Craig Topper	c849172105	[AVX-512] Add support for pushing bitcasts through INSERT_SUBVEC in order to select a masked operation. llvm-svn: 290865	2017-01-03 05:46:02 +00:00
Craig Topper	0cda8bbf74	[AVX-512] Remove vinsert intrinsics and autoupgrade to native shufflevectors. There are some codegen problems here that I'll try to fix in future commits. llvm-svn: 290864	2017-01-03 05:45:57 +00:00
Craig Topper	4d47c6ae57	[AVX-512] Remove vextract intrinsics and autoupgrade to native shufflevectors. This unfortunately generates some really terrible code without VLX support due to v2i1 and v4i1 not being legal. Hopefully we can improve that in future patches. llvm-svn: 290863	2017-01-03 05:45:46 +00:00
Matt Arsenault	b264c94963	InstCombine: Add fma with constant transforms DAGCombine already does these. llvm-svn: 290860	2017-01-03 04:32:35 +00:00
Matt Arsenault	1cc294c85d	InstCombine: Add fma + fabs/fneg transforms fma (fneg x), (fneg y), z -> fma x, y, z fma (fabs x), (fabs x), z -> fma x, x, z llvm-svn: 290859	2017-01-03 04:32:31 +00:00
Xin Tong	2940231ff0	Make sure total loop body weight is preserved in loop peeling Summary: Regardless how the loop body weight is distributed, we should preserve total loop body weight. i.e. we should have same weight reaching the body of the loop or its duplicates in peeled and unpeeled case. Reviewers: mkuper, davidxl, anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28179 llvm-svn: 290833	2017-01-02 20:27:23 +00:00
Daniel Berlin	aa0ec1e992	NewGVN: Add a test case for equivalent phis. llvm-svn: 290830	2017-01-02 19:55:13 +00:00
Daniel Berlin	43a5f998df	NewGVN: Add forgotten testcase for PR 31483 llvm-svn: 290829	2017-01-02 19:49:20 +00:00
Michal Gorny	89b6f16b3e	[cmake] Add LLVM_ENABLE_DIA_SDK option, and expose it in LLVMConfig Add an explicit LLVM_ENABLE_DIA_SDK option to control building support for DIA SDK-based debugging. Control its value to match whether DIA SDK support was found and expose it in LLVMConfig (alike LLVM_ENABLE_ZLIB). Its value is needed for LLDB to determine whether to run tests requiring DIA support. Currently it is obtained from llvm/Config/config.h; however, this file is not available for standalone builds. Following this change, LLDB will be modified to use the value from LLVMConfig. Differential Revision: https://reviews.llvm.org/D26255 llvm-svn: 290818	2017-01-02 18:19:35 +00:00
Joerg Sonnenberger	7b83732a40	Emit .cfi_sections before the first .cfi_startproc GNU as rejects input where .cfi_sections is used after .cfi_startproc, if the new section differs from the old. Adjust our output to always emit .cfi_sections before the first .cfi_startproc to minimize necessary code. Differential Revision: https://reviews.llvm.org/D28011 llvm-svn: 290817	2017-01-02 18:05:27 +00:00
Elena Demikhovsky	d96200d60a	Fixed shuffle-reverse cost on AVX-512. (This changed was approved in https://reviews.llvm.org/D28118, but Simon asked to submit it separately). llvm-svn: 290812	2017-01-02 11:44:10 +00:00
Elena Demikhovsky	21706cbd24	AVX-512 Loop Vectorizer: Cost calculation for interleave load/store patterns. X86 target does not provide any target specific cost calculation for interleave patterns.It uses the common target-independent calculation, which gives very high numbers. As a result, the scalar version is chosen in many cases. The situation on AVX-512 is even worse, since we have 3-src shuffles that significantly reduce the cost. In this patch I calculate the cost on AVX-512. It will allow to compare interleave pattern with gather/scatter and choose a better solution (PR31426). * Shiffle-broadcast cost will be changed in Simon's upcoming patch. Differential Revision: https://reviews.llvm.org/D28118 llvm-svn: 290810	2017-01-02 10:37:52 +00:00
Keno Fischer	f7d84ee6ff	Reapply "[CodeGen] Fix invalid DWARF info on Win64" This reapplies rL289013 (reverted in rL289014) with the fixes identified in D21731. Should hopefully pass the buildbots this time. llvm-svn: 290809	2017-01-02 03:00:19 +00:00
Sanjay Patel	0e3ae439cf	[InstCombine] add explanatory comment to test; NFC The test was added at r290797, and a patch to enable the transform is proposed in D28204. llvm-svn: 290798	2017-01-01 18:20:49 +00:00
Sanjay Patel	07537c2b6e	[InstCombine] add test to show potential nonnull attribute propagation; NFC This will change with the current draft of: https://reviews.llvm.org/D28204 llvm-svn: 290797	2017-01-01 17:18:00 +00:00
Sanjay Patel	5865d12e9f	[ValueTracking] add tests for known-nonnull-at; NFC llvm-svn: 290790	2016-12-31 19:23:26 +00:00
Dylan McKay	97cf837b46	[AVR] Optimize 16-bit ANDs with '1' Summary: Fixes PR 31345 Reviewers: dylanmckay Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D28186 llvm-svn: 290778	2016-12-31 01:07:14 +00:00
Craig Topper	d00db69227	[InstCombine][AVX-512] Teach InstCombine that llvm.x86.avx512.vcomi.sd and llvm.x86.avx512.vcomi.ss don't use the upper elements of their input. This was already done for the SSE/SSE2 version of the intrinsics. llvm-svn: 290776	2016-12-31 00:45:06 +00:00
Simon Pilgrim	c5fde8d748	[X86][AVX512DQ] Add truncated math tests for AVX512DQ. llvm-svn: 290772	2016-12-30 22:43:41 +00:00
Simon Pilgrim	85af973506	[X86][SSE] Fix truncated math test names. Inconsistent naming convention and wrong name for some input/output types. llvm-svn: 290771	2016-12-30 22:40:32 +00:00
Simon Pilgrim	712374169d	[X86][AVX512] Regenerate test - missing shuffle comments llvm-svn: 290770	2016-12-30 22:31:33 +00:00
Piotr Padlewski	da36215017	[MemDep] Handle gep with zeros for invariant.group Summary: gep 0, 0 is equivalent to bitcast. LLVM canonicalizes it to getelementptr because it make SROA can then handle it. Simple case like void g(A &a) { z(a); if (glob) a.foo(); } void testG() { A a; g(a); } was not devirtualized with -fstrict-vtable-pointers because luck of handling for gep 0 in Memory Dependence Analysis Reviewers: dberlin, nlewycky, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28126 llvm-svn: 290763	2016-12-30 18:45:07 +00:00
Mehdi Amini	e2770c0b80	Fix test change in r290736: restore index generation I remove one extra line, but because annoyingly llvm-lit does not clean the output directory before running the test, it didn't fail locally (the file was present from a previous run). llvm-svn: 290740	2016-12-30 01:15:50 +00:00
Michael Kuperstein	76e06c8858	[LICM] When promoting scalars, allow inserting stores to thread-local allocas. This is similar to the allocfn case - if an alloca is not captured, then it's necessarily thread-local. Differential Revision: https://reviews.llvm.org/D28170 llvm-svn: 290738	2016-12-30 01:03:17 +00:00
Dehao Chen	cc76344ef5	Use continuous boosting factor for complete unroll. Summary: The current loop complete unroll algorithm checks if unrolling complete will reduce the runtime by a certain percentage. If yes, it will apply a fixed boosting factor to the threshold (by discounting cost). The problem for this approach is that the threshold abruptly. This patch makes the boosting factor a function of runtime reduction percentage, capped by a fixed threshold. In this way, the threshold changes continuously. The patch also simplified the code by reducing one parameter in UP. The patch only affects code-gen of two speccpu2006 benchmark: 445.gobmk binary size decreases 0.08%, no performance change. 464.h264ref binary size increases 0.24%, no performance change. Reviewers: mzolotukhin, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26989 llvm-svn: 290737	2016-12-30 00:50:28 +00:00
Mehdi Amini	30a9b6bb4e	Replace test from using llvm-lto to use llvm-link (NFC) Some incoming changes in ThinLTO will break this test. Instead of relying on the heuristic to import, we force the importing to happen with llvm-link. llvm-svn: 290736	2016-12-30 00:45:26 +00:00
Dylan McKay	453d042969	[AVR] Optimize 16-bit ORs with '0' Summary: Fixes PR 31344 Authored by Anmol P. Paralkar Reviewers: dylanmckay Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D28121 llvm-svn: 290732	2016-12-30 00:21:56 +00:00
Daniel Berlin	e0bd37e78f	NewGVN: Fix PR 31491 by ensuring that we touch the right instructions. Change to one based numbering so we can assert we don't cause the same bug again. llvm-svn: 290724	2016-12-29 22:15:12 +00:00
Reid Kleckner	cd46c1df80	Revert "[COFF] Use 32-bit jump table entries in .rdata for Win64" This reverts commit r290694. It broke sanitizer tests on Win64. I'll probably bring this back, but the jump tables will just live in .text like they do for MSVC. llvm-svn: 290714	2016-12-29 17:07:10 +00:00

1 2 3 4 5 ...

41919 Commits