llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	51818c14b3	AMDGPU: Constant fold when immediate is materialized In future commits these patterns will appear after moveToVALU changes. llvm-svn: 291615	2017-01-10 23:32:04 +00:00
Matt Arsenault	fdb78f8bae	InstCombine: fdiv -x, -y -> fdiv x, y llvm-svn: 291611	2017-01-10 23:08:54 +00:00
Kyle Butt	df27aa8c89	CodeGen: Allow small copyable blocks to "break" the CFG. When choosing the best successor for a block, ordinarily we would have preferred a block that preserves the CFG unless there is a strong probability the other direction. For small blocks that can be duplicated we now skip that requirement as well. Differential revision: https://reviews.llvm.org/D27742 llvm-svn: 291609	2017-01-10 23:04:30 +00:00
Douglas Yung	ee787a7665	Make the test accept different OpCode values since it doesn't really care about the value. Differential Revision: https://reviews.llvm.org/D28487 llvm-svn: 291605	2017-01-10 22:10:22 +00:00
Matt Arsenault	0b382a7cb8	DAG: Avoid OOB when legalizing vector indexing If a vector index is out of bounds, the result is supposed to be undefined but is not undefined behavior. Change the legalization for indexing the vector on the stack so that an out of bounds index does not create an out of bounds memory access. llvm-svn: 291604	2017-01-10 22:02:30 +00:00
Derek Schuff	7acb42a41a	[WebAssembly] Only RAUW a constant once in FixFunctionBitcasts When we collect 2 uses of a function in FindUses and then RAUW when we visit the first, we end up visiting the wrapper (because the second was RAUW'd). We still want to use RAUW instead of just Use->set() because it has special handling for Constants, so this patch just ensures that only one use of each constant is added to the work list. Differential Revision: https://reviews.llvm.org/D28504 llvm-svn: 291603	2017-01-10 21:59:53 +00:00
Victor Leschuk	59d0b92a2a	Correct object file for implicit const test llvm-svn: 291601	2017-01-10 21:30:42 +00:00
Victor Leschuk	cbddae74f5	DebugInfo: support for DW_FORM_implicit_const Support for DW_FORM_implicit_const DWARFv5 feature. When this form is used attribute value goes to .debug_abbrev section (as SLEB). As this form would break any debug tool which doesn't support DWARFv5 it is guarded by dwarf version check. Attempt to use this form with dwarf version <= 4 is considered a fatal error. Differential Revision: https://reviews.llvm.org/D28456 llvm-svn: 291599	2017-01-10 21:18:26 +00:00
Michal Gorny	6911324ed4	[llvm-config] Canonicalize CMake booleans to 0/1 Following the similar change to lit configuration, ensure that all CMake booleans are canonicalized to 0/1 when being passed to llvm-config. This fixes the incorrect interpretation of values when user passes another value than the ON/OFF, and simplifies the code by removing unnecessary string matching. Furthermore, the code for --has-rtti and --has-global-isel has been modified to print consistent values indepdently of the boolean used by passed by the user to CMake. Sadly, the code already implicitly used different values for the two (YES/NO for --has-rtti, ON/OFF for --has-global-isel). Include tests for all booleans and multi-value options in llvm-config. Differential Revision: https://reviews.llvm.org/D28366 llvm-svn: 291593	2017-01-10 19:55:51 +00:00
Michael Kuperstein	ee31cbe35f	[LV] Don't panic when encountering the IV of an outer loop. Bail out instead of asserting when we encounter this situation, which can actually happen. The reason the test uses the new PM is that the "bad" phi, incidentally, gets cleaned up by LoopSimplify. But LICM can create this kind of phi and preserve loop simplify form, so the cleanup has no chance to run. This fixes PR31190. We may want to solve this in a less conservative manner, since this phi is actually uniform within the inner loop (or we may want LICM to output a cleaner promotion to begin with). Differential Revision: https://reviews.llvm.org/D28490 llvm-svn: 291589	2017-01-10 19:32:30 +00:00
Rong Xu	ef1adad938	[PGO] Turn off comdat renaming in IR PGO by default Summary: In IR PGO we append the function hash to comdat functions to avoid the potential hash mismatch. This turns out not legal in some cases: if the comdat function is address-taken and used in comparison. Renaming changes the semantic. This patch turns off comdat renaming by default. To alleviate the hash mismatch issue, we now rename the profile variable for comdat functions. Profile allows co-existing multiple versions of profiles with different hash value. The inlined copy will always has the correct profile counter. The out-of-line copy might not have the correct count. But we will not have the bogus mismatch warning. Reviewers: davidxl Subscribers: llvm-commits, xur Differential Revision: https://reviews.llvm.org/D28416 llvm-svn: 291588	2017-01-10 19:30:20 +00:00
Matt Arsenault	8871683d60	AMDGPU: Add tests for HasMultipleConditionRegisters This was enabled without many specific tests or the comment. llvm-svn: 291586	2017-01-10 19:08:15 +00:00
Simon Pilgrim	b6d4fa6551	[CostModel][X86] Add AVX512VL vector shift cost tests. llvm-svn: 291585	2017-01-10 19:04:12 +00:00
Michael Zuckerman	bcd03e7f3b	[X86][AVX512]Improving shuffle lowering by using AVX-512 EXPAND* instructions This patch fix PR31351: https://llvm.org/bugs/show_bug.cgi?id=31351 1. This patch adds new type of shuffle lowering 2. We can use the expand instruction, When the shuffle pattern is as following: { 0a[0]0a[1]...0*a[n] , n >=0 where a[] elements in a ascending order}. Reviewers: 1. igorb 2. guyblank 3. craig.topper 4. RKSimon Differential Revision: https://reviews.llvm.org/D28352 llvm-svn: 291584	2017-01-10 18:57:17 +00:00
Davide Italiano	f8711f093e	[SimplifyLibCalls] Propagate fast math flags while optimizing pow(). llvm-svn: 291577	2017-01-10 18:02:05 +00:00
Chad Rosier	3daffbf6a8	[AArch64] Add support for lowering bitreverse to the rbit instruction. Differential Revision: https://reviews.llvm.org/D28379 llvm-svn: 291575	2017-01-10 17:20:33 +00:00
Simon Dardis	548a53f5ee	[mips] Fix Mips MSA instrinsics The usage of some MIPS MSA instrinsics that took immediates could crash LLVM during lowering. This patch addresses that behaviour. Crucially this patch also makes the use of intrinsics with out of range immediates as producing an internal error. The ld,st instrinsics would trigger an assertion failure for MIPS64 as their lowering would attempt to add an i32 offset to a i64 pointer. Reviewers: vkalintiris, slthakur Differential Revision: https://reviews.llvm.org/D25438 llvm-svn: 291571	2017-01-10 16:40:57 +00:00
Simon Dardis	0e9e237310	[mips] Honour -mno-odd-spreg for vector splat (again) Previous the lowering of FILL_FW would use the MSA128W register class when performing a vector splat. Instead it should be honouring -mno-odd-spreg and only use the even registers when performing a splat from word to vector register. Logical follow-on from r230235. This fixes PR/31369. A previous commit was missing the test case and had another differential in it. Reviewers: slthakur Differential Revision: https://reviews.llvm.org/D28373 llvm-svn: 291566	2017-01-10 15:53:10 +00:00
Eugene Leviant	8e32aebe80	RuntimeDyldELF: implement R_AARCH64_PREL64 reloc Differential revision: https://reviews.llvm.org/D28122 llvm-svn: 291558	2017-01-10 11:05:30 +00:00
Chris Bieneman	e2796fd3fd	[ObjectYAML] Missed one mixup in the debug_line test llvm-svn: 291547	2017-01-10 06:24:24 +00:00
Chris Bieneman	1b7200d2cf	[ObjectYAML] Support for DWARF line tables One more try... relanding r291541 with a fix to properly gate MaxOpsPerInst on DWARF version. Description from r291541: This patch re-lands r291470, which failed on Linux bots. The issue (I believe) was undefined behavior because the size of llvm::dwarf::LineNumberOps was not explcitly specified or consistently respected. The updated patch adds an explcit underlying type to the enum and preserves the size more correctly. Original description: This patch adds support for the DWARF debug_lines section. The line table state machine opcodes are preserved, so this can be used to test the state machine evaluation directly. llvm-svn: 291546	2017-01-10 06:22:49 +00:00
Craig Topper	d55b83128b	AMD family 17h (znver1) enablement Summary: This patch enables the following 1. AMD family 17h architecture using "znver1" tune flag (-march, -mcpu). 2. ISAs that are enabled for "znver1" architecture. 3. Checks ADX isa from cpuid to identify "znver1" flag when -march=native is used. 4. ISAs FMA4, XOP are disabled as they are dropped from amdfam17. 5. For the time being, it uses the btver2 scheduler model. 6. Test file is updated to check this flag. This item is linked to clang review item https://reviews.llvm.org/D28018 Patch by Ganesh Gopalasubramanian Reviewers: RKSimon, craig.topper Subscribers: vprasad, RKSimon, ashutosh.nema, llvm-commits Differential Revision: https://reviews.llvm.org/D28017 llvm-svn: 291543	2017-01-10 06:01:16 +00:00
Chris Bieneman	e6663d376e	Revert "[ObjectYAML] Support for DWARF line tables" This reverts commit r291541. Still failing on a bot: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/47224/steps/test_llvm/logs/stdio llvm-svn: 291542	2017-01-10 05:31:23 +00:00
Chris Bieneman	07ab0aa5d6	[ObjectYAML] Support for DWARF line tables This patch re-lands r291470, which failed on Linux bots. The issue (I believe) was undefined behavior because the size of llvm::dwarf::LineNumberOps was not explcitly specified or consistently respected. The updated patch adds an explcit underlying type to the enum and preserves the size more correctly. Original description: This patch adds support for the DWARF debug_lines section. The line table state machine opcodes are preserved, so this can be used to test the state machine evaluation directly. llvm-svn: 291541	2017-01-10 05:25:24 +00:00
Dean Michael Berris	4ebc79bb05	[XRay] Use regular expression for finding symbols Un-break the test in Windows. Follow-up on D24376. llvm-svn: 291538	2017-01-10 04:32:34 +00:00
Serge Pavlov	0668cd2c95	[StructurizeCfg] Update dominator info. In some cases StructurizeCfg updates root node, but dominator info remains unchanges, it causes crash when expensive checks are enabled. To cope with this problem a new method was added to DominatorTreeBase that allows adding new root nodes, it is called in StructurizeCfg to put dominator tree in sync. This change fixes PR27488. Differential Revision: https://reviews.llvm.org/D28114 llvm-svn: 291530	2017-01-10 02:50:47 +00:00
Dean Michael Berris	f8f909f848	[XRay] Implement `llvm-xray convert` -- trace file conversion This is the second part of a multi-part change to define additional subcommands to the `llvm-xray` tool. This change defines a conversion subcommand to take XRay log files, and turns them from one format to another (binary or YAML). This currently only supports the first version of the log file format, defined in the compiler-rt runtime. Depends on D21987. Reviewers: dblaikie, echristo Subscribers: mehdi_amini, dberris, beanz, llvm-commits Differential Revision: https://reviews.llvm.org/D24376 llvm-svn: 291529	2017-01-10 02:38:11 +00:00
James Y Knight	5b30b67cd1	Commit a test for match-full-lines. I unfortunately neglected to add it in r260540, but it has been sitting in my working dir ever since. D'oh. Modified to work with r290069, which made the CHECK patterns themselves whitespace-sensitive as well, and remove the test added then, as this tests both strict and non-strict modes. llvm-svn: 291499	2017-01-09 23:11:25 +00:00
Simon Pilgrim	fa32894730	[X86][AVX512VL] Added AVX512VL to 128/256 bit vector shift tests llvm-svn: 291488	2017-01-09 22:13:51 +00:00
Davide Italiano	472684eaf5	[SimplifyLibCalls] pow(x, -0.5) -> 1.0 / sqrt(x). Differential Revision: https://reviews.llvm.org/D28479 llvm-svn: 291486	2017-01-09 21:55:23 +00:00
Matthias Braun	ba7d95d425	PeepholeOptimizer: Do not replace SubregToReg(bitcast like) While we can usually replace bitcast like instructions (MachineInstr::isBitcast()) with a COPY this is not legal if any of the users uses SUBREG_TO_REG to assert the upper bits of the result are zero. Differential Revision: https://reviews.llvm.org/D28474 llvm-svn: 291483	2017-01-09 21:38:17 +00:00
Matthias Braun	c612891cc5	Drive by typo fix llvm-svn: 291482	2017-01-09 21:38:14 +00:00
Michael Kuperstein	1559e8863e	Revert r291092 because it introduces a crash. See PR31589 for details. llvm-svn: 291478	2017-01-09 21:04:46 +00:00
Vyacheslav Klochkov	d497d36083	X86-specific path: Implemented the fusing of MUL+ADDSUB to FMADDSUB. Differential Revision: https://reviews.llvm.org/D28087 llvm-svn: 291473	2017-01-09 20:26:17 +00:00
Sanjay Patel	8f4910e26a	[InstCombine] add test to show missed fold using llvm.assume; NFC llvm-svn: 291472	2017-01-09 20:18:30 +00:00
Chris Bieneman	e62e684fdd	Revert "[ObjectYAML] Support for DWARF line tables" This reverts commit r291470 due to failing bots: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/47209/steps/test_llvm/logs/stdio llvm-svn: 291471	2017-01-09 20:04:55 +00:00
Chris Bieneman	0396f99184	[ObjectYAML] Support for DWARF line tables This patch adds support for the DWARF debug_lines section. The line table state machine opcodes are preserved, so this can be used to test the state machine evaluation directly. llvm-svn: 291470	2017-01-09 20:01:37 +00:00
Sanjay Patel	eaa143c98c	[InstCombine] regenerate checks; NFC llvm-svn: 291469	2017-01-09 19:43:26 +00:00
Sanjay Patel	baac743254	[ValueTracking] regenerate checks; NFC llvm-svn: 291468	2017-01-09 19:31:20 +00:00
Sanjay Patel	87495eb8ef	[InstCombine] regenerate checks; NFC llvm-svn: 291464	2017-01-09 19:18:46 +00:00
Sanjay Patel	ced8fdd42a	[InstCombine] remove unnecessary attribute comments from test files; NFC llvm-svn: 291463	2017-01-09 19:13:38 +00:00
Matthew Simpson	cf796478e9	[LV] Fix-up external IV users after updating dominator tree This patch delays the fix-up step for external induction variable users until after the dominator tree has been properly updated. This should fix PR30742. The SCEVExpander in InductionDescriptor::transform can generate code in the wrong location if the dominator tree is not up-to-date. We should work towards keeping the dominator tree up-to-date throughout the transformation. Reference: https://llvm.org/bugs/show_bug.cgi?id=30742 Differential Revision: https://reviews.llvm.org/D28168 llvm-svn: 291462	2017-01-09 19:05:29 +00:00
Matt Arsenault	6dca542b4a	AMDGPU: Add Assert[SZ]Ext during argument load creation For i16 zeroext arguments when i16 was a legal type, the known bits information from the truncate was lost. Insert a zeroext so the known bits optimizations work with the 32-bit loads. Fixes code quality regressions vs. SI in min.ll test. llvm-svn: 291461	2017-01-09 18:52:39 +00:00
Xin Tong	c13a8e84d1	Intrinsic::Bitreverse is safe to speculate Summary: Intrinsic::Bitreverse is safe to speculate Reviewers: hfinkel, mkuper, arsenm, jmolloy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28471 llvm-svn: 291456	2017-01-09 17:57:08 +00:00
Simon Pilgrim	0f23b2ba1a	[X86][AVX512] Enable v16i8/v32i8 vector shifts to use an extend+shift+truncate pattern. Use the existing AVX2 v8i16 vector shift lowering for v16i8 (extending to v16i32) on AVX512 targets and v32i8 (extending to v32i16) on AVX512BW targets. Cost model updates to follow. llvm-svn: 291451	2017-01-09 17:20:03 +00:00
Simon Pilgrim	d990cd371b	[X86][AVX512DQ] Enable v16i16 vector shifts to use an extend+shift+truncate pattern. Use the existing AVX2 v8i16 vector shift lowering for v16i16 on AVX512 targets (AVX512BW will have already have lowered with vpsravw). Cost model updates to follow. llvm-svn: 291445	2017-01-09 15:15:45 +00:00
Simon Pilgrim	f8538572ab	[X86][AVX512DQ] Added AVX512DQ to 128/256 bit vector shift tests llvm-svn: 291444	2017-01-09 14:36:09 +00:00
Bjorn Pettersson	b14afd452d	[SelectionDAG] Fix in legalization of UMAX/SMAX/UMIN/SMIN. Solves PR31486. Summary: Originally i64 = umax t8, Constant:i64<4> was expanded into i32,i32 = umax Constant:i32<0>, Constant:i32<0> i32,i32 = umax t7, Constant:i32<4> Now instead the two produced umax:es return i32 instead of i32, i32. Thanks to Jan Vesely for help with the test case. Patch by mikael.holmen at ericsson.com Reviewers: bogner, jvesely, tstellarAMD, arsenm Subscribers: test, wdng, RKSimon, arsenm, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D28135 llvm-svn: 291441	2017-01-09 12:03:50 +00:00
Eugene Leviant	74f27dc91b	RuntimeDyldELF: add missing test cases for AArch64 llvm-svn: 291438	2017-01-09 11:47:33 +00:00
Eugene Leviant	be2d68f774	RuntimeDyldELF: don't create thunk if not needed This patch doesn't create thunk for branch operation when following conditions are met: - Architecture is AArch64 - Relocation target is in the same object file - Relocation target is close enough to be encoded in immediate offset In such case we branch directly to the target instead of branching to thunk Differential revision: https://reviews.llvm.org/D28108 llvm-svn: 291431	2017-01-09 09:56:31 +00:00
Chandler Carruth	082c183f06	[PM] Teach SCEV to invalidate itself when its dependencies become invalid. This fixes use-after-free bugs that will arise with any interesting use of SCEV. I've added a dedicated test that works diligently to trigger these kinds of bugs in the new pass manager and also checks for them explicitly as well as triggering ASan failures when things go squirly. llvm-svn: 291426	2017-01-09 07:44:34 +00:00
Daniel Berlin	b755aea8eb	NewGVN: Fix PR 31573, a failure to verify memory congruency due to not excluding ourselves when checking if any equivalent stores exist. llvm-svn: 291421	2017-01-09 05:34:29 +00:00
Craig Topper	96ab6fd2eb	[AVX-512] Change another pattern that was using BLENDM to use masked moves. A future patch will conver it back to BLENDM if its beneficial to register allocation. llvm-svn: 291419	2017-01-09 04:19:34 +00:00
Craig Topper	6393afce97	[AVX-512] Add patterns to use a zero masked VPTERNLOG instruction for vselects of all ones and all zeros. Previously we emitted a VPTERNLOG and a separate masked move. llvm-svn: 291415	2017-01-09 02:44:34 +00:00
Piotr Padlewski	09ad678bc4	[MemDep] NFC walk invariant.group graph only down Summary: By using stripPointerCasts we can get to the root value and then walk down the bitcast graph Reviewers: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28181 llvm-svn: 291405	2017-01-08 22:26:06 +00:00
Craig Topper	f51ba1e3da	[AVX-512] If avx512dq is available use vpmovm2d/vpmovm2q instead of vselect of zeroes/ones when handling sign extends of i1 without VLX. llvm-svn: 291402	2017-01-08 21:32:30 +00:00
Craig Topper	0930a523cc	[X86] Add avx512bw and avx512dq command lines to the vector compare results test. This is preparation for improving a case with avx512dq. llvm-svn: 291401	2017-01-08 21:32:26 +00:00
Sanjay Patel	bf51c8a975	[x86] fix usage of stale operands when lowering select I noticed this problem as part of the ongoing attempt to canonicalize min/max ops in IR. The debug output shows nodes like this: t4: i32 = xor t2, Constant:i32<-1> t21: i8 = setcc t4, Constant:i32<0>, setlt:ch t14: i32 = select t21, t4, Constant:i32<-1> And because the select is holding onto the t4 (xor) node while EmitTest creates a new x86-specific xor node, the lowering results in: t4: i32 = xor t2, Constant:i32<-1> t25: i32,i32 = X86ISD::XOR t2, Constant:i32<-1> t28: i32,glue = X86ISD::CMOV Constant:i32<-1>, t4, Constant:i8<15>, t25:1 Differential Revision: https://reviews.llvm.org/D28374 llvm-svn: 291392	2017-01-08 15:53:40 +00:00
Simon Pilgrim	9c58950eeb	[CostModel][X86] Fixed vXi8 uniform shift costs. The 'fast' costs should only work for shifts by uniform constants (uniform non-constant are lowered using the slow default implementation). Logical shifts were not taking into account that we must mask the psrlw result, so the costs needed to be doubled. Added missing AVX2/AVX512BW costs as well. llvm-svn: 291391	2017-01-08 14:14:36 +00:00
Simon Pilgrim	1fa5487c05	[CostModel][X86] Moved legal uniform shift costs earlier. XOP was prematurely matching, doubling the cost of ashr/lshr uniform shifts. llvm-svn: 291390	2017-01-08 13:12:03 +00:00
Dylan McKay	8fa6d8db9c	[AVR] Implement TargetLoweing::getRegisterByName This allows the use of the 'read_register' intrinsics used by clang's named register globals features. llvm-svn: 291375	2017-01-07 23:39:47 +00:00
Simon Pilgrim	9681c407b4	[CostModel][X86] Update SSE41/AVX1 vXi32 SHL costs SSE41 provides pmulld which allows the simpler pslld/paddd/cvttps2dq/pmulld pattern than SSE2's use of pmuludq. llvm-svn: 291372	2017-01-07 22:27:43 +00:00
Craig Topper	a74e3088df	[AVX-512] Remove patterns from the other VBLENDM instructions. They are all redundant with masked move instructions. We should probably teach the two address instruction pass to turn masked moves into BLENDM when its beneficial to the register allocator. llvm-svn: 291371	2017-01-07 22:20:34 +00:00
Craig Topper	2d02d4926b	[X86] Regenerate a test to remove tab characters. llvm-svn: 291370	2017-01-07 22:20:28 +00:00
Craig Topper	da84ff3ed4	[AVX-512] Add masked forms of the alternate MOVDDUP patterns. I'm not too sure how to get isel to select even all of the unmasked forms, but at least we have a consistent set now. llvm-svn: 291368	2017-01-07 22:20:23 +00:00
Simon Pilgrim	a470296367	[CostModel][X86] Fix AVX2 v16i16 shift 'splat' costs. llvm-svn: 291366	2017-01-07 22:08:09 +00:00
Simon Pilgrim	82e3e05fe2	[CostModel][X86] Match 256-bit vector shift 'splat' costs for AVX2 and above We were matching against general vector shift costs before the uniform splat costs llvm-svn: 291365	2017-01-07 21:47:10 +00:00
Simon Pilgrim	935beac173	[X86][AVX2] Regenerate arithmetic tests Fixed missing checks for tests that used a '-' in the name, which was messing with update_llc_test_checks.py llvm-svn: 291363	2017-01-07 20:38:36 +00:00
Mehdi Amini	d5549f3dac	[ThinLTO] Fix assertions on lazy-loading of Metadata TBAA attachments Summary: The issue happens with: %0 = ....., !tbaa !0 %1 = ....., !tbaa !1 With !0 that references !1. In this case when loading !0 we generates a temporary for the operand !1. We now flush it immediately and trigger the load of !1 before moving on. If we don't we get the temporary when attaching to %1. This is usually not an issue except that we eagerly try to update TBAA MDNodes, which is obviously not possible if we only have a temporary. Differential Revision: https://reviews.llvm.org/D28423 llvm-svn: 291362	2017-01-07 20:24:23 +00:00
Hal Finkel	ec85fc5eac	[llvm-opt-report] Fix context-sensitive lines where nothing happened Don't print a line multiple times, each for different inlining contexts, if nothing happened in any context. This prevents situations like this: [[ > main: 65 \| if ((i * ni + j) % 20 == 0) fprintf > print_array: 65 \| if ((i * ni + j) % 20 == 0) fprintf ]] which could happen if different optimizations were missed in different inlining contexts. llvm-svn: 291361	2017-01-07 20:21:17 +00:00
Matt Arsenault	a7d2194168	SimplifyLibCalls: Remove incorrect optimization of fabs fabs(x * x) is not generally safe to assume x is positive if x is a NaN. This is also less general than it could be, so this will be replaced with a transformation on the intrinsic. llvm-svn: 291359	2017-01-07 19:55:12 +00:00
Simon Pilgrim	a4109d6433	[CostModel][AVX512BW] Add v32i16 vector shift costs for avx512bw targets. llvm-svn: 291354	2017-01-07 17:54:10 +00:00
Daniel Berlin	32f8d560dd	NewGVN: Make sure we properly lookup operand leaders while creating congruence classes for stores, and then keep them up to date. Add testcases. llvm-svn: 291351	2017-01-07 16:55:14 +00:00
Simon Pilgrim	a1b8e2c725	[X86][AVX512] Use lowerShuffleAsRepeatedMaskAndLanePermute for non-VBMI v64i8 shuffles (PR31470) llvm-svn: 291347	2017-01-07 15:37:50 +00:00
Dan Gohman	0e2ceb8121	[WebAssembly] Don't abort on code with UB. Gracefully leave code that performs function-pointer bitcasts implying non-trivial pointer conversions alone, rather than aborting, since it's just undefined behavior. llvm-svn: 291326	2017-01-07 01:50:01 +00:00
Dan Gohman	1b637458f6	[WebAssembly] Add a pass to create wrappers for function bitcasts. WebAssembly requires caller and callee signatures to match exactly. In LLVM, there are a variety of circumstances where signatures may be mismatched in practice, and one can bitcast a function address to another type to call it as that type. This patch adds a pass which replaces bitcasted function addresses with wrappers to replace the bitcasts. This doesn't catch everything, but it does match many common cases. llvm-svn: 291315	2017-01-07 00:34:54 +00:00
Daniel Berlin	d92e7f9f74	NewGVN: Fix PR 31501. Summary: LLVM's non-standard notion of phi nodes means we can't both try to substitute for undef in phi nodes and use phi nodes as leaders all the time. This changes NewGVN to use the same semantics as SimplifyPHINode to decide which phi nodes are equivalent. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28312 llvm-svn: 291308	2017-01-07 00:01:42 +00:00
Teresa Johnson	9006d52651	[ThinLTO] Handle conflicting local names gracefully Summary: r285871 introduced an assert that was overly aggressive in the case of a same-named local in different same-named files (in different directories), where the source name and therefore the GUID ended up the same because the files were compiled in their own directory without any leading path. Change the handling in the promotion logic to get the summary for the version in that module. This also exposed an issue where we are not always importing the right copy, which is a performance not correctness issue (because the renaming is based on the module hash which must be different, see the bug report for details). I will fix that as a follow-on. Fixes PR31561. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28411 llvm-svn: 291304	2017-01-06 23:38:41 +00:00
David Majnemer	63da0c238b	[InstSimplify] Optimize away udivs in the presence of range metadata We know that udiv %V, C can be optimized away to 0 if %V is ult C. llvm-svn: 291296	2017-01-06 22:58:02 +00:00
Kuba Mracek	5a2f078ee2	Follow-up for r291289: Fix failing global_metadata_darwin.ll test llvm-svn: 291292	2017-01-06 22:22:22 +00:00
Michal Gorny	d1b954884c	[llvm-config] Print --system-libs only when static linking Modify the --system-libs option in llvm-config to print system libs only when using static linking. The system libraries are irrelevant when linking to a shared library since the library has appropriate library dependencies embedded. Modify the --system-libs test appropriately to force static linking, and disable it if static libs are not available (i.e. BUILD_SHARED_LIBS is enabled). Differential Revision: https://reviews.llvm.org/D27805 llvm-svn: 291285	2017-01-06 21:33:54 +00:00
Michal Gorny	9283f5b200	[cmake] Canonicalize CMake booleans to 0/1 for lit interop Canonicalize all CMake booleans to 0/1 before passing them to lit, to ensure that the Python side handles all of them consistently and correctly. 0/1 is a safe choice of values that trigger the same boolean interpretation in CMake, Python and C++. Furthermore, using them without quotes improves the chance Python will explicitly fail when an incorrect value (such as ON/OFF, TRUE/FALSE, YES/NO) is accidentally passed, rather than silently misinterpreting the value. This replaces a lot of different logics spread around lit site files, attempting to partially reproduce the boolean logic used in CMake and usually silently failing when an uncommon value was used instead. In fact, some of them were never working correctly since different values were assigned in CMake and checked in Python. The alternative solution could be to create a common parser for CMake booleans in lit and use it consistently throughout the site files. However, it does not seem like the best idea to create redundant implementation of the same logic and have to follow upstream if it ever is extended to handle more values. Differential Revision: https://reviews.llvm.org/D28294 llvm-svn: 291284	2017-01-06 21:33:48 +00:00
Michal Gorny	82eb45a6f8	[test] Remove unused 'test_examples' config var Remove config.test_examples from lit.site.cfg and the relevant ENABLE_EXAMPLES definition from CMake. It is not used anywhere. Differential Revision: https://reviews.llvm.org/D28283 llvm-svn: 291283	2017-01-06 21:33:39 +00:00
David Majnemer	8c0e62f507	[InstSimplify] Optimize away urems in the presence of range metadata We know that urem %V, C can be optimized away to %V if %V is ult C. llvm-svn: 291282	2017-01-06 21:23:51 +00:00
Mehdi Amini	27d224fbbb	Fix LoopLoadElimination to keep original alignment on the inital hoisted store This is fixing a bug where Loop Vectorization is widening a load but with a lower alignment. Hoisting the load without propagating the alignment will allow inst-combine to later deduce a higher alignment that what the pointer actually is. Differential Revision: https://reviews.llvm.org/D28408 llvm-svn: 291281	2017-01-06 21:06:51 +00:00
Jan Vesely	06200bd7bc	AMDGPU/R600: Don't use REGISTER_{LOAD,STORE} ISD nodes This will make transition to SCRATCH_MEMORY easier Differential Revision: https://reviews.llvm.org/D24746 llvm-svn: 291279	2017-01-06 21:00:46 +00:00
Simon Pilgrim	08519d7b02	[X86][SSE] Standardized triples in vector shift tests Made no sense for them to be different and caused useless diffs in assembly remarks. llvm-svn: 291274	2017-01-06 19:56:57 +00:00
Simon Pilgrim	9cbcc5ff0b	[CostModel][X86] Add AVX512 and 512-bit vector shift cost tests. llvm-svn: 291269	2017-01-06 19:41:26 +00:00
Matthias Braun	258b847c4f	AArch64CollectLOH: Rewrite as block-local analysis. Re-apply r288561: This time with a fix where the ADDs that are part of a 3 instruction LOH would not invalidate the "LastAdrp" state. This fixes http://llvm.org/PR31361 Previously this pass was using up to 5% compile time in some cases which is a bit much for what it is doing. The pass featured a full blown data-flow analysis which in the default configuration was restricted to a single block. This rewrites the pass under the assumption that we only ever work on a single block. This is done in a single pass maintaining a state machine per general purpose register to catch LOH patterns. Differential Revision: https://reviews.llvm.org/D27329 This reverts commit 9e6cedb0a4f14364d6511597a9160305e7d34493. llvm-svn: 291266	2017-01-06 19:22:01 +00:00
Sanjay Patel	2715d92389	[InstCombine] add a vector version of a test added in r291262; NFC llvm-svn: 291265	2017-01-06 19:14:05 +00:00
Sanjay Patel	8d4aa10960	[InstCombine] move and add tests for icmp + shl nsw; NFC As discussed here: http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html ...we should be able to better optimize this pattern. llvm-svn: 291262	2017-01-06 18:57:54 +00:00
Wolfgang Pieb	c17a279eda	[DWARF] Null out the debug locs of (loop invariant) instructions hoisted by LICM in order to avoid jumpy line tables. Calls are left alone because they may be inlined. Differential Revision: https://reviews.llvm.org/D28390 llvm-svn: 291258	2017-01-06 18:38:57 +00:00
Chad Rosier	e177185e79	[AArch64] Reduce vector insert/extract cost for Falkor. Differential Revision: https://reviews.llvm.org/D28403 llvm-svn: 291254	2017-01-06 18:03:26 +00:00
Konstantin Zhuravlyov	67a6d5401a	[AMDGPU] Do not emit .AMDGPU.config section for amdhsa Differential Revision: https://reviews.llvm.org/D27732 llvm-svn: 291245	2017-01-06 17:02:10 +00:00
Simon Pilgrim	9122793b15	[X86][AVX] Regenerate shuffle 128-bit tests. The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now. llvm-svn: 291242	2017-01-06 15:56:52 +00:00
Simon Pilgrim	10cc5d555f	[X86][AVX] Regenerate tzcnt tests. The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now. llvm-svn: 291241	2017-01-06 15:54:23 +00:00
Filipe Cabecinhas	4647b74b51	[ASan] Make ASan instrument variable-masked loads and stores Summary: Previously we only supported constant-masked loads and stores. Reviewers: kcc, RKSimon, pgousseau, gbedwell, vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28370 llvm-svn: 291238	2017-01-06 15:24:51 +00:00
Simon Pilgrim	d8333372bc	[CostModel][X86] Fix 512-bit SDIV/UDIV 'big' costs. Set the costs on the lowest target that supports the type. llvm-svn: 291229	2017-01-06 11:12:53 +00:00
Simon Pilgrim	441d1d35d2	[CostModel][X86] Add SDIV/UDIV cost tests for a wider range of targets Added a test demonstrating bug in AVX512 division costs llvm-svn: 291228	2017-01-06 11:02:40 +00:00
Daniel Jasper	965d802ec7	Move test input to directory called Inputs. It is a common convention that our internal test runner depends upon. llvm-svn: 291227	2017-01-06 10:22:15 +00:00

1 2 3 4 5 ...

41919 Commits