llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	f520eff782	AArch64: use ldxp/stxp pair to implement 128-bit atomic loads. The ARM ARM is clear that 128-bit loads are only guaranteed to have been atomic if there has been a corresponding successful stxp. It's less clear for AArch32, so I'm leaving that alone for now. llvm-svn: 254524	2015-12-02 18:12:57 +00:00
Tom Stellard	e3b5aeaf83	AMDGPU/SI: Don't emit group segment global variables Summary: Only global or readonly segment variables should appear in object files. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15111 llvm-svn: 254519	2015-12-02 17:00:42 +00:00
David Majnemer	942003acc6	Do (A == C1 \|\| A == C2) -> (A & ~(C1 ^ C2)) == C1 rather than (A == C1 \|\| A == C2) -> (A \| (C1 ^ C2)) == C2 when C1 ^ C2 is a power of 2. Differential Revision: http://reviews.llvm.org/D14223 Patch by Amaury SECHET! llvm-svn: 254518	2015-12-02 16:15:07 +00:00
Rafael Espindola	9b04181d81	Add an interesting case we already get right. llvm-svn: 254514	2015-12-02 15:02:43 +00:00
Christof Douma	8b5dc2c94e	[AArch64]: Add support for Cortex-A35 Adds support for the new Cortex-A35 ARMv8-A core. llvm-svn: 254503	2015-12-02 11:53:44 +00:00
Nemanja Ivanovic	74e31bc929	Patch to fix a crash in the PowerPC back end due to ISD::ROTL and ISD::ROTR not being expanded. Test case included. llvm-svn: 254501	2015-12-02 10:36:24 +00:00
Hrvoje Varga	672b0f5582	[mips][microMIPS] Implement PREPEND, RADDU.W.QB, RDDSP, REPL.PH, REPL.QB, REPLV.PH, REPLV.QB and MTHLIP instructions Differential Revision: http://reviews.llvm.org/D14527 llvm-svn: 254496	2015-12-02 09:31:24 +00:00
Simon Pilgrim	3fc3454a0c	[X86][FMA] Optimize FNEG(FMUL) Patterns On FMA targets, we can avoid having to load a constant to negate a float/double multiply by instead using a FNMSUB (-(X*Y)-0) Fix for PR24366 Differential Revision: http://reviews.llvm.org/D14909 llvm-svn: 254495	2015-12-02 09:07:55 +00:00
Elena Demikhovsky	a1a40cce9f	AVX-512: Updated cost of FP/SINT/UINT conversion operations I checked and updated the cost of AVX-512 conversion operations. Added cost of conversion operations in DQ mode. Conversion of illegal types that requires vector split is not calculated right now (like for other X86 targets). Differential Revision: http://reviews.llvm.org/D15074 llvm-svn: 254494	2015-12-02 08:59:47 +00:00
Asaf Badouh	2489f350c0	[X86][AVX512] add comi with Sae add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 254493	2015-12-02 08:17:51 +00:00
David Blaikie	b073cb9be2	[llvm-dwp] Emit a rather fictional debug_cu_index This is very rudimentary support for debug_cu_index, but it is enough to allow llvm-dwarfdump to find the offsets for contributions and correctly dump debug_info. It will need to actually find the real signature of the unit and build the real hash table with the right number of buckets, as per the DWP specification. It will also need to be expanded to cover the tu_index as well. llvm-svn: 254489	2015-12-02 06:21:34 +00:00
Quentin Colombet	f1e91c8bf1	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This is a superset of the fix done in r254448. This fixes PR25607. llvm-svn: 254478	2015-12-02 01:22:54 +00:00
Tim Northover	f3be9d5c0b	AArch64: fix 128-bit shifts We mustn't introduce a shift of exactly 64-bits for any inputs, since that's an UNDEF value (and worse, it's not what you want with the natural Arch64 implementation). The generated code is pretty horrific, but I couldn't come up with an obviously better alternative (if the amount is constant EXTR could help). Turns out 128-bit shifts are just nasty. rdar://22491037 llvm-svn: 254475	2015-12-02 00:33:54 +00:00
Matt Arsenault	592d068198	AMDGPU: Error on addrspacecasts that aren't actually implemented llvm-svn: 254469	2015-12-01 23:04:05 +00:00
Matt Arsenault	f9bfeafd00	AMDGPU: Implement isNoopAddrSpaceCast llvm-svn: 254468	2015-12-01 23:04:00 +00:00
Matt Arsenault	3b15967008	AMDGPU: Disallow flat_scr in SI assembler llvm-svn: 254459	2015-12-01 20:31:08 +00:00
Quentin Colombet	9cb01aa30a	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This fixes PR25629. llvm-svn: 254448	2015-12-01 19:49:31 +00:00
Artyom Skrobov	5d1f2524a0	Fix Thumb1 epilogue generation Summary: This had been broken for a very long time, but nobody noticed until D14357 enabled shrink-wrapping by default. Reviewers: jroelofs, qcolombet Subscribers: tyomitch, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14986 llvm-svn: 254444	2015-12-01 19:25:11 +00:00
David Blaikie	bb94e440d5	[llvm-dwp] Deduplicate strings in the debug_str.dwo section Also, ensure that references to those strings in debug_str_offsets.dwo correctly refer to the deduplicated strings. llvm-svn: 254441	2015-12-01 19:17:58 +00:00
Weiming Zhao	56ab51870c	[AArch64] Fix a corner case in BitFeild select Summary: When not useful bits, BitWidth becomes 0 and APInt will not be happy. See https://llvm.org/bugs/show_bug.cgi?id=25571 We can just mark the operand as IMPLICIT_DEF is none bits of it is used. Reviewers: t.p.northover, jmolloy Subscribers: gberry, jmolloy, mgrang, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14803 llvm-svn: 254440	2015-12-01 19:17:49 +00:00
Matt Arsenault	e830f5427b	AMDGPU: Report extractelement as free in cost model The cost for scalarized operations is computed as N * (scalar operation cost + 1 extractelement + 1 insertelement). This partially fixes inflating the cost of scalarized operations since every operation is scalarized and free. I don't think we want any cost asociated with scalarization, but for now insertelement is still counted. I'm not sure if we should pretend that insertelement is also free, or add a way to compute a custom scalarization cost. llvm-svn: 254438	2015-12-01 19:08:39 +00:00
David Blaikie	98ad82a6a1	[llvm-dwp] Correctly update debug_str_offsets.dwo when linking dwo files This doesn't deduplicate strings in the debug_str section, nor does it properly wire up the index so that debug_info can /find/ these strings, but it does correct the str_offsets specifically. Follow up patches to address those related/next issues. llvm-svn: 254431	2015-12-01 18:07:07 +00:00
Rafael Espindola	b318fcbd8b	Simplify test. NFC. llvm-svn: 254419	2015-12-01 15:46:46 +00:00
Rafael Espindola	baa3bf8f76	Bring r254336 back: The difference is that now we don't error on out-of-comdat access to internal global values. We copy them instead. This seems to match the expectation of COFF linkers (see pr25686). Original message: Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254418	2015-12-01 15:19:48 +00:00
Elena Demikhovsky	aa1f17ea95	AVX-512: regenerated test for avx512 arithmetics, NFC llvm-svn: 254410	2015-12-01 12:35:03 +00:00
Elena Demikhovsky	0781d7b2b4	Fixed a failure in cost calculation for vector GEP Cost calculation for vector GEP failed with due to invalid cast to GEP index operand. The bug is fixed, added a test. http://reviews.llvm.org/D14976 llvm-svn: 254408	2015-12-01 12:08:36 +00:00
Hrvoje Varga	e51b0e13f3	[mips][microMIPS] Implement RECIP.fmt, RINT.fmt, ROUND.L.fmt, ROUND.W.fmt, SEL.fmt, SELEQZ.fmt, SELNEQZ.fmt and CLASS.fmt Differential Revision: http://reviews.llvm.org/D13885 llvm-svn: 254405	2015-12-01 11:59:21 +00:00
Yury Gribov	d7dbb66eb8	Introduce new @llvm.get.dynamic.area.offset.i{32, 64} intrinsics. The @llvm.get.dynamic.area.offset.* intrinsic family is used to get the offset from native stack pointer to the address of the most recent dynamic alloca on the caller's stack. These intrinsics are intendend for use in combination with @llvm.stacksave and @llvm.restore to get a pointer to the most recent dynamic alloca. This is useful, for example, for AddressSanitizer's stack unpoisoning routines. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D14983 llvm-svn: 254404	2015-12-01 11:40:55 +00:00
Oliver Stannard	a34e47066e	[AArch64] Add ARMv8.2-A Statistical Profiling Extension The Statistical Profiling Extension is an optional extension to ARMv8.2-A. Since it is an optional extension, I have added the FeatureSPE subtarget feature to control it. The assembler-visible parts of this extension are the new "psb csync" instruction, which is equivalent to "hint #17", and a number of system registers. Differential Revision: http://reviews.llvm.org/D15021 llvm-svn: 254401	2015-12-01 10:48:51 +00:00
Oliver Stannard	4667071574	[ARM] Add ARMv8.2-A to TargetParser Add ARMv8.2-A to TargetParser, so that it can be used by the clang command-line options and the .arch directive. Most testing of this will be done in clang, checking that the command-line options that this enables work. Differential Revision: http://reviews.llvm.org/D15037 llvm-svn: 254400	2015-12-01 10:33:56 +00:00
NAKAMURA Takumi	54d90f46c5	llvm/test/DebugInfo/X86/safestack-byval.ll: Give an explicit triple for now. It crashes for targeting *-win32. Also revert r254375 and r254361. llvm-svn: 254397	2015-12-01 10:07:41 +00:00
NAKAMURA Takumi	8bd0f0b141	Move llvm/test/DebugInfo/Generic/safestack-byval.ll to X86. It depends on x86-64. llvm-svn: 254396	2015-12-01 10:07:37 +00:00
Cong Hou	d97c100dc4	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. (This is the second attempt to submit this patch. The first caused two assertion failures and was reverted. See https://llvm.org/bugs/show_bug.cgi?id=25687) The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254377	2015-12-01 05:29:22 +00:00
Colin LeMahieu	309fb1877e	[Hexagon] Disabling failing safestack test llvm-svn: 254375	2015-12-01 04:56:25 +00:00
Hans Wennborg	1dbaf67537	Revert r254348: "Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces." and the follow-up r254356: "Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction." Asserts were firing in Chromium builds. See PR25687. llvm-svn: 254366	2015-12-01 03:49:42 +00:00
NAKAMURA Takumi	09eff05c0b	llvm/test/DebugInfo/Generic/safestack-byval.ll is using tls. llvm-svn: 254361	2015-12-01 01:15:03 +00:00
NAKAMURA Takumi	23183f3bba	check-llvm: Introduce the new feature "tls". llvm-svn: 254360	2015-12-01 01:14:58 +00:00
David Blaikie	32aa0495e8	[llvm-dwp] Add missing dependency from llvm tests on the llvm-dwp tool llvm-svn: 254357	2015-12-01 00:57:05 +00:00
David Blaikie	242b948817	[llvm-dwp] Initial partial prototype This just concatenates the common DWP sections without doing any of the fancy DWP things like: 1) update str_offsets 2) deduplicating strings 3) merging/creating cu/tu_index Patches for these will follow shortly. (also not sure about target triple/object file type for this tool - do I really need a whole triple just to write an object file that contains purely static/hardcoded bytes in each section? & I guess I should just pick it based on the first input, maybe, rather than hardcoding for now - but we only produce .dwo on ELF platforms with objcopy for now anyway) llvm-svn: 254355	2015-12-01 00:48:39 +00:00
Evgeniy Stepanov	42f3b12274	[safestack] Protect byval function arguments. Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353	2015-12-01 00:40:05 +00:00
Evgeniy Stepanov	fd07995363	Extend debug info for function parameters in SDAG. SDAG currently can emit debug location for function parameters when an llvm.dbg.declare points to either a function argument SSA temp, or to an AllocaInst. This change extends this logic by adding a fallback case when neither of the above is true. This is required for SafeStack, which may copy the contents of a byval function argument into something that is not an alloca, and then describe the target as the new location of the said argument. llvm-svn: 254352	2015-12-01 00:34:30 +00:00
Evgeniy Stepanov	a4ac3f4bdf	[safestack] Fix handling of array allocas. The current code does not take alloca array size into account and, as a result, considers any access past the first array element to be unsafe. llvm-svn: 254350	2015-12-01 00:06:13 +00:00
Cong Hou	fa1917c673	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254348	2015-12-01 00:02:51 +00:00
Rafael Espindola	e9841a6bb5	This reverts commit r254336 and r254344. They broke a bot and I am debugging why. llvm-svn: 254347	2015-11-30 23:54:19 +00:00
Rafael Espindola	a891957002	Disable a consistency check. Trying to figure out why it fails on a bot but passes locally. llvm-svn: 254344	2015-11-30 23:05:25 +00:00
Sanjay Patel	8b1fb3daba	[InstCombine] add tests to show potential vector IR shuffle transforms llvm-svn: 254342	2015-11-30 22:39:36 +00:00
Simon Pilgrim	db26b3ddfa	[X86][FMA4] Prefer FMA4 to FMA We currently output FMA instructions on targets which support both FMA4 + FMA (i.e. later Bulldozer CPUS bdver2/bdver3/bdver4). This patch flips this so FMA4 is preferred; this is for several reasons: 1 - FMA4 is non-destructive reducing the need for mov instructions. 2 - Its more straighforward to commute and fold inputs (although the recent work on FMA has reduced this difference). 3 - All supported targets have FMA4 performance equal or better to FMA - Piledriver (bdver2) in particular has half the throughput when executing FMA instructions. Its looks like no future AMD processor lines will support FMA4 after the Bulldozer series so we're not causing problems for later CPUs. Differential Revision: http://reviews.llvm.org/D14997 llvm-svn: 254339	2015-11-30 22:22:06 +00:00
Rafael Espindola	c109200c53	Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254336	2015-11-30 22:01:43 +00:00
Paul Robinson	a2550a6da3	Have 'optnone' respect the -fast-isel=false option. This is primarily useful for debugging optnone v. ISel issues. Differential Revision: http://reviews.llvm.org/D14792 llvm-svn: 254335	2015-11-30 21:56:16 +00:00
Cong Hou	eb9c7056f0	[X86] Update test/CodeGen/X86/avg.ll with the help of update_llc_test_checks.py. NFC. llvm-svn: 254334	2015-11-30 21:46:08 +00:00
Matt Arsenault	26f8f3db39	AMDGPU: Rework how private buffer passed for HSA If we know we have stack objects, we reserve the registers that the private buffer resource and wave offset are passed and use them directly. If not, reserve the last 5 SGPRs just in case we need to spill. After register allocation, try to pick the next available registers instead of the last SGPRs, and then insert copies from the inputs to the reserved registers in the progloue. This also only selectively enables all of the input registers which are really required instead of always enabling them. llvm-svn: 254331	2015-11-30 21:16:03 +00:00
Matt Arsenault	0e3d38937e	AMDGPU: Remove SIPrepareScratchRegs It does not work because of emergency stack slots. This pass was supposed to eliminate dummy registers for the spill instructions, but the register scavenger can introduce more during PrologEpilogInserter, so some would end up left behind if they were needed. The potential for spilling the scratch resource descriptor and offset register makes doing something like this overly complicated. Reserve registers to use for the resource descriptor and use them directly in eliminateFrameIndex. Also removes creating another scratch resource descriptor when directly selecting scratch MUBUF instructions. The choice of which registers are reserved is temporary. For now it attempts to pick the next available registers after the user and system SGPRs. llvm-svn: 254329	2015-11-30 21:15:53 +00:00
Matt Arsenault	ff6da2fe89	AMDGPU: Use assert zext for workgroup sizes llvm-svn: 254328	2015-11-30 21:15:45 +00:00
Quentin Colombet	cdad10f333	[ARM] For old thumb ISA like v4t, we cannot use PC directly in pop. Fix the epilogue emission to account for that. llvm-svn: 254325	2015-11-30 20:37:58 +00:00
Reid Kleckner	8a71273d89	Avoid writing to source directory of tests llvm-svn: 254324	2015-11-30 20:36:23 +00:00
Davide Italiano	9c26161b2e	[SimplifyLibCalls] Remove useless bits of this tests. llvm-svn: 254318	2015-11-30 19:38:35 +00:00
Davide Italiano	1aeed6a955	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math. llvm-svn: 254317	2015-11-30 19:36:35 +00:00
David Majnemer	bf4119faf6	[X86] Add RIP to GR64_TCW64 The MachineVerifier wants to check that the register operands of an instruction belong to the instruction's register class. RIP-relative control flow instructions violated this by referencing RIP. While this was fixed for SysV, it was never fixed for Win64. llvm-svn: 254315	2015-11-30 19:04:19 +00:00
Kit Barton	f4ce2f3a9e	Enable shrink wrapping for PPC64 Re-enable shrink wrapping for PPC64 Little Endian. One minor modification to PPCFrameLowering::findScratchRegister was necessary to handle fall-thru blocks (blocks with no terminator) correctly. Tested with all LLVM test, clang tests, and the self-hosting build, with no problems found. PHabricator: http://reviews.llvm.org/D14778 llvm-svn: 254314	2015-11-30 18:59:41 +00:00
Rafael Espindola	c98b20b0d6	Fix another llvm.ctors merging bug. We were not looking past casts to see if an element should be included or not. llvm-svn: 254313	2015-11-30 18:54:24 +00:00
Matt Arsenault	ea03cf2fa1	AMDGPU: Don't reserve SCRATCH_PTR input register This hasn't been doing anything since using relocations was added. llvm-svn: 254304	2015-11-30 15:46:47 +00:00
Hrvoje Varga	c03957f049	[mips][microMIPS] Implement LBUX, LHX, LWX, MAQ_S[A].W.PHL, MAQ_S[A].W.PHR, MFHI, MFLO, MTHI and MTLO instructions Differential Revision: http://reviews.llvm.org/D14436 llvm-svn: 254297	2015-11-30 12:58:39 +00:00
Zoran Jovanovic	a887b36167	[mips][microMIPS] Fix issue with offset operand of BALC and BC instructions Value of offset operand for microMIPS BALC and BC instructions is currently shifted 2 bits, but it should be 1 bit. Differential Revision: http://reviews.llvm.org/D14770 llvm-svn: 254296	2015-11-30 12:56:18 +00:00
Igor Breger	ea7932cfb7	AVX512: regenerate avx512bw intrincics tests results. Differential Revision: http://reviews.llvm.org/D15069 llvm-svn: 254295	2015-11-30 10:40:52 +00:00
Daniel Sanders	d32db286a0	[mips][ias] Removed MSA instructions from base architecture valid-xfail.s's. valid-xfail.s is for instructions that should be valid in the given ISA but incorrectly fail. MSA instructions are correct to fail since MSA is not enabled. llvm-svn: 254293	2015-11-30 09:52:00 +00:00
Zlatko Buljan	56f3b0e410	[mips][microMIPS] Implement PRECR.QB.PH, PRECR_SRA[_R].PH.W, PRECRQ.PH.W, PRECRQ.QB.PH, PRECRQU_S.QB.PH and PRECRQ_RS.PH.W instructions Differential Revision: http://reviews.llvm.org/D14605 llvm-svn: 254291	2015-11-30 08:37:38 +00:00
Craig Topper	ecae476e4c	[X86] int_x86_avx2_permps and X86ISD::VPERMV should take an integer vector for its shuffle indices. llvm-svn: 254269	2015-11-29 22:53:22 +00:00
Davide Italiano	0b14f29285	[SimplifyLibCalls] Don't crash if the function doesn't have a name. llvm-svn: 254265	2015-11-29 21:58:56 +00:00
Davide Italiano	b8b7133c94	[SimplifyLibCalls] Tranform log(pow(x, y)) -> ylog(x). This one is enabled only under -ffast-math. There are cases where the difference between the value computed and the correct value is huge even for ffast-math, e.g. as Steven pointed out: x = -1, y = -4 log(pow(-1), 4) = 0 4log(-1) = NaN I checked what GCC does and apparently they do the same optimization (which result in the dramatic difference). Future work might try to make this (slightly) less worse. Differential Revision: http://reviews.llvm.org/D14400 llvm-svn: 254263	2015-11-29 20:58:04 +00:00
Simon Pilgrim	88aa627c0b	[X86][SSE] Added support for lowering to ADDSUBPS/ADDSUBPD with commuted inputs We could already recognise shuffle(FSUB, FADD) -> ADDSUB, this allow us to recognise shuffle(FADD, FSUB) -> ADDSUB by commuting the shuffle mask prior to matching. llvm-svn: 254259	2015-11-29 16:41:04 +00:00
Rafael Espindola	3f85d24df4	Add a passing test. When a comdat is discarded, any globals defined in it become undefined. llvm-svn: 254258	2015-11-29 15:52:12 +00:00
Rafael Espindola	c73fdcd1b7	Don't depend on the order the IR is copied. llvm-svn: 254257	2015-11-29 15:22:49 +00:00
Rafael Espindola	94247d0860	Don't depend on the order the IR is copied. llvm-svn: 254256	2015-11-29 15:08:39 +00:00
Rafael Espindola	290409ef5d	Make this test less strict. We just want to test what is copied, no the order. llvm-svn: 254255	2015-11-29 14:53:06 +00:00
Igor Breger	e293e83f5d	AVX512:Implemented encoding for the vmovq.s instruction. Differential Revision: http://reviews.llvm.org/D14810 llvm-svn: 254248	2015-11-29 07:41:26 +00:00
Rafael Espindola	c945c8d22e	Correctly handle llvm.global_ctors merging. We were not handling the case where an entry must be dropped and the destination module has no llvm.global_ctors. llvm-svn: 254241	2015-11-29 03:29:42 +00:00
Rafael Espindola	9f30fac4d8	Fix a crash when writing merged bitcode. Playing with mutateType in here was making getValueType and getType incompatible. llvm-svn: 254240	2015-11-29 03:21:30 +00:00
Simon Pilgrim	4c5ab52a54	[X86][AVX] Regenerate ADDSUB tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 254237	2015-11-28 19:20:49 +00:00
Renato Golin	5dbc8a5283	Revert "[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM." This reverts commit r254201 and r254202, as it broke test-suite, self-hosting and sanitizer tests on ARM buildbots. llvm-svn: 254234	2015-11-28 17:23:46 +00:00
Simon Pilgrim	d9bb73b236	[X86][FMA] Added 512-bit tests to match 128/256-bit tests coverage As discussed on D14909 llvm-svn: 254233	2015-11-28 16:04:24 +00:00
Simon Pilgrim	82f663d755	[X86][FMA] More thorough FMA tests Added FMADD/FMSUB/FNMADD/FNMSUB tests for all types Added load folding tests for 512-bit vectors NOTE: Many of the AVX512 FMA instructions don't yet commute/fold correctly As discussed on D14909 llvm-svn: 254232	2015-11-28 14:28:44 +00:00
Simon Pilgrim	29412ee45f	[X86][AVX2] Tidied up PBROADCAST tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 254231	2015-11-28 14:15:40 +00:00
NAKAMURA Takumi	50024613e7	llvm/test/CodeGen/SystemZ/alloca-04.ll REQUIRES asserts due to -debug-pass. llvm-svn: 254230	2015-11-28 13:05:49 +00:00
Jonas Paulsson	f12b925bb1	[Stack realignment] Handling of aligned allocas. This patch implements dynamic realignment of stack objects for targets with a non-realigned stack pointer. Behaviour in FunctionLoweringInfo is changed so that for a target that has StackRealignable set to false, over-aligned static allocas are considered to be variable-sized objects and are handled with DYNAMIC_STACKALLOC nodes. It would be good to group aligned allocas into a single big alloca as an optimization, but this is yet todo. SystemZ benefits from this, due to its stack frame layout. New tests SystemZ/alloca-03.ll for aligned allocas, and SystemZ/alloca-04.ll for "no-realign-stack" attribute on functions. Review and help from Ulrich Weigand and Hal Finkel. llvm-svn: 254227	2015-11-28 11:02:32 +00:00
Rafael Espindola	5aafbac081	Pass .ll directly to llvm-link. llvm-svn: 254214	2015-11-27 23:47:15 +00:00
Rafael Espindola	57e61231ad	Pass .ll directly to llvm-link llvm-svn: 254213	2015-11-27 23:21:45 +00:00
Diego Novillo	84f06cc835	SamplePGO - Add initial support for inliner annotations. This adds two thresholds to the sample profiler to affect inlining decisions: the concept of global hotness and coldness. Functions that have accumulated more than a certain fraction of samples at runtime, are annotated with the InlineHint attribute. Conversely, functions that accumulate less than a certain fraction of samples, are annotated with the Cold attribute. This is very similar to the hints emitted by Clang when using instrumentation profiles. Notice that this is a very blunt instrument. A function may have globally collected a significant fraction of samples, but that does not necessarily mean that every callsite for that function is hot. Ideally, we would annotate each callsite with the samples collected at that callsite. This way, the inliner can incorporate all these weights into its cost model. Once the inliner offers this functionality, we can change the hints emitted here to a more precise per-callsite annotation. For now, this is providing some measure of speedups with our internal benchmarks. I've observed speedups of up to 23% (though the geo mean is about 3%). I expect these numbers to improve as the inliner gets better annotations. llvm-svn: 254212	2015-11-27 23:14:51 +00:00
Rafael Espindola	138f895655	Modernize the test a bit Remove out of date comment. Pass .ll files to llvm-link. llvm-svn: 254210	2015-11-27 23:13:17 +00:00
Artyom Skrobov	b955b90509	[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM. Summary: Since this build attribute corresponds to a whole module, and different functions in a module may differ in the optimizations enabled for them, this attribute is emitted after all functions, and only in the case that the optimization goals for all functions match. Reviewers: logan, hans Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14934 llvm-svn: 254201	2015-11-27 15:30:51 +00:00
Oliver Stannard	b25914e03f	[AArch64] Add ARMv8.2-A FP16 scalar instructions ARMv8.2-A adds 16-bit floating point versions of all existing VFP floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. Most of these instructions are the same as the 32- and 64-bit versions, but with the type field (bits 23-22) set to 0b11. Previously the top bit of the size field was always 0, so the instruction classes only provided a 1-bit size field, which I have widened to 2 bits. Differential Revision: http://reviews.llvm.org/D15014 llvm-svn: 254198	2015-11-27 13:04:48 +00:00
Adhemerval Zanella	d93c0c4dc4	[sanitizer] [dfsan] Unify aarch64 mapping This patch changes the DFSan instrumentation for aarch64 to instead of using fixes application mask defined by SANITIZER_AARCH64_VMA to read the application shadow mask value from compiler-rt. The value is initialized based on runtime VAM detection. Along with this patch a compiler-rt one will also be added to export the shadow mask variable. llvm-svn: 254196	2015-11-27 12:42:39 +00:00
Andrew Wilkins	522eb9c57d	test: bail early if tool_path is None tool_path will be None for llvm-go if Go cannot be found llvm-svn: 254190	2015-11-27 05:07:26 +00:00
Andrew Wilkins	572fe6e95e	test: check if go_executable is set llvm-svn: 254189	2015-11-27 04:51:13 +00:00
Andrew Wilkins	caa3b51ad2	Use $GO_EXECUTABLE in Go-based lit tests Summary: When running tests, pass the GO_EXECUTABLE CMake cache variable to llvm-go. The "go" binary may not be in $PATH, or may be different to the one passed to CMake. Reviewers: pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14041 llvm-svn: 254187	2015-11-27 04:44:51 +00:00
Rafael Espindola	8e8183b8bd	Test both input file orders. llvm-svn: 254186	2015-11-27 03:50:34 +00:00
Rafael Espindola	60b57863a0	Add missing file. llvm-svn: 254185	2015-11-27 03:47:29 +00:00
Rafael Espindola	1d3465f641	Make the test a bit more interesting. It now covers a regular function replacing an available_externally one. llvm-svn: 254184	2015-11-27 02:07:37 +00:00
Peter Collingbourne	8359a6a83e	MC: Simplify handling of temporary symbols in COFF writer. The COFF object writer was previously adding unnecessary symbols to its temporary data structures and cleaning them up later. This made the code harder to understand and caused a bug (aliases classed as temporary symbols would cause an assertion failure). A much simpler way of handling such symbols is to ask the layout for their section-relative position when needed. Tested with a bootstrap on Windows and by building Chrome. Differential Revision: http://reviews.llvm.org/D14975 llvm-svn: 254183	2015-11-26 23:29:27 +00:00
Simon Pilgrim	1d881ae225	[X86][FMA] Begun adding AVX512 FMA tests As discussed on D14909 llvm-svn: 254180	2015-11-26 20:53:28 +00:00
Charlie Turner	54336a5a4e	[LoopVectorize] Use MapVector rather than DenseMap for MinBWs. The order in which instructions are truncated in truncateToMinimalBitwidths effects code generation. Switch to a map with a determinisic order, since the iteration order over a DenseMap is not defined. This code is not hot, so the difference in container performance isn't interesting. Many thanks to David Blaikie for making me aware of MapVector! Fixes PR25490. Differential Revision: http://reviews.llvm.org/D14981 llvm-svn: 254179	2015-11-26 20:39:51 +00:00

1 2 3 4 5 ...

33297 Commits