llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	6164297f46	[X86] Fix weird identation. NFC llvm-svn: 254487	2015-12-02 05:24:38 +00:00
Quentin Colombet	bbdebefff6	[X86] Fix a think-o when checking if the eflags needs to be preserved. llvm-svn: 254480	2015-12-02 02:07:00 +00:00
Quentin Colombet	f1e91c8bf1	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This is a superset of the fix done in r254448. This fixes PR25607. llvm-svn: 254478	2015-12-02 01:22:54 +00:00
Tim Northover	f3be9d5c0b	AArch64: fix 128-bit shifts We mustn't introduce a shift of exactly 64-bits for any inputs, since that's an UNDEF value (and worse, it's not what you want with the natural Arch64 implementation). The generated code is pretty horrific, but I couldn't come up with an obviously better alternative (if the amount is constant EXTR could help). Turns out 128-bit shifts are just nasty. rdar://22491037 llvm-svn: 254475	2015-12-02 00:33:54 +00:00
Matt Arsenault	592d068198	AMDGPU: Error on addrspacecasts that aren't actually implemented llvm-svn: 254469	2015-12-01 23:04:05 +00:00
Matt Arsenault	f9bfeafd00	AMDGPU: Implement isNoopAddrSpaceCast llvm-svn: 254468	2015-12-01 23:04:00 +00:00
Matthias Braun	b258d794dd	ARM: Change ArchCheck field to uint64_t The values in this field are compared against getAvailableFeatures() which returns an uint64_t. This was causing problems in an internal branch. llvm-svn: 254462	2015-12-01 21:48:52 +00:00
Matt Arsenault	3b15967008	AMDGPU: Disallow flat_scr in SI assembler llvm-svn: 254459	2015-12-01 20:31:08 +00:00
Matt Arsenault	856d1928a8	AMDGPU: Optimize VOP2 operand legalization Don't use commuteInstruction, and don't commute if doing so will not improve legality. Skip the more complex checks for literal operands and constant bus restrictions, which are not a concern for VOP2 instructions because src1 does not accept SGPRs or constants and few implicitly read vcc. This gets called quite a few times and the attempts at commuting are a significant fraction of the time spent in SIFixSGPRCopies, so it's somewhat worthwhile to optimize. With this patch and others leading up to it, this reduces the compile time of SIFixSGPRCopies on some of the LuxMark 2 kernels from ~8ms to ~5ms on my system. llvm-svn: 254452	2015-12-01 19:57:17 +00:00
Quentin Colombet	9cb01aa30a	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This fixes PR25629. llvm-svn: 254448	2015-12-01 19:49:31 +00:00
Artyom Skrobov	5d1f2524a0	Fix Thumb1 epilogue generation Summary: This had been broken for a very long time, but nobody noticed until D14357 enabled shrink-wrapping by default. Reviewers: jroelofs, qcolombet Subscribers: tyomitch, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14986 llvm-svn: 254444	2015-12-01 19:25:11 +00:00
Weiming Zhao	56ab51870c	[AArch64] Fix a corner case in BitFeild select Summary: When not useful bits, BitWidth becomes 0 and APInt will not be happy. See https://llvm.org/bugs/show_bug.cgi?id=25571 We can just mark the operand as IMPLICIT_DEF is none bits of it is used. Reviewers: t.p.northover, jmolloy Subscribers: gberry, jmolloy, mgrang, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14803 llvm-svn: 254440	2015-12-01 19:17:49 +00:00
Matt Arsenault	e830f5427b	AMDGPU: Report extractelement as free in cost model The cost for scalarized operations is computed as N * (scalar operation cost + 1 extractelement + 1 insertelement). This partially fixes inflating the cost of scalarized operations since every operation is scalarized and free. I don't think we want any cost asociated with scalarization, but for now insertelement is still counted. I'm not sure if we should pretend that insertelement is also free, or add a way to compute a custom scalarization cost. llvm-svn: 254438	2015-12-01 19:08:39 +00:00
Tom Stellard	38b7cbe3e0	AMDGPU/SI: Remove REGISTER_STORE/REGISTER_LOAD code which is now dead Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15050 llvm-svn: 254427	2015-12-01 17:45:22 +00:00
Tom Stellard	ff63c25753	AMDGPU: Use the default strings for data emission directives Summary: This makes the assembly output look nicer and there is no reason to have custom strings for these. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14671 llvm-svn: 254426	2015-12-01 17:45:17 +00:00
Sanjay Patel	60216f6943	[x86] add a convenience method to check for FMA capability; NFCI llvm-svn: 254425	2015-12-01 17:27:55 +00:00
Elena Demikhovsky	0d0692d854	AVX-512: fixed asm string of vsqrtss (vvsqrtss was generated before) llvm-svn: 254411	2015-12-01 12:43:46 +00:00
Hrvoje Varga	e51b0e13f3	[mips][microMIPS] Implement RECIP.fmt, RINT.fmt, ROUND.L.fmt, ROUND.W.fmt, SEL.fmt, SELEQZ.fmt, SELNEQZ.fmt and CLASS.fmt Differential Revision: http://reviews.llvm.org/D13885 llvm-svn: 254405	2015-12-01 11:59:21 +00:00
Yury Gribov	d7dbb66eb8	Introduce new @llvm.get.dynamic.area.offset.i{32, 64} intrinsics. The @llvm.get.dynamic.area.offset.* intrinsic family is used to get the offset from native stack pointer to the address of the most recent dynamic alloca on the caller's stack. These intrinsics are intendend for use in combination with @llvm.stacksave and @llvm.restore to get a pointer to the most recent dynamic alloca. This is useful, for example, for AddressSanitizer's stack unpoisoning routines. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D14983 llvm-svn: 254404	2015-12-01 11:40:55 +00:00
Oliver Stannard	a34e47066e	[AArch64] Add ARMv8.2-A Statistical Profiling Extension The Statistical Profiling Extension is an optional extension to ARMv8.2-A. Since it is an optional extension, I have added the FeatureSPE subtarget feature to control it. The assembler-visible parts of this extension are the new "psb csync" instruction, which is equivalent to "hint #17", and a number of system registers. Differential Revision: http://reviews.llvm.org/D15021 llvm-svn: 254401	2015-12-01 10:48:51 +00:00
Oliver Stannard	4667071574	[ARM] Add ARMv8.2-A to TargetParser Add ARMv8.2-A to TargetParser, so that it can be used by the clang command-line options and the .arch directive. Most testing of this will be done in clang, checking that the command-line options that this enables work. Differential Revision: http://reviews.llvm.org/D15037 llvm-svn: 254400	2015-12-01 10:33:56 +00:00
Oliver Stannard	8addbf4350	[ARM] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15036 llvm-svn: 254399	2015-12-01 10:23:06 +00:00
Craig Topper	c458c7c6c9	[X86] Fix patterns for memory forms of FP FSUBR and FDIVR. They need to have memory on the left hand side of the fsub/fdiv operations in their patterns. Not sure how to test this. I noticed by inspection in the isel tables where the same pattern tried to produce DIV and DIVR or SUB and SUBR. llvm-svn: 254388	2015-12-01 06:13:16 +00:00
Craig Topper	271f9ded44	[X86] Use range-based for loops. NFC llvm-svn: 254387	2015-12-01 06:13:15 +00:00
Craig Topper	ba894c3c0d	[X86] Use array_lengthof instead of calculating manually. Also change index types to size_t to match. llvm-svn: 254386	2015-12-01 06:13:13 +00:00
Craig Topper	ddc76f2bed	[Hexagon] Use std::begin() and std::end() instead of doing the same manually. NFC llvm-svn: 254385	2015-12-01 06:13:10 +00:00
Craig Topper	d824f5f0d9	[Hexagon] Use array_lengthof and const correct and type correct the array and array size. NFC llvm-svn: 254384	2015-12-01 06:13:08 +00:00
Craig Topper	6261e1b94d	Use array_lengthof instead of manually calculating it. NFC llvm-svn: 254383	2015-12-01 06:13:06 +00:00
Craig Topper	3da000c07f	[Hexagon] Use ArrayRef to avoid needing to calculate an array size. Interestingly the original code may have had a bug because it was passing the byte size of a uint16_t array instead of the number of entries. llvm-svn: 254382	2015-12-01 06:13:04 +00:00
Craig Topper	8072081b63	[ARM] Use range-based for loops to avoid the need for calculating an array size that I would have otherwise cconverted to array_lengthof. NFC llvm-svn: 254381	2015-12-01 06:13:01 +00:00
Cong Hou	d97c100dc4	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. (This is the second attempt to submit this patch. The first caused two assertion failures and was reverted. See https://llvm.org/bugs/show_bug.cgi?id=25687) The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254377	2015-12-01 05:29:22 +00:00
Hans Wennborg	1dbaf67537	Revert r254348: "Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces." and the follow-up r254356: "Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction." Asserts were firing in Chromium builds. See PR25687. llvm-svn: 254366	2015-12-01 03:49:42 +00:00
Matt Arsenault	456fdfcdc2	Squelch unused variable warning in SIRegisterInfo.cpp. Patch by Justin Lebar llvm-svn: 254362	2015-12-01 02:14:33 +00:00
Cong Hou	fa1917c673	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254348	2015-12-01 00:02:51 +00:00
Simon Pilgrim	db26b3ddfa	[X86][FMA4] Prefer FMA4 to FMA We currently output FMA instructions on targets which support both FMA4 + FMA (i.e. later Bulldozer CPUS bdver2/bdver3/bdver4). This patch flips this so FMA4 is preferred; this is for several reasons: 1 - FMA4 is non-destructive reducing the need for mov instructions. 2 - Its more straighforward to commute and fold inputs (although the recent work on FMA has reduced this difference). 3 - All supported targets have FMA4 performance equal or better to FMA - Piledriver (bdver2) in particular has half the throughput when executing FMA instructions. Its looks like no future AMD processor lines will support FMA4 after the Bulldozer series so we're not causing problems for later CPUs. Differential Revision: http://reviews.llvm.org/D14997 llvm-svn: 254339	2015-11-30 22:22:06 +00:00
Matt Arsenault	ada6cf1b22	AMDGPU: Fix unused function llvm-svn: 254333	2015-11-30 21:32:10 +00:00
Matt Arsenault	41003af292	AMDGPU: Error if too many user SGPRs used llvm-svn: 254332	2015-11-30 21:16:07 +00:00
Matt Arsenault	26f8f3db39	AMDGPU: Rework how private buffer passed for HSA If we know we have stack objects, we reserve the registers that the private buffer resource and wave offset are passed and use them directly. If not, reserve the last 5 SGPRs just in case we need to spill. After register allocation, try to pick the next available registers instead of the last SGPRs, and then insert copies from the inputs to the reserved registers in the progloue. This also only selectively enables all of the input registers which are really required instead of always enabling them. llvm-svn: 254331	2015-11-30 21:16:03 +00:00
Matt Arsenault	ac234b604d	AMDGPU: Rename enums to be consistent with HSA code object terminology llvm-svn: 254330	2015-11-30 21:15:57 +00:00
Matt Arsenault	0e3d38937e	AMDGPU: Remove SIPrepareScratchRegs It does not work because of emergency stack slots. This pass was supposed to eliminate dummy registers for the spill instructions, but the register scavenger can introduce more during PrologEpilogInserter, so some would end up left behind if they were needed. The potential for spilling the scratch resource descriptor and offset register makes doing something like this overly complicated. Reserve registers to use for the resource descriptor and use them directly in eliminateFrameIndex. Also removes creating another scratch resource descriptor when directly selecting scratch MUBUF instructions. The choice of which registers are reserved is temporary. For now it attempts to pick the next available registers after the user and system SGPRs. llvm-svn: 254329	2015-11-30 21:15:53 +00:00
Matt Arsenault	ff6da2fe89	AMDGPU: Use assert zext for workgroup sizes llvm-svn: 254328	2015-11-30 21:15:45 +00:00
Quentin Colombet	cdad10f333	[ARM] For old thumb ISA like v4t, we cannot use PC directly in pop. Fix the epilogue emission to account for that. llvm-svn: 254325	2015-11-30 20:37:58 +00:00
David Majnemer	bf4119faf6	[X86] Add RIP to GR64_TCW64 The MachineVerifier wants to check that the register operands of an instruction belong to the instruction's register class. RIP-relative control flow instructions violated this by referencing RIP. While this was fixed for SysV, it was never fixed for Win64. llvm-svn: 254315	2015-11-30 19:04:19 +00:00
Kit Barton	f4ce2f3a9e	Enable shrink wrapping for PPC64 Re-enable shrink wrapping for PPC64 Little Endian. One minor modification to PPCFrameLowering::findScratchRegister was necessary to handle fall-thru blocks (blocks with no terminator) correctly. Tested with all LLVM test, clang tests, and the self-hosting build, with no problems found. PHabricator: http://reviews.llvm.org/D14778 llvm-svn: 254314	2015-11-30 18:59:41 +00:00
Dan Gohman	96029f7880	[WebAssembly] Fix a few minor compiler warnings. NFC. llvm-svn: 254311	2015-11-30 18:42:08 +00:00
Sanjay Patel	239be1fb0d	fix formatting; NFC llvm-svn: 254310	2015-11-30 17:52:02 +00:00
Colin LeMahieu	e6241798c9	[Hexagon] NFC Reordering headers. llvm-svn: 254307	2015-11-30 17:32:34 +00:00
Matt Arsenault	ea03cf2fa1	AMDGPU: Don't reserve SCRATCH_PTR input register This hasn't been doing anything since using relocations was added. llvm-svn: 254304	2015-11-30 15:46:47 +00:00
Aaron Ballman	33c95f08b0	Silencing a 32-bit to 64-bit implicit conversion warning; NFC. llvm-svn: 254302	2015-11-30 14:52:33 +00:00
Hrvoje Varga	c03957f049	[mips][microMIPS] Implement LBUX, LHX, LWX, MAQ_S[A].W.PHL, MAQ_S[A].W.PHR, MFHI, MFLO, MTHI and MTLO instructions Differential Revision: http://reviews.llvm.org/D14436 llvm-svn: 254297	2015-11-30 12:58:39 +00:00
Zoran Jovanovic	a887b36167	[mips][microMIPS] Fix issue with offset operand of BALC and BC instructions Value of offset operand for microMIPS BALC and BC instructions is currently shifted 2 bits, but it should be 1 bit. Differential Revision: http://reviews.llvm.org/D14770 llvm-svn: 254296	2015-11-30 12:56:18 +00:00
Zlatko Buljan	56f3b0e410	[mips][microMIPS] Implement PRECR.QB.PH, PRECR_SRA[_R].PH.W, PRECRQ.PH.W, PRECRQ.QB.PH, PRECRQU_S.QB.PH and PRECRQ_RS.PH.W instructions Differential Revision: http://reviews.llvm.org/D14605 llvm-svn: 254291	2015-11-30 08:37:38 +00:00
Craig Topper	27e2912fa8	Revert r254279 "[X86] Use ArrayRef. NFC". It seems to have upset an MSVC build bot. llvm-svn: 254280	2015-11-30 02:28:19 +00:00
Craig Topper	b84f39865f	[X86] Use ArrayRef. NFC llvm-svn: 254279	2015-11-30 02:08:05 +00:00
Craig Topper	aad5f11e5f	[AVX512] The vpermi2 instructions require an integer vector for the index vector. This is reflected correctly in the intrinsics, but was not refelected in the isel patterns. For the floating point types, this requires adding a bitcast to the index vector when its passed through to the output. llvm-svn: 254277	2015-11-30 00:13:24 +00:00
Craig Topper	fbde7aa13a	[X86] Remove duplicate entries from intrinsics tables and add asserts to verify there are no others. llvm-svn: 254274	2015-11-29 23:18:32 +00:00
Dan Gohman	9551a44d0c	[WebAssembly] Delete an obsolete TODO comment. llvm-svn: 254272	2015-11-29 23:09:41 +00:00
Dan Gohman	174b2d83ee	[WebAssembly] Set several MCInstrDesc flags. llvm-svn: 254271	2015-11-29 22:59:19 +00:00
Craig Topper	ecae476e4c	[X86] int_x86_avx2_permps and X86ISD::VPERMV should take an integer vector for its shuffle indices. llvm-svn: 254269	2015-11-29 22:53:22 +00:00
Dan Gohman	5237b3991d	[WebAssembly] Delete unused functions. NFC. llvm-svn: 254268	2015-11-29 22:48:57 +00:00
Dan Gohman	7a6b9825ce	[WebAssembly] Minor clang-format and selected clang-tidy cleanups. NFC. llvm-svn: 254267	2015-11-29 22:32:02 +00:00
Simon Pilgrim	88aa627c0b	[X86][SSE] Added support for lowering to ADDSUBPS/ADDSUBPD with commuted inputs We could already recognise shuffle(FSUB, FADD) -> ADDSUB, this allow us to recognise shuffle(FADD, FSUB) -> ADDSUB by commuting the shuffle mask prior to matching. llvm-svn: 254259	2015-11-29 16:41:04 +00:00
Igor Breger	e293e83f5d	AVX512:Implemented encoding for the vmovq.s instruction. Differential Revision: http://reviews.llvm.org/D14810 llvm-svn: 254248	2015-11-29 07:41:26 +00:00
Renato Golin	5dbc8a5283	Revert "[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM." This reverts commit r254201 and r254202, as it broke test-suite, self-hosting and sanitizer tests on ARM buildbots. llvm-svn: 254234	2015-11-28 17:23:46 +00:00
Jonas Paulsson	f12b925bb1	[Stack realignment] Handling of aligned allocas. This patch implements dynamic realignment of stack objects for targets with a non-realigned stack pointer. Behaviour in FunctionLoweringInfo is changed so that for a target that has StackRealignable set to false, over-aligned static allocas are considered to be variable-sized objects and are handled with DYNAMIC_STACKALLOC nodes. It would be good to group aligned allocas into a single big alloca as an optimization, but this is yet todo. SystemZ benefits from this, due to its stack frame layout. New tests SystemZ/alloca-03.ll for aligned allocas, and SystemZ/alloca-04.ll for "no-realign-stack" attribute on functions. Review and help from Ulrich Weigand and Hal Finkel. llvm-svn: 254227	2015-11-28 11:02:32 +00:00
Artyom Skrobov	f01a59f9fb	Follow-up fix for r254201 llvm-svn: 254202	2015-11-27 16:20:34 +00:00
Artyom Skrobov	b955b90509	[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM. Summary: Since this build attribute corresponds to a whole module, and different functions in a module may differ in the optimizations enabled for them, this attribute is emitted after all functions, and only in the case that the optimization goals for all functions match. Reviewers: logan, hans Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14934 llvm-svn: 254201	2015-11-27 15:30:51 +00:00
Oliver Stannard	b25914e03f	[AArch64] Add ARMv8.2-A FP16 scalar instructions ARMv8.2-A adds 16-bit floating point versions of all existing VFP floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. Most of these instructions are the same as the 32- and 64-bit versions, but with the type field (bits 23-22) set to 0b11. Previously the top bit of the size field was always 0, so the instruction classes only provided a 1-bit size field, which I have widened to 2 bits. Differential Revision: http://reviews.llvm.org/D15014 llvm-svn: 254198	2015-11-27 13:04:48 +00:00
Craig Topper	e38c57a4b8	[X86] Pair a NoVLX with HasAVX512 to match the others and remove a unique predicate check in the isel tables. NFC llvm-svn: 254191	2015-11-27 05:44:02 +00:00
Craig Topper	a47576f297	[X86] Now that X86VPermt2 is used in all the avx512_perm_t_sizes just hardcode it into the patterns instead of passing as an argument. NFC llvm-svn: 254177	2015-11-26 20:21:29 +00:00
Craig Topper	05858f52fe	[X86] Merge X86VPermt2Fp and X86VPermt2Int back together by weakening them just enough. The SDTCisSameSizeAs introduced in r254138 helps here. llvm-svn: 254176	2015-11-26 20:02:01 +00:00
Craig Topper	0009656335	[X86] Split ISD node for Vfpclass and Vfpclasss so that we can write strong type constraints for each that don't cause ambiguous isel. llvm-svn: 254172	2015-11-26 19:41:34 +00:00
Craig Topper	ff2f14731a	[X86] Revert part of r254167 to recover bots. llvm-svn: 254169	2015-11-26 19:13:05 +00:00
Krzysztof Parzyszek	08ff8883fd	[Hexagon] Lowering of V60/HVX vector types llvm-svn: 254168	2015-11-26 18:38:27 +00:00
Craig Topper	9d1deb4b72	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254167	2015-11-26 18:31:19 +00:00
Krzysztof Parzyszek	4eb6d4d1f2	[Hexagon] Hexagon V60 HVX intrinsic defintions Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 254165	2015-11-26 16:54:33 +00:00
Daniel Sanders	daa4b6fbd9	[mips][ias] Range check uimm5 operands and fix several bugs this revealed. Summary: The bugs were: * append, prepend, and balign were not tested * balign takes a uimm2 not a uimm5. * drotr32 was correctly implemented with a uimm5 but the tests expected '52' to be valid. * li/la were implemented with a uimm5 instead of simm32. simm32 isn't completely correct either but I'll fix that when I get to simm32. A notable omission are some of the shift instructions. Several of these have been implemented using a single uimm6 instruction (rather than two uimm5 instructions and a CodeGen-only uimm6 pseudo). These will be updated in the uimm6 patch. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14712 llvm-svn: 254164	2015-11-26 16:35:41 +00:00
Oliver Stannard	64c167db7a	[AArch64] Add ARMv8.2-A new AT instruction variants ARMv8.2-A adds new variants of the "at" (address translate) system instruction, which take the PSTATE.PAN bit (added in ARMv8.1-A). These are a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15018 llvm-svn: 254159	2015-11-26 15:34:44 +00:00
Martell Malone	d12292480a	ARM: address WOA unsigned division overflow crash Building on r253865 the crash is not limited to signed overflows. Disable custom handling of unsigned 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit unsigned integer overflow. llvm-svn: 254158	2015-11-26 15:34:03 +00:00
Oliver Stannard	911ea20f07	[AArch64] Add ARMv8.2-A UAO PSTATE bit ARMv8.2-A adds a new PSTATE bit, PSTATE.UAO, which allows the LDTR/STTR instructions to behave the same as LDR/STR with respect to execute-only pages at higher privilege levels. New variants of the MSR/MRS instructions are added to allow reading and writing this bit. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15020 llvm-svn: 254157	2015-11-26 15:32:30 +00:00
Oliver Stannard	1a81cc9f43	[AArch64] Add ARMv8.2-A persistent memory instruction ARMv8.2-A adds the "dc cvap" instruction, which is a system instruction that cleans caches to the point of persistence (for systems that have persistent memory). It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15016 llvm-svn: 254156	2015-11-26 15:28:47 +00:00
Oliver Stannard	48b43741d0	[AArch64] Add ARMv8.2-A ID_A64MMFR2_EL1 register ARMv8.2-A adds a new ID register, ID_A64MMFR2_EL1, which behaves in the same way as ID_A64MMFR0_EL1 and ID_A64MMFR1_EL1. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15017 llvm-svn: 254155	2015-11-26 15:26:10 +00:00
Oliver Stannard	7cc0c4e675	[AArch64] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15013 llvm-svn: 254154	2015-11-26 15:23:32 +00:00
Craig Topper	a3ac738725	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254142	2015-11-26 07:58:20 +00:00
Vyacheslav Klochkov	ed865dfcc5	X86-FMA3: Improved/enabled the memory folding optimization for scalar loads generated for _mm_losd_s{s,d}() intrinsics and used in scalar FMAs generated for FMA intrinsics _mm_f{madd,msub,nmadd,nmsub}_s{s,d}(). Reviewer: David Kreitzer Differential Revision: http://reviews.llvm.org/D14762 llvm-svn: 254140	2015-11-26 07:45:30 +00:00
Craig Topper	4c175cdc8e	[X86] Strengthen the type constraints on X86psadbw and X86dbpsadbw to reduce some of the type checks in the isel matching tables. llvm-svn: 254139	2015-11-26 07:02:21 +00:00
Krzysztof Parzyszek	195dc8d0db	[Hexagon] HVX vector register classes and more isel patterns llvm-svn: 254132	2015-11-26 04:33:11 +00:00
Tom Stellard	48f29f21ee	AMDGPU: Add llvm.amdgcn.dispatch.ptr intrinsic Summary: This returns a pointer to the dispatch packet, which can be used to load information about the kernel dispach. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14898 llvm-svn: 254116	2015-11-26 00:43:29 +00:00
Dan Gohman	a774d719a0	[WebAssembly] Fix inline asm support for i64 operands. llvm-svn: 254106	2015-11-25 22:28:50 +00:00
Dan Gohman	d9b4218831	[WebAssembly] Fold setne and seteq comparisons into selects. llvm-svn: 254104	2015-11-25 22:13:48 +00:00
Krzysztof Parzyszek	70a134d29f	[Hexagon] Treat transfers of FP immediates are pseudo instructions This is a temporary fix to address ICE on 2005-10-21-longlonggtu.ll. The proper fix will be to use A2_tfrsi, but it will need more work to teach all users of A2_tfrsi to also expect a floating-point operand. llvm-svn: 254099	2015-11-25 21:40:03 +00:00
Dan Gohman	5941bde03c	[WebAssembly] Add some comments. NFC. llvm-svn: 254096	2015-11-25 21:32:06 +00:00
Marek Olsak	7ed6b2f414	AMDGPU/SI: select S_ABS_I32 when possible (v2) v2: added more tests, moved the SALU->VALU conversion to a separate function It looks like it's not possible to get subregisters in the S_ABS lowering code, and I don't feel like guessing without testing what the correct code would look like. llvm-svn: 254095	2015-11-25 21:22:45 +00:00
Dan Gohman	80e34e0a18	[WebAssembly] Fix WebAssembly register numbering for registers added late. If virtual registers are created late, mappings to WebAssembly registers need to be added explicitly. This patch adds a function to do so and teaches WebAssemblyPeephole to use it. This fixes an out-of-bounds access on the WARegs vector. llvm-svn: 254094	2015-11-25 21:13:02 +00:00
Matt Arsenault	49affb8462	AMDGPU: Check feature attributes in SIMachineFunctionInfo llvm-svn: 254091	2015-11-25 20:55:12 +00:00
Krzysztof Parzyszek	207c13f254	Add hexagonv55 and hexagonv60 as recognized CPUs, make v60 the default llvm-svn: 254089	2015-11-25 20:30:59 +00:00
Matt Arsenault	61001bbc03	AMDGPU: Make v2i64/v2f64 legal types. They can be loaded and stored, so count them as legal. This is mostly to fix a number of common cases for load/store merging. llvm-svn: 254086	2015-11-25 19:58:34 +00:00
Artyom Skrobov	314ee04268	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Dan Gohman	fb3e0594e4	[WebAssembly] Use a physical register to describe ARGUMENT liveness. Instead of trying to move ARGUMENT instructions back up to the top after they've been scheduled or sunk down, use a fake physical register to create a liveness constraint that prevents ARGUMENT instructions from moving down in the first place. This is still not entirely ideal, however it is more robust than letting them move and moving them back. llvm-svn: 254084	2015-11-25 19:36:19 +00:00
Dan Gohman	9c54d3b4c6	[WebAssembly] Clean up several FIXME comments. llvm-svn: 254079	2015-11-25 18:13:18 +00:00

1 2 3 4 5 ...

35251 Commits