llvm-project

Commit Graph

Author	SHA1	Message	Date
Sriraman Tallam	7da9b445ea	Differential Revision: http://reviews.llvm.org/D19733 llvm-svn: 268106	2016-04-29 21:19:16 +00:00
Matt Arsenault	dc4ebad6d4	AMDGPU: Add kernarg.segment.ptr intrinsic llvm-svn: 268105	2016-04-29 21:16:52 +00:00
Chad Rosier	cd62bf5821	[InstCombine] Determine the result of a select based on a dominating condition. Differential Revision: http://reviews.llvm.org/D19550 llvm-svn: 268104	2016-04-29 21:12:31 +00:00
Sanjay Patel	9190b4add8	[InstCombine] clean up; NFC llvm-svn: 268099	2016-04-29 20:54:56 +00:00
Matt Arsenault	cf2744f1c8	AMDGPU/SI: Move post regalloc run of SIShrinkInstructions Move to addPreEmitPass. This is so it runs after post-RA scheduling so we can merge s_nops emitted by the scheduler and hazard recognizer. llvm-svn: 268095	2016-04-29 20:23:42 +00:00
Matt Arsenault	ab2232cf73	DAGCombiner: Reduce truncated shl width llvm-svn: 268094	2016-04-29 19:53:16 +00:00
Easwaran Raman	dc7071226b	Move coverage related code into a separate library. Differential Revision: http://reviews.llvm.org/D19333 llvm-svn: 268089	2016-04-29 18:53:05 +00:00
Kostya Serebryany	2fe9304d62	[libFuzzer] enable detect_leaks=1, add proper docs llvm-svn: 268088	2016-04-29 18:49:55 +00:00
George Burgess IV	1b1fef30d0	[MemorySSA] Fix bugs in walker; refactor unittests a bit. This patch fixes two somewhat related bugs in MemorySSA's caching walker. These bugs were found because D19695 brought up the problem that we'd have defs cached to themselves, which is incorrect. The bugs this fixes are: - We would sometimes skip the nearest clobber of a MemoryAccess, because we would query our cache for a given potential clobber before checking if the potential clobber is the clobber we're looking for. The cache entry for the potential clobber would point to the nearest clobber of the potential clobber, so if that was a cache hit, we'd ignore the potential clobber entirely. - There are times (sometimes in DFS, sometimes in the getClobbering... functions) where we would insert cache entries that say a def clobbers itself. There's a bit of common code between the fixes for the bugs, so they aren't split out into multiple commits. This patch also adds a few unit tests, and refactors existing tests a bit to reduce the duplication of setup code. llvm-svn: 268087	2016-04-29 18:42:55 +00:00
David Majnemer	d2a074b1f4	[ValueTracking] matchSelectPattern needs to be more careful around FP matchSelectPattern attempts to see through casts which mask min/max patterns from being more obvious. Under certain circumstances, it would misidentify a sequence of instructions as a min/max because it assumed that folding casts would preserve the result. This is not the case for floating point <-> integer casts. This fixes PR27575. llvm-svn: 268086	2016-04-29 18:40:34 +00:00
Zachary Turner	9213ba5304	Fix crash in PDB when loading corrupt file. There are probably hundreds of crashers we can find by fuzzing more. For now we do the simplest possible validation of the block size. Later, more complicated validations can verify that other fields of the super block such as directory size, number of blocks, agree with the size of the file etc. llvm-svn: 268084	2016-04-29 18:09:19 +00:00
Simon Pilgrim	464f1f3bea	Use SelectionDAG::getTargetConstant* helper functions. NFC. Instead of SelectionDAG::getConstant directly to make it more obvious that we're creating target constants. llvm-svn: 268074	2016-04-29 17:42:45 +00:00
Zachary Turner	2f09b5091c	Put PDB parsing code into a pdb namespace. llvm-svn: 268072	2016-04-29 17:28:47 +00:00
Zachary Turner	6ba65deeb9	Refactor the PDB Stream reading interface. The motivation for this change is that PDB has the notion of streams and substreams. Substreams often consist of variable length structures that are convenient to be able to treat as guaranteed, contiguous byte arrays, whereas the streams they are contained in are not necessarily so, as a single stream could be spread across many discontiguous blocks. So, when processing data from a substream, we want to be able to assume that we have a contiguous byte array so that we can cast pointers to variable length arrays and such. This leads to the question of how to be able to read the same data structure from either a stream or a substream using the same interface, which is where this patch comes in. We separate out the stream's read state from the underlying representation, and introduce a `StreamReader` class. Then we change the name of `PDBStream` to `MappedBlockStream`, and introduce a second kind of stream called a `ByteStream` which is simply a sequence of contiguous bytes. Finally, we update all of the std::vectors in `PDBDbiStream` to use `ByteStream` instead as a proof of concept. llvm-svn: 268071	2016-04-29 17:22:58 +00:00
Dehao Chen	21aefaec97	Do not read callee name when matching IR to profile as it is not used. Summary: Callee name is not used to identify a callsite now, so do not read it during annotation. Reviewers: davidxl, dnovillo Subscribers: dnovillo, danielcdh, llvm-commits Differential Revision: http://reviews.llvm.org/D19704 llvm-svn: 268069	2016-04-29 17:19:10 +00:00
Geoff Berry	b92cd5293e	[BasicAA] Treat llvm.assume as not accessing memory in getModRefBehavior(Function) Reviewers: dberlin, chandlerc, hfinkel, reames, sanjoy Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19730 llvm-svn: 268068	2016-04-29 17:18:28 +00:00
Haicheng Wu	e749ce53d4	[MBP] Split placement and alignment into two functions. NFC. Cut and Paste. llvm-svn: 268067	2016-04-29 17:06:44 +00:00
Artem Tamazov	38e496b175	Fixed/Recommitted r267733 "[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for SMRD." Previously reverted by r267752. r267733 review: Differential Revision: http://reviews.llvm.org/D19342 llvm-svn: 268066	2016-04-29 17:04:50 +00:00
Guozhi Wei	fa3e04298b	[PPC] Enable shuffling of VSX vectors This patch fixes PR27078 by enabling shuffling of vectors if VSX is available. llvm-svn: 268064	2016-04-29 17:00:54 +00:00
Filipe Cabecinhas	7894938a45	Add operator- to Path's reverse_iterator. Needed for D19666 Reviewers: rafael, craig.topper, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19724 llvm-svn: 268062	2016-04-29 16:48:07 +00:00
Sanjay Patel	d5b0e54b49	[InstCombine] add helper function for ICmp with constant canonicalization; NFCI As suggested in http://reviews.llvm.org/D17859 , we should enhance this to support vectors. llvm-svn: 268059	2016-04-29 16:22:25 +00:00
Daniel Sanders	7225cd52e7	[mips][ias] Move createCpRestoreMemOp to MipsTargetStreamer. NFC. Summary: This removes the temporary call to isIntegratedAssemblerRequired() which was added recently. It's effect is now acheived directly in the MipsTargetStreamer hierarchy. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19715 llvm-svn: 268058	2016-04-29 16:16:49 +00:00
Krzysztof Parzyszek	173fc57b54	Fix NDEBUG build: variables used only in debug code causing compile error llvm-svn: 268057	2016-04-29 16:14:00 +00:00
Amjad Aboud	293ee8bba1	Recommitted r264280 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26942 in r267004. llvm-svn: 268054	2016-04-29 16:07:55 +00:00
Simon Dardis	d8bceb9d3a	[mips][FastISel] A store is not a load. Correct trivial error. One of the failing tests from PR/27458. Reviewers: dsanders, vkalintiris, mcrosier Differential Review: http://reviews.llvm.org/D19726 llvm-svn: 268053	2016-04-29 16:07:47 +00:00
Simon Dardis	7383bfd8bd	[PATCH] [mips] Fix forbidden slot hazard handling MipsHazardSchedule has to determine what the next physical machine instruction is to decide whether to insert a nop. In case where a branch with a forbidden slot appears at the end of a basic block, first real instruction of the next physical basic block was determined using getFirstNonDebugInstr(). Unfortunately this only considers DBG_VALUEs and not other transient opcodes such as EHLABEL. As EHLABEL passes the SafeInForbiddenSlot predicate and the instruction after the EHLABEL can be a CTI, we observed test failures in the LNT testsuite. Reviewers: dsanders Differential Review: http://reviews.llvm.org/D19051 llvm-svn: 268052	2016-04-29 16:04:18 +00:00
Krzysztof Parzyszek	f5cbac93eb	[Hexagon] Optimize addressing modes for load/store Patch by Jyotsna Verma. llvm-svn: 268051	2016-04-29 15:49:13 +00:00
Filipe Cabecinhas	0da9937517	Unify XDEBUG and EXPENSIVE_CHECKS (into the latter), and add an option to the cmake build to enable them. Summary: Historically, we had a switch in the Makefiles for turning on "expensive checks". This has never been ported to the cmake build, but the (dead-ish) code is still around. This will also make it easier to turn it on in buildbots. Reviewers: chandlerc Subscribers: jyknight, mzolotukhin, RKSimon, gberry, llvm-commits Differential Revision: http://reviews.llvm.org/D19723 llvm-svn: 268050	2016-04-29 15:22:48 +00:00
Tom Stellard	92b24f324b	AMDGPU/SI: Add offset field to ds_permute/ds_bpermute instructions Summary: These instructions can add an immediate offset to the address, like other ds instructions. Reviewers: arsenm Subscribers: arsenm, scchan Differential Revision: http://reviews.llvm.org/D19233 llvm-svn: 268043	2016-04-29 14:34:26 +00:00
Daniel Sanders	fba875f902	[mips][ias] Split expandMemInst between MipsAsmParser and MipsTargetStreamer. Almost NFC. Summary: The portion in MipsAsmParser is responsible for figuring out which expansion to use, while the portion in MipsTargetStreamer is responsible for emitting it. This allows us to remove the call to isIntegratedAssemblerRequired() which is currently ensuring the effect of .cprestore only occurs when writing objects. The small functional change is that the memory offsets are now correctly printed as signed values. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19714 llvm-svn: 268042	2016-04-29 13:43:45 +00:00
Daniel Sanders	a736b37a25	[mips][ias] Moved most instruction emission helpers to MipsTargetStreamer. NFC. Summary: * Moved all the emit() helpers to MipsTargetStreamer. Moved createNop() to MipsTargetStreamer as emitNop() and emitEmptyDelaySlot(). This instruction has been split to distinguish between the 'nop' instruction and the nop used in delay slots which is sometimes a different nop to the 'nop' instruction (e.g. for short delay slots on microMIPS). * Moved createAddu() to MipsTargetStreamer as emitAddu(). * Moved createAppropriateDSLL() to MipsTargetStreamer as emitDSLL(). Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19712 llvm-svn: 268041	2016-04-29 13:33:12 +00:00
Daniel Sanders	9db710a171	[mips][ias] Make section sizes a multiple of the alignment. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19008 llvm-svn: 268036	2016-04-29 12:44:07 +00:00
Nikolay Haustov	4f672a34ed	AMDGPU/SI: Assembler: Unify parsing/printing of operands. Summary: The goal is for each operand type to have its own parse function and at the same time share common code for tracking state as different instruction types share operand types (e.g. glc/glc_flat, etc). Introduce parseAMDGPUOperand which can parse any optional operand. DPP and Clamp/OMod have custom handling for now. Sam also suggested to have class hierarchy for operand types instead of table. This can be done in separate change. Remove parseVOP3OptionalOps, parseDS*OptionalOps, parseFlatOptionalOps, parseMubufOptionalOps, parseDPPOptionalOps. Reduce number of definitions of AsmOperand's and MatchClasses' by using common base class. Rename AsmMatcher/InstPrinter methods accordingly. Print immediate type when printing parsed immediate operand. Use 'off' if offset/index register is unused instead of skipping it to make it more readable (also agreed with SP3). Update tests. Reviewers: tstellarAMD, SamWot, artem.tamazov Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19584 llvm-svn: 268015	2016-04-29 09:02:30 +00:00
Zlatko Buljan	531809d340	[mips][microMIPS] Fix offsets for LLE, LWE, SBE, SCE and SHE instructions Differential Revision: http://reviews.llvm.org/D18645 llvm-svn: 268012	2016-04-29 08:36:54 +00:00
David Majnemer	fadc6db036	[GlobalOpt] Propagate operand bundles We neglected to transfer operand bundles for some transforms. These were found via inspection, I'll try to come up with some test cases. llvm-svn: 268011	2016-04-29 08:07:22 +00:00
David Majnemer	231a68cc22	[InstCombine] Propagate operand bundles We neglected to transfer operand bundles for some transforms. These were found via inspection, I'll try to come up with some test cases. llvm-svn: 268010	2016-04-29 08:07:20 +00:00
David Majnemer	1a5799fe3e	[DeadArgumentElimination] Propagate operand bundles to promoted call sites We neglected to transfer operand bundles when performing argument promotion. llvm-svn: 268008	2016-04-29 07:22:36 +00:00
Adam Nemet	88ec491830	[LoopDist] Also emit optimization remark on success (-Rpass=) The option -Rpass=loop-distribute now reports the loops that were distributed. llvm-svn: 268006	2016-04-29 07:10:46 +00:00
Adam Nemet	4338d6769e	[LoopDist] Pass 'Function' to main class. NFC Next patch will add another use for 'Function' inside the class. llvm-svn: 268005	2016-04-29 07:10:39 +00:00
David Majnemer	13d5526392	[SLPVectorizer] Add operand bundles to vectorized functions SLPVectorizing a call site should result in further propagation of its bundles. llvm-svn: 268004	2016-04-29 07:09:51 +00:00
David Majnemer	50ddc0e1b6	[LoopVectorize] Add operand bundles to vectorized functions Also, do not crash when calculating a cost model for loop-invariant token values. llvm-svn: 268003	2016-04-29 07:09:48 +00:00
Matt Arsenault	7d1b6c81af	AMDGPU: Stop reporting an addressing mode for unknown addrspace This was being treated the same as private, which has an immediate offset. For unknown, it probably means it's for a computation not actually being used for accessing memory, so it should not have a nontrivial addressing mode. llvm-svn: 268002	2016-04-29 06:25:10 +00:00
Matt Arsenault	790eb1c490	DivergenceAnalysis: Fix crash with unreachable blocks Unreachable blocks may not be in the dominator tree, so don't crash on them. llvm-svn: 268001	2016-04-29 06:17:47 +00:00
David Majnemer	cd24bb1d3a	[ArgumentPromotion] Propagate operand bundles to promoted call sites We neglected to transfer operand bundles when performing argument promotion. This fixes PR27568. llvm-svn: 267986	2016-04-29 04:56:12 +00:00
Craig Topper	b805723294	[X86] Remove unnecessary header file containing a small class. It was only included in one place. Just define the class directly in the cpp file. NFC llvm-svn: 267985	2016-04-29 04:22:28 +00:00
Craig Topper	e7c1cd18d3	[X86] Include X86MCTargetDesc.h directly in X86Disassembler.cpp instead of duplicating parts of it. NFC llvm-svn: 267984	2016-04-29 04:22:26 +00:00
Michael Zolotukhin	1816d03b7d	[PR25281] Remove AAResultsWrapper from preserved analyses of loop vectorizer. We don't preserve AAResults, because, for one, we don't preserve SCEV-AA. That fixes PR25281. llvm-svn: 267980	2016-04-29 03:31:25 +00:00
Matthias Braun	f3619b8212	RegisterPressure: Fix default lanemask for missing regunit intervals In case of missing live intervals for a physical registers getLanesWithProperty() would report 0 which was not a safe default in all situations. Add a parameter to pass in a safe default. No testcase because in-tree targets do not skip computing register unit live intervals. Also cleanup the getXXX() functions to not perform the RequireLiveIntervals checks anymore so we do not even need to return safe defaults. llvm-svn: 267977	2016-04-29 02:44:54 +00:00
Matthias Braun	5e4ac856d6	RegisterPressure: Cannot produce dead (subregister) defs anymore With the DetectDeadLanes pass in place we cannot run into situations anymore where defs suddenly become dead. Also add a missing check so we do not try to add an undef flag to a physreg (found by visual inspection, no failing test). llvm-svn: 267976	2016-04-29 02:44:48 +00:00
Ivan Krasin	8dafa2da8e	Fix build by casting to the proper int type. Reviewers: eugenis Differential Revision: http://reviews.llvm.org/D19706 llvm-svn: 267974	2016-04-29 02:09:57 +00:00
Hal Finkel	1b66f7e3c8	[LoopVectorize] Keep hints from original loop on the vector loop We need to keep loop hints from the original loop on the new vector loop. Failure to do this meant that, for example: void foo(int *b) { #pragma clang loop unroll(disable) for (int i = 0; i < 16; ++i) b[i] = 1; } this loop would be unrolled. Why? Because we'd vectorize it, thus dropping the hints that unrolling should be disabled, and then we'd unroll it. llvm-svn: 267970	2016-04-29 01:27:40 +00:00
Evgeniy Stepanov	35f3e5e4e7	[msan] Handle vector compare x86 intrinsics. This handles SSE and SSE2 cmp_* and comiXX_* intrinsics. llvm-svn: 267966	2016-04-29 01:19:52 +00:00
David Majnemer	ca9ac4721d	[llvm-pdbdump] Try to appease the ASan bot We didn't check that the file was large enough to hold a super block. llvm-svn: 267965	2016-04-29 01:00:17 +00:00
Craig Topper	184310d6a9	[X86] Use nested switches to vary the operand to helper functions that were previously called in multiple cases. This seems to help the inliner reduce code. NFC llvm-svn: 267964	2016-04-29 00:51:30 +00:00
David Majnemer	1573b242ae	[llvm-pdbdump] Restore error messages, handle bad block sizes We lost the ability to report errors, bring it back. Also, correctly validate the block size. llvm-svn: 267955	2016-04-28 23:47:27 +00:00
Matthias Braun	f84547c6e0	LiveIntervalAnalysis: Remove LiveVariables requirement This requirement was a huge hack to keep LiveVariables alive because it was optionally used by TwoAddressInstructionPass and PHIElimination. However we have AnalysisUsage::addUsedIfAvailable() which we can use in those passes. This re-applies r260806 with LiveVariables manually added to PowerPC to hopefully not break the stage 2 bots this time. llvm-svn: 267954	2016-04-28 23:42:51 +00:00
David Majnemer	5baa2bc2e1	[llvm-pdbdump] Correctly read data larger than a block A bug was introduced when the code was refactored which resulted in a bad memory access. This fixes PR27565. llvm-svn: 267953	2016-04-28 23:24:23 +00:00
Adam Nemet	0ba164bbcb	[LoopDist] Emit optimization remarks (-Rpass) I closely followed the precedents set by the vectorizer: With -Rpass-missed, the loop is reported with further details pointing to -Rpass--analysis. * -Rpass-analysis reports the details why distribution has failed. * Regardless of -Rpass*, when distribution fails for a loop where distribution was forced with the pragma, a warning is produced according to -Wpass-failed. In this case the analysis info is also printed even without -Rpass-analysis. llvm-svn: 267952	2016-04-28 23:08:32 +00:00
Adam Nemet	adeccf7658	[LoopDist] Improve debug messages The next patch will start using these for -Rpass-analysis so they won't be internal-only anymore. Move the 'Skipping; ' prefix that some of the message are using into the 'fail' function. We don't want to include this prefix in the -Rpass-analysis report. llvm-svn: 267951	2016-04-28 23:08:30 +00:00
Adam Nemet	7f38e1199a	[LoopDist] Add helper to print debug message when distribution fails. NFC This will form the basis to emit optimization remarks (-Rpass*). llvm-svn: 267950	2016-04-28 23:08:27 +00:00
Hal Finkel	50316d95a9	[Inliner] Preserve llvm.mem.parallel_loop_access metadata When inlining a call site with llvm.mem.parallel_loop_access metadata, this metadata needs to be propagated to all cloned memory-accessing instructions. Otherwise, inlining parts of the loop body will invalidate the annotation. With this functionality, we now vectorize the following as expected: void Body(int res, int c, int d, int p, int i) { res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; } void Test(int res, int c, int d, int p, int n) { int i; #pragma clang loop vectorize(assume_safety) for (i = 0; i < 1600; i++) { Body(res, c, d, p, i); } } llvm-svn: 267949	2016-04-28 23:00:04 +00:00
Dehao Chen	1b54fce319	Read discriminators correctly from object file. Summary: This is the follow-up patch for http://reviews.llvm.org/D19436 * Update the discriminator reading algorithm to match the assignment algorithm. * Add test to cover the new algorithm. Reviewers: dnovillo, echristo, dblaikie Subscribers: danielcdh, dblaikie, echristo, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19522 llvm-svn: 267945	2016-04-28 22:09:37 +00:00
Marcin Koscielnicki	3a592df3e4	[CodeGen] Remove extra ';' Squashes a -Wpedantic warning. llvm-svn: 267944	2016-04-28 21:49:46 +00:00
Marcin Koscielnicki	7b32957852	[PowerPC] Fix the EH_SjLj_Setup pseudo. This instruction is just a control flow marker - it should not actually exist in the object file. Unfortunately, nothing catches it before it gets to AsmPrinter. If integrated assembler is used, it's considered to be a normal 4-byte instruction, and emitted as an all-0 word, crashing the program. With external assembler, a comment is emitted. Fixed by setting Size to 0 and handling it in MCCodeEmitter - this means the comment will still be emitted if integrated assembler is not used. This broke an ASan test, which has been disabled for a long time as a result (see the discussion on D19657). We can reenable it once this lands. llvm-svn: 267943	2016-04-28 21:24:37 +00:00
Krzysztof Parzyszek	bf90d5a3b3	[RDF] Recognize tail calls in graph creation llvm-svn: 267939	2016-04-28 20:40:08 +00:00
Amaury Sechet	5575d079a5	Fix warning in PDB code. NFC llvm-svn: 267938	2016-04-28 20:39:39 +00:00
Matthias Braun	e9631f166e	LiveIntervalAnalysis: No need to deal with dead subregister defs anymore. The DetectDeadLaneMask already ensures that we have no dead subregister definitions making the special handling in LiveIntervalAnalysis unnecessary. This reverts most of r248335. llvm-svn: 267937	2016-04-28 20:35:26 +00:00
Krzysztof Parzyszek	c5a4e26410	[RDF] Improve handling of inline-asm - Keep implicit defs from inline-asm instructions. - Treat register references from inline-asm as fixed. llvm-svn: 267936	2016-04-28 20:33:33 +00:00
Zachary Turner	897067e3f1	Add parentheses to silence -Wparentheses warnings. llvm-svn: 267934	2016-04-28 20:26:30 +00:00
Krzysztof Parzyszek	55874cf02b	[RDF] Add option to keep dead phi nodes in DFG Dead phi nodes are needed for code motion (such as copy propagation), where a new use would be placed in a location that would be dominated by a dead phi. Such a transformation is not legal for copy propagation, and the existence of the phi would prevent it, but if the phi is not there, it may appear to be valid. llvm-svn: 267932	2016-04-28 20:17:06 +00:00
Zachary Turner	84c3a8ba3d	Read the rest of the DBI substreams, and parse source info. We now read out the rest of the substreams from the DBI streams. One of these substreams, the FileInfo substream, contains information about which source files contribute to each module (aka compiland). This patch additionally parses out the file information from that substream, and dumps it in llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19634 Reviewed by: ruiu llvm-svn: 267928	2016-04-28 20:05:18 +00:00
Kit Barton	7a1a9e01ad	This reverts commit r265505. Revert "[Power9] Implement add-pc, multiply-add, modulo, extend-sign-shift, random number, set bool, and dfp test significance". This patch has caused a functional regression in SPEC2k6 namd, and a performance regression in mesa-pipe. llvm-svn: 267927	2016-04-28 20:00:42 +00:00
Krzysztof Parzyszek	e5fcce2d2b	[Hexagon] Add instruction aliases for vector unsigned compare-equal Unsigned compare-equal instructions are mapped to signed compare-equal. llvm-svn: 267925	2016-04-28 19:49:18 +00:00
Matt Arsenault	1c4d0efe56	AMDGPU: Emit error if too much LDS is used llvm-svn: 267922	2016-04-28 19:37:35 +00:00
Yaron Keren	3189622ae5	Remove doInitialization() and doFinalization() member declarations without definitions. Visual C++ 2015 flags this in the IDE. llvm-svn: 267919	2016-04-28 19:21:30 +00:00
Krzysztof Parzyszek	7ea9a529aa	Reset the TopRPTracker's position in ScheduleDAGMILive::initQueues ScheduleDAGMI::initQueues changes the RegionBegin to the first non-debug instruction. Since it does not track register pressure, it does not affect any RP trackers. ScheduleDAGMILive inherits initQueues from ScheduleDAGMI, and it does reset the TopTPTracker in its schedule method. Any derived, target-specific scheduler will need to do it as well, but the TopRPTracker is only exposed as a "const" object to derived classes. Without the ability to modify the tracker directly, this leaves a derived scheduler with a potential of having the TopRPTracker out-of-sync with the CurrentTop. The symptom of the problem: void llvm::ScheduleDAGMILive::scheduleMI(llvm::SUnit *, bool): Assertion `TopRPTracker.getPos() == CurrentTop && "out of sync"' failed. Differential Revision: http://reviews.llvm.org/D19438 llvm-svn: 267918	2016-04-28 19:17:44 +00:00
Matt Arsenault	c5fce69031	AMDGPU: Fix mishandling array allocations when promoting alloca The canonical form for allocas is a single allocation of the array type. In case we see a non-canonical array alloca, make sure we aren't replacing this with an array N times smaller. llvm-svn: 267916	2016-04-28 18:38:48 +00:00
Sriraman Tallam	46d47b8ce2	Add "PIE Level" metadata to module flags. http://reviews.llvm.org/D19671 llvm-svn: 267911	2016-04-28 18:15:44 +00:00
Eugene Zelenko	5354a8aa4d	Fix some Clang-tidy modernize and Include What You Use warnings. Differential revision: http://reviews.llvm.org/D19673 llvm-svn: 267910	2016-04-28 18:04:41 +00:00
Rong Xu	62d5e473ce	[PGO] Fix incorrect Twine usage in emitting optimization remarks. Should not store Twine objects to local variables. This is fixed the test failures with r267815 in VS2015 X64 build. llvm-svn: 267908	2016-04-28 17:49:56 +00:00
Rong Xu	08afb05491	Minor format change and fixing typos in the comments. NFC. llvm-svn: 267905	2016-04-28 17:31:22 +00:00
Krzysztof Parzyszek	0e7d2d339d	[Hexagon] Define certain aliases for vector instructions Specifically: Vd = #0 -> Vd = vxor(Vd, Vd) Vdd = #0 -> Vdd.w = vsub(Vdd.w, Vdd.w) Vdd = Vss -> Vdd = vcombine(Vss.H, Vss.L) llvm-svn: 267901	2016-04-28 16:43:16 +00:00
Simon Dardis	a2d8cc3db9	[mips][atomics] Fix partword atomic binary operation implementation Currently Mips::emitAtomicBinaryPartword() does not properly respect the width of pointers. For MIPS64 this causes the memory address that the ll/sc sequence uses to be truncated. At runtime this causes a segmentation fault. This can be fixed by applying similar changes as r266204, so that a full 64bit pointer is loaded. Reviewers: dsanders Differential Review: http://reviews.llvm.org/D19651 llvm-svn: 267900	2016-04-28 16:26:43 +00:00
Arch D. Robison	0e61034018	[SLPVectorizer] Extend SLP Vectorizer to deal with aggregates. The refactoring portion part was done as r267748. http://reviews.llvm.org/D14185 llvm-svn: 267899	2016-04-28 16:11:45 +00:00
Chad Rosier	712b7d7630	[GVN] Minor code cleanup. NFC. Differential Revision: http://reviews.llvm.org/D18828 Patch by Aditya Kumar! llvm-svn: 267898	2016-04-28 16:00:15 +00:00
Krzysztof Parzyszek	e737b86f8c	[Hexagon] Handle double-vector registers as new-value producers Patch by Colin LeMahieu. llvm-svn: 267897	2016-04-28 15:54:48 +00:00
Adrian Prantl	e5447574c8	Debug Info: Restore the pre-r240853 behavior for DWARF2 bitfields. The DWARF2 specification of DW_AT_bit_offset is ambiguous for little-endian machines, but by restoring to the old behavior we match what debuggers expect and what other popular compilers generate. llvm-svn: 267896	2016-04-28 15:37:52 +00:00
Adrian Prantl	f393d313ec	Debug info: Support DWARF4 bitfields via DW_AT_data_bit_offset. The DWARF2 specification of DW_AT_bit_offset was written from the perspective of a big-endian machine with unclear semantics for other systems. DWARF4 deprecated DW_AT_bit_offset and introduced a new attribute DW_AT_data_bit_offset that simply counts the number of bits from the beginning of the containing entity regardless of endianness. After this patch LLVM emits DW_AT_bit_offset for DWARF 2 or 3 and DW_AT_data_bit_offset when DWARF 4 or later is requested. llvm-svn: 267895	2016-04-28 15:37:48 +00:00
Geoff Berry	5ae272c2c1	[EarlyCSE] Change LoadValue field Value Data to Instruction Inst. NFC. Made in preparation for adding MemorySSA support to EarlyCSE. llvm-svn: 267893	2016-04-28 15:22:37 +00:00
Krzysztof Parzyszek	efd72857a3	[RDF] Handle undefined registers in RDF copy propagation When updating the graph, make sure that new uses without reaching defs are handled correctly. llvm-svn: 267891	2016-04-28 15:09:19 +00:00
Geoff Berry	354fac2a69	[EarlyCSE] Sort includes. NFC. Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19617 llvm-svn: 267890	2016-04-28 14:59:27 +00:00
Yaron Keren	8300995548	Rangify for loops, NFC. llvm-svn: 267889	2016-04-28 14:49:44 +00:00
Chad Rosier	567556aa9c	[Inliner] Formatting. NFC. Patch by Aditya Kumar! Differential Revision: http://reviews.llvm.org/D19047 llvm-svn: 267888	2016-04-28 14:47:23 +00:00
Ahmed Bougacha	17482a5696	[InstCombine] Remove trailing whitespace. NFC. r267873. llvm-svn: 267887	2016-04-28 14:36:07 +00:00
Simon Pilgrim	bd4a3be7d2	[InstCombine][SSE] Add MOVMSK support to SimplifyDemandedUseBits The MOVMSK instructions copies a vector elements' sign bits to the low bits of a scalar register and zeros the high bits. This patch adds MOVMSK support to SimplifyDemandedUseBits so that its aware that the upper bits are known to be zero. It also removes the call to MOVMSK if none of the lower bits are actually required and just returns zero. Differential Revision: http://reviews.llvm.org/D19614 llvm-svn: 267873	2016-04-28 12:22:53 +00:00
Craig Topper	477649a4c0	[X86] Remove unused operand from a function and all its callers. NFC llvm-svn: 267854	2016-04-28 05:58:46 +00:00
Craig Topper	33772c5375	[CodeGen] Default CTTZ_ZERO_UNDEF/CTLZ_ZERO_UNDEF to Expand in TargetLoweringBase. This is what the majority of the targets want and removes a bunch of code. Set it to Legal explicitly in the few cases where that's the desired behavior. llvm-svn: 267853	2016-04-28 03:34:31 +00:00
Matthias Braun	fbe85ae12e	CodeGen: Add DetectDeadLanes pass. The DetectDeadLanes pass performs a dataflow analysis of used/defined subregister lanes across COPY instructions and instructions that will get lowered to copies. It detects dead definitions and uses reading undefined values which are obscured by COPY and subregister usage. These dead definitions cause trouble in the register coalescer which cannot deal with definitions suddenly becoming dead after coalescing COPY instructions. For now the pass only adds dead and undef flags to machine operands. It should be possible to extend it in the future to remove the dead instructions and redo the analysis for the affected virtual registers. Differential Revision: http://reviews.llvm.org/D18427 llvm-svn: 267851	2016-04-28 03:07:16 +00:00
Matthias Braun	c9e759acff	LiveIntervalAnalysis: Fix handleMove() using wrong value numbers handleMove() was incorrectly swapping two value numbers. This was missed before because the problem only occured when moving subregister definitions and needed -verify-machineinstrs to be detected. I cannot add a testcase as long as I cannot reapply r260905/r260806. llvm-svn: 267840	2016-04-28 02:11:49 +00:00
Craig Topper	3b4842b56f	[AArch64] Expand CTTZ for all vector types. llvm-svn: 267837	2016-04-28 01:58:21 +00:00
Chaoren Lin	49317f2d90	Use llvm:Twine instead of std::to_string. std::to_string is not available from the Android NDK. Reviewers: lhames, ovyalov, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19638 llvm-svn: 267829	2016-04-28 00:49:37 +00:00
Bryan Chan	893110ecaf	[SystemZ] Support Swift Calling Convention Summary: Port rL265480, rL264754, rL265997 and rL266252 to SystemZ, in order to enable the Swift port on the architecture. SwiftSelf and SwiftError are assigned to R10 and R9, respectively, which are normally callee-saved registers. For more information, see: RFC: Implementing the Swift calling convention in LLVM and Clang https://groups.google.com/forum/#!topic/llvm-dev/epDd2w93kZ0 Reviewers: kbarton, manmanren, rjmccall, uweigand Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19414 llvm-svn: 267823	2016-04-28 00:17:23 +00:00
Peter Collingbourne	edf8432480	LTO: Don't bother trying to mangle unnamed globals, as they can't be preserved with MustPreserveSymbols. Summary: Should fix sanitizer-windows bot. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19635 llvm-svn: 267820	2016-04-27 23:48:11 +00:00
Zachary Turner	1822af542f	Parse module information from DBI stream. This gets more data out of the DBI strema of the PDB. In particular it extracts the metadata for the list of modules (compilands) that this PDB contains info about, and adds support for dumping these fields to llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19570 Reviewed By: ruiu llvm-svn: 267818	2016-04-27 23:41:42 +00:00
Quentin Colombet	12b69919a2	[ImplicitNullChecks] Properly update the live-in of the block of the memory operation. We basically replace: HoistBB: cond_br NullBB, NotNullBB NullBB: ... NotNullBB: <reg> = load into HoistBB <reg> = load_faulting_op NullBB uncond_br NotNullBB NullBB: ... NotNullBB: ## <reg> is now live-in of NotNullBB ... This partially fixes the machine verifier error for test/CodeGen/X86/implicit-null-check.ll, but it still fails because of the implicit CFG structure. llvm-svn: 267817	2016-04-27 23:26:40 +00:00
Rong Xu	6e34c490ff	[PGO] Promote indirect calls to conditional direct calls with value-profile This patch implements the transformation that promotes indirect calls to conditional direct calls when the indirect-call value profile meta-data is available. Differential Revision: http://reviews.llvm.org/D17864 llvm-svn: 267815	2016-04-27 23:20:27 +00:00
Sanjay Patel	facf45a82f	[SimplifyCFG] propagate branch metadata when creating select There's no existing test for this path, and I don't know how to expose it in a regression test, but I'm assuming there's some reason this path exists. llvm-svn: 267813	2016-04-27 23:14:12 +00:00
Lang Hames	f88174dd80	[RuntimeDyld] Propagate another dropped error in RuntimeDyldELF. This should fix the PPC64 bots. llvm-svn: 267810	2016-04-27 22:54:03 +00:00
Mitch Bodart	e60465ddf7	[X86] Enable the post-RA-scheduler for clang's default 32-bit cpu. For compilations with no explicit cpu specified, this exhibits nice gains on Silvermont, with neutral performance on big cores. Differential Revision: http://reviews.llvm.org/D19138 llvm-svn: 267809	2016-04-27 22:52:35 +00:00
Quentin Colombet	bf200688de	[X86][FastISel] Make sure we use the right register class when we select stores. llvm-svn: 267806	2016-04-27 22:33:42 +00:00
Colin LeMahieu	a3782da3e3	[Hexagon] Merging nops in to previous packet rather than always creating a new one. llvm-svn: 267798	2016-04-27 21:37:44 +00:00
Quentin Colombet	d6dbec4c6f	[X86] Fix the lowering of TLS calls. The callseq_end node must be glued with the TLS calls, otherwise, the generic code will miss the uses of the returned value and will mark it dead. Moreover, TLSCall 64-bit pseudo must not set an implicit-use on RDI, the pseudo uses the symbol address at this point not RDI and the lowering will do the right thing. llvm-svn: 267797	2016-04-27 21:37:37 +00:00
Colin LeMahieu	485d905510	[MCAssembler] Allow backend to finalize layout post-relaxation. Differential revision: http://reviews.llvm.org/D19429 llvm-svn: 267796	2016-04-27 21:26:13 +00:00
Rong Xu	af5aebaa32	[PGO] Prohibit address recording if the function is both internal and COMDAT Differential Revision: http://reviews.llvm.org/D19515 llvm-svn: 267792	2016-04-27 21:17:30 +00:00
Matt Arsenault	0547b016b1	AMDGPU: Account for globals in AMDGPUPromoteAlloca pass Patch by Bas Nieuwenhuizen llvm-svn: 267791	2016-04-27 21:05:08 +00:00
Lang Hames	bc38ea9596	[RuntimeDyld] Add missing include - <string> is requried for std::to_string. This should fix the compile error that showed up in build: http://lab.llvm.org:8011/builders/lldb-x86_64-ubuntu-14.04-buildserver/builds/6754/ llvm-svn: 267790	2016-04-27 20:54:49 +00:00
Lang Hames	09a74c46ec	[RuntimeDyld] Propagate Errors from findPPC64TOCSection. llvm-svn: 267789	2016-04-27 20:51:58 +00:00
Ahmed Bougacha	65572afea8	[ARM] Set AddPristinesAndCSRs to expandCMP_SWAP LivePhysRegs. We run after PEI. Found via inspection; no obvious testcase. Follow-up to r266679. llvm-svn: 267781	2016-04-27 20:33:07 +00:00
Ahmed Bougacha	5a3bf6a4a9	[AArch64] Set AddPristinesAndCSRs to expandCMP_SWAP LivePhysRegs. We run after PEI. Found via inspection; no obvious testcase. Follow-up to r266339. llvm-svn: 267780	2016-04-27 20:33:05 +00:00
Ahmed Bougacha	9e71425f54	[AArch64] Set correct successors in CMPXCHG pseudo expansion. transferSuccessors() would LoadCmpBB a successor of DoneBB, whereas it should be a successor of the original MBB. Follow-up to r266339. Unfortunately, it's tricky to catch this in the verifier. llvm-svn: 267779	2016-04-27 20:33:02 +00:00
Ahmed Bougacha	b4af107239	[ARM] Set correct successors in CMPXCHG pseudo expansion. transferSuccessors() would LoadCmpBB a successor of DoneBB, whereas it should be a successor of the original MBB. The testcase changes are caused by Thumb2SizeReduction, which was previously confused by the broken CFG. Follow-up to r266679. Unfortunately, it's tricky to catch this in the verifier. llvm-svn: 267778	2016-04-27 20:32:54 +00:00
Lang Hames	8959531c51	[RuntimeDyld] Plumb Error/Expected through the internals of RuntimeDyld. Also replaces a number of calls to report_fatal_error with Error returns. The plumbing will make it easier to return errors originating in libObject. Replacing report_fatal_errors with Error returns will give JIT clients the opportunity to recover gracefully when the JIT is unable to produce/relocate code, as well as providing meaningful error messages that can be used to file bug reports. llvm-svn: 267776	2016-04-27 20:24:48 +00:00
Than McIntosh	a541320908	Fix build failure under NDEBUG. llvm-svn: 267774	2016-04-27 20:07:02 +00:00
Kevin B. Smith	c378a99ba5	[X86]: Quit promoting 16 bit loads to 32 bit. Differential Revision: http://reviews.llvm.org/D19592 llvm-svn: 267773	2016-04-27 19:58:03 +00:00
Kostya Serebryany	0e0bcc4bdb	[libFuzzer] disable leak detection if we have tried it for 1000 times w/o finding a leak [part 2] llvm-svn: 267771	2016-04-27 19:52:56 +00:00
Kostya Serebryany	7018a1aaa4	[libFuzzer] disable leak detection if we have tried it for 1000 times w/o finding a leak llvm-svn: 267770	2016-04-27 19:52:34 +00:00
Andrew Kaylor	289bd5f684	Add optimization bisect opt-in calls for PowerPC passes Differential Revision: http://reviews.llvm.org/D19554 llvm-svn: 267769	2016-04-27 19:39:32 +00:00
David Majnemer	0c80e2eac6	[CodeGenPrepare] Don't sink a cast past its user The sink cast machinery is supposed to sink casts as close to their user as possible. However, an EH pad is the first instruction in it's basic block. Don't sink if the user is an EH pad. This fixes PR27536. llvm-svn: 267767	2016-04-27 19:36:38 +00:00
Than McIntosh	1b60168576	Refactor debugging code, NFC. Summary: Refactor debugging routines to reduce code duplication. Remove a couple of #include's that were not needed. Don't require MachineDominator as a prereq for this pass (not needed). These changes split off from http://reviews.llvm.org/D18827. Reviewers: wmi, gbiv, qcolombet Subscribers: llvm-commits, davidxl, jevinskie Differential Revision: http://reviews.llvm.org/D18992 llvm-svn: 267766	2016-04-27 19:26:25 +00:00
Justin Lebar	7cdbce5946	[NVPTX] Run NVVMReflect at the beginning of IR passes. Summary: Currently the NVVMReflect pass is run at the beginning of our backend passes. But really, it should be run as early as possible, as it's simply resolving an "if" statement in code. So copy it into TargetMachine::addEarlyAsPossiblePasses. We still run it at the beginning of the backend passes, since it's needed for correctness when lowering to nvptx. (Specifically, NVVMReflect changes each call to the __nvvm_reflect function or llvm.nvvm.reflect intrinsic into an integer constant, based on the pass's configuration. Clearly we miss many optimization opportunities if we perform this transformation at the beginning of codegen.) Reviewers: rnk Subscribers: tra, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D18616 llvm-svn: 267765	2016-04-27 19:13:37 +00:00
Ahmed Bougacha	ace97c1f7d	[LIR] Set attributes on memset_pattern16. "inferattrs" will deduce the attribute, but it will be too late for many optimizations. Set it ourselves when creating the call. Differential Revision: http://reviews.llvm.org/D17598 llvm-svn: 267762	2016-04-27 19:04:50 +00:00
Ahmed Bougacha	7f97193dd7	[LIR] Reuse variable. NFCI. llvm-svn: 267761	2016-04-27 19:04:46 +00:00
Ahmed Bougacha	44c19876c7	[InferAttrs] Mark memset_pattern16 params nocapture. Differential Revision: http://reviews.llvm.org/D19471 llvm-svn: 267760	2016-04-27 19:04:43 +00:00
Ahmed Bougacha	b0624a2cb4	[TLI] Unify LibFunc attribute inference. NFCI. Now the pass is just a tiny wrapper around the util. This lets us reuse the logic elsewhere (done here for BuildLibCalls) instead of duplicating it. The next step is to have something like getOrInsertLibFunc that also sets the attributes. Differential Revision: http://reviews.llvm.org/D19470 llvm-svn: 267759	2016-04-27 19:04:40 +00:00
Ahmed Bougacha	d765a82b54	[TLI] Unify LibFunc signature checking. NFCI. I tried to be as close as possible to the strongest check that existed before; cleaning these up properly is left for future work. Differential Revision: http://reviews.llvm.org/D19469 llvm-svn: 267758	2016-04-27 19:04:35 +00:00
Ahmed Bougacha	220c4010bf	[TLI] Fix indentation. NFC. llvm-svn: 267757	2016-04-27 19:04:29 +00:00
Sjoerd Meijer	41beee6575	Clean up to avoid compiler warnings for casting away const qualifiers. Differential Revision: http://reviews.llvm.org/D19598 llvm-svn: 267753	2016-04-27 18:35:02 +00:00
Chad Rosier	03e1647d19	Revert "[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for SMRD." This reverts commit r267733 due to a -Werror,-Wunused-function error. llvm-svn: 267752	2016-04-27 18:29:11 +00:00
Matthew Simpson	622b95be7b	[LV] Reallow positive-stride interleaved load groups with gaps We previously disallowed interleaved load groups that may cause us to speculatively access memory out-of-bounds (r261331). We did this by ensuring each load group had an access corresponding to the first and last member. Instead of bailing out for these interleaved groups, this patch enables us to peel off the last vector iteration, ensuring that we execute at least one iteration of the scalar remainder loop. This solution was proposed in the review of the previous patch. Differential Revision: http://reviews.llvm.org/D19487 llvm-svn: 267751	2016-04-27 18:21:36 +00:00
Arch D. Robison	aca7c412b4	[SLPVectorizer] Refactor where MinVecRegSize and MaxVecRegSize live. This is the first of two commits for extending SLP Vectorizer to deal with aggregates. This commit merely refactors existing logic. http://reviews.llvm.org/D14185 llvm-svn: 267748	2016-04-27 17:46:25 +00:00
Gerolf Hoflehner	50426191d7	[DAGCombiner] Follow coding convention for function name (NFC) llvm-svn: 267745	2016-04-27 17:27:16 +00:00
Marcin Koscielnicki	7efdca5622	[Mips] Add support for llvm.thread.pointer intrinsic. This will be used to implement __builtin_thread_pointer in clang. Differential Revision: http://reviews.llvm.org/D19569 llvm-svn: 267743	2016-04-27 17:21:49 +00:00
Reid Kleckner	7f0ae15e9d	Silence a -Wdangling-else llvm-svn: 267737	2016-04-27 16:46:33 +00:00
Matthew Simpson	47bd3994b7	Add parentheses to silence buildbot warning llvm-svn: 267734	2016-04-27 16:25:04 +00:00
Artem Tamazov	3896f8f83d	[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for SMRD. Added support of TTMP quads. Reworked M0 exclusion machinery for SMRD and similar instructions to enable usage of TTMP registers in those instructions as destinations. Tests added. Differential Revision: http://reviews.llvm.org/D19342 llvm-svn: 267733	2016-04-27 16:20:23 +00:00
Reid Kleckner	0336cc05e7	[PDB] Fix function names for private symbols in PDBs Summary: llvm-symbolizer wants to get linkage names of functions for historical reasons. Linkage names are only recorded in the PDB for public symbols, and the linkage name is apparently stored separately in some "public symbol" record. We had a workaround in PDBContext which would look for such symbols when the user requested linkage names. However, when given an address that was truly in a private function and public funciton, we would accidentally find nearby public symbols and return those function names. The fix is to look for both function symbols and public symbols and only prefer the public symbol name if the addresses of the symbols agree. Fixes PR27492 Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19571 llvm-svn: 267732	2016-04-27 16:10:29 +00:00
Nicolai Haehnle	f66bdb5ea8	AMDGPU/SI: Add llvm.amdgcn.s.waitcnt.all intrinsic Summary: So it appears that to guarantee some of the ordering requirements of a GLSL memoryBarrier() executed in the shader, we need to emit an s_waitcnt. (We can't use an s_barrier, because memoryBarrier() may appear anywhere in the shader, in particular it may appear in non-uniform control flow.) Reviewers: arsenm, mareko, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19203 llvm-svn: 267729	2016-04-27 15:46:01 +00:00
Matthew Simpson	e5dfb08fcb	[TTI] Add hook for vector extract with extension This change adds a new hook for estimating the cost of vector extracts followed by zero- and sign-extensions. The motivating example for this change is the SMOV and UMOV instructions on AArch64. These instructions move data from vector to general purpose registers while performing the corresponding extension (sign-extend for SMOV and zero-extend for UMOV) at the same time. For these operations, TargetTransformInfo can assume the extensions are free and only report the cost of the vector extract. The SLP vectorizer has been updated to make use of the new hook. Differential Revision: http://reviews.llvm.org/D18523 llvm-svn: 267725	2016-04-27 15:20:21 +00:00
Artem Tamazov	5cd55b1784	[AMDGPU][llvm-mc] s_getreg/setreg* - Support symbolic names of hardware registers. Possibility to specify code of hardware register kept. Disassemble to symbolic name, if name is known. Tests updated/added. Differential Revision: http://reviews.llvm.org/D19335 llvm-svn: 267724	2016-04-27 15:17:03 +00:00
Nico Weber	e69b9548b8	Revert r267649, it caused PR27539. llvm-svn: 267723	2016-04-27 15:16:54 +00:00

1 2 3 4 5 ...

89803 Commits