llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	3269215401	CodeGen: Use a single SlotTracker in MachineFunction::print() Expose enough of the IR-level `SlotTracker` so that `MachineFunction::print()` can use a single one for printing `BasicBlock`s. Next step would be to lift this through a few more APIs so that we can make other print methods faster. Fixes PR23865, changing the runtime of `llc -print-machineinstrs` from many minutes (killed after 3 minutes, but it wasn't very close) to 13 seconds for a 502185 line dump. llvm-svn: 240842	2015-06-26 22:04:20 +00:00
Adrian Prantl	06b298e4b6	Debug Info: Clarify the documentation for bitfields emission. llvm-svn: 240835	2015-06-26 21:27:30 +00:00
Pete Cooper	485d1146db	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240822	2015-06-26 19:37:02 +00:00
Pete Cooper	af61ac71e2	Wrap assert loops in #ifndef NDEBUG The body of the loops here only contained asserts. This triggered an unused variable warning on release builds and -Werror on the bots. llvm-svn: 240819	2015-06-26 19:23:20 +00:00
Pete Cooper	9271ccc345	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240817	2015-06-26 19:18:49 +00:00
Pete Cooper	8fc121dfc4	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240815	2015-06-26 19:08:33 +00:00
Matt Arsenault	572c29afc9	Show invariant loads in MMO dumping llvm-svn: 240813	2015-06-26 19:00:11 +00:00
Pete Cooper	8c0a710995	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240809	2015-06-26 18:41:54 +00:00
Alex Lorenz	ec6b26b955	Fix unused variable from r240792. The variable 'I' wasn't used when assertions were disabled. This commit ensures that 'I' is used outside of an assert. llvm-svn: 240797	2015-06-26 17:07:27 +00:00
Benjamin Kramer	1dcd8b09b4	[DAGCombine] Fix demanded bits computation for exact shifts. Fixes a miscompilation of MultiSource/Benchmarks/MallocBench/gs llvm-svn: 240796	2015-06-26 16:59:31 +00:00
Alex Lorenz	33f0aef32f	MIR Serialization: Serialize machine basic block operands. This commit serializes machine basic block operands. The machine basic block operands use the following syntax: %bb.<id>[.<name>] This commit also modifies the YAML representation for the machine basic blocks - a new, required field 'id' is added to the MBB YAML mapping. The id is used to resolve the MBB references to the actual MBBs. And while the name of the MBB can be included in a MBB reference, this name isn't used to resolve MBB references - as it's possible that multiple MBBs will reference the same BB and thus they will have the same name. If the name is specified, the parser will verify that it is equal to the name of the MBB with the specified id. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10608 llvm-svn: 240792	2015-06-26 16:46:11 +00:00
Benjamin Kramer	c2ae767377	[DAGCombiner] Preserve the exact bit when simplifying SRA to SRL. Allows more aggressive folding of ashr/shl pairs. llvm-svn: 240788	2015-06-26 14:51:49 +00:00
Benjamin Kramer	07e70b4fa4	[DAGCombine] fold (X >>?,exact C1) << C2 --> X << (C2-C1) Instcombine also does this but many opportunities only become visible after GEPs are lowered. llvm-svn: 240787	2015-06-26 14:51:36 +00:00
Hao Liu	b41c0b44af	[InterleavedAccess] Fix failures "undefined type 'llvm::raw_ostream'" on windows. llvm-svn: 240760	2015-06-26 04:38:21 +00:00
Hao Liu	1c1e0c9e71	[InterleavedAccess] Add a pass InterleavedAccess to identify interleaved memory accesses and transform into target specific intrinsics. E.g. An interleaved load (Factor = 2): %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <0, 2, 4, 6> %v1 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <1, 3, 5, 7> It can be transformed into a ld2 intrinsic in AArch64 backend or a vld2 intrinsic in ARM backend. E.g. An interleaved store (Factor = 3): %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr It can be transformed into a st3 intrinsic in AArch64 backend or a vst3 intrinsic in ARM backend. Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240751	2015-06-26 02:10:27 +00:00
Duncan P. N. Exon Smith	827200c822	AsmPrinter: Use an intrusively linked list for DIE::Children Replace the `std::vector<>` for `DIE::Children` with an intrusively linked list. This is a strict memory improvement: it requires no auxiliary storage, and reduces `sizeof(DIE)` by one pointer. It also factors out the DIE-related malloc traffic. This drops llc memory usage from 735 MB down to 718 MB, or ~2.3%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240736	2015-06-25 23:52:10 +00:00
Duncan P. N. Exon Smith	4fb1f9cda6	AsmPrinter: Convert DIE::Values to a linked list Change `DIE::Values` to a singly linked list, where each node is allocated on a `BumpPtrAllocator`. In order to support `push_back()`, the list is circular, and points at the tail element instead of the head. I abstracted the core list logic out to `IntrusiveBackList` so that it can be reused for `DIE::Children`, which also cares about `push_back()`. This drops llc memory usage from 799 MB down to 735 MB, about 8%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240733	2015-06-25 23:46:41 +00:00
Matt Arsenault	f735cab986	DAGCombiner: Use pop_back_val() llvm-svn: 240709	2015-06-25 22:15:05 +00:00
Sanjay Patel	e4aedb55d6	fix typos; NFC llvm-svn: 240699	2015-06-25 21:11:08 +00:00
Matt Arsenault	c244dcb804	DAGCombiner: Remove redundant check MemIntrinsicSDNode is already a subclass of MemSDNode, so the MemSDNode check is sufficient. llvm-svn: 240672	2015-06-25 18:47:02 +00:00
Bruno Cardoso Lopes	edb876d52c	[AsmPrinter] Fix crash in handleIndirectSymViaGOTPCRel Check for symbols in MCValue before using them. Bail out early in case they are null. This fixes PR23779. Differential Revision: http://reviews.llvm.org/D10712 rdar://problem/21532830 llvm-svn: 240649	2015-06-25 15:17:23 +00:00
Akira Hatanaka	14348aa2c5	[If Converter] Convert recursion to iteration. This commit makes changes to IfConverter::AnalyzeBlock to use iteration instead of recursion. Previously, this function would get called recursively a large number of times and eventually segfault when a function with the following CFG was compiled: BB0: if (condition0) goto BB1 goto BB2 BB1: goto BB2 BB2: if (condition1) goto BB3 goto BB4 BB3: ... (repeat until BB7488) rdar://problem/21386145 Differential Revision: http://reviews.llvm.org/D10587 llvm-svn: 240589	2015-06-24 20:34:35 +00:00
Alex Lorenz	54565cf02b	MIR Serialization: Serialize simple MachineRegisterInfo attributes. This commit serializes the 3 scalar boolean attributes from the MachineRegisterInfo class: IsSSA, TracksRegLiveness, and TracksSubRegLiveness. These attributes are serialized as part of the machine function YAML mapping. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10618 llvm-svn: 240579	2015-06-24 19:56:10 +00:00
Duncan P. N. Exon Smith	9dbb5013b7	AsmPrinter: Cleanup DIEValue::EmitValue() API, NFC Stop taking a `dwarf::Form` in `DIEValue::EmitValue()` and `DIEValue::SizeOf()`, since they're always passed `DIEValue::getForm()` anyway. This is just left over from when `DIEValue` didn't know its own form. llvm-svn: 240566	2015-06-24 18:48:11 +00:00
Alex Lorenz	12b554e6a7	MIR Serialization: Serialize the null register operands. This commit serializes the null register machine operands. It uses the '_' keyword to represent them, but the parser also allows the '%noreg' named register syntax. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10580 llvm-svn: 240558	2015-06-24 17:34:58 +00:00
Daniel Sanders	110bf6da75	Eliminate additional redundant copies of Triple objects. NFC. Subscribers: rafael, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10654 llvm-svn: 240540	2015-06-24 13:25:57 +00:00
Pawel Bylica	cc35812877	Fix instruction scheduling live register tracking Summary: This patch fixes PR23405 (https://llvm.org/bugs/show_bug.cgi?id=23405). During a node unscheduling an entry in LiveRegGens can be replaced with a new value. That corrupts the live reg tracking and LiveReg* structure is not cleared as should be during unscheduling. Problematic condition that enforces Gen replacement is `I->getSUnit()->getHeight() < LiveRegGens[I->getReg()]->getHeight()`. This condition should be checked only if LiveRegGen was set in current node unscheduling. Test Plan: Regression test included. Reviewers: hfinkel, atrick Reviewed By: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9993 llvm-svn: 240538	2015-06-24 12:49:42 +00:00
NAKAMURA Takumi	c267b5f5aa	MILexer.cpp: Try to fix a warning. [-Wsign-compare] llvm-svn: 240525	2015-06-24 06:40:09 +00:00
Alex Lorenz	240fc1e0aa	MIR Serialization: Serialize immediate machine operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10573 llvm-svn: 240481	2015-06-23 23:42:28 +00:00
Alex Lorenz	51af160f4c	MIR Parser: Use correct source locations for machine instruction diagnostics. This commit translates the source locations for MIParser diagnostics from the locations in the machine instruction string to the locations in the MIR file. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10574 llvm-svn: 240474	2015-06-23 22:39:23 +00:00
Sanjoy Das	3f1bc3b2bb	Revert "[FaultMaps] Move FaultMapParser to Object/" This reverts commit r240364 (git c49542e5bb186). The issue r240364 was trying to fix was fixed independently in r240362. llvm-svn: 240448	2015-06-23 20:09:03 +00:00
Alex Lorenz	f3db51de5e	MIR Serialization: Serialize physical register machine operands. This commit introduces functionality that's used to serialize machine operands. Only the physical register operands are serialized by this commit. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10525 llvm-svn: 240425	2015-06-23 16:35:26 +00:00
Benjamin Kramer	8c57cfd51b	[BranchFolding] Document why replacing HashMachineInstr with hash_code doesn't work llvm-svn: 240415	2015-06-23 14:47:36 +00:00
Benjamin Kramer	6b568964ba	[MachineBasicBlock] Add getFirstNonDebugInstr to complement getLastNonDebugInstr Use it in CodeGen where applicable. No functionality change intended. llvm-svn: 240414	2015-06-23 14:47:29 +00:00
Benjamin Kramer	9c956b33d7	[MachineBasicBlock] Use the const_cast(this) trick to reduce duplication NFC. llvm-svn: 240413	2015-06-23 14:47:18 +00:00
Rafael Espindola	c233f74e6e	Simplify the Mangler interface now that DataLayout is mandatory. We only need to pass in a DataLayout when mangling a raw string, not when constructing the mangler. llvm-svn: 240405	2015-06-23 13:59:29 +00:00
Rafael Espindola	ce4c2bc1d6	Use MCSymbols for FastISel. The summary is that it moves the mangling earlier and replaces a few calls to .addExternalSymbol with addSym. I originally wanted to replace all the uses of addExternalSymbol with addSym, but noticed it was a lot of work and doesn't need to be done all at once. llvm-svn: 240395	2015-06-23 12:21:54 +00:00
Alexander Kornienko	f00654e31b	Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC) Apparently, the style needs to be agreed upon first. llvm-svn: 240390	2015-06-23 09:49:53 +00:00
Sanjoy Das	9d95716c15	[FaultMaps] Move FaultMapParser to Object/ Summary: That way llvm-objdump can rely on it without adding an extra dependency on CodeGen. This change duplicates the FaultKind enum and the code that serializes it to a string. I could not figure out a way to get around this without adding a new dependency to Object Reviewers: rafael, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10619 llvm-svn: 240364	2015-06-23 01:05:26 +00:00
Sanjay Patel	e79b43a01f	[x86] generalize reassociation optimization in machine combiner to 2 instructions Currently ( D10321, http://reviews.llvm.org/rL239486 ), we can use the machine combiner pass to reassociate the following sequence to reduce the critical path: A = ? op ? B = A op X C = B op Y --> A = ? op ? B = X op Y C = A op B 'op' is currently limited to x86 AVX scalar FP adds (with fast-math on), but in theory, it could be any associative math/logic op (see TODO in code comment). This patch generalizes the pattern match to ignore the instruction that defines 'A'. So instead of a sequence of 3 adds, we now only need to find 2 dependent adds and decide if it's worth reassociating them. This generalization has a compile-time cost because we can now match more instruction sequences and we rely more heavily on the machine combiner to discard sequences where reassociation doesn't improve the critical path. For example, in the new test case: A = M div N B = A add X C = B add Y We'll match 2 reassociation patterns, but this transform doesn't reduce the critical path: A = M div N B = A add Y C = B add X We need the combiner to reject that pattern but select this: A = M div N B = X add Y C = B add A Differential Revision: http://reviews.llvm.org/D10460 llvm-svn: 240361	2015-06-23 00:39:40 +00:00
Pawel Bylica	e6fd8c4232	Revert r240291: causes problems in self-hosted builds. llvm-svn: 240343	2015-06-22 21:54:07 +00:00
Alex Lorenz	91370c5d62	MIR Serialization: Introduce a lexer for machine instructions. This commit adds a function that tokenizes the string containing the machine instruction. This commit also adds a struct called 'MIToken' which is used to represent the lexer's tokens. Reviewers: Sean Silva Differential Revision: http://reviews.llvm.org/D10521 llvm-svn: 240323	2015-06-22 20:37:46 +00:00
Sanjoy Das	cee60be640	Fix MSVC build. I had some unnecessary `typename`s left in after addressing review. This compiled successfully with clang++ but MSVC reported an error. Fix the build error by removing the redundant `typename`s. llvm-svn: 240307	2015-06-22 18:20:10 +00:00
Sanjoy Das	6f567a4b79	[FaultMaps] Add a parser for the __llvm__faultmaps section. Summary: The parser is exercised by llvm-objdump using -print-fault-maps. As is probably obvious, the code itself was "heavily inspired" by http://reviews.llvm.org/D10434. Reviewers: reames, atrick, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10491 llvm-svn: 240304	2015-06-22 18:03:02 +00:00
Rafael Espindola	36b718fc74	Avoid a Symbol -> Name -> Symbol conversion. Before this we were producing a TargetExternalSymbol from a MCSymbol. That meant extracting the symbol name and fetching the symbol again down the pipeline. This patch adds a DAG.getMCSymbol that lets the MCSymbol pass unchanged on the DAG. Doing so removes the need for MO_NOPREFIX and fixes the root cause of pr23900, allowing r240130 to be committed again. llvm-svn: 240300	2015-06-22 17:46:53 +00:00
Alex Lorenz	8e0a1b4857	MIR Serialization: Serialize machine instruction names. This commit implements initial machine instruction serialization. It serializes machine instruction names. The instructions are represented using a YAML sequence of string literals and are a part of machine basic block YAML mapping. This commit introduces a class called 'MIParser' which will be used to parse the machine instructions and operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10481 llvm-svn: 240295	2015-06-22 17:02:30 +00:00
Pawel Bylica	06407c0320	Fix shl folding in DAG combiner. Summary: The code responsible for shl folding in the DAGCombiner was assuming incorrectly that all constants are less than 64 bits. This patch simply changes the way values are compared. Test Plan: A regression test included. Reviewers: andreadb Reviewed By: andreadb Subscribers: andreadb, test, llvm-commits Differential Revision: http://reviews.llvm.org/D10602 llvm-svn: 240291	2015-06-22 15:58:11 +00:00
Chandler Carruth	c3f49eb451	[PM/AA] Hoist the AliasResult enum out of the AliasAnalysis class. This will allow classes to implement the AA interface without deriving from the class or referencing an internal enum of some other class as their return types. Also, to a pretty fundamental extent, concepts such as 'NoAlias', 'MayAlias', and 'MustAlias' are first class concepts in LLVM and we aren't saving anything by scoping them heavily. My mild preference would have been to use a scoped enum, but that feature is essentially completely broken AFAICT. I'm extremely disappointed. For example, we cannot through any reasonable[1] means construct an enum class (or analog) which has scoped names but converts to a boolean in order to test for the possibility of aliasing. [1]: Richard Smith came up with a "solution", but it requires class templates, and lots of boilerplate setting up the enumeration multiple times. Something like Boost.PP could potentially bundle this up, but even that would be quite painful and it doesn't seem realistically worth it. The enum class solution would probably work without the need for a bool conversion. Differential Revision: http://reviews.llvm.org/D10495 llvm-svn: 240255	2015-06-22 02:16:51 +00:00
Duncan P. N. Exon Smith	3a73d9e067	AsmPrinter: Don't emit empty .debug_loc entries If we don't know how to represent a .debug_loc entry, skip the entry entirely rather than emitting an empty one. Similarly, if a .debug_loc list has no entries, don't create the list. We still want to create the variables, just in an optimized-out form that doesn't have a DW_AT_location. llvm-svn: 240244	2015-06-21 16:54:56 +00:00
Duncan P. N. Exon Smith	e6cc531b1a	AsmPrinter: Rewrite initialization of DbgVariable, NFC There are three types of `DbgVariable`: - alloca variables, created based on the MMI table, - register variables, created based on DBG_VALUE instructions, and - optimized-out variables. This commit reconfigures `DbgVariable` to make it easier to tell which kind we have, and make initialization a little clearer. For MMI/alloca variables, `FrameIndex.size()` must always equal `Expr.size()`, and there shouldn't be an `MInsn`. For register variables (with a `MInsn`), `FrameIndex` must be empty, and `Expr` should have 0 or 1 element depending on whether it has a complex expression (registers with multiple locations use `DebugLocListIndex`). Optimized-out variables shouldn't have any of these fields. Moreover, this separates DBG_VALUE initialization until after the variable is created, simplifying logic in a future commit that changes `collectVariableInfo()` to stop creating empty .debug_loc entries/lists. llvm-svn: 240243	2015-06-21 16:50:43 +00:00
Hans Wennborg	6ed81cbcdb	Switch lowering: add heuristic for filling leaf nodes in the weight-balanced binary search tree Sparse switches with profile info are lowered as weight-balanced BSTs. For example, if the node weights are {1,1,1,1,1,1000}, the right-most node would end up in a tree by itself, bringing it closer to the top. However, a leaf in this BST can contain up to 3 cases, and having a single case in a leaf node as in the example means the tree might become unnecessarily high. This patch adds a heauristic to the pivot selection algorithm that moves more cases into leaf nodes unless that would lower their rank. It still doesn't yield the optimal tree in every case, but I believe it's conservatibely correct. llvm-svn: 240224	2015-06-20 17:14:07 +00:00
NAKAMURA Takumi	34d3376afc	Reformat. llvm-svn: 240213	2015-06-20 06:22:04 +00:00
NAKAMURA Takumi	3746abba00	Revert r240040, "[BranchFolding] Replace custom MachineInstr with MachineInstrExpressionTrait" It caused different emission between stage2 and stage3. Investigating. llvm-svn: 240212	2015-06-20 06:21:48 +00:00
Sanjoy Das	d200893741	[Statepoint] Remove unnecessary argument from Statepoint::getRelocates NFC. llvm-svn: 240198	2015-06-20 00:01:03 +00:00
Sanjay Patel	cfe0393b82	name change: hasPattern() -> getMachineCombinerPatterns() ; NFC This was suggested as part of D10460, but it's independent of any functional change. llvm-svn: 240192	2015-06-19 23:21:42 +00:00
Alex Lorenz	00302df3fe	MIR Parser: report an error when a basic block isn't found. This commit reports an error when the MIR parser can't find a basic block with the machine basic block's name. llvm-svn: 240174	2015-06-19 20:12:03 +00:00
Alex Lorenz	4f093bf1ce	MIR Serialization: Serialize the list of machine basic blocks with simple attributes. This commit implements the initial serialization of machine basic blocks in a machine function. Only the simple, scalar MBB attributes are serialized. The reference to LLVM IR's basic block is preserved when that basic block has a name. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10465 llvm-svn: 240145	2015-06-19 17:43:07 +00:00
Alexander Kornienko	70bc5f1398	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ llvm/lib/ Thanks to Eugene Kosov for the original patch! llvm-svn: 240137	2015-06-19 15:57:42 +00:00
Eric Christopher	572e03a396	Fix "the the" in comments. llvm-svn: 240112	2015-06-19 01:53:21 +00:00
Yi Jiang	e0b3499db7	Avoid redundant select node in early if-conversion pass llvm-svn: 240072	2015-06-18 22:34:09 +00:00
Hans Wennborg	67d492a544	Switch lowering: enable whole-switch jump tables at -O0. To same compile time, the analysis to find dense case-clusters in switches is not done at -O0. However, when the whole switch is dense enough, it is easy to turn it into a jump table, resulting in much faster code with no extra effort. llvm-svn: 240071	2015-06-18 22:22:30 +00:00
Benjamin Kramer	8985b32e76	[BranchFolding] Replace custom MachineInstr with MachineInstrExpressionTrait While the hash functions are subtly different it shouldn't have an impact. Instructions are checked with isIdenticalTo later. llvm-svn: 240040	2015-06-18 20:00:03 +00:00
David Majnemer	46c852e438	[CodeGen] Don't emit a random reference to the personality function This should fix issues we've been seeing with Darwin. llvm-svn: 240036	2015-06-18 18:31:46 +00:00
Sanjay Patel	8730ef78f8	fix typo; NFC llvm-svn: 240022	2015-06-18 15:53:33 +00:00
Benjamin Kramer	c6e8bfc41d	[AsmPrinter] Make isRepeatedByteSequence smarter about odd integer types - zext the value to alloc size first, then check if the value repeats with zero padding included. If so we can still emit a .space - Do the checking with APInt.isSplat(8), which handles non-pow2 types - Also handle large constants (bit width > 64) - In a ConstantArray all elements have the same type, so it's sufficient to check the first constant recursively and then just compare if all following constants are the same by pointer compare llvm-svn: 239977	2015-06-17 23:55:17 +00:00
Sanjay Patel	a3f423b4fc	remove unnecessary casts; NFC llvm-svn: 239942	2015-06-17 20:54:46 +00:00
David Majnemer	7fddeccb8b	Move the personality function from LandingPadInst to Function The personality routine currently lives in the LandingPadInst. This isn't desirable because: - All LandingPadInsts in the same function must have the same personality routine. This means that each LandingPadInst beyond the first has an operand which produces no additional information. - There is ongoing work to introduce EH IR constructs other than LandingPadInst. Moving the personality routine off of any one particular Instruction and onto the parent function seems a lot better than have N different places a personality function can sneak onto an exceptional function. Differential Revision: http://reviews.llvm.org/D10429 llvm-svn: 239940	2015-06-17 20:52:32 +00:00
Ahmed Bougacha	f32991461f	[CodeGenPrepare] Generalize inserted set from truncs to any inst. It's been used before to avoid infinite loops caused by separate CGP optimizations undoing one another. We found one more such issue caused by r238054. To avoid it, generalize the "InsertedTruncs" set to any inst, and use it to avoid touching those again. llvm-svn: 239938	2015-06-17 20:44:32 +00:00
Sanjay Patel	dcaa53791c	fix typos in comments; NFC llvm-svn: 239916	2015-06-17 16:34:48 +00:00
Chandler Carruth	ac80dc7532	[PM/AA] Remove the Location typedef from the AliasAnalysis class now that it is its own entity in the form of MemoryLocation, and update all the callers. This is an entirely mechanical change. References to "Location" within AA subclases become "MemoryLocation", and elsewhere "AliasAnalysis::Location" becomes "MemoryLocation". Hope that helps out-of-tree folks update. llvm-svn: 239885	2015-06-17 07:18:54 +00:00
Rafael Espindola	857546e7e0	Rename and improve emitSectionOffset. Different object formats represent references from dwarf in different ways. ELF uses a relocation to the referenced point (except for .dwo) and COFF/MachO use the offset of the referenced point inside its section. This patch renames emitSectionOffset because * It doesn't produce an offset on ELF. * It changes behavior depending on how DWARF is represented, so adding dwarf to its name is probably a good thing. The patch also adds an option to force the use of offsets.That avoids funny looking code like if (!UseOffsets) Asm->emitSectionOffset.... It was correct, but read as if the ! was inverted. llvm-svn: 239866	2015-06-16 23:22:02 +00:00
Sanjay Patel	0fcc53f6d6	rename variables; NFC ...because I see 'StoreBW' and read it as 'store bandwidth' llvm-svn: 239850	2015-06-16 20:47:19 +00:00
Sanjay Patel	bb385ed454	extract some code into a helper function for MergeConsecutiveStores(); NFCI llvm-svn: 239847	2015-06-16 20:05:00 +00:00
Matthias Braun	ca4e842127	VirtRegMap: Add undef flag when reading undefined subregisters. While completely undefined registers are easy to catch and get their <undef> flag early in ProcessImplicitDefs/RegisterCoalescer reading from a partially defined register where just the subreg happens to be undefined is harder to catch so we only add the undef flag in the virtual register rewriting step. No testcase as I cannot reproduce the problem on any of the in-tree targets at the moment. This fixes rdar://21387089 Differential Revision: http://reviews.llvm.org/D10470 llvm-svn: 239838	2015-06-16 18:22:28 +00:00
Matthias Braun	f63c807809	TargetRegisterInfo: Make the concept of imprecise lane masks explicit LaneMasks as given by getSubRegIndexLaneMask() have a limited number of of bits, so for targets with more than 31 disjunct subregister there may be cases where: getSubReg(Reg,A) does not overlap getSubReg(Reg,B) but we still have (getSubRegIndexLaneMask(A) & getSubRegIndexLaneMask(B)) != 0. I had hoped to keep this an implementation detail of the tablegen but as my next commit shows we can avoid unnecessary imp-defs operands if we know that the lane masks in use are precise. This is in preparation to http://reviews.llvm.org/D10470. llvm-svn: 239837	2015-06-16 18:22:26 +00:00
Alex Lorenz	5ef16b8a7c	MIR Parser: Report an error when a machine function doesn't have a corresponding function. This commit reports an error when a machine function from a MIR file that contains LLVM IR can't find a function with the same name in the loaded LLVM IR module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10468 llvm-svn: 239831	2015-06-16 17:06:29 +00:00
Sanjay Patel	f134048b1d	propagate IR-level fast-math-flags to DAG nodes, disabled by default This is an updated version of the patch that was checked in at: http://reviews.llvm.org/rL237046 but subsequently reverted because it exposed a bug in the DAG Combiner: http://reviews.llvm.org/D9893 This time, there's an enablement flag ("EnableFMFInDAG") around the code in SelectionDAGBuilder where we copy the set of FP optimization flags from IR instructions to DAG nodes. So, in theory, there should be no functional change from this patch as-is, but it will allow testing with the added functionality to proceed via "-enable-fmf-dag" passed to llc. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 Differential Revision: http://reviews.llvm.org/D10403 llvm-svn: 239828	2015-06-16 16:25:43 +00:00
Matt Arsenault	ed891b5561	Revert "Revert "Fix merges of non-zero vector stores"" Reapply r239539. Don't assume the collected number of stores is the same vector size. Just take the first N stores to fill the vector. llvm-svn: 239825	2015-06-16 15:51:48 +00:00
Daniel Sanders	335487ad87	Replace string GNU Triples with llvm::Triple in TargetMachine::getTargetTriple(). NFC. Summary: This continues the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. Reviewers: rengolin Reviewed By: rengolin Subscribers: llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10381 llvm-svn: 239815	2015-06-16 13:15:50 +00:00
Arnaud A. de Grandmaison	c8a694fd27	[MachineSink] Address post-commit review comments The successors cache is now a local variable, making it more visible that it is only valid for the MBB being processed. llvm-svn: 239807	2015-06-16 08:57:21 +00:00
Alex Lorenz	5b5f97537f	MIR Serialization: Print and parse simple machine function attributes. This commit serializes the simple, scalar attributes from the 'MachineFunction' class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10449 llvm-svn: 239790	2015-06-16 00:10:47 +00:00
Alex Lorenz	345c1449c8	MIR Serialization: move the MIR printer out of the MIR printing pass. This commit decouples the MIR printer and the MIR printing pass so that it will be possible to move the MIR printer into a separate machine IR library later on. Reviewers: Duncan P. N. Exon Smith llvm-svn: 239788	2015-06-15 23:52:35 +00:00
Adrian Prantl	8ff53b3cda	Debug Info IR: Switch DIObjCProperty to use DITypeRef. This is a prerequisite for turning on ODR type uniquing for ObjC++. rdar://problem/21377883 llvm-svn: 239780	2015-06-15 23:18:03 +00:00
Alex Lorenz	8e7a58d7cc	MIR Serialization: Create dummy functions when the MIR file doesn't have LLVM IR. This commit creates a dummy LLVM IR function with one basic block and an unreachable instruction for each parsed machine function when the MIR file doesn't have LLVM IR. This change is required as the machine function analysis pass creates machine functions only for the functions that are defined in the current LLVM module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10135 llvm-svn: 239778	2015-06-15 23:07:38 +00:00
Alex Lorenz	fe2aa97bab	MIR Serialization: Report an error when machine functions have the same name. This commit reports an error when the MIR parser encounters a machine function with the name that is the same as the name of a different machine function. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10130 llvm-svn: 239774	2015-06-15 22:23:23 +00:00
Peter Collingbourne	82437bf7a5	Protection against stack-based memory corruption errors using SafeStack This patch adds the safe stack instrumentation pass to LLVM, which separates the program stack into a safe stack, which stores return addresses, register spills, and local variables that are statically verified to be accessed in a safe way, and the unsafe stack, which stores everything else. Such separation makes it much harder for an attacker to corrupt objects on the safe stack, including function pointers stored in spilled registers and return addresses. You can find more information about the safe stack, as well as other parts of or control-flow hijack protection technique in our OSDI paper on code-pointer integrity (http://dslab.epfl.ch/pubs/cpi.pdf) and our project website (http://levee.epfl.ch). The overhead of our implementation of the safe stack is very close to zero (0.01% on the Phoronix benchmarks). This is lower than the overhead of stack cookies, which are supported by LLVM and are commonly used today, yet the security guarantees of the safe stack are strictly stronger than stack cookies. In some cases, the safe stack improves performance due to better cache locality. Our current implementation of the safe stack is stable and robust, we used it to recompile multiple projects on Linux including Chromium, and we also recompiled the entire FreeBSD user-space system and more than 100 packages. We ran unit tests on the FreeBSD system and many of the packages and observed no errors caused by the safe stack. The safe stack is also fully binary compatible with non-instrumented code and can be applied to parts of a program selectively. This patch is our implementation of the safe stack on top of LLVM. The patches make the following changes: - Add the safestack function attribute, similar to the ssp, sspstrong and sspreq attributes. - Add the SafeStack instrumentation pass that applies the safe stack to all functions that have the safestack attribute. This pass moves all unsafe local variables to the unsafe stack with a separate stack pointer, whereas all safe variables remain on the regular stack that is managed by LLVM as usual. - Invoke the pass as the last stage before code generation (at the same time the existing cookie-based stack protector pass is invoked). - Add unit tests for the safe stack. Original patch by Volodymyr Kuznetsov and others at the Dependable Systems Lab at EPFL; updates and upstreaming by myself. Differential Revision: http://reviews.llvm.org/D6094 llvm-svn: 239761	2015-06-15 21:07:11 +00:00
Alex Lorenz	735c47ec3e	MIR Serialization: Connect the machine function analysis pass to the MIR parser. This commit connects the machine function analysis pass (which creates machine functions) to the MIR parser, which will initialize the machine functions with the state from the MIR file and reconstruct the machine IR. This commit introduces a new interface called 'MachineFunctionInitializer', which can be used to provide custom initialization for the machine functions. This commit also introduces a new diagnostic class called 'DiagnosticInfoMIRParser' which is used for MIR parsing errors. This commit modifies the default diagnostic handling in LLVMContext - now the the diagnostics are printed directly into llvm::errs() so that the MIR parsing errors can be printed with colours. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9928 llvm-svn: 239753	2015-06-15 20:30:22 +00:00
Sanjoy Das	baeb678a91	Unbreak the build from r239740. Do not re-use an enum name as a field name. Some bots don't like this. llvm-svn: 239746	2015-06-15 19:29:44 +00:00
Sanjoy Das	69fad0799e	[CodeGen] Add a pass to fold null checks into nearby memory operations. Summary: This change adds an "ImplicitNullChecks" target dependent pass. This pass folds null checks into memory operation using the FAULTING_LOAD pseudo-op introduced in previous patches. Depends on D10197 Depends on D10199 Depends on D10200 Reviewers: reames, rnk, pgavlin, JosephTremoulet, atrick Reviewed By: atrick Subscribers: ab, JosephTremoulet, llvm-commits Differential Revision: http://reviews.llvm.org/D10201 llvm-svn: 239743	2015-06-15 18:44:27 +00:00
Sanjoy Das	b666ea369c	[TargetInstrInfo] Rename getLdStBaseRegImmOfs and implement for x86. Summary: TargetInstrInfo::getLdStBaseRegImmOfs to TargetInstrInfo::getMemOpBaseRegImmOfs and implement for x86. The implementation only handles a few easy cases now and will be made more sophisticated in the future. This is NFCI: the only user of `getLdStBaseRegImmOfs` (now `getmemOpBaseRegImmOfs`) is `LoadClusterMotion` and `LoadClusterMotion` is disabled for x86. Reviewers: reames, ab, MatzeB, atrick Reviewed By: MatzeB, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10199 llvm-svn: 239741	2015-06-15 18:44:14 +00:00
Sanjoy Das	c63244daa1	[CodeGen] Introduce a FAULTING_LOAD_OP pseudo-op. Summary: This instruction encodes a loading operation that may fault, and a label to branch to if the load page-faults. The locations of potentially faulting loads and their "handler" destinations are recorded in a FaultMap section, meant to be consumed by LLVM's clients. Nothing generates FAULTING_LOAD_OP instructions yet, but they will be used in a future change. The documentation (FaultMaps.rst) needs improvement and I will update this diff with a more expanded version shortly. Depends on D10196 Reviewers: rnk, reames, AndyAyers, ab, atrick, pgavlin Reviewed By: atrick, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10197 llvm-svn: 239740	2015-06-15 18:44:08 +00:00
Arnaud A. de Grandmaison	d8673edc2d	[MachineSink] Improve runtime performance. NFC. This patch fixes a compilation time issue, when MachineSink faces PHIs with a huge number of operands. This can happen for example in goto table based interpreters, where some basic blocks can have several of those PHIs, each one with several hundreds operands. MachineSink was spending a significant time re-building and re-sorting the list of successors of the current MachineBasicBlock. The computing and sorting of the current MachineBasicBlock successors is now cached. llvm-svn: 239720	2015-06-15 09:09:06 +00:00
NAKAMURA Takumi	a6a250a211	AsmPrinter.cpp: Avoid crashes for targeting like "arm-mingw32". CurrentFnSym might not be <MCSymbolELF> here. llvm-svn: 239692	2015-06-14 00:23:40 +00:00
NAKAMURA Takumi	bf6ad02906	Reformat. llvm-svn: 239691	2015-06-14 00:23:33 +00:00
Simon Pilgrim	d3f6427446	[DAGCombiner] Added BSWAP(BSWAP(x)) -> x combine pattern. llvm-svn: 239682	2015-06-13 16:25:12 +00:00
Sanjay Patel	5714998484	hoist loop-invariant; NFCI llvm-svn: 239681	2015-06-13 15:33:15 +00:00
Sanjay Patel	41044f8859	remove function names from comments and clean up; NFC llvm-svn: 239680	2015-06-13 15:32:45 +00:00
Simon Pilgrim	2c35e7a264	[SelectionDAG] Added assertions + UNDEF handling for BSWAP node creation. llvm-svn: 239679	2015-06-13 15:23:58 +00:00
Sanjay Patel	85924e5bf3	remove unnecessary casts; NFCI llvm-svn: 239678	2015-06-13 15:06:33 +00:00
Simon Pilgrim	011381d48b	[DAGCombiner] Added BSWAP vector constant folding support. llvm-svn: 239675	2015-06-13 14:08:15 +00:00
Simon Pilgrim	096cccd01a	Stripped trailing whitespace. NFC. llvm-svn: 239674	2015-06-13 12:57:36 +00:00
Matthias Braun	39a2afc941	Rename TargetSubtargetInfo::enablePostMachineScheduler() to enablePostRAScheduler() r213101 changed the behaviour of this method to not only affect the PostMachineScheduler scheduler but also the PostRAScheduler scheduler, renaming should make this fact clear. Also document that the preferred way is to specify this in the scheduling model instead of overriding this method. Differential Revision: http://reviews.llvm.org/D10427 llvm-svn: 239659	2015-06-13 03:42:16 +00:00
Matthias Braun	88e213159a	MachineLICM: Use TargetSchedModel instead of just itineraries This will use Itinieraries if available, but will also work if just a MCSchedModel is available. Differential Revision: http://reviews.llvm.org/D10428 llvm-svn: 239658	2015-06-13 03:42:11 +00:00
Reid Kleckner	81d1cc00b7	[WinEH] Put finally pointers in the handler scope table field We were putting them in the filter field, which is correct for 64-bit but wrong for 32-bit. Also switch the order of scope table entry emission so outermost entries are emitted first, and fix an obvious state assignment bug. llvm-svn: 239574	2015-06-11 23:37:18 +00:00
Reid Kleckner	a9d6253572	[WinEH] Create an llvm.x86.seh.exceptioninfo intrinsic This intrinsic is like framerecover plus a load. It recovers the EH registration stack allocation from the parent frame and loads the exception information field out of it, giving back a pointer to an EXCEPTION_POINTERS struct. It's designed for clang to use in SEH filter expressions instead of accessing the EXCEPTION_POINTERS parameter that is available on x64. This required a minor change to MC to allow defining a label variable to another absolute framerecover label variable. llvm-svn: 239567	2015-06-11 22:32:23 +00:00
Daniel Sanders	3e5de88dac	Replace string GNU Triples with llvm::Triple in TargetMachine. NFC. Summary: For the moment, TargetMachine::getTargetTriple() still returns a StringRef. This continues the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. Reviewers: rengolin Reviewed By: rengolin Subscribers: ted, llvm-commits, rengolin, jholewinski Differential Revision: http://reviews.llvm.org/D10362 llvm-svn: 239554	2015-06-11 19:41:26 +00:00
Ahmed Bougacha	c88bf54366	[CodeGen] ArrayRef'ize cond/pred in various TII APIs. NFC. llvm-svn: 239553	2015-06-11 19:30:37 +00:00
Rafael Espindola	7c6e6e49cc	Generalize emitAbsoluteSymbolDiff. This makes emitAbsoluteSymbolDiff always succeed and moves logic from the asm printer to it. The object one now also works on ELF. If two symbols are in the same fragment, we will never move them apart. llvm-svn: 239552	2015-06-11 18:58:08 +00:00
Reid Kleckner	2691c59e97	Revert "Fix merges of non-zero vector stores" This reverts commit r239539. It was causing SDAG assertions while building freetype. llvm-svn: 239543	2015-06-11 17:25:24 +00:00
Matt Arsenault	e23a063dc3	Fix merges of non-zero vector stores Now actually stores the non-zero constant instead of 0. I somehow forgot to include this part of r238108. The test change was just an independent instruction order swap, so just add another check line to satisfy CHECK-NEXT. llvm-svn: 239539	2015-06-11 16:03:52 +00:00
Sanjay Patel	8b2150efdb	remove function names from comments; NFC llvm-svn: 239532	2015-06-11 14:26:49 +00:00
Arnaud A. de Grandmaison	af37ad19a9	[LiveVariables] Improve isLiveOut runtime performances. NFC. On large goto table based interpreters, where phi nodes can have (very) large fan-ins, isLiveOut exhibited poor performances: about 40% of the full codegen time was spent in PHIElim, sorting MachineBasicBlock addresses. This patch improve the performances for such cases, and does not show compile time regressions on the LNT, at bootstrap (llvm+clang+lldb) or any other benchmarks we have in-house. llvm-svn: 239510	2015-06-11 07:50:21 +00:00
Arnaud A. de Grandmaison	2e8ffa3b44	[PHIElim] Use ranges and const-ify, NFC. llvm-svn: 239508	2015-06-11 07:45:05 +00:00
Pete Cooper	7cbe58d3c5	Remove MachineModuleInfo::UsedFunctions as it has no users. It hasn't been used since r130964. This also removes MachineModuleInfo::isUsedFunction and MachineModuleInfo::AnalyzeModule, both of which were only there to support UsedFunctions. llvm-svn: 239501	2015-06-11 01:04:56 +00:00
Sanjay Patel	ccb8d5cc57	punctuation policing; NFC llvm-svn: 239484	2015-06-10 19:52:58 +00:00
Reid Kleckner	c87a6faba1	[WinEH] _except_handlerN uses 0 instead of 1 to indicate catch-all Our usage of 1 was a holdover from __C_specific_handler. llvm-svn: 239482	2015-06-10 18:14:07 +00:00
Sanjay Patel	a32fadd14a	fix typo in comment; NFC llvm-svn: 239478	2015-06-10 17:08:12 +00:00
Igor Laevsky	346ff628f7	[StatepointLowering] Reuse stack slots across basic blocks During statepoint lowering we can sometimes avoid spilling of the value if we know that it was already spilled for previous statepoint. We were doing this by checking if incoming statepoint value was lowered into load from stack slot. This was working only in boundaries of one basic block. But instead of looking at the lowered node we can look directly at the llvm-ir value and if it was gc.relocate (or some simple modification of it) look up stack slot for it's derived pointer and reuse stack slot from it. This allows us to look across basic block boundaries. Differential Revision: http://reviews.llvm.org/D10251 llvm-svn: 239472	2015-06-10 12:31:53 +00:00
Reid Kleckner	ca6ef66e4c	Remove safeseh debug print and remove extra braces llvm-svn: 239449	2015-06-10 01:13:44 +00:00
Reid Kleckner	2bc93ca846	[WinEH] Emit .safeseh directives for all 32-bit exception handlers Use a "safeseh" string attribute to do this. You would think we chould just accumulate the set of personalities like we do on dwarf, but this fails to account for the LSDA-loading thunks we use for __CxxFrameHandler3. Each of those needs to make it into .sxdata as well. The string attribute seemed like the most straightforward approach. llvm-svn: 239448	2015-06-10 01:02:30 +00:00
Reid Kleckner	7912d9b899	Fix -Wsign-compare warning in WinException.cpp llvm-svn: 239445	2015-06-10 00:04:53 +00:00
Tobias Edler von Koch	d5289d9724	[RegisterScavenger] Fix handling of predicated instructions Summary: The RegisterScavenger explicitly ignores <kill> flags on operands of predicated instructions and therefore assumes that such registers remain live. When it then scavenges such a register, it inserts a spill of this (killed) register. This is invalid code and gets flagged up by the verifier. Nowadays kill flags are set correctly on predicated instructions. This patch makes the Scavenger respect them. The bug has so far only been triggered by an internal pass, so I don't have a test case unfortunately. Fixes PR23119. Reviewers: hfinkel, tobiasvk_caf Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9039 llvm-svn: 239439	2015-06-09 22:10:58 +00:00
Reid Kleckner	f12c030f48	[WinEH] Add 32-bit SEH state table emission prototype This gets all the handler info through to the asm printer and we can look at the .xdata tables now. I've convinced one small catch-all test case to work, but other than that, it would be a stretch to say this is functional. The state numbering algorithm avoids doing any scope reconstruction as we do for C++ to simplify the implementation. llvm-svn: 239433	2015-06-09 21:42:19 +00:00
David Blaikie	0ebe35b278	Revert "[DWARF] Fix a few corner cases in expression emission" This reverts commit r239380 due to apparently GDB regressions: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/22562 llvm-svn: 239420	2015-06-09 18:01:51 +00:00
Keno Fischer	e34147ce2f	[DWARF] Fix a few corner cases in expression emission Summary: I noticed an object file with `DW_OP_reg4 DW_OP_breg4 0` as a DWARF expression, which I traced to a missing break (and `++I`) in this code snippet. While I was at it, I also added support for a few other corner cases along the same lines that I could think of. Test Plan: Hand-crafted test case to exercises these cases is included. Reviewers: echristo, dblaikie, aprantl Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10302 llvm-svn: 239380	2015-06-09 01:53:59 +00:00
Matt Arsenault	705eb8f6b1	Implement computeKnownBits for min/max nodes llvm-svn: 239378	2015-06-09 00:52:41 +00:00
Matt Arsenault	8b643559d4	MC: Add target hook to control symbol quoting llvm-svn: 239370	2015-06-09 00:31:39 +00:00
Keno Fischer	e70b31fc1b	[InstrInfo] Refactor foldOperandImpl to thread through InsertPt. NFC Summary: This was a longstanding FIXME and is a necessary precursor to cases where foldOperandImpl may have to create more than one instruction (e.g. to constrain a register class). This is the split out NFC changes from D6262. Reviewers: pete, ributzka, uweigand, mcrosier Reviewed By: mcrosier Subscribers: mcrosier, ted, llvm-commits Differential Revision: http://reviews.llvm.org/D10174 llvm-svn: 239336	2015-06-08 20:09:58 +00:00
Akira Hatanaka	4a61619ff5	[ARM] Pass a callback to FunctionPass constructors to enable skipping execution on a per-function basis. Previously some of the passes were conditionally added to ARM's pass pipeline based on the target machine's subtarget. This patch makes changes to add those passes unconditionally and execute them conditonally based on the predicate functor passed to the pass constructors. This enables running different sets of passes for different functions in the module. rdar://problem/20542263 Differential Revision: http://reviews.llvm.org/D8717 llvm-svn: 239325	2015-06-08 18:50:43 +00:00
Oliver Stannard	8379e298b3	Fix assertion failure in global-merge with unused ConstantExpr The global-merge pass was crashing because it assumes that all ConstantExprs (reached via the global variables that they use) have at least one user. I haven't worked out a way to test this, as an unused ConstantExpr cannot be represented by serialised IR, and global-merge can only be run in llc, which does not run any passes which can make a ConstantExpr dead. This (reduced to the point of silliness) C code triggers this bug when compiled for arm-none-eabi at -O1: static a = 7; static volatile b[10] = {&a}; c; main() { c = 0; for (; c < 10;) printf(b[c]); } Differential Revision: http://reviews.llvm.org/D10314 llvm-svn: 239308	2015-06-08 16:55:31 +00:00
Simon Pilgrim	4791f6d89b	[DAGCombiner] Added CTLZ vector constant folding support. llvm-svn: 239305	2015-06-08 16:19:00 +00:00
Simon Pilgrim	c789e1d57b	[DAGCombiner] Added CTTZ vector constant folding support. llvm-svn: 239293	2015-06-08 09:57:09 +00:00
Simon Pilgrim	68cd237f57	[DAGCombiner] Added CTPOP vector constant folding support. Added tests to the existing SSE/AVX test files. llvm-svn: 239252	2015-06-07 15:37:14 +00:00
Akira Hatanaka	c100c56a20	Move the code in TargetPassConfig::addPass that inserts machine printer pass to the overloaded version of addPass which takes Pass. This change enables inserting the machine printer pass when the overloaded version of addPass that takes Pass is called to add a pass, instead of the one which takes AnalysisID. I need this to prevent make-check tests from failing when I commit another patch later. llvm-svn: 239192	2015-06-05 21:58:14 +00:00
Fiona Glaser	666e352440	DAGCombiner: don't duplicate (fmul x, c) in visitFNEG if fneg is free For targets with a free fneg, this fold is always a net loss if it ends up duplicating the multiply, so definitely avoid it. This might be true for some targets without a free fneg too, but I'll leave that for future investigation. llvm-svn: 239167	2015-06-05 17:52:34 +00:00
Andrea Di Biagio	eb33134ce7	Simplify code; NFC. Also, moved test cases from CodeGen/X86/fold-buildvector-bug.ll into CodeGen/X86/buildvec-insertvec.ll and regenerated CHECK lines using update_llc_test_checks.py. llvm-svn: 239142	2015-06-05 10:29:55 +00:00
Swaroop Sridhar	70d18df18f	Statepoint: Fix handling of Far Immediate calls gc.statepoint intrinsics with a far immediate call target were lowered incorrectly as pc-rel32 calls. This change fixes the problem, and generates an indirect call via a scratch register. For example: Intrinsic: %safepoint_token = call i32 (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () inttoptr (i64 140727162896504 to void ()), i32 0, i32 0, i32 0, i32 0) Old Incorrect Lowering: callq 140727162896504 New Correct Lowering: movabsq $140727162896504, %rax callq %rax In lowerCallFromStatepoint(), the callee-target was modified and represented as a "TargetConstant" node, rather than a "Constant" node. Undoing this modification enabled LowerCall() to generate the correct CALL instruction. llvm-svn: 239114	2015-06-04 23:03:21 +00:00
Benjamin Kramer	ff0fb6936b	[SDAG switch lowering] Fix switch case -> or merging for 0 and INT_MIN The big/small ordering here is based on signed values so SmallValue will be INT_MIN and BigValue 0. This shouldn't be a problem but the code assumed that BigValue always had more bits set than SmallValue. We used to just miss the transformation, but a recent refactoring of mine turned this into an assertion failure. llvm-svn: 239105	2015-06-04 22:05:51 +00:00
Sergey Dmitrouk	3160d02b5b	Erase constant dbgloc on reuse in PHI node Basic block selection involves checking successor BBs for PHI nodes that depend on the current BB. In case such BBs are found, the value being selected is a constant and such constant already exists in current BB, it's value is reused. This might lead to wrong locations in some situations, especially if same constant value ends up being materialized twice in two different ways, which discards that sharing and leaves us with wrong debug location in the successor BB. In code this involves the following sequence of calls: SelectionDAGBuilder::HandlePHINodesInSuccessorBlocks -> SelectionDAGBuilder::CopyValueToVirtualRegister -> SelectionDAGBuilder::getNonRegisterValue llvm-svn: 239089	2015-06-04 20:48:40 +00:00
Ahmed Bougacha	8207641251	[GlobalMerge] Take into account minsize on Global users' parents. Now that we can look at users, we can trivially do this: when we would have otherwise disabled GlobalMerge (currently -O<3), we can just run it for minsize functions, as it's usually a codesize win. Differential Revision: http://reviews.llvm.org/D10054 llvm-svn: 239087	2015-06-04 20:39:23 +00:00
Andrea Di Biagio	9ac8a6b13d	[DAGCombiner] Fix wrong folding of a build_vector into a blend with zero. Method 'visitBUILD_VECTOR' in the DAGCombiner knows how to combine a build_vector of a bunch of extract_vector_elt nodes and constant zero nodes into a shuffle blend with a zero vector. However, method 'visitBUILD_VECTOR' forgot that a floating point build_vector may contain negative zero as well as positive zero. Example: define <2 x double> @example(<2 x double> %A) { entry: %0 = extractelement <2 x double> %A, i32 0 %1 = insertelement <2 x double> undef, double %0, i32 0 %2 = insertelement <2 x double> %1, double -0.0, i32 1 ret <2 x double> %2 } Before this patch, llc (with -mattr=+sse4.1) wrongly generated movq %xmm0, %xmm0 # xmm0 = xmm0[0],zero So, the sign bit of the negative zero was effectively lost. This patch fixes the problem by adding explicit checks for positive zero. With this patch, llc produces the following code for the example above: movhpd .LCPI0_0(%rip), %xmm0 where .LCPI0_0 referes to a 'double -0'. llvm-svn: 239070	2015-06-04 19:15:01 +00:00
Benjamin Kramer	185579bf0c	[SDag switch lowering] Simplify code a bit. No functional change intended. llvm-svn: 239056	2015-06-04 17:07:59 +00:00
Matt Arsenault	f72b49bc17	CodeGenPrepare: Provide address space to isLegalAddressingMode Use -1 as the address space if it can't be determined. llvm-svn: 239052	2015-06-04 16:17:38 +00:00
Matt Arsenault	ca519dc28b	Pass address space to isLegalAddressingMode in DAGCombiner No test because I don't know of a target that makes use of address spaces and indexed load / store. llvm-svn: 239051	2015-06-04 16:17:34 +00:00
Hans Wennborg	d922915685	Switch lowering: fix assert in buildBitTests (PR23738) When checking (High - Low + 1).sle(BitWidth), BitWidth would be truncated to the size of the left-hand side. In the case of this PR, the left-hand side was i4, so BitWidth=64 got truncated to 0 and the assert failed. llvm-svn: 239048	2015-06-04 15:55:00 +00:00
James Molloy	37593732a4	Don't create a MIN/MAX node if the underlying compare has more than one use. If the compare in a select pattern has another use then it can't be removed, so we'd just be creating repeated code if we created a min/max node. Spotted by Matt Arsenault! llvm-svn: 239037	2015-06-04 13:48:23 +00:00
Sanjoy Das	513aadecac	[SelectionDAG] Fix PR23603. Summary: LLVM's MI level notion of invariant_load is different from LLVM's IR level notion of invariant_load with respect to dereferenceability. The IR notion of invariant_load only guarantees that all non-faulting invariant loads result in the same value. The MI notion of invariant load guarantees that the load can be legally moved to any location within its containing function. The MI notion of invariant_load is stronger than the IR notion of invariant_load -- an MI invariant_load is an IR invariant_load + a guarantee that the location being loaded from is dereferenceable throughout the function's lifetime. Reviewers: hfinkel, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10075 llvm-svn: 238881	2015-06-02 22:33:30 +00:00
Rafael Espindola	a869576008	Create a MCSymbolELF. This create a MCSymbolELF class and moves SymbolSize since only ELF needs a size expression. This reduces the size of MCSymbol from 56 to 48 bytes. llvm-svn: 238801	2015-06-02 00:25:12 +00:00
Matthias Braun	c1e029e93d	LiveRangeEdit: Fix liveranges not shrinking on subrange kill. If a dead instruction we may not only have a last-use in the main live range but also in a subregister range if subregisters are tracked. We need to partially rebuild live ranges in both cases. The testcase only broke when subregister liveness was enabled. I commited it in the current form because there is currently no flag to enable/disable subregister liveness. This fixes PR23720. llvm-svn: 238785	2015-06-01 21:26:26 +00:00
Owen Anderson	55313d21dc	Disable MachineSink on convergent operations, similar to how IR Sink is restricted. No test because no in-tree target currently has convergent MachineInstr's. llvm-svn: 238763	2015-06-01 17:26:30 +00:00
Matt Arsenault	bd7d80a4a6	Add address space argument to isLegalAddressingMode This is important because of different addressing modes depending on the address space for GPU targets. This only adds the argument, and does not update any of the uses to provide the correct address space. llvm-svn: 238723	2015-06-01 05:31:59 +00:00
Chandler Carruth	502b23a7a9	[sdag] Add the helper I most want to the DAG -- building a bitcast around a value using its existing SDLoc. Start using this in just one function to save omg lines of code. llvm-svn: 238638	2015-05-30 04:14:10 +00:00
Jim Grosbach	13760bd152	MC: Clean up MCExpr naming. NFC. llvm-svn: 238634	2015-05-30 01:25:56 +00:00
Fiona Glaser	b82e33106b	SelectionDAG: fix logic for promoting shift types r238503 fixed the problem of too-small shift types by promoting them during legalization, but the correct solution is to promote only the operands that actually demand promotion. This fixes a crash on an out-of-tree target caused by trying to promote an operand that can't be promoted. llvm-svn: 238632	2015-05-29 23:37:22 +00:00
Benjamin Kramer	f5e2fc474d	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238602	2015-05-29 19:43:39 +00:00
Matthias Braun	165d467125	MachineCopyPropagation: Remove the copies instead of using KILL instructions. For some history here see the commit messages of r199797 and r169060. The original intent was to fix cases like: %EAX<def> = COPY %ECX<kill>, %RAX<imp-def> %RCX<def> = COPY %RAX<kill> where simply removing the copies would have RCX undefined as in terms of machine operands only the ECX part of it is defined. The machine verifier would complain about this so 169060 changed such COPY instructions into KILL instructions so some super-register imp-defs would be preserved. In r199797 it was finally decided to always do this regardless of super-register defs. But this is wrong, consider: R1 = COPY R0 ... R0 = COPY R1 getting changed to: R1 = KILL R0 ... R0 = KILL R1 It now looks like R0 dies at the first KILL and won't be alive until the second KILL, while in reality R0 is alive and must not change in this part of the program. As this only happens after register allocation there is not much code still performing liveness queries so the issue was not noticed. In fact I didn't manage to create a testcase for this, without unrelated changes I am working on at the moment. The fix is simple: As of r223896 the MachineVerifier allows reads from partially defined registers, so the whole transforming COPY->KILL thing is not necessary anymore. This patch also changes a similar (but more benign case as the def and src are the same register) case in the VirtRegRewriter. Differential Revision: http://reviews.llvm.org/D10117 llvm-svn: 238588	2015-05-29 18:19:25 +00:00
Alex Lorenz	09b832cac5	MIR Serialization: use correct line and column numbers for LLVM IR errors. This commit translates the line and column numbers for LLVM IR errors from the numbers in the YAML block scalar to the numbers in the MIR file so that the MIRParser users can report LLVM IR errors with the correct line and column numbers. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10108 llvm-svn: 238576	2015-05-29 17:05:41 +00:00
Reid Kleckner	1d3d4adbb9	[WinEH] Emit EH tables for __CxxFrameHandler3 on 32-bit x86 Small (really small!) C++ exception handling examples work on 32-bit x86 now. This change disables the use of .seh_* directives in WinException when CFI is not in use. It also uses absolute symbol references in the tables instead of imagerel32 relocations. Also fixes a cache invalidation bug in MMI personality classification. llvm-svn: 238575	2015-05-29 17:00:57 +00:00
Matthias Braun	27a6cfd823	This should have been a reference llvm-svn: 238540	2015-05-29 02:59:59 +00:00
Matthias Braun	e41e146c16	CodeGen: Use mop_iterator instead of MIOperands/ConstMIOperands MIOperands/ConstMIOperands are classes iterating over the MachineOperand of a MachineInstr, however MachineInstr::mop_iterator does the same thing. I assume these two iterators exist to have a uniform interface to iterate over the operands of a machine instruction bundle and a single machine instruction. However in practice I find it more confusing to have 2 different iterator classes, so this patch transforms (nearly all) the code to use mop_iterators. The only exception being MIOperands::anlayzePhysReg() and MIOperands::analyzeVirtReg() still needing an equivalent, I leave that as an exercise for the next patch. Differential Revision: http://reviews.llvm.org/D9932 This version is slightly modified from the proposed revision in that it introduces MachineInstr::getOperandNo to avoid the extra counting variable in the few loops that previously used MIOperands::getOperandNo. llvm-svn: 238539	2015-05-29 02:56:46 +00:00
Matthias Braun	111f5d88fb	MachineFrameInfo: Simplify pristine register calculation. About pristine regsiters: Pristine registers "hold a value that is useless to the current function, but that must be preserved - they are callee saved registers that have not been saved." This concept saves compile time as it frees the prologue/epilogue inserter from adding every such register to every basic blocks live-in list. However the current code in getPristineRegs is formulated in a complicated way: Inside the function prologue and epilogue all callee saves are considered pristine, while in the rest of the code only the non-saved ones are considered pristine. This requires logic to differentiate between prologue/epilogue and the rest and in the presence of shrink-wrapping this even becomes complicated/expensive. It's also unnecessary because the prologue epilogue inserters already mark callee-save registers that are saved/restores properly in the respective blocks in the prologue/epilogue (see updateLiveness() in PrologueEpilogueInserter.cpp). So only declaring non-saved/restored callee saved registers as pristine just works. Differential Revision: http://reviews.llvm.org/D10101 llvm-svn: 238524	2015-05-28 23:20:35 +00:00
Reid Kleckner	60b640bb80	Rename Win64Exception.(cpp\|h) to WinException.(cpp\|h) This is in preparation for reusing this for 32-bit x86 EH table emission. Also updates the type name for consistency. NFC llvm-svn: 238521	2015-05-28 22:47:01 +00:00
Alex Lorenz	78d7831b0f	MIR Serialization: print and parse machine function names. This commit introduces a serializable structure called 'llvm::yaml::MachineFunction' that stores the machine function's name. This structure will mirror the machine function's state in the future. This commit prints machine functions as YAML documents containing a YAML mapping that stores the state of a machine function. This commit also parses the YAML documents that contain the machine functions. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D9841 llvm-svn: 238519	2015-05-28 22:41:12 +00:00
Quentin Colombet	75afbfd4a1	[MachineCopyPropagation] Fix a bug with undef handling when the value is actualy alive. Test case will follow. llvm-svn: 238518	2015-05-28 22:38:40 +00:00
Reid Kleckner	fe4d491bd9	[WinEH] Start inserting state number stores for C++ EH This moves all the state numbering code for C++ EH to WinEHPrepare so that we can call it from the X86 state numbering IR pass that runs before isel. Now we just call the same state numbering machinery and insert a bunch of stores. It also populates MachineModuleInfo with information about the current function. llvm-svn: 238514	2015-05-28 22:00:24 +00:00
David Majnemer	22d2b02706	[SelectionDAG] Scalar shift amounts may require legalization The shift amount may be too small to cope with promoted left hand side, make sure to promote it as well. This fixes PR23664. llvm-svn: 238503	2015-05-28 21:29:59 +00:00
Duncan P. N. Exon Smith	8d3197f657	AsmPrinter: Stop exposing underlying DIE children list, NFC Update `DIE` API to hide the implementation of `DIE::Children` so we can swap it out. llvm-svn: 238468	2015-05-28 19:56:34 +00:00
Duncan P. N. Exon Smith	b04fb5ed25	AsmPrinter: Rename begin_values() => values_begin(), NFC llvm-svn: 238456	2015-05-28 18:55:38 +00:00
Benjamin Kramer	5188a2af72	[AsmPrinter] Destroy allocated DIEAbbrevs on teardown. DIEAbbrev contains a SmallVector that can leak for overly large abbrevs. They used to be owned by the DIE, but after the recent refactoring DWARFFile allocates its own abbrevs. Leak found by asan. llvm-svn: 238418	2015-05-28 12:55:43 +00:00
Duncan P. N. Exon Smith	a68b880d69	AsmPrinter: Avoid a warning in NDEBUG, NFC Should fix the -Werror release build: http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/11113 llvm-svn: 238375	2015-05-27 23:02:36 +00:00
Duncan P. N. Exon Smith	6289892c20	AsmPrinter: Return added DIE from DIE::addChild() Change `DIE::addChild()` to return a reference to the just-added node, and update consumers to use it directly. An upcoming commit will abstract away (and eventually change) the underlying storage of `DIE::Children`. llvm-svn: 238372	2015-05-27 22:59:03 +00:00
Fiona Glaser	ca706e54a9	RegisterPressure: fix debug prints in case of physical registers llvm-svn: 238371	2015-05-27 22:51:47 +00:00
Duncan P. N. Exon Smith	88a8fc5448	AsmPrinter: Stop exposing underlying DIEValue list, NFC Change the `DIE` API to hide the implementation of the list of `DIEValue`s. llvm-svn: 238369	2015-05-27 22:44:06 +00:00
Duncan P. N. Exon Smith	f3a6a67ffd	AsmPrinter: Remove DIEHash::AttrEntry, NFC Remove "the most boring struct ever" (thanks to review by Eric). llvm-svn: 238366	2015-05-27 22:36:37 +00:00
Duncan P. N. Exon Smith	815a6eb55d	AsmPrinter: Store abbreviation data directly in DIE and DIEValue Stop storing a `DIEAbbrev` in `DIE`, since the data fits neatly inside the `DIEValue` list. Besides being a cleaner data structure (avoiding the parallel arrays), this gives us more freedom to rearrange the `DIEValue` list. This fixes the temporary memory regression from 845 MB up to 879 MB, and drops it further to 829 MB for a net memory decrease of around 1.9% (incremental decrease around 5.7%). (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238364	2015-05-27 22:31:41 +00:00
Duncan P. N. Exon Smith	e7e1d0c706	Reapply "AsmPrinter: Change DIEValue to be stored by value" This reverts commit r238350, effectively reapplying r238349 after fixing (all?) the problems, all somehow related to how I was using `AlignedArrayCharUnion<>` inside `DIEValue`: - MSVC can only handle `sizeof()` on types, not values. Change the assert. - GCC doesn't know the `is_trivially_copyable` type trait. Instead of asserting it, add destructors. - Call placement new even when constructing POD (i.e., the pointers). - Instead of copying the char buffer, copy the casted classes. I've left in a couple of `static_assert`s that I think both MSVC and GCC know how to handle. If the bots disagree with me, I'll remove them. - Check that the constructed type is either standard layout or a pointer. This protects against a programming error: we really want the "small" `DIEValue`s to be small and simple, so don't accidentally change them not to be. - Similarly, check that the size of the buffer is no bigger than a `uint64_t` or a pointer. (I thought checking against `sizeof(uint64_t)` would be good enough, but Chandler suggested that pointers might sometimes be bigger than that in the context of sanitizers.) I've also committed r238359 in the meantime, which introduces a DIEValue.def to simplify dispatching between the various types (thanks to a review comment by David Blaikie). Without that, this commit would be almost unintelligible. Here's the original commit message: -- Change `DIEValue` to be stored/passed/etc. by value, instead of reference. It's now a discriminated union, with a `Val` field storing the actual type. The classes that used to inherit from `DIEValue` no longer do. There are two categories of these: - Small values fit in a single pointer and are stored by value. - Large values require auxiliary storage, and are stored by reference. The only non-mechanical change is to tools/dsymutil/DwarfLinker.cpp. It was relying on `DIEInteger`s being passed around by reference, so I replaced that assumption with a `PatchLocation` type that stores a safe reference to where the `DIEInteger` lives instead. This commit causes a temporary regression in memory usage, since I've left merging `DIEAbbrevData` into `DIEValue` for a follow-up commit. I measured an increase from 845 MB to 879 MB, around 3.9%. The follow-up drops it lower than the starting point, and I've only recently brought the memory this low anyway, so I'm committing these changes separately to keep them incremental. (I also considered swapping the commits, but the other one first would cause a lot more code churn.) (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) -- llvm-svn: 238362	2015-05-27 22:14:58 +00:00
Duncan P. N. Exon Smith	ff18927c58	AsmPrinter: Introduce DIEValue.def, NFC Use a .def macro file to iterate through the various subclasses of `DIEValue`. llvm-svn: 238359	2015-05-27 21:15:43 +00:00
Duncan P. N. Exon Smith	583bc03829	Revert "AsmPrinter: Change DIEValue to be stored by value" This reverts commit r238349, since it caused some errors on bots: - std::is_trivially_copyable isn't available until GCC 5.0. - It was complaining about strict aliasing with my use of ArrayCharUnion. llvm-svn: 238350	2015-05-27 19:30:27 +00:00
Duncan P. N. Exon Smith	7735b48a8b	AsmPrinter: Change DIEValue to be stored by value Change `DIEValue` to be stored/passed/etc. by value, instead of reference. It's now a discriminated union, with a `Val` field storing the actual type. The classes that used to inherit from `DIEValue` no longer do. There are two categories of these: - Small values fit in a single pointer and are stored by value. - Large values require auxiliary storage, and are stored by reference. The only non-mechanical change is to tools/dsymutil/DwarfLinker.cpp. It was relying on `DIEInteger`s being passed around by reference, so I replaced that assumption with a `PatchLocation` type that stores a safe reference to where the `DIEInteger` lives instead. This commit causes a temporary regression in memory usage, since I've left merging `DIEAbbrevData` into `DIEValue` for a follow-up commit. I measured an increase from 845 MB to 879 MB, around 3.9%. The follow-up drops it lower than the starting point, and I've only recently brought the memory this low anyway, so I'm committing these changes separately to keep them incremental. (I also considered swapping the commits, but the other one first would cause a lot more code churn.) (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238349	2015-05-27 19:22:50 +00:00
Alex Lorenz	2bdb4e1063	Resubmit r237954 (MIR Serialization: print and parse LLVM IR using MIR format). This commit a 3rd attempt at comitting the initial MIR serialization patch. The first commit (r237708) was reverted in 237730. Then the second commit (r237954) was reverted in r238007, as the MIR library under CodeGen caused a circular dependency where the CodeGen library depended on MIR and MIR library depended on CodeGen. This commit has fixed the dependencies between CodeGen and MIR by reorganizing the MIR serialization code - the code that prints out MIR has been moved to CodeGen, and the MIR library has been renamed to MIRParser. Now the CodeGen library doesn't depend on the MIRParser library, thus the circular dependency no longer exists. --Original Commit Message-- MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 238341	2015-05-27 18:02:19 +00:00
Jan Vesely	86f2fda623	SelectionDAG: Don't do libcall on div/rem if divrem is custom v2: TargetLoweringBase:: -> TargetLowering:: Use Ops array v3: Explicitly use value 0 for ?DIV Remove redundant newline Differential revision: http://reviews.llvm.org/D7803 reviewer: ab llvm-svn: 238336	2015-05-27 16:54:09 +00:00
Rafael Espindola	f4a1365387	Use operator<< instead of print in a few more places. llvm-svn: 238315	2015-05-27 13:05:42 +00:00
Quentin Colombet	8083588a7e	[ShrinkWrap] Add a target hook to check whether or not the target can handle a given basic block as prologue or epilogue. Related to <rdar://problem/20821487> llvm-svn: 238292	2015-05-27 06:25:48 +00:00
Matthias Braun	07a07ba41c	MachineBasicBlock: Cleanup computeRegisterLiveness() - Clean documentation comment - Change the API to accept an iterator so you can actually pass MachineBasicBlock::end() now. - Add more "const". llvm-svn: 238288	2015-05-27 05:12:39 +00:00
Akira Hatanaka	e36505c7f5	Remove NoFramePointerElim and NoFramePointerElimOverride from TargetOptions and remove ExecutionEngine's dependence on CodeGen. NFC. This is a follow-up to r238080. Differential Revision: http://reviews.llvm.org/D9830 llvm-svn: 238244	2015-05-26 20:17:20 +00:00
Adrian Prantl	6f8c1b6be6	Use "auto &" in range-based for-loop and remove the extra braces. llvm-svn: 238243	2015-05-26 20:06:51 +00:00
Adrian Prantl	757073191a	Fix a use-after-free in a DEBUG output. llvm-svn: 238242	2015-05-26 20:06:48 +00:00
Matt Arsenault	f05b02351f	CodeGenPrepare: Don't match addressing modes through addrspacecast This was resulting in the addrspacecast being removed and incorrectly replaced with a ptrtoint when sinking. llvm-svn: 238217	2015-05-26 16:59:43 +00:00
Elena Demikhovsky	1c1391ba24	Added promotion to EXTRACT_SUBVECTOR operand. I encountered with this case in one of KNL tests for i1 vectors. v16i1 = EXTRACT_SUBVECTOR v32i1, x llvm-svn: 238130	2015-05-25 11:33:13 +00:00
NAKAMURA Takumi	5582a6a4a5	Reformat. llvm-svn: 238126	2015-05-25 01:43:34 +00:00
NAKAMURA Takumi	fb3bd7127a	Prune CRLFs. llvm-svn: 238125	2015-05-25 01:43:23 +00:00
Duncan P. N. Exon Smith	882a2b5a7d	AsmPrinter: Avoid creating symbols in DwarfStringPool Stop creating symbols we don't need in `DwarfStringPool`. The consumers only call `DwarfStringPoolEntryRef::getSymbol()` when DWARF is relocatable, so this just stops creating the unused symbols when it's not. This drops memory usage from 851 MB to 845 MB, around 0.7%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238122	2015-05-24 16:58:59 +00:00
Duncan P. N. Exon Smith	9d50e82fb2	AsmPrinter: Prune an include, NFC llvm-svn: 238121	2015-05-24 16:54:59 +00:00
Duncan P. N. Exon Smith	e344705ade	AsmPrinter: Remove dead code, NFC llvm-svn: 238120	2015-05-24 16:51:29 +00:00
Duncan P. N. Exon Smith	1e0d94e7bb	AsmPrinter: Avoid EmitLabelDifference() in DwarfAccelTable Mint a new function, `AsmPrinter::emitDwarfStringOffset()`, which takes a `DwarfStringPoolEntryRef`. When DWARF is relocatable across sections, this defers to `emitSectionOffset()` and emits the `MCSymbol`; otherwise, just emit the offset directly, without using any intermediate symbols. `EmitLabelDifference()` is already optimized to emit absolute label differences cheaply when possible, so there aren't any major memory savings here (853 MB down to 851 MB, or 0.2%). However, it prepares for making the `MCSymbol`s in the `DwarfStringPool` optional. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238119	2015-05-24 16:48:54 +00:00
Duncan P. N. Exon Smith	f4599942fb	AsmPrinter: Use DwarfStringPoolEntry in DwarfAccelTable, NFC This is just an API change, but it prepares to stop using `EmitLabelDifference()` when possible. llvm-svn: 238118	2015-05-24 16:44:32 +00:00
Duncan P. N. Exon Smith	f73bcf4020	AsmPrinter: Make DIEString small Expose the `DwarfStringPool` entry in a header, and store a pointer to it directly in `DIEString`. Instead of choosing at creation time how to emit it, use the `dwarf::Form` to determine that at emission time. Besides avoiding the other `DIEValue`, this shaves two pointers off of `DIEString`; the data is now a single pointer. This is a nice cleanup on its own -- and drops memory usage from 861 MB down to 853 MB, around 0.9% -- but it's also preparation for passing `DIEValue`s by value. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238117	2015-05-24 16:40:47 +00:00
Duncan P. N. Exon Smith	03b7a1cf93	AsmPrinter: Extract DwarfStringPoolEntry from DwarfStringPool, NFC Extract out `DwarfStringPoolEntry` and `DwarfStringPoolRef` from `DwarfStringPool` so that downstream users can start using `DwarfStringPool::getEntry()` directly. This will allow users to delay the decision between emitting a symbol or an offset until later. llvm-svn: 238116	2015-05-24 16:33:33 +00:00
Duncan P. N. Exon Smith	1a65e4ade4	AsmPrinter: Emit the DwarfStringPool offset directly when possible Change `DwarfStringPool` to calculate byte offsets on-the-fly, and update `DwarfUnit::getLocalString()` to use a `DIEInteger` instead of a `DIEDelta` when Dwarf doesn't use relocations (i.e., Mach-O). This eliminates another call to `EmitLabelDifference()`, and drops memory usage from 865 MB down to 861 MB, around 0.5%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238114	2015-05-24 16:14:59 +00:00
Duncan P. N. Exon Smith	8c6499fa6d	AsmPrinter: Refactor DwarfStringPool::getEntry(), NFC Move `DwarfStringPool`'s `getEntry()` to the header (and make it a member function) in preparation for calculating symbol offsets on-the-fly. llvm-svn: 238112	2015-05-24 16:06:08 +00:00
Matt Arsenault	65ad1602b0	Add target hook to allow merging stores of nonzero constants On GPU targets, materializing constants is cheap and stores are expensive, so only doing this for zero vectors was silly. Most of the new testcases aren't optimally merged, and are for later improvements. llvm-svn: 238108	2015-05-24 00:51:27 +00:00
Aaron Ballman	c681c3d890	Silencing a spurious -Wreturn-type warning; NFC. llvm-svn: 238099	2015-05-23 14:46:49 +00:00
Duncan P. N. Exon Smith	68b3f30778	AsmPrinter: Remove the vtable-entry from DIEValue Remove all virtual functions from `DIEValue`, dropping the vtable pointer from its layout. Instead, create "impl" functions on the subclasses, and use the `DIEValue::Type` to implement the dynamic dispatch. This is necessary -- obviously not sufficient -- for passing `DIEValue`s around by value. However, this change stands on its own: we make tons of these. I measured a drop in memory usage from 888 MB down to 860 MB, or around 3.2%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238084	2015-05-23 01:45:07 +00:00
Duncan P. N. Exon Smith	d5aa33525c	CodeGen: Remove redundant DIETypeSignature::dump(), NFC We already have this in `DIEValue`; no reason to shadow it. llvm-svn: 238082	2015-05-23 01:26:26 +00:00
Akira Hatanaka	ddf76aa36f	Stop resetting NoFramePointerElim in TargetMachine::resetTargetOptions. This is part of the work to remove TargetMachine::resetTargetOptions. In this patch, instead of updating global variable NoFramePointerElim in resetTargetOptions, its use in DisableFramePointerElim is replaced with a call to TargetFrameLowering::noFramePointerElim. This function determines on a per-function basis if frame pointer elimination should be disabled. There is no change in functionality except that cl:opt option "disable-fp-elim" can now override function attribute "no-frame-pointer-elim". llvm-svn: 238080	2015-05-23 01:14:08 +00:00
Akira Hatanaka	bd881834c5	Simplify and rename function overrideFunctionAttributes. NFC. This is in preparation to making changes needed to stop resetting NoFramePointerElim in resetTargetOptions. llvm-svn: 238079	2015-05-23 01:12:26 +00:00
Ahmed Bougacha	236f9040d0	[AArch64][CGP] Sink zext feeding stxr/stlxr into the same block. The usual CodeGenPrepare trickery, on a target-specific intrinsic. Without this, the expansion of atomics will usually have the zext be hoisted out of the loop, defeating the various patterns we have to catch this precise case. Differential Revision: http://reviews.llvm.org/D9930 llvm-svn: 238054	2015-05-22 21:37:17 +00:00
Puyan Lotfi	bb457b973d	Compile time improvements to VirtRegRewriter. This change to VirtRegRewriter::addMBBLiveIns adds live-in registers for each MachineBasicBlock's LiveIns set without isLiveIn checks as they are being added because doing so is expensive. After all live-in registers are added, the LiveIn vectors are sorted and uniqued. llvm-svn: 238008	2015-05-22 08:11:26 +00:00
NAKAMURA Takumi	263b27997d	Revert r237954, "Resubmit r237708 (MIR Serialization: print and parse LLVM IR using MIR format)." It brought cyclic dependencies between LLVMCodeGen and LLVMMIR. llvm-svn: 238007	2015-05-22 07:17:07 +00:00
Duncan P. N. Exon Smith	0c54197d31	SDAG: Give SDDbgValues their own allocator (and reset it) Previously `SDDbgValue`s used the general allocator that lives for all of `SelectionDAG`. Instead, give them their own allocator, and reset it whenever `SDDbgInfo::clear()` is called, plugging a spiritual leak. This drops `SelectionDAGBuilder::visitIntrinsicCall()` off of my heap profile (was at around 2% of `llc` for codegen of `-flto -g`). Thanks to Pete Cooper for spotting the problem and suggesting the fix. llvm-svn: 237998	2015-05-22 05:45:19 +00:00
Duncan P. N. Exon Smith	1f0c1c4f47	SDAG: Cleanup initialization of SDDbgValue, NFC Cleanup how `SDDbgValue` is initialized, and rearrange the fields to save two pointers in the struct layout. No real functionality change though (and I doubt the memory savings would show up in a profile). llvm-svn: 237997	2015-05-22 05:35:53 +00:00
Quentin Colombet	7b73bfa67a	[InlineSpiller] Fix rematerialization for bundles. Prior to this patch, we could update the operand of another MI in the same bundle. Longer version: Before InlineSpiller rematerializes a vreg, it iterates over operands of each MI in a bundle, collecting all (MI, OpNo) pairs that reference that vreg. Then if it does rematerialize, it goes through the pair list and replaces the operands with the new (rematerialized) vreg. The problem is, it tries to replace all of these operands in the main MI ! This works fine for single MIs. However, if we are processing a bundle of MIs and the list contains multiple pairs - the rematerialization will either crash trying to access a non-existing operand of the main MI, or silently corrupt one of the existing ones. It will also ignore other MIs in the bundle. The obvious fix is to use the MI pointers saved in collected (MI, OpNo) pairs. This must have been the original intent of the pair list but somehow these pointers got lost. Patch by Dmitri Shtilman <dshtilman@icloud.com>! Differential revision: http://reviews.llvm.org/D9904 <rdar://problem/21002163> llvm-svn: 237964	2015-05-21 21:41:55 +00:00
Sanjay Patel	f911484051	fix typo in comment; NFC llvm-svn: 237962	2015-05-21 21:29:13 +00:00
Alex Lorenz	c37baf82a9	Resubmit r237708 (MIR Serialization: print and parse LLVM IR using MIR format). This commit is a 2nd attempt at committing the initial MIR serialization patch. The first commit (r237708) made the incremental buildbots unstable and was reverted in r237730. The original commit didn't add a terminating null character to the LLVM IR source which was passed to LLParser, and this sometimes caused the test 'llvmIR.mir' to fail with a parsing error because the LLVM IR source didn't have a null character immediately after the end and thus LLLexer encountered some garbage characters that ultimately caused the error. This commit also includes the other test fixes I committed in r237712 (llc path fix) and r237723 (remove target triple) which also got reverted in r237730. --Original Commit Message-- MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 237954	2015-05-21 20:54:45 +00:00
Rafael Espindola	0709a7bd1a	Move alignment from MCSectionData to MCSection. This starts merging MCSection and MCSectionData. There are a few issues with the current split between MCSection and MCSectionData. * It optimizes the the not as important case. We want the production of .o files to be really fast, but the split puts the information used for .o emission in a separate data structure. * The ELF/COFF/MachO hierarchy is not represented in MCSectionData, leading to some ad-hoc ways to represent the various flags. * It makes it harder to remember where each item is. The attached patch starts merging the two by moving the alignment from MCSectionData to MCSection. Most of the patch is actually just dropping 'const', since MCSectionData is mutable, but MCSection was not. llvm-svn: 237936	2015-05-21 19:20:38 +00:00
Sanjay Patel	f69f4e42ce	use range-based for-loops; NFCI llvm-svn: 237918	2015-05-21 17:43:26 +00:00
Sanjay Patel	99b3aa3505	use range-based for-loops; NFCI llvm-svn: 237917	2015-05-21 17:22:45 +00:00
Sanjay Patel	f8c028c0b0	use range-based for-loop llvm-svn: 237914	2015-05-21 17:04:17 +00:00
Sanjay Patel	490aca92be	use range-based for-loop; NFCI llvm-svn: 237908	2015-05-21 16:00:50 +00:00
Manuel Klimek	b00d42c10c	std::sort must be called with a strict weak ordering. Found by a debug enabled stl. llvm-svn: 237906	2015-05-21 15:38:25 +00:00
Simon Pilgrim	e054199354	[X86][SSE] Improve support for 128-bit vector sign extension This patch improves support for sign extension of the lower lanes of vectors of integers by making use of the SSE41 pmovsx* sign extension instructions where possible, and optimizing the sign extension by shifts on pre-SSE41 targets (avoiding the use of i64 arithmetic shifts which require scalarization). It converts SIGN_EXTEND nodes to SIGN_EXTEND_VECTOR_INREG where necessary, that more closely matches the pmovsx* instruction than the default approach of using SIGN_EXTEND_INREG which splits the operation (into an ANY_EXTEND lowered to a shuffle followed by shifts) making instruction matching difficult during lowering. Necessary support for SIGN_EXTEND_VECTOR_INREG has been added to the DAGCombiner. Differential Revision: http://reviews.llvm.org/D9848 llvm-svn: 237885	2015-05-21 10:05:03 +00:00
Duncan P. N. Exon Smith	0b73d71abb	AsmPrinter: Compute absolute label difference directly Create a low-overhead path for `EmitLabelDifference()` that emits a emits an absolute number when (1) the output is an object stream and (2) the two symbols are in the same data fragment. This drops memory usage on Mach-O from 975 MB down to 919 MB (5.8%). The only call is when `!doesDwarfUseRelocationsAcrossSections()` -- i.e., on Mach-O -- since otherwise an absolute offset from the start of the section needs a relocation. (`EmitLabelDifference()` is cheaper on ELF anyway, since it creates 1 fewer temp symbol, and it gets called far less often. It's not clear to me if this is even a bottleneck there.) (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 237876	2015-05-21 02:41:23 +00:00
Andrew Kaylor	cafb89df1e	Fix build error llvm-svn: 237859	2015-05-20 23:58:44 +00:00
Andrew Kaylor	69fc4418ab	Fix build warning llvm-svn: 237855	2015-05-20 23:28:03 +00:00
Andrew Kaylor	a6c5b9682e	[WinEH] C++ EH state numbering fixes Differential Revision: http://reviews.llvm.org/D9787 llvm-svn: 237854	2015-05-20 23:22:24 +00:00
Pete Cooper	a05c082866	Don't generate comments in the DebugLocStream unless required. NFC. The ByteStreamer here wasn't taking account of whether the asm streamer was text based and verbose. Only with that combination should we emit comments. This change makes sure that we only actually convert a Twine to a string using Twine::str() if we need the comment. This saves about 10000 small allocations on a test case involving the verify-use_list-order bitcode going through llc with debug info. Note, this is NFC as the comments would ultimately never be emitted unless required. Reviewed by Duncan Exon Smith and David Blaikie. llvm-svn: 237851	2015-05-20 22:51:27 +00:00
Pete Cooper	477300d333	Revert "Add bool to DebugLocDwarfExpression to control emitting comments." This reverts commit 0037b6bcbc874aa1b93d7ce3ad8dba3753ee2d9d (r237827). David Blaikie suggested some alternatives to this which are better. Reverting to apply a better solution later. llvm-svn: 237849	2015-05-20 22:37:48 +00:00
Pete Cooper	35522001fa	Add bool to DebugLocDwarfExpression to control emitting comments. DebugLocDwarfExpression::EmitOp was creating temporary strings by concatenating Twine's. When emitting to object files, these comments are thrown away. This commit adds a boolean to the constructor of the DwarfExpression to control whether it will actually emit any comments. This prevents it from even generating the temporary comments which would have been thrown away anyway. llvm-svn: 237827	2015-05-20 19:50:03 +00:00
Matthias Braun	56a781495a	DAGCombiner: Continue combining if FoldConstantArithmetic() fails. DAG.FoldConstantArithmetic() can fail even though both operands are Constants if OpaqueConstants are involved. Continue trying other combine possibilities in tis case. Differential Revision: http://reviews.llvm.org/D6946 Somewhat related to PR21801 / rdar://19211454 llvm-svn: 237822	2015-05-20 18:54:02 +00:00
Pawel Bylica	8011da9628	Fix icmp lowering Summary: During icmp lowering it can happen that a constant value can be larger than expected (see the code around the change). APInt::getMinSignedBits() must be checked again as the shift before can change the constant sign to positive. I'm not sure it is the best fix possible though. Test Plan: Regression test included. Reviewers: resistor, chandlerc, spatel, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D9147 llvm-svn: 237812	2015-05-20 17:21:09 +00:00
Pete Cooper	9e1d335697	Change Function::getIntrinsicID() to return an Intrinsic::ID. NFC. Now that Intrinsic::ID is a typed enum, we can forward declare it and so return it from this method. This updates all users which were either using an unsigned to store it, or had a now unnecessary cast. llvm-svn: 237810	2015-05-20 17:16:39 +00:00
Daniel Sanders	69c6008e49	Revert r237789 - [mips] The naming convention for private labels is ABI dependant. It works, but I've noticed that I missed several callers of createMCAsmInfo() and many don't have a TargetMachine to provide. llvm-svn: 237792	2015-05-20 14:18:59 +00:00
Daniel Sanders	b718eca643	[mips] The naming convention for private labels is ABI dependant. Summary: For N32/N64, private labels begin with '.L' but for O32 they begin with '$'. MCAsmInfo now has an initializer function which can be used to provide information from the TargetMachine to control the assembly syntax. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: jfb, sandeep, llvm-commits, rafael Differential Revision: http://reviews.llvm.org/D9821 llvm-svn: 237789	2015-05-20 13:16:42 +00:00
Igor Laevsky	423bc9ec4c	[StatepointLowering] Support of the gc.relocates for invoke statepoints. This change implements support for lowering of the gc.relocates tied to the invoke statepoint. This is acomplished by storing frame indices of the lowered values in "StatepointRelocatedValues" map inside FunctionLoweringInfo instead of storing them in per-basic block structure StatepointLowering. After this change StatepointLowering is used only during "LowerStatepoint" call and it is not necessary to store it as a field in SelectionDAGBuilder anymore. Differential Revision: http://reviews.llvm.org/D7798 llvm-svn: 237786	2015-05-20 11:37:25 +00:00
Swaroop Sridhar	665bc9c936	Add a GCStrategy for CoreCLR This change adds a new GC strategy for supporting the CoreCLR runtime. This strategy is currently identical to Statepoint-example GC, but is necessary for several upcoming changes specific to CoreCLR, such as: 1. Base-pointers not explicitly reported for interior pointers 2. Different format for stack-map encoding 3. Location of Safe-point polls: polls are only needed before loop-back edges and before tail-calls (not needed at function-entry) 4. Runtime specific handshake between calls to managed/unmanaged functions. llvm-svn: 237753	2015-05-20 01:07:23 +00:00
Philip Reames	7738dd68cf	Remove a stale comment The todo was implemented a while ago; I just forgot to remove the comment. llvm-svn: 237736	2015-05-19 22:26:33 +00:00
Alex Lorenz	de1970fe66	Revert r237708 (MIR serialization) - incremental buildbots became unstable. The incremental buildbots entered a pass-fail cycle where during the fail cycle one of the tests from this commit fails for an unknown reason. I have reverted this commit and will investigate the cause of this problem. llvm-svn: 237730	2015-05-19 21:41:28 +00:00
Matthias Braun	07066cca20	MachineInstr: Remove unused parameter. llvm-svn: 237726	2015-05-19 21:22:20 +00:00
Sanjay Patel	03abbb48a4	use 'auto *' for pointers; clearer usage, no deep copying llvm-svn: 237719	2015-05-19 20:10:16 +00:00
Sanjay Patel	ad11415962	tidy up 1. remove duplicate local variable 2. add local variable with name to match comment 3. remove useless comment llvm-svn: 237715	2015-05-19 19:10:57 +00:00
Sanjay Patel	64a6da947a	use range-based for-loop llvm-svn: 237711	2015-05-19 18:24:33 +00:00
Alex Lorenz	c5e0d4d146	MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 237708	2015-05-19 18:17:39 +00:00
Matthias Braun	7e10e53f14	RegisterCoalescer: Improve a comment. Explain the relation of the example to the variables in the code, explain what bad behaviour the code avoids in this case. llvm-svn: 237706	2015-05-19 17:52:32 +00:00
Sanjay Patel	3c9e370ec0	use range-based for loop llvm-svn: 237705	2015-05-19 17:49:14 +00:00
Matthias Braun	20683efd47	SelectionDAG: Cleanup and simplify FoldConstantArithmetic This cleans up the FoldConstantArithmetic code by factoring out the case of two ConstantSDNodes into an own function. This avoids unnecessary complexity for many callers who already have ConstantSDNode arguments. This also avoids an intermeidate SmallVector datastructure and a loop over that datastructure. llvm-svn: 237651	2015-05-19 01:40:21 +00:00
Matthias Braun	887fdfb759	DAGCombiner: Factor common pattern into isOneConstant() function. NFC llvm-svn: 237645	2015-05-19 00:25:21 +00:00
Matthias Braun	033121981d	DAGCombiner: Factor common pattern into isAllOnesConstant() function. NFC llvm-svn: 237644	2015-05-19 00:25:20 +00:00
Matthias Braun	0542b5d1db	DAGCombiner: Use isNullConstant() where possible llvm-svn: 237643	2015-05-19 00:25:17 +00:00
Matthias Braun	c545234772	Revert accidental change in r237633 llvm-svn: 237635	2015-05-18 23:18:13 +00:00
Matthias Braun	1505efb0bb	DAGCombiner: Factor common pattern into isNullConstant() function. NFC llvm-svn: 237633	2015-05-18 23:07:27 +00:00
David Blaikie	ff6409d096	Simplify IRBuilder::CreateCall* by using ArrayRef+initializer_list/braced init only llvm-svn: 237624	2015-05-18 22:13:54 +00:00
Matthias Braun	fa3872e7ad	MachineInstr: Change return value of getOpcode() to unsigned. This was previously returning int. However there are no negative opcode numbers and more importantly this was needlessly different from MCInstrDesc::getOpcode() (which even is the value returned here) and SDValue::getOpcode()/SDNode::getOpcode(). llvm-svn: 237611	2015-05-18 20:27:55 +00:00
Jim Grosbach	6f482000e9	MC: Clean up method names in MCContext. The naming was a mish-mash of old and new style. Update to be consistent with the new. NFC. llvm-svn: 237594	2015-05-18 18:43:14 +00:00
Hal Finkel	44b81ee40b	Preserve the order of READ_REGISTER and WRITE_REGISTER At the present time, we don't have a way to represent general dependency relationships, so everything is represented using memory dependency. In order to preserve the data dependency of a READ_REGISTER on WRITE_REGISTER, we need to model WRITE_REGISTER as writing (which we had been doing) and model READ_REGISTER as reading (which we had not been doing). Fix this, and also the way that the chain operands were generated at the SDAG level. Patch by Nicholas Paul Johnson, thanks! Test case by me. llvm-svn: 237584	2015-05-18 16:42:10 +00:00
Oliver Stannard	6cb23465e0	Revert r237579, as it broke windows buildbots llvm-svn: 237583	2015-05-18 16:39:16 +00:00
Oliver Stannard	0c553afe6a	[LLVM - ARM/AArch64] Add ACLE special register intrinsics This patch implements LLVM support for the ACLE special register intrinsics in section 10.1, __arm_{w,r}sr{,p,64}. This patch is intended to lower the read/write_register instrinsics, used to implement the special register intrinsics in the clang patch for special register intrinsics (see http://reviews.llvm.org/D9697), to ARM specific instructions MRC,MCR,MSR etc. to allow reading an writing of coprocessor registers in AArch32 and AArch64. This is done by inspecting the register string passed to the intrinsic and then lowering to the appropriate instruction. Patch by Luke Cheeseman. Differential Revision: http://reviews.llvm.org/D9699 llvm-svn: 237579	2015-05-18 16:23:33 +00:00
Hal Finkel	a60e633fdd	[DAGCombine] Be more pedantic about use iteration in CombineToPreIndexedLoadStore In CombineToPreIndexedLoadStore, when the offset is a constant, we have code that looks for other uses of the pointer which are constant offset computations so that they can be rewritten in terms of the updated pointer so that we don't need to keep a copy of the base pointer to compute these constant offsets. Unfortunately, when it iterated over the uses, it did so by SDNodes, and so we could confuse ourselves if the base pointer was produced by a node that had multiple results (because we would not immediately exclude uses of the other node results). This was reported as PR22755. Unfortunately, we don't have a test case (and I've also been unable to produce one thus far), but at least the mistake is clear. The right way to fix this problem is to make use of the information contained in the use iterators to filter out any uses of other results of the node producing the base pointer. This should be mostly NFC, but should also fix PR22755 (for which, unfortunately, we have no in-tree test case). llvm-svn: 237576	2015-05-18 15:46:02 +00:00
Andrew Trick	569dc65a60	MachineScheduler debug output clarity. llvm-svn: 237545	2015-05-17 23:40:31 +00:00
Andrew Trick	e02d5da8a7	RegisterPressureTracker: reword stale comments. llvm-svn: 237544	2015-05-17 23:40:27 +00:00
Benjamin Kramer	a48e0656b6	[WinEH] Push unique_ptr through the Action interface. This was the source of many leaks in the past, this should fix them once and for all. llvm-svn: 237524	2015-05-16 15:40:03 +00:00
Craig Topper	9a9d58a238	Correct indentation. NFC llvm-svn: 237512	2015-05-16 05:42:08 +00:00
Matthias Braun	352b89c460	MachineSink: Collect registers before clearing their killflags. Currently whenever we sink any instruction, we do clearKillFlags for every use of every use operand for that instruction, apparently there are a lot of duplication, therefore compile time penalties. This patch collect all the interested registers first, do clearKillFlags for it all together at once at the end, so we only need to do clearKillFlags once for one register, duplication is avoided. Patch by Lawrence Hu! Differential Revision: http://reviews.llvm.org/D9719 llvm-svn: 237510	2015-05-16 03:11:07 +00:00
James Molloy	7307cd57c5	[SDAGBuilder] Make the AArch64 builder happier. I intended this loop to only unwrap SplitVector actions, but it was more broad than that, such as unwrapping WidenVector actions, which makes operations seem legal when they're not. llvm-svn: 237457	2015-05-15 17:41:29 +00:00
James Molloy	7e9776b559	Add SDNodes for umin, umax, smin and smax. This adds new SDNodes for signed/unsigned min/max. These nodes are built from select/icmp pairs matched at SDAGBuilder stage. This patch adds the nodes, as well as legalization support and sets them to be "expand" for all targets. NFC for now; this will be tested when I switch AArch64 to using these new nodes. llvm-svn: 237423	2015-05-15 09:03:15 +00:00
Akira Hatanaka	ff86773f51	Stop resetting SanitizeAddress in TargetMachine::resetTargetOptions. NFC. Instead of doing that, create a temporary copy of MCTargetOptions and reset its SanitizeAddress field based on the function's attribute every time an InlineAsm instruction is emitted in AsmPrinter::EmitInlineAsm. This is part of the work to remove TargetMachine::resetTargetOptions (the FIXME added to TargetMachine.cpp in r236009 explains why this function has to be removed). Differential Revision: http://reviews.llvm.org/D9570 llvm-svn: 237412	2015-05-15 00:20:44 +00:00
Matthias Braun	7a247f709b	Turn effective assert(0) into llvm_unreachable llvm-svn: 237379	2015-05-14 18:33:29 +00:00
Matthias Braun	42e1e66e55	TargetSchedule: factor out common code; NFC llvm-svn: 237376	2015-05-14 18:01:13 +00:00
Matthias Braun	bff3a7eb3d	Remove MCInstrItineraries includes in parts that don't use them anymore llvm-svn: 237375	2015-05-14 18:01:11 +00:00
Ahmed Bougacha	6402ad27c0	[CodeGen] Use standard -not gnueabi- naming for f16 libcalls on Darwin. Other targets probably should as well. Since r237161, compiler-rt has both, but I don't see why anything other than gnueabi would use a gnueabi naming scheme. llvm-svn: 237324	2015-05-14 01:00:51 +00:00
Nick Lewycky	37a175007b	Revert r237046. See the testcase on the thread where r237046 was committed. llvm-svn: 237317	2015-05-13 23:41:47 +00:00
Sergey Dmitrouk	46c4f02848	[DebugInfo] Debug locations for constant SD nodes Several updates for [DebugInfo] Add debug locations to constant SD nodes (r235989). Includes: * re-enabling the change (disabled recently); * missing change for FP constants; * resetting debug location of constant node if it's used more than at one place to prevent emission of wrong locations in case of coalesced constants; * a couple of additional tests. Now all look ups in CSEMap are wrapped by additional method. Comment in D9084 suggests that debug locations aren't useful for "target constants", so there might be one more change related to this API (namely, dropping debug locations for getTarget*Constant methods). Differential Revision: http://reviews.llvm.org/D9604 llvm-svn: 237237	2015-05-13 08:58:03 +00:00
Sanjoy Das	a1d39ba940	[Statepoints] Support for "patchable" statepoints. Summary: This change adds two new parameters to the statepoint intrinsic, `i64 id` and `i32 num_patch_bytes`. `id` gets propagated to the ID field in the generated StackMap section. If the `num_patch_bytes` is non-zero then the statepoint is lowered to `num_patch_bytes` bytes of nops instead of a call (the spill and reload code remains unchanged). A non-zero `num_patch_bytes` is useful in situations where a language runtime requires complete control over how a call is lowered. This change brings statepoints one step closer to patchpoints. With some additional work (that is not part of this patch) it should be possible to get rid of `TargetOpcode::STATEPOINT` altogether. PlaceSafepoints generates `statepoint` wrappers with `id` set to `0xABCDEF00` (the old default value for the ID reported in the stackmap) and `num_patch_bytes` set to `0`. This can be made more sophisticated later. Reviewers: reames, pgavlin, swaroop.sridhar, AndyAyers Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9546 llvm-svn: 237214	2015-05-12 23:52:24 +00:00
Saleem Abdulrasool	ee13fbe848	CodeGen: ignore DEBUG_VALUE nodes in KILL tagging DEBUG_VALUE nodes do not take part in code generation. Ignore them when performing KILL updates. Addresses PR23486. llvm-svn: 237211	2015-05-12 23:36:18 +00:00
Pat Gavlin	08d7027cc1	[Statepoints] Clean up statepoint argument accessors. Differential Revision: http://reviews.llvm.org/D9622 llvm-svn: 237191	2015-05-12 21:33:48 +00:00
Pete Cooper	833f34d837	Convert PHI getIncomingValue() to foreach over incoming_values(). NFC. We already had a method to iterate over all the incoming values of a PHI. This just changes all eligible code to use it. Ineligible code included anything which cared about the index, or was also trying to get the i'th incoming BB. llvm-svn: 237169	2015-05-12 20:05:31 +00:00
Pat Gavlin	c7dc6d6ee7	[Statepoints] Split the calling convention and statepoint flags operand to STATEPOINT into two separate operands. Differential Revision: http://reviews.llvm.org/D9623 llvm-svn: 237166	2015-05-12 19:50:19 +00:00
Igor Laevsky	87ef5eaf46	Reverse ordering of base and derived pointer during safepoint lowering. According to the documentation in StackMap section for the safepoint we should have: "The first Location in each pair describes the base pointer for the object. The second is the derived pointer actually being relocated." But before this change we emitted them in reverse order - derived pointer first, base pointer second. llvm-svn: 237126	2015-05-12 13:12:14 +00:00
Eric Christopher	824f42f209	Migrate existing backends that care about software floating point to use the information in the module rather than TargetOptions. We've had and clang has used the use-soft-float attribute for some time now so have the backends set a subtarget feature based on a particular function now that subtargets are created based on functions and function attributes. For the one middle end soft float check go ahead and create an overloadable TargetLowering::useSoftFloat function that just checks the TargetSubtargetInfo in all cases. Also remove the command line option that hard codes whether or not soft-float is set by using the attribute for all of the target specific test cases - for the generic just go ahead and add the attribute in the one case that showed up. llvm-svn: 237079	2015-05-12 01:26:05 +00:00
Andrew Kaylor	0ddaf2bfb9	Fixing memory leak llvm-svn: 237072	2015-05-12 00:13:51 +00:00
Sanjoy Das	3d705e37c3	Refactoring gc_relocate related code in CodeGenPrepare.cpp Summary: The original code inserted new instructions by following a Create->Remove->ReInsert flow. This patch removes the unnecessary Remove->ReInsert part by setting up the InsertPoint correctly at the very beginning. This change does not introduce any functionality change. Patch by Chen Li! Reviewers: reames, AndyAyers, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9687 llvm-svn: 237070	2015-05-11 23:47:30 +00:00
Andrew Kaylor	cc14f387e8	[WinEH] Handle nested landing pads that return directly to the parent function. Differential Revision: http://reviews.llvm.org/D9684 llvm-svn: 237063	2015-05-11 23:06:02 +00:00
Sanjay Patel	5b202966f5	propagate IR-level fast-math-flags to DAG nodes; 2nd try; NFC This is a less ambitious version of: http://reviews.llvm.org/rL236546 because that was reverted in: http://reviews.llvm.org/rL236600 because it caused memory corruption that wasn't related to FMF but was actually due to making nodes with 2 operands derive from a plain SDNode rather than a BinarySDNode. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. llvm-svn: 237046	2015-05-11 21:07:09 +00:00
Andrew Kaylor	ce6f907e2f	Fixing build warnings llvm-svn: 237042	2015-05-11 20:45:11 +00:00
Andrew Kaylor	762a6bea1f	[WinEH] Update exception numbering to give handlers their own base state. Differential Revision: http://reviews.llvm.org/D9512 llvm-svn: 237014	2015-05-11 19:41:19 +00:00
Sanjoy Das	89c5491a72	[RewriteStatepointsForGC] Fix a bug on creating gc_relocate for pointer to vector of pointers Summary: In RewriteStatepointsForGC pass, we create a gc_relocate intrinsic for each relocated pointer, and the gc_relocate has the same type with the pointer. During the creation of gc_relocate intrinsic, llvm requires to mangle its type. However, llvm does not support mangling of all possible types. RewriteStatepointsForGC will hit an assertion failure when it tries to create a gc_relocate for pointer to vector of pointers because mangling for vector of pointers is not supported. This patch changes the way RewriteStatepointsForGC pass creates gc_relocate. For each relocated pointer, we erase the type of pointers and create an unified gc_relocate of type i8 addrspace(1)*. Then a bitcast is inserted to convert the gc_relocate to the correct type. In this way, gc_relocate does not need to deal with different types of pointers and the unsupported type mangling is no longer a problem. This change would also ease further merge when LLVM erases types of pointers and introduces an unified pointer type. Some minor changes are also introduced to gc_relocate related part in InstCombineCalls, CodeGenPrepare, and Verifier accordingly. Patch by Chen Li! Reviewers: reames, AndyAyers, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9592 llvm-svn: 237009	2015-05-11 18:49:34 +00:00
Matthias Braun	5391754288	LiveRangeCalc: Improve error messages on malformed IR llvm-svn: 237008	2015-05-11 18:47:47 +00:00
Simon Pilgrim	e09584ca95	[SelectionDAG] Fixed constant folding issue when legalised types are smaller then the folded type. Found when testing with llvm-stress on i686 targets. llvm-svn: 236954	2015-05-10 14:14:51 +00:00
James Y Knight	fca02be3c1	Fix MergeConsecutiveStore for non-byte-sized memory accesses. The bug showed up as a compile-time assertion failure: Assertion `NumBits >= MIN_INT_BITS && "bitwidth too small"' failed when building msan tests on x86-64. Prior to r236850, this bug was masked due to a bogus alignment check, which also accidentally rejected non-byte-sized accesses. Afterwards, an invalid ElementSizeBytes == 0 got further into the function, and triggered the assertion failure. It would probably be a good idea to allow it to handle merging stores of unusual widths as well, but for now, to un-break it, I'm just making the minimal fix. Differential Revision: http://reviews.llvm.org/D9626 llvm-svn: 236927	2015-05-09 03:13:37 +00:00
Tom Stellard	f01af29f01	MachineCSE: Add a target query for the LookAheadLimit heurisitic This is used to determine whether or not to CSE physical register defs. Differential Revision: http://reviews.llvm.org/D9472 llvm-svn: 236923	2015-05-09 00:56:07 +00:00
Pete Cooper	d54fb89901	[Fast-ISel] Don't mark the first use of a remat constant as killed. When emitting something like 'add x, 1000' if we remat the 1000 then we should be able to mark the vreg containing 1000 as killed. Given that we go bottom up in fast-isel, a later use of 1000 will be higher up in the BB and won't kill it, or be impacted by the lower kill. However, rematerialised constant expressions aren't generated bottom up. The local value save area grows downwards. This means that if you remat 2 constant expressions which both use 1000 then the first will kill it, then the second, which is lower in the BB will read a killed register. This is the case in the attached test where the 2 GEPs both need to generate 'add x, 6680' for the constant offset. Note that this commit only makes kill flag generation conservative. There's nothing else obviously wrong with the local value save area growing downwards, and in fact it needs to for handling arbitrarily complex constant expressions. However, it would be nice if there was a solution which would let us generate more accurate kill flags, or just kill flags completely. llvm-svn: 236922	2015-05-09 00:51:03 +00:00
Arnold Schwaighofer	f54b73d681	ScheduleDAGInstrs: In functions with tail calls PseudoSourceValues are not non-aliasing distinct objects The code that builds the dependence graph assumes that two PseudoSourceValues don't alias. In a tail calling function two FixedStackObjects might refer to the same location. Worse 'immutable' fixed stack objects like function arguments are not immutable and will be clobbered. Change this so that a load from a FixedStackObject is not invariant in a tail calling function and don't return a PseudoSourceValue for an instruction in tail calling functions when building the dependence graph so that we handle function arguments conservatively. Fix for PR23459. rdar://20740035 llvm-svn: 236916	2015-05-08 23:52:00 +00:00
Hans Wennborg	ae0254dabc	Switch lowering: cluster adjacent fall-through cases even at -O0 It's cheap to do, and codegen is much faster if cases can be merged into clusters. llvm-svn: 236905	2015-05-08 21:23:39 +00:00
Pete Cooper	e4bb07ecff	[Fast-ISel] Clear kill flags on registers replaced by updateValueMap. When selecting an extract instruction, we don't actually generate code but instead work out which register we are reading, and rewrite uses of the extract def to the source register. This is done via updateValueMap,. However, its possible that the source register we are rewriting to to also have uses. If those uses are after a kill of the value we are rewriting from then we have uses after a kill and the verifier fails. This code checks for the case where the to register is also used, and if so it clears all kill on the from register. This is conservative, but better that always clearing kills on the from register. llvm-svn: 236897	2015-05-08 20:46:54 +00:00
Pat Gavlin	cc0431d1c0	Extend the statepoint intrinsic to allow statepoints to be marked as transitions from GC-aware code to code that is not GC-aware. This changes the shape of the statepoint intrinsic from: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 unused, ...call args, i32 # deopt args, ...deopt args, ...gc args) to: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 flags, ...call args, i32 # transition args, ...transition args, i32 # deopt args, ...deopt args, ...gc args) This extension offers the backend the opportunity to insert (somewhat) arbitrary code to manage the transition from GC-aware code to code that is not GC-aware and back. In order to support the injection of transition code, this extension wraps the STATEPOINT ISD node generated by the usual lowering lowering with two additional nodes: GC_TRANSITION_START and GC_TRANSITION_END. The transition arguments that were passed passed to the intrinsic (if any) are lowered and provided as operands to these nodes and may be used by the backend during code generation. Eventually, the lowering of the GC_TRANSITION_{START,END} nodes should be informed by the GC strategy in use for the function containing the intrinsic call; for now, these nodes are instead replaced with no-ops. Differential Revision: http://reviews.llvm.org/D9501 llvm-svn: 236888	2015-05-08 18:07:42 +00:00
Pete Cooper	85b1c48b20	Clear kill flags on all used registers when sinking instructions. The test here was sinking the AND here to a lower BB: %vreg7<def> = ANDWri %vreg8, 0; GPR32common:%vreg7,%vreg8 TBNZW %vreg8<kill>, 0, <BB#1>; GPR32common:%vreg8 which meant that vreg8 was read after it was killed. This commit changes the code from clearing kill flags on the AND to clearing flags on all registers used by the AND. llvm-svn: 236886	2015-05-08 17:54:32 +00:00
Pete Cooper	ff5064a188	80 cols fix since i'm looking at this function anyway. NFC llvm-svn: 236885	2015-05-08 17:54:29 +00:00
James Y Knight	284e7b3d6c	Fix alignment checks in MergeConsecutiveStores. 1) check whether the alignment of the memory is sufficient for the merged store or load to be efficient. Not doing so can result in some ridiculously poor code generation, if merging creates a vector operation which must be aligned but isn't. 2) DON'T check that the alignment of each load/store is equal. If you're merging 2 4-byte stores, the first might have 8-byte alignment, but the second certainly will have 4-byte alignment. We do want to allow those to be merged. llvm-svn: 236850	2015-05-08 13:47:01 +00:00
Igor Laevsky	9d3932bf96	Fix coding standart based on post submit comments. Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 236849	2015-05-08 13:17:22 +00:00
Pete Cooper	ba593ad3f3	Clear kill flags in tail duplication. If we duplicate an instruction then we must also clear kill flags on any uses we rewrite. Otherwise we might be killing a register which was used in other BBs. For example, here the entry BB ended up with these instructions, the ADD having been tail duplicated. %vreg24<def> = t2ADDri %vreg10<kill>, 1, pred:14, pred:%noreg, opt:%noreg; GPRnopc:%vreg24 rGPR:%vreg10 %vreg22<def> = COPY %vreg10; GPR:%vreg22 rGPR:%vreg10 The copy here is inserted after the add and so needs vreg10 to be live. llvm-svn: 236782	2015-05-07 21:48:26 +00:00
Hans Wennborg	44faaa7aa4	Switch lowering: handle zero-weight branch probabilities After r236617, branch probabilities are no longer guaranteed to be >= 1. This patch makes the swich lowering code handle that correctly, without bumping the branch weights by 1 which might cause overflow and skews the probabilities. Covered by @zero_weight_tree in test/CodeGen/X86/switch.ll. llvm-svn: 236739	2015-05-07 15:47:15 +00:00
Pete Cooper	27483915e8	Handle dead defs in the if converter. We had code such as this: r2 = ... t2Bcc label1: ldr ... r2 label2; return r2<dead, def> The if converter was transforming this to r2<def> = ... return [pred] r2<dead,def> ldr <r2, kill> return which fails the machine verifier because the ldr now reads from a dead def. The fix here detects dead defs in stepForward and passes them back to the caller in the clobbers list. The caller then clears the dead flag from the def is the value is live. llvm-svn: 236660	2015-05-06 22:51:04 +00:00
Quentin Colombet	0ddd315db0	[RegisterCoalescer] Make sure each live-range has only one component, as demanded by the machine verifier. After shrinking a live-range to its uses, it is possible to create several smaller live-ranges. When this happens, shrinkToUses returns true and we need to split the different components into their own live-ranges. The problem does not reproduce on any in-tree target but Jonas Paulsson <jonas.paulsson@ericsson.com>, who reported the problem, checked that this patch fixes the issue. llvm-svn: 236658	2015-05-06 22:41:50 +00:00
Pete Cooper	54085cdc7b	Fix incorrect kill flags in fastisel. If called twice in the same BB on the same constant, FastISel::fastEmit_ri_ was marking the materialized vreg as killed on each use, instead of only the last use. Change this to only mark the last use as killed by making earlier uses check if the vreg is already used elsewhere. llvm-svn: 236650	2015-05-06 22:09:29 +00:00
Duncan P. N. Exon Smith	c177fec93f	MC: Skip names of temporary symbols in object streamer Don't create names for temporary symbols when using an object streamer. The names never make it to the output anyway. From the starting point of r236629, my heap profile says this drops peak memory usage from 1100 MB to 1058 MB for CodeGen of `verify-uselistorder`, a savings of almost 4% on peak memory, and removes `StringMap<bool, BumpPtrAllocator...>` from the profile entirely. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 236642	2015-05-06 21:34:34 +00:00
Tim Northover	e4310fe946	CodeGen: move over-zealous assert into actual if statement. It's quite possible to encounter an insertvalue instruction that's more deeply nested than the value we're looking for, but when that happens we really mustn't compare beyond the end of the index array. Since I couldn't see any guarantees about what comparisons std::equal makes, we probably need to directly check the size beforehand. In practice, I suspect most std::equal implementations would probably bail early, which would be OK. But just in case... rdar://20834485 llvm-svn: 236635	2015-05-06 20:07:38 +00:00
Duncan P. N. Exon Smith	653c1099b4	DwarfDebug: Emit number of bytes in .debug_loc entry directly Emit the number of bytes in a `.debug_loc` entry directly. The old code created temp labels (expensive), emitted the difference between them, and then emitted one on each side of the relevant bytes. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc` (the optimized version of ld64's `-save-temps` when linking the `verify-uselistorder` executable in an LTO bootstrap). I've hacked `MCContext::Allocate()` to just call `malloc()` instead of using the `BumpPtrAllocator` so that the heap profile is easier to read. As far as peak memory is concerned, `MCContext::Allocate()` is equivalent to a leak, since it only gets freed at process teardown. In my heap profile, this patch drops memory usage of `DwarfDebug::emitDebugLoc()` from 132.56 MB (11.4%) down to 29.86 MB (2.7%) at peak memory. Some of that must be noise from `SmallVector` (or other) allocations -- peak memory only dropped from 1160 MB down to 1100 MB -- but this nevertheless shaves 5% off the top.) llvm-svn: 236629	2015-05-06 19:11:20 +00:00
Reid Kleckner	d1b38c4b0b	[WinEH] Improve fatal error message about failed demotion llvm-svn: 236626	2015-05-06 18:45:24 +00:00
Sanjoy Das	6c0fe24bd1	[SelectionDAG] Delete SelectionDAGBuilder::removeValue. NFC. SelectionDAGBuilder::removeValue is dead now, after rL236563. llvm-svn: 236618	2015-05-06 18:02:10 +00:00
Diego Novillo	14f94de1ee	Allow 0-weight branches in BranchProbabilityInfo. Summary: When computing branch weights in BPI, we used to disallow branches with weight 0. This is a minor nuisance, because a branch with weight 0 is different to "don't have information". In the context of instrumentation, it may mean "never executed", in the context of sampling, it means "never or seldom executed". In allowing 0 weight branches, I ran into issues with the switch expansion code in selection DAG. It is currently hardwired to not handle branches with weight 0. To maintain the current behaviour, I changed it to use 1 when it finds 0, but perhaps the algorithm needs changes to tolerate branches with weight zero. Reviewers: hansw Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9533 llvm-svn: 236617	2015-05-06 17:55:11 +00:00
Matt Arsenault	633dba4f41	Add ChangeTo* to MachineOperand for symbols llvm-svn: 236612	2015-05-06 17:05:54 +00:00
NAKAMURA Takumi	e452998b4b	Reformat. llvm-svn: 236601	2015-05-06 14:03:22 +00:00
NAKAMURA Takumi	d7c0be9c42	Revert r236546, "propagate IR-level fast-math-flags to DAG nodes (NFC)" It caused undefined behavior. llvm-svn: 236600	2015-05-06 14:03:12 +00:00
Pawel Bylica	9f1fb9d1ef	SelectionDAG: Handle out-of-bounds index in extract vector element Summary: This patch correctly handles undef case of EXTRACT_VECTOR_ELT node where the element index is constant and not less than vector size. Test Plan: CodeGen for X86 test included. Also one incorrect regression test fixed. Reviewers: qcolombet, chandlerc, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D9250 llvm-svn: 236584	2015-05-06 10:19:14 +00:00
Sanjoy Das	4bfb472072	[Statepoint] Clean up StatepointLowering: symbolic constants. For accessors in the `Statepoint` class, use symbolic constants for offsets into the argument vector instead of literals. This makes the code intent clearer and simpler to change. llvm-svn: 236566	2015-05-06 02:36:31 +00:00
Sanjoy Das	499d703f52	[Statepoint] Clean up Statepoint.h: accessor names. Use getFoo() as accessors consistently and some other naming changes. llvm-svn: 236564	2015-05-06 02:36:26 +00:00
Sanjoy Das	c6bf3e9f12	[StatepointLowering] Don't create temporary instructions. NFCI. Summary: Instead of creating a temporary call instruction and lowering that, use SelectionDAGBuilder::lowerCallOperands. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9480 llvm-svn: 236563	2015-05-06 02:36:20 +00:00
Ahmed Bougacha	ed363c5dcb	[WinEH] Reset WinEHPrepare::SEHExceptionCodeSlot when we're done. This caused a use-after-free on test/CodeGen/X86/win32-eh.ll No functional change intended. llvm-svn: 236561	2015-05-06 01:28:58 +00:00
Sanjoy Das	1194d1e799	[SelectionDAG] Make an argument optional in RFV::getCopyToRegs. NFC. Summary: We default the value argument to nullptr. The only use of the value is in diagnosePossiblyInvalidConstraint and that seems to be resilient to it being nullptr. Reviewers: atrick, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9479 llvm-svn: 236555	2015-05-05 23:06:57 +00:00
Sanjoy Das	3936a97f11	[SelectionDAG] Move RegsForValue into SelectionDAGBuilder.h. NFC. Summary: The exported class will be used in later change, in StatepointLowering.cpp. It is still internal to SelectionDAG (not exported via include/). Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9478 llvm-svn: 236554	2015-05-05 23:06:54 +00:00
Sanjoy Das	84153c450a	[SelectionDAG] Pass explicit type to lowerCallOperands. NFC. Summary: Currently this does not change anything, but change will be used in a later change to StatepointLowering.cpp Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9477 llvm-svn: 236553	2015-05-05 23:06:52 +00:00
Sanjoy Das	3fb91c0a0d	[StatepointLowering] Rename variable, NFC. Rename LoweredArgs to LoweredMetaArgs to clarify intent. llvm-svn: 236552	2015-05-05 23:06:49 +00:00
Pete Cooper	ce9ad757c7	Fix IfConverter to handle regmask machine operands. Note, this is a recommit of r236515 after fixing an error in r236514. The buildbot ran fast enough that it picked up r236514 prior to r236515 and threw an error. r236515 itself ran 'make check' without errors. Original commit message follows: A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks. These are slightly different to a def in that we need to add both an implicit use and def to appease the machine verifier. Otherwise, uses after the if converted call could think they are reading an undefined register. Reviewed by Matthias Braun and Quentin Colombet. llvm-svn: 236550	2015-05-05 22:09:41 +00:00
Sanjay Patel	801caff64d	propagate IR-level fast-math-flags to DAG nodes (NFC) This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. There are 2 structural changes here: 1. The main diff is that we're preparing to extend the optimization flags to affect more than just binary SDNodes. Eg, IR intrinsics ( https://llvm.org/bugs/show_bug.cgi?id=21290 ) or non-binop nodes that don't even exist in IR such as FMA, FNEG, etc. 2. The other change is that we're actually copying the FP fast-math-flags from the IR instructions to SDNodes. Differential Revision: http://reviews.llvm.org/D8900 llvm-svn: 236546	2015-05-05 21:40:38 +00:00
Pete Cooper	7605e37a63	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC Note, this is a reapplication of r236515 with a fix to not assert on non-register operands, but instead only handle them until the subsequent commit. Original commit message follows. The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236538	2015-05-05 20:14:22 +00:00
Ulrich Weigand	9958c489bb	[DAGCombiner] Account for getVectorIdxTy() when narrowing vector load This patch makes ReplaceExtractVectorEltOfLoadWithNarrowedLoad convert the element number from getVectorIdxTy() to PtrTy before doing pointer arithmetic on it. This is needed on z, where element numbers are i32 but pointers are i64. Original patch by Richard Sandiford. llvm-svn: 236530	2015-05-05 19:34:10 +00:00
Ulrich Weigand	af2c618e2b	[DAGCombiner] Fix ReplaceExtractVectorEltOfLoadWithNarrowedLoad for BE For little-endian, the function would convert (extract_vector_elt (load X), Y) to X + Ysizeof(elt). For big-endian it would instead use X + sizeof(vec) - Ysizeof(elt). The big-endian case wasn't right since vector index order always follows memory/array order, even for big-endian. (Note that the current handling has to be wrong for Y==0 since it would access beyond the end of the vector.) Original patch by Richard Sandiford. llvm-svn: 236529	2015-05-05 19:33:37 +00:00
Ulrich Weigand	2693c0a491	[LegalizeVectorTypes] Allow single loads and stores for more short vectors When lowering a load or store for TypeWidenVector, the type legalizer would use a single load or store if the associated integer type was legal. E.g. it would load a v4i8 as an i32 if i32 was legal. This patch extends that behavior to promoted integers as well as legal ones. If the integer type for the full vector width is TypePromoteInteger, the element type is going to be TypePromoteInteger too, and it's still better to use a single promoting load or truncating store rather than N individual promoting loads or truncating stores. E.g. if you have a v2i8 on a target where i16 is promoted to i32, it's better to load the v2i8 as an i16 rather than load both i8s individually. Original patch by Richard Sandiford. llvm-svn: 236528	2015-05-05 19:32:57 +00:00
Pete Cooper	336d90b61b	Revert "Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC" This reverts commit 963cdbccf6e5578822836fd9b2ebece0ba9a60b7 (ie r236514) This is to get the bots green while i investigate. llvm-svn: 236518	2015-05-05 18:49:08 +00:00
Pete Cooper	05b84d4168	Revert "Fix IfConverter to handle regmask machine operands." This reverts commit b27413cbfd78d959c18e713bfa271fb69e6b3303 (ie r236515). This is to get the bots green while i investigate the failures. llvm-svn: 236517	2015-05-05 18:49:05 +00:00
Pete Cooper	6ebc207703	Fix IfConverter to handle regmask machine operands. A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks. These are slightly different to a def in that we need to add both an implicit use and def to appease the machine verifier. Otherwise, uses after the if converted call could think they are reading an undefined register. Reviewed by Matthias Braun and Quentin Colombet. llvm-svn: 236515	2015-05-05 18:31:36 +00:00
Pete Cooper	bbd1c727d1	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236514	2015-05-05 18:31:31 +00:00
Reid Kleckner	0738a9c02e	Re-land "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236360. This change exposed a bug in WinEHPrepare by opting win32 code into EH preparation. We already knew that WinEHPrepare has bugs, and is the status quo for x64, so I don't think that's a reason to hold off on this change. I disabled exceptions in the sanitizer tests in r236505 and an earlier revision. llvm-svn: 236508	2015-05-05 17:44:16 +00:00
Quentin Colombet	61b305edfd	[ShrinkWrap] Add (a simplified version) of shrink-wrapping. This patch introduces a new pass that computes the safe point to insert the prologue and epilogue of the function. The interest is to find safe points that are cheaper than the entry and exits blocks. As an example and to avoid regressions to be introduce, this patch also implements the required bits to enable the shrink-wrapping pass for AArch64. Context Currently we insert the prologue and epilogue of the method/function in the entry and exits blocks. Although this is correct, we can do a better job when those are not immediately required and insert them at less frequently executed places. The job of the shrink-wrapping pass is to identify such places. Motivating example Let us consider the following function that perform a call only in one branch of a if: define i32 @f(i32 %a, i32 %b) { %tmp = alloca i32, align 4 %tmp2 = icmp slt i32 %a, %b br i1 %tmp2, label %true, label %false true: store i32 %a, i32* %tmp, align 4 %tmp4 = call i32 @doSomething(i32 0, i32* %tmp) br label %false false: %tmp.0 = phi i32 [ %tmp4, %true ], [ %a, %0 ] ret i32 %tmp.0 } On AArch64 this code generates (removing the cfi directives to ease readabilities): _f: ; @f ; BB#0: stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething LBB0_2: ; %false mov sp, x29 ldp x29, x30, [sp], #16 ret With shrink-wrapping we could generate: _f: ; @f ; BB#0: cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething add sp, x29, #16 ; =16 ldp x29, x30, [sp], #16 LBB0_2: ; %false ret Therefore, we would pay the overhead of setting up/destroying the frame only if we actually do the call. Proposed Solution This patch introduces a new machine pass that perform the shrink-wrapping analysis (See the comments at the beginning of ShrinkWrap.cpp for more details). It then stores the safe save and restore point into the MachineFrameInfo attached to the MachineFunction. This information is then used by the PrologEpilogInserter (PEI) to place the related code at the right place. This pass runs right before the PEI. Unlike the original paper of Chow from PLDI’88, this implementation of shrink-wrapping does not use expensive data-flow analysis and does not need hack to properly avoid frequently executed point. Instead, it relies on dominance and loop properties. The pass is off by default and each target can opt-in by setting the EnableShrinkWrap boolean to true in their derived class of TargetPassConfig. This setting can also be overwritten on the command line by using -enable-shrink-wrap. Before you try out the pass for your target, make sure you properly fix your emitProlog/emitEpilog/adjustForXXX method to cope with basic blocks that are not necessarily the entry block. Design Decisions 1. ShrinkWrap is its own pass right now. It could frankly be merged into PEI but for debugging and clarity I thought it was best to have its own file. 2. Right now, we only support one save point and one restore point. At some point we can expand this to several save point and restore point, the impacted component would then be: - The pass itself: New algorithm needed. - MachineFrameInfo: Hold a list or set of Save/Restore point instead of one pointer. - PEI: Should loop over the save point and restore point. Anyhow, at least for this first iteration, I do not believe this is interesting to support the complex cases. We should revisit that when we motivating examples. Differential Revision: http://reviews.llvm.org/D9210 <rdar://problem/3201744> llvm-svn: 236507	2015-05-05 17:38:16 +00:00
Tim Northover	851ff69b42	CodeGen: match up correct insertvalue indices when assessing tail calls. When deciding whether a value comes from the aggregate or inserted value of an insertvalue instruction, we compare the indices against those of the location we're interested in. One of the lists needs reversing because the input data is backwards (so that modifications take place at the end of the SmallVector), but we were reversing both before leading to incorrect results. Should fix PR23408 llvm-svn: 236457	2015-05-04 20:41:51 +00:00
Pete Cooper	300069a019	ScheduleDAGInstrs should toggle kill flags on bundled instrs. ScheduleDAGInstrs wasn't setting or clearing the kill flags on instructions inside bundles. This led to code such as this %R3<def> = t2ANDrr %R0 BUNDLE %ITSTATE<imp-def,dead>, %R0<imp-use,kill> t2IT 1, 24, %ITSTATE<imp-def> R6<def,tied6> = t2ORRrr %R0<kill>, ... being transformed to BUNDLE %ITSTATE<imp-def,dead>, %R0<imp-use> t2IT 1, 24, %ITSTATE<imp-def> R6<def,tied6> = t2ORRrr %R0<kill>, ... %R3<def> = t2ANDrr %R0<kill> where the kill flag was removed from the BUNDLE instruction, but not the t2ORRrr inside it. The verifier then thought that R0 was undefined when read by the AND. This change make the toggleKillFlags method also check for bundles and toggle flags on bundled instructions. Setting the kill flag is special cased as we only want to set the kill flag on the last instruction in the bundle. llvm-svn: 236428	2015-05-04 16:52:06 +00:00
Elena Demikhovsky	1b60ed7069	Masked gather and scatter intrinsics - enabled codegen for KNL. llvm-svn: 236394	2015-05-03 07:12:25 +00:00
Simon Pilgrim	017ca19384	[DAGCombiner] Enabled vector float/double -> int constant folding llvm-svn: 236387	2015-05-02 13:04:07 +00:00
David Blaikie	72d03efa6d	DebugInfo: Use low_pc relative debug_ranges under fission when the CU has a low_pc Seems we were setting the base address on the wrong DwarfCompileUnit object so it wasn't being used when generating the ranges. llvm-svn: 236377	2015-05-02 02:31:49 +00:00
Jim Grosbach	bfe3a9c318	Fix spelling. llvm-svn: 236367	2015-05-02 00:44:07 +00:00
Reid Kleckner	83d89fa546	Revert "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236359. Things are still broken despite testing. :( llvm-svn: 236360	2015-05-01 22:50:14 +00:00
Reid Kleckner	51476acd77	Re-land "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236340. llvm-svn: 236359	2015-05-01 22:40:25 +00:00
Reid Kleckner	2747d3d55a	Revert "[WinEH] Add an EH registration and state insertion pass for 32-bit x86" This reverts commit r236339, it breaks the win32 clang-cl self-host. llvm-svn: 236340	2015-05-01 20:14:04 +00:00
Reid Kleckner	4856fc61b4	[WinEH] Add an EH registration and state insertion pass for 32-bit x86 This pass is responsible for constructing the EH registration object that gets linked into fs:00, which is all it does in this change. In the future, it will also insert stores to update the EH state number. I considered keeping this functionality in WinEHPrepare, but it's pretty separable and X86 specific. It has conceptually very little to do with the task of WinEHPrepare, which is currently outlining. WinEHPrepare is also in theory useful on ARM, but this logic is pretty x86 specific. Reviewers: andrew.w.kaylor, majnemer Differential Revision: http://reviews.llvm.org/D9422 llvm-svn: 236339	2015-05-01 20:04:54 +00:00
Simon Pilgrim	9fb06bca67	[SelectionDAG] Unary vector constant folding integer legality fixes This patch fixes issues with vector constant folding not correctly handling scalar input operands if they require implicit truncation - this was tested with llvm-stress as recommended by Patrik H Hagglund. The patch ensures that integer input scalars from a build vector are correctly truncated before folding, and that constant integer scalar results are promoted to a legal type before inclusion in the new folded build vector. I have added another crash test case and also a test for UINT_TO_FP / SINT_TO_FP using an non-truncated scalar input, which was failing before this patch. Differential Revision: http://reviews.llvm.org/D9282 llvm-svn: 236308	2015-05-01 08:20:04 +00:00
Matt Arsenault	59d2ca1cba	Fix typo llvm-svn: 236283	2015-04-30 23:20:56 +00:00
Pete Cooper	451755d370	Commute the internal flag on MachineOperands. When commuting a thumb instruction in the size reduction pass, thumb instructions are represented as a bundle and so some operands may be marked as internal. The internal flag has to move with the operand when commuting. This test is sensitive to register allocation so can't specifically check that this error was happening, but so long as it continues to pass with -verify then hopefully its still ok. rdar://problem/20752113 llvm-svn: 236282	2015-04-30 23:14:14 +00:00
Andrea Di Biagio	c84b5bdd69	Fix for PR23103. Correctly propagate the 'IsUndef' flag to the register operands of a commuted instruction. Revision 220239 exposed a latent bug in method 'TargetInstrInfo::commuteInstruction'. When commuting the operands of a machine instruction, method 'commuteInstruction' didn't correctly propagate the 'IsUndef' flag to the register operands of the new (commuted) instruction. Before this patch, the following instruction: %vreg4<def> = VADDSDrr %vreg14, %vreg5<undef>; FR64:%vreg4,%vreg14,%vreg5 was wrongly converted by method 'commuteInstruction' into: %vreg4<def> = VADDSDrr %vreg5, %vreg14<undef>; FR64:%vreg4,%vreg5,%vreg14 The correct instruction should have been: %vreg4<def> = VADDSDrr %vreg5<undef>, %vreg14; FR64:%vreg4,%vreg5,%vreg14 This patch fixes the problem in method 'TargetInstrInfo::commuteInstruction'. When swapping the operands of a machine instruction, we now make sure that 'IsUndef' flags are correctly set. Added test case 'pr23103.ll'. Differential Revision: http://reviews.llvm.org/D9406 llvm-svn: 236258	2015-04-30 21:03:29 +00:00
Matt Arsenault	ee5c2ab734	MachineVerifier: Don't crash if MachineOperand has no parent If you somehow added a MachineOperand to an instruction that did not have the parent set, the verifier would crash since it attempts to use the operand's parent. llvm-svn: 236249	2015-04-30 19:35:41 +00:00
Pete Cooper	4d8d2ec3eb	Don't rewrite jumps to empty BBs to landing pads. In the test case here, the 'unreachable' BB was removed by BranchFolding because its empty. It then rewrote the jump from 'entry' to jump to its fallthrough, which was a landing pad. This results in 'entry' jumping to 2 different landing pads, which fails the machine verifier. rdar://problem/20750162 llvm-svn: 236248	2015-04-30 18:58:23 +00:00
Reid Kleckner	582786b6cc	Add a note about permitting default member initializers Use them in WinEHPrepare so that we can spot any toolchain bugs that come up. llvm-svn: 236244	2015-04-30 18:17:12 +00:00
Jan Vesely	808fff585b	Reinstate revisions r234755, r234759, r234760 changes: Don't apply on hexagon and NVPTX since they no longer claim to support UADDO/USUBO Add location to getConstant Drop comment about the ops being turned into expand llvm-svn: 236240	2015-04-30 17:15:56 +00:00
Daniel Jasper	0366cd23ac	Inline local variable to silence unused warning. llvm-svn: 236212	2015-04-30 08:51:13 +00:00
Elena Demikhovsky	e1eda8a9e6	Masked gather and scatter - added DAGCombine visitors and AVX-512 instruction selection patterns. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 236211	2015-04-30 08:38:48 +00:00
Owen Anderson	d8a029c81b	Semantically revert r236031, which is not a good idea for in-order targets. At the least it should be guarded by some kind of target hook. It also introduced catastrophic compile time and code quality regressions on some out of tree targets (test case still being reduced/sanitized). Sanjay agreed with reverting this patch until these issues can be resolved. llvm-svn: 236199	2015-04-30 04:06:32 +00:00
Hans Wennborg	4b828d35fd	Switch lowering: use profile info to build weight-balanced binary search trees This will cause hot nodes to appear closer to the root. The literature says building the tree like this makes it a near-optimal (in terms of search time given key frequencies) binary search tree. In LLVM's case, we can do up to 3 comparisons in each leaf node, so it might be better to opt for lower tree height in some cases; that's something to look into in the future. Differential Revision: http://reviews.llvm.org/D9318 llvm-svn: 236192	2015-04-30 00:57:37 +00:00
Reid Kleckner	bcda1cd45a	[WinEH] Start EH preparation for 32-bit x86, it uses no arguments 32-bit x86 MSVC-style exceptions are functionaly similar to 64-bit, but they take no arguments. Instead, they implicitly use the value of EBP passed in by the caller as a pointer to the parent's frame. In LLVM, we can represent this as llvm.frameaddress(1), and feed that into all of our calls to llvm.framerecover. The next steps are: - Add an alloca to the fs:00 linked list of handlers - Add something like llvm.sjlj.lsda or generalize it to store in the alloca - Move state number calculation to WinEHPrepare, arrange for FunctionLoweringInfo to call it - Use the state numbers to insert explicit loads and stores in the IR llvm-svn: 236172	2015-04-29 22:49:54 +00:00
Sanjay Patel	04b0e92766	generalize binop reassociation; NFC Move the fold introduced in r236031: http://reviews.llvm.org/rL236031 to its own helper function, so we can use it for other binops. This is a preliminary step before partially solving: https://llvm.org/bugs/show_bug.cgi?id=21768 https://llvm.org/bugs/show_bug.cgi?id=23116 llvm-svn: 236171	2015-04-29 22:30:02 +00:00
Pat Gavlin	022c5acad8	Run StatepointLowering.{cpp,h} through clang-format. llvm-svn: 236166	2015-04-29 21:52:45 +00:00
David Blaikie	f64246be72	[opaque pointer type] Pass GlobalAlias the actual pointer type rather than decomposing it into pointee type + address space Many of the callers already have the pointer type anyway, and for the couple of callers that don't it's pretty easy to call PointerType::get on the pointee type and address space. This avoids LLParser from using PointerType::getElementType when parsing GlobalAliases from IR. llvm-svn: 236160	2015-04-29 21:22:39 +00:00
Sanjay Patel	caf5180ff7	tidy up; NFC llvm-svn: 236156	2015-04-29 21:01:41 +00:00
Sanjay Patel	ee6678119d	too much space again; NFC llvm-svn: 236150	2015-04-29 20:38:02 +00:00
Sanjay Patel	435efaadff	too much space; NFC llvm-svn: 236147	2015-04-29 20:32:57 +00:00
Andrew Kaylor	a33f159056	[WinEH] Fix minor bug in begincatch block splitting llvm-svn: 236129	2015-04-29 17:21:26 +00:00
Duncan P. N. Exon Smith	a9308c49ef	IR: Give 'DI' prefix to debug info metadata Finish off PR23080 by renaming the debug info IR constructs from `MD` to `DI`. The last of the `DIDescriptor` classes were deleted in r235356, and the last of the related typedefs removed in r235413, so this has all baked for about a week. Note: If you have out-of-tree code (like a frontend), I recommend that you get everything compiling and tests passing with the previous commit before updating to this one. It'll be easier to keep track of what code is using the `DIDescriptor` hierarchy and what you've already updated, and I think you're extremely unlikely to insert bugs. YMMV of course. Back to this commit: I did this using the rename-md-di-nodes.sh upgrade script I've attached to PR23080 (both code and testcases) and filtered through clang-format-diff.py. I edited the tests for test/Assembler/invalid-generic-debug-node-*.ll by hand since the columns were off-by-three. It should work on your out-of-tree testcases (and code, if you've followed the advice in the previous paragraph). Some of the tests are in badly named files now (e.g., test/Assembler/invalid-mdcompositetype-missing-tag.ll should be 'dicompositetype'); I'll come back and move the files in a follow-up commit. llvm-svn: 236120	2015-04-29 16:38:44 +00:00
Jan Vesely	7539548738	CodeGen: Default overflow operations to expand so we don't have to assume targets are lying Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: ab Differential Revision: http://reviews.llvm.org/D9265 llvm-svn: 236119	2015-04-29 16:30:46 +00:00
Elena Demikhovsky	ac969012ef	Fixed masked gather/scatter switch-case llvm-svn: 236092	2015-04-29 08:38:53 +00:00
Elena Demikhovsky	744fe0de33	fixed comments, blanks, nullptr; NFC llvm-svn: 236086	2015-04-29 06:49:50 +00:00
Matthias Braun	5295793bca	RegisterCoalescer: hide terminal rule option by default llvm-svn: 236062	2015-04-28 23:55:11 +00:00
Andrew Kaylor	91307434f4	Style updates llvm-svn: 236048	2015-04-28 22:01:51 +00:00
Andrew Kaylor	046f7b42f2	[WinEH] Split blocks at calls to llvm.eh.begincatch Differential Revision: http://reviews.llvm.org/D9311 llvm-svn: 236046	2015-04-28 21:54:14 +00:00
Sanjay Patel	2fbc4e5c49	transform fadd chains to increase parallelism This is a compromise: with this simple patch, we should always handle a chain of exactly 3 operations optimally, but we're not generating the optimal balanced binary tree for a longer sequence. In general, this transform will reduce the dependency chain for a sequence of instructions using N operands from a worst case N-1 dependent operations to N/2 dependent operations. The optimal balanced binary tree would reduce the chain to log2(N). The trade-off for not dealing with longer sequences is: (1) we have less complexity in the compiler, (2) we avoid unknown compile-time blowup calculating a balanced tree, and (3) we don't need to worry about the increased register pressure required to parallelize longer sequences. It also seems unlikely that we would ever encounter really long strings of dependent ops like that in the wild, but I'm not sure how to verify that speculation. FWIW, I see no perf difference for test-suite running on btver2 (x86-64) with -ffast-math and this patch. We can extend this patch to cover other associative operations such as fmul, fmax, fmin, integer add, integer mul. This is a partial fix for: https://llvm.org/bugs/show_bug.cgi?id=17305 and if extended: https://llvm.org/bugs/show_bug.cgi?id=21768 https://llvm.org/bugs/show_bug.cgi?id=23116 The issue also came up in: http://reviews.llvm.org/D8941 Differential Revision: http://reviews.llvm.org/D9232 llvm-svn: 236031	2015-04-28 21:03:22 +00:00
Sanjay Patel	ba55804ea3	move IR-level optimization flags into their own struct This is a preliminary step to using the IR-level floating-point fast-math-flags in the SDAG (D8900). In this patch, we introduce the optimization flags as their own struct. As noted in the TODO comment, we should eventually share this data between the IR passes and the backend. We also switch the existing nsw / nuw / exact bit functionality of the BinaryWithFlagsSDNode class to use the new struct. The tradeoff is that instead of using the free but limited space of SDNode's SubclassData, we add a data member to the subclass. This means we don't have to repeat all of the get/set methods per flag, but we're potentially adding size to all nodes of this subclassi type. In practice on 64-bit systems (measured on Linux and MacOS X), there is no size difference between an SDNode and BinaryWithFlagsSDNode after this change: they're both 80 bytes. This means that we had at least one free byte to play with due to struct alignment. Differential Revision: http://reviews.llvm.org/D9325 llvm-svn: 235997	2015-04-28 16:39:12 +00:00
Sergey Dmitrouk	842a51bad8	Reapply r235977 "[DebugInfo] Add debug locations to constant SD nodes" [DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235989	2015-04-28 14:05:47 +00:00
Daniel Jasper	48e93f7181	Revert "[DebugInfo] Add debug locations to constant SD nodes" This breaks a test: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/23870 llvm-svn: 235987	2015-04-28 13:38:35 +00:00
Sergey Dmitrouk	adb4c69d5c	[DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235977	2015-04-28 11:56:37 +00:00
Elena Demikhovsky	584ce378ab	Masked gather and scatter: Added code for SelectionDAG. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 235970	2015-04-28 07:57:37 +00:00
Hans Wennborg	7bf4d4eee0	Switch lowering: use uint32_t for weights everywhere I previously thought switch clusters would need to use uint64_t in case the weights of multiple cases overflowed a 32-bit int. It turns out that the weights on a terminator instruction are capped to allow for being added together, so using a uint32_t should be safe. llvm-svn: 235945	2015-04-27 23:52:19 +00:00
Hans Wennborg	67c03759e4	Switch lowering: Take branch weight into account when ordering for fall-through Previously, the code would try to put a fall-through case last, even if that meant moving a case with much higher branch weight further down the chain. Ordering by branch weight is most important, putting a fall-through block last is secondary. llvm-svn: 235942	2015-04-27 23:35:22 +00:00
Hans Wennborg	ba6d2568f9	Switch lowering: order bit tests by branch weight. llvm-svn: 235912	2015-04-27 20:21:17 +00:00
Philip Reames	20c24f1da5	Make the message associated with a fatal error slightly more helpful Looking into 23095, my best guess is that the CodeGen library itself isn't getting linked and initialized properly. To make this slightly more obvious to consumers of LLVM, emit a different error message if we can tell that the registry is empty vs you've simply happened to name a collector which hasn't been registered. llvm-svn: 235824	2015-04-26 22:00:34 +00:00
Andrew Kaylor	8c384bbb35	Fix build error from accidental change llvm-svn: 235792	2015-04-24 23:34:46 +00:00
Andrew Kaylor	8c79411203	[WinEH] Find correct cloned entry block for outlined handler functions. llvm-svn: 235791	2015-04-24 23:27:32 +00:00
Andrew Kaylor	5dacfd8b8a	[WinEH] Find correct cloned entry block for outlined handler functions. llvm-svn: 235789	2015-04-24 23:10:38 +00:00
Quentin Colombet	8229145961	[DAGCombiner] Fix the type used in canFoldInAddressingMode to account for the right scaling. In the function canFoldInAddressingMode, VT is computed as the type of the destination/source of a LOAD/STORE operations, instead of the memory type of the operation. On targets with a scaling factor on the offset of the LOAD/STORE operations, the function may return false for actually valid cases. This may then prevent the selection of profitable pre or post indexed load/store operations, and instead select pre or post indexed load/store for unprofitable cases. Patch by Francois de Ferriere <francois.de-ferriere@st.com>! Differential Revision: http://reviews.llvm.org/D9146 llvm-svn: 235780	2015-04-24 21:28:00 +00:00
Kaelyn Takata	5e5524bc25	Remove an unused variable to prevent -Werror build failures. llvm-svn: 235773	2015-04-24 21:02:18 +00:00
Reid Kleckner	cfbfe6f29c	[SEH] Implement GetExceptionCode in __except blocks This introduces an intrinsic called llvm.eh.exceptioncode. It is lowered by copying the EAX value live into whatever basic block it is called from. Obviously, this only works if you insert it late during codegen, because otherwise mid-level passes might reschedule it. llvm-svn: 235768	2015-04-24 20:25:05 +00:00
Lang Hames	9ff69c8f4d	[AsmPrinter] Make AsmPrinter's OutStreamer member a unique_ptr. AsmPrinter owns the OutStreamer, so an owning pointer makes sense here. Using a reference for this is crufty. llvm-svn: 235752	2015-04-24 19:11:51 +00:00
Hans Wennborg	ec679a8b3b	Switch lowering: fix APInt overflow causing infinite loop / OOM llvm-svn: 235729	2015-04-24 16:53:55 +00:00
Reid Kleckner	2c3ccaacb7	[WinEH] Split the landingpad BB instead of cloning it This means we don't have to RAUW the landingpad instruction and landingpad BB, which is a nice win. llvm-svn: 235725	2015-04-24 16:22:19 +00:00
Matthias Braun	f2a08dcaf6	RegisterCoalescer: implicit phsreg uses are fine when rematerializing The target hooks should have already checked them. This change is necessary to enable the remateriailzation on R600. llvm-svn: 235673	2015-04-24 00:01:37 +00:00
Matthias Braun	43fb8a157b	RegisterCoalescer: Avoid unnecessary register class widening for some rematerializations I couldn't provide a testcase as none of the public targets has wide register classes with alot of subregisters and at the same time an instruction which "ReMaterializable" and "AsCheapAsAMove" (could probably be added for R600). llvm-svn: 235668	2015-04-23 23:24:36 +00:00
Reid Kleckner	5c5facc2ce	Re-commit "[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works" This reverts commit r235617. r235649 should have addressed the problems. llvm-svn: 235667	2015-04-23 23:22:33 +00:00
Andrew Kaylor	20ae2a311f	[WinEH] Ignore filter clauses while mapping landing pad blocks. llvm-svn: 235656	2015-04-23 22:38:36 +00:00
Reid Kleckner	1ac225219c	Remove trivial assert to fix NDEBUG Werror builds llvm-svn: 235652	2015-04-23 21:36:32 +00:00
Reid Kleckner	e3af86e9d9	[WinEH] Replace more lpad value uses with undef We were asserting on code like this: extern "C" unsigned long _exception_code(); void might_crash(unsigned long); void foo() { __try { might_crash(0); } __except(1) { might_crash(_exception_code()); } } Gtest and many other libraries get the exception code from the __except block. What's supposed to happen here is that EAX is live into the __except block, and it contains the exception code. Eventually we'll represent that as a use of the landingpad ehptr value, but for now we can replace it with undef. llvm-svn: 235649	2015-04-23 21:22:30 +00:00
Quentin Colombet	796d906e06	[MachineCopyPropagation] Handle undef flags conservatively so that we do not remove copies that are useful after breaking some hardware dependencies. In other words, handle this kind of situations conservatively by assuming reg2 is redefined by the undef flag. reg1 = copy reg2 = inst reg2<undef> reg2 = copy reg1 Copy propagation used to remove the last copy. This is incorrect because the undef flag on reg2 in inst, allows next passes to put whatever trashed value in reg2 that may help. In practice we end up with this code: reg1 = copy reg2 reg2 = 0 = inst reg2<undef> reg2 = copy reg1 This fixes PR21743. llvm-svn: 235647	2015-04-23 21:17:39 +00:00
Andrew Kaylor	5f715522f1	[WinEH] Handle stubs for outlined functions that have only unreached terminators. llvm-svn: 235618	2015-04-23 18:37:39 +00:00
Reid Kleckner	909ea7e6b8	Revert "[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works" We still have some "uses remain after removal" issues in -O0 builds. This reverts commit r235557. llvm-svn: 235617	2015-04-23 18:34:01 +00:00
Hans Wennborg	0867b151c9	Re-commit r235560: Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) Third time's the charm. The previous commit was reverted as a reverse for-loop in SelectionDAGBuilder::lowerWorkItem did 'I--' on an iterator at the beginning of a vector, causing asserts when using debugging iterators. This commit fixes that. llvm-svn: 235608	2015-04-23 16:45:24 +00:00
Aaron Ballman	0be238cebd	Revert r235560; this commit was causing several failed assertions in Debug builds using MSVC's STL. The iterator is being used outside of its valid range. llvm-svn: 235597	2015-04-23 13:41:59 +00:00
Simon Pilgrim	86b034bae9	[DAGCombiner] Remove extra bitcasts surrounding vector shuffles Patch to remove extra bitcasts from shuffles, this is often a legacy of XformToShuffleWithZero being used to combine bitmaskings (of float vectors bitcast to integer vectors) into shuffles: bitcast(shuffle(bitcast(s0),bitcast(s1))) -> shuffle(s0,s1) Differential Revision: http://reviews.llvm.org/D9097 llvm-svn: 235578	2015-04-23 08:43:13 +00:00
Andrew Kaylor	43e1d76278	[WinEH] Don't skip landing pads that end with an unreachable instruction. llvm-svn: 235563	2015-04-23 00:20:44 +00:00
Hans Wennborg	15823d49b6	Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) This is a re-commit of r235101, which also fixes the problems with the previous patch: - Switches with only a default case and non-fallthrough were handled incorrectly - The previous patch tickled a bug in PowerPC Early-Return Creation which is fixed here. > This is a major rewrite of the SelectionDAG switch lowering. The previous code > would lower switches as a binary tre, discovering clusters of cases > suitable for lowering by jump tables or bit tests as it went along. To increase > the likelihood of finding jump tables, the binary tree pivot was selected to > maximize case density on both sides of the pivot. > > By not selecting the pivot in the middle, the binary trees would not always > be balanced, leading to performance problems in the generated code. > > This patch rewrites the lowering to search for clusters of cases > suitable for jump tables or bit tests first, and then builds the binary > tree around those clusters. This way, the binary tree will always be balanced. > > This has the added benefit of decoupling the different aspects of the lowering: > tree building and jump table or bit tests finding are now easier to tweak > separately. > > For example, this will enable us to balance the tree based on profile info > in the future. > > The algorithm for finding jump tables is quadratic, whereas the previous algorithm > was O(n log n) for common cases, and quadratic only in the worst-case. This > doesn't seem to be major problem in practice, e.g. compiling a file consisting > of a 10k-case switch was only 30% slower, and such large switches should be rare > in practice. Compiling e.g. gcc.c showed no compile-time difference. If this > does turn out to be a problem, we could limit the search space of the algorithm. > > This commit also disables all optimizations during switch lowering in -O0. > > Differential Revision: http://reviews.llvm.org/D8649 llvm-svn: 235560	2015-04-22 23:14:56 +00:00
Reid Kleckner	64a2a6a473	[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works This removes the -sehprepare flag and makes __C_specific_handler functions always to use WinEHPrepare. This was tested by building all of chromium_builder_tests and running a few tests that use SEH, but if something breaks, we can revert this. llvm-svn: 235557	2015-04-22 22:13:09 +00:00
Reid Kleckner	fd7df284b8	[WinEH] Demote values and phis live across exception handlers up front In particular, this handles SSA values that are live out of a handler. The existing code only handles values that are live in to a handler. It also handles phi nodes in the block where normal control should resume after the end of a catch handler. When EH return points have phi nodes, we need to split the return edge. It is impossible for phi elimination to emit copies in the previous block if that block gets outlined. The indirectbr that we leave in the function is only notional, and is eliminated from the MachineFunction CFG early on. Reviewers: majnemer, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D9158 llvm-svn: 235545	2015-04-22 21:05:21 +00:00
Luqman Aden	c76f470c2d	Test commit: fix typo in comment. llvm-svn: 235526	2015-04-22 17:42:37 +00:00
Olivier Sallenave	c587bee405	Fixed logic to enable complex FMA formation. llvm-svn: 235508	2015-04-22 14:07:26 +00:00
Hal Finkel	0d49cf2645	[DAGCombine] Disable select(c, load,load) for indexed loads This turned up after r235333, but was a pre-existing bug. The optimization which transforms select(c, load, load) into a load of a select of the addresses does not handle indexed loads (pre/post inc/dec). However, it did not check for them either, leading to a crash if it tried to transform one of them. llvm-svn: 235497	2015-04-22 11:32:25 +00:00
Lang Hames	65613a634a	[patchpoint] Add support for symbolic patchpoint targets to SelectionDAG and the X86 backend. The code generated for symbolic targets is identical to the code generated for constant targets, except that a relocation is emitted to fix up the actual target address at link-time. This allows IR and object files containing patchpoints to be cached across JIT-invocations where the target address may change. llvm-svn: 235483	2015-04-22 06:02:31 +00:00
Reid Kleckner	f14787dad8	[WinEH] Correctly handle inlined __finally blocks with captures We should also teach the inliner to collapse framerecover of frameaddress of the current frame down to an alloca, but that can happen later. llvm-svn: 235459	2015-04-22 00:07:52 +00:00
Duncan P. N. Exon Smith	aa861aa483	DebugInfo: Remove DIArray and DITypeArray typedefs Remove the `DIArray` and `DITypeArray` typedefs, preferring the underlying types (`DebugNodeArray` and `MDTypeRefArray`, respectively). llvm-svn: 235413	2015-04-21 20:07:38 +00:00
Duncan P. N. Exon Smith	60635e39b6	DebugInfo: Drop rest of DIDescriptor subclasses Delete the remaining subclasses of (the already deleted) `DIDescriptor`. Part of PR23080. llvm-svn: 235404	2015-04-21 18:44:06 +00:00
Duncan P. N. Exon Smith	d4a19a396d	DebugInfo: Assert dbg.declare/value insts are valid Remove early returns for when `getVariable()` is null, and just assert that it never happens. The Verifier already confirms that there's a valid variable on these intrinsics, so we should assume the debug info isn't broken. I also updated a check for a `!dbg` attachment, which the Verifier similarly guarantees. llvm-svn: 235400	2015-04-21 18:24:23 +00:00
Reid Kleckner	d2a1a51996	Re-land r235154-r235156 under the existing -sehprepare flag Keep the old SEH fan-in lowering on by default for now, since projects rely on it. This will make it easy to test this change with a simple flag flip. llvm-svn: 235399	2015-04-21 18:23:57 +00:00
Simon Pilgrim	860f08779c	CONCAT_VECTOR of BUILD_VECTOR - minor fix Fixed issue with the combine of CONCAT_VECTOR of 2 BUILD_VECTOR nodes - the optimisation wasn't ensuring that the scalar operands of both nodes were the same type/size for implicit truncation. Test case spotted by Patrik Hagglund llvm-svn: 235371	2015-04-21 08:05:43 +00:00
Pawel Bylica	57c2f7c756	Fix generic shift expansion when shift amount is 0 Summary: This fixes http://llvm.org/bugs/show_bug.cgi?id=16439. This is one possible way to approach this. The other would be to split InL>>(nbits-Amt) into (InL>>(nbits-1-Amt))>>1, which is also valid since since we only need to care about Amt up nbits-1. It's hard to tell which one is better since the shift might be expensive if this stage of expansion is not yet a legal machine integer, whereas comparisons with zero are relatively cheap at all sizes, but more expensive than a shift if the shift is on a legal machine type. Patch by Keno Fischer! Test Plan: regression test from http://reviews.llvm.org/D7752 Reviewers: chfast, resistor Reviewed By: chfast, resistor Subscribers: sanjoy, resistor, chfast, llvm-commits Differential Revision: http://reviews.llvm.org/D4978 llvm-svn: 235370	2015-04-21 06:28:36 +00:00
Andrew Kaylor	00e5d9ee5f	[WinEH] Fix problem with landing pad return values used in PHI nodes during outlining. llvm-svn: 235358	2015-04-20 22:53:42 +00:00
Duncan P. N. Exon Smith	2fbe13540a	DebugInfo: Delete subclasses of DIScope Delete subclasses of (the already defunct) `DIScope`, updating users to use the raw pointers from the `Metadata` hierarchy directly. llvm-svn: 235356	2015-04-20 22:10:08 +00:00
Andrew Kaylor	41758517bf	[WinEH] Fix problem with mapping shared empty handler blocks. Differential Revision: http://reviews.llvm.org/D9125 llvm-svn: 235354	2015-04-20 22:04:09 +00:00
Duncan P. N. Exon Smith	c62468859a	DebugInfo: Delete old subclasses of DIType Delete subclasses of (the already deleted) `DIType` in favour of directly using pointers from the `Metadata` hierarchy. While `DICompositeType` wraps `MDCompositeTypeBase` and `DIDerivedType` wraps `MDDerivedTypeBase`, most uses of each really meant the more specific `MDCompositeType` and `MDDerivedType`. llvm-svn: 235351	2015-04-20 21:17:32 +00:00
Duncan P. N. Exon Smith	698df36ab7	DwarfUnit: Split MDSubroutineType version of constructTypeDIE() The version of `constructTypeDIE()` for `MDSubroutineType` is unrelated to (and has different callers than) the `MDCompositeType`. Split the two in half. This simplifies an upcoming patch to delete `DICompositeType`. There shouldn't be any real functionality change here. `createTypeDIE()` is `cast<>`'ing where it didn't need to before, but that function in turn is only called for true `MDCompositeType`s. llvm-svn: 235349	2015-04-20 21:04:33 +00:00
Duncan P. N. Exon Smith	d89ef16aa9	DwarfUnit: Cleanup comments Update comment style in `DwarfUnit`. - Drop duplicated comments at definition, and update the comments at the declaration where the definition comments looked newer or more complete. - Drop the `functionName -` prefix. - Add `\brief` in a few places. - Remove a few comments entirely that weren't adding value (just turned the function name and arguments into a sentence). llvm-svn: 235345	2015-04-20 20:29:51 +00:00
Olivier Sallenave	b99c2eb0f0	Refactoring and enhancement to FMA combine. llvm-svn: 235344	2015-04-20 20:29:40 +00:00
Tom Stellard	69a7b91e95	DAGCombine: Remove redundant NaN checks around ISD::FSQRT This folds: (select (setcc x, -0.0, *lt), NaN, (fsqrt x)) -> ( fsqrt x) llvm-svn: 235333	2015-04-20 19:38:27 +00:00
Duncan P. N. Exon Smith	9928a909c6	DebugInfo: Remove DIType This is the last major parent class, so I'll probably start deleting classes in batches now. Looks like many of the references to the DI* hierarchy were updated organically along the way. llvm-svn: 235331	2015-04-20 18:52:06 +00:00
Andrew Kaylor	f18771bdfd	[WinEH] Fix memory leak with catch-all mapping. llvm-svn: 235328	2015-04-20 18:48:45 +00:00
Duncan P. N. Exon Smith	be9e4fe768	DebugInfo: Remove DIScope Replace uses of `DIScope` with `MDScope`. There was one spot where I've left an `MDScope` uninitialized (where `DIScope` would have been default-initialized to `nullptr`) -- this is intentional, since the if/else that follows should unconditional assign it to a value. llvm-svn: 235327	2015-04-20 18:32:29 +00:00
Duncan P. N. Exon Smith	848af387d8	DebugInfo: Remove typedefs for DITypeRef, etc. Remove typedefs for type refs: - DITypeRef => MDTypeRef - DIScopeRef => MDScopeRef - DIDescriptorRef => DebugNodeRef llvm-svn: 235323	2015-04-20 18:20:03 +00:00
Hal Finkel	1e5733bbed	[InlineAsm] Remove EarlyClobber on registers that are also inputs When an inline asm call has an output register marked as early-clobber, but that same register is also an input operand, what should we do? GCC accepts this, and is documented to accept this for read/write operands saying, "Furthermore, if the earlyclobber operand is also a read/write operand, then that operand is written only after it's used." For write-only operands, the situation seems less clear, but I have at least one existing codebase that assumes this will work, in part because it has syscall macros like this: ({ \ register uint64_t r0 __asm__ ("r0") = (__NR_ ## name); \ register uint64_t r3 __asm__ ("r3") = ((uint64_t) (arg0)); \ register uint64_t r4 __asm__ ("r4") = ((uint64_t) (arg1)); \ register uint64_t r5 __asm__ ("r5") = ((uint64_t) (arg2)); \ __asm__ __volatile__ \ ("sc" \ : "=&r"(r0),"=&r"(r3),"=&r"(r4),"=&r"(r5) \ : "0"(r0), "1"(r3), "2"(r4), "3"(r5) \ : "r6","r7","r8","r9","r10","r11","r12","cr0","memory"); \ r3; \ }) Furthermore, with register aliases and subregister relationships that only the backend knows about, rejecting this in the frontend seems like a difficult proposition (if we wanted to do so). However, keeping the early-clobber flag on the INLINEASM MI does not work for us, because it will cause the register's live interval to end to soon (so it will not appear defined to be used as an input). Fortunately, fixing this does not seem hard: When forming the INLINEASM MI, check to see if any of the early-clobber outputs are also inputs, and if so, remove the early-clobber flag. llvm-svn: 235283	2015-04-20 00:01:30 +00:00
Eric Christopher	d2e3ddad14	Remove CFIFuncName from TargetOptions as it is currently unused. llvm-svn: 235268	2015-04-19 03:21:04 +00:00
Eric Christopher	78804ab2df	Remove the CFIEnforcing flag from TargetOptions as it is unused. llvm-svn: 235267	2015-04-19 03:20:59 +00:00
Ahmed Bougacha	279e3ee954	[GlobalMerge] Look at uses to create smaller global sets. Instead of merging everything together, look at the users of GlobalVariables, and try to group them by function, to create sets of globals used "together". Using that information, a less-aggressive alternative is to keep merging everything together except globals that are only ever used alone, that is, those for which it's clearly non-profitable to merge with others. In my testing, grouping by Function is too aggressive, but grouping by BasicBlock is too conservative. Anything in-between isn't trivially available, so stick with Function grouping for now. cl::opts are added for testing; both enabled by default. A few of the testcases aren't testing the merging proper, but just various edge cases when merging does occur. Update them to use the previous grouping behavior. Also, one of the tests is unrelated to GlobalMerge; change it accordingly. While there, switch to r234666' flags rather than the brutal -O3. Differential Revision: http://reviews.llvm.org/D8070 llvm-svn: 235249	2015-04-18 01:21:58 +00:00
Duncan P. N. Exon Smith	7c60f20e49	DebugInfo: Delete DIDescriptor (but not its subclasses) Delete `DIDescriptor` and update the remaining users. I'll follow-up by deleting subclasses in manageable groups (top-down). llvm-svn: 235248	2015-04-18 00:35:36 +00:00
Andrew Kaylor	761fb44efe	Fix build wanrings and line endings llvm-svn: 235241	2015-04-17 23:20:24 +00:00
Duncan P. N. Exon Smith	ed557b55ee	DebugInfo: Remove DIDescriptor from the DebugInfo API Stop using `DIDescriptor` and its subclasses in the `DebugInfoFinder` API, as well as the rest of the API hanging around in `DebugInfo.h`. llvm-svn: 235240	2015-04-17 23:20:10 +00:00
Andrew Kaylor	ea8df61d4d	[WinEH] Fixes for a few cppeh failures. Differential Review: http://reviews.llvm.org/D9065 llvm-svn: 235239	2015-04-17 23:05:43 +00:00
Duncan P. N. Exon Smith	364a3005f2	AsmPrinter: Create a unified .debug_loc stream This commit removes `DebugLocList` and replaces it with `DebugLocStream`. - `DebugLocEntry` no longer contains its byte/comment streams. - The `DebugLocEntry` list for a variable/inlined-at pair is allocated on the stack, and released right after `DebugLocEntry::finalize()` (possible because of the refactoring in r231023). Now, only one list is in memory at a time now. - There's a single unified stream for the `.debug_loc` section that persists, stored in the new `DebugLocStream` data structure. The last point is important: this collapses the nested `SmallVector<>`s from `DebugLocList` into unified streams. We previously had something like the following: vec<tuple<Label, CU, vec<tuple<BeginSym, EndSym, vec<Value>, vec<char>, vec<string>>>>> A `SmallVector` can avoid allocations, but is statically fairly large for a vector: three pointers plus the size of the small storage, which is the number of elements in small mode times the element size). Nesting these is expensive, since an inner vector's size contributes to the element size of an outer one. (Nesting any vector is expensive...) In the old data structure, the outer vector's element size was 632B, excluding allocation costs for when the middle and inner vectors exceeded their small sizes. 312B of this was for the "three" pointers in the vector-tree beneath it. If you assume 1M functions with an average of 10 variable/inlined-at pairs each (in an LTO scenario), that's almost 6GB (besides inner allocations), with almost 3GB for the "three" pointers. This came up in a heap profile a little while ago of a `clang -flto -g` bootstrap, with `DwarfDebug::collectVariableInfo()` using something like 10-15% of the total memory. With this commit, we have: tuple<vec<tuple<Label, CU, Offset>>, vec<tuple<BeginSym, EndSym, Offset, Offset>>, vec<char>, vec<string>> The offsets are used to create `ArrayRef` slices of adjacent `SmallVector`s. This reduces the number of vectors to four (unrelated to the number of variable/inlined-at pairs), and caps the number of allocations at the same number. Besides saving memory and limiting allocations, this is NFC. I don't know my way around this code very well yet, but I wonder if we could go further: why stream to a side-table, instead of directly to the output stream? llvm-svn: 235229	2015-04-17 21:34:47 +00:00
Duncan P. N. Exon Smith	237662429d	Remove dead code, NFC llvm-svn: 235225	2015-04-17 21:06:49 +00:00
David Majnemer	dcd89368cb	[WinEH] Reusing HandlerType entries leads to small CatchHigh values CatchHigh may be smaller than TryHigh if we reuse an outlined catch handler for two different invokes with different EH states. We have no evidence which shows that CatchHigh must be greater than TryHigh or TryLow. We can revisit this if we turn out to be wrong. llvm-svn: 235223	2015-04-17 20:12:09 +00:00
Pirama Arumuga Nainar	50604a69e9	Fix build errors introduced by r235215 Summary: - Handle TypePromoteFloat in switch statements - Move an expression into an assert to avoid unused variable in non-assert builds. Reviewers: srhines, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9086 llvm-svn: 235220	2015-04-17 19:51:44 +00:00
Pirama Arumuga Nainar	db7c07e2bf	Add support to promote f16 to f32 Summary: This patch adds legalization support to operate on FP16 as a load/store type and do operations on it as floats. Tests for ARM are added to test/CodeGen/ARM/fp16-promote.ll Reviewers: srhines, t.p.northover Differential Revision: http://reviews.llvm.org/D8755 llvm-svn: 235215	2015-04-17 18:36:25 +00:00
David Majnemer	2be05eef31	[WinEH] Allow CatchHigh to be equal to TryHigh Catch blocks which are empty may be in the same state as their try blocks. It is not meaningful to give the catch block its own state number in this case because it can't do anything exceptional. llvm-svn: 235212	2015-04-17 17:20:30 +00:00
Duncan P. N. Exon Smith	c0f7dd72b7	AsmPrinter: Store MDExpression directly instead of MDNode, NFC Clean up `DebugLocEntry::Value::Expression`'s type while I'm messing around in here anyway. llvm-svn: 235203	2015-04-17 16:36:10 +00:00
Duncan P. N. Exon Smith	546c8be967	AsmPrinter: Stop storing MDLocalVariable in DebugLocEntry Stop storing the `MDLocalVariable` in the `DebugLocEntry::Value`s. We generate the list of `DebugLocEntry`s separately for each variable/inlined-at pair, so the variable never actually changes here. This is effectively NFC (aside from saving some memory and CPU time). llvm-svn: 235202	2015-04-17 16:33:37 +00:00
Duncan P. N. Exon Smith	fba25d6e9b	AsmPrinter: Calculate type upfront for location lists, NFC We can calculate the variable type up front before calling `DebugLocEntry::finalize()`. In fact, since we only care about the type if it's an `MDBasicType`, don't even bother resolving it using the type identifier map. llvm-svn: 235201	2015-04-17 16:28:58 +00:00
James Molloy	a4ff7b2713	Fix TRUNCATE splitting helper logic. This is a followon to r233681 - I'd misunderstood the semantics of FTRUNC, and had confused it with (FP_ROUND ..., 0). Thanks for Ahmed Bougacha for his post-commit review! llvm-svn: 235191	2015-04-17 13:51:40 +00:00
Nico Weber	a762fa6c98	Revert r235154-r235156, they cause asserts when building win64 code (http://crbug.com/477988 ) llvm-svn: 235170	2015-04-17 09:10:43 +00:00
Reid Kleckner	69afb1f8ef	Fix unused variable warning llvm-svn: 235155	2015-04-17 01:03:30 +00:00
Reid Kleckner	d4523e3c51	[SEH] Reimplement x64 SEH using WinEHPrepare This now emits simple, unoptimized xdata tables for __C_specific_handler based on the handlers listed in @llvm.eh.actions calls produced by WinEHPrepare. This adds support for running __finally blocks when exceptions are thrown, and removes the old landingpad fan-in codepath. I ran some manual execution tests on small basic test cases with and without optimization, as well as on Chrome base_unittests, which uses a small amount of SEH. I'm sure there are bugs, and we may need to revert. llvm-svn: 235154	2015-04-17 01:01:27 +00:00
Duncan P. N. Exon Smith	7bb480dbc2	DebugInfo: Fix UserValue::match() in LiveDebugVariables after r235050 r235050 dropped the inlined-at field from `MDLocalVariable`, deferring to the `!dbg` attachments. Fix `UserValue` to take the `!dbg` into account when differentiating between variables. llvm-svn: 235140	2015-04-16 22:27:54 +00:00
Duncan P. N. Exon Smith	9f25633170	AsmPrinter: Remove dead code, NFC llvm-svn: 235139	2015-04-16 22:14:20 +00:00
Duncan P. N. Exon Smith	78a9527062	AsmPrinter: Simplify logic for debug info intrinsics' !dbg attachments These are required, so just assume they're there. llvm-svn: 235138	2015-04-16 22:12:59 +00:00
Reid Kleckner	1c130bb04b	[WinEH] Handle a landingpad, resume, and cleanup all rolled into a BB This happens a lot with simple cleanups after SimplifyCFG. llvm-svn: 235117	2015-04-16 17:02:23 +00:00
Hans Wennborg	a9e2057416	Revert the switch lowering change (r235101, r235103, r235106) Looks like it broke the sanitizer-ppc64-linux1 build. Reverting for now. llvm-svn: 235108	2015-04-16 15:43:26 +00:00
Hans Wennborg	d403664ed8	Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) This is a major rewrite of the SelectionDAG switch lowering. The previous code would lower switches as a binary tre, discovering clusters of cases suitable for lowering by jump tables or bit tests as it went along. To increase the likelihood of finding jump tables, the binary tree pivot was selected to maximize case density on both sides of the pivot. By not selecting the pivot in the middle, the binary trees would not always be balanced, leading to performance problems in the generated code. This patch rewrites the lowering to search for clusters of cases suitable for jump tables or bit tests first, and then builds the binary tree around those clusters. This way, the binary tree will always be balanced. This has the added benefit of decoupling the different aspects of the lowering: tree building and jump table or bit tests finding are now easier to tweak separately. For example, this will enable us to balance the tree based on profile info in the future. The algorithm for finding jump tables is O(n^2), whereas the previous algorithm was O(n log n) for common cases, and quadratic only in the worst-case. This doesn't seem to be major problem in practice, e.g. compiling a file consisting of a 10k-case switch was only 30% slower, and such large switches should be rare in practice. Compiling e.g. gcc.c showed no compile-time difference. If this does turn out to be a problem, we could limit the search space of the algorithm. This commit also disables all optimizations during switch lowering in -O0. Differential Revision: http://reviews.llvm.org/D8649 llvm-svn: 235101	2015-04-16 14:49:23 +00:00
Simon Pilgrim	6bd5d3caa9	TRUNCATE constant folding - minor fix for rL233224 Fix for test case found by James Molloy - TRUNCATE of constant build vectors can be more simply achieved by simply replacing with a new build vector node with the truncated value type - no need to touch the scalar operands at all. llvm-svn: 235079	2015-04-16 08:21:09 +00:00
Ahmed Bougacha	c984b90c86	[CodeGen] Re-apply r234809 (concat of scalars), with an x86_mmx fix. The only type that isn't an integer, isn't floating point, and isn't a vector; ladies and gentlemen, the gift that keeps on giving: x86_mmx! Fixes PR23246. Original message (reverted in r235062): [CodeGen] Combine concat_vectors of scalars into build_vector. Combine something like: (v8i8 concat_vectors (v2i8 bitcast (i16)) x4) into: (v8i8 (bitcast (v4i16 BUILD_VECTOR (i16) x4))) If any of the scalars are floating point, use that throughout. Differential Revision: http://reviews.llvm.org/D8948 llvm-svn: 235072	2015-04-16 02:39:14 +00:00
Duncan P. N. Exon Smith	b0b0e4958b	DebugInfo: DIRef<> => TypedDebugNodeRef<> Delete `DIRef<>`, and replace the remaining uses of it with `TypedDebugNodeRef<>`. To minimize code churn, I've added typedefs from `MDTypeRef` to `DITypeRef` (etc.). llvm-svn: 235071	2015-04-16 02:24:01 +00:00
Duncan P. N. Exon Smith	f15c6f8032	DebugInfo: Gut DIDescriptor PR23080 is almost finished. With this commit, there's no consequential API in `DIDescriptor` and its subclasses. What's left? - Default-constructed to `nullptr`. - Handy `const_cast<>` (constructed from `const`, but accessors are non-`const`). I think the safe way to catch those is to delete the classes and fix compile errors. That'll be my next step, after I delete the `DITypeRef` (etc.) wrapper around `MDTypeRef`. llvm-svn: 235069	2015-04-16 01:53:33 +00:00
Duncan P. N. Exon Smith	b273d06b63	DebugInfo: Gut DIScope, DIEnumerator and DISubrange The only class the still has API left is `DIDescriptor` itself. llvm-svn: 235067	2015-04-16 01:37:00 +00:00
Duncan P. N. Exon Smith	b105564015	DebugInfo: Gut DIType and subclasses Continuing PR23080, gut `DIType` and its various subclasses, leaving behind thin wrappers around the pointer types in the new debug info hierarchy. llvm-svn: 235064	2015-04-16 01:01:28 +00:00
Nick Lewycky	b8557a972f	Revert r234809 because it caused PR23246. llvm-svn: 235062	2015-04-16 00:56:20 +00:00
Reid Kleckner	8676214025	[SEH] Deal with users of the old lpad for SEH catch-all blocks The way we split SEH catch-all blocks can leave some dead EH values behind at -O0. Try to remove them, and if we fail, replace them all with undef. Fixes a crash when removing the old unreachable landingpad which is still used by extractvalue instructions in the catch-all block. llvm-svn: 235061	2015-04-16 00:02:04 +00:00
Duncan P. N. Exon Smith	1d1a8e00b8	DebugInfo: Remove unnecessary API from DIDerivedType and DIType Remove the accessors of `DIDerivedType` that downcast to `MDDerivedType`, shifting the `cast<MDDerivedType>` into the callers. Also remove `DIType::isValid()`, which is really just a check against `nullptr` at this point. llvm-svn: 235059	2015-04-15 23:49:09 +00:00
Duncan P. N. Exon Smith	35ef22cf53	DebugInfo: Gut DICompileUnit and DIFile Continuing gutting `DIDescriptor` subclasses; this edition, `DICompileUnit` and `DIFile`. In the name of PR23080. llvm-svn: 235055	2015-04-15 23:19:27 +00:00
Duncan P. N. Exon Smith	768d96ca6f	DebugInfo: factor out dead variable in NDEBUG from r235050 Caught by the lld bot: http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/9832 llvm-svn: 235052	2015-04-15 22:42:43 +00:00
Duncan P. N. Exon Smith	62e0f454a0	DebugInfo: Remove 'inlinedAt:' field from MDLocalVariable Remove 'inlinedAt:' from MDLocalVariable. Besides saving some memory (variables with it seem to be single largest `Metadata` contributer to memory usage right now in -g -flto builds), this stops optimization and backend passes from having to change local variables. The 'inlinedAt:' field was used by the backend in two ways: 1. To tell the backend whether and into what a variable was inlined. 2. To create a unique id for each inlined variable. Instead, rely on the 'inlinedAt:' field of the intrinsic's `!dbg` attachment, and change the DWARF backend to use a typedef called `InlinedVariable` which is `std::pair<MDLocalVariable, MDLocation>`. This `DebugLoc` is already passed reliably through the backend (as verified by r234021). This commit removes the check from r234021, but I added a new check (that will survive) in r235048, and changed the `DIBuilder` API in r235041 to require a `!dbg` attachment whose 'scope:` is in the same `MDSubprogram` as the variable's. If this breaks your out-of-tree testcases, perhaps the script I used (mdlocalvariable-drop-inlinedat.sh) will help; I'll attach it to PR22778 in a moment. llvm-svn: 235050	2015-04-15 22:29:27 +00:00
Reid Kleckner	3e9fadfbc8	[WinEH] Try to make the MachineFunction CFG more accurate This avoids emitting code for unreachable landingpad blocks that contain calls to llvm.eh.actions and indirectbr. It's also a first step towards unifying the SEH and WinEH lowering codepaths. I'm keeping the old fan-in lowering of SEH around until the preparation version works well enough that we can switch over without breaking existing users. llvm-svn: 235037	2015-04-15 18:48:15 +00:00
Reid Kleckner	6e3b5d40fc	Reland "[WinEH] Use the parent function when computing frameescape labels" Fixed the test by removing extraneous quotes. llvm-svn: 235028	2015-04-15 17:47:26 +00:00
Reid Kleckner	7ce2baeb81	Revert "[WinEH] Use the parent function when computing frameescape labels" This reverts commit r235025. The test isn't passing yet. llvm-svn: 235027	2015-04-15 17:43:54 +00:00
Daniel Berlin	25db4f4141	Add range iterators for post order and inverse post order. Use them llvm-svn: 235026	2015-04-15 17:41:42 +00:00
Reid Kleckner	d0275ed8b4	[WinEH] Use the parent function when computing frameescape labels Fixes assertions in MC when a local label wasn't defined. llvm-svn: 235025	2015-04-15 17:32:01 +00:00
Chandler Carruth	26d3017b8e	[MBP] Spell the conditions the same way through out this if statement. NFC. llvm-svn: 235009	2015-04-15 13:39:42 +00:00
Chandler Carruth	cfb2b9d755	[MBP] Sink a comment into the if block to which it pertains. This makes the content of the comment make much more sense. llvm-svn: 235007	2015-04-15 13:26:41 +00:00
Chandler Carruth	9a512a48b2	[MBP] Fix a really misleading typo in a comment. llvm-svn: 235006	2015-04-15 13:19:54 +00:00
Richard Trieu	6b1aa5f5e1	Change range-based for-loops to be -Wrange-loop-analysis clean. No functionality change. llvm-svn: 234963	2015-04-15 01:21:15 +00:00
Rafael Espindola	5560a4cfbd	Use raw_pwrite_stream in the object writer/streamer. The ELF object writer will take advantage of that in the next commit. llvm-svn: 234950	2015-04-14 22:14:34 +00:00
Reid Kleckner	e5f13831d0	[WinEH] Avoid emitting xdata tables twice for cleanups Since adding invokes of llvm.donothing to cleanups, we come here now, and trivial EH cleanup usage from clang fails to compile. llvm-svn: 234948	2015-04-14 21:42:36 +00:00
David Blaikie	877354a2f7	DebugInfo: Pubnames: Do not include variable declarations in pubnames This causes badness for GDB which expects to find a definition in any compile_unit that has an entry for the variable in its pubnames. llvm-svn: 234915	2015-04-14 18:08:25 +00:00
Daniel Jasper	274928fec6	[MachineLICM] Use newer model of register pressure sets. TargetRegisterInfo::getRegPressureLimit has a note that it is an old model that relies on manually entered classes. Using the newer model of register pressure sets seems more appropriate. We might eventually even switch to lib/CodeGen/RegisterPressure.cpp, but we should probably do incremental changes here. Using the newer model also makes it easier to take regmasks into account which is necessary to fix llvm.org/PR23143. I am currently also preparing a patch for that, but would like to do this switch independently. Review: http://reviews.llvm.org/D8986 llvm-svn: 234880	2015-04-14 11:56:25 +00:00
Duncan P. N. Exon Smith	537b4a8159	DebugInfo: Gut DISubprogram and DILexicalBlock* Gut the `DIDescriptor` wrappers around `MDLocalScope` subclasses. Note that `DILexicalBlock` wraps `MDLexicalBlockBase`, not `MDLexicalBlock`. llvm-svn: 234850	2015-04-14 03:40:37 +00:00
Duncan P. N. Exon Smith	20caafbfd6	DebugInfo: Gut DINamespace and DITemplate*Parameter Continue gutting `DIDescriptor` subclasses, turning them into as-bare-as-possible pointer wrappers. llvm-svn: 234843	2015-04-14 03:01:27 +00:00
Duncan P. N. Exon Smith	7348ddaa74	DebugInfo: Gut DIVariable and DIGlobalVariable Gut all the non-pointer API from the variable wrappers, except an implicit conversion from `DIGlobalVariable` to `DIDescriptor`. Note that if you're updating out-of-tree code, `DIVariable` wraps `MDLocalVariable` (`MDVariable` is a common base class shared with `MDGlobalVariable`). llvm-svn: 234840	2015-04-14 02:22:36 +00:00
Duncan P. N. Exon Smith	32e7f2860b	DebugInfo: Move DIVariable::printExtendedName() to its only caller Move the local function `printDebugLoc()` along with it. llvm-svn: 234838	2015-04-14 02:09:32 +00:00
Duncan P. N. Exon Smith	571baebacb	DebugInfo: Inline DIVariable::isBlockByrefVariable() into its callers I don't think this API is helping much. Change the callers to call `MDType::isBlockByrefStruct()` directly. llvm-svn: 234837	2015-04-14 01:59:58 +00:00
Duncan P. N. Exon Smith	de8e427394	DebugInfo: Gut DIObjCProperty and DIImportedEntity Gut a couple more classes in the DIDescriptor hierarchy. Leave behind an implicit conversion to `DIDescriptor`, the old base class. llvm-svn: 234836	2015-04-14 01:46:44 +00:00
Duncan P. N. Exon Smith	b7e221ba55	DebugInfo: Gut DILocation This is along the same lines as r234832, but for `DILocation`. Clean out all accessors from `DILocation`. Any callers should be using `MDLocation` directly (e.g., via `operator->()`). llvm-svn: 234835	2015-04-14 01:35:55 +00:00
Duncan P. N. Exon Smith	6a0320a991	DebugInfo: Gut DIExpression Completely gut `DIExpression`, turning it into a simple wrapper around `MDExpression `. There are two bits of magic left: - It's constructed from `const MDExpression` but convertible to `MDExpression*`. - It's default-constructed to `nullptr`. Otherwise, it should behave quite like a raw pointer. Once I've done the same to the rest of the `DIDescriptor` subclasses, I'll come back to delete them entirely (and update call sites as necessary to deal with the missing magic). llvm-svn: 234832	2015-04-14 01:12:42 +00:00
Duncan P. N. Exon Smith	26e1cda1f4	DebugInfo: Remove DIObjCProperty attribute accessors, NFC There's only one user of the various `DIObjCProperty::is*Property()` accessors -- `DwarfUnit::constructTypeDIE()` -- and it's just using the reverse logic to reconstruct the bitfield. Drop this API and simplify the only caller. llvm-svn: 234818	2015-04-13 23:55:01 +00:00
Ahmed Bougacha	8ebcdb3bc3	[CodeGen] Combine concat_vectors of scalars into build_vector. Combine something like: (v8i8 concat_vectors (v2i8 bitcast (i16)) x4) into: (v8i8 (bitcast (v4i16 BUILD_VECTOR (i16) x4))) If any of the scalars are floating point, use that throughout. Differential Revision: http://reviews.llvm.org/D8948 llvm-svn: 234809	2015-04-13 22:57:21 +00:00
Duncan P. N. Exon Smith	745a5db444	SelectionDAG: Stop using DIVariable::isInlinedFnArgument() Instead of calling the somewhat confusingly-named `DIVariable::isInlinedFnArgument()`, do the check directly here. There's possibly a small functionality change here: instead of `dyn_cast<>`'ing `DV->getScope()` to `MDSubprogram`, I'm looking up the scope chain for the actual subprogram. I suspect that this is a no-op for function arguments so in practise there isn't a real difference. I've also added a `FIXME` to check the `inlinedAt:` chain instead, since I wonder if that would be more reliable than the `MDSubprogram::describes()` function. Since this was the only user of `DIVariable::isInlinedFnArgument()`, delete it. llvm-svn: 234799	2015-04-13 21:38:48 +00:00
Duncan P. N. Exon Smith	cca5f68e53	DebugInfo: Remove DIGlobalVariable::getGlobal() `DIGlobalVariable::getGlobal()` isn't really helpful, it just does a `dyn_cast_or_null<>`. Simplify its only user by doing the cast directly and delete the code. llvm-svn: 234796	2015-04-13 20:39:25 +00:00
Duncan P. N. Exon Smith	c82570b68b	Reapply "Verifier: Check for incompatible bit piece expressions" This reverts commit r234717, reapplying r234698 (in spirit). As described in r234717, the original `Verifier` check had a use-after-free. Instead of storing pointers to "interesting" debug info intrinsics whose bit piece expressions should be verified once we have typerefs, do a second traversal. I've added a testcase to catch the `llc` crasher. Original commit message: Verifier: Check for incompatible bit piece expressions Convert an assertion into a `Verifier` check. Bit piece expressions must fit inside the variable, and mustn't be the entire variable. Catching this in the verifier will help us find bugs sooner, and makes `DIVariable::getSizeInBits()` dead code. llvm-svn: 234776	2015-04-13 18:53:11 +00:00
Jan Vesely	ffcd968647	Revert revisions r234755, r234759, r234760 Revert "Remove default in fully-covered switch (to fix Clang -Werror -Wcovered-switch-default)" Revert "R600: Add carry and borrow instructions. Use them to implement UADDO/USUBO" Revert "LegalizeDAG: Try to use Overflow operations when expanding ADD/SUB" Using overflow operations fails CodeGen/Generic/2011-07-07-ScheduleDAGCrash.ll on hexagon, nvptx, and r600. Revert while I investigate. llvm-svn: 234768	2015-04-13 17:47:15 +00:00
Krzysztof Parzyszek	a46c36b8f4	Allow memory intrinsics to be tail calls llvm-svn: 234764	2015-04-13 17:16:45 +00:00
Matthias Braun	a283cb3265	DAGCombiner: Fix crash in select(select) opt. In case of different types used for the condition of the selects the select(select) -> select(and) normalisation cannot be performed. See also: http://reviews.llvm.org/D7622 llvm-svn: 234763	2015-04-13 17:16:33 +00:00
David Blaikie	155f38e0d8	Remove default in fully-covered switch (to fix Clang -Werror -Wcovered-switch-default) llvm-svn: 234760	2015-04-13 16:37:50 +00:00
Jan Vesely	a835555e40	LegalizeDAG: Try to use Overflow operations when expanding ADD/SUB v2: consider BooleanContents when processing overflow Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewers: resistor, jholewinsky (nvidia parts) Differential Revision: http://reviews.llvm.org/D6340 llvm-svn: 234755	2015-04-13 15:32:01 +00:00
John Brawn	e8fd6c8563	[ARM] Align global variables passed to memory intrinsics Fill in the TODO in CodeGenPrepare::OptimizeCallInst so that global variables that are passed to memory intrinsics are aligned in the same way that allocas are. Differential Revision: http://reviews.llvm.org/D8421 llvm-svn: 234735	2015-04-13 10:47:39 +00:00
Duncan P. N. Exon Smith	5cd454161c	Revert "Verifier: Check for incompatible bit piece expressions" This reverts commit r234698. This caused a use-after-free: `QueuedBitPieceExpressions` holds onto references to `DbgInfoIntrinsic`s and references them past where they're deleted (this is because the verifier is run as a function pass, and then `verifyTypeRefs()` is called during `doFinalization()`). I'll include a reduced crasher for `llc` when I recommit the check. llvm-svn: 234717	2015-04-13 00:06:28 +00:00
Duncan P. N. Exon Smith	5ad6ff76dc	Verifier: Check for incompatible bit piece expressions Convert an assertion into a `Verifier` check. Bit piece expressions must fit inside the variable, and mustn't be the entire variable. Catching this in the verifier will help us find bugs sooner, and makes `DIVariable::getSizeInBits()` dead code. llvm-svn: 234698	2015-04-11 19:58:35 +00:00
Benjamin Kramer	dd0ff85701	Remove empty non-virtual destructors or mark them =default when non-public These add no value but can make a class non-trivially copyable. NFC. llvm-svn: 234688	2015-04-11 15:32:26 +00:00
Alexander Kornienko	f817c1cb9a	Use 'override/final' instead of 'virtual' for overridden methods The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' \ -j=32 -fix -format http://reviews.llvm.org/D8925 llvm-svn: 234679	2015-04-11 02:11:45 +00:00
Ahmed Bougacha	b96444efd1	[CodeGen] Split -enable-global-merge into ARM and AArch64 options. Currently, there's a single flag, checked by the pass itself. It can't force-enable the pass (and is on by default), because it might not even have been created, as that's the targets decision. Instead, have separate explicit flags, so that the decision is consistently made in the target. Keep the flag as a last-resort "force-disable GlobalMerge" for now, for backwards compatibility. llvm-svn: 234666	2015-04-11 00:06:36 +00:00
Reid Kleckner	9405ef0e1f	[WinEH] Recognize SEH finally block inserted by the frontend This allows winehprepare to build sensible llvm.eh.actions calls for SEH finally blocks. The pattern matching in this change is brittle and should be replaced with something more robust soon. In the meantime, this will let us write the code that produces __C_specific_handler xdata tables, which we need regardless of how we decide to get finally blocks through EH preparation. llvm-svn: 234663	2015-04-10 23:12:29 +00:00
Benjamin Kramer	b4bf14ceaa	[CodeGenPrepare] Report all changes made during instruction sinking r234638 chained another transform below which was tripping over the deleted instruction. Use after free found by asan in many regression tests. llvm-svn: 234654	2015-04-10 22:25:36 +00:00
Sanjoy Das	b6c5914308	[InstCombine][CodeGenPrep] Create llvm.uadd.with.overflow in CGP. Summary: This change moves creating calls to `llvm.uadd.with.overflow` from InstCombine to CodeGenPrep. Combining overflow check patterns into calls to the said intrinsic in InstCombine inhibits optimization because it introduces an intrinsic call that not all other transforms and analyses understand. Depends on D8888. Reviewers: majnemer, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8889 llvm-svn: 234638	2015-04-10 21:07:09 +00:00
Reid Kleckner	6e48a826e8	[WinEH] Try to make outlining invokes work a little better WinEH currently turns invokes into calls. Long term, we will reconsider this, but for now, make sure we remap the operands and clone the successors of the new terminator. llvm-svn: 234608	2015-04-10 16:26:42 +00:00
Benjamin Kramer	619c4e57ba	Reduce dyn_cast<> to isa<> or cast<> where possible. No functional change intended. llvm-svn: 234586	2015-04-10 11:24:51 +00:00
David Majnemer	5c65f58f64	[WinEHPrepare] Don't rely on the order of IR The IPToState table must be emitted after we have generated labels for all functions in the table. Don't rely on the order of the list of globals. Instead, utilize WinEHFuncInfo to tell us how many catch handlers we expect to outline. Once we know we've visited all the catch handlers, emit the cppxdata. llvm-svn: 234566	2015-04-10 04:56:17 +00:00
Ahmed Bougacha	1ffe7c7d36	[AArch64] Promote f16 operations to f32. For the most common ones (such as fadd), we already did the promotion. Do the same thing for all the others. Currently, we'll just crash/assert on all these operations, as there's no hardware or libcall support whatsoever. f16 (half) is specified as an interchange - not arithmetic - format, and is expected to be promoted to single-precision for arithmetic operations. While there, teach the legalizer about promoting some of the (mostly floating-point) operations that we never needed before. Differential Revision: http://reviews.llvm.org/D8648 See related discussion on the thread for: http://reviews.llvm.org/D8755 llvm-svn: 234550	2015-04-10 00:08:48 +00:00
Rafael Espindola	5682ce2ceb	Simplify use of formatted_raw_ostream. formatted_raw_ostream is a wrapper over another stream to add column and line number tracking. It is used only for asm printing. This patch moves the its creation down to where we know we are printing assembly. This has the following advantages: * Simpler lifetime management: std::unique_ptr * We don't compute column and line number of object files :-) llvm-svn: 234535	2015-04-09 21:06:08 +00:00
Ahmed Bougacha	df43737782	[CodeGen] Combine concat_vector of trunc'd scalar to scalar_to_vector. We already do: concat_vectors(scalar, undef) -> scalar_to_vector(scalar) When the scalar is legal. When it's not, but is a truncated legal scalar, we can also do: concat_vectors(trunc(scalar), undef) -> scalar_to_vector(scalar) Which is equivalent, since the upper lanes are undef anyway. While there, teach the combine to look at more than 2 operands. Differential Revision: http://reviews.llvm.org/D8883 llvm-svn: 234530	2015-04-09 20:04:47 +00:00
Rafael Espindola	1c84271694	Revert "Refactoring and enhancement to FMA combine." This reverts commit r234513. It was failing on the bots. llvm-svn: 234518	2015-04-09 18:29:32 +00:00
Olivier Sallenave	53703d0862	Refactoring and enhancement to FMA combine. llvm-svn: 234513	2015-04-09 17:55:26 +00:00
Rafael Espindola	ee0dd4d289	This reverts commit r234460 and r234461. Revert "Add classof implementations to the raw_ostream classes." Revert "Use the cast machinery to remove dummy uses of formatted_raw_ostream." The underlying issue can be fixed without classof. llvm-svn: 234495	2015-04-09 15:54:59 +00:00
Rafael Espindola	132381f981	Use the cast machinery to remove dummy uses of formatted_raw_ostream. If we know we are producing an object, we don't need to wrap the stream in a formatted_raw_ostream anymore. llvm-svn: 234461	2015-04-09 02:28:12 +00:00
Andrew Kaylor	e104d89c8f	Formmatting correction llvm-svn: 234438	2015-04-08 21:22:46 +00:00
Andrew Kaylor	67d3c0359d	[WinEH] Minor bug fixes. Fixed insert point for allocas created for demoted values. Clear the nested landing pad list after it has been processed. llvm-svn: 234433	2015-04-08 20:57:22 +00:00
Akira Hatanaka	c6fab80536	[DAGCombine] Fix a bug in MergeConsecutiveStores. The bug manifests when there are two loads and two stores chained as follows in a DAG, (ld v3f32) -> (st f32) -> (ld v3f32) -> (st f32) and the stores' values are extracted from the preceding vector loads. MergeConsecutiveStores would replace the first store in the chain with the merged vector store, which would create a cycle between the merged store node and the last load node that appears in the chain. This commits fixes the bug by replacing the last store in the chain instead. rdar://problem/20275084 Differential Revision: http://reviews.llvm.org/D8849 llvm-svn: 234430	2015-04-08 20:34:53 +00:00
Daniel Jasper	018070c4c1	[MachineLICM] Cleanup, remove unused parameters. NFC. llvm-svn: 234392	2015-04-08 07:10:30 +00:00
Matthias Braun	9b9210264f	Oops, didn't mean to commit my debug fprintfs llvm-svn: 234385	2015-04-08 02:10:01 +00:00
Matthias Braun	1e61bbf022	LiveInterval: Fix computeFromMainRange() producing adjacent segments with same valno If two livesegments from different subranges happened to have the same definition they could possibly end up as two adjacent segments in the main liverange with the same value number which is not allowed. Detect such cases and fix them in the 2nd pass of computeFromMainRange() if necessary. No testcase as there is only an out-of-tree target where I can sensibly come up with one. llvm-svn: 234382	2015-04-08 01:41:10 +00:00
Andrew Kaylor	bb11132e65	[WinEH] Add invoke of llvm.donothing to outlined catch and cleanup handlers to identify their personality. Differential Review: http://reviews.llvm.org/D8835 llvm-svn: 234360	2015-04-07 21:30:23 +00:00
Reid Kleckner	f1853c65d9	[WinEH] Fix xdata generation when no catch object is present The lack of a catch object is indicated by a frame escape index of -1. Fixes PR23137. llvm-svn: 234346	2015-04-07 19:46:38 +00:00
Duncan P. N. Exon Smith	9a597ef199	Revert "Try a third time to fix MSVC build after r234290" This reverts commit r234295 (and r234294 and r234292 before it). I removed the implicit conversion to `MDTuple*` r234326, so there's no longer an ambiguity in `operator[]()`. I think MSVC should accept the original code now... llvm-svn: 234335	2015-04-07 18:07:41 +00:00
Daniel Jasper	e87e82bf79	Cleanup register pressure calculation in MachineLICM. There were four almost identical implementations of calculating/updating the register pressure for a certain MachineInstr. Cleanup to have a single implementation (well, controlled with two bool flags until this is cleaned up more). No functional changes intended. Tested by verify that there are no binary changes in the entire llvm test-suite. A new test was added separately in r234309 as it revealed a pre-existing error in the register pressure calculation. llvm-svn: 234325	2015-04-07 16:42:35 +00:00
Rafael Espindola	b91455b5c0	Refactor a lot of duplicated code for stub output. This also moves it earlier so that it they are produced before we print an end symbol for the data section. llvm-svn: 234315	2015-04-07 13:42:44 +00:00
Rafael Espindola	d1294d9bb3	Clear the stub map in getSortedStubs. This makes sure they are only output once (and frees a bit of memory). llvm-svn: 234313	2015-04-07 12:59:28 +00:00
Daniel Jasper	40c8194097	[MachineLICM] Remove obsolete comment about not considering reg pressure. llvm-svn: 234310	2015-04-07 11:43:32 +00:00
Duncan P. N. Exon Smith	1c0844bdfb	Try a third time to fix MSVC build after r234290 I have no idea what MSVC means with its error text here :(. http://lab.llvm.org:8011/builders/sanitizer-windows/builds/2310 llvm-svn: 234295	2015-04-07 05:03:47 +00:00
Duncan P. N. Exon Smith	8a4569bd2b	Try again to fix MSVC build after r234290 Still failing: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/2309 llvm-svn: 234294	2015-04-07 04:49:13 +00:00
Duncan P. N. Exon Smith	4ddfd73aa1	Try to fix MSVC build after r234290 http://lab.llvm.org:8011/builders/sanitizer-windows/builds/2308 llvm-svn: 234292	2015-04-07 04:33:42 +00:00
Duncan P. N. Exon Smith	000fa2c646	DebugInfo: Remove DITypedArray<>, replace with typedefs Replace all uses of `DITypedArray<>` with `MDTupleTypedArrayWrapper<>` and `MDTypeRefArray`. The APIs are completely different, but the provided functionality is the same: treat an `MDTuple` as if it's an array of a particular element type. To simplify this patch a bit, I've temporarily typedef'ed `DebugNodeArray` to `DIArray` and `MDTypeRefArray` to `DITypeArray`. I've also temporarily conditionalized the accessors to check for null -- eventually these should be changed to asserts and the callers should check for null themselves. There's a tiny accompanying patch to clang. llvm-svn: 234290	2015-04-07 04:14:33 +00:00
Duncan P. N. Exon Smith	76c9184434	DebugInfo: Remove special iterators from DIExpression Remove special iterators from `DIExpression` in favour of same in `MDExpression`. There should be no functionality change here. Note that the APIs are slightly different: `getArg(unsigned)` counts from 0, not 1, in the `MDExpression` version of the iterator. llvm-svn: 234285	2015-04-07 03:45:57 +00:00
Duncan P. N. Exon Smith	e686f1591f	CodeGen: Stop using DIDescriptor::is*() and auto-casting Same as r234255, but for lib/CodeGen and lib/Target. llvm-svn: 234258	2015-04-06 23:27:40 +00:00
Rafael Espindola	d58de064b8	Use sext in fast isel. Fast isel used to zero extends immediates to 64 bits. This normally goes unnoticed because the value is truncated to 32 bits for output. Two cases were it is noticed: * We fail to use smaller encodings. * If the original constant was smaller than i32. In the tests using i1 constants, codegen would change to use -1, which is fine (and matches what regular isel does) since only the lowest bit is then used. Instead, this patch then changes the ir to use i8 constants, which looks more like what clang produces. llvm-svn: 234249	2015-04-06 22:29:07 +00:00
Duncan P. N. Exon Smith	a5099dce62	DebugInfo: Remove DIDescriptor::Verify() Remove `DIDescriptor::Verify()` and the `Verify()`s from subclasses. They had already been gutted, and just did an `isa<>` check. In a couple of cases I've temporarily dropped the check entirely, but subsequent commits are going to disallow conversions to the `DIDescriptor`s directly from `MDNode`, so the checks will come back in another form soon enough. llvm-svn: 234201	2015-04-06 19:49:39 +00:00
Reid Kleckner	b401941f3d	[WinEH] Don't sink allocas into child handlers The uselist isn't enough to infer anything about the lifetime of such allocas. If we want to re-add this optimization, we will need to leverage lifetime markers to do it. Fixes PR23122. llvm-svn: 234196	2015-04-06 18:50:38 +00:00
Simon Pilgrim	07e063e44c	[DAGCombiner] Add support for FCEIL, FFLOOR and FTRUNC vector constant folding Differential Revision: http://reviews.llvm.org/D8715 llvm-svn: 234179	2015-04-06 17:15:41 +00:00
Simon Pilgrim	bcf3bc2757	[DAGCombiner] Merge FMUL Scalar and Vector constant canonicalization to RHS. NFCI. llvm-svn: 234118	2015-04-05 14:30:37 +00:00
David Blaikie	4e5d47f436	[opaque pointer type] More GEP API migrations llvm-svn: 234108	2015-04-04 21:07:10 +00:00
Sanjay Patel	59f60a91b8	less space; NFC llvm-svn: 234106	2015-04-04 21:05:52 +00:00
Rafael Espindola	8ca44f0b5c	Implement unique sections with an unique ID. This allows the compiler/assembly programmer to switch back to a section. This in turn fixes the bootstrap failure on powerpc (tested on gcc110) without changing the ppc codegen at all. I will try to cleanup the various getELFSection overloads in a followup patch. Just using a default argument now would lead to ambiguities. llvm-svn: 234099	2015-04-04 18:02:01 +00:00
Simon Pilgrim	20b7aba04a	[DAGCombiner] Canonicalize vector constants for ADD/MUL/AND/OR/XOR re-association Scalar integers are commuted to move constants to the RHS for re-association - this ensures vectors do the same. llvm-svn: 234092	2015-04-04 10:20:31 +00:00
David Majnemer	7f5e714406	[WinEH] Fill out CatchHigh in the TryBlockMap Now all fields in the WinEH xdata have been filled out. llvm-svn: 234067	2015-04-03 23:37:34 +00:00
David Majnemer	69132a7fb2	[WinEH] Fill out .xdata for catch objects This add support for catching an exception such that an exception object available to the catch handler will be initialized by the runtime. llvm-svn: 234062	2015-04-03 22:49:05 +00:00
David Majnemer	3337064a47	[WinEH] Sink UnwindHelp completely out of IR We don't need to represent UnwindHelp in IR. Instead, we can use the knowledge that we are emitting the parent function to decide if we should create the UnwindHelp stack object. llvm-svn: 234061	2015-04-03 22:32:26 +00:00
Andrew Kaylor	7a0cec399d	Fixing a memory leak in WinEHPrepare llvm-svn: 234059	2015-04-03 21:44:17 +00:00
David Blaikie	aa41cd57e0	[opaque pointer type] More GEP IRBuilder API migrations... llvm-svn: 234058	2015-04-03 21:33:42 +00:00
Andrew Kaylor	a12eb15f1d	Fixing a build error llvm-svn: 234045	2015-04-03 19:55:30 +00:00
Andrew Kaylor	c2eb1426fc	Fixing build warnings. llvm-svn: 234043	2015-04-03 19:45:32 +00:00
Andrew Kaylor	aa92ab069c	[WinEH] Handle nested landing pads in outlined catch handlers Differential Revision: http://reviews.llvm.org/D8596 llvm-svn: 234041	2015-04-03 19:37:50 +00:00
Duncan P. N. Exon Smith	3bef6a3803	CodeGen: Assert that inlined-at locations agree As a follow-up to r234021, assert that a debug info intrinsic variable's `MDLocalVariable::getInlinedAt()` always matches the `MDLocation::getInlinedAt()` of its `!dbg` attachment. The goal here is to get rid of `MDLocalVariable::getInlinedAt()` entirely (PR22778), but I'll let these assertions bake for a while first. If you have an out-of-tree backend that just broke, you're probably attaching the wrong `DebugLoc` to a `DBG_VALUE` instruction. The one you want is the location that was attached to the corresponding `@llvm.dbg.declare` or `@llvm.dbg.value` call that you started with. llvm-svn: 234038	2015-04-03 19:20:26 +00:00
Reid Kleckner	aab30e1454	[WinEH] Fold cast into assertion based on review feedback llvm-svn: 234034	2015-04-03 18:18:06 +00:00
Duncan P. N. Exon Smith	66463cc5dc	SelectionDAG: Use specialized metadata nodes in EmitFuncArgumentDbgValue(), NFC Use `MDLocalVariable` and `MDExpression` directly for the arguments of `EmitFuncArgumentDbgValue()` to simplify a follow-up patch. llvm-svn: 234026	2015-04-03 17:11:42 +00:00
Duncan P. N. Exon Smith	c5bd3e01df	CodeGen: Fix MachineInstr::print() for DBG_VALUE Grab the `MDLocalVariable` from the second-to-last argument; the last argument is an `MDExpression`, and mixing them up will crash. llvm-svn: 234019	2015-04-03 16:23:04 +00:00
Daniel Jasper	efece52160	[MachineLICM] Small cleanup: Constify and rangeify. NFC. llvm-svn: 234018	2015-04-03 16:19:48 +00:00
Simon Pilgrim	ed2ba33ba0	[DAGCombiner] Combine shuffles of BUILD_VECTOR and SCALAR_TO_VECTOR This patch attempts to fold the shuffling of 'scalar source' inputs - BUILD_VECTOR and SCALAR_TO_VECTOR nodes - if the shuffle node is the only user. This folds away a lot of unnecessary shuffle nodes, and allows quite a bit of constant folding that was being missed. Differential Revision: http://reviews.llvm.org/D8516 llvm-svn: 234004	2015-04-03 10:02:21 +00:00
Reid Kleckner	7fab90f0e8	Fix unused variable in NDEBUG builds llvm-svn: 233978	2015-04-02 21:43:22 +00:00
Reid Kleckner	3567d27062	[WinEH] Make llvm.eh.actions use frameescape indices for catch params This makes it possible to use the same representation of llvm.eh.actions in outlined handlers as we use in the parent function because i32's are just constants that can be copied freely between functions. I had to add a sentinel alloca to the list of child allocas so that we don't try to sink the catch object into the handler. Normally, one would use nullptr for this kind of thing, but TinyPtrVector doesn't support null elements. More than that, it's elements have to have a suitable alignment. Therefore, I settled on this for my sentinel: AllocaInst getCatchObjectSentinel() { return static_cast<AllocaInst >(nullptr) + 1; } llvm-svn: 233947	2015-04-02 21:13:31 +00:00
David Blaikie	4a2e73b066	[opaque pointer type] API migration for GEP constant factories Require the pointee type to be passed explicitly and assert that it is correct. For now it's possible to pass nullptr here (and I've done so in a few places in this patch) but eventually that will be disallowed once all clients have been updated or removed. It'll be a long road to get all the way there... but if you have the cahnce to update your callers to pass the type explicitly without depending on a pointer's element type, that would be a good thing to do soon and a necessary thing to do eventually. llvm-svn: 233938	2015-04-02 18:55:32 +00:00
Philip Reames	4692f36306	[gcroot] Remove unused items from an enum These two were never implemented for gcroot, so there's no point in keeping them around now. llvm-svn: 233892	2015-04-02 05:02:16 +00:00
Philip Reames	2df7827c75	Teach gcroot how to handle dynamically realigned frames I'm playing with supporting custom stack map formats with statepoints. While doing so, I noticed that the existing implementation didn't indicate inherently unsized frames. This change essentially just ports the functionality that already exists for the default StackMaps section to custom stackmaps. llvm-svn: 233891	2015-04-02 05:00:40 +00:00
Andrew Kaylor	64622aa162	Fix WinEHPrepare bug with multiple catch handlers Differential Revision: http://reviews.llvm.org/D8682 llvm-svn: 233824	2015-04-01 17:21:25 +00:00
David Majnemer	e8eb9e6de3	[WinEH] Implement support for catch-all A catch (...) doesn't have a type descriptor. Instead, the 'adjectives' field has bit six set. llvm-svn: 233788	2015-04-01 05:20:42 +00:00
Jiangning Liu	b0f076910b	Fix PR23065. Avoid optimizing bitcast of build_vector with constant input to scalar_to_vector. llvm-svn: 233778	2015-04-01 01:52:38 +00:00
David Majnemer	d1079bf27a	[WinEH] ExitingScope is vacuously true if !PoppedCatches.empty() Remove a redundant condition, no functional change intended. llvm-svn: 233770	2015-03-31 22:43:56 +00:00
David Majnemer	a225a19dd0	[WinEH] Generate .xdata for catch handlers This lets us catch exceptions in simple cases. N.B. Things that do not work include (but are not limited to): - Throwing from within a catch handler. - Catching an object with a named catch parameter. - 'CatchHigh' is fictitious, we aren't sure of its purpose. - We aren't entirely efficient with regards to the number of EH states that we generate. - IP-to-State tables are sensitive to the order of emission. llvm-svn: 233767	2015-03-31 22:35:44 +00:00
Hal Finkel	17b6d77a5f	[SDAG] Handle non-integer preferred memset types for non-constant values The existing code in getMemsetValue only handled integer-preferred types when the fill value was not a constant. Make this more robust in two ways: 1. If the preferred type is a floating-point value, do the mul-splat trick on the corresponding integer type and then bitcast. 2. If the preferred type is a vector, do the mul-splat trick on one vector element, and then build a vector out of them. Fixes PR22754 (although, we should also turn off use of vector types at -O0). llvm-svn: 233749	2015-03-31 20:35:26 +00:00
Matthias Braun	a354cdd0c5	RegAllocGreedy: Allow target to specify register class ordering. Specify an allocation order with a register class. This is used by register allocators with a greedy heuristic. This is usefull as it is sometimes beneficial to color more constrained classes first. Differential Revision: http://reviews.llvm.org/D8626 llvm-svn: 233743	2015-03-31 19:57:53 +00:00
Matthias Braun	f5f89b9f7d	RegAllocGreedy: Improve live interval order in ReverseLocal mode When allocating live intervals in linear order and all of them are local to a single basic block you get an optimal coloring. This is also true if you reverse the order, but it is not true if you sort live ranges beginnings in reverse order, change to sort live range endings in reverse order. Take the following live ranges for example: \|---\| \|--------\| \|----------\| \|-------\| They get colored suboptimally with 3 registers if you sort the live range starting points in reverse order (but optimally with live range begins in order, or live range ends in reverse order). Apparently the previous strategy was intentional because of allocation time considerations. I am having a hard time replicating these effects, while I see substantial improvements in allocation quality with this change. No testcase as none of the (in tree) targets use reverse order mode. Differential Revision: http://reviews.llvm.org/D8625 llvm-svn: 233742	2015-03-31 19:57:49 +00:00
Sanjay Patel	d399d94837	typos; NFC llvm-svn: 233701	2015-03-31 16:17:51 +00:00
James Molloy	4c1b746771	[SDAG] Move TRUNCATE splitting logic into a helper, and use it more liberally. SplitVecOp_TRUNCATE has logic for recursively splitting oversize vectors that need more than one round of splitting to become legal. There are many other ISD nodes that could benefit from this logic, so factor it out and use it for FP_TO_UINT,FP_TO_SINT,SINT_TO_FP,UINT_TO_FP and FTRUNC. llvm-svn: 233681	2015-03-31 10:20:58 +00:00
Duncan P. N. Exon Smith	88f50087c8	DebugInfo: Remove LexicalBlockFile scope/context distinction Two things here: 1. I read `getScope()` and `getContext()` backwards in r233640. There was no need for `getScopeOfScope()`. Obviously not enough test coverage here (as I said in that commit, I'm going to come back to that), but anyway I'm reverting to the behaviour before r233640. 2. The callers that use `DILexicalBlockFile::getContext()` don't seem to care about the difference. Just have it redirect to `getScope()` so I can't get confused again. llvm-svn: 233650	2015-03-31 00:10:37 +00:00
Eric Christopher	f8019408dc	Replace the MCSubtargetInfo parameter with a Triple when creating an MCInstPrinter. Update all callers and use where we wanted a Triple previously. llvm-svn: 233648	2015-03-31 00:10:04 +00:00
Duncan P. N. Exon Smith	3386e0ea7b	LexicalScopes: Cleanup remaining uses of DebugLoc llvm-svn: 233644	2015-03-30 23:58:59 +00:00
Duncan P. N. Exon Smith	82eba746df	DebugLoc: Remove getFromDILexicalBlock() The only user of `DebugLoc::getFromDILexicalBlock()` was creating a new `MDLocation` as convenient API for passing an `MDScope`. Stop doing that, and remove the API. If in the future we actually want to create new DebugLocs, calling `MDLexicalBlock::get()` makes more sense. llvm-svn: 233643	2015-03-30 23:47:26 +00:00
Duncan P. N. Exon Smith	33af7a8f50	LexicalScopes: Use debug info hierarchy pervasively Pervasively use the types provided by the debug info hierarchy rather than `MDNode` in `LexicalScopes`. I noticed (again, I guess, based on comments in the implementation?) that `DILexicalBlockFile::getScope()` returns something different from `DILexicalBlockFile::getContext()`. I created a local helper for getting the same logic from `MDLexicalBlockFile` called `getScopeOfScope()`. I still don't really understand it, but I've added some FIXMEs and I'll come back to it (I suspect the way we encode these objects isn't really ideal). Note that my previous commit r233610 accidentally changed behaviour in `findLexicalScope()` -- it transitioned from a call to `DILexicalBlockFile::getScope()` to `MDLexicalBlockFile::getScope()` (sounds right, doesn't it?) -- so I've fixed that as a drive-by. No tests failed with my error, so it looks like we're missing some coverage here... when I come back to understand the logic, I'll see if I can add some. Other than the fix to `findLexicalScope()`, no functionality change. llvm-svn: 233640	2015-03-30 23:21:21 +00:00
David Majnemer	9a55539bef	Silence an unused variable warning. No functional change intended. llvm-svn: 233639	2015-03-30 23:14:45 +00:00
David Majnemer	cde33036ed	[WinEH] Run cleanup handlers when an exception is thrown Generate tables in the .xdata section representing what actions to take when an exception is thrown. This currently fills in state for cleanups, catch handlers are still unfinished. llvm-svn: 233636	2015-03-30 22:58:10 +00:00
Duncan P. N. Exon Smith	5a227fffe5	LexicalScopes: Use MDLocation directly instead of DebugLoc There's no benefit to using `DebugLoc` here. Moreover, this will let a follow-up commit work with `MDScope` directly instead of `DebugLoc`. llvm-svn: 233610	2015-03-30 21:54:46 +00:00
Quentin Colombet	6749ae36b4	[RegisterCoalescer] Fix a potential misuse of direct operand index in the terminal rule. Spot by code inspection. llvm-svn: 233606	2015-03-30 21:50:44 +00:00
Duncan P. N. Exon Smith	fd07a2af23	DwarfDebug: Avoid creating new DebugLocs in the backend Don't use `DebugLoc::getFnDebugLoc()`, which creates new `MDLocation`s, in the backend. We just want to grab the subprogram here anyway. llvm-svn: 233601	2015-03-30 21:32:28 +00:00
David Blaikie	3909da7f4b	[opaque pointer type] More IRBuilder::createGEP (non-inbounds) migrations: CodeGenPrepare and SimplifyLibCalls llvm-svn: 233596	2015-03-30 20:42:56 +00:00
Duncan P. N. Exon Smith	9dffcd04f7	CodeGen: Use the new DebugLoc API, NFC Update lib/CodeGen (and lib/Target) to use the new `DebugLoc` API. llvm-svn: 233582	2015-03-30 19:14:47 +00:00
Duncan P. N. Exon Smith	b525e1c07c	SelectionDAG: Reflow code to use early returns, NFC llvm-svn: 233577	2015-03-30 18:23:28 +00:00
Simon Pilgrim	dcbe1213c8	Use SDValue bool check to tidyup some possible vector folding ops. NFC. llvm-svn: 233498	2015-03-29 19:13:40 +00:00
Simon Pilgrim	d15c2805ab	Use SDValue bool check to tidyup some possible ReassociateOps. NFC. llvm-svn: 233495	2015-03-29 16:49:51 +00:00
Simon Pilgrim	7fdcc30e93	[DAGCombiner] Fixed incorrect test for buildvector of constant integers. DAGCombiner::ReassociateOps was correctly testing for an constant integer scalar but failed to correctly test for constant integer vectors (it was testing for any constant vector). llvm-svn: 233482	2015-03-28 18:31:31 +00:00
Ahmed Bougacha	a0f35592be	[CodeGen] "PromoteInteger" f32 to f64 doesn't make sense. The original f32->f64 promotion logic was refactored into roughly the currently shape in r37781. However, starting with r132263, the legalizer has been split into different kinds, and the previous "Promote" (which did the right thing) was search-and-replace'd into "PromoteInteger". The divide gradually deepened, with type legalization ("PromoteInteger") being separated from ops legalization ("Promote", which still works for floating point ops). Fast-forward to today: there's no in-tree target with legal f64 but illegal f32 (rather: no tests were harmed in the making of this patch). With such a target, i.e., if you trick the legalizer into going through the PromoteInteger path for FP, you get the expected brokenness. For instance, there's no PromoteIntRes_FADD (the name itself sounds wrong), so we'll just hit some assert in the PromoteInteger path. Don't pretend we can promote f32 to f64. Instead, always soften. llvm-svn: 233464	2015-03-28 01:22:37 +00:00
Andrew Kaylor	f7118ae810	Fixing a bug with optimized catch-all handlers in WinEHPrepare llvm-svn: 233439	2015-03-27 22:31:12 +00:00
Sanjay Patel	f176566a00	fix typo and 80-col; NFC llvm-svn: 233427	2015-03-27 21:45:18 +00:00
Ahmed Bougacha	faf8065a99	[CodeGen] Don't attempt a tail-call with a non-forwarded explicit sret. Tailcalls are only OK with forwarded sret pointers. With explicit sret, one approximation is to check that the pointer isn't an Instruction, as in that case it might point into some local memory (alloca). That's not OK with tailcalls. Explicit sret counterpart to r233409. Differential Revison: http://reviews.llvm.org/D8510 llvm-svn: 233410	2015-03-27 20:35:49 +00:00
Ahmed Bougacha	e2bd5d36b3	[CodeGen] Don't attempt a tail-call with implicit sret. Tailcalls are only OK with forwarded sret pointers. With sret demotion, they're not, as we'd have a pointer into a soon-to-be-dead stack frame. Differential Revison: http://reviews.llvm.org/D8510 llvm-svn: 233409	2015-03-27 20:28:30 +00:00
Quentin Colombet	2e27df717a	[RegisterCoalescer] Refine the terminal rule to still consider the terminal nodes. When a node is terminal it is pushed at the end of the list of the copies to coalesce instead of being completely ignored. In effect, this reduces its priority over non-terminal nodes. Because of that, we do not miss the rematerialization opportunities, nor the copies that can be merged with more complex, than the terminal rule, interference checks. Related to PR22768. llvm-svn: 233395	2015-03-27 18:37:15 +00:00
Yaron Keren	75e0c4b060	Remove superfluous .str() and replace std::string concatenation with Twine. llvm-svn: 233392	2015-03-27 17:51:30 +00:00
Andrew Trick	43adfb30d5	Complete the MachineScheduler fix made way back in r210390. "Fix the MachineScheduler's logic for updating ready times for in-order. Now the scheduler updates a node's ready time as soon as it is scheduled, before releasing dependent nodes." This fix was only made in one variant of the ScheduleDAGMI driver. Francois de Ferriere reported the issue in the other bit of code where it was also needed. I never got around to coming up with a test case, but it's an obvious fix that shouldn't be delayed any longer. I'll try to refactor this code a little better. I did verify performance on a wide variety of targets and saw no negative impact with this fix. llvm-svn: 233366	2015-03-27 06:10:13 +00:00
Philip Reames	e1bf27045d	Require a GC strategy be specified for functions which use gc.statepoint This was discussed a while back and I left it optional for migration. Since it's been far more than the 'week or two' that was discussed, time to actually make this manditory. llvm-svn: 233357	2015-03-27 05:09:33 +00:00
Philip Reames	f8f0933b48	Allow explicit spill slots to be specified for a gc.statepoint This patch adds support for explicitly provided spill slots in the GC arguments of a gc.statepoint. This is somewhat analogous to gcroot, but leverages the STATEPOINT MI node and StackMap infrastructure. The motivation for this is: 1) The stack spilling code for gc.statepoints hasn't advanced as fast as I'd like. One major option is to give up on doing spilling in the backend and do it at the IR level instead. We'd give up the ability to have gc values in registers, but that's a minor cost in practice. We are not neccessarily moving in that direction, but having the ability to prototype such a thing cheaply is interesting. 2) I want to port the gcroot lowering to use the statepoint infastructure. Given the metadata printers for gcroot expect a fixed set of stack roots, it's easiest to just reuse the explicit stack slots and pass them directly to the underlying statepoint. I'm holding off on the documentation for the new feature until I'm reasonable sure this is going to stick around. llvm-svn: 233356	2015-03-27 04:52:48 +00:00
David Majnemer	b919dd693f	WinEH: Create a parent frame alloca for HandlerType xdata tables We don't have any logic to emit those tables yet, so the SDAG lowering of this intrinsic is just a stub. We can see the intrinsic in the prepared IR, though. llvm-svn: 233354	2015-03-27 04:17:07 +00:00
Andrew Trick	e97ff5a2ad	Fix a bug in SelectionDAG scheduling backtracking code: PR22304. It can happen (by line CurSU->isPending = true; // This SU is not in AvailableQueue right now.) that a SUnit is mark as available but is not in the AvailableQueue. For SUnit being selected for scheduling both conditions must be met. This patch mainly defensively protects from invalid removing a node from a queue. Sometimes nodes are marked isAvailable but are not in the queue because they have been defered due to some hazard. Patch by Pawel Bylica! llvm-svn: 233351	2015-03-27 03:44:13 +00:00
Ahmed Bougacha	821880a7a1	[AsmPrinter] Don't assert on GOT equivalent non-constant users. We used to dyn_cast<Constant> in the recursive call, but cast<> in the initial one, and there can be non-Constant initial users. llvm-svn: 233346	2015-03-27 01:40:54 +00:00
Ahmed Bougacha	2a20e27057	Deduplicate a bunch of setOpActions into an MVT range-for. NFC. llvm-svn: 233330	2015-03-26 23:21:03 +00:00
Ahmed Bougacha	e85a2d34c6	[CodeGen] Report error rather than crash when unable to makeLibCall. Also, make the assumption explicit in the header. llvm-svn: 233329	2015-03-26 22:46:58 +00:00
Ahmed Bougacha	2721f62d50	[CodeGen] Don't pretend we can expand f16 libcalls. We used to mark a bunch of libm nodes as Expand for f16. There are no libcalls we can use for those, so we eventually just hit an unhelpful llvm_unreachable in ExpandFPLibCall. Instead, just ignore them altogether. If nothing else changes, we'll then get the more descriptive and pleasant "Cannot select" fatal error. There's an argument to be made for consistency, but f16 is already special in all the good ways, and as long as there's no f16 support in the ops expander (this patch), as well as the Soften/Expand float legalizers (which, when hit, will currently segfault), I think there's no point in even pretending we can legalize any of this. This shouldn't affect anything that's not already broken. llvm-svn: 233328	2015-03-26 22:44:58 +00:00
Sanjay Patel	5b305d2d66	revert inadvertent change llvm-svn: 233294	2015-03-26 17:19:24 +00:00
Sanjay Patel	4fa4a886d7	comment cleanup; NFC llvm-svn: 233293	2015-03-26 17:18:17 +00:00
Sanjay Patel	d95dd9e5fb	fix indent; NFC llvm-svn: 233288	2015-03-26 16:55:17 +00:00
Quentin Colombet	2c6e0597c6	[RegisterCoalescer] Add a rule to consider more profitable copies first when those are in the same basic block. The previous approach was the topological order of the basic block. By default this rule is disabled. Related to PR22768. llvm-svn: 233241	2015-03-26 01:01:48 +00:00
Simon Pilgrim	09f3ff9a0a	[DAGCombiner] Add support for TRUNCATE + FP_EXTEND vector constant folding This patch adds supports for the vector constant folding of TRUNCATE and FP_EXTEND instructions and tidies up the SINT_TO_FP and UINT_TO_FP instructions to match. It also moves the vector constant folding for the FNEG and FABS instructions to use the DAG.getNode() functionality like the other unary instructions. Differential Revision: http://reviews.llvm.org/D8593 llvm-svn: 233224	2015-03-25 22:30:31 +00:00
Matthias Braun	5d27ef6449	RegisterCoalescer: Fix implicit def handling in register coalescer If liveranges induced by an IMPLICIT_DEF get completely covered by a proper liverange the IMPLICIT_DEF instructions and its corresponding definitions have to be removed from the live ranges. This has to happen in the subregister live ranges as well (I didn't see this case earlier because in most programs only some subregisters are covered and the IMPLCIT_DEF won't get removed). No testcase, I spent hours trying to create one for one of the public targets, but ultimately failed because I couldn't manage to properly control the placement of COPY and IMPLICIT_DEF instructions from an .ll file. llvm-svn: 233217	2015-03-25 21:18:24 +00:00
Matthias Braun	e962e52a45	MachineVerifier: slightly simplify code that is only called with vregs llvm-svn: 233216	2015-03-25 21:18:22 +00:00
Reid Kleckner	7e9546b378	WinEH: Create an unwind help alloca for __CxxFrameHandler3 xdata tables We don't have any logic to emit those tables yet, so the sdag lowering of this intrinsic is just a stub. We can see the intrinsic in the prepared IR, though. llvm-svn: 233209	2015-03-25 20:10:36 +00:00
Paul Robinson	284f0451cf	'optnone' should not disable DAG combiner. Reverts the code change from r221168 and the relevant test. It was a mistake to disable the combiner, and based on the ultimate definition of 'optnone' we shouldn't have considered the test case as failing in the first place. llvm-svn: 233153	2015-03-25 00:10:24 +00:00
Reid Kleckner	11470c48d0	X86: Fix frameescape when not using an FP We can't use TargetFrameLowering::getFrameIndexOffset directly, because Win64 really wants the offset from the stack pointer at the end of the prologue. Instead, use X86FrameLowering::getFrameIndexOffsetFromSP(), which is a pretty close approximiation of that. It fails to handle cases with interestingly large stack alignments, which is pretty uncommon on Win64 and is TODO. llvm-svn: 233137	2015-03-24 23:46:01 +00:00
David Blaikie	68d535c45f	Opaque Pointer Types: GEP API migrations to specify the gep type explicitly The changes to InstCombine do seem a bit silly - it doesn't make anything obviously better to have the caller access the pointers element type (the thing I'm trying to remove) than the GEP itself, but it's a helpful migration step. This will allow me to more obviously lock down GEP (& Load, etc) API usage, then fix all the code that accesses pointer element types except the places that need to be removed (most of the InstCombines) anyway - at which point I'll need to just remove all that code because it won't be meaningful anymore (there will be no pointer types, so no bitcasts to combine) llvm-svn: 233126	2015-03-24 22:38:16 +00:00
Benjamin Kramer	722ff28643	Internalize the StackMapLiveness pass. No need to have its own header when it's not used anywhere. NFC. llvm-svn: 233072	2015-03-24 13:20:54 +00:00
Simon Pilgrim	481f4146cd	[SelectionDAG] Fixed issue with uitofp vector constant folding being treated as sitofp While the uitofp scalar constant folding treats an integer as an unsigned value (from lang ref): %X = sitofp i8 -1 to double ; yields double:-1.0 %Y = uitofp i8 -1 to double ; yields double:255.0 The vector constant folding was always using sitofp: %X = sitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> %Y = uitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> This patch fixes this so that the correct opcode is used for sitofp and uitofp. %X = sitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> %Y = uitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double 255.0, double 255.0> Differential Revision: http://reviews.llvm.org/D8560 llvm-svn: 233033	2015-03-23 22:44:55 +00:00
Rafael Espindola	f2b408c64e	Refactor how passes get a symbol at the end of a section. There is now a canonical symbol at the end of a section that different passes can request. This also allows us to assert that we don't switch back to a section whose end symbol has already been printed. llvm-svn: 233026	2015-03-23 21:22:04 +00:00
Benjamin Kramer	799003bf8c	Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used. llvm-svn: 232998	2015-03-23 19:32:43 +00:00
Benjamin Kramer	a8d61b104d	[winehprepare] Update and sort includes. NFC. llvm-svn: 232994	2015-03-23 18:57:17 +00:00
Benjamin Kramer	b85d3756a6	Another set of missing raw_ostream.h. Still no functional change. llvm-svn: 232993	2015-03-23 18:45:56 +00:00
Benjamin Kramer	de9f090e10	More missing includes only visible to MSVC. NFC. llvm-svn: 232981	2015-03-23 18:23:08 +00:00
Benjamin Kramer	16132e6faa	Purge unused includes throughout libSupport. NFC. llvm-svn: 232976	2015-03-23 18:07:13 +00:00
Benjamin Kramer	51f6096cf8	Move private classes into anonymous namespaces NFC. llvm-svn: 232944	2015-03-23 12:30:58 +00:00
Petar Jovanovic	5b4362276b	Fix sign extension for MIPS64 in makeLibCall function Fixing sign extension in makeLibCall for MIPS64. In MIPS64 architecture all 32 bit arguments (int, unsigned int, float 32 (soft float)) must be sign extended. This fixes test "MultiSource/Applications/oggenc/". Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D7791 llvm-svn: 232943	2015-03-23 12:28:13 +00:00
Hal Finkel	8f7c5a7f18	[SDAG] Don't widen VSETCC during type legalization for split operands Because the operands of a vector SETCC node can be of a different type from the result (and often are), it can happen that even if we'd prefer to widen the result type of the SETCC, the operands have been split instead. In this case, the SETCC result also must be split. This mirrors what is done in WidenVecRes_SELECT, and should be NFC elsewhere because if the operands are not widened the following calls to GetWidenedVector will assert (which is what was happening in the test case). llvm-svn: 232935	2015-03-23 08:22:43 +00:00
Eric Christopher	c5a85af3b2	Cache the Function dependent subtarget on the MachineFunction. As preparation for removing the getSubtargetImpl() call from TargetMachine go ahead and flip the switch on caching the function dependent subtarget and remove the bare getSubtargetImpl call from the X86 port. As part of this add a few tests that show we can generate code and assemble on X86 based on features/cpu on the Function. llvm-svn: 232879	2015-03-21 03:13:10 +00:00
Eric Christopher	cd53d6eda7	Change getISAEncoding to use the target triple to determine thumb-ness similar to the rest of the Module level asm printing infrastructure as debug info finalization happens after the function may be missing. llvm-svn: 232875	2015-03-21 03:13:01 +00:00
Ahmed Bougacha	7173b669b4	[CodeGen][IfCvt] Don't re-ifcvt blocks with unanalyzable terminators. If we couldn't analyze its terminator (i.e., it's an indirectbr, or some other weirdness), we can't safely re-if-convert a predicated block, because we can't tell whether the predicated terminator can fallthrough (it does). Currently, we would completely ignore the fallthrough successor. In the added testcase, this means we used to generate: ... @ %entry: cmp r5, #21 ittt ne @ %cc1f: cmpne r7, #42 @ %cc2t: strne.w r5, [r8] movne pc, r10 @ %cc1t: ... Whereas the successor of %cc1f was originally %bb1. With the fix, we get the correct: ... @ %entry: cmp r5, #21 itt eq @ %cc1t: streq.w r5, [r11] moveq pc, r0 @ %cc1f: cmp r7, #42 itt ne @ %cc2t: strne.w r5, [r8] movne pc, r10 @ %bb1: ... rdar://20192768 Differential Revision: http://reviews.llvm.org/D8509 llvm-svn: 232872	2015-03-21 01:23:15 +00:00
Andrew Kaylor	3170e5620e	Fixing a bug with WinEH PHI handling llvm-svn: 232851	2015-03-20 21:42:54 +00:00
Eric Christopher	594fa96a57	Remove dead calls and function arguments dealing with TRI in StackMaps. llvm-svn: 232847	2015-03-20 21:05:18 +00:00
Rafael Espindola	36a15cb975	Don't declare all text sections at the start of the .s The code this patch removes was there to make sure the text sections went before the dwarf sections. That is necessary because MachO uses offsets relative to the start of the file, so adding a section can change relaxations. The dwarf sections were being printed at the start just to produce symbols pointing at the start of those sections. The underlying issue was fixed in r231898. The dwarf sections are now printed when they are about to be used, which is after we printed the text sections. To make sure we don't regress, the patch makes the MachO streamer assert if CodeGen puts anything unexpected after the DWARF sections. llvm-svn: 232842	2015-03-20 20:00:01 +00:00
Duncan P. N. Exon Smith	23e56ecf26	AsmPrinter: Check subprogram before using it Check return of `getDISubprogram()` before using it. A WIP patch makes `DIDescriptor` accessors more strict (and would crash on this). llvm-svn: 232838	2015-03-20 19:50:00 +00:00
Duncan P. N. Exon Smith	d3a057733f	DwarfDebug: Check for null DebugLocs `DL` might be null, so check for that before using accessors. A WIP patch to make `DIDescriptors` more strict fails otherwise. As a bonus, I think the logic is easier to follow now (despite the extra nesting depth). llvm-svn: 232836	2015-03-20 19:37:03 +00:00
Hans Wennborg	90aa1a9653	SelectionDAGBuilder: Rangeify a loop. NFC. llvm-svn: 232831	2015-03-20 18:48:40 +00:00
Hans Wennborg	2bdc4cf35f	SelectionDAGBuilder::handleJTSwitchCase, simplify loop; NFC llvm-svn: 232830	2015-03-20 18:48:31 +00:00
John Brawn	1f26a47630	[ARM] Fix handling of thumb1 out-of-range frame offsets LocalStackSlotPass assumes that isFrameOffsetLegal doesn't change its answer when the base register changes. Unfortunately this isn't true in thumb1, where SP-based loads allow a larger offset than non-SP-based loads, and this causes the base register reuse code to generate instructions that are unencodable, causing an assertion failure. Solve this by adding a BaseReg parameter to isFrameOffsetLegal, which ARMBaseRegisterInfo can then make use of to give the correct answer. Differential Revision: http://reviews.llvm.org/D8419 llvm-svn: 232825	2015-03-20 17:20:07 +00:00
Eric Christopher	cef8e71394	Rewrite StackMap location handling to pre-compute the dwarf register numbers before emission. This removes a dependency on being able to access TRI at the module level and is similar to the DwarfExpression handling. I've modified the debug support into print/dump routines that'll do the same dumping but is now callable anywhere and if TRI isn't available will go ahead and just print out raw register numbers. llvm-svn: 232821	2015-03-20 16:03:42 +00:00
Eric Christopher	d43c5c75b6	At the beginning of doFinalization set the MachineFunction to nullptr so that users get an earlier dereferencing error and so that we can use it to conditionalize access to MachineFunction specific data. llvm-svn: 232820	2015-03-20 16:03:39 +00:00
Daniel Jasper	214997c63b	[MBP] Don't outline short optional branches With the option -outline-optional-branches, LLVM will place optional branches out of line (more details on r231230). With this patch, this is not done for short optional branches. A short optional branch is a branch containing a single block with an instruction count below a certain threshold (defaulting to 3). Still everything is guarded under -outline-optional-branches). Outlining a short branch can't significantly improve code locality. It can however decrease performance because of the additional jmp and in cases where the optional branch is hot. This fixes a compile time regression I have observed in a benchmark. Review: http://reviews.llvm.org/D8108 llvm-svn: 232802	2015-03-20 10:00:37 +00:00
Hans Wennborg	077845eb81	Rewrite SelectionDAGBuilder::Clusterify to run in linear time. NFC. It was previously repeatedly erasing elements from the middle of a vector, causing O(n^2) worst-case run-time. llvm-svn: 232789	2015-03-20 00:41:03 +00:00
Eric Christopher	d83003ea59	Use the cached subtarget on the MachineFunction when the AsmPrinter will have a MachineFunction, i.e. in places other than the module level doInitialize/doFinalize. llvm-svn: 232783	2015-03-19 23:27:42 +00:00
Eric Christopher	7585fb2d9f	Use the cached subtarget off of the machine function. llvm-svn: 232782	2015-03-19 23:06:21 +00:00
Owen Anderson	db4201235b	Fix a nasty bug in DAGCombine of STORE nodes. This is very related to the bug fixed in r174431. The problem is that SelectionDAG does not include alignment in the uniquing of loads and stores. When an otherwise no-op DAGCombine would increase the alignment of a load or store, the original node would be returned (with the alignment increased), which would cause the node not to be processed by any further DAGCombines. I don't have a direct testcase for this that manifests on an in-tree target, but I did see some noise in the tests for other targets and have updated them for it. llvm-svn: 232780	2015-03-19 22:48:57 +00:00
Eric Christopher	cf7b5f5fc5	Remove unused headers. llvm-svn: 232777	2015-03-19 22:36:38 +00:00
Eric Christopher	12cf76fe26	Add an MCSubtargetInfo variable to the TargetMachine. This enables us to remove calls to the subtarget from the TargetMachine and with a small hack for backends that require global subtarget information for module level code generation, e.g. mips abi flags, as mentioned in a fixme in the code. llvm-svn: 232776	2015-03-19 22:36:37 +00:00
Eric Christopher	72e23a219c	Add a TargetMachine local MCRegisterInfo and MCInstrInfo so that they can be used without a subtarget in constructing subtarget independent passes. llvm-svn: 232775	2015-03-19 22:36:32 +00:00
Reid Kleckner	c759fe90bc	WinEH: Make llvm.eh.actions emission match the EH docs This switches the sense of the i32 values and updates the test cases. We can also use CHECK-SAME to clean up some tests, and reduce the visual noise from bitcasts. llvm-svn: 232774	2015-03-19 22:31:02 +00:00

... 11 12 13 14 15 ...

19356 Commits