llvm-project

Commit Graph

Author	SHA1	Message	Date
Hubert Tong	01a2cb55f1	TrailingObjects::FixedSizeStorage constexpr fixes + tests Summary: This change fixes issues with `LLVM_CONSTEXPR` functions and `TrailingObjects::FixedSizeStorage`. In particular, some of the functions marked `LLVM_CONSTEXPR` used by `FixedSizeStorage` were not implemented such that they evaluate successfully as part of a constant expression despite constant arguments. This change also implements a more traditional template-meta path to accommodate MSVC, and adds unit tests for `FixedSizeStorage`. Drive-by fix: the access control for members of `TrailingObjectsImpl` is tightened. Reviewers: faisalv, rsmith, aaron.ballman Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D22668 llvm-svn: 277270	2016-07-30 14:01:00 +00:00
Hubert Tong	a216643cd3	MathExtras.h: add LLVM_CONSTEXPR where simple Summary: This change adds `LLVM_CONSTEXPR` to functions selected as follows: - the body is already valid under C++11 for a `constexpr` function, - the evaluation of the function, given constant arguments, will not fail during the evaluation of a constant expression, and - the above properties are easily verifiable at a glance. Note: the evaluation of the function cannot fail if the instantiation triggers a static assertion failure. Reviewers: faisalv, rsmith, aaron.ballman Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D22824 llvm-svn: 277269	2016-07-30 13:38:51 +00:00
Benjamin Kramer	96cb6bfa27	Update modulemap for Msf -> MSF rename. llvm-svn: 277267	2016-07-30 12:05:17 +00:00
Lang Hames	c31d594228	[Orc] Add support for updating stub targets to CompileOnDemandLayer. This makes it possible to implement re-optimization on top of the CompileOnDemandLayer. Test case to come in a future patch: This will need an execution test, and execution tests require a full working stack. The best option is to plumb this API up to the C Bindings stack and add a C bindings test for this. Patch by Sean Ogden. Thanks Sean! llvm-svn: 277257	2016-07-30 00:57:54 +00:00
Lang Hames	6ecbd16f00	[Support] Add storage specifier for MachO::NListType. This should fix UB warnings from the sanitizer bots: LLD performs bit manipulations on enums of this type, and these are UB if the underlying storage type isn't specified. llvm-svn: 277251	2016-07-29 23:17:53 +00:00
Tim Northover	5fb414d870	GlobalISel: support translation of intrinsic calls. These come in two variants for now: G_INTRINSIC and G_INTRINSIC_W_SIDE_EFFECTS. We may decide to split the latter up with finer-grained restrictions later, if necessary. llvm-svn: 277224	2016-07-29 22:32:36 +00:00
Rui Ueyama	7a5cdc6225	pdbdump: Dump Free Page Map contents. Differential Revision: https://reviews.llvm.org/D22974 llvm-svn: 277216	2016-07-29 21:38:00 +00:00
Zachary Turner	a3225b0451	[msf] Resubmit "Rename Msf -> MSF". Previously this change was submitted from a Windows machine, so changes made to the case of filenames and directory names did not survive the commit, and as a result the CMake source file names and the on-disk file names did not match on case-sensitive file systems. I'm resubmitting this patch from a Linux system, which hopefully allows the case changes to make it through unfettered. llvm-svn: 277213	2016-07-29 20:56:36 +00:00
Tim Northover	6b3bd61283	CodeGen: add new "intrinsic" MachineOperand kind. This will be used during GlobalISel, where we need a more robust and readable way to write tests than a simple immediate ID. llvm-svn: 277209	2016-07-29 20:32:59 +00:00
Adam Nemet	12937c361f	[LoopUnroll] Include hotness of region in opt remark LoopUnroll is a loop pass, so the analysis of OptimizationRemarkEmitter is added to the common function analysis passes that loop passes depend on. The BFI and indirectly BPI used in this pass is computed lazily so no overhead should be observed unless -pass-remarks-with-hotness is used. This is how the patch affects the O3 pipeline: Dominator Tree Construction Natural Loop Information Canonicalize natural loops Loop-Closed SSA Form Pass Basic Alias Analysis (stateless AA impl) Function Alias Analysis Results Scalar Evolution Analysis + Lazy Branch Probability Analysis + Lazy Block Frequency Analysis + Optimization Remark Emitter Loop Pass Manager Rotate Loops Loop Invariant Code Motion Unswitch loops Simplify the CFG Dominator Tree Construction Basic Alias Analysis (stateless AA impl) Function Alias Analysis Results Combine redundant instructions Natural Loop Information Canonicalize natural loops Loop-Closed SSA Form Pass Scalar Evolution Analysis + Lazy Branch Probability Analysis + Lazy Block Frequency Analysis + Optimization Remark Emitter Loop Pass Manager Induction Variable Simplification Recognize loop idioms Delete dead loops Unroll loops ... llvm-svn: 277203	2016-07-29 19:29:47 +00:00
Zachary Turner	334aec4dd2	Revert "[msf] Rename Msf to MSF." This reverts commit 4d1557ffac41e079bcb1abbcf04f512474dcd6fe. llvm-svn: 277194	2016-07-29 18:38:47 +00:00
Piotr Padlewski	5312b6f10c	Fixing broken MSVS builds llvm-svn: 277191	2016-07-29 18:28:07 +00:00
Zachary Turner	a010f5cef0	[msf] Rename Msf to MSF. In a previous patch, it was suggested to use all caps instead of rolling caps for initialisms, so this patch changes everything to do this. llvm-svn: 277190	2016-07-29 18:24:26 +00:00
Andrew Kaylor	b99d1cc7ed	Recommitting r275284: add support to inline __builtin_mempcpy Patch by Sunita Marathe Third try, now following fixes to MSan to handle mempcy in such a way that this commit won't break the MSan buildbots. (Thanks, Evegenii!) llvm-svn: 277189	2016-07-29 18:23:18 +00:00
Tim Northover	0d56e05a12	GlobalISel: make translate* functions take the most specialized class possible. NFC. llvm-svn: 277188	2016-07-29 18:11:21 +00:00
Tim Northover	69c2ba546f	GlobalISel: add generic conditional branch. Just the basic equivalent to DAG's condbr for now, we'll get to things like br_cc when we start doing more legalization. llvm-svn: 277184	2016-07-29 17:58:00 +00:00
Kevin Enderby	f4586039f6	The next step along the way to getting good error messages for bad archives. As mentioned in commit log for r276686 this next step is adding a new method in the ArchiveMemberHeader class to get the full name that does proper error checking, and can be use for error messages. To do this the name of ArchiveMemberHeader::getName() is changed to ArchiveMemberHeader::getRawName() to be consistent with Archive::Child::getRawName(). Then the “new” method is the addition of a new implementation of ArchiveMemberHeader::getName() which gets the full name and provides proper error checking. Which is mostly a rewrite of what was Archive::Child::getName() and cleaning up incorrect uses of llvm_unreachable() in the code which were actually just cases of errors in the input Archives. Then Archive::Child::getName() is changed to return Expected<> and use the new implementation of ArchiveMemberHeader::getName() . Also needed to change Archive::getMemoryBufferRef() with these changes to return Expected<> as well to propagate Errors up. As well as changing Archive::isThinMember() to return Expected<> . llvm-svn: 277177	2016-07-29 17:44:13 +00:00
Tim Northover	a51575ffa2	CodeGen: improve MachineInstrBuilder & MachineIRBuilder interface For MachineInstrBuilder, having to manually use RegState::Define is ugly and makes register definitions clunkier than they need to be, so this adds two convenience functions: addDef and addUse. For MachineIRBuilder, we want to avoid BuildMI's first-reg-is-def rule because it's hidden away and causes bugs. So this patch switches buildInstr to returning a MachineInstrBuilder and adding all operands via addDef/addUse. NFC. llvm-svn: 277176	2016-07-29 17:43:52 +00:00
Ahmed Bougacha	784e3423e6	[GlobalISel] Add G_XOR. llvm-svn: 277172	2016-07-29 16:56:20 +00:00
Ahmed Bougacha	5c98b60ecc	[GlobalISel] Add LLT raw_ostream operator<< overload. Helpful when debugging; will be used in the following commit. llvm-svn: 277170	2016-07-29 16:56:12 +00:00
Brendon Cahoon	254f889dc5	MachinePipeliner pass that implements Swing Modulo Scheduling Software pipelining is an optimization for improving ILP by overlapping loop iterations. Swing Modulo Scheduling (SMS) is an implementation of software pipelining that attempts to reduce register pressure and generate efficient pipelines with a low compile-time cost. This implementaion of SMS is a target-independent back-end pass. When enabled, the pass should run just prior to the register allocation pass, while the machine IR is in SSA form. If the pass is successful, then the original loop is replaced by the optimized loop. The optimized loop contains one or more prolog blocks, the pipelined kernel, and one or more epilog blocks. This pass is enabled for Hexagon only. To enable for other targets, a couple of target specific hooks must be implemented, and the pass needs to be called from the target's TargetMachine implementation. Differential Review: http://reviews.llvm.org/D16829 llvm-svn: 277169	2016-07-29 16:44:44 +00:00
Matt Masten	a6669a1e05	Initial support for vectorization using svml (short vector math library). Differential Revision: https://reviews.llvm.org/D19544 llvm-svn: 277166	2016-07-29 16:42:44 +00:00
Ahmed Bougacha	5831cc28b5	[GlobalISel] Auto-brief LowLevelType. NFC. llvm-svn: 277163	2016-07-29 16:11:06 +00:00
Ahmed Bougacha	9d95557128	[GlobalISel] Add LLT::operator!=(). llvm-svn: 277162	2016-07-29 16:11:04 +00:00
Ahmed Bougacha	8292bdf735	[GlobalISel] Fix LLT::unsized to match LLT(LabelTy). When coming from an IR label type, we set a 0 NumElements, but not when constructing an LLT using unsized(), causing comparisons to fail. Pick one variant and fix the other. llvm-svn: 277161	2016-07-29 16:11:02 +00:00
Ahmed Bougacha	9f986bf3a9	[GlobalISel] Add unittests for LowLevelType. llvm-svn: 277160	2016-07-29 16:10:57 +00:00
Sjoerd Meijer	0eb96ed0de	TargetInstrInfo: add virtual function getInstSizeInBytes This adds a target hook getInstSizeInBytes to TargetInstrInfo that a lot of subclasses already implement. Differential Revision: https://reviews.llvm.org/D22885 llvm-svn: 277126	2016-07-29 08:16:16 +00:00
David Majnemer	a926b3e71b	[ConstantFolding] Remove an unused ConstantFoldInstOperands overload No functional change is intended. llvm-svn: 277101	2016-07-29 03:27:33 +00:00
David Majnemer	d536f2328e	[ConstnatFolding] Teach the folder how to fold ConstantVector A ConstantVector can have ConstantExpr operands and vice versa. However, the folder had no ability to fold ConstantVectors which, in some cases, was an optimization barrier. Instead, rephrase the folder in terms of Constants instead of ConstantExprs and teach callers how to deal with failure. llvm-svn: 277099	2016-07-29 03:27:26 +00:00
Piotr Padlewski	c7a151464f	Fixed comment llvm-svn: 277091	2016-07-29 00:30:07 +00:00
Piotr Padlewski	84abc74f2c	Added ThinLTO inlining statistics Summary: copypasta doc of ImportedFunctionsInliningStatistics class \brief Calculate and dump ThinLTO specific inliner stats. The main statistics are: (1) Number of inlined imported functions, (2) Number of imported functions inlined into importing module (indirect), (3) Number of non imported functions inlined into importing module (indirect). The difference between first and the second is that first stat counts all performed inlines on imported functions, but the second one only the functions that have been eventually inlined to a function in the importing module (by a chain of inlines). Because llvm uses bottom-up inliner, it is possible to e.g. import function `A`, `B` and then inline `B` to `A`, and after this `A` might be too big to be inlined into some other function that calls it. It calculates this statistic by building graph, where the nodes are functions, and edges are performed inlines and then by marking the edges starting from not imported function. If `Verbose` is set to true, then it also dumps statistics per each inlined function, sorted by the greatest inlines count like - number of performed inlines - number of performed inlines to importing module Reviewers: eraman, tejohnson, mehdi_amini Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D22491 llvm-svn: 277089	2016-07-29 00:27:16 +00:00
Justin Lebar	9cbc301035	Revert "Don't invoke getName() from Function::isIntrinsic().", rL276942. This broke some out-of-tree AMDGPU tests that relied on the old behavior wherein isIntrinsic() would return true for any function that starts with "llvm.". And in general that change will not play nicely with out-of-tree backends. llvm-svn: 277087	2016-07-28 23:58:15 +00:00
Sanjoy Das	c6af5ead86	[IR] Introduce a non-integral pointer type Summary: This change adds a `ni` specifier in the `datalayout` string to denote pointers in some given address spaces as "non-integral", and adds some typing rules around these special pointers. Reviewers: majnemer, chandlerc, atrick, dberlin, eli.friedman, tstellarAMD, arsenm Subscribers: arsenm, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D22488 llvm-svn: 277085	2016-07-28 23:43:38 +00:00
Adam Nemet	aa3506c5f0	[BPI] Add new LazyBPI analysis Summary: The motivation is the same as in D22141: In order to add the hotness attribute to optimization remarks we need BFI to be available in all passes that emit optimization remarks. BFI depends on BPI so unless we make this lazy as well we would still compute BPI unconditionally. The solution is to use the new LazyBPI pass in LazyBFI and only compute BPI when computation of BFI is requested by the client. I extended the laziness test using a LoopDistribute test to also cover BPI. Reviewers: hfinkel, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22835 llvm-svn: 277083	2016-07-28 23:31:12 +00:00
Michael Kuperstein	e45d4d9b35	[PM] Port LowerGuardIntrinsic to the new PM. llvm-svn: 277057	2016-07-28 22:08:41 +00:00
David Majnemer	3d32b7ed0d	[coroutines] Part 3 of N: Adding Boilerplate for Coroutine Passes This adds boilerplate code for all coroutine passes, the passes are no-ops for now. Also, a small test has been added to verify that passes execute in the expected order or not at all if coroutine support is disabled. Patch by Gor Nishanov! Differential Revision: https://reviews.llvm.org/D22847 llvm-svn: 277033	2016-07-28 21:04:31 +00:00
Zachary Turner	e98137c47f	[pdb] Fix some warnings that break -Werror builds. llvm-svn: 277021	2016-07-28 19:18:02 +00:00
Zachary Turner	d66889cbae	[pdb] Refactor library to more clearly separate reading/writing Reviewed By: amccarth, ruiu Differential Revision: https://reviews.llvm.org/D22693 llvm-svn: 277019	2016-07-28 19:12:28 +00:00
Zachary Turner	199f48a5f0	Get rid of IMsfStreamData class. This was a pure virtual base class whose purpose was to abstract away the notion of how you retrieve the layout of a discontiguous stream of blocks in an Msf file. This led to too many layers of abstraction making it difficult to figure out what was going on and extend things. Ultimately, a stream's layout is decided by its length and the array of block numbers that it lives on. So rather than have an abstract base class which can return this in any number of ways, it's more straightforward to simply store them as fields of a trivial struct, and also to give a more appropriate name. This patch does that. It renames IMsfStreamData to MsfStreamLayout, and deletes the 2 concrete implementations, DirectoryStreamData and IndexedStreamData. MsfStreamLayout is a trivial struct with the necessary data. llvm-svn: 277018	2016-07-28 19:11:09 +00:00
Matthias Braun	941a705b7b	MachineFunction: Return reference for getFrameInfo(); NFC getFrameInfo() never returns nullptr so we should use a reference instead of a pointer. llvm-svn: 277017	2016-07-28 18:40:00 +00:00
John Brawn	2853269224	Revert r276973 "Adjust Registry interface to not require plugins to export a registry" Buildbot failures when building with clang -Werror. Reverting while I try to figure this out. llvm-svn: 277008	2016-07-28 17:17:22 +00:00
Ahmed Bougacha	46c05fc861	[GlobalISel] Remove types on selected insts instead of using LLT(). LLT() has a particular meaning: it's one invalid type. But we really want selected instructions to have no type whatsoever. Also verify that types don't linger after ISel, and enable the verifier on the AArch64 select test. llvm-svn: 277001	2016-07-28 16:58:27 +00:00
Wei Ding	07e03712d3	AMDGPU : Add intrinsics for compare with the full wavefront result Differential Revision: http://reviews.llvm.org/D22482 llvm-svn: 276998	2016-07-28 16:42:13 +00:00
John Brawn	778c3c6c61	Reapply r276856 "Adjust Registry interface to not require plugins to export a registry" This version has two fixes compared to the original: * In Registry.h the template static members are instantiated before they are used, as clang gives an error if you do it the other way around. * The use of the Registry template in clang-tidy is updated in the same way as has been done everywhere else. Original commit message: Currently the Registry class contains the vestiges of a previous attempt to allow plugins to be used on Windows without using BUILD_SHARED_LIBS, where a plugin would have its own copy of a registry and export it to be imported by the tool that's loading the plugin. This only works if the plugin is entirely self-contained with the only interface between the plugin and tool being the registry, and in particular this conflicts with how IR pass plugins work. This patch changes things so that instead the add_node function of the registry is exported by the tool and then imported by the plugin, which solves this problem and also means that instead of every plugin having to export every registry they use instead LLVM only has to export the add_node functions. This allows plugins that use a registry to work on Windows if LLVM_EXPORT_SYMBOLS_FOR_PLUGINS is used. llvm-svn: 276973	2016-07-28 12:48:17 +00:00
Zijiao Ma	e56a53a9b3	Add unittests to {ARM \| AArch64}TargetParser. Add unittest to {ARM \| AArch64}TargetParser,and by the way correct problems as below: 1.Correct a incorrect indexing problem in AArch64TargetParser. The architecture enumeration is shared across ARM and AArch64 in original implementation.But In the code,I just used the index which was offset by the ARM, and this would index into the array incorrectly. To make AArch64 has its own arch enum,or we will do a lot of slowly iterating. 2.Correct a spelling error. The parameter of llvm::AArch64::getArchExtName. 3.Correct a writing mistake, in llvm::ARM::parseArchISA. Differential Revision: https://reviews.llvm.org/D21785 llvm-svn: 276957	2016-07-28 06:11:18 +00:00
David Majnemer	6e9b47bc8a	Add EP_CGSCCOptimizerLate extension point to PassManagerBuilder The EP_CGSCCOptimizerLate extension point allows adding CallGraphSCC passes at the end of the main CallGraphSCC passes and before any function simplification passes run by CGPassManager. Patch by Gor Nishanov! Differential Revision: https://reviews.llvm.org/D22897 llvm-svn: 276953	2016-07-28 03:28:43 +00:00
Justin Lebar	45bcdcbefb	Don't invoke getName() from Function::isIntrinsic(). Summary: getName() involves a hashtable lookup, so is expensive given how frequently isIntrinsic() is called. (In particular, many users cast to IntrinsicInstr or one of its subclasses before calling getIntrinsicID().) This has an incidental functional change: Before, isIntrinsic() would return true for any function whose name started with "llvm.", even if it wasn't properly an intrinsic. The new behavior seems more correct to me, because it's strange to say that isIntrinsic() is true, but getIntrinsicId() returns "not an intrinsic". Some callers want the old behavior -- they want to know whether the caller is a recognized intrinsic, or might be one in some other version of LLVM. For them, we added Function::hasLLVMReservedName(), which checks whether the name starts with "llvm.". This change is good for a 1.5% e2e speedup compiling a large Eigen benchmark. Reviewers: bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22065 llvm-svn: 276942	2016-07-27 23:46:57 +00:00
George Burgess IV	dbd35c44d4	[CFLAA] Add getModRefBehavior to CFLAnders. This patch lets CFLAnders respond to mod-ref queries. It also includes a small bugfix to CFLSteens. Patch by Jia Chen. Differential Revision: https://reviews.llvm.org/D22823 llvm-svn: 276939	2016-07-27 23:07:07 +00:00
Duncan P. N. Exon Smith	1723821d17	CodeGen: Make iterator-to-pointer conversion explicit, NFC Remove the implicit conversion from MachineInstrBundleIterator to MachineInstr, leaving behind an explicit conversion. I think* this is the last ilist_iterator-related implicit conversion to ilist_node subclass. If I'm right, I can finally dig in and fix the UB in ilist that these conversions were relying on. Note that the implicit users of this conversion have already been removed. If you have out-of-tree code that doesn't update, you might be able to buy some time by temporarily reverting this commit. llvm-svn: 276902	2016-07-27 18:45:18 +00:00
David Majnemer	22039e6858	Fix the build for libstdc++ 4.7 libstdc++ 4.7 doesn't have emplace. Use std::map::insert instead. llvm-svn: 276901	2016-07-27 18:25:12 +00:00
Reid Kleckner	46cb48c74a	Remove MCAsmInfo.h include from TargetOptions.h TargetOptions wants the ExceptionHandling enum. Move that to MCTargetOptions.h to avoid transitively including Dwarf.h everywhere in clang. Now you can add a DWARF tag without a full rebuild of clang semantic analysis. llvm-svn: 276883	2016-07-27 16:03:57 +00:00
Ahmed Bougacha	6756a2c953	[GlobalISel] Introduce an instruction selector. And implement it for AArch64, supporting x/w ADD/OR. Differential Revision: https://reviews.llvm.org/D22373 llvm-svn: 276875	2016-07-27 14:31:55 +00:00
Tim Northover	8658b3c393	GlobalISel: remove variable_ops from output list. The instance in the input operand list allows both inputs and outputs, but the one in (outs) is not treated specially which leads to the MachineVerifier invoking UB (looking at an invalid MCInstrDesc field). No functional change except in UBSan builds (maybe, who knows!), where it fixes the legalize-add.mir test. llvm-svn: 276872	2016-07-27 14:30:49 +00:00
Daniel Sanders	c5537427c2	[mips][ias] Check '$rs = $rd' constraints when both registers are in AsmText. Summary: This is one possible solution to the problem of ignoring constraints that Simon raised in D21473 but it's a bit of a hack. The integrated assembler currently ignores violations of the tied register constraints when the operands involved in a tie are both present in the AsmText. For example, 'dati $rs, $rt, $imm' with the '$rs = $rt' will silently replace $rt with $rs. So 'dati $2, $3, 1' is processed as if the user provided 'dati $2, $2, 1' without any diagnostic being emitted. This is difficult to solve properly because there are multiple parts of the matcher that are silently forcing these constraints to be met. Tied operands are rendered to instructions by cloning previously rendered operands but this is unnecessary because the matcher was already instructed to render the operand it would have cloned. This is also unnecessary because earlier code has already replaced the MCParsedOperand with the one it was tied to (so the parsed input is matched as if it were 'dati <RegIdx 2>, <RegIdx 2>, <Imm 1>'). As a result, it looks like fixing this properly amounts to a rewrite of the tied operand handling which affects all targets. This patch however, merely inserts a checking hook just before the substitution of MCParsedOperands and the Mips target overrides it. It's not possible to accurately check the registers are the same this early (because numeric registers haven't been bound to a register class yet) so it cheats a bit and checks that the tokens that produced the operand are lexically identical. This works because tied registers need to have the same register class but it does have a flaw. It will reject 'dati $4, $a0, 1' for violating the constraint even though $a0 ends up as the same register as $4. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: https://reviews.llvm.org/D21994 llvm-svn: 276867	2016-07-27 13:49:44 +00:00
John Brawn	3839263204	Revert r276856 "Adjust Registry interface to not require plugins to export a registry" This is causing a huge pile of buildbot failures. llvm-svn: 276857	2016-07-27 11:41:18 +00:00
John Brawn	63aff61019	Adjust Registry interface to not require plugins to export a registry Currently the Registry class contains the vestiges of a previous attempt to allow plugins to be used on Windows without using BUILD_SHARED_LIBS, where a plugin would have its own copy of a registry and export it to be imported by the tool that's loading the plugin. This only works if the plugin is entirely self-contained with the only interface between the plugin and tool being the registry, and in particular this conflicts with how IR pass plugins work. This patch changes things so that instead the add_node function of the registry is exported by the tool and then imported by the plugin, which solves this problem and also means that instead of every plugin having to export every registry they use instead LLVM only has to export the add_node functions. This allows plugins that use a registry to work on Windows if LLVM_EXPORT_SYMBOLS_FOR_PLUGINS is used. Differential Revision: http://reviews.llvm.org/D21385 llvm-svn: 276856	2016-07-27 11:18:38 +00:00
Simon Pilgrim	470b81ca69	Removed unusued template function declaration that has no definition - fixes MSVC warning. llvm-svn: 276852	2016-07-27 10:11:05 +00:00
Sean Silva	285e0974f0	Refactor - CodeExtractor : Move check for valid block to static utility This lets you actually check to see if a block is valid before trying to extract. Patch by River Riddle! Differential Revision: https://reviews.llvm.org/D22699 llvm-svn: 276846	2016-07-27 08:02:46 +00:00
David Majnemer	7855719c10	[coroutines] Part 2 of N: Adding Coroutine Intrinsics This is the second patch in the coroutine series. It adds coroutine intrinsics and updates intrinsic cost in TargetTransformInfoImpl.h. Patch by Gor Nishanov! Differential Revision: https://reviews.llvm.org/D22659 llvm-svn: 276839	2016-07-27 05:12:35 +00:00
Sebastian Pop	18c964d7a4	add a verbose mode to Loop->print() to print all the basic blocks of a loop Differential Revision: https://reviews.llvm.org/D22817 llvm-svn: 276838	2016-07-27 05:02:17 +00:00
Sebastian Pop	9570bfd349	add function isLoopLatch Differential Revision: https://reviews.llvm.org/D22817 llvm-svn: 276837	2016-07-27 05:02:15 +00:00
Sebastian Pop	57a127a7d1	refactor code in verifyLoop: NFC. Use std::any_of as requested in https://reviews.llvm.org/D22816 llvm-svn: 276835	2016-07-27 04:36:06 +00:00
Sebastian Pop	e5730a27f2	Move assert as early as possible Patch written by Aditya Kumar. Differential Revision: https://reviews.llvm.org/D22816 llvm-svn: 276830	2016-07-27 03:30:11 +00:00
Andrew Kaylor	f990fa5f7b	Reverting r276771 due to MSan failures. llvm-svn: 276824	2016-07-27 01:19:24 +00:00
Hans Wennborg	685e8ff953	Revert r276136 "Use ValueOffsetPair to enhance value reuse during SCEV expansion." It causes Clang tests to fail after Windows self-host (PR28705). (Also reverts follow-up r276139.) llvm-svn: 276822	2016-07-26 23:25:13 +00:00
Tim Northover	ad2b717f2c	GlobalISel: add generic load and store instructions. Pretty straightforward, the only oddity is the MachineMemOperand (which it's surprisingly difficult to share code for). llvm-svn: 276799	2016-07-26 20:23:26 +00:00
David Majnemer	6774d612d4	[InstSimplify] Cast folding can be made more generic Use isEliminableCastPair to determine if a pair of casts are foldable. llvm-svn: 276777	2016-07-26 17:58:05 +00:00
Andrew Kaylor	3104a6bad0	Re-committing r275284: add support to inline __builtin_mempcpy Patch by Sunita Marathe Differential Revision: http://reviews.llvm.org/D21920 llvm-svn: 276771	2016-07-26 17:23:13 +00:00
Matt Arsenault	32fc527c65	AMDGPU: Add fp legacy instruction intrinsics This could use some additional optimization work to use mad/mac legacy. llvm-svn: 276764	2016-07-26 16:45:45 +00:00
Tim Northover	756eca35cf	GlobalISel: add specialized buildCopy function to MachineInstrBuilder. NFC. llvm-svn: 276763	2016-07-26 16:45:30 +00:00
Tim Northover	cc5f76226b	GlobalISel: give MachineInstrBuilder a uniform interface. NFC. Instead of an ad-hoc collection of "buildInstr" functions with varying numbers of registers, this uses variadic templates to provide for as many regs as needed! Also make IRtranslator use new "buildBr" function instead of some weird generic one that no-one else would really use. llvm-svn: 276762	2016-07-26 16:45:26 +00:00
Oliver Stannard	2171828a49	[ARM] Implement -mimplicit-it assembler option This option, compatible with gas's -mimplicit-it, controls the generation/checking of implicit IT blocks in ARM/Thumb assembly. This option allows two behaviours that were not possible before: - When in ARM mode, emit a warning when assembling a conditional instruction that is not in an IT block. This is enabled with -mimplicit-it=never and -mimplicit-it=thumb. - When in Thumb mode, automatically generate IT instructions when an instruction with a condition code appears outside of an IT block. This is enabled with -mimplicit-it=thumb and -mimplicit-it=always. The default option is -mimplicit-it=arm, which matches the existing behaviour (allow conditional ARM instructions outside IT blocks without warning, and error if a conditional Thumb instruction is outside an IT block). The general strategy for generating IT blocks in Thumb mode is to keep a small list of instructions which should be in the IT block, and only emit them when we encounter something in the input which means we cannot continue the block. This could be caused by: - A non-predicable instruction - An instruction with a condition not compatible with the IT block - The IT block already contains 4 instructions - A branch-like instruction (including ALU instructions with the PC as the destination), which cannot appear in the middle of an IT block - A label (branching into an IT block is not legal) - A change of section, architecture, ISA, etc - The end of the assembly file. Some of these, such as change of section and end of file, are parsed outside of the ARM asm parser, so I've added a new virtual function to AsmParser to ensure any previously-parsed instructions have been emitted. The ARM implementation of this flushes the currently pending IT block. We now have to try instruction matching up to 3 times, because we cannot know if the current IT block is valid before matching, and instruction matching changes depending on the IT block state (due to the 16-bit ALU instructions, which set the flags iff not in an IT block). In the common case of not having an open implicit IT block and the instruction being matched not needing one, we still only have to run the matcher once. I've removed the ITState.FirstCond variable, because it does not store any information that isn't already represented by CurPosition. I've also updated the comment on CurPosition to accurately describe it's meaning (which this patch doesn't change). Differential Revision: https://reviews.llvm.org/D22760 llvm-svn: 276747	2016-07-26 14:19:47 +00:00
David Majnemer	a90a621d1e	Reapply: [InstSimplify] Add support for bitcasts" This reverts commit r276700 and reapplies r276698. The relevant clang tests have been updated. llvm-svn: 276727	2016-07-26 05:52:29 +00:00
Amaury Sechet	06ac2f4a7e	Propery format doccomment in lto.h . NFC llvm-svn: 276725	2016-07-26 04:20:30 +00:00
Michael Kuperstein	d5617b5545	Attempt to pacify windows bots, again. llvm-svn: 276703	2016-07-25 22:29:04 +00:00
David Majnemer	6e06b577cc	Revert "[InstSimplify] Add support for bitcasts" This reverts commit r276698. Clang has tests which rely on the optimizer :( llvm-svn: 276700	2016-07-25 22:24:59 +00:00
David Majnemer	62611fd3f7	[InstSimplify] Add support for bitcasts BitCasts of BitCasts can be folded away as can BitCasts which don't change the type of the operand. llvm-svn: 276698	2016-07-25 22:04:58 +00:00
Tim Northover	7c9eba90ff	GlobalISel: add generic casts to IRTranslator This adds LLVM's 3 main cast instructions (inttoptr, ptrtoint, bitcast) to the IRTranslator. The first two are direct translations (with 2 MachineInstr types each). Since LLT discards information, a bitcast might become trivial and we emit a COPY in those cases instead. llvm-svn: 276690	2016-07-25 21:01:29 +00:00
Michael Kuperstein	39feb6290c	[PM] Port SymbolRewriter to the new PM Differential Revision: https://reviews.llvm.org/D22703 llvm-svn: 276687	2016-07-25 20:52:00 +00:00
Kevin Enderby	95b0842e64	Next step along the way to getting good error messages for bad archives. I consulted with Lang Hames on this work, and the goal was to add a bit of "where" in the archive the error occurred along with what the error was. So this step changes ArchiveMemberHeader into a class with a pointer to the archive header and the parent archive. Which allows the methods in the ArchiveMemberHeader to determine which member the header is for to include that information in the error message. For this first step the "where" is just the offset to the member in the archive. The next step will be a new method on ArchiveMemberHeader to get the full name, if possible, to be use in the error message. Which will now be possible as ArchiveMemberHeader contains a pointer to the Archive with its string table and its size, etc. so the full name can be determined from the header if it is valid. Also this change adds the missing checks the archive header is actually contained in the buffer and is not truncated, as well as if the terminating characters are correct in the header. And changes one error message in Archive::Child::getNext() where the name or offset to member is now added. llvm-svn: 276686	2016-07-25 20:36:36 +00:00
Jordan Rose	10697a7c34	Fix r276671 to not use a defaulted move constructor. MSVC won't provide the body of this move constructor and assignment operator, possibly because the copy constructor is banned. Just write it manually. llvm-svn: 276685	2016-07-25 20:34:25 +00:00
Jan Vesely	b64c8925e9	AMDGPU: Remove read_workdim intrinsic Differential revision: https://reviews.llvm.org/D22732 llvm-svn: 276682	2016-07-25 20:17:02 +00:00
Matt Arsenault	df3e224632	LiveIntervals: Return index from replaceMachineInstrInMaps Fixes weird asymmetry with insertion llvm-svn: 276678	2016-07-25 19:39:04 +00:00
Jordan Rose	f85a95fdcb	StringSwitch cannot be copied (take 2). This prevents StringSwitch from being used with 'auto', which is important because the inferred type is StringSwitch rather than the result type. This is a problem because StringSwitch stores addresses of temporary values rather than copying or moving the value into its own storage. This is a compromise that still allows wrapping StringSwitch in other temporary structures, which (unlike StringSwitch) may be non-trivial to set up and therefore want to at least be movable. (For an example, see QueryParser.cpp in clang-tools-extra.) Changing this uncovered the bug in PassBuilder, also in this patch. Clang doesn't seem to have any occurrences of the issue. Re-commit of r276652. llvm-svn: 276671	2016-07-25 18:34:51 +00:00
Zachary Turner	336e919b01	Add a modulemap for LLVMDebugInfoMsf. Differential Revision: https://reviews.llvm.org/D22769 llvm-svn: 276669	2016-07-25 18:18:59 +00:00
Michael Kuperstein	8f8e1d1bf6	Don't use iplist in SymbolRewriter. NFC. There didn't appear to be a good reason to use iplist in this case, a regular list of unique_ptr works just as well. Change made in preparation to a new PM port (since iplist is not moveable). llvm-svn: 276668	2016-07-25 18:10:54 +00:00
Jordan Rose	9978dec4c2	Revert "StringSwitch cannot be copied or moved." This reverts commit r276652. The clang-query tool is currently relying on this behavior. I'll try again later. llvm-svn: 276661	2016-07-25 17:28:33 +00:00
Joel Jones	373d7d30dd	MC] Provide an MCTargetOptions to implementors of MCAsmBackendCtorTy, NFC Some targets, notably AArch64 for ILP32, have different relocation encodings based upon the ABI. This is an enabling change, so a future patch can use the ABIName from MCTargetOptions to chose which relocations to use. Tested using check-llvm. The corresponding change to clang is in: http://reviews.llvm.org/D16538 Patch by: Joel Jones Differential Revision: https://reviews.llvm.org/D16213 llvm-svn: 276654	2016-07-25 17:18:28 +00:00
Jordan Rose	0cdbe7a572	StringSwitch cannot be copied or moved. ...but most importantly, it cannot be used well with 'auto', because the inferred type is StringSwitch rather than the result type. This is a problem because StringSwitch stores addresses of temporary values rather than copying or moving the value into its own storage. Changing this uncovered the bug in PassBuilder, also in this patch. Clang doesn't seem to have any occurrences of the issue. llvm-svn: 276652	2016-07-25 17:08:24 +00:00
Sean Silva	fe5abd5e0c	Fix : Partial Inliner requires AssumptionCacheTracker The public InlineFunction utility assumes that the passed in InlineFunctionInfo has a valid AssumptionCacheTracker. Patch by River Riddle! Differential Revision: https://reviews.llvm.org/D22706 llvm-svn: 276609	2016-07-25 05:00:00 +00:00
Elena Demikhovsky	376a18bd92	[Loop Vectorizer] Handling loops FP induction variables. Allowed loop vectorization with secondary FP IVs. Like this: float *A; float x = init; for (int i=0; i < N; ++i) { A[i] = x; x -= fp_inc; } The auto-vectorization is possible when the induction binary operator is "fast" or the function has "unsafe" attribute. Differential Revision: https://reviews.llvm.org/D21330 llvm-svn: 276554	2016-07-24 07:24:54 +00:00
Xinliang David Li	9239245401	[Profile] Use explicit flag to enable IR PGO Patch by Jake VanAdrighem Differential Revision: http://reviews.llvm.org/D22607 llvm-svn: 276516	2016-07-23 04:28:52 +00:00
Sean Silva	ab6a683765	Avoid using a raw AssumptionCacheTracker in various inliner functions. This unblocks the new PM part of River's patch in https://reviews.llvm.org/D22706 Conveniently, this same change was needed for D21921 and so these changes are just spun out from there. llvm-svn: 276515	2016-07-23 04:22:50 +00:00
Sanjoy Das	a7d9ec8751	[SCEV] Make isImpliedCondOperandsViaRanges smarter This change lets us prove things like "{X,+,10} s< 5000" implies "{X+7,+,10} does not sign overflow" It does this by replacing replacing getConstantDifference by computeConstantDifference (which is smarter) in isImpliedCondOperandsViaRanges. llvm-svn: 276505	2016-07-23 00:54:36 +00:00
Sanjoy Das	0b1af85cc2	[SCEV] Change the interface of computeConstantDifference; NFC This is in preparation of s/getConstantDifference/computeConstantDifference/ in a later change. llvm-svn: 276503	2016-07-23 00:28:56 +00:00
Adam Nemet	9e6e63fba2	[LoopDataPrefetch] Include hotness of region in opt remark llvm-svn: 276488	2016-07-22 22:53:17 +00:00
Tim Northover	98a56eb7f4	GlobalISel: allow multiple types on MachineInstrs. llvm-svn: 276481	2016-07-22 22:13:36 +00:00
Vedant Kumar	c9d3a173fb	[Coverage] Mark more methods const (NFC) llvm-svn: 276474	2016-07-22 21:11:55 +00:00
Anna Thomas	58d1192a22	Add invariant start call creation in IRBuilder.NFC Differential Revision: https://reviews.llvm.org/D22700 llvm-svn: 276471	2016-07-22 20:57:23 +00:00
Pete Cooper	fea2139740	Use RValue refs in APInt add/sub methods. This adds versions of operator + and - which are optimized for the LHS/RHS of the operator being RValue's. When an RValue is available, we can use its storage space instead of allocating new space. On code such as ConstantRange which makes heavy use of APInt's over 64-bits in size, this results in significant numbers of saved allocations. Thanks to David Blaikie for all the review and most of the code here. llvm-svn: 276470	2016-07-22 20:55:46 +00:00
Sanjoy Das	095f5b204f	[SCEV] Extract out a helper function; NFC The helper will get smarter in a later change, but right now this is just code reorganization. llvm-svn: 276467	2016-07-22 20:47:55 +00:00
Tim Northover	33b07d6725	GlobalISel: implement legalization pass, with just one transformation. This adds the actual MachineLegalizeHelper to do the work and a trivial pass wrapper that legalizes all instructions in a MachineFunction. Currently the only transformation supported is splitting up a vector G_ADD into one acting on smaller vectors. llvm-svn: 276461	2016-07-22 20:03:43 +00:00
Zachary Turner	e4a4f33daf	Make PDBFile store an msf::Layout. Previously it was storing all the fields of an msf::Layout as separate members. This is a trivial cleanup to make it store an msf::Layout directly. This makes the code more readable since it becomes clear which fields of PDBFile are actually the msf specific layout information in a sea of other bookkeeping fields. llvm-svn: 276460	2016-07-22 19:56:33 +00:00
Zachary Turner	e109dc63f9	[pdb] Have builders share a single BumpPtrAllocator. This makes it easier to have the writable and readable PDB interfaces share code since the read/write and write-only interfaces now share a single allocator, you don't have to worry about a builder building a read only interface and then having the read-only interface's data become corrupt when the builder goes out of scope. Now the allocator is specified explicitly to all constructors, so all interfaces can share a single allocator that is scoped appropriately. llvm-svn: 276459	2016-07-22 19:56:26 +00:00
Zachary Turner	bac69d33d0	[msf] Create LLVMDebugInfoMsf This provides a better layering of responsibilities among different aspects of PDB writing code. Some of the MSF related code was contained in CodeView, and some was in PDB prior to this. Further, we were often saying PDB when we meant MSF, and the two are actually independent of each other since in theory you can have other types of data besides PDB data in an MSF. So, this patch separates the MSF specific code into its own library, with no dependencies on anything else, and DebugInfoCodeView and DebugInfoPDB take dependencies on DebugInfoMsf. llvm-svn: 276458	2016-07-22 19:56:05 +00:00
Wei Mi	e04d0eff29	[PM] Port BreakCriticalEdges to the new PM. Differential Revision: https://reviews.llvm.org/D22688 llvm-svn: 276449	2016-07-22 18:04:25 +00:00
Anna Thomas	0be4a0e6a4	Invariant start/end intrinsics overloaded for address space Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: apilipenko, reames Subscribers: llvm-commits llvm-svn: 276447	2016-07-22 17:49:40 +00:00
Matt Arsenault	8d718dcfda	AMDGPU: Add HSA dispatch id intrinsic llvm-svn: 276437	2016-07-22 17:01:30 +00:00
Tim Northover	bd5054602e	GlobalISel: implement alloca instruction llvm-svn: 276433	2016-07-22 16:59:52 +00:00
Xinliang David Li	c27f1b7182	[Profile] Cleanup: remove unused interface llvm-svn: 276431	2016-07-22 16:11:56 +00:00
Lang Hames	5e51a2e31a	[Support] Make ErrorAsOutParameter take an Error* rather than an Error&. This allows ErrorAsOutParameter to work better with "optional" errors. For example, consider a function where for certain input values it is known that the function can't fail. This can now be written as: Result foo(Arg X, Error Err) { ErrorAsOutParameter EAO(Err); if (<Error Condition>) { if (Err) Err = <report error>; else llvm_unreachable("Unexpected failure!"); } } Rather than having to construct an ErrorAsOutParameter under every conditional where Err is known to be non-null. llvm-svn: 276430	2016-07-22 16:11:25 +00:00
Zachary Turner	b383d628df	[pdb] Move file layout header structs to RawTypes.h This facilitates code reuse between the builder classes and the "frozen" read only versions of the classes used for parsing existing PDB files. llvm-svn: 276427	2016-07-22 15:46:46 +00:00
Zachary Turner	d218c26124	[pdb] Round-trip module & file info to/from YAML. This implements support for writing compiland and compiland source file info to a binary PDB. This is tested by adding support for dumping these fields from an existing PDB to yaml, reading them back in, and dumping them again and verifying the values are as expected. llvm-svn: 276426	2016-07-22 15:46:37 +00:00
Simon Pilgrim	ea0d4f9962	[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128 (reapplied) As reported on PR26235, we don't currently make use of the VBROADCASTF128/VBROADCASTI128 instructions (or the AVX512 equivalents) to load+splat a 128-bit vector to both lanes of a 256-bit vector. This patch enables lowering from subvector insertion/concatenation patterns and auto-upgrades the llvm.x86.avx.vbroadcastf128.pd.256 / llvm.x86.avx.vbroadcastf128.ps.256 intrinsics to match. We could possibly investigate using VBROADCASTF128/VBROADCASTI128 to load repeated constants as well (similar to how we already do for scalar broadcasts). Reapplied with fix for PR28657 - removed intrinsic definitions (clang companion patch to be be submitted shortly). Differential Revision: https://reviews.llvm.org/D22460 llvm-svn: 276416	2016-07-22 13:58:44 +00:00
Xinliang David Li	d382e9d82b	Sync up InstrProfData.inc with compiler-rt with fixes to references llvm-svn: 276388	2016-07-22 04:46:56 +00:00
Xinliang David Li	6e9dee999d	Revert 276386 llvm-svn: 276387	2016-07-22 04:31:26 +00:00
Xinliang David Li	a182e58265	Sync up InstrProfData.inc with compiler-rt llvm-svn: 276386	2016-07-22 04:18:17 +00:00
Pete Cooper	b2ba776aed	Avoid dsymutil calls to getFileNameByIndex. This change adds a hasFileAtIndex method. getChildDeclContext can first call this method, and if it returns true it knows it can then lookup the resolved path cache for the given file index. If we hit that cache then we don't even have to call getFileNameByIndex. Running dsymutil against the swift executable built from github gives a 20% performance improvement without any change in the binary. Differential Revision: https://reviews.llvm.org/D22655 Reviewed by friss. llvm-svn: 276380	2016-07-22 01:41:32 +00:00
Xinliang David Li	6f8c504f10	[Profile] deprecate __llvm_profile_override_default_filename This eliminates unncessary calls and init functions. Differential Revision: http://reviews.llvm.org/D22613 llvm-svn: 276354	2016-07-21 23:19:10 +00:00
Wei Mi	1cf58f8996	[PM] Port NaryReassociate to the new PM Differential Revision: https://reviews.llvm.org/D22648 llvm-svn: 276349	2016-07-21 22:28:52 +00:00
Rong Xu	97b68c5ebe	[PGO] Make needsComdatForCounter() available (NFC) Move needsComdatForCounter() to lib/ProfileData/InstrProf.cpp from lib/Transforms/Instrumentation/InstrProfiling.cpp to make is available for other files. Differential Revision: https://reviews.llvm.org/D22643 llvm-svn: 276330	2016-07-21 20:50:02 +00:00
Anna Thomas	c858faa244	Revert "Invariant start/end intrinsics overloaded for address space" This reverts commit r276316. llvm-svn: 276320	2016-07-21 19:06:28 +00:00
Anna Thomas	29b24dfe44	Invariant start/end intrinsics overloaded for address space Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: tstellarAMD, reames, apilipenko Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22519 llvm-svn: 276316	2016-07-21 18:41:44 +00:00
Quentin Colombet	2b59eab79f	[IRTranslator] Add G_SUB opcode. This commit adds a generic SUB opcode to global-isel. llvm-svn: 276308	2016-07-21 17:26:50 +00:00
Quentin Colombet	7bcc921dd8	[IRTranslator] Add G_AND opcode. This commit adds a generic AND opcode to global-isel. llvm-svn: 276297	2016-07-21 15:50:42 +00:00
Konstantin Zhuravlyov	155626238b	AMDGPU/SI: Add support for R_AMDGPU_ABS32 Differential Revision: https://reviews.llvm.org/D21646 llvm-svn: 276294	2016-07-21 15:29:19 +00:00
Benjamin Kramer	3f1edd7e0c	Weaken ThreadSafeRefCountedBase atomics. Doesn't make a difference on x86, but avoids memory barriers on weakly-ordered archs like PowerPC and ARM. llvm-svn: 276291	2016-07-21 15:06:50 +00:00
Benjamin Kramer	857754a1cb	[DenseMap] Add a C++17-style try_emplace method. This provides an elegant pattern to solve the "construct if not in map already" problem we have many times in LLVM. Without try_emplace we either have to rely on a sentinel value (nullptr) or do two lookups. llvm-svn: 276277	2016-07-21 13:37:53 +00:00
Benjamin Kramer	eab3d36753	Rename StringMap::emplace_second to try_emplace. Coincidentally this function maps to the C++17 try_emplace. Rename it for consistentcy with C++17 std::map. NFC. llvm-svn: 276276	2016-07-21 13:37:48 +00:00
Amaury Sechet	17b67cd1ad	Expose AttributeSetNode, use it to provide aggregate getter for attribute in the C API. Summary: See D19181 for context. Reviewers: whitequark, Wallbraker, jyknight, echristo, bkramer, void Subscribers: mehdi_amini Differential Revision: http://reviews.llvm.org/D21265 llvm-svn: 276236	2016-07-21 04:25:06 +00:00
Adam Nemet	cbe2a9b213	[OptDiag] Missed these when making the IR Value a const pointer llvm-svn: 276224	2016-07-21 01:11:12 +00:00
Adam Nemet	7cfd5971ab	[OptDiag,LV] Add hotness attribute to applied-optimization remarks Test coverage is provided by modifying the function in the FP-math testcase that we are allowed to vectorize. llvm-svn: 276223	2016-07-21 01:07:13 +00:00
Adam Nemet	0e0e2d5d26	[OptDiag,LV] Add hotness attribute to the derived analysis remarks This includes FPCompute and Aliasing. Testcase is based on no_fpmath.ll. llvm-svn: 276211	2016-07-20 23:50:32 +00:00
Tim Northover	cffc0d20fb	GlobalISel: Remove explicit enumerator values from .def file. They were all auto-incremented from 0 anyway, and I'm getting really annoying conflicts and runtime failures when different people add more for GlobalISel (and even when I'm refactoring my own patches). NFC. llvm-svn: 276204	2016-07-20 22:58:01 +00:00
Adam Nemet	5b3a5cf6b0	[OptDiag,LV] Add hotness attribute to analysis remarks The earlier change added hotness attribute to missed-optimization remarks. This follows up with the analysis remarks (the ones explaining the reason for the missed optimization). llvm-svn: 276192	2016-07-20 21:44:26 +00:00
Adam Nemet	6100d16e7d	[OptDiag] Take the IR Value as a const pointer This helps because LoopAccessReport is passed around as a const reference and we derive the basic block passed as the Value parameter from the instruction in LoopAccessReport. llvm-svn: 276191	2016-07-20 21:44:22 +00:00
Tim Northover	75ad077330	GlobalISel: implement Legalization querying framework. This adds an (incomplete, inefficient) framework for deciding what to do with some operation on a given type. llvm-svn: 276184	2016-07-20 21:13:29 +00:00
George Burgess IV	400ae40348	[MSSA] Add an overload for getClobberingMemoryAccess. A seemingly common use for the walker's getClobberingMemoryAccess function is: ``` MemoryAccess getClobber(MemorySSAWalker W, MemoryUseOrDef MUD) { const Instruction I = MUD->getMemoryInst(); return W->getClobberingMemoryAccess(I); } ``` Which is kind of redundant, since walkers will ultimately query MSSA to find out which MemoryAccess `I` maps to (...which is always `MUD`). So, this patch adds an overload of getClobberingMemoryAccess that accepts MemoryAccesses directly. As a result, the Instruction overload of getClobberingMemoryAccess becomes a lightweight wrapper around our new overload. Additionally, this patch un`virtual`izes the Instruction overload of getClobberingMemoryAccess, since there doesn't seem to be a walker that benefits from that being virtual, and I can't think of how else one would implement it. Happy to make it virtual again if we would benefit from doing so. llvm-svn: 276169	2016-07-20 19:51:34 +00:00
Tim Northover	d3f047a38f	GlobalISel: properly conditionalize LLT use. We can't guard the include of LowLevelType.h because getType and setType are (trivial) functions even when GlobalISel isn't built. llvm-svn: 276160	2016-07-20 19:17:29 +00:00
Tim Northover	62ae568bbb	GlobalISel: implement low-level type with just size & vector lanes. This should be all the low-level instruction selection needs to determine how to implement an operation, with the remaining context taken from the opcode (e.g. G_ADD vs G_FADD) or other flags not based on type (e.g. fast-math). llvm-svn: 276158	2016-07-20 19:09:30 +00:00
Adam Nemet	546675cc7f	[OptDiag] Fix function comment Function is not passed unlike in the original of this (llvm::emitOptimizationRemarkMissed). llvm-svn: 276150	2016-07-20 18:16:45 +00:00
Sanjay Patel	683170bf56	move decomposeBitTestICmp() to Transforms/Utils; NFC As noted in https://reviews.llvm.org/D22537 , we can use this functionality in visitSelectInstWithICmp() and InstSimplify, but currently we have duplicated code. llvm-svn: 276140	2016-07-20 17:18:45 +00:00
Wei Mi	db80c0c77f	Use ValueOffsetPair to enhance value reuse during SCEV expansion. In D12090, the ExprValueMap was added to reuse existing value during SCEV expansion. However, const folding and sext/zext distribution can make the reuse still difficult. A simplified case is: suppose we know S1 expands to V1 in ExprValueMap, and S1 = S2 + C_a S3 = S2 + C_b where C_a and C_b are different SCEVConstants. Then we'd like to expand S3 as V1 - C_a + C_b instead of expanding S2 literally. It is helpful when S2 is a complex SCEV expr and S2 has no entry in ExprValueMap, which is usually caused by the fact that S3 is generated from S1 after const folding. In order to do that, we represent ExprValueMap as a mapping from SCEV to ValueOffsetPair. We will save both S1->{V1, 0} and S2->{V1, C_a} into the ExprValueMap when we create SCEV for V1. When S3 is expanded, it will first expand S2 to V1 - C_a because of S2->{V1, C_a} in the map, then expand S3 to V1 - C_a + C_b. Differential Revision: https://reviews.llvm.org/D21313 llvm-svn: 276136	2016-07-20 16:40:33 +00:00
Sanjay Patel	be53c65fab	fix documentation comments; NFC llvm-svn: 276135	2016-07-20 16:30:55 +00:00
Adam Nemet	67c8929a2c	[LV] Add hotness attribute to missed-optimization remarks The new OptimizationRemarkEmitter analysis pass is hooked up to both new and old PM passes. llvm-svn: 276080	2016-07-20 04:03:43 +00:00
Matthias Braun	5b9722d6c7	Revert "RegScavenging: Add scavengeRegisterBackwards()" Reverting this commit for now as it seems to be causing failures on test-suite tests on the clang-ppc64le-linux-lnt bot. This reverts commit r276044. llvm-svn: 276068	2016-07-20 00:21:32 +00:00
Sean Silva	e3c18a5ae8	[PM] Port LoopUnroll. We just set PreserveLCSSA to always true since we don't have an analogous method `mustPreserveAnalysisID(LCSSA)`. Also port LoopInfo verifier pass to test LoopUnrollPass. llvm-svn: 276063	2016-07-19 23:54:23 +00:00
Kyle Butt	9e52c064c2	Codegen: Factor out canTailDuplicate canTailDuplicate accepts two blocks and returns true if the first can be duplicated into the second successfully. Use this function to encapsulate the heuristic. llvm-svn: 276062	2016-07-19 23:54:21 +00:00
Justin Lebar	7ab570ec3a	[ADT] Warn on unused results from ArrayRef and StringRef functions that read like they might mutate. Summary: Functions like "slice" and "drop_front" sound like they might mutate the underlying object, but they don't. Warning on unused results would have saved me an hour yesterday, and I'm sure I'm not the only one. LLVM and Clang are clean wrt this warning after D22540. Reviewers: majnemer Subscribers: sanjoy, chandlerc, llvm-commits Differential Revision: https://reviews.llvm.org/D22541 llvm-svn: 276058	2016-07-19 23:19:25 +00:00
Daniel Berlin	5c46b943db	Make MemorySSA::dominates/locallydominates constant time Summary: Make MemorySSA::dominates/locallydominates constant time Reviewers: george.burgess.iv, gberry Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22527 llvm-svn: 276046	2016-07-19 22:49:43 +00:00
Chandler Carruth	2aff750cb8	Add AIX support to Path.inc, Host.h, and CMake. Patch by Andrew Paprocki! Differential Revision: https://reviews.llvm.org/D18359 llvm-svn: 276045	2016-07-19 22:46:39 +00:00
Matthias Braun	84fd4bee6c	RegScavenging: Add scavengeRegisterBackwards() This is a variant of scavengeRegister() that works for enterBasicBlockEnd()/backward(). The benefit of the backward mode is that it is not affected by incomplete kill flags. This patch also changes PrologEpilogInserter::doScavengeFrameVirtualRegs() to use the register scavenger in backwards mode. Differential Revision: http://reviews.llvm.org/D21885 llvm-svn: 276044	2016-07-19 22:37:09 +00:00
Matthias Braun	4cb68e1048	RegisterScavenger: Introduce backward() mode. This adds two pieces: - RegisterScavenger:::enterBasicBlockEnd() which behaves similar to enterBasicBlock() but starts tracking at the end of the basic block. - A RegisterScavenger::backward() method. It is subtly different from the existing unprocess() method which only considers uses with the kill flag set: If a value is dead at the end of a basic block with a last use inside the basic block, unprocess() will fail to mark it as live. However we cannot change/fix this behaviour because unprocess() needs to perform the exact reverse operation of forward(). Differential Revision: http://reviews.llvm.org/D21873 llvm-svn: 276043	2016-07-19 22:37:02 +00:00
Kevin Enderby	6524bd8c00	Next step along the way to getting good error messages for bad archives. This step builds on Lang Hames work to change Archive::child_iterator for better interoperation with Error/Expected. Building on that it is now possible to return an error message when the size field of an archive contains non-decimal characters. llvm-svn: 276025	2016-07-19 20:47:07 +00:00
Rafael Espindola	3816c53f04	Use posix_fallocate instead of ftruncate. This makes sure that space is actually available. With this change running lld on a full file system causes it to exit with failed to open foo: No space left on device instead of crashing with a sigbus. llvm-svn: 276017	2016-07-19 20:19:56 +00:00
David Majnemer	938a6c7ce0	[RegionInfo] Some cleanups - Use unique_ptr instead of managing a container of new'd pointers. - Use range based for loops. No functional change is intended. llvm-svn: 276001	2016-07-19 17:50:30 +00:00
Simon Pilgrim	0ea8d275cc	[X86][SSE] Reimplement SSE fp2si conversion intrinsics instead of using generic IR D20859 and D20860 attempted to replace the SSE (V)CVTTPS2DQ and VCVTTPD2DQ truncating conversions with generic IR instead. It turns out that the behaviour of these intrinsics is different enough from generic IR that this will cause problems, INF/NAN/out of range values are guaranteed to result in a 0x80000000 value - which plays havoc with constant folding which converts them to either zero or UNDEF. This is also an issue with the scalar implementations (which were already generic IR and what I was trying to match). This patch changes both scalar and packed versions back to using x86-specific builtins. It also deals with the other scalar conversion cases that are runtime rounding mode dependent and can have similar issues with constant folding. A companion clang patch is at D22105 Differential Revision: https://reviews.llvm.org/D22106 llvm-svn: 275981	2016-07-19 15:07:43 +00:00
Simon Pilgrim	766345e331	Get rid of VS2015 operator precedence warning. NFCI. llvm-svn: 275971	2016-07-19 12:26:51 +00:00
Daniel Sanders	2cb55d7dfd	[mips] Recognise the triple used by Debian stretch for mips64el. Summary: The triple used for this distribution is mips64el-linux-gnuabi64. Reviewers: sdardis Subscribers: sdardis, llvm-commits Differential Revision: https://reviews.llvm.org/D22406 llvm-svn: 275966	2016-07-19 10:22:19 +00:00
Tobias Grosser	3a49a8e13c	Style: drop some unnecessary ';' [NFC] llvm-svn: 275963	2016-07-19 09:01:46 +00:00
George Burgess IV	5f30897b7b	[MemorySSA] Update to the new shiny walker. This patch updates MemorySSA's use-optimizing walker to be more accurate and, in some cases, faster. Essentially, this changed our core walking algorithm from a cache-as-you-go DFS to an iteratively expanded DFS, with all of the caching happening at the end. Said expansion happens when we hit a Phi, P; we'll try to do the smallest amount of work possible to see if optimizing above that Phi is legal in the first place. If so, we'll expand the search to see if we can optimize to the next phi, etc. An iteratively expanded DFS lets us potentially quit earlier (because we don't assume that we can optimize above all phis) than our old walker. Additionally, because we don't cache as we go, we can now optimize above loops. As an added bonus, this patch adds a ton of verification (if EXPENSIVE_CHECKS are enabled), so finding bugs is easier. Differential Revision: https://reviews.llvm.org/D21777 llvm-svn: 275940	2016-07-19 01:29:15 +00:00
Vedant Kumar	e3a0bf5048	Retry: [llvm-profdata] Speed up merging by using a thread pool Add a "-j" option to llvm-profdata to control the number of threads used. Auto-detect NumThreads when it isn't specified, and avoid spawning threads when they wouldn't be beneficial. I tested this patch using a raw profile produced by clang (147MB). Here is the time taken to merge 4 copies together on my laptop: No thread pool: 112.87s user 5.92s system 97% cpu 2:01.08 total With 2 threads: 134.99s user 26.54s system 164% cpu 1:33.31 total Changes since the initial commit: - When handling odd-length inputs, call ThreadPool::wait() before merging the last profile. Should fix a race/off-by-one (see r275937). Differential Revision: https://reviews.llvm.org/D22438 llvm-svn: 275938	2016-07-19 01:17:20 +00:00
Vedant Kumar	21ab20e005	Revert "[llvm-profdata] Speed up merging by using a thread pool" This reverts commit r275921. It broke the ppc64be bot: http://lab.llvm.org:8011/builders/clang-ppc64be-linux-multistage/builds/3537 I'm not sure why it broke, but based on the output, it looks like an off-by-one (one profile left un-merged). llvm-svn: 275937	2016-07-19 00:57:09 +00:00
Matt Arsenault	4cb438b93c	TableGen: Allow custom register operand decoder method This is for a situation where the encoding for a register may be different depending on the specific operand. For some instructions, we want to apply additional restrictions beyond the encoding's constraints. In AMDGPU some operands are VSrc_32, using the VS_32 pseudo register class which accept VGPRs, SGPRs, or immediates in the encoding. Some specific instructions with the same encoding operand do not want to allow immediates or SGPRs, but the encoding format is different in this case than a regular VGPR_32 operand. This allows specifying the encoding should be treated the same without introducing yet another dummy register class. llvm-svn: 275929	2016-07-18 23:20:46 +00:00
Vedant Kumar	0bd9907581	[llvm-profdata] Speed up merging by using a thread pool Add a "-j" option to llvm-profdata to control the number of threads used. Auto-detect NumThreads when it isn't specified, and avoid spawning threads when they wouldn't be beneficial. I tested this patch using a raw profile produced by clang (147MB). Here is the time taken to merge 4 copies together on my laptop: No thread pool: 112.87s user 5.92s system 97% cpu 2:01.08 total With 2 threads: 134.99s user 26.54s system 164% cpu 1:33.31 total Differential Revision: https://reviews.llvm.org/D22438 llvm-svn: 275921	2016-07-18 22:02:39 +00:00
Dehao Chen	6132ee8502	[PM] Convert Loop Strength Reduce pass to new PM Summary: Convert Loop String Reduce pass to new PM Reviewers: davidxl, silvas Subscribers: junbuml, sanjoy, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D22468 llvm-svn: 275919	2016-07-18 21:41:50 +00:00
Mehdi Amini	4d74631ea4	Update doxygen description for `WriteBitcodeToFile()` API (NFC) llvm-svn: 275917	2016-07-18 21:29:24 +00:00
Teresa Johnson	2124157102	[PM] Port FunctionImport Pass to new PM Summary: Port FunctionImport Pass to new PM. Reviewers: mehdi_amini, davide Subscribers: davidxl, llvm-commits Differential Revision: https://reviews.llvm.org/D22475 llvm-svn: 275916	2016-07-18 21:22:24 +00:00
Justin Lebar	4133584504	Write isUInt using template specializations to work around an incorrect MSVC warning. Summary: Per D22441, MSVC warns on our old implementation of isUInt<64>. It sees uint64_t(1) << 64 and doesn't realize that it's not going to be executed. Writing as a template specialization is ugly, but prevents the warning. Reviewers: RKSimon Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D22472 llvm-svn: 275909	2016-07-18 20:40:35 +00:00
Matt Arsenault	c96e1deffa	AMDGPU: Add intrinsic for s_flbit_i32/v_ffbh_i32 llvm-svn: 275871	2016-07-18 18:35:05 +00:00
Matt Arsenault	4c519d3518	AMDGPU/R600: Replace barrier intrinsics llvm-svn: 275870	2016-07-18 18:34:59 +00:00
David Majnemer	a2a218fbd4	[MathExtras] Fix UB in minIntN We negated a value with a signed type which invited problems when that value was the most negative signed number. Use an unsigned type for the value instead. It will compute the same twos complement result without the UB. llvm-svn: 275815	2016-07-18 17:03:09 +00:00
Adam Nemet	b2593f78ca	[LoopDist] Port to new PM Summary: The direct motivation for the port is to ensure that the OptRemarkEmitter tests work with the new PM. This remains a function pass because we not only create multiple loops but could also version the original loop. In the test I need to invoke opt with -passes='require<aa>,loop-distribute'. LoopDistribute does not directly depend on AA however LAA does. LAA uses getCachedResult so I think we need manually pull in 'aa'. Reviewers: davidxl, silvas Subscribers: sanjoy, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22437 llvm-svn: 275811	2016-07-18 16:29:27 +00:00
Adam Nemet	79ac42a5c9	[OptRemarkEmitter] Port to new PM Summary: The main goal is to able to start using the new OptRemarkEmitter analysis from the LoopVectorizer. Since the vectorizer was recently converted to the new PM, it makes sense to convert this analysis as well. This pass is currently tested through the LoopDistribution pass, so I am also porting LoopDistribution to get coverage for this analysis with the new PM. Reviewers: davidxl, silvas Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22436 llvm-svn: 275810	2016-07-18 16:29:21 +00:00
Simon Dardis	d32a2d30cb	[inlineasm] Propagate operand constraints to the backend When SelectionDAGISel transforms a node representing an inline asm block, memory constraint information is not preserved. This can cause constraints to be broken when a memory offset is of the form: offset + frame index when the frame is resolved. By propagating the constraints all the way to the backend, targets can enforce memory operands of inline assembly to conform to their constraints. For MIPSR6, some instructions had their offsets reduced to 9 bits from 16 bits such as ll/sc. This becomes problematic when using inline assembly to perform atomic operations, as an offset can generated that is too big to encode in the instruction. Reviewers: dsanders, vkalintris Differential Review: https://reviews.llvm.org/D21615 llvm-svn: 275786	2016-07-18 13:17:31 +00:00
Diana Picus	774d157a5d	[ARM] Honour ABI for rem under -O0 for EABI, GNUEABI, Android and Musl At higher optimization levels, we generate the libcall for DIVREM_Ix, which is fine: aeabi_{u\|i}divmod. At -O0 we generate the one for REM_Ix, which is the default {u}mod{q\|h\|s\|d}i3. This commit makes sure that we don't generate REM_Ix calls for ABIs that don't support them (i.e. where we need to use DIVREM_Ix instead). This is achieved by bailing out of FastISel, which can't handle non-double multi-reg returns, and letting the legalization infrastructure expand the REM_Ix calls. It also updates the divmod-eabi.ll test to run under -O0 as well, and adds some Windows checks to it to make sure we don't break things for it. Fixes PR27068 Differential Revision: https://reviews.llvm.org/D21926 llvm-svn: 275773	2016-07-18 06:48:25 +00:00
Justin Lebar	b59c1dd5cf	Avoid UB in maxIntN(64). Summary: Previously we were relying on 2's complement underflow in an int64_t. Now we cast to a uint64_t so we explicitly get the behavior we want. Reviewers: rnk Subscribers: dylanmckay, llvm-commits Differential Revision: https://reviews.llvm.org/D22445 llvm-svn: 275722	2016-07-17 18:19:26 +00:00
Justin Lebar	6df6bde694	Clean up some comments in MathExtras.h. Reviewers: rnk Subscribers: llvm-commits, dylanmckay Differential Revision: https://reviews.llvm.org/D22444 llvm-svn: 275721	2016-07-17 18:19:25 +00:00
Justin Lebar	ab549c8187	Add assertions checking SignExtend{32,64}'s bit width. Summary: The bit width must be greater than zero, otherwise we shift by the integer's width, which is UB. Also (more obviously) the width must be less than or equal to the integer's width, otherwise we shift by a negative number, which is also UB. Reviewers: rnk Subscribers: llvm-commits, dylanmckay Differential Revision: https://reviews.llvm.org/D22442 llvm-svn: 275720	2016-07-17 18:19:23 +00:00
Justin Lebar	cbba3c4aef	Fix isShiftedInt and isShiftedUint for widths > 32. Summary: Previously we were doing 1 << S. "1" is an int, so this doesn't work when S >= 32. This patch also adds some static_asserts to these functions to ensure that we don't hit UB by shifting left too much. Reviewers: rnk Subscribers: llvm-commits, dylanmckay Differential Revision: https://reviews.llvm.org/D22441 llvm-svn: 275719	2016-07-17 18:19:21 +00:00
Justin Lebar	f2d0066af7	Use a faster implementation of maxUIntN. Summary: On x86-64 with clang 3.8, before: mov edx, 1 mov cl, dil shl rdx, cl cmp rdi, 64 mov rax, -1 cmovne rax, rdx ret after: mov ecx, 64 sub ecx, edi mov rax, -1 shr rax, cl ret Reviewers: rnk Subscribers: dylanmckay, mkuper, llvm-commits Differential Revision: https://reviews.llvm.org/D22440 llvm-svn: 275718	2016-07-17 18:19:19 +00:00
Teresa Johnson	cd21a646f6	[ThinLTO] Perform profile-guided indirect call promotion Summary: To enable profile-guided indirect call promotion in ThinLTO mode, we simply add call graph edges for each profitable target from the profile to the summaries, then the summary-guided importing will consider the callee for importing as usual. Also we need to enable the indirect call promotion pass creation in the PassManagerBuilder when PerformThinLTO=true (we are in the ThinLTO backend), so that the newly imported functions are considered for promotion in the backends. The IC promotion profiles refer to callees by GUID, which required adding GUIDs to the per-module VST in bitcode (and assigning them valueIds similar to how they are assigned valueIds in the combined index). Reviewers: mehdi_amini, xur Subscribers: mehdi_amini, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D21932 llvm-svn: 275707	2016-07-17 14:47:01 +00:00
Dehao Chen	1a44452b11	[PM] Convert IVUsers analysis to new pass manager. Summary: Convert IVUsers analysis to new pass manager. Reviewers: davidxl, silvas Subscribers: junbuml, sanjoy, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22434 llvm-svn: 275698	2016-07-16 22:51:33 +00:00
Eric Christopher	7a0b7dfd05	Reword comment to be more clear. llvm-svn: 275659	2016-07-16 01:55:45 +00:00
Richard Smith	3bfed96632	Fix modules buildbot after r275633. llvm-svn: 275657	2016-07-16 01:05:39 +00:00
Justin Lebar	8d56f47cfe	Don't do uint64_t(1) << 64 in maxUIntN. Summary: This shift is undefined behavior (and, as compiled by clang, gives the wrong answer for maxUIntN(64)). Reviewers: mkuper Subscribers: llvm-commits, jroelofs, rsmith Differential Revision: https://reviews.llvm.org/D22430 llvm-svn: 275656	2016-07-16 00:59:41 +00:00
Vedant Kumar	7a4bd83c6c	[Support] Fix a doxygen comment (NFC) There was a missing "<" on a line, so its contents wrapped around into the description of the next argument. llvm-svn: 275638	2016-07-15 22:44:52 +00:00
Alexei Starovoitov	cfb51f54ba	BPF: Use official ELF e_machine value The same value for EM_BPF is being propagated to glibc, elfutils, and binutils. Signed-off-by: Richard Henderson <rth@twiddle.net> Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 275633	2016-07-15 22:27:55 +00:00
Zachary Turner	b927e02e1b	[pdb] Teach MsfBuilder and other classes about the Free Page Map. Block 1 and 2 of an MSF file are bit vectors that represent the list of blocks allocated and free in the file. We had been using these blocks to write stream data and other data, so we mark them as the free page map now. We don't yet serialize these pages to the disk, but at least we make a note of what it is, and avoid writing random data to them. Doing this also necessitated cleaning up some of the tests to be more general and hardcode fewer values, which is nice. llvm-svn: 275629	2016-07-15 22:17:19 +00:00
Zachary Turner	5e534c7fb3	[pdb] Round trip the NameMap data structure to YAML. llvm-svn: 275628	2016-07-15 22:17:08 +00:00
Zachary Turner	faa554b2fd	[pdb] Use MsfBuilder to handle the writing PDBs. Previously we would read a PDB, then write some of it back out, but write the directory, super block, and other pertinent metadata back out unchanged. This generates incorrect PDBs since the amount of data written was not always the same as the amount of data read. This patch changes things to use the newly introduced `MsfBuilder` class to write out a correct and accurate set of Msf metadata for the data actually written, which opens up the door for adding and removing type records, symbol records, and other types of data to an existing PDB. llvm-svn: 275627	2016-07-15 22:16:56 +00:00
Matt Arsenault	11d3e21f2b	AMDGPU: Remove AMDGPU.ldexp llvm-svn: 275618	2016-07-15 21:26:56 +00:00
Matt Arsenault	09b2c4aee8	AMDGPU: Remove legacy rsq.clamped intrinsic Mesa still has a use of llvm.AMDGPU.rsq.f64 remaining. Also fix mismatch with non-IEEE rsq selecting to IEEE rsq. llvm-svn: 275617	2016-07-15 21:26:52 +00:00
Michael Zolotukhin	a78937afb2	Make processInstruction from LCSSA.cpp externally available. Summary: When a pass tries to keep LCSSA form it's often convenient to be able to update LCSSA for a set of instructions rather than for the entire loop. This patch makes the processInstruction from LCSSA externally available under a name formLCSSAForInstruction. Reviewers: chandlerc, sanjoy, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22378 llvm-svn: 275613	2016-07-15 21:08:41 +00:00
Zachary Turner	f52a899f4a	[pdb] Introduce MsfBuilder for laying out PDB files. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D22308 llvm-svn: 275611	2016-07-15 20:43:38 +00:00
George Burgess IV	6d30aa03a0	[CFLAA] Add an initial CFLAnders implementation. This adds an incomplete anders-style implementation for CFLAA. It's incomplete in that it's missing interprocedural analysis, attrs handling, etc. and that it needs more tests. More tests and features will be added in future commits. Patch by Jia Chen. Differential Revision: https://reviews.llvm.org/D22291 llvm-svn: 275602	2016-07-15 19:53:25 +00:00
Justin Lebar	9c375817ac	[SelectionDAG] Get rid of bool parameters in SelectionDAG::getLoad, getStore, and friends. Summary: Instead, we take a single flags arg (a bitset). Also add a default 0 alignment, and change the order of arguments so the alignment comes before the flags. This greatly simplifies many callsites, and fixes a bug in AMDGPUISelLowering, wherein the order of the args to getLoad was inverted. It also greatly simplifies the process of adding another flag to getLoad. Reviewers: chandlerc, tstellarAMD Subscribers: jholewinski, arsenm, jyknight, dsanders, nemanjai, llvm-commits Differential Revision: http://reviews.llvm.org/D22249 llvm-svn: 275592	2016-07-15 18:27:10 +00:00
Justin Lebar	0af80cd6f0	[CodeGen] Take a MachineMemOperand::Flags in MachineFunction::getMachineMemOperand. Summary: Previously we took an unsigned. Hooray for type-safety. Reviewers: chandlerc Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D22282 llvm-svn: 275591	2016-07-15 18:26:59 +00:00
Sanjay Patel	32f900730c	fix documentation comments; NFC llvm-svn: 275587	2016-07-15 18:03:59 +00:00
Adam Nemet	aad816083e	[OptRemark,LDist] RFC: Add hotness attribute Summary: This is the first set of changes implementing the RFC from http://thread.gmane.org/gmane.comp.compilers.llvm.devel/98334 This is a cross-sectional patch; rather than implementing the hotness attribute for all optimization remarks and all passes in a patch set, it implements it for the 'missed-optimization' remark for Loop Distribution. My goal is to shake out the design issues before scaling it up to other types and passes. Hotness is computed as an integer as the multiplication of the block frequency with the function entry count. It's only printed in opt currently since clang prints the diagnostic fields directly. E.g.: remark: /tmp/t.c:3:3: loop not distributed: use -Rpass-analysis=loop-distribute for more info (hotness: 300) A new API added is similar to emitOptimizationRemarkMissed. The difference is that it additionally takes a code region that the diagnostic corresponds to. From this, hotness is computed using BFI. The new API is exposed via an analysis pass so that it can be made dependent on LazyBFI. (Thanks to Hal for the analysis pass idea.) This feature can all be enabled by setDiagnosticHotnessRequested in the LLVM context. If this is off, LazyBFI is not calculated (D22141) so there should be no overhead. A new command-line option is added to turn this on in opt. My plan is to switch all user of emitOptimizationRemark* to use this module instead. Reviewers: hfinkel Subscribers: rcox2, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D21771 llvm-svn: 275583	2016-07-15 17:23:20 +00:00
David Majnemer	a940f360cb	[AliasAnalysis] Give back AA results for fence instructions Calling getModRefInfo with a fence resulted in crashes because fences don't have a memory location. Add a new predicate to Instruction called isFenceLike which indicates that the instruction mutates memory but not any single memory location in particular. In practice, it is a proxy for the set of instructions which "mayWriteToMemory" but cannot be used with MemoryLocation::get. This fixes PR28570. llvm-svn: 275581	2016-07-15 17:19:24 +00:00
Dehao Chen	dcafd5ebfd	[PM] Convert LoopInstSimplify Pass to new PM Summary: Convert LoopInstSimplify to new PM. Unfortunately there is no exisiting unittest for this pass. Reviewers: davidxl, silvas Subscribers: silvas, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22280 llvm-svn: 275576	2016-07-15 16:42:11 +00:00
Jacques Pienaar	71c30a14b7	Rename AnalyzeBranch* to analyzeBranch*. Summary: NFC. Rename AnalyzeBranch/AnalyzeBranchPredicate to analyzeBranch/analyzeBranchPredicate to follow LLVM coding style and be consistent with TargetInstrInfo's analyzeCompare and analyzeSelect. Reviewers: tstellarAMD, mcrosier Subscribers: mcrosier, jholewinski, jfb, arsenm, dschuff, jyknight, dsanders, nemanjai Differential Revision: https://reviews.llvm.org/D22409 llvm-svn: 275564	2016-07-15 14:41:04 +00:00
Igor Laevsky	ee40d1e8da	Re-submit r272891 "Prevent dangling pointer problems in BranchProbabilityInfo" Most possibly problem was caused by the same reason as PR28400. This change bypasses it by using CallbackVH instead of AssertingVH. Differential Revision: https://reviews.llvm.org/D20957 llvm-svn: 275563	2016-07-15 14:31:16 +00:00
Sebastian Pop	4177480aad	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 275561	2016-07-15 13:45:20 +00:00
Vedant Kumar	f681e2e506	[Coverage] Mark a few more methods const (NFC) llvm-svn: 275514	2016-07-15 01:19:33 +00:00
Peter Collingbourne	5c73220fa5	Move legacy LTO interface headers to legacy/ directory. Differential Revision: https://reviews.llvm.org/D22173 llvm-svn: 275476	2016-07-14 21:21:16 +00:00
Lang Hames	69f4902ba6	[Object] Change Archive::findSym to return an Expected<Optional<Child>>. As suggested by Rafael in review of D22079 - this was accidentally left out of the final commit (r275316). llvm-svn: 275469	2016-07-14 20:44:27 +00:00
Mehdi Amini	40f993df25	Add recently added TargetOptions::EnableIPRA member to operator== llvm-svn: 275467	2016-07-14 20:22:13 +00:00
Jun Bum Lim	c837af306e	[PM] Port Dead Loop Deletion Pass to the new PM Summary: Port Dead Loop Deletion Pass to the new pass manager. Reviewers: silvas, davide Subscribers: llvm-commits, sanjoy, mcrosier Differential Revision: https://reviews.llvm.org/D21483 llvm-svn: 275453	2016-07-14 18:28:29 +00:00
Justin Lebar	288b3376ae	[CodeGen] Refactor MachineMemOperand::Flags's target-specific flags. Summary: Make the target-specific flags in MachineMemOperand::Flags real, bona fide enum values. This simplifies users, prevents various constants from going out of sync, and avoids the false sense of security provided by declaring static members in classes and then forgetting to define them inside of cpp files. Reviewers: MatzeB Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22372 llvm-svn: 275451	2016-07-14 18:15:20 +00:00
Ahmed Bougacha	83dc5cb6a9	[GlobalISel] Fix G_OR opcode after the addition of a TargetOpcode. r275367 fixed G_ADD and G_BR, but not G_OR. llvm-svn: 275444	2016-07-14 17:29:49 +00:00
Ahmed Bougacha	9e511525a0	[CodeGen] Simplify reg bank/class union is+get into dyn_cast. NFC. llvm-svn: 275443	2016-07-14 17:29:46 +00:00
Justin Lebar	efbf9c8d5a	[CodeGen] s/constexpr/LLVM_CONSTEXPR/ in MachineMemOperand.h. llvm-svn: 275441	2016-07-14 17:16:40 +00:00
Justin Lebar	a3b786a8c1	[CodeGen] Refactor MachineMemOperand's Flags enum. Summary: - Give it a shorter name (because we're going to refer to it often from SelectionDAG and friends). - Split the flags and alignment into separate variables. - Specialize FlagsEnumTraits for it, so we can do bitwise ops on it without losing type information. - Make some enum values constants in MachineMemOperand instead. MOMaxBits should not be a valid Flag. - Simplify some of the bitwise ops for dealing with Flags. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22281 llvm-svn: 275438	2016-07-14 17:07:44 +00:00
Ahmed Bougacha	6b943e33e6	[TableGen] Autobrief-ize Record. NFC. llvm-svn: 275425	2016-07-14 14:53:14 +00:00
Ahmed Bougacha	e8405ad5d0	[TableGen] Cleanup Record comments. NFC. LLVM doesn't use exceptions anymore. Also remove the implementation comments. Some of them diverged. llvm-svn: 275424	2016-07-14 14:53:11 +00:00
Nico Weber	755cd760cd	Revert r275401, it caused PR28551. llvm-svn: 275420	2016-07-14 14:41:25 +00:00
Sebastian Pop	63847d04e7	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 275401	2016-07-14 12:18:53 +00:00
Sjoerd Meijer	38c2cd0c14	This implements a more optimal algorithm for selecting a base constant in constant hoisting. It not only takes into account the number of uses and the cost of expressions in which constants appear, but now also the resulting integer range of the offsets. Thus, the algorithm maximizes the number of uses within an integer range that will enable more efficient code generation. On ARM, for example, this will enable code size optimisations because less negative offsets will be created. Negative offsets/immediates are not supported by Thumb1 thus preventing more compact instruction encoding. Differential Revision: http://reviews.llvm.org/D21183 llvm-svn: 275382	2016-07-14 07:44:20 +00:00
Dean Michael Berris	52735fc435	XRay: Add entry and exit sleds Summary: In this patch we implement the following parts of XRay: - Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches. - Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts). - X86-specific nop sleds as described in the white paper. - A machine function pass that adds the different instrumentation marker instructions at a very late stage. - A way of identifying which return opcode is considered "normal" for each architecture. There are some caveats here: 1) We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet. 2) The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library. Reviewers: sanjoy, eugenis, kcc, pcc, echristo, rnk Subscribers: niravd, majnemer, atrick, rnk, emaste, bmakam, mcrosier, mehdi_amini, llvm-commits Differential Revision: http://reviews.llvm.org/D19904 llvm-svn: 275367	2016-07-14 04:06:33 +00:00
Lang Hames	fc209623e9	[Object] Re-apply r275316 now that I have the corresponding LLD patch ready. llvm-svn: 275361	2016-07-14 02:24:01 +00:00
Adrian Prantl	0418ef2691	Synchronize LLVM and clang's ObjCDeclSpec::ObjCPropertyAttributeKind. This adds Clang-specific DWARF constants for nullability and ObjC class properties that are already generated by clang. This patch adds dwarfdump support and a more comprehensive testcase. <rdar://problem/27335745> llvm-svn: 275354	2016-07-14 00:41:18 +00:00
Lang Hames	ae610ab528	[Object] Revert r275316, Archive::child_iterator changes, while I update lld. Should fix the bots broken by r275316. llvm-svn: 275353	2016-07-14 00:37:04 +00:00
Justin Lebar	d5bbd856e2	Force a semicolon at the end of the LLVM_ENABLE_BITMASK_ENUMS_IN_NAMESPACE() macro. This silences a warning about an extra semicolon on gcc. llvm-svn: 275349	2016-07-13 23:52:19 +00:00
Mehdi Amini	cfed2564f7	Add EnableIPRA to TargetOptions, and move the cl::opt -enable-ipra to TargetMachine.cpp Avoid exposing a cl::opt in a public header and instead promote this option in the API. Alternatively, we could land the cl::opt in CommandFlags.h so that it is available to every tool, but we would still have to find an option for clang. llvm-svn: 275348	2016-07-13 23:39:46 +00:00
Mehdi Amini	4beea66232	[IPRA] Set callee saved registers to none for local function when IPRA is enabled. IPRA try to optimize caller saved register by propagating register usage information from callee to caller so it is beneficial to have caller saved registers compare to callee saved registers when IPRA is enabled. Please find more detailed explanation here https://groups.google.com/d/msg/llvm-dev/XRzGhJ9wtZg/tjAJqb0eEgAJ. This change makes local function do not have any callee preserved register when IPRA is enabled. A simple test case is also added to verify this change. Patch by Vivek Pandya <vivekvpandya@gmail.com> Differential Revision: http://reviews.llvm.org/D21561 llvm-svn: 275347	2016-07-13 23:39:34 +00:00
Vedant Kumar	ef345e1d3f	[Coverage] Return an ArrayRef to avoid copies (NFC) llvm-svn: 275338	2016-07-13 23:12:26 +00:00
Vedant Kumar	7fcc5472e2	[Coverage] Mark a few methods const (NFC) llvm-svn: 275337	2016-07-13 23:12:23 +00:00
Adam Nemet	7da74abf3d	[LAA] Don't hold on to DominatorTree in the analysis result llvm-svn: 275335	2016-07-13 22:36:35 +00:00
Adam Nemet	b49d9a56eb	[LAA] Don't hold on to TargetLibraryInfo in the analysis result llvm-svn: 275334	2016-07-13 22:36:27 +00:00
Matthias Braun	c4ab36abeb	MIRYamlMapping: Update stale comment llvm-svn: 275328	2016-07-13 22:23:19 +00:00
Adam Nemet	1824e411c6	[LAA] Don't hold on to DataLayout in the analysis result In fact, don't even pass this to the ctor since we can get it from the module. llvm-svn: 275326	2016-07-13 22:18:51 +00:00
Adam Nemet	6616ad08f6	[LAA] Don't hold on to LoopInfo in the analysis result llvm-svn: 275325	2016-07-13 22:18:48 +00:00
Adam Nemet	1556357677	[LAA] Don't hold on to AliasAnalysis in the analysis result llvm-svn: 275322	2016-07-13 21:39:09 +00:00
Teresa Johnson	6df48b34bf	Mark the textual headers in the module map for ProfileData Follow on to r275312. llvm-svn: 275319	2016-07-13 21:27:51 +00:00
Lang Hames	c2773e97d2	[Object] Change Archive::child_iterator for better interop with Error/Expected. See http://reviews.llvm.org/D22079 Changes the Archive::child_begin and Archive::children to require a reference to an Error. If iterator increment fails (because the archive header is damaged) the iterator will be set to 'end()', and the error stored in the given Error&. The Error value should be checked by the user immediately after the loop. E.g.: Error Err; for (auto &C : A->children(Err)) { // Do something with archive child C. } // Check the error immediately after the loop. if (Err) return Err; Failure to check the Error will result in an abort() when the Error goes out of scope (as guaranteed by the Error class). llvm-svn: 275316	2016-07-13 21:13:05 +00:00
Teresa Johnson	56a76961aa	Define a module map entry for ProfileData. As per Richard Smith, this should help avoid a modules bug exposed by my r275216 commit: http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/17560 llvm-svn: 275312	2016-07-13 20:19:09 +00:00
Andrew Kaylor	346dd7f1bd	Reverting r275284 due to platform-specific test failures llvm-svn: 275304	2016-07-13 19:09:16 +00:00
Justin Lebar	81edbbe259	[ADT] Add LLVM_MARK_AS_BITMASK_ENUM, used to enable bitwise operations on enums without static_cast. Summary: Normally when you do a bitwise operation on an enum value, you get back an instance of the underlying type (e.g. int). But using this macro, bitwise ops on your enum will return you back instances of the enum. This is particularly useful for enums which represent a combination of flags. Suppose you have a function which takes an int and a set of flags. One way to do this would be to take two numeric params: enum SomeFlags { F1 = 1, F2 = 2, F3 = 4, ... }; void Fn(int Num, int Flags); void foo() { Fn(42, F2 \| F3); } But now if you get the order of arguments wrong, you won't get an error. You might try to fix this by changing the signature of Fn so it accepts a SomeFlags arg: enum SomeFlags { F1 = 1, F2 = 2, F3 = 4, ... }; void Fn(int Num, SomeFlags Flags); void foo() { Fn(42, static_cast<SomeFlags>(F2 \| F3)); } But now we need a static cast after doing "F2 \| F3" because the result of that computation is the enum's underlying type. This patch adds a mechanism which gives us the safety of the second approach with the brevity of the first. enum SomeFlags { F1 = 1, F2 = 2, F3 = 4, ..., F_MAX = 128, LLVM_MARK_AS_BITMASK_ENUM(F_MAX) }; void Fn(int Num, SomeFlags Flags); void foo() { Fn(42, F2 \| F3); // No static_cast. } The LLVM_MARK_AS_BITMASK_ENUM macro enables overloads for bitwise operators on SomeFlags. Critically, these operators return the enum type, not its underlying type, so you don't need any static_casts. An advantage of this solution over the previously-proposed BitMask class [0, 1] is that we don't need any wrapper classes -- we can operate directly on the enum itself. The approach here is somewhat similar to OpenOffice's typed_flags_set [2]. But we skirt the need for a wrapper class (and a good deal of complexity) by judicious use of enable_if. We SFINAE on the presence of a particular enumerator (added by the LLVM_MARK_AS_BITMASK_ENUM macro) instead of using a traits class so that it's impossible to use the enum before the overloads are present. The solution here also seamlessly works across multiple namespaces. [0] http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150622/283369.html [1] http://lists.llvm.org/pipermail/llvm-commits/attachments/20150623/073434b6/attachment.obj [2] https://cgit.freedesktop.org/libreoffice/core/tree/include/o3tl/typed_flags_set.hxx Reviewers: chandlerc, rsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22279 llvm-svn: 275292	2016-07-13 18:23:16 +00:00
Andrew Kaylor	12cccdd731	Fix for Bug 26903, adds support to inline __builtin_mempcpy Patch by Sunita Marathe Differential Revision: http://reviews.llvm.org/D21920 llvm-svn: 275284	2016-07-13 17:25:11 +00:00
Adam Nemet	c2f791d8a7	[BFI] Add new LazyBFI analysis pass Summary: This is necessary for D21771. In order to add the hotness attribute to optimization remarks we need BFI to be available in all passes that emit optimization remarks. However we don't want to pay for computing BFI unless the hotness attribute is requested. This is achieved by making BFI lazy at the very high-level through a new analysis pass -- BFI is not calculated unless requested. I am adding a test to check the laziness under D21771 where the first user of the analysis is added. Reviewers: hfinkel, dexonsmith, davidxl Subscribers: davidxl, dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D22141 llvm-svn: 275250	2016-07-13 05:01:48 +00:00
David Majnemer	17bdf445e4	[IR] Make getIndexedOffsetInType return a signed result A GEPed offset can go negative, the result of getIndexedOffsetInType should according be a signed type. llvm-svn: 275246	2016-07-13 03:42:38 +00:00
Dehao Chen	9cba1f4e7e	New pass manager for LICM. Summary: Port LICM to the new pass manager. Reviewers: davidxl, silvas Subscribers: krasin, vitalybuka, silvas, davide, sanjoy, llvm-commits, mehdi_amini Differential Revision: http://reviews.llvm.org/D21772 llvm-svn: 275222	2016-07-12 22:37:48 +00:00
Teresa Johnson	1e44b5d3ab	Refactor indirect call promotion profitability analysis (NFC) Summary: Refactored the profitability analysis out of the IC promotion pass and into lib/Analysis so that it can be accessed by the summary index builder in a follow-on patch to enable IC promotion in ThinLTO (D21932). Reviewers: davidxl, xur Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22182 llvm-svn: 275216	2016-07-12 21:13:44 +00:00
Dehao Chen	b9f8e29290	[PM] Port LoopIdiomRecognize Pass to new PM Summary: Port LoopIdiomRecognize Pass to new PM Reviewers: davidxl Subscribers: davide, sanjoy, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D22250 llvm-svn: 275202	2016-07-12 18:45:51 +00:00
Wei Ding	5b2636a152	AMDGPU: Add LLVM IR Intrinsic for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22239 llvm-svn: 275197	2016-07-12 18:02:14 +00:00
Krzysztof Parzyszek	f5b9bb61f7	Add print/dump routines to LiveInterval::SubRange llvm-svn: 275194	2016-07-12 17:37:44 +00:00
Vitaly Buka	204dc533c5	Revert "New pass manager for LICM." Summary: This reverts commit r275118. Subscribers: sanjoy, mehdi_amini Differential Revision: http://reviews.llvm.org/D22259 llvm-svn: 275156	2016-07-12 06:25:32 +00:00
Craig Topper	a6e6febe2c	[AVX512] Remove masked logic op intrinsics and autoupgrade them to native IR. llvm-svn: 275155	2016-07-12 05:27:53 +00:00
Rui Ueyama	ef5ec2da4a	Re-enable TPI hash verification for enum records. We didn't read unique names correctly. As a result, we computed hashes on (non-)unique names instead of unique names. llvm-svn: 275150	2016-07-12 03:25:03 +00:00
Mehdi Amini	0da268d13a	Do not use bool in C header lto.h, use lto_bool_t instead llvm-svn: 275130	2016-07-11 23:55:01 +00:00
Mehdi Amini	e75aa6f674	Add a libLTO API to query a memory buffer and check if it contains ObjC categories The linker supports a feature to force load an object from a static archive if it defines an Objective-C category. This API supports this feature by looking at every section in the module to find if a category is defined in the module. llvm-svn: 275125	2016-07-11 23:10:18 +00:00
Dehao Chen	7ef5820fa3	New pass manager for LICM. Summary: Port LICM to the new pass manager. Reviewers: davidxl, silvas Subscribers: silvas, davide, sanjoy, llvm-commits, mehdi_amini Differential Revision: http://reviews.llvm.org/D21772 llvm-svn: 275118	2016-07-11 22:45:24 +00:00
Zachary Turner	dbeaea7b35	Refactor the PDB writing to use a builder approach llvm-svn: 275110	2016-07-11 21:45:26 +00:00
Sanjay Patel	bb7d87ee25	fix documentation comments; NFC llvm-svn: 275101	2016-07-11 20:50:39 +00:00
Alina Sbirlea	327955e057	Add TLI.allowsMisalignedMemoryAccesses to LoadStoreVectorizer Summary: Extend TTI to access TLI.allowsMisalignedMemoryAccesses(). Check condition when vectorizing load and store chains. Add additional parameters: AddressSpace, Alignment, Fast. Reviewers: llvm-commits, jlebar Subscribers: arsenm, mzolotukhin Differential Revision: http://reviews.llvm.org/D21935 llvm-svn: 275100	2016-07-11 20:46:17 +00:00
Chad Rosier	4f0dad1674	[IPRA] Properly compute register usage at call sites. Differential Revision: http://reviews.llvm.org/D21395 Patch by Vivek Pandya. PR28144 llvm-svn: 275087	2016-07-11 18:45:49 +00:00
Davide Italiano	e8ae0b5eb4	[PM/IPO] Port LowerTypeTests to the new PassManager. There's a little bit of churn in this patch because the initialization mechanism is now shared between the old and the new PM. Other than that, it's just a pretty mechanical translation. llvm-svn: 275082	2016-07-11 18:10:06 +00:00
Dehao Chen	9232f98279	Implement callsite-hotness based inline cost for Sample-based PGO Summary: For sample-based PGO, using BFI to calculate callsite count is sometime not accurate. This is because with sampling based approach, if a callsite resides in a hot loop deeply nested in a bunch of cold branches, the callsite's BFI frequency would be inaccurately calculated due to lack of samples in the cold branch. E.g. if (A1 && A2 && A3 && ..... && A10) { for (i=0; i < 100000000; i++) { callsite(); } } Assume that A1 to A100 are all 100% taken, and callsite has 1000 samples and thus is considerred hot. Because the loop's trip count is huge, it's normal that all branches outside the loop has no sample at all. As a result, we can only use static branch probability to derive the the frequency of the loop header. Assuming that static heuristic thinks each branch is 50% taken, then the count calculated from BFI will be 1/(2^10) of the actual value. In order to get more accurate callsite count, we directly annotate the weight on the call instruction, and directly use it when checking callsite hotness. Note that this mechanism can also be shared by instrumentation based callsite hotness analysis. The side benefit is that it breaks the dependency from Inliner to BFI as call count is embedded in the IR. Reviewers: davidxl, eraman, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22118 llvm-svn: 275073	2016-07-11 16:48:54 +00:00
Nirav Dave	8603062ee4	Fix branch relaxation in 16-bit mode. Thread through MCSubtargetInfo to relaxInstruction function allowing relaxation to generate jumps with 16-bit sized immediates in 16-bit mode. This fixes PR22097. Reviewers: dwmw2, tstellarAMD, craig.topper, jyknight Subscribers: jfb, arsenm, jyknight, llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D20830 llvm-svn: 275068	2016-07-11 14:23:53 +00:00
Nirav Dave	53a72f4d3c	Provide support for preserving assembly comments Preserve assembly comments from input in output assembly and flags to toggle property. This is on by default for inline assembly and off in llvm-mc. Parsed comments are emitted immediately before an EOL which generally places them on the expected line. Reviewers: rtrieu, dwmw2, rnk, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20020 llvm-svn: 275058	2016-07-11 12:42:14 +00:00
Daniel Berlin	e64985fc94	Allow BasicBlockEdge to be used in DenseMap Summary: Add a DenseMapInfo specialization for BasicBlockEdge Reviewers: hfinkel, chandlerc, majnemer Differential Revision: http://reviews.llvm.org/D22207 llvm-svn: 275041	2016-07-11 04:37:53 +00:00
Hal Finkel	47646c0981	Add a 'Returned' intrinsic property corresponding to the 'returned' argument attribute This will be used by the upcoming llvm.noalias intrinsic. Differential Revision: http://reviews.llvm.org/D22201 llvm-svn: 275034	2016-07-11 01:28:42 +00:00
Hal Finkel	e87ad547ef	Add getReturnedArgOperand to Call/InvokeInst, CallSite In order to make the optimizer smarter about using the 'returned' argument attribute (generally, but motivated by my llvm.noalias intrinsic work), add a utility function to Call/InvokeInst, and CallSite, to make it easy to get the returned call argument (when one exists). P.S. There is already an unfortunate amount of code duplication between CallInst and InvokeInst, and this adds to it. We should probably clean that up separately. Differential Revision: http://reviews.llvm.org/D22204 llvm-svn: 275031	2016-07-10 23:01:32 +00:00
Sanjay Patel	fedc01ad76	[DAG] make isConstantSplatVector() available to the rest of lowering llvm-svn: 275025	2016-07-10 21:27:06 +00:00
Jan Vesely	2fa28c330c	AMDGPU/R600: Add implicitarg.ptr intrinsic Differential Revision: http://reviews.llvm.org/D21622 llvm-svn: 275024	2016-07-10 21:20:29 +00:00
Sanjay Patel	9bedcdb5f5	fix documentation comments; NFC llvm-svn: 275021	2016-07-10 21:02:16 +00:00
Marcin Koscielnicki	cf7cc724a7	[SystemZ] Utilize Test Data Class instructions. This adds a new SystemZ-specific intrinsic, llvm.s390.tdc.f(32\|64\|128), which maps straight to the test data class instructions. A new IR pass is added to recognize instructions that can be converted to TDC and perform the necessary replacements. Differential Revision: http://reviews.llvm.org/D21949 llvm-svn: 275016	2016-07-10 14:41:22 +00:00
Benjamin Kramer	da5b4cc339	[codeview] Drop unused private inheritance. There is no polymorphism here, and StreamRef already contains a StreamInterface pointer. Dropping the base class makes StreamRef more transparent to the compiler, for example it can find unused variables. llvm-svn: 275013	2016-07-10 10:17:36 +00:00
David Majnemer	1b79e9a5b9	[pdb] Sanity check the stream map Some abstractions in LLVM "know" that they are reading in-bounds, FixedStreamArray, and provide a simple result. This breaks down if the stream map is bogus. llvm-svn: 275010	2016-07-10 05:32:05 +00:00
David Majnemer	6211b1f1f9	[llvm-pdbdump] Propagate errors a little more consistently PDBFile::getBlockData didn't really return any indication that it failed. It merely returned an empty buffer. llvm-svn: 275009	2016-07-10 03:34:47 +00:00
Sean Silva	db90d4d9c1	[PM] Port LoopVectorize to the new PM. llvm-svn: 275000	2016-07-09 22:56:50 +00:00
Sean Silva	bf71035438	Fix up an include guard. This should have been done as part of the move in r274960. llvm-svn: 274999	2016-07-09 22:56:39 +00:00
Sanjay Patel	6170b4bebd	fix documentation comments; NFC llvm-svn: 274981	2016-07-09 18:52:07 +00:00
Craig Topper	45a59a08bc	[X86] Remove sse41 extract intrinsics. They are not used by clang and are not implemented by the x86 backend. llvm-svn: 274967	2016-07-09 04:38:30 +00:00
Craig Topper	70610cf7b6	[X86] Remove and autoupgrade 512-bit non-temporal store intrinsics. llvm-svn: 274966	2016-07-09 04:38:27 +00:00
Davide Italiano	92b933a55c	[PM] Port CrossDSOCFI to the new pass manager. llvm-svn: 274962	2016-07-09 03:25:35 +00:00
Sean Silva	0dacbd8f31	[PM] Fix a think-o. mv {Scalar,Vectorize}/SLPVectorize.h llvm-svn: 274960	2016-07-09 03:11:29 +00:00
Davide Italiano	cd96cfd8df	[PM] Port LoopSimplify to the new pass manager. While here move simplifyLoop() function to the new header, as suggested by Chandler in the review. Differential Revision: http://reviews.llvm.org/D21404 llvm-svn: 274959	2016-07-09 03:03:01 +00:00
George Burgess IV	c294d0dcc2	[CFLAA] Move the graph builder out from CFLSteens. NFC. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D22022 llvm-svn: 274958	2016-07-09 02:54:42 +00:00
Anna Thomas	9ad45adfd7	Revert "InstCombine rule to fold truncs whose value is available" This reverts commit r274853. Caused failure in ppcBE build llvm-svn: 274943	2016-07-08 22:15:08 +00:00
Jingyue Wu	15f3e82d42	[TTI] Expose TTI::getGEPCost and use it in SLSR and NaryReassociate. NFC. llvm-svn: 274940	2016-07-08 21:48:05 +00:00
Adam Nemet	0a24048bb7	[BFI] Minor cleanup. NFC Use typedef Result in BlockFrequencyAnalysis::run. Fix typo in comment. llvm-svn: 274936	2016-07-08 21:24:13 +00:00
Xinliang David Li	07e08fa36b	[PM] name the new PM LAA class LoopAccessAnalysis (LAA) /NFC llvm-svn: 274934	2016-07-08 21:21:44 +00:00
Wei Mi	c022370767	Allow dead insts to be kept in DeadRemat only when they are rematerializable. Because isReallyTriviallyReMaterializableGeneric puts many limits on rematerializable instructions, this fix can prevent instructions with tied virtual operands and instructions with virtual register uses from being kept in DeadRemat, so as to workaround the live interval consistency problem for the dummy instructions kept in DeadRemat. But we still need to fix the live interval consistency problem. This patch is just a short time relieve. PR28464 has been filed as a reminder. Differential Revision: http://reviews.llvm.org/D19486 llvm-svn: 274928	2016-07-08 21:08:09 +00:00
Xinliang David Li	7853c1dd73	Rename LoopAccessAnalysis to LoopAccessLegacyAnalysis /NFC llvm-svn: 274927	2016-07-08 20:55:26 +00:00
Justin Bogner	068a8054ae	IR: Set a TargetPrefix for nvvm intrinsics Since these are named nvvm_* rather than nvptx_*, we also need to update getArchTypePrefix. It's a bit unusual for getArchTypePrefix not to match the backend name, but I think this fits the intent of the function in this case. llvm-svn: 274890	2016-07-08 17:25:18 +00:00
Anna Thomas	3124f6273a	InstCombine rule to fold truncs whose value is available We can fold truncs whose operand feeds from a load, if the trunc value is available through a prior load/store. This change is from: http://reviews.llvm.org/D21246, which folded the trunc but missed the bitcast or ptrtoint/inttoptr required in the RAUW call, when the load type didnt match the prior load/store type. Differential Revision: http://reviews.llvm.org/D21791 llvm-svn: 274853	2016-07-08 15:18:56 +00:00
Vassil Vassilev	bf70516503	[modules] Add missing includes. Patch by Cristina Cristescu! Reviewed by Adrian Prantl (D21985) llvm-svn: 274838	2016-07-08 12:00:08 +00:00
Craig Topper	f7bf6de0af	[AVX512] Remove and autoupgrade a duplicate set of 512-bit masked shift intrinsics. I'm not sure if clang ever used these builtin names or not. llvm-svn: 274827	2016-07-08 06:14:47 +00:00
Craig Topper	4f826b7ae5	[X86] Remove intrinsics that already have autoupgrade support. llvm-svn: 274826	2016-07-08 06:14:41 +00:00
Wei Mi	90d195a5fd	[PM] Port UnreachableBlockElim to the new Pass Manager Differential Revision: http://reviews.llvm.org/D22124 llvm-svn: 274824	2016-07-08 03:32:49 +00:00
Davide Italiano	16284df8ec	[PM] Port InstSimplify to the new pass manager. llvm-svn: 274796	2016-07-07 21:14:36 +00:00
Rui Ueyama	52a1dd76cb	Add a reference for Elf_Chdr type. llvm-svn: 274793	2016-07-07 20:19:19 +00:00
Tim Northover	917e744ea6	GlobalISel: remove redundant property setting. NFC. AsmString is empty by default. llvm-svn: 274789	2016-07-07 19:45:45 +00:00
Peter Collingbourne	73589f321b	ThinLTO: Do not take into account whether a definition has multiple copies when promoting. We currently do not touch a symbol's linkage in the case where a definition has a single copy. However, this code is effectively unnecessary: either the definition is not exported, in which case the internalize phase sets its linkage to internal, or it is exported, in which case we need to promote linkage to weak. Those two cases are already handled by existing code. I believe that the only real functional change here is in the case where we have a single definition which does not prevail (e.g. because the definition in a native object file prevails). In that case we now lower linkage to available_externally following the existing code path for that case. As a result we can remove the isExported function parameter from the thinLTOResolveWeakForLinkerInIndex function. Differential Revision: http://reviews.llvm.org/D21883 llvm-svn: 274784	2016-07-07 18:31:51 +00:00
Justin Lebar	e5c910f8ae	[NVVM] Rename __nvvm_bar0 builtin back to __syncthreads. __syncthreads was renamed to __nvvm_bar0 in r274664. But __syncthreads is part of our user-facing API, so we need to keep the name. This will momentarily break clang; we need a matching patch there. Patch by Justin Bogner. llvm-svn: 274779	2016-07-07 18:14:55 +00:00
Justin Bogner	a466cc33fa	NVPTX: Remove the legacy ptx intrinsics - Rename the ptx.read.* intrinsics to nvvm.read.ptx.sreg.* - some but not all of these registers were already accessible via the nvvm name. - Rename ptx.bar.sync nvvm.bar.sync, to match nvvm.bar0. There's a fair amount of code motion here, but it's all very mechanical. llvm-svn: 274769	2016-07-07 16:40:17 +00:00
Chandler Carruth	168800c97d	[LCG] Hoist the definitions of the stream operator friends to be inline friend definitions. Based on the experiments Sean Silva and Reid did, this seems the safest course of action and also will work around a questionable warning provided by GCC6 on the old form of the code. Thanks for Davide pointing out the issue and other suggesting ways to fix. llvm-svn: 274740	2016-07-07 07:52:07 +00:00
David Majnemer	7afb46d3c8	[LoopAccessAnalysis] Fix an integer overflow We were inappropriately using 32-bit types to account for quantities that can be far larger. Fixed in PR28443. llvm-svn: 274737	2016-07-07 06:24:36 +00:00
Rui Ueyama	830c078d8b	Define endianness-aware type for Elf_Chdr. llvm-svn: 274728	2016-07-07 03:53:00 +00:00
Sean Silva	59fe82f4ce	[PM] Port TailCallElim llvm-svn: 274708	2016-07-06 23:48:41 +00:00
Matt Arsenault	3e96a709f1	Fix missing member initializers This fixes the -Werror build with some combination of warning flags. llvm-svn: 274707	2016-07-06 23:30:54 +00:00
Sean Silva	b025d375a1	[PM] Port CorrelatedValuePropagation llvm-svn: 274705	2016-07-06 23:26:29 +00:00
Matthias Braun	332bb5c236	AArch64: Replace a RegScavenger instance with LivePhysRegs findScratchNonCalleeSaveRegister() just needs a simple liveness analysis, use LivePhysRegs for that as it is simpler and does not depend on the kill flags. This commit adds a convenience function available() to LivePhysRegs: This function returns true if the given register is not reserved and neither the register nor any of its aliases are alive. Differential Revision: http://reviews.llvm.org/D21865 llvm-svn: 274685	2016-07-06 21:31:27 +00:00
Chad Rosier	232e29ebea	[MemorySSA] Reinstate the legacy printer and verifier. Differential Revision: http://reviews.llvm.org/D22058 llvm-svn: 274679	2016-07-06 21:20:47 +00:00
Justin Bogner	a463537a36	NVPTX: Replace uses of cuda.syncthreads with nvvm.barrier0 Everywhere where cuda.syncthreads or __syncthreads is used, use the properly namespaced nvvm.barrier0 instead. llvm-svn: 274664	2016-07-06 20:02:45 +00:00
Justin Bogner	b3745b6d24	NVPTX: Make the llvm.nvvm.shfl intrinsics and builtin names consistent The intrinsics here use nvvm, but the builtins and tablegen variable names were using ptx. Stick to the modern names here. llvm-svn: 274662	2016-07-06 19:52:27 +00:00
Matthias Braun	f16acbd2f9	TailDuplicator: Remove live-in updating logic This logic was introduced in r157663 and does not make any sense to me. The motivating example in rdar://11538365 looks like this: This is the tail: BB#16: derived from LLVM BB %if.end68 Live Ins: %R0 %R4 %R5 Predecessors according to CFG: BB#15 BB#5 tBLXi pred:14, pred:%noreg, <ga:@CFRelease>, %R0<kill>, <regmask>, %LR<imp-def,dead>, %SP<imp-use>, %SP<imp-def> t2B <BB#20>, pred:14, pred:%noreg Successors according to CFG: BB#20 This is the predBB: BB#5: Live Ins: %R5 Predecessors according to CFG: BB#4 %R4<def> = t2MOVi 0, pred:14, pred:%noreg, opt:%noreg t2B <BB#16>, pred:14, pred:%noreg Successors according to CFG: BB#16 However this is invalid machine code to begin with, if %R0 is live-in to BB#16 then it must be live-in to BB#5 as well if BB#5 does not define it. We should not need logic to retroactively fix broken machine code and in fact the example from r157663 passes cleanly with the code removed and I do not see any (newly) failing tests with the machine verifier enabled. Differential Revision: http://reviews.llvm.org/D22031 llvm-svn: 274655	2016-07-06 18:55:10 +00:00
Zachary Turner	8848a7a6b2	[pdb] Round trip the PDB stream between YAML and binary PDB. This gets writing of the PDB stream working. llvm-svn: 274647	2016-07-06 18:05:57 +00:00
Michael Kuperstein	aa71bdd3af	[TTI] The cost model should not assume vector casts get completely scalarized The cost model should not assume vector casts get completely scalarized, since on targets that have vector support, the common case is a partial split up to the legal vector size. So, when a vector cast gets split, the resulting casts end up legal and cheap. Instead of pessimistically assuming scalarization, base TTI can use the costs the concrete TTI provides for the split vector, plus a fudge factor to account for the cost of the split itself. This fudge factor is currently 1 by default, except on AMDGPU where inserts and extracts are considered free. Differential Revision: http://reviews.llvm.org/D21251 llvm-svn: 274642	2016-07-06 17:30:56 +00:00
Zachary Turner	ca4f02ce53	Add a default parameter for getRegisteredOptions. llvm-svn: 274640	2016-07-06 17:25:16 +00:00
Reid Kleckner	dafc5d75ea	Prune RelocVisitor.h include to avoid including COFF.h from MCJIT.h This helps to mitigate the conflict between COFF.h and winnt.h, which is PR28399. llvm-svn: 274637	2016-07-06 16:56:42 +00:00
Craig Topper	226274ab60	[X86] Remove GCC builtin names from sse/avx packed fp cmp intrinsics so clang can special handle some of the immediate values. llvm-svn: 274607	2016-07-06 06:27:25 +00:00
Craig Topper	2839045e28	[AVX512] Remove GCC builtins from the vplzcntd/q intrinsics so we can emit native IR using the generic ctlz intrinsic in clang. llvm-svn: 274602	2016-07-06 04:24:24 +00:00
George Burgess IV	bfa401e5ad	[CFLAA] Split into Anders+Steens analysis. StratifiedSets (as implemented) is very fast, but its accuracy is also limited. If we take a more aggressive andersens-like approach, we can be way more accurate, but we'll also end up being slower. So, we've decided to split CFLAA into CFLSteensAA and CFLAndersAA. Long-term, we want to end up in a place where CFLSteens is queried first; if it can provide an answer, great (since queries are basically map lookups). Otherwise, we'll fall back to CFLAnders, BasicAA, etc. This patch splits everything out so we can try to do something like that when we get a reasonable CFLAnders implementation. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D21910 llvm-svn: 274589	2016-07-06 00:26:41 +00:00
Tim Northover	e6ae6767d9	AArch64: TableGenerate system instruction operands. The way the named arguments for various system instructions are handled at the moment has a few problems: - Large-scale duplication between AArch64BaseInfo.h and AArch64BaseInfo.cpp - That weird Mapping class that I have no idea what I was on when I thought it was a good idea. - Searches are performed linearly through the entire list. - We print absolutely all registers in upper-case, even though some are canonically mixed case (SPSel for example). - The ARM ARM specifies sysregs in terms of 5 fields, but those are relegated to comments in our implementation, with a slightly opaque hex value indicating the canonical encoding LLVM will use. This adds a new TableGen backend to produce efficiently searchable tables, and switches AArch64 over to using that infrastructure. llvm-svn: 274576	2016-07-05 21:23:04 +00:00
Tim Northover	88403d7a84	TableGen: promote "code" type from syntactic sugar. It's being immediately converted to a "string", but being able to tell what type the field was originally can be useful in backends. llvm-svn: 274575	2016-07-05 21:22:55 +00:00
Simon Pilgrim	9769428e08	[X86][AVX512] Remove vector BROADCAST builtins. llvm-svn: 274555	2016-07-05 14:49:58 +00:00
Michael Zuckerman	bdc5f40dca	[LLVM][INTRINSICS] adding intrinsics of CLFLUSHOPT Differential Revision: http://reviews.llvm.org/D21789 llvm-svn: 274553	2016-07-05 14:42:12 +00:00
Lang Hames	2b1c093c43	[Support][Error] Make logAllUnhandledErrors take a Twine for the banner, rather than a const string&. llvm-svn: 274526	2016-07-04 22:47:53 +00:00
Craig Topper	5d16cd9d63	[AVX512] Remove masked VPERMD/VPERMQ/VPERMILPS/VPERMILPD intrinsics. They were autoupgraded to native IR in r274506 and r274506. llvm-svn: 274519	2016-07-04 19:58:38 +00:00
Nicolai Haehnle	84c9f9919a	Add writeonly IR attribute Summary: This complements the earlier addition of IntrWriteMem and IntrWriteArgMem LLVM intrinsic properties, see D18291. Also start using the attribute for memset, memcpy, and memmove intrinsics, and remove their special-casing in BasicAliasAnalysis. Reviewers: reames, joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18714 llvm-svn: 274485	2016-07-04 08:01:29 +00:00
Sean Silva	45835e731d	Remove dead TLI arg of isKnownNonNull and propagate deadness. NFC. This actually uncovered a surprisingly large chain of ultimately unused TLI args. From what I can gather, this argument is a remnant of when isKnownNonNull would look at the TLI directly. The current approach seems to be that InferFunctionAttrs runs early in the pipeline and uses TLI to annotate the TLI-dependent non-null information as return attributes. This also removes the dependence of functionattrs on TLI altogether. llvm-svn: 274455	2016-07-02 23:47:27 +00:00
Xinliang David Li	2ecff7dd5a	Fix wrong comment llvm-svn: 274453	2016-07-02 21:25:12 +00:00
Xinliang David Li	8a021317a2	[PM] Port LoopAccessInfo analysis to new PM It is implemented as a LoopAnalysis pass as discussed and agreed upon. llvm-svn: 274452	2016-07-02 21:18:40 +00:00
Simon Pilgrim	77dda7c2e0	[X86][AVX512] Converted the MOVDDUP/MOVSLDUP/MOVSHDUP masked intrinsics to generic IR llvm-svn: 274443	2016-07-02 17:16:41 +00:00
Pirama Arumuga Nainar	9c3aec2035	Add RenderScript ArchType Summary: Add renderscript32 and renderscript64 ArchTypes. This is to configure the ABI requirement on 32-bit RenderScript that 'long' types have 64-bit size and alignment. 64-bit RenderScript is the same as AArch64, but is added here for completeness. Reviewers: echristo, rsmith Subscribers: aemerson, jfb, rampitec, dschuff, mehdi_amini, llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D21333 llvm-svn: 274412	2016-07-02 00:23:09 +00:00
Michael Kuperstein	071d8306b0	[PM] Port ConstantHoisting to the new Pass Manager Differential Revision: http://reviews.llvm.org/D21945 llvm-svn: 274411	2016-07-02 00:16:47 +00:00
Justin Bogner	0efaa349e4	IR: Set TargetPrefix for some X86 and AArch64 intrinsics where it was missing llvm-svn: 274390	2016-07-01 22:07:11 +00:00
Adrian Prantl	b340676439	Reapply "Define a module map entry for DebugInfo/CodeView." This reapplies r274313 with two additional #include directives needed when submodule visibility is enabled. Fixes PR28384. llvm-svn: 274358	2016-07-01 15:54:46 +00:00
Benjamin Kramer	b0b52fc4c6	function_refify. NFC. While there use emplace_back to create an expensive pair. llvm-svn: 274344	2016-07-01 11:05:15 +00:00
Craig Topper	2bd8b4b180	[CodeGen,Target] Remove the version of DAG.getVectorShuffle that takes a pointer to a mask array. Convert all callers to use the ArrayRef version. No functional change intended. For the most part this simplifies all callers. There were two places in X86 that needed an explicit makeArrayRef to shorten a statically sized array. llvm-svn: 274337	2016-07-01 06:54:47 +00:00
Eric Christopher	36e601c6dc	Add support for allowing us to create uniquely identified "COMDAT" or "ELF Group" sections while lowering. In particular, for ELF sections this is useful for creating function-specific groups that get merged into the same named section. Also use const Twine& instead of StringRef for the getELF functions while we're here. Differential Revision: http://reviews.llvm.org/D21743 llvm-svn: 274336	2016-07-01 06:07:38 +00:00
Xinliang David Li	94734eef33	[PM] refactor LoopAccessInfo code part-2 Differential Revision: http://reviews.llvm.org/D21636 llvm-svn: 274334	2016-07-01 05:59:55 +00:00
Adrian Prantl	257a676b8d	Revert "Define a module map entry for DebugInfo/CodeView." This reverts commit r274313. While this fixed the build on Darwin, it broke Linux with local submodule visibility. llvm-svn: 274328	2016-07-01 03:17:02 +00:00
Reid Kleckner	b5af11dfa3	[codeview] Add DISubprogram::ThisAdjustment Summary: This represents the adjustment applied to the implicit 'this' parameter in the prologue of a virtual method in the MS C++ ABI. The adjustment is always zero unless multiple inheritance is involved. This increases the size of DISubprogram by 8 bytes, unfortunately. The adjustment really is a signed 32-bit integer. If this size increase is too much, we could probably win it back by splitting out a subclass with info specific to virtual methods (virtuality, vindex, thisadjustment, containingType). Reviewers: aprantl, dexonsmith Subscribers: aaboud, amccarth, llvm-commits Differential Revision: http://reviews.llvm.org/D21614 llvm-svn: 274325	2016-07-01 02:41:21 +00:00
Matt Arsenault	370e8226c7	LoadStoreVectorizer: Check TTI for vec reg bit width llvm-svn: 274322	2016-07-01 02:07:22 +00:00
Duncan P. N. Exon Smith	9d1f156418	Revert "code hoisting pass based on GVN" This reverts commit r274305, since it breaks self-hosting: http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/22349/ http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/17232 Note that the blamelist on lab.llvm.org:8011 is incorrect. The previous build was r274299, but somehow r274305 wasn't included in the blamelist: http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules llvm-svn: 274320	2016-07-01 01:51:40 +00:00
Duncan P. N. Exon Smith	d26fdc83c9	CodeGen: Use MachineInstr& in LiveVariables API, NFC Change all the methods in LiveVariables that expect non-null MachineInstr* to take MachineInstr& and update the call sites. This clarifies the API, and designs away a class of iterator to pointer implicit conversions. llvm-svn: 274319	2016-07-01 01:51:32 +00:00
Adrian Prantl	8d62f7fc10	Define a module map entry for DebugInfo/CodeView. This fixes the -fmodules build. llvm-svn: 274313	2016-07-01 01:16:17 +00:00
Sebastian Pop	5c5798c57c	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 274305	2016-07-01 00:24:31 +00:00
Duncan P. N. Exon Smith	632987296f	Target: Remove unused arguments from overrideSchedPolicy, NFC TargetSubtargetInfo::overrideSchedPolicy takes two MachineInstr* arguments (begin and end) that invite implicit conversions from MachineInstrBundleIterator. One option would be to change their type to an iterator, but since they don't seem to have been used since the API was added in 2010, I'm deleting the dead code. llvm-svn: 274304	2016-07-01 00:23:27 +00:00
Matt Arsenault	08debb0244	Add LoadStoreVectorizer pass This was contributed by Apple, and I've been working on minimal cleanups and generalizing it. llvm-svn: 274293	2016-06-30 23:11:38 +00:00
Reid Kleckner	69ae6848e9	[TableGen] Use a SmallVector for Record::Values to avoid debug iterators Debug iterators are valuable so we don't want to turn them off completely. However, llvm-tblgen is critical to build speed, so we can skip them here. Regenerating X86GenSubtargetInfo.inc in a clang-cl self-host debug build now takes 39s instead of 1m29s. Helps PR28222 llvm-svn: 274288	2016-06-30 23:04:07 +00:00
Duncan P. N. Exon Smith	e4f5e4f4d1	CodeGen: Use MachineInstr& in TargetLowering, NFC This is a mechanical change to make TargetLowering API take MachineInstr& (instead of MachineInstr), since the argument is expected to be a valid MachineInstr. In one case, changed a parameter from MachineInstr to MachineBasicBlock::iterator, since it was used as an insertion point. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. llvm-svn: 274287	2016-06-30 22:52:52 +00:00
Matt Arsenault	727e279ac4	SLPVectorizer: Move propagateMetadata to VectorUtils This will be re-used by the LoadStoreVectorizer. Fix handling of range metadata and testcase by Justin Lebar. llvm-svn: 274281	2016-06-30 21:17:59 +00:00
Justin Bogner	fbac64d678	CodeGen: Add the other BuildMI overload for MachineInstr& The change in r274193 missed this variant. llvm-svn: 274259	2016-06-30 18:32:12 +00:00
Rafael Espindola	d86e8bb0ed	Delete MCCodeGenInfo. MC doesn't really care about CodeGen stuff, so this was just complicating target initialization. llvm-svn: 274258	2016-06-30 18:25:11 +00:00
Lang Hames	759b30edfc	[Support] Fix a bug in ErrorList::join / joinErrors. When concatenating two error lists the ErrorList::join method (which is called by joinErrors) was failing to set the checked bit on the second error, leading to a 'failure to check error' assertion. llvm-svn: 274249	2016-06-30 17:43:06 +00:00
Zachary Turner	ab58ae8730	[pdb] Re-add code to write PDB files. Somehow all the functionality to write PDB files got removed, probably accidentally when uploading the patch perhaps the wrong one got uploaded. This re-adds all the code, as well as the corresponding test. llvm-svn: 274248	2016-06-30 17:43:00 +00:00
David Majnemer	9319cbc045	[CodeView] Implement support for bitfields in LLVM CodeView need to know the offset of the storage allocation for a bitfield. Encode this via the "extraData" field in DIDerivedType and introduced a new flag, DIFlagBitField, to indicate whether or not a member is a bitfield. This fixes PR28162. Differential Revision: http://reviews.llvm.org/D21782 llvm-svn: 274200	2016-06-30 03:00:20 +00:00
Chandler Carruth	758032726d	[ADT] Add a new data structure for managing a priority worklist where re-insertion of entries into the worklist moves them to the end. This is fairly similar to a SetVector, but helps in the case where in addition to not inserting duplicates you want to adjust the sequence of a pop-off-the-back worklist. I'm not at all attached to the name of this data structure if others have better suggestions, but this is one that David Majnemer brought up in IRC discussions that seems plausible. I've trimmed the interface down somewhat from SetVector's interface because several things make less sense here IMO: iteration primarily. I'd prefer to add these back as we have users that need them. My use case doesn't even need all of what is provided here. =] I've also included a basic unittest to make sure this functions reasonably. Differential Revision: http://reviews.llvm.org/D21866 llvm-svn: 274198	2016-06-30 02:32:20 +00:00
George Burgess IV	d86e38e1db	[CFLAA] Add support for ModRef queries. This patch makes CFLAA answer some ModRef queries. Because we don't distinguish between reading/writing when making StratifiedSets, we're unable to offer any of the readonly-related answers. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D21858 llvm-svn: 274197	2016-06-30 02:11:26 +00:00
Matthias Braun	f7493393fc	RegisterScavenging: Code cleanup; NFC - Use range based for loops - No need for some !Reg checks: isPhysicalRegister() reports false for NoRegister anyway - Do not repeat function name in documentation comment. - Do not repeat documentation comment in implementation when we already have one at the declaration. - Factor some common subexpressions out. - Change file comments to use doxygen syntax. llvm-svn: 274194	2016-06-30 00:23:54 +00:00
Duncan P. N. Exon Smith	1bc348a97b	CodeGen: Add an explicit BuildMI overload for MachineInstr& Add an explicit overload to BuildMI for MachineInstr& to deal with insertions inside of instruction bundles. - Use it to re-implement MachineInstr* to give it coverage. - Document how the overload for MachineBasicBlock::instr_iterator differs from that for MachineBasicBlock::iterator (the previous (implicit) overload for MachineInstr&). - Add a comment explaining why the MachineInstr& and MachineInstr* overloads don't universally forward to the MachineBasicBlock::instr_iterator overload. Thanks to Justin for noticing the API quirk. While this doesn't fix any known bugs -- all uses of BuildMI with a MachineInstr& were previously using MachineBasicBlock::iterator -- it protects against future bugs. llvm-svn: 274193	2016-06-30 00:10:17 +00:00
Duncan P. N. Exon Smith	9cfc75c214	CodeGen: Use MachineInstr& in TargetInstrInfo, NFC This is mostly a mechanical change to make TargetInstrInfo API take MachineInstr& (instead of MachineInstr* or MachineBasicBlock::iterator) when the argument is expected to be a valid MachineInstr. This is a general API improvement. Although it would be possible to do this one function at a time, that would demand a quadratic amount of churn since many of these functions call each other. Instead I've done everything as a block and just updated what was necessary. This is mostly mechanical fixes: adding and removing `` and `&` operators. The only non-mechanical change is to split ARMBaseInstrInfo::getOperandLatencyImpl out from ARMBaseInstrInfo::getOperandLatency. Previously, the latter took a `MachineInstr` which it updated to the instruction bundle leader; now, the latter calls the former either with the same `MachineInstr&` or the bundle leader. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. Note: I updated WebAssembly, Lanai, and AVR (despite being off-by-default) since it turned out to be easy. I couldn't run tests for AVR since llc doesn't link with it turned on. llvm-svn: 274189	2016-06-30 00:01:54 +00:00
Peter Collingbourne	068078121c	Add move constructor and move assignment to fix MSVC build. llvm-svn: 274186	2016-06-29 23:54:10 +00:00
Peter Collingbourne	8ec68fad33	Object: Replace NewArchiveIterator with a simpler NewArchiveMember class. NFCI. The NewArchiveIterator class has a problem: it requires too much context. Any memory buffers added to the archive must be stored within an Archive::Member, which must have an associated Archive. This makes it harder than necessary to create new archive members (or new archives entirely) from scratch using memory buffers. This patch replaces NewArchiveIterator with a NewArchiveMember class that stores just the memory buffer and the information that goes into the archive member header. Differential Revision: http://reviews.llvm.org/D21721 llvm-svn: 274183	2016-06-29 22:27:42 +00:00
Zachary Turner	07670b3e98	Resubmit "Update llvm command line parser to support subcommands." This fixes an issue where occurrence counts would be unexpectedly reset when parsing different parts of a command line multiple times. ORIGINAL COMMIT MESSAGE This allows command line tools to use syntaxes like the following: llvm-foo.exe command1 -o1 -o2 llvm-foo.exe command2 -p1 -p2 Where command1 and command2 contain completely different sets of valid options. This is backwards compatible with previous uses of llvm cl which did not support subcommands, as any option which specifies no optional subcommand (e.g. all existing code) goes into a special "top level" subcommand that expects dashed options to appear immediately after the program name. For example, code which is subcommand unaware would generate a command line such as the following, where no subcommand is specified: llvm-foo.exe -q1 -q2 The top level subcommand can co-exist with actual subcommands, as it is implemented as an actual subcommand which is searched if no explicit subcommand is specified. So llvm-foo.exe as specified above could be written so as to support all three aforementioned command lines simultaneously. There is one additional "special" subcommand called AllSubCommands, which can be used to inject an option into every subcommand. This is useful to support things like help, so that commands such as: llvm-foo.exe --help llvm-foo.exe command1 --help llvm-foo.exe command2 --help All work and display the help for the selected subcommand without having to explicitly go and write code to handle each one separately. This patch is submitted without an example of anything actually using subcommands, but a followup patch will convert the llvm-pdbdump tool to use subcommands. Reviewed By: beanz llvm-svn: 274171	2016-06-29 21:48:26 +00:00
Kevin Enderby	c60a321c6b	Change Archive::create() from ErrorOr<...> to Expected<...> and update its clients. This commit will break the next lld builds. I’ll be committing the matching change for lld next. llvm-svn: 274160	2016-06-29 20:35:44 +00:00
Tim Shen	aec68b263d	[InstCombine] Simplify and correct folding fcmps with the same children Summary: Take advantage of FCmpInst::Predicate's bit pattern and handle (fcmp , x, y) \| (fcmp , x, y) and (fcmp , x, y) & (fcmp , x, y) more consistently. Also fold more FCmpInst::FCMP_FALSE and FCmpInst::FCMP_TRUE to constants. Currently InstCombine wrongly folds (fcmp ogt, x, y) \| (fcmp ord, x, y) to (fcmp ogt, x, y); this patch also fixes that. Reviewers: spatel Subscribers: llvm-commits, iteratee, echristo Differential Revision: http://reviews.llvm.org/D21775 llvm-svn: 274156	2016-06-29 20:10:17 +00:00
Benjamin Kramer	832d042078	[ManagedStatic] Reimplement double-checked locking with std::atomic. This gets rid of the memory fence in the hot path (dereferencing the ManagedStatic), trading for an extra mutex lock in the cold path (when the ManagedStatic was uninitialized). Since this only happens on the first accesses it shouldn't matter much. On strict architectures like x86 this removes any atomic instructions from the hot path. Also remove the tsan annotations, tsan knows how standard atomics work so they should be unnecessary now. llvm-svn: 274131	2016-06-29 15:04:07 +00:00
Rafael Espindola	a99ccfce1a	Drop support for creating $stubs. They are created by ld64 since OS X 10.5. llvm-svn: 274130	2016-06-29 14:59:50 +00:00
Krzysztof Parzyszek	3da1078a9b	[Docs][CodeGenerator] Don't specify the number of operands in BuildMI Patch by Visoiu Mistrih Francis. Differential Revision: http://reviews.llvm.org/D21819 llvm-svn: 274128	2016-06-29 14:14:59 +00:00
Elena Demikhovsky	5e21c94f25	Reverted patch 273864 llvm-svn: 274115	2016-06-29 10:01:06 +00:00
Vedant Kumar	34e4e477c8	Revert "[Coverage] Move logic to encode filenames and mappings into llvm (NFC)" This reverts commit 520a8298d8ef676b5da617ba3d2c7fa37381e939 (r273055). This is breaking stage2 instrumented builds with "malformed coverage data" errors. llvm-svn: 274106	2016-06-29 05:33:26 +00:00
Vedant Kumar	a30139d50c	Revert "[Coverage] Clarify ownership of a MemoryBuffer in the reader (NFC)" This reverts commit 1037ef2574adde2103ad221d63834c3e1df4a776. llvm-svn: 274105	2016-06-29 05:33:24 +00:00
Adam Nemet	ad437fff53	[Diag] Add getter shouldAlwaysPrint. NFC For the new hotness attribute, the API will take the pass rather than the pass name so we can no longer play the trick of AlwaysPrint being a special pass name. This adds a getter to help the transition. There is also a corresponding clang patch. llvm-svn: 274100	2016-06-29 04:55:19 +00:00
Craig Topper	f067a043fb	[CodeGen] Make ShuffleVectorSDNode::commuteMask take a MutableArrayRef instead of SmallVectorImpl. NFC. llvm-svn: 274095	2016-06-29 03:29:06 +00:00
Davide Italiano	941685e9f4	[Triple] Add isLittleEndian(). This allows us to query about the endianness without having to look at DataLayout. The API will be used (and tested) in lld, in order to find out the endianness of BitcodeFiles. Briefly discussed with Rafael. llvm-svn: 274090	2016-06-29 01:56:27 +00:00
Kevin Enderby	42398051d8	Finish cleaning up most of the error handling in libObject’s MachOUniversalBinary and its clients to use the new llvm::Error model for error handling. Changed getAsArchive() from ErrorOr<...> to Expected<...> so now all interfaces there use the new llvm::Error model for return values. In the two places it had if (!Parent) this is actually a program error so changed from returning errorCodeToError(object_error::parse_failed) to calling report_fatal_error() with a message. In getObjectForArch() added error messages to its two llvm::Error return values instead of returning errorCodeToError(object_error::arch_not_found) with no error message. For the llvm-obdump, llvm-nm and llvm-size clients since the only binary files in Mach-O Universal Binaries that are supported are Mach-O files or archives with Mach-O objects, updated their logic to generate an error when a slice contains something like an ELF binary instead of ignoring it. And added a test case for that. The last error stuff to be cleaned up for libObject’s MachOUniversalBinary is the use of errorOrToExpected(Archive::create(ObjBuffer)) which needs Archive::create() to be changed from ErrorOr<...> to Expected<...> first, which I’ll work on next. llvm-svn: 274079	2016-06-28 23:16:13 +00:00
Adam Nemet	9c12639370	[Diag] Fix file comment llvm-svn: 274078	2016-06-28 23:06:39 +00:00
Manman Ren	d16490dfd1	Revert r274054 to try to appease the bot llvm-svn: 274072	2016-06-28 22:20:17 +00:00
Zachary Turner	2012d744f4	Update llvm command line parser to support subcommands. This allows command line tools to use syntaxes like the following: llvm-foo.exe command1 -o1 -o2 llvm-foo.exe command2 -p1 -p2 Where command1 and command2 contain completely different sets of valid options. This is backwards compatible with previous uses of llvm cl which did not support subcommands, as any option which specifies no optional subcommand (e.g. all existing code) goes into a special "top level" subcommand that expects dashed options to appear immediately after the program name. For example, code which is subcommand unaware would generate a command line such as the following, where no subcommand is specified: llvm-foo.exe -q1 -q2 The top level subcommand can co-exist with actual subcommands, as it is implemented as an actual subcommand which is searched if no explicit subcommand is specified. So llvm-foo.exe as specified above could be written so as to support all three aforementioned command lines simultaneously. There is one additional "special" subcommand called AllSubCommands, which can be used to inject an option into every subcommand. This is useful to support things like help, so that commands such as: llvm-foo.exe --help llvm-foo.exe command1 --help llvm-foo.exe command2 --help All work and display the help for the selected subcommand without having to explicitly go and write code to handle each one separately. This patch is submitted without an example of anything actually using subcommands, but a followup patch will convert the llvm-pdbdump tool to use subcommands. Reviewed By: beanz Differential Revision: http://reviews.llvm.org/D21485 llvm-svn: 274054	2016-06-28 20:09:47 +00:00
Artur Pilipenko	7ad95ec22d	Support arbitrary addrspace pointers in masked load/store intrinsics This is a resubmittion of 263158 change after fixing the existing problem with intrinsics mangling (see LTO and intrinsics mangling llvm-dev thread for details). This patch fixes the problem which occurs when loop-vectorize tries to use @llvm.masked.load/store intrinsic for a non-default addrspace pointer. It fails with "Calling a function with a bad signature!" assertion in CallInst constructor because it tries to pass a non-default addrspace pointer to the pointer argument which has default addrspace. The fix is to add pointer type as another overloaded type to @llvm.masked.load/store intrinsics. Reviewed By: reames Differential Revision: http://reviews.llvm.org/D17270 llvm-svn: 274043	2016-06-28 18:27:25 +00:00
Jacques Pienaar	f43266b868	[lanai] Update ELF number to correspond to the assigned number. Change EM_LANAI to correspond to machine number assigned by Xinuos. llvm-svn: 274042	2016-06-28 18:22:22 +00:00
Vassil Vassilev	92de57e7b2	[modules] Separate intrinsic_gen dependent headers from module LLVM_IR. Some headers in IR depend on tablegen generated code. Modules builds triggered generation of the LLVM_IR module (including headers dependant on intrinsic_gen), imposing a unnecessary build dependency. Reviewed by Richard Smith. llvm-svn: 274006	2016-06-28 12:27:09 +00:00
Vassil Vassilev	21858f21f0	Add missing includes. Patch by Cristina Cristescu. llvm-svn: 274004	2016-06-28 12:17:05 +00:00
Xinliang David Li	3e176c77ab	[BFI/MBFI]: cfg graph view with color scheme This patch enhances dot graph viewer to show hot regions with hot bbs/edges displayed in red. The ratio of the bb freq to the max freq of the function needs to be no less than the value specified by view-hot-freq-percent option. The default value is 10 (i.e. 10%). llvm-svn: 273996	2016-06-28 06:58:21 +00:00
Xinliang David Li	55415f2565	[BFI]: graph viewer code refactoring BFI and MBFI's dot traits class share most of the code and all future enhancement. This patch extracts common implementation into base class BFIDOTGraphTraitsBase. This patch also enables BFI graph to show branch probability on edges as MBFI does before. llvm-svn: 273990	2016-06-28 03:41:29 +00:00
Xinliang David Li	3264fdd3ca	[BFI]: code cleanup Expose getBPI interface from BFI impl and use it in graph viewer. This eliminates the dependency on old PM interface. llvm-svn: 273967	2016-06-28 00:15:45 +00:00
Michael Kuperstein	0f684b0d04	Remove stray comment. NFC. llvm-svn: 273966	2016-06-28 00:14:09 +00:00
Chandler Carruth	dca834089a	[PM] Improve the debugging and logging facilities of the CGSCC bits of the new pass manager. This adds operator<< overloads for the various bits of the LazyCallGraph, dump methods for use from the debugger, and debug logging using them to the CGSCC pass manager. Having this was essential for debugging the call graph update patch, and I've extracted what I could from that patch here to minimize the delta. llvm-svn: 273961	2016-06-27 23:26:08 +00:00
Rafael Espindola	3beef8d6db	Move shouldAssumeDSOLocal to Target. Should fix the shared library build. llvm-svn: 273958	2016-06-27 23:15:57 +00:00
Davide Italiano	bddbabb6c8	[MC] Garbage collect dead API: createELFObjectTargetWriter(). llvm-svn: 273953	2016-06-27 22:41:52 +00:00
Kevin Enderby	1051909df1	Change all but the last ErrorOr<...> use for MachOUniversalBinary to Expected<...> to allow a good error message to be produced. I added the one test case that the object file tools could produce an error message. The other two errors can’t be triggered if the input file is passed through sys::fs::identify_magic(). But the malformedError("bad magic number") does get triggered by the logic in llvm-dsymutil when dealing with a normal Mach-O file. The other "File too small ..." error would take a logic error currently to produce and is not tested for. llvm-svn: 273946	2016-06-27 21:39:39 +00:00
Rafael Espindola	f9e348bd59	Convert a few more comparisons to isPositionIndependent(). NFC. llvm-svn: 273945	2016-06-27 21:33:08 +00:00
Chris Bieneman	8ff0c11357	[yaml2obj] Remove --format option in favor of YAML tags Summary: Our YAML library's handling of tags isn't perfect, but it is good enough to get rid of the need for the --format argument to yaml2obj. This patch does exactly that. Instead of requiring --format, it infers the format based on the tags found in the object file. The supported tags are: !ELF !COFF !mach-o !fat-mach-o I have a corresponding patch that is quite large that fixes up all the in-tree test cases. Reviewers: rafael, Bigcheese, compnerd, silvas Subscribers: compnerd, llvm-commits Differential Revision: http://reviews.llvm.org/D21711 llvm-svn: 273915	2016-06-27 19:53:53 +00:00
Daniel Berlin	16ed57c86b	Factor out buildMemorySSA from getWalker. NFC. llvm-svn: 273901	2016-06-27 18:22:27 +00:00
Artur Pilipenko	72f76b8805	Revert -r273892 "Support arbitrary addrspace pointers in masked load/store intrinsics" since some of the clang tests don't expect to see the updated signatures. llvm-svn: 273895	2016-06-27 16:54:33 +00:00
Easwaran Raman	1832bf6aee	[PM] Port PartialInlining to the new PM Differential revision: http://reviews.llvm.org/D21699 llvm-svn: 273894	2016-06-27 16:50:18 +00:00
Artur Pilipenko	a36aa41519	Support arbitrary addrspace pointers in masked load/store intrinsics This is a resubmittion of 263158 change after fixing the existing problem with intrinsics mangling (see LTO and intrinsics mangling llvm-dev thread for details). This patch fixes the problem which occurs when loop-vectorize tries to use @llvm.masked.load/store intrinsic for a non-default addrspace pointer. It fails with "Calling a function with a bad signature!" assertion in CallInst constructor because it tries to pass a non-default addrspace pointer to the pointer argument which has default addrspace. The fix is to add pointer type as another overloaded type to @llvm.masked.load/store intrinsics. Reviewed By: reames Differential Revision: http://reviews.llvm.org/D17270 llvm-svn: 273892	2016-06-27 16:29:26 +00:00
Rafael Espindola	0db11db560	Move isPositionIndependent up to AsmPrinter. Use it in ppc too. llvm-svn: 273877	2016-06-27 14:19:45 +00:00
Benjamin Kramer	728cf09b0e	[IRBuilder] Drop unused CreateInvoke overloads. The arrayref overload is more flexible with virtually the same interface. NFC. llvm-svn: 273867	2016-06-27 12:25:26 +00:00
Elena Demikhovsky	4c58b2761a	Fixed consecutive memory access detection in Loop Vectorizer. It did not handle correctly cases without GEP. The following loop wasn't vectorized: for (int i=0; i<len; i++) to++ = from++; I use getPtrStride() to find Stride for memory access and return 0 is the Stride is not 1 or -1. Re-commit rL273257 - revision: http://reviews.llvm.org/D20789 llvm-svn: 273864	2016-06-27 11:19:23 +00:00
Pawel Bylica	6830401863	APInt: remove unsued param in private method. NFC Reviewers: davide Subscribers: davide, llvm-commits Differential Revision: http://reviews.llvm.org/D21638 llvm-svn: 273851	2016-06-27 08:31:48 +00:00
Rafael Espindola	ae0d866f56	Refactor a duplicated predicate. NFC. llvm-svn: 273826	2016-06-26 22:13:55 +00:00
Benjamin Kramer	aa2091505f	Apply clang-tidy's modernize-loop-convert to lib/Analysis. Only minor manual fixes. No functionality change intended. llvm-svn: 273816	2016-06-26 17:27:42 +00:00
Benjamin Kramer	135f735af1	Apply clang-tidy's modernize-loop-convert to most of lib/Transforms. Only minor manual fixes. No functionality change intended. llvm-svn: 273808	2016-06-26 12:28:59 +00:00
Sanjoy Das	90547f1d20	[RSForGC] Bring findBasePointer up to code; NFC Name-casing and minor style changes to bring the function up to the LLVM coding style. llvm-svn: 273791	2016-06-26 04:55:05 +00:00
David Majnemer	ad7b7e73a5	[Object, COFF] An import data directory might not consist soley of imports The last import is the penultimate entry, the last entry is nulled out. Data beyond the null entry should not be considered to hold import entries. This fixes PR28302. N.B. I am working on a reduced testcase, the one in PR28302 is too large. llvm-svn: 273790	2016-06-26 04:36:32 +00:00
Hubert Tong	ff243b588b	Reapply r273664 with workaround for MSVC Reviewers: rsmith, faisalv, aaron.ballman Subscribers: llvm-commits, cfe-commits, nwilson Differential Revision: http://reviews.llvm.org/D19770 llvm-svn: 273781	2016-06-25 11:23:59 +00:00
David Majnemer	e14e7bc4b8	Revert "[SimplifyCFG] Stop inserting calls to llvm.trap for UB" This reverts commit r273778, it seems to break UBSan :/ llvm-svn: 273779	2016-06-25 08:19:55 +00:00
David Majnemer	d346a37737	[SimplifyCFG] Stop inserting calls to llvm.trap for UB SimplifyCFG had logic to insert calls to llvm.trap for two very particular IR patterns: stores and invokes of undef/null. While InstCombine canonicalizes certain undefined behavior IR patterns to stores of undef, phase ordering means that this cannot be relied upon in general. There are much better tools than llvm.trap: UBSan and ASan. N.B. I could be argued into reverting this change if a clear argument as to why it is important that we synthesize llvm.trap for stores, I'd be hard pressed to see why it'd be useful for invokes... llvm-svn: 273778	2016-06-25 08:04:19 +00:00
NAKAMURA Takumi	c9f731c479	Fix a typo in FindAvailableLoadedValue, introduced by r273734. [-Wdocumentation] llvm-svn: 273774	2016-06-25 06:03:14 +00:00
Matthias Braun	cc676c47a3	MachineScheduler: Remember top/bottom choice in bidirectional scheduling Remember the last choice for the top/bottom scheduling boundary in bidirectional scheduling mode. The top choice should not change if we schedule at the bottom and vice versa. This allows us to improve compiletime: We only recalculate the best pick for one border and re-use the cached top-pick from the other border. Differential Revision: http://reviews.llvm.org/D19350 llvm-svn: 273766	2016-06-25 02:03:36 +00:00
David Majnemer	0f45572761	The absence of noreturn doesn't ensure mayReturn There are two separate issues: - LLVM doesn't consider infinite loops to be side effects: we happily hoist/sink above/below loops whose bounds are unknown. - The absence of the noreturn attribute is insufficient for us to know if a function will definitely return. Relying on noreturn in the middle-end for any property is an accident waiting to happen. llvm-svn: 273762	2016-06-25 00:55:12 +00:00
Peter Collingbourne	0312f614b1	IR: Introduce llvm.type.checked.load intrinsic. This intrinsic safely loads a function pointer from a virtual table pointer using type metadata. This intrinsic is used to implement control flow integrity in conjunction with virtual call optimization. The virtual call optimization pass will optimize away llvm.type.checked.load intrinsics associated with devirtualized calls, thereby removing the type check in cases where it is not needed to enforce the control flow integrity constraint. This patch also introduces the capability to copy type metadata between global variables, and teaches the virtual call optimization pass to do so. Differential Revision: http://reviews.llvm.org/D21121 llvm-svn: 273756	2016-06-25 00:23:04 +00:00
Matthias Braun	6ad3d05b68	MachineScheduler: Fully compare top/bottom candidates In bidirectional scheduling this gives more stable results than just comparing the "reason" fields of the top/bottom node because the reason field may be higher depending on what other nodes are in the queue. Differential Revision: http://reviews.llvm.org/D19401 llvm-svn: 273755	2016-06-25 00:23:00 +00:00
Michael Kuperstein	83b753d430	[PM] Port float2int to the new pass manager Differential Revision: http://reviews.llvm.org/D21704 llvm-svn: 273747	2016-06-24 23:32:02 +00:00
Eli Friedman	5a52856cc8	Fix documentation for FindAvailableLoadedValue. llvm-svn: 273734	2016-06-24 21:32:15 +00:00
Peter Collingbourne	7efd750607	IR: New representation for CFI and virtual call optimization pass metadata. The bitset metadata currently used in LLVM has a few problems: 1. It has the wrong name. The name "bitset" refers to an implementation detail of one use of the metadata (i.e. its original use case, CFI). This makes it harder to understand, as the name makes no sense in the context of virtual call optimization. 2. It is represented using a global named metadata node, rather than being directly associated with a global. This makes it harder to manipulate the metadata when rebuilding global variables, summarise it as part of ThinLTO and drop unused metadata when associated globals are dropped. For this reason, CFI does not currently work correctly when both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable globals, and fails to associate metadata with the rebuilt globals. As I understand it, the same problem could also affect ASan, which rebuilds globals with a red zone. This patch solves both of those problems in the following way: 1. Rename the metadata to "type metadata". This new name reflects how the metadata is currently being used (i.e. to represent type information for CFI and vtable opt). The new name is reflected in the name for the associated intrinsic (llvm.type.test) and pass (LowerTypeTests). 2. Attach metadata directly to the globals that it pertains to, rather than using the "llvm.bitsets" global metadata node as we are doing now. This is done using the newly introduced capability to attach metadata to global variables (r271348 and r271358). See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html Differential Revision: http://reviews.llvm.org/D21053 llvm-svn: 273729	2016-06-24 21:21:32 +00:00
Rafael Espindola	a895a0cd01	Add support for musl-libc on ARM Linux. Patch by Lei Zhang! llvm-svn: 273726	2016-06-24 21:14:33 +00:00
Chris Bieneman	cd7f886e06	[MachO] Fixing copy-paste error from r273719 Thanks Kevin! llvm-svn: 273725	2016-06-24 21:06:52 +00:00
George Burgess IV	fd1f2f8561	[MemorySSA] Move code around a bit. NFC. This patch moves MSSA's caching walker into MemorySSA, and moves the actual definition of MSSA's caching walker out of MemorySSA.h. This is done in preparation for the new walker, which should be out for review soonish. Also, this patch removes a field from UpwardsMemoryQuery and has a few lines of diff from clang-format'ing MemorySSA.cpp. llvm-svn: 273723	2016-06-24 21:02:12 +00:00
Chris Bieneman	93e7119380	[obj2yaml] [yaml2obj] Support for MachO Universal binaries This patch adds round-trip support for MachO Universal binaries to obj2yaml and yaml2obj. Universal binaries have a header and list of architecture structures, followed by a the individual object files at specified offsets. llvm-svn: 273719	2016-06-24 20:42:28 +00:00
Michael Kuperstein	82d5da5aac	[PM] Port PreISelIntrinsicLowering to the new PM llvm-svn: 273713	2016-06-24 20:13:42 +00:00
David Majnemer	f15064871a	[CodeView] Healthy paranoia around strings Make sure strings don't get too big for a record, truncate them if need-be. llvm-svn: 273710	2016-06-24 19:34:41 +00:00
Reid Kleckner	fbd5eef691	Revert "InstCombine rule to fold trunc when value available" This reverts commit r273608. Broke building code with sanitizers, where apparently these kinds of loads, casts, and truncations are common: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/24502 http://crbug.com/623099 llvm-svn: 273703	2016-06-24 18:42:58 +00:00
Kevin Enderby	931cb65df2	Thread Expected<...> up from libObject’s getSymbolAddress() for symbols to allow a good error message to be produced. This is nearly the last libObject interface that used ErrorOr and the last one that appears in llvm/include/llvm/Object/MachO.h . For Mach-O objects this is just a clean up because it’s version of getSymbolAddress() can’t return an error. I will leave it to the experts on COFF and ELF to actually add meaning full error messages in their tests if they wish. And also leave it to these experts to change the last two ErrorOr interfaces in llvm/include/llvm/Object/ObjectFile.h for createCOFFObjectFile() and createELFObjectFile() if they wish. Since there are no test cases for COFF and ELF error cases with respect to getSymbolAddress() in the test suite this is no functional change (NFC). llvm-svn: 273701	2016-06-24 18:24:42 +00:00
Peter Collingbourne	4f7c16dd53	Linker: Copy metadata when linking declarations. Differential Revision: http://reviews.llvm.org/D21624 llvm-svn: 273692	2016-06-24 17:42:21 +00:00
Reid Kleckner	33848faa5e	[codeview] Use one byte for S_FRAMECOOKIE CookieKind and add flags byte We bailed out while printing codeview for an MSVC compiled SemaExprCXX.cpp that used this record. The MS reference headers look incorrect here, which is probably why we had this bug. They use a 32-bit enum as the field type, but the actual record appears to use one byte for the cookie kind followed by a flags byte. llvm-svn: 273691	2016-06-24 17:23:49 +00:00
Artur Pilipenko	6c7a8abf5c	Remangle intrinsics names when types are renamed This is a resubmittion of previously reverted rL273568. This is a fix for the problem mentioned in "LTO and intrinsics mangling" llvm-dev mail thread: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098387.html Reviewers: mehdi_amini, reames Differential Revision: http://reviews.llvm.org/D19373 llvm-svn: 273686	2016-06-24 15:10:29 +00:00
Artur Pilipenko	b68b82117a	NFC. Move verifyIntrinsicIsVarArg from verifier to Intrinsic::matchIntrinsicVarArg since it will be reused for intrinsic remangling code llvm-svn: 273685	2016-06-24 14:47:27 +00:00
Chad Rosier	fd342808e0	[MachineDominatorTree] Add a MDT verifier. Differential Revision: http://reviews.llvm.org/D21657 llvm-svn: 273678	2016-06-24 13:32:22 +00:00
Hubert Tong	3e2c30d447	Revert r273664 Revert change until build issues with MSVC can be resolved. llvm-svn: 273670	2016-06-24 12:25:15 +00:00
Hubert Tong	034d2c92e8	Add FixedSizeStorage to TrailingObjects; NFC Summary: This change introduces two types, `FixedSizeStorage` and `FixedSizeStorageOwner`, which can be used to provide stack-allocated objects with trailing objects. Reviewers: rsmith, faisalv, aaron.ballman Subscribers: llvm-commits, cfe-commits, nwilson Differential Revision: http://reviews.llvm.org/D19770 llvm-svn: 273664	2016-06-24 11:34:16 +00:00
Simon Dardis	5f95c9af8d	Revert "Revert "[misched] Extend scheduler to handle unsupported features"" This reverts commit r273565. This was an over-eager revert. llvm-svn: 273658	2016-06-24 08:43:27 +00:00
David Majnemer	df68f032ae	Use the same underlying type for bitfields MSVC allocates fresh storage for consecutive bitfields with different underlying types. llvm-svn: 273645	2016-06-24 04:05:25 +00:00
Tom Stellard	14416ae6cd	Support/ELF: Add R_AMDGPU_GOTPCREL relocation Summary: We will start generating this in a future patch. Reviewers: arsenm, kzhuravl, rafael, ruiu, tony-tye Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D21482 llvm-svn: 273628	2016-06-23 23:11:29 +00:00
Chandler Carruth	586fc7c5be	[LCG] Make the name of an SCC include more of the functions in it. This makes it much easier to debug issues when the logging contains the name of the SCC. It requires to create a temporary string, but for logging and debugging uses that seems fine. I've added logic to try to output all the function names with an elipsis if there are too many. This was helpful fro me in debugging issues with the new pass manager. llvm-svn: 273625	2016-06-23 22:51:14 +00:00
Anna Thomas	31a0b2088f	InstCombine rule to fold trunc when value available Summary: This instcombine rule folds away trunc operations that have value available from a prior load or store. This kind of code can be generated as a result of GVN widening the load or from source code as well. Reviewers: reames, majnemer, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21246 llvm-svn: 273608	2016-06-23 20:22:22 +00:00
Vassil Vassilev	e3ffbc38d9	Typo. llvm-svn: 273592	2016-06-23 18:13:46 +00:00
Sanjoy Das	2951e6b314	[SCEV] Don't unnecessarily namespace; NFC llvm-svn: 273587	2016-06-23 18:03:32 +00:00
Nirav Dave	bfdb483755	Preserve DebugInfo when replacing values in DAGCombiner Recommiting after correcting over-eager Debug Value transfer fixing PR28270. [DAG] Previously debug values would transfer debuginfo for the selected start node for a replacement which allows for debug to be dropped. Push debug value transfer to occur with node/value replacement in SelectionDAG, remove now extraneous transfers of debug values. This refixes PR9817 which was being incompletely checked in the testsuite. Reviewers: jyknight Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D21037 llvm-svn: 273585	2016-06-23 17:52:57 +00:00
Pablo Barrio	7a64346533	[ARM] Lower (select_cc k k (select_cc ~k ~k x)) into (SSAT l_k x) Summary: SSAT saturates an integer, making sure that its value lies within an interval [-k, k]. Since the constant is given to SSAT as the number of bytes set to one, k + 1 must be a power of 2, otherwise the optimization is not possible. Also, the select_cc must use < and > respectively so that they define an interval. Reviewers: mcrosier, jmolloy, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D21372 llvm-svn: 273581	2016-06-23 16:53:49 +00:00
Hans Wennborg	a63b50afb8	Revert r273568 "Remangle intrinsics names when types are renamed" It broke 2008-07-15-Bswap.ll and 2009-09-01-PostRAProlog.ll llvm-svn: 273574	2016-06-23 16:13:23 +00:00
Artur Pilipenko	f0c9f81379	Remangle intrinsics names when types are renamed This is a fix for the problem mentioned in "LTO and intrinsics mangling" llvm-dev mail thread: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098387.html Reviewers: mehdi_amini, reames Differential Revision: http://reviews.llvm.org/D19373 llvm-svn: 273568	2016-06-23 15:25:09 +00:00
Simon Dardis	fcc7f6fad2	Revert "[misched] Extend scheduler to handle unsupported features" This reverts commit r273551. Patch contained a wrong check for isUnsupported. llvm-svn: 273565	2016-06-23 14:54:47 +00:00
Simon Dardis	081e4bb14c	[misched] Extend scheduler to handle unsupported features Currently isComplete = 1 requires that every instruction must be described, declared unsupported or marked as having no scheduling information for a processor. For some backends such as MIPS, this requirement entails long regex lists of instructions that are unsupported. This patch teaches Tablegen to skip over instructions that are associated with unsupported feature when checking if the scheduling model is complete. Patch by: Daniel Sanders Contributions by: Simon Dardis Reviewers: MatzeB Differential Reviewer: http://reviews.llvm.org/D20522 llvm-svn: 273551	2016-06-23 09:22:11 +00:00
Craig Topper	597aa42fec	[AVX512] Remove masked unpack intrinsics and autoupgrade to vectorshuffle and selects. llvm-svn: 273543	2016-06-23 07:37:33 +00:00
Vassil Vassilev	47867200a2	[modules] Good ol' JIT is gone. llvm-svn: 273541	2016-06-23 07:33:03 +00:00
Vassil Vassilev	12ee77c835	Add missing include. Should fix modules builds. llvm-svn: 273540	2016-06-23 07:30:12 +00:00
David Majnemer	8871e7a49f	[ADT] Add a range variant of std::transform This will be used in a followup change in clang. llvm-svn: 273520	2016-06-23 00:14:26 +00:00
Peter Collingbourne	6717803485	Revert r273456, "Preserve DebugInfo when replacing values in DAGCombiner" as it caused pr28270. llvm-svn: 273518	2016-06-23 00:06:17 +00:00
Reid Kleckner	858239d5f8	Prune some includes from headers and sink some inline functions MCSymbol.h shouldn't pull in MCAssembler.h, just MCFragment.h. MCLinkerOptimizationHint.h shouldn't need MCMachObjectWriter.h. The rest is fixing the fallout. llvm-svn: 273507	2016-06-22 23:23:08 +00:00
Xinliang David Li	ce030acb4e	[PM]: LoopAccessInfo simple refactoring To make definition of mov ctors easier. Differential Revision: http://reviews.llvm.org/D21563 llvm-svn: 273506	2016-06-22 23:20:59 +00:00
Chris Bieneman	8e783eb757	[MachO] Finish moving fat header swap functions to MachO.h This is a follow-up to r273479. At the time I wrote r273479 I didn't connect the dots that the functions I was adding had to exist somewhere. Turns out, they do. This finishes moving the functions to MachO.h. Existing MachO fat header tests like test/tools/llvm-readobj/Inputs/macho-universal-archive.x86_64.i386 execute this code. llvm-svn: 273502	2016-06-22 22:19:08 +00:00
Changpeng Fang	47efe1f6db	AMDGPU/SI: Define an intrinsic to expose ds_swizzle_b32 Reviewers: tstellarAMD, arsenm Differential Revision: http://reviews.llvm.org/D21533 llvm-svn: 273496	2016-06-22 21:33:49 +00:00
Pawel Bylica	f4437e9b48	Do not require __STDC_LIMIT_MACROS and others Summary: Do not require __STDC_LIMIT_MACROS and __STDC_CONSTANT_MACROS macros to be defined globally. They are not needed for C++11 compliant standard headers. Reviewers: joerg, jyknight Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21553 llvm-svn: 273493	2016-06-22 21:15:51 +00:00
Chris Bieneman	4bf88afa52	[MachO] Adding a few missing swapStruct functions These are just missing swap functions for handling endian conversion. llvm-svn: 273478	2016-06-22 21:01:17 +00:00
Daniel Berlin	84d6372010	Update header documentation for API deliberately made public llvm-svn: 273473	2016-06-22 20:31:12 +00:00
Peter Collingbourne	6d88fde3af	IR: Introduce Module::global_objects(). This is a convenience iterator that allows clients to enumerate the GlobalObjects within a Module. Also start using it in a few places where it is obviously the right thing to use. Differential Revision: http://reviews.llvm.org/D21580 llvm-svn: 273470	2016-06-22 20:29:42 +00:00
Davide Italiano	53d457c615	[UpdateCompilerUsed] API rename and cleanup, suggested by Rafaael. * UpdateCompilerUsed() -> updateCompilerUsed() * ThinLTO doesn't use the API so we can remove the include * Clean up unused #include <functional> from the header * Rename #ifdef guard comment to be correct. llvm-svn: 273461	2016-06-22 19:50:42 +00:00
Xinliang David Li	30c50f3cea	[MBFI]: Add a new suboption for graph viewer -view-machine-block-freq-propagation-dags currently support integer and fraction as the suboptions. This patch adds the 'count' suboption to display actual profile count if available. llvm-svn: 273460	2016-06-22 19:26:44 +00:00
Sanjay Patel	a06d989552	[ValueTracking] improve ComputeNumSignBits for vector constants This is similar to the computeKnownBits improvement in rL268479. There's probably more we can do for vector logic instructions, but this should let us see non-splat constant masking ops that can become vector selects instead of and/andn/or sequences. Differential Revision: http://reviews.llvm.org/D21610 llvm-svn: 273459	2016-06-22 19:20:59 +00:00
Nirav Dave	96beb7dee5	Preserve DebugInfo when replacing values in DAGCombiner Recommiting after fixing over-aggressive assertion [DAG] Previously debug values would transfer debuginfo for the selected start node for a replacement which allows for debug to be dropped. Push debug value transfer to occur with node/value replacement in SelectionDAG, remove now extraneous transfers of debug values. This refixes PR9817 which was being incompletely checked in the testsuite. Reviewers: jyknight Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D21037 llvm-svn: 273456	2016-06-22 19:03:26 +00:00
Wei Ding	0526e7f8d9	AMDGPU: Add convergent flag to INLINEASM instruction. Differential Revision: http://reviews.llvm.org/D21214 llvm-svn: 273455	2016-06-22 18:51:08 +00:00
Reid Kleckner	156a7239c1	[codeview] Add IntroducingVirtual debug info flag CodeView needs to know if a virtual method was introduced in the current class, and base classes may not have complete type information, so we need to thread this bit through from the frontend. llvm-svn: 273453	2016-06-22 18:31:14 +00:00
Reid Kleckner	3bd6c7d0e7	[codeview] Fix trivial bug in OneMethodRecord::isIntroducingVirtual These should be equality comparisons. Fixes assertions while self-hosting clang with codeview debug info. Ultimately this is going to be covered by real tests for virtual method emission, so I'm not adding a "don't crash on this input" test that I'll remove soon afterwards. llvm-svn: 273446	2016-06-22 17:32:59 +00:00
Xinliang David Li	b12b353a41	[BFI]: NFC refactoring move getBlockProfileCount implementation to the base class so that MBFI can share too. llvm-svn: 273442	2016-06-22 17:12:12 +00:00
Artur Pilipenko	bc552275e9	NFC. Move Verifier::verifyIntrinsicType to Intrinsics.h Move Verifier::verifyIntrinsicType to Intrinsics::matchIntrinsicsType. Will be used to accumulate overloaded types of a given intrinsic by the upcoming patch to fix intrinsics names when overloaded types are renamed. Reviewed By: reames Differential Revision: http://reviews.llvm.org/D19372 llvm-svn: 273424	2016-06-22 14:56:33 +00:00
Krzysztof Parzyszek	e116d500a7	[SDAG] Remove FixedArgs parameter from CallLoweringInfo::setCallee The setCallee function will set the number of fixed arguments based on the size of the argument list. The FixedArgs parameter was often explicitly set to 0, leading to a lack of consistent value for non- vararg functions. Differential Revision: http://reviews.llvm.org/D20376 llvm-svn: 273403	2016-06-22 12:54:25 +00:00
Davide Italiano	da5b8495e2	[LTO] Move UpdateCompilerUsed.h from lib/ to include/ I plan to use it in lld soon. Differential Revision: http://reviews.llvm.org/D21575 llvm-svn: 273380	2016-06-22 04:52:43 +00:00
Craig Topper	bb88afe17d	[X86] Remove GCC builtins from masked integer cmp and ucmp instrinsics so we can emit native IR in clang. llvm-svn: 273376	2016-06-22 04:47:42 +00:00
Peter Collingbourne	21521891a2	IR: Allow metadata attachments on declarations, and fix lazy loaded metadata issue with globals. This change is motivated by an upcoming change to the metadata representation used for CFI. The indirect function call checker needs type information for external function declarations in order to correctly generate jump table entries for such declarations. We currently associate such type information with declarations using a global metadata node, but I plan [1] to move all such metadata to global object attachments. In bitcode, metadata attachments for function declarations appear in the global metadata block. This seems reasonable to me because I expect metadata attachments on declarations to be uncommon. In the long term I'd also expect this to be the case for CFI, because we'd want to use some specialized bitcode format for this metadata that could be read as part of the ThinLTO thin-link phase, which would mean that it would not appear in the global metadata block. To solve the lazy loaded metadata issue I was seeing with D20147, I use the same bitcode representation for metadata attachments for global variables as I do for function declarations. Since there's a use case for metadata attachments in the global metadata block, we might as well use that representation for global variables as well, at least until we have a mechanism for lazy loading global variables. In the assembly format, the metadata attachments appear after the "declare" keyword in order to avoid a parsing ambiguity. [1] http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html Differential Revision: http://reviews.llvm.org/D21052 llvm-svn: 273336	2016-06-21 23:42:48 +00:00
Vedant Kumar	7a7f5348a7	[Coverage] Clarify ownership of a MemoryBuffer in the reader (NFC) Pass a `MemoryBuffer &` to BinaryCoverageReader::create() instead of a `std::unique_ptr<MemoryBuffer> &`. This makes it easier to reason about the ownership of the buffer at a glance. llvm-svn: 273326	2016-06-21 22:22:33 +00:00
Jan Vesely	26c6bb103f	AMDGPU: Remove gcc builtin names from workitem intrinsics We'll need to emit these manually in clang to add range metadata Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D20691 llvm-svn: 273318	2016-06-21 20:46:22 +00:00
Jan Vesely	fea814d531	AMDGPU: Add implicitarg.ptr intrinsic. Points to the start of implicit arguments (appended after explicit arguments) Differential Revision: http://reviews.llvm.org/D20297 llvm-svn: 273317	2016-06-21 20:46:20 +00:00
Daniel Berlin	1430026142	Add MemoryAccess creation and PHI creation APIs to MemorySSA Reviewers: george.burgess.iv, gberry, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21463 llvm-svn: 273295	2016-06-21 18:39:20 +00:00
Reid Kleckner	5b335b864b	[codeview] Add support for splitting field list records over 64KB The basic structure is that once a list record goes over 64K, the last subrecord of the list is an LF_INDEX record that refers to the next record. Because the type record graph must be toplogically sorted, this means we have to emit them in reverse order. We build the type record in order of declaration, so this means that if we don't want extra copies, we need to detect when we were about to split a record, and leave space for a continuation subrecord that will point to the eventual split top-level record. Also adds dumping support for these records. Next we should make sure that large method overload lists work properly. llvm-svn: 273294	2016-06-21 18:33:01 +00:00
Krzysztof Parzyszek	4f1c86b77c	Fix typo, NFC llvm-svn: 273284	2016-06-21 16:16:52 +00:00
Etienne Bergeron	f6be62f2c8	[StackProtector] Fix computation of GSCookieOffset and EHCookieOffset with SEH4 Summary: Fix the computation of the offsets present in the scopetable when using the SEH (__except_handler4). This patch added an intrinsic to track the position of the allocation on the stack of the EHGuard. This position is needed when producing the ScopeTable. ``` struct _EH4_SCOPETABLE { DWORD GSCookieOffset; DWORD GSCookieXOROffset; DWORD EHCookieOffset; DWORD EHCookieXOROffset; _EH4_SCOPETABLE_RECORD ScopeRecord[1]; }; struct _EH4_SCOPETABLE_RECORD { DWORD EnclosingLevel; long (FilterFunc)(); union { void (HandlerAddress)(); void (*FinallyFunc)(); }; }; ``` The code to generate the EHCookie is added in `X86WinEHState.cpp`. Which is adding these instructions when using SEH4. ``` Lfunc_begin0: # BB#0: # %entry pushl %ebp movl %esp, %ebp pushl %ebx pushl %edi pushl %esi subl $28, %esp movl %ebp, %eax <<-- Loading FramePtr movl %esp, -36(%ebp) movl $-2, -16(%ebp) movl $L__ehtable$use_except_handler4_ssp, %ecx xorl ___security_cookie, %ecx movl %ecx, -20(%ebp) xorl ___security_cookie, %eax <<-- XOR FramePtr and Cookie movl %eax, -40(%ebp) <<-- Storing EHGuard leal -28(%ebp), %eax movl $__except_handler4, -24(%ebp) movl %fs:0, %ecx movl %ecx, -28(%ebp) movl %eax, %fs:0 movl $0, -16(%ebp) calll _may_throw_or_crash LBB1_1: # %cont movl -28(%ebp), %eax movl %eax, %fs:0 addl $28, %esp popl %esi popl %edi popl %ebx popl %ebp retl ``` And the corresponding offset is computed: ``` Luse_except_handler4_ssp$parent_frame_offset = -36 .p2align 2 L__ehtable$use_except_handler4_ssp: .long -2 # GSCookieOffset .long 0 # GSCookieXOROffset .long -40 # EHCookieOffset <<---- .long 0 # EHCookieXOROffset .long -2 # ToState .long _catchall_filt # FilterFunction .long LBB1_2 # ExceptionHandler ``` Clang is not yet producing function using SEH4, but it's a work in progress. This patch is a step toward having a valid implementation of SEH4. Unfortunately, it is not yet fully working. The EH registration block is not allocated at the right offset on the stack. Reviewers: rnk, majnemer Subscribers: llvm-commits, chrisha Differential Revision: http://reviews.llvm.org/D21231 llvm-svn: 273281	2016-06-21 15:58:55 +00:00
Daniel Sanders	bf2c03ee69	[arm+x86] Make GNU variants behave like GNU w.r.t combining sin+cos into sincos. Summary: canCombineSinCosLibcall() would previously combine sin+cos into sincos for GNUX32/GNUEABI/GNUEABIHF regardless of whether UnsafeFPMath were set or not. However, GNU would only combine them for UnsafeFPMath because sincos does not set errno like sin and cos do. It seems likely that this is an oversight. Reviewers: t.p.northover Subscribers: t.p.northover, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D21431 llvm-svn: 273259	2016-06-21 12:29:03 +00:00
Elena Demikhovsky	a266cf0518	reverted the prev commit due to assertion failure llvm-svn: 273258	2016-06-21 12:10:11 +00:00
Elena Demikhovsky	9823c995bc	Fixed consecutive memory access detection in Loop Vectorizer. It did not handle correctly cases without GEP. The following loop wasn't vectorized: for (int i=0; i<len; i++) to++ = from++; I use getPtrStride() to find Stride for memory access and return 0 is the Stride is not 1 or -1. Differential revision: http://reviews.llvm.org/D20789 llvm-svn: 273257	2016-06-21 11:32:01 +00:00
James Y Knight	03c1415b8f	Revert "Change RelaxELFRelocations for llc." This reverts commit r273019. From email I sent to list: > I don't think this makes sense. Either the linker you're using supports > this feature, or it doesn't. Having it enabled for llc if your linker > doesn't support it is not fun. > > Further note that this also affects basically all other code using llvm > libraries -- other than Clang, which explicitly sets it back to false by > default, unless you set the ENABLE_X86_RELAX_RELOCATIONS cmake flag to > true. > > If you want to enable the relax mode across all llvm tools in some > circumstances, I think it should be via moving the cmake flag from clang > down into llvm. > > I'm going to revert this commit, since I both think it intrinsically > doesn't make sense to do this, and because it's breaking some of our > tools. llvm-svn: 273245	2016-06-21 05:40:41 +00:00
David Majnemer	e61e4bfd87	Replace silly uses of 'signed' with 'int' llvm-svn: 273244	2016-06-21 05:10:24 +00:00
Craig Topper	0a0fb0fda1	[AVX512] Remove the masked vpcmpeq/vcmpgt intrinsics and autoupgrade them to native icmps. llvm-svn: 273240	2016-06-21 03:53:24 +00:00
Reid Kleckner	0abaef604d	Use the same tag type across all PointerLikeTypeTraits specializations Works around a bug (PR28216) in Clang's MS mangling of templates with partial specializations. This mismatch was introduced in about six months ago in r256656. llvm-svn: 273223	2016-06-20 23:50:21 +00:00
George Burgess IV	87b2e41416	[CFLAA] Add interprocedural function summaries. This patch adds function summaries, so that we don't need to recompute various properties about function parameters/return values at each callsite of a function. It also adds many interprocedural tests for CFLAA. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D21475#inline-182390 llvm-svn: 273219	2016-06-20 23:10:56 +00:00
Sanjay Patel	61ddbdcd02	don't repeat function names in documentation comments; NFC llvm-svn: 273209	2016-06-20 22:40:35 +00:00
Kevin Enderby	eb6d110c1d	Add support for Darwin’s 64-bit universal files with 64-bit offsets and sizes for the objects. Darwin added support in its Xcode 8.0 tools (released in the beta) for universal files where offsets and sizes for the objects are 64-bits to allow support for objects contained in universal files to be larger then 4gb. The change is very straight forward. There is a new magic number that differs by one bit, much like the 64-bit Mach-O files. Then there is a new structure that follow the fat_header that has the same layout but with the offset and size fields using 64-bit values instead of 32-bit values. rdar://26899493 llvm-svn: 273207	2016-06-20 22:16:18 +00:00
Easwaran Raman	8b65e86661	Remove interface to get/set MaxFunctionCount Differential revision: http://reviews.llvm.org/D19185 llvm-svn: 273203	2016-06-20 21:36:38 +00:00
Daniel Berlin	ada263dcd0	Rename to be consistent with other type names. NFC llvm-svn: 273194	2016-06-20 20:21:33 +00:00
Matt Arsenault	b6d8c37e1a	AMDGPU: Fold more custom nodes to undef This will help sneak undefs past GVN into the DAG for some tests. Also add missing intrinsic for rsq_legacy, even though the node was already selected to the instruction. Also start passing the debug location to intrinsic errors. llvm-svn: 273181	2016-06-20 18:33:56 +00:00
Matt Arsenault	ff98241f37	Generalize DiagnosticInfoStackSize to support other limits Backends may want to report errors on resources other than stack size. llvm-svn: 273177	2016-06-20 18:13:04 +00:00
Pankaj Gode	0aab2e398a	[AARCH64] Add support for Broadcom Vulcan Adding core tuning support for new Broadcom Vulcan core (ARMv8.1A). Differential Revision: http://reviews.llvm.org/D21500 llvm-svn: 273148	2016-06-20 11:13:31 +00:00
Vassil Vassilev	7b6d06c5d2	Add the corresponding modulemap entry, following up r273066. llvm-svn: 273112	2016-06-19 15:31:12 +00:00
Joerg Sonnenberger	2298203056	doesSetDirectiveSuppressesReloc -> doesSetDirectiveSuppressReloc, the former is grammatically incorrect. llvm-svn: 273100	2016-06-18 23:25:37 +00:00
Marcin Koscielnicki	3feda222c6	[sanitizers] Disable target-specific lowering of string functions. CodeGen has hooks that allow targets to emit specialized code instead of calls to memcmp, memchr, strcpy, stpcpy, strcmp, strlen, strnlen. When ASan/MSan/TSan/ESan is in use, this sidesteps its interceptors, resulting in uninstrumented memory accesses. To avoid that, make these sanitizers mark the calls as nobuiltin. Differential Revision: http://reviews.llvm.org/D19781 llvm-svn: 273083	2016-06-18 10:10:37 +00:00
Sean Silva	7cb30664fc	Add a super basic LazyCallGraph DOT printer. Access it through -passes=print-lcg-dot Let me know any suggestions for changing the rendering; I'm not particularly attached to what is implemented here. llvm-svn: 273082	2016-06-18 09:17:32 +00:00
Simon Pilgrim	f4b2af1b9f	[X86][SSE4A] Autoupgrade and remove MOVNTSD/MOVNTSS intrinsics Required better annotation of the instruction defs upon removal of the builtin intrinsic pattern. llvm-svn: 273077	2016-06-18 02:38:26 +00:00
Tom Stellard	f8db61c5f0	Support/ELF: Add AMDGPU relocation definitions to match documentation Reviewers: arsenm, kzhuravl, rafael Subscribers: llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D21443 llvm-svn: 273066	2016-06-17 22:38:08 +00:00
Adam Nemet	a9f09c6245	[LAA] Enable symbolic stride speculation for all LAA clients This is a functional change for LLE and LDist. The other clients (LV, LVerLICM) already had this explicitly enabled. The temporary boolean parameter to LAA is removed that allowed turning off speculation of symbolic strides. This makes LAA's caching interface LAA::getInfo only take the loop as the parameter. This makes the interface more friendly to the new Pass Manager. The flag -enable-mem-access-versioning is moved from LV to a LAA which now allows turning off speculation globally. llvm-svn: 273064	2016-06-17 22:35:41 +00:00
Matt Arsenault	eef67d531e	DiagnosticInfo: Allow unsupported be a warning Some unsupported features can be ignored, so don't force this to be a hard error. llvm-svn: 273061	2016-06-17 22:26:56 +00:00
Kevin Enderby	ae108ffb9a	Add support for Darwin’s static library table of contents with 64-bit offsets to the archive members. Darwin added support in its Xcode 8.0 tools (released in the beta) for static library table of contents with 64-bit offsets to the archive members. The change is very straight forward. The table of contents member is named ___.SYMDEF_64 or "___.SYMDEF_64 SORTED" and same layout is used but with fields using 64 bit values instead of 32 bit values. rdar://26869808 llvm-svn: 273058	2016-06-17 22:16:06 +00:00
Vedant Kumar	8039e92ac5	[Coverage] Move logic to encode filenames and mappings into llvm (NFC) Currently, frontends which emit source-based code coverage have to duplicate logic to encode filenames and raw coverage mappings properly. This violates an abstraction layer and forces frontends to copy tricky code. Introduce llvm::coverage::encodeFilenamesAndRawMappings() to take care of this. This will help us experiment with zlib-compressing coverage mapping data. llvm-svn: 273055	2016-06-17 21:53:31 +00:00
Reid Kleckner	604105bb90	[codeview] Add DIFlags for pointer to member representations Summary: This seems like the least intrusive way to pass this information through. Fixes PR28151 Reviewers: majnemer, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21444 llvm-svn: 273053	2016-06-17 21:31:33 +00:00
Benjamin Kramer	1afc1de406	Apply another batch of fixes from clang-tidy's performance-unnecessary-value-param. Contains some manual fixes. No functionality change intended. llvm-svn: 273047	2016-06-17 20:41:14 +00:00
Reid Kleckner	11582c59d7	[pdb] Don't error on missing FPO streams 64-bit PDBs never have FPO data. They have xdata instead. Also improve error recovery of stream summary dumping while I'm here. llvm-svn: 273046	2016-06-17 20:38:01 +00:00
Davide Italiano	b49aa5c0c4	[PM] Port MergedLoadStoreMotion to the new pass manager, take two. This is indeed a much cleaner approach (thanks to Daniel Berlin for pointing out), and also David/Sean for review. Differential Revision: http://reviews.llvm.org/D21454 llvm-svn: 273032	2016-06-17 19:10:09 +00:00
James Y Knight	148a6469dc	Support expanding partial-word cmpxchg to full-word cmpxchg in AtomicExpandPass. Many CPUs only have the ability to do a 4-byte cmpxchg (or ll/sc), not 1 or 2-byte. For those, you need to mask and shift the 1 or 2 byte values appropriately to use the 4-byte instruction. This change adds support for cmpxchg-based instruction sets (only SPARC, in LLVM). The support can be extended for LL/SC-based PPC and MIPS in the future, supplanting the ISel expansions those architectures currently use. Tests added for the IR transform and SPARCv9. Differential Revision: http://reviews.llvm.org/D21029 llvm-svn: 273025	2016-06-17 18:11:48 +00:00
Davide Italiano	4cccc488b7	[Codegen] Change PICLevel. We convert `Default` to `NotPIC` so that target independent code can reason about this correctly. Differential Revision: http://reviews.llvm.org/D21394 llvm-svn: 273024	2016-06-17 18:07:14 +00:00
Rafael Espindola	9f86baebe0	Change RelaxELFRelocations for llc. As a developer tool it makes sense for it to use the new relocations. llvm-svn: 273019	2016-06-17 17:43:41 +00:00
Rafael Espindola	f5e7d63add	Change RelaxELFRelocations' default. NFC to the existing clients since they all set it already. llvm-svn: 273017	2016-06-17 17:26:07 +00:00
Nirav Dave	fd91041ce1	Refactor and cleanup Assembly Parsing / Lexing Recommiting after fixing non-atomic insert to front of SmallVector in MCAsmLexer.h Add explicit Comment Token in Assembly Lexing for future support for outputting explicit comments from inline assembly. As part of this, CPPHash Directives are now explicitly distinguished from Hash line comments in Lexer. Line comments are recorded as EndOfStatement tokens, not Comment tokens to simplify compatibility with current TargetParsers. This slightly complicates comment output. This remove all lexing tasks out of the parser, does minor cleanup to remove extraneous newlines Asm Output, and some improvements white space handling. Reviewers: rtrieu, dwmw2, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20009 llvm-svn: 273007	2016-06-17 16:06:17 +00:00
Simon Pilgrim	6a35e5ab97	[X86][SSE4A] Remove the GCCBuiltins from the movntsd/movntss intrinsic defs so we can emit native IR from clang. Clang-side sibling commit to follow. llvm-svn: 273002	2016-06-17 14:27:38 +00:00
Chandler Carruth	74a8a2214a	[PM] Run clang-format over various parts of the new pass manager code prior to some very substantial patches to isolate any formatting-only changes. llvm-svn: 272991	2016-06-17 07:15:29 +00:00
Ranjeet Singh	39d2d097d6	[ARM] Add support for mrrc/mrrc2 intrinsics. Reapplying patch as it was reverted when it was first committed because of an assertion failure when the mrrc2 intrinsic was called in ARM mode. The failure was happening because the instruction was being built in ARMISelDAGToDAG.cpp and the tablegen description for mrrc2 instruction doesn't allow you to use a predicate. The ARM architecture manuals do say that mrrc2 in ARM mode can be predicated with AL in assembly but this has no effect on the encoding of the instruction as the top 4 bits will always be 1111 not 1110 which is the encoding for the condition AL. Differential Revision: http://reviews.llvm.org/D21408 llvm-svn: 272982	2016-06-17 00:52:41 +00:00
Chandler Carruth	164a2aa6f4	[PM] Remove support for omitting the AnalysisManager argument to new pass manager passes' `run` methods. This removes a bunch of SFINAE goop from the pass manager and just requires pass authors to accept `AnalysisManager<IRUnitT> &` as a dead argument. This is a small price to pay for the simplicity of the system as a whole, despite the noise that changing it causes at this stage. This will also helpfull allow us to make the signature of the run methods much more flexible for different kinds af passes to support things like intelligently updating the pass's progression over IR units. While this touches many, many, files, the changes are really boring. Mostly made with the help of my trusty perl one liners. Thanks to Sean and Hal for bouncing ideas for this with me in IRC. llvm-svn: 272978	2016-06-17 00:11:01 +00:00
Adam Nemet	c953bb9953	[LV] Move management of symbolic strides to LAA. NFCI This is still NFCI, so the list of clients that allow symbolic stride speculation does not change (yes: LV and LoopVersioningLICM, no: LLE, LDist). However since the symbolic strides are now managed by LAA rather than passed by client a new bool parameter is used to enable symbolic stride speculation. The existing test Transforms/LoopVectorize/version-mem-access.ll checks that stride speculation is performed for LV. The previously added test Transforms/LoopLoadElim/symbolic-stride.ll ensures that no speculation is performed for LLE. The next patch will change the functionality and turn on symbolic stride speculation in all of LAA's clients and remove the bool parameter. llvm-svn: 272970	2016-06-16 22:57:55 +00:00
Evgeniy Stepanov	72d961a1da	[safestack] Fixup llvm.dbg.value when rewriting unsafe allocas. When moving unsafe allocas to the unsafe stack, dbg.declare intrinsics are updated to refer to the new location. This change does the same to dbg.value intrinsics. llvm-svn: 272968	2016-06-16 22:34:00 +00:00
Evgeniy Stepanov	660b1a49dc	Fix BitVector move ctor/assignment. Current implementation leaves the object in an invalid state. This reverts commit bf0c389ac683cd6c0e5959b16537e59e5f4589e3. llvm-svn: 272965	2016-06-16 21:45:13 +00:00
Matt Arsenault	8dad57cc49	TTI: Add hook for memory width to vectorize llvm-svn: 272964	2016-06-16 21:43:12 +00:00
Nirav Dave	280ecf6ff0	Revert "Refactor and cleanup Assembly Parsing / Lexing" Reverting for unexpected crashes on various platforms. This reverts commit r272953. llvm-svn: 272957	2016-06-16 21:19:23 +00:00
Nirav Dave	c19c3260df	Refactor and cleanup Assembly Parsing / Lexing Add explicit Comment Token in Assembly Lexing for future support for outputting explicit comments from inline assembly. As part of this, CPPHash Directives are now explicitly distinguished from Hash line comments in Lexer. Line comments are recorded as EndOfStatement tokens, not Comment tokens to simplify compatibility with current TargetParsers. This slightly complicates comment output. This remove all lexing tasks out of the parser, does minor cleanup to remove extraneous newlines Asm Output, and some improvements white space handling. Reviewers: rtrieu, dwmw2, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20009 llvm-svn: 272953	2016-06-16 20:34:22 +00:00
Sanjoy Das	0ebc9616b4	NFC; refactor getFrameIndexReferenceFromSP Summary: ... into getFrameIndexReferencePreferSP. This change folds the fail-then-retry logic into getFrameIndexReferencePreferSP. There is a non-functional but behaviorial change in WinException -- earlier if `getFrameIndexReferenceFromSP` failed we'd trip an assert, but now we'll silently use the (wrong) offset from the base pointer. I could not write the assert I'd like to write ("FrameReg == StackRegister", like I've done in X86FrameLowering) since there is no easy way to get to the stack register from WinException (happy to be proven wrong here). One solution to this is to add a `bool OnlyStackPointer` parameter to `getFrameIndexReferenceFromSP` that asserts if it could not satisfy its promise of returning an offset from a stack pointer, but that seems overkill. Reviewers: rnk Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D21427 llvm-svn: 272938	2016-06-16 18:54:06 +00:00
Sanjay Patel	0e9afea3c8	[x86] autoupgrade and remove AVX2 integer min/max intrinsics This will (hopefully very temporarily) break clang. The clang side of this should be the next commit. llvm-svn: 272932	2016-06-16 18:44:20 +00:00
Zachary Turner	01ee3dae04	Resubmit "[pdb] Change type visitor pattern to be dynamic." There was a regression introduced during type stream merging when visiting a field list record. This has been fixed in this patch. llvm-svn: 272929	2016-06-16 18:22:27 +00:00
Zachary Turner	73b0b2f555	Revert "[pdb] Change type visitor pattern to be dynamic." This reverts commit fb0dd311e1ad945827b8ffd5354f4810e2be1579. This breaks some llvm-readobj tests. llvm-svn: 272927	2016-06-16 18:09:04 +00:00
Zachary Turner	1f6372c429	[pdb] Change type visitor pattern to be dynamic. This allows better catching of compiler errors since we can use the override keyword to verify that methods are actually overridden. Also in this patch I've changed from storing a boolean Error code everywhere to returning an llvm::Error, to propagate richer error information up the call stack. Reviewed By: ruiu, rnk Differential Revision: http://reviews.llvm.org/D21410 llvm-svn: 272926	2016-06-16 18:00:28 +00:00
Davide Italiano	41315f7873	[PM] Revert the port of MergeLoadStoreMotion to the new pass manager. Daniel Berlin expressed some real concerns about the port and proposed and alternative approach. I'll revert this for now while working on a new patch, which I hope to put up for review shortly. Sorry for the churn. llvm-svn: 272925	2016-06-16 17:40:53 +00:00
Igor Laevsky	87f0d0e185	Revert r272891 "[JumpThreading] Prevent dangling pointer problems in BranchProbabilityInfo" It was causing failures in Profile-i386 and Profile-x86_64 tests. llvm-svn: 272912	2016-06-16 16:25:53 +00:00
Sanjay Patel	51ab757941	[x86] autoupgrade and remove SSE2/SSE41 integer min/max intrinsics Follow-up to: http://reviews.llvm.org/rL272806 http://reviews.llvm.org/rL272807 llvm-svn: 272907	2016-06-16 15:48:30 +00:00
Rui Ueyama	43ed08efa3	[codeview] Pass CVRecord to visitTypeBegin callback. Both parameters to visitTypeBegin are actually members of CVRecord, so we can just pass CVRecord instead of destructuring it. Differential Revision: http://reviews.llvm.org/D21435 llvm-svn: 272899	2016-06-16 14:47:23 +00:00
Rui Ueyama	b9095ae7ee	[codeview] Remove unused parameter. Differential Revision: http://reviews.llvm.org/D21433 llvm-svn: 272898	2016-06-16 14:41:22 +00:00
Rui Ueyama	5c7248c959	Implement pdb::hashBufferV8 hash function. llvm-svn: 272894	2016-06-16 13:48:16 +00:00
Igor Laevsky	c9179fd2c2	[JumpThreading] Prevent dangling pointer problems in BranchProbabilityInfo We should update results of the BranchProbabilityInfo after removing block in JumpThreading. Otherwise we will get dangling pointer inside BranchProbabilityInfo cache. Differential Revision: http://reviews.llvm.org/D20957 llvm-svn: 272891	2016-06-16 13:28:25 +00:00
Rui Ueyama	8b0ae136e2	[codeview] Use CVTypeVisitor instead of a hand-written switch-cases. Differential Revision: http://reviews.llvm.org/D21418 llvm-svn: 272888	2016-06-16 13:14:42 +00:00
Daniel Sanders	1d14864bb3	[llvm-objdump] Support detection of feature bits from the object and implement this for Mips. Summary: The Mips implementation only covers the feature bits described by the ELF e_flags so far. Mips stores additional feature bits such as MSA in the .MIPS.abiflags section. Also fixed a small bug this revealed where microMIPS wouldn't add the EF_MIPS_MICROMIPS flag when using -filetype=obj. Reviewers: echristo, rafael Subscribers: rafael, mehdi_amini, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D21125 llvm-svn: 272880	2016-06-16 09:17:03 +00:00
Adam Nemet	139ffba398	[LAA] Rename Strides to SymblicStrides in analyzeLoop. NFC This is to facilitate to move of SymblicStrides from LV to LAA. llvm-svn: 272879	2016-06-16 08:27:03 +00:00
Adam Nemet	bdbc5227ce	[LAA] Default getInfo to not speculate symbolic strides. NFC Soon we won't be passing Strides to getInfo and then we'll have fewer call sites to update. llvm-svn: 272878	2016-06-16 08:26:56 +00:00
Vassil Vassilev	e27a4f96fa	[modules] Combine Pass.h, PassSupport.h and PassAnalysisSupport.h into one module. The header files are designed to be used always together (through Pass.h). Addresses the first part of https://llvm.org/bugs/show_bug.cgi?id=27991 Patch by Cristina Cristescu and me. Reviewed by Richard Smith. llvm-svn: 272877	2016-06-16 08:00:29 +00:00
Eli Friedman	bd254a6f45	[InstCombine] Don't widen metadata on store-to-load forwarding The original check for load CSE or store-to-load forwarding is wrong when the forwarded stored value happened to be a load. Ref https://github.com/JuliaLang/julia/issues/16894 Differential Revision: http://reviews.llvm.org/D21271 Patch by Yichao Yu! llvm-svn: 272868	2016-06-16 02:33:42 +00:00
Xinliang David Li	1e16d61f1f	Address review feedbacks of AddDiscriminator change llvm-svn: 272850	2016-06-15 22:20:56 +00:00
Xinliang David Li	1eaecefaf9	[PM] Port Add discriminator pass to new PM llvm-svn: 272847	2016-06-15 21:51:30 +00:00
Rui Ueyama	5dbea9db10	[Codeview] Add a class for LF_UDT_MOD_SRC_LINE. Differential Revision: http://reviews.llvm.org/D21406 llvm-svn: 272843	2016-06-15 21:25:29 +00:00
Reid Kleckner	828c4f64e2	[codeview] Move deserialization methods out of line They aren't performance critical and don't need to be inline. llvm-svn: 272829	2016-06-15 20:30:34 +00:00
Matthias Braun	98ea88be42	Statistic: Add machine parseable json output - We lacked a short unique identifier for a statistics, so I renamed the current "Name" field that just contained the DEBUG_TYPE name of the current file to DebugType and added a new "Name" field that contains the C++ identifier of the statistic variable. - Add the -stats-json option which outputs statistics in json format. Differential Revision: http://reviews.llvm.org/D20995 llvm-svn: 272826	2016-06-15 20:19:16 +00:00
Reid Kleckner	a16fec18b0	[codeview] Use ArrayRef instead of a non-const vector reference llvm-svn: 272817	2016-06-15 18:48:35 +00:00
Amaury Sechet	6100adfeb5	Add support for string attributes in the C API. Summary: As per title. This completes the C API Attribute support. Reviewers: Wallbraker, whitequark, echristo, rafael, jyknight Subscribers: mehdi_amini Differential Revision: http://reviews.llvm.org/D21365 llvm-svn: 272811	2016-06-15 17:50:39 +00:00
Sanjay Patel	a6c6f09967	[x86, SSE] remove the GCCBuiltins from the integer min/max intrinsics This allows us to emit native IR in Clang (next commit). Also, update the intrinsic tests to show that codegen already knows how to handle the IR that Clang will soon produce. llvm-svn: 272806	2016-06-15 17:17:27 +00:00
Nirav Dave	194cb55f37	Revert "Preserve DebugInfo when replacing values in DAGCombiner" Reverting due to assertion failure in lib/CodeGen/SelectionDAG/InstrEmitter.cpp This reverts commit r272792. llvm-svn: 272799	2016-06-15 16:08:50 +00:00
Nirav Dave	a72e308403	Preserve DebugInfo when replacing values in DAGCombiner [DAG] Previously debug values would transfer debuginfo for the selected start node for a replacement which allows for debug to be dropped. Push debug value transfer to occur with node/value replacement in SelectionDAG, remove now extraneous transfers of debug values. This refixes PR9817 which was being incompletely checked in the testsuite. Reviewers: jyknight Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D21037 llvm-svn: 272792	2016-06-15 14:50:08 +00:00
Ranjeet Singh	0db7be886e	Reverting r272778 because there's an assertion failure when running the test CodeGen/ARM/intrinsics-coprocessor.ll llvm-svn: 272791	2016-06-15 14:23:29 +00:00
Craig Topper	48b54c95ec	[AVX512] Remove the GCCBuiltins from the mask pcmpeq/pcmpgt intrinsics so we can emit native IR from clang. The intrinsics themselves can be removed in a future commit. llvm-svn: 272786	2016-06-15 14:06:28 +00:00
Ranjeet Singh	351364fe76	[ARM] Add support for mrrc/mrrc2 intrinsics. Differential Revision: http://reviews.llvm.org/D21178 llvm-svn: 272778	2016-06-15 11:32:24 +00:00

... 9 10 11 12 13 ...

28836 Commits