llvm-project

Commit Graph

Author	SHA1	Message	Date
Justin Bogner	af20fae2fb	ErrorOr: Be more explicit in the implicit conversion to bool docs llvm-svn: 221295	2014-11-04 21:01:48 +00:00
Arnaud A. de Grandmaison	829dd81377	[PBQP] Tweak spill costs and coalescing benefits This patch improves how the different costs (register, interference, spill and coalescing) relates together. The assumption is now that: - coalescing (or any other "side effect" of reg alloc) is negative, and instead of being derived from a spill cost, they use the block frequency info. - spill costs are in the [MinSpillCost:+inf( range - register or interference costs are in [0.0:MinSpillCost( or +inf The current MinSpillCost is set to 10.0, which is a random value high enough that the current constraint builders do not need to worry about when settings costs. It would however be worth adding a normalization step for register and interference costs as the last step in the constraint builder chain to ensure they are not greater than SpillMinCost (unless this has some sense for some architectures). This would work well with the current builder pipeline, where all costs are tweaked relatively to each others, but could grow above MinSpillCost if the pipeline is deep enough. The current heuristic is tuned to depend rather on the number of uses of a live interval rather than a density of uses, as used by the greedy allocator. This heuristic provides a few percent improvement on a number of benchmarks (eembc, spec, ...) and will definitely need to change once spill placement is implemented: the current spill placement is really ineficient, so making the cost proportionnal to the number of use is a clear win. llvm-svn: 221292	2014-11-04 20:51:24 +00:00
Rafael Espindola	649f172c17	Remove unused DisableRedZone option. Patch by Steve King. llvm-svn: 221282	2014-11-04 18:18:52 +00:00
Rafael Espindola	c1f30877e0	Remove FindProgramByName. NFC. llvm-svn: 221258	2014-11-04 12:35:47 +00:00
Michael J. Spencer	65ffd92f07	[Support][Program] Add findProgramByName(Name, OptionalPaths) llvm-svn: 221220	2014-11-04 01:29:29 +00:00
Reid Kleckner	bfe73d7506	Use the LLVM_GNUC_PREREQ macro instead of the longhand version check llvm-svn: 221219	2014-11-04 01:28:33 +00:00
Reid Kleckner	6847d05f06	Remove the END_WITH_NULL macro now that Clang doesn't use it llvm-svn: 221218	2014-11-04 01:15:53 +00:00
Reid Kleckner	e1e1df83f9	Rename END_WITH_NULL to LLVM_END_WITH_NULL and move to Compiler.h We shouldn't put this kind of attribute stuff in DataTypes.h. Leave the END_WITH_NULL name for now so I can update clang without making build spam. llvm-svn: 221215	2014-11-04 01:12:21 +00:00
Sanjoy Das	e839965faa	The patchpoint lowering logic would crash with live constants equal to the tombstone or empty keys of a DenseMap<int64_t, T>. This patch fixes the issue (and adds a tests case). llvm-svn: 221214	2014-11-04 00:59:21 +00:00
Kevin Enderby	9907d0a3c2	Add the code and test cases for 32-bit Intel to llvm-objdump’s Mach-O symbolizer. llvm-svn: 221211	2014-11-04 00:43:16 +00:00
Hal Finkel	840257a49c	Use AA in LoadCombine LoadCombine can be smarter about aborting when a writing instruction is encountered, instead of aborting upon encountering any writing instruction, use an AliasSetTracker, and only abort when encountering some write that might alias with the loads that could potentially be combined. This was originally motivated by comments made (and a test case provided) by David Majnemer in response to PR21448. It turned out that LoadCombine was not responsible for that PR, but LoadCombine should also be improved so that unrelated stores (and @llvm.assume) don't interrupt load combining. llvm-svn: 221203	2014-11-03 23:19:16 +00:00
Reid Kleckner	2170429dce	Relax the LLVM_NOEXCEPT _MSC_VER version check back to 1900 Unconditional noexcept support was added in the VS 2013 Nov CTP. Given that there have been three CTPs since then, I don't think we need careful macro magic to target that specific tech preview. Instead, target the major release version number of 1900, which corresponds to the as-yet unreleased VS "14". llvm-svn: 221169	2014-11-03 18:22:42 +00:00
Duncan P. N. Exon Smith	3d5a02f677	IR: MDNode => Value: Instruction::getAllMetadataOtherThanDebugLoc() Change `Instruction::getAllMetadataOtherThanDebugLoc()` from a vector of `MDNode` to one of `Value`. Part of PR21433. llvm-svn: 221167	2014-11-03 18:13:57 +00:00
Oliver Stannard	cf6bfb1dd0	Revert r221150, as it broke sanitizer tests llvm-svn: 221151	2014-11-03 12:19:03 +00:00
Oliver Stannard	652ec6ee89	Emit .eh_frame with relocations to functions, rather than sections When LLVM emits DWARF call frame information, it currently creates a local, section-relative symbol in the code section, which is pointed to by a relocation on the .eh_frame section. However, for C++ we emit some functions in section groups, and the SysV ABI has some rules to make it easier to remove these sections (http://www.sco.com/developers/gabi/latest/ch4.sheader.html#section_group_rules): A symbol table entry with STB_LOCAL binding that is defined relative to one of a group's sections, and that is contained in a symbol table section that is not part of the group, must be discarded if the group members are discarded. References to this symbol table entry from outside the group are not allowed. This means that we need to use the function symbol for the relocation, not a temporary symbol. There was a comment in the code claiming that the local symbol was used to avoid creating a relocation, but a relocation must be created anyway as the code and CFI are in different sections. llvm-svn: 221150	2014-11-03 12:02:51 +00:00
Charlie Turner	164acd5e2d	Test commit. Fixes two typos. Change-Id: I129f647de8933e1d8f0dc9941bcb91602edce7e2 llvm-svn: 221148	2014-11-03 10:58:05 +00:00
Diego Novillo	fcd556074c	Use ErrorOr for the ::create factory on instrumented and sample profilers. Summary: As discussed in http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20141027/242445.html, the creation of reader and writer instances is better done using ErrorOr. There are no functional changes, but several callers needed to be adjusted. Reviewers: bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6076 llvm-svn: 221120	2014-11-03 00:51:45 +00:00
Matt Arsenault	eb49216bba	Support REG_SEQUENCE in tablegen. The problem is mostly that variadic output instruction aren't handled, so it is rejected for having an inconsistent number of operands, and then the right number of operands isn't emitted. llvm-svn: 221117	2014-11-02 23:46:51 +00:00
David Majnemer	634ca236dc	InstCombine: Don't assume that m_ZExt matches an Instruction m_ZExt might bind against a ConstantExpr instead of an Instruction. Assuming this, using cast<Instruction>, results in InstCombine crashing. Instead, introduce ZExtOperator to bridge both Instruction and ConstantExpr ZExts. This fixes PR21445. llvm-svn: 221069	2014-11-01 23:46:05 +00:00
Daniel Sanders	8104b75c9f	Renamed CCState members that appear to misspell 'Processed' as 'Proceed'. NFC. Reviewers: rnk Reviewed By: rnk Subscribers: rnk, llvm-commits Differential Revision: http://reviews.llvm.org/D5978 llvm-svn: 221061	2014-11-01 19:32:23 +00:00
Daniel Sanders	ca80f1a05a	[tablegen] Add CustomCallingConv and use it to tablegen-erate the outermost parts of the Mips O32 implementation Summary: CustomCallingConv is simply a CallingConv that tablegen should not generate the implementation for. It allows regular CallingConv's to delegate to these custom functions. This is (currently) necessary for Mips and we cannot use CCCustom without having to adapt to the different API that CCCustom uses. This brings us a bit closer to being able to remove MipsCC::analyzeCallOperands and MipsCC::analyzeFormalArguments in favour of the common implementation. No functional change to the targets. Depends on D3341 Reviewers: vmedic Reviewed By: vmedic Subscribers: vmedic, llvm-commits Differential Revision: http://reviews.llvm.org/D5965 llvm-svn: 221052	2014-11-01 17:38:22 +00:00
Diego Novillo	d5336ae269	Add show and merge tools for sample PGO profiles. Summary: This patch extends the 'show' and 'merge' commands in llvm-profdata to handle sample PGO formats. Using the 'merge' command it is now possible to convert one sample PGO format to another. The only format that is currently not working is 'gcc'. I still need to implement support for it in lib/ProfileData. The changes in the sample profile support classes are needed for the merge operation. Reviewers: bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6065 llvm-svn: 221032	2014-11-01 00:56:55 +00:00
Duncan P. N. Exon Smith	4abd1a0808	IR: MDNode => Value: Instruction::getAllMetadata() Change `Instruction::getAllMetadata()` to modify a vector of `Value` instead of `MDNode` and update call sites. This is part of PR21433. llvm-svn: 221027	2014-11-01 00:26:42 +00:00
Duncan P. N. Exon Smith	3872d0084c	IR: MDNode => Value: Instruction::getMetadata() Change `Instruction::getMetadata()` to return `Value` as part of PR21433. Update most callers to use `Instruction::getMDNode()`, which wraps the result in a `cast_or_null<MDNode>`. llvm-svn: 221024	2014-11-01 00:10:31 +00:00
Duncan P. N. Exon Smith	7c4fc4e5ae	IR: MDNode => Value: Add Instruction::getMDNode() Add `Instruction::getMDNode()` that casts to `MDNode` before changing `Instruction::getMetadata()` to return `Value`. This avoids adding `cast_or_null<MDNode>` boiler-plate throughout the code. Part of PR21433. llvm-svn: 221023	2014-10-31 23:58:04 +00:00
Reid Kleckner	da00cf5f73	Work around bugs in MSVC "14" CTP 3's conversion logic It appears to ignore or find ambiguous MachineInstrBuilder's conversion operators that allow conversion to MachineInstr* and MachineBasicBlock::bundle_iterator. As a workaround, add an explicit way to get the MachineInstr. llvm-svn: 221017	2014-10-31 23:19:46 +00:00
Reid Kleckner	81bef5a1a2	Define LLVM_NOEXCEPT with MSVC 14 CTP 3 or newer We have to use _MSC_FULL_VER here as CTP 2 and earlier didn't define noexcept to my knowledge. Fixes build error in lib/Support/Error.cpp when inheriting from std::error_category, which has a noexcept virtual method. llvm-svn: 221013	2014-10-31 23:02:40 +00:00
Lang Hames	f04de6ec48	[Object] Modify OwningBinary's interface to separate inspection from ownership. The getBinary and getBuffer method now return ordinary pointers of appropriate const-ness. Ownership is transferred by calling takeBinary(), which returns a pair of the Binary and a MemoryBuffer. llvm-svn: 221003	2014-10-31 21:37:49 +00:00
Duncan P. N. Exon Smith	e5d641ebca	IR: MDNode => Value: Instruction::setMetadata() Change `Instruction::setMetadata()` API to accept `Value` instead of `MDNode`. Part of PR21433. llvm-svn: 220994	2014-10-31 20:13:11 +00:00
Bill Schmidt	1ca69fa64d	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch adds basic support for VSX intrinsics in general, and tests it by implementing intrinsics for minimum and maximum for the vector double data type. The LLVM portion of this is quite straightforward. There is a companion patch for Clang. llvm-svn: 220988	2014-10-31 19:19:07 +00:00
Quentin Colombet	c32615dfef	[CodeGenPrepare] Move extractelement close to store if they can be combined. This patch adds an optimization in CodeGenPrepare to move an extractelement right before a store when the target can combine them. The optimization may promote any scalar operations to vector operations in the way to make that possible. Context Some targets use different register files for both vector and scalar operations. This means that transitioning from one domain to another may incur copy from one register file to another. These copies are not coalescable and may be expensive. For example, according to the scheduling model, on cortex-A8 a vector to GPR move is 20 cycles. Motivating Example Let us consider an example: define void @foo(<2 x i32>* %addr1, i32* %dest) { %in1 = load <2 x i32>* %addr1, align 8 %extract = extractelement <2 x i32> %in1, i32 1 %out = or i32 %extract, 1 store i32 %out, i32* %dest, align 4 ret void } As it is, this IR generates the following assembly on armv7: vldr d16, [r0] @vector load vmov.32 r0, d16[1] @ cross-register-file copy: 20 cycles orr r0, r0, #1 @ scalar bitwise or str r0, [r1] @ scalar store bx lr Whereas we could generate much faster code: vldr d16, [r0] @ vector load vorr.i32 d16, #0x1 @ vector bitwise or vst1.32 {d16[1]}, [r1:32] @ vector extract + store bx lr Half of the computation made in the vector is useless, but this allows to get rid of the expensive cross-register-file copy. Proposed Solution To avoid this cross-register-copy penalty, we promote the scalar operations to vector operations. The penalty will be removed if we manage to promote the whole chain of computation in the vector domain. Currently, we do that only when the chain of computation ends by a store and the target is able to combine an extract with a store. Stores are the most likely candidates, because other instructions produce values that would need to be promoted and so, extracted as some point[1]. Moreover, this is customary that targets feature stores that perform a vector extract (see AArch64 and X86 for instance). The proposed implementation relies on the TargetTransformInfo to decide whether or not it is beneficial to promote a chain of computation in the vector domain. Unfortunately, this interface is rather inaccurate for this level of details and although this optimization may be beneficial for X86 and AArch64, the inaccuracy will lead to the optimization being too aggressive. Basically in TargetTransformInfo, everything that is legal has a cost of 1, whereas, even if a vector type is legal, usually a vector operation is slightly more expensive than its scalar counterpart. That will lead to too many promotions that may not be counter balanced by the saving of the cross-register-file copy. For instance, on AArch64 this penalty is just 4 cycles. For now, the optimization is just enabled for ARM prior than v8, since those processors have a larger penalty on cross-register-file copies, and the scope is limited to basic blocks. Because of these two factors, we limit the effects of the inaccuracy. Indeed, I did not want to build up a fancy cost model with block frequency and everything on top of that. [1] We can imagine targets that can combine an extractelement with other instructions than just stores. If we want to go into that direction, the current interfaces must be augmented and, moreover, I think this becomes a global isel problem. Differential Revision: http://reviews.llvm.org/D5921 <rdar://problem/14170854> llvm-svn: 220978	2014-10-31 17:52:53 +00:00
David Majnemer	c7d7c6fb3a	Object, COFF: Cleanup symbol type code, improve binutils compatibility Do a better job classifying symbols. This increases the consistency between the COFF handling code and the ELF side of things. llvm-svn: 220952	2014-10-31 05:07:00 +00:00
NAKAMURA Takumi	c158884726	Threading.h: Give named parameters to llvm::call_once(flag,UserFn). [-Wdocumentation] llvm-svn: 220941	2014-10-31 00:54:20 +00:00
Chris Bieneman	14e2bcccfb	Removing the static initializer in ManagedStatic.cpp by using llvm_call_once to initialize the ManagedStatic mutex. Summary: This patch adds an llvm_call_once which is a wrapper around std::call_once on platforms where it is available and devoid of bugs. The patch also migrates the ManagedStatic mutex to be allocated using llvm_call_once. These changes are philosophically equivalent to the changes added in r219638, which were reverted due to a hang on Win32 which was the result of a bug in the Windows implementation of std::call_once. Reviewers: aaron.ballman, chapuni, chandlerc, rnk Reviewed By: rnk Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D5922 llvm-svn: 220932	2014-10-30 22:07:09 +00:00
Diego Novillo	047d5d1a0a	Fix comment spelling and tidy diagnostic call in profile reader. No functional changes. llvm-svn: 220922	2014-10-30 20:19:19 +00:00
Diego Novillo	c572e92c76	Add profile writing capabilities for sampling profiles. Summary: This patch finishes up support for handling sampling profiles in both text and binary formats. The new binary format uses uleb128 encoding to represent numeric values. This makes profiles files about 25% smaller. The profile writer class can write profiles in the existing text and the new binary format. In subsequent patches, I will add the capability to read (and perhaps write) profiles in the gcov format used by GCC. Additionally, I will be adding support in llvm-profdata to manipulate sampling profiles. There was a bit of refactoring needed to separate some code that was in the reader files, but is actually common to both the reader and writer. The new test checks that reading the same profile encoded as text or raw, produces the same results. Reviewers: bogner, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6000 llvm-svn: 220915	2014-10-30 18:00:06 +00:00
Simon Atanasyan	6e3949a35c	[Mips] Add new Mips specific e_flags. No functional changes. llvm-svn: 220910	2014-10-30 14:56:02 +00:00
Jan Wen Voung	ce2164f45c	Fix getRelocationValueString to return the symbol name for EM_386. Summary: This helps llvm-objdump -r to print out the symbol name along with the relocation type on x86. Adjust existing tests from checking for "Unknown" to check for the symbol now. Test Plan: Adjusted test/Object tests. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5987 llvm-svn: 220866	2014-10-29 18:37:13 +00:00
Peter Zotov	2481c75f8b	[C API] PR19859: Add functions to query and modify branches. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220817	2014-10-28 19:46:56 +00:00
Peter Zotov	1d98e6ddef	[C API] PR19859: Add LLVMGetFCmpPredicate and LLVMConstRealGetDouble. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220814	2014-10-28 19:46:44 +00:00
Reid Kleckner	9ccce99e1d	X86: Implement the vectorcall calling convention This is a Microsoft calling convention that supports both x86 and x86_64 subtargets. It passes vector and floating point arguments in XMM0-XMM5, and passes them indirectly once they are consumed. Homogenous vector aggregates of up to four elements can be passed in sequential vector registers, but this part is not implemented in LLVM and will be handled in Clang. On 32-bit x86, it is similar to fastcall in that it uses ecx:edx as integer register parameters and is callee cleanup. On x86_64, it delegates to the normal win64 calling convention. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D5943 llvm-svn: 220745	2014-10-28 01:29:26 +00:00
Rafael Espindola	9f8eff31db	Remove the PreserveSource linker mode. I noticed that it was untested, and forcing it on caused some tests to fail: LLVM :: Linker/metadata-a.ll LLVM :: Linker/prefixdata.ll LLVM :: Linker/type-unique-odr-a.ll LLVM :: Linker/type-unique-simple-a.ll LLVM :: Linker/type-unique-simple2-a.ll LLVM :: Linker/type-unique-simple2.ll LLVM :: Linker/type-unique-type-array-a.ll LLVM :: Linker/unnamed-addr1-a.ll LLVM :: Linker/visibility1.ll If it is to be resurrected, it has to be fixed and we should probably have a -preserve-source command line option in llvm-mc and run tests with and without it. llvm-svn: 220741	2014-10-28 00:24:16 +00:00
Rafael Espindola	4160f5d3ac	Make it easier to pass a custom diagnostic handler to the IR linker. llvm-svn: 220732	2014-10-27 23:02:10 +00:00
Jingyue Wu	53c1612ed1	[ScalarEvolution] Guard dump() with #if to be consistent with its definition in ScalarEvolution.cpp llvm-svn: 220721	2014-10-27 21:14:41 +00:00
Lang Hames	69fa70efb3	[PBQP] Remove a spurious 'typename' keyword. This was causing an error on MSVC. llvm-svn: 220690	2014-10-27 17:59:51 +00:00
Lang Hames	bfd1c3cfa5	[PBQP] Clarify ambiguous-looking typedef. This was causing an error on the hexagon bots. llvm-svn: 220689	2014-10-27 17:52:05 +00:00
Lang Hames	5fe30ca56f	[PBQP] Unique allowed-sets for nodes in the PBQP graph and use pairs of these sets as keys into a cache of interference matrice values in the Interference constraint adder. Creating interference matrices was one of the large remaining time-sinks in PBQP. Caching them reduces the total compile time (when using PBQP) on the nightly test suite by ~10%. llvm-svn: 220688	2014-10-27 17:44:25 +00:00
Michael Gottesman	d71825c3cb	Add MapVector::rbegin(), MapVector::rend() to completment MapVector::begin(), MapVector::end(). These just delegate to the underlying vector type in the MapVector. Also just add in some sanity unittests. llvm-svn: 220687	2014-10-27 17:20:53 +00:00
NAKAMURA Takumi	729be14435	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Lang Hames	5af35a9ee1	[PBQP] Tidying up as per Dave Blaikie's suggesions for r220642. llvm-svn: 220655	2014-10-26 22:12:02 +00:00

1 2 3 4 5 ...

21705 Commits