llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	e4eb78188c	Add ExpandFloatOp_FCOPYSIGN to handle ppcf128-related expansions We had previously been asserting when faced with a FCOPYSIGN f64, ppcf128 node because there was no way to expand the FCOPYSIGN node. Because ppcf128 is the sum of two doubles, and the first double must have the larger magnitude, we can take the sign from the first double. As a result, in addition to fixing the crash, this is also an optimization. llvm-svn: 188655	2013-08-19 06:55:37 +00:00
David Blaikie	715528be0b	DebugInfo: don't emit zero-length names for parameters We check this in many/all other cases, just missed this one it seems. Perhaps it'd be worth unifying this so we never emit zero-length DW_AT_names. llvm-svn: 188649	2013-08-19 03:34:03 +00:00
Jim Grosbach	06c2a68125	ARM: Fix more fast-isel verifier failures. Teach the generic instruction selection helper functions to constrain the register classes of their input operands. For non-physical register references, the generic code needs to be careful not to mess that up when replacing references to result registers. As the comment indicates for MachineRegisterInfo::replaceRegWith(), it's important to call constrainRegClass() first. rdar://12594152 llvm-svn: 188593	2013-08-16 23:37:31 +00:00
David Blaikie	d4e106e39d	DebugInfo: Allow the addition of other (such as static data) members to a record type after construction Plus a type cleanup & minor fix to enumerate members of declarations. llvm-svn: 188577	2013-08-16 20:42:14 +00:00
Richard Sandiford	0dec06a28c	[SystemZ] Use SRST to implement strlen and strnlen It would also make sense to use it for memchr; I'm working on that now. llvm-svn: 188547	2013-08-16 11:41:43 +00:00
Richard Sandiford	bb83a50f57	[SystemZ] Use MVST to implement strcpy and stpcpy llvm-svn: 188546	2013-08-16 11:29:37 +00:00
Richard Sandiford	ca23271010	[SystemZ] Use CLST to implement strcmp llvm-svn: 188544	2013-08-16 11:21:54 +00:00
Richard Sandiford	e3827751e2	[SystemZ] Fix handling of 64-bit memcmp results Generalize r188163 to cope with return types other than MVT::i32, just as the existing visitMemCmpCall code did. I've split this out into a subroutine so that it can be used for other upcoming patches. I also noticed that I'd used the wrong API to record the out chain. It's a load that uses DAG.getRoot() rather than getRoot(), so the out chain should go on PendingLoads. I don't have a testcase for that because we don't do any interesting scheduling on z yet. llvm-svn: 188540	2013-08-16 10:55:47 +00:00
Bill Wendling	33fae6935a	Make a few more things const. llvm-svn: 188484	2013-08-15 20:25:44 +00:00
Bill Wendling	2d092f05b4	Use a reference instead of making an unnecessary copy. Also use 'const'. llvm-svn: 188483	2013-08-15 20:21:49 +00:00
Craig Topper	d9c2783d8f	Replace getValueType().getSimpleVT() with getSimpleValueType(). llvm-svn: 188442	2013-08-15 02:44:19 +00:00
Mark Lacey	9d8103de7a	Auto-compute live intervals on demand. When new virtual registers are created during splitting/spilling, defer creation of the live interval until we need to use the live interval. Along with the recent commits to notify LiveRangeEdit when new virtual registers are created, this makes it possible for functions like TargetInstrInfo::loadRegFromStackSlot() and TargetInstrInfo::storeRegToStackSlot() to create multiple virtual registers as part of the process of generating loads/stores for different register classes, and then have the live intervals for those new registers computed when they are needed. llvm-svn: 188437	2013-08-14 23:50:16 +00:00
Mark Lacey	f367cd9239	Notify LiveRangeEdit of new virtual registers. Add a delegate class to MachineRegisterInfo with a single virtual function, MRI_NoteNewVirtualRegister(). Update LiveRangeEdit to inherit from this delegate class and override the definition of the callback with an implementation that tracks the newly created virtual registers. llvm-svn: 188435	2013-08-14 23:50:09 +00:00
Mark Lacey	f9ea88546f	Track new virtual registers by register number. Track new virtual registers by register number, rather than by the live interval created for them. This is the first step in separating the creation of new virtual registers and new live intervals. Eventually live intervals will be created and populated on demand after the virtual registers have been created and used in instructions. llvm-svn: 188434	2013-08-14 23:50:04 +00:00
David Blaikie	d0d6fcc923	DebugInfo: Prefer references over pointers, pass by const reference for a type that will grow in the future llvm-svn: 188422	2013-08-14 22:23:05 +00:00
Jakob Stoklund Olesen	4417c7b265	Remove unnecessary parameter to RenumberValues. Patch by Matthias Braun! llvm-svn: 188393	2013-08-14 17:28:52 +00:00
Jakob Stoklund Olesen	6d13b8fd85	Improve misleading comment. Patch by Matthias Braun! llvm-svn: 188391	2013-08-14 17:28:46 +00:00
Jakob Stoklund Olesen	874c412b6f	Remove declaration of nonexistant function. Patch by Matthias Braun! llvm-svn: 188390	2013-08-14 17:28:44 +00:00
Jakob Stoklund Olesen	21914ab441	LiveIntervalUnion is not used in RegAllocBase. Patch by Matthias Braun! llvm-svn: 188389	2013-08-14 17:28:42 +00:00
Jim Grosbach	327ccc787e	DAG: Combine (and (setne X, 0), (setne X, -1)) -> (setuge (add X, 1), 2) A common idiom is to use zero and all-ones as sentinal values and to check for both in a single conditional ("x != 0 && x != (unsigned)-1"). That generates code, for i32, like: testl %edi, %edi setne %al cmpl $-1, %edi setne %cl andb %al, %cl With this transform, we generate the simpler: incl %edi cmpl $1, %edi seta %al Similar improvements for other integer sizes and on other platforms. In general, combining the two setcc instructions into one is better. rdar://14689217 llvm-svn: 188315	2013-08-13 21:30:58 +00:00
Michael Gottesman	7a8017290a	Update makeLibCall to return both the call and the chain associated with the libcall instead of just the call. This allows us to specify libcalls that return void. LowerCallTo returns a pair with the return value of the call as the first element and the chain associated with the return value as the second element. If we lower a call that has a void return value, LowerCallTo returns an SDValue with a NULL SDNode and the chain for the call. Thus makeLibCall by just returning the first value makes it impossible for you to set up the chain so that the call is not eliminated as dead code. I also updated all references to makeLibCall to reflect the new return type. llvm-svn: 188300	2013-08-13 17:54:56 +00:00
Carlo Kok	bac096a614	Output DW_AT_stmt_list dwarf debug info as DW_FORM_sec_offset instead of DW_FORM_data4 as it is a section offset (fixes the coff/dwarf debug info statement locations) llvm-svn: 188297	2013-08-13 17:46:57 +00:00
Carlo Kok	fb849b0f21	For COFF only: dwarf debug info output a label reference as a section relative item only when it's one of dw_from strp, sec_offset, ref_addr or op_call_ref instead of going by size. llvm-svn: 188296	2013-08-13 17:45:53 +00:00
Evgeniy Stepanov	b59d82ac66	Pass DIEHash::collectAttributes output argument by-pointer instead of by-value. Before this, collectAttributes() was operating on a local object. llvm-svn: 188254	2013-08-13 07:57:01 +00:00
David Majnemer	3d96acb735	[-cxx-abi microsoft] Stick zero initialized symbols into the .bss section for COFF Summary: We need to do two things: - Initialize BSSSection in MCObjectFileInfo::InitCOFFMCObjectFileInfo - Teach TargetLoweringObjectFileCOFF::SelectSectionForGlobal what to do with it This fixes PR16861. Reviewers: rnk Reviewed By: rnk CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1361 llvm-svn: 188244	2013-08-13 01:23:53 +00:00
Eric Christopher	d29614f98d	Add the start of DIE hashing for DWARF4 type units and split dwarf CUs. Currently only hashes the name of CUs and the names of any children, but it's an obvious first step to show the framework. The testcase should continue to be correct, however, as it's an empty TU. llvm-svn: 188243	2013-08-13 01:21:55 +00:00
Eric Christopher	cede3db5ea	Reflow comment. llvm-svn: 188233	2013-08-12 23:59:24 +00:00
Eric Christopher	166294f37a	Remove empty constructor. llvm-svn: 188232	2013-08-12 23:59:18 +00:00
Michael Gottesman	3923bec37b	Fixed SelectionDAGBuilder.h C++ filetype declaration to use the canonical C++ instead of c++. llvm-svn: 188203	2013-08-12 21:02:02 +00:00
Michael Gottesman	f1d3b7c22e	Fixed another place in CodeGen where we had a typo in our editor C++ filetype declaration. llvm-svn: 188202	2013-08-12 20:52:06 +00:00
Michael Gottesman	1649a877e1	[branchfolding] Fix typo in C++ editor declaration. llvm-svn: 188201	2013-08-12 20:49:27 +00:00
Eric Christopher	60eb7696a9	Move the addition of the dwo_id as late as possible after everything has been finalized except for sizes and offsets. Update test accordingly. llvm-svn: 188199	2013-08-12 20:27:48 +00:00
Michael Gottesman	7dce16f69d	[stackprotector] Add in the stackprotector libcall. We support this libcall on all platforms except for OpenBSD (See lib/Codegen/StackProtector.cpp). llvm-svn: 188193	2013-08-12 18:45:38 +00:00
Richard Sandiford	564681c88d	[SystemZ] Use CLC and IPM to implement memcmp For now this is restricted to fixed-length comparisons with a length in the range [1, 256], as for memcpy() and MVC. llvm-svn: 188163	2013-08-12 10:28:10 +00:00
Tim Northover	707d68f082	Allow compatible extension attributes for tail calls If the tail-callee and caller give the same bits via the same signext/zeroext attribute then a tail-call should be allowed, since the extension has already been done by the callee. llvm-svn: 188159	2013-08-12 09:45:46 +00:00
Michael Gottesman	8afcf3a408	[stackprotector] Simplify SP Pass so that we emit different fail basic blocks for each fail condition. This patch decouples the stack protector pass so that we can support stack protector implementations that do not use the IR level generated stack protector fail basic block. No codesize increase is caused by this change since the MI level tail merge pass properly merges together the fail condition blocks (see the updated test). llvm-svn: 188105	2013-08-09 21:26:18 +00:00
Benjamin Kramer	df03449a0a	Make helper static and fix formatting. llvm-svn: 188074	2013-08-09 14:44:41 +00:00
Craig Topper	0ecb26a79e	Change asserts at the top of getVectorShuffle to check that LHS and RHS have the same type as the result. Previously the asserts were only checking that RHS and LHS were the same type and had the same element type as the result. All downstream code for ISD::VECTOR_SHUFFLE requires the types to be the same. Also removed one unnecessary check of matched element counts that was present in the code. llvm-svn: 188051	2013-08-09 04:37:24 +00:00
Hal Finkel	8ec43c6a0f	Set ISD::FROUND to Expand by default for all types For most libm ISD nodes, TargetLoweringBase::initActions sets the default scalar-type action to Expand, and leaves the vector-type action default as Legal. This is not appropriate for the new ISD::FROUND node (which no backend but PowerPC handles explicitly). Fixes PR16842. llvm-svn: 188048	2013-08-09 04:13:44 +00:00
Eric Christopher	ac886fe0f8	Update the CMake build files. llvm-svn: 188030	2013-08-08 23:51:31 +00:00
Eric Christopher	4573198b30	Move hash computation code into a separate class and file. No functional change intended. llvm-svn: 188028	2013-08-08 23:45:55 +00:00
Arnold Schwaighofer	c31c2de18b	Revert "Reapply r185872 now that the address sanitizer has been changed to support this." This reverts commit r187939. It broke an O0 build of a spec benchmark. llvm-svn: 188012	2013-08-08 21:04:16 +00:00
Eric Christopher	056b647d1f	For DW_TAG_template_type_parameter the actual passed in type could be void and therefore not have a type entry. Only add the type if it is non-void and provide a testcase. llvm-svn: 187966	2013-08-08 08:09:43 +00:00
Craig Topper	9a39b07a60	Remove AllUndef check from one of the loops in getVectorShuffle. It was already handled by the 'AllLHS && AllRHS' check after the previous loop. llvm-svn: 187965	2013-08-08 08:03:12 +00:00
Eric Christopher	49e17b2049	The conversion to bool is fine here, no need to check isType. llvm-svn: 187964	2013-08-08 07:40:42 +00:00
Eric Christopher	0df08e2ff9	Make sure that if we're going to attempt to add a type to a DIE that the type exists. Fix up cases where we weren't checking for optional types and add an assert to addType to make sure we catch this in the future. Fix up a testcase that was using the tag for DW_TAG_array_type when it meant DW_TAG_enumeration_type. llvm-svn: 187963	2013-08-08 07:40:37 +00:00
Eric Christopher	afb2c4114e	Change variable name and reflow formatting. llvm-svn: 187962	2013-08-08 07:40:31 +00:00
Craig Topper	309dfefb6f	Optimize mask generation for one of the DAG combiner shufflevector cases. llvm-svn: 187961	2013-08-08 07:38:55 +00:00
David Majnemer	f76d6b3712	Revert "coff also doesn't have a ReadOnlySection yet, (!)" This reverts commit r77814. We were sticking global constants in the .data section instead of in the .rdata section when emitting for COFF. This fixes PR16831. llvm-svn: 187956	2013-08-08 01:50:52 +00:00
Eric Christopher	d25f7fc4ae	Reflow for loop. llvm-svn: 187954	2013-08-08 01:41:05 +00:00
Eric Christopher	31b0576b01	Be more rigorous about the sizes of forms and attributes. llvm-svn: 187953	2013-08-08 01:41:00 +00:00
Bill Wendling	b80f9791e4	Reapply r185872 now that the address sanitizer has been changed to support this. Original commit message: Stop emitting weak symbols into the "coal" sections. The Mach-O linker has been able to support the weak-def bit on any symbol for quite a while now. The compiler however continued to place these symbols into a "coal" section, which required the linker to map them back to the base section name. Replace the sections like this: __TEXT/__textcoal_nt instead use __TEXT/__text __TEXT/__const_coal instead use __TEXT/__const __DATA/__datacoal_nt instead use __DATA/__data <rdar://problem/14265330> llvm-svn: 187939	2013-08-07 23:42:09 +00:00
Hal Finkel	171817ee8a	Add ISD::FROUND for libm round() All libm floating-point rounding functions, except for round(), had their own ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm adding ISD::FROUND so that round() can be custom lowered as well. For the most part, this is straightforward. I've added an intrinsic and a matching ISD node just like those for nearbyint() and friends. The SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed fround). This will be used by the PowerPC backend in a follow-up commit. llvm-svn: 187926	2013-08-07 22:49:12 +00:00
Eric Christopher	7af8baf678	Using the integrated assembler we'd fail to change section to the .tbss section for zerofill thread locals. Make sure we do this before emitting the zerofills. Fixes PR15972. llvm-svn: 187913	2013-08-07 21:13:06 +00:00
Andrew Trick	2f7667e018	Confusing comment typo. llvm-svn: 187895	2013-08-07 17:20:32 +00:00
Eric Christopher	341770d7ea	Remove some parens. No functional change. llvm-svn: 187872	2013-08-07 08:35:10 +00:00
Eric Christopher	8552e22b07	Add a way to grab a particular attribute out of a DIE. Use it when we're looking for a string in particular. Update comments as well. llvm-svn: 187844	2013-08-07 01:18:33 +00:00
Eric Christopher	af15f8dd5a	Move somewhat messy conditional out of line. No functional change. llvm-svn: 187843	2013-08-07 01:18:24 +00:00
Arnold Schwaighofer	a7cd6bf3bb	LoopVectorize: Allow vectorization of loops with lifetime markers Patch by Marc Jessome! llvm-svn: 187825	2013-08-06 22:37:52 +00:00
Tim Northover	a4415854db	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
NAKAMURA Takumi	e359e85649	AsmPrinter/CMakeLists.txt: Add explicit dependency to intrinsics_gen here. llvm-svn: 187778	2013-08-06 05:56:39 +00:00
Eric Christopher	0062f2edc0	Recommit previous cleanup with a fix for c++98 ambiguity. llvm-svn: 187752	2013-08-05 22:32:28 +00:00
Tom Stellard	d42c594960	TargetLowering: Add getVectorIdxTy() function v2 This virtual function can be implemented by targets to specify the type to use for the index operand of INSERT_VECTOR_ELT, EXTRACT_VECTOR_ELT, INSERT_SUBVECTOR, EXTRACT_SUBVECTOR. The default implementation returns the result from TargetLowering::getPointerTy() The previous code was using TargetLowering::getPointerTy() for vector indices, because this is guaranteed to be legal on all targets. However, using TargetLowering::getPointerTy() can be a problem for targets with pointer sizes that differ across address spaces. On such targets, when vectors need to be loaded or stored to an address space other than the default 'zero' address space (which is the address space assumed by TargetLowering::getPointerTy()), having an index that is a different size than the pointer can lead to inefficient pointer calculations, (e.g. 64-bit adds for a 32-bit address space). There is no intended functionality change with this patch. llvm-svn: 187748	2013-08-05 22:22:01 +00:00
Eric Christopher	432c99af0b	Revert "Use existing builtin hashing functions to make this routine more" This reverts commit r187745. llvm-svn: 187747	2013-08-05 22:07:30 +00:00
Eric Christopher	d728355a1c	Use existing builtin hashing functions to make this routine more simple. llvm-svn: 187745	2013-08-05 22:00:50 +00:00
Eric Christopher	0369ad7053	Change parent hashing algorithm to be non-recursive and elaborate greatly on many comments in the code. llvm-svn: 187742	2013-08-05 21:40:57 +00:00
Benjamin Kramer	483b9fbddb	Don't leak passes if added outside of the area determined by Started/Stopped flags. llvm-svn: 187722	2013-08-05 11:11:11 +00:00
Carlo Kok	4382da983a	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB (with fix for 64bits msvc) llvm-svn: 187656	2013-08-02 16:14:15 +00:00
NAKAMURA Takumi	6fda3b4b86	Revert r187597, "Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB." It broke x86_64-win32 builder in llvm/test/DebugInfo. llvm-svn: 187642	2013-08-02 03:46:05 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
David Blaikie	a1ae0e6ecb	DebugInfo: Emit definitions for types with no members. The absence of members was a poor/incorrect proxy for "is definition". llvm-svn: 187607	2013-08-01 20:30:22 +00:00
Carlo Kok	afcc62024e	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB. fixes Bug 16249 - LLVM generates broken debug info on Windows llvm-svn: 187597	2013-08-01 18:38:14 +00:00
Eric Christopher	e6656ac870	Fix crashing on invalid inline asm with matching constraints. For a testcase like the following: typedef unsigned long uint64_t; typedef struct { uint64_t lo; uint64_t hi; } blob128_t; void add_128_to_128(const blob128_t in, blob128_t res) { asm ("PAND %1, %0" : "+Q"(res) : "Q"(in)); } where we'll fail to allocate the register for the output constraint, our matching input constraint will not find a register to match, and could try to search past the end of the current operands array. On the idea that we'd like to attempt to keep compilation going to find more errors in the module, change the error cases when we're visiting inline asm IR to return immediately and avoid trying to create a node in the DAG. This leaves us with only a single error message per inline asm instruction, but allows us to safely keep going in the general case. llvm-svn: 187470	2013-07-31 01:26:24 +00:00
Eric Christopher	029af15086	Reflow this to be easier to read. llvm-svn: 187459	2013-07-30 22:50:44 +00:00
Andrew Trick	c7934b3e37	Down-scale slot index distance to save bits. llvm-svn: 187438	2013-07-30 19:59:19 +00:00
Andrew Trick	9c17eab761	MI Sched: Track live-thru registers. When registers must be live throughout the scheduling region, increase the limit for the register class. Once we exceed the original limit, they will be spilled, and there's no point further reducing pressure. This isn't a perfect heuristics but avoids a situation where the scheduler could become trapped by trying to achieve the impossible. llvm-svn: 187436	2013-07-30 19:59:12 +00:00
Andrew Trick	d9761776bc	MI Sched fix: assert "Disconnected LRG within the scheduling region." llvm-svn: 187435	2013-07-30 19:59:08 +00:00
Quentin Colombet	6bf4baa408	[DAGCombiner] insert_vector_elt: Avoid building a vector twice. This patch prevents the following combine when the input vector is used more than once. insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx => build_vector elt0, ..., NewEltIdx, ..., eltN The reasons are: - Building a vector may be expensive, so try to reuse the existing part of a vector instead of creating a new one (think big vectors). - elt0 to eltN now have two users instead of one. This may prevent some other optimizations. llvm-svn: 187396	2013-07-30 00:24:09 +00:00
Eric Christopher	e414ece79a	Fix a truly egregious thinko in anonymous namespace check, update testcase to make sure we generate debug info for walrus by adding a non-trivial constructor and verify that we don't emit an ODR signature for the type. llvm-svn: 187393	2013-07-29 23:53:08 +00:00
Eric Christopher	d853ea3142	Make sure we don't emit an ODR hash for types with no name and make sure the comments for each testcase are a bit easier to distinguish. llvm-svn: 187392	2013-07-29 23:53:05 +00:00
Eric Christopher	f8542ec305	Elaborate a bit on the type unit and ODR conditional code. llvm-svn: 187385	2013-07-29 22:24:32 +00:00
Nico Rieck	7fdaee8f15	Use proper section suffix for COFF weak symbols 32-bit symbols have "_" as global prefix, but when forming the name of COMDAT sections this prefix is ignored. The current behavior assumes that this prefix is always present which is not the case for 64-bit and names are truncated. llvm-svn: 187356	2013-07-29 13:58:39 +00:00
Benjamin Kramer	409afcf174	DwarfDebug: MD5 is always little endian, bswap on big endian platforms. This makes LLVM emit the same signature regardless of host and target endianess. llvm-svn: 187304	2013-07-27 14:14:43 +00:00
Chandler Carruth	2a1c0d2c03	Fix a memory leak in the debug emission by simply not allocating memory. There doesn't appear to be any reason to put this variable on the heap. I'm suspicious of the LexicalScope above that we stuff in a map and then delete afterward, but I'm just trying to get the valgrind bot clean. llvm-svn: 187301	2013-07-27 11:09:58 +00:00
Nick Lewycky	0b68245ec8	Reimplement isPotentiallyReachable to make nocapture deduction much stronger. Adds unit tests for it too. Split BasicBlockUtils into an analysis-half and a transforms-half, and put the analysis bits into a new Analysis/CFG.{h,cpp}. Promote isPotentiallyReachable into llvm::isPotentiallyReachable and move it into Analysis/CFG. llvm-svn: 187283	2013-07-27 01:24:00 +00:00
Tom Stellard	8b1e021e85	SimplifyCFG: Use parallel-and and parallel-or mode to consolidate branch conditions Merge consecutive if-regions if they contain identical statements. Both transformations reduce number of branches. The transformation is guarded by a target-hook, and is currently enabled only for +R600, but the correctness has been tested on X86 target using a variety of CPU benchmarks. Patch by: Mei Ye llvm-svn: 187278	2013-07-27 00:01:07 +00:00
Eric Christopher	219fb91499	Remove addLetterToHash, no functional change. llvm-svn: 187245	2013-07-26 21:07:18 +00:00
Eric Christopher	67646438c9	Add preliminary support for hashing DIEs and breaking them into type units. Initially this support is used in the computation of an ODR checker for C++. For now we're attaching it to the DIE, but in the future it will be attached to the type unit. This also starts breaking out types into the separation for type units, but without actually splitting the DIEs. In preparation for hashing the DIEs this adds a DIEString type that contains a StringRef with the string contained at the label. llvm-svn: 187213	2013-07-26 17:02:41 +00:00
Justin Holewinski	d3f2035a3c	Add a target legalize hook for SplitVectorOperand (again) CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 Attempt to fix the buildbots by making the X86 test I just added platform independent llvm-svn: 187202	2013-07-26 13:28:29 +00:00
Rafael Espindola	1d812728cc	Revert "Add a target legalize hook for SplitVectorOperand" This reverts commit 187198. It broke the bots. The soft float test probably needs a -triple because of name differences. On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of "vroundss $1, %xmm0, %xmm0, %xmm0". llvm-svn: 187201	2013-07-26 13:18:16 +00:00
Justin Holewinski	f848a24e50	Add a target legalize hook for SplitVectorOperand CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 llvm-svn: 187198	2013-07-26 12:46:39 +00:00
Andrew Trick	f4b1ee3492	RegAllocGreedy comment. llvm-svn: 187141	2013-07-25 18:35:22 +00:00
Andrew Trick	8bb0a251fd	Evict local live ranges if they can be reassigned. The previous change to local live range allocation also suppressed eviction of local ranges. In rare cases, this could result in more expensive register choices. This commit actually revives a feature that I added long ago: check if live ranges can be reassigned before eviction. But now it only happens in rare cases of evicting a local live range because another local live range wants a cheaper register. The benefit is improved code size for some benchmarks on x86 and armv7. I measured no significant compile time increase and performance changes are noise. llvm-svn: 187140	2013-07-25 18:35:19 +00:00
Andrew Trick	8485257d6d	Allocate local registers in order for optimal coloring. Also avoid locals evicting locals just because they want a cheaper register. Problem: MI Sched knows exactly how many registers we have and assumes they can be colored. In cases where we have large blocks, usually from unrolled loops, greedy coloring fails. This is a source of "regressions" from the MI Scheduler on x86. I noticed this issue on x86 where we have long chains of two-address defs in the same live range. It's easy to see this in matrix multiplication benchmarks like IRSmk and even the unit test misched-matmul.ll. A fundamental difference between the LLVM register allocator and conventional graph coloring is that in our model a live range can't discover its neighbors, it can only verify its neighbors. That's why we initially went for greedy coloring and added eviction to deal with the hard cases. However, for singly defined and two-address live ranges, we can optimally color without visiting neighbors simply by processing the live ranges in instruction order. Other beneficial side effects: It is much easier to understand and debug regalloc for large blocks when the live ranges are allocated in order. Yes, global allocation is still very confusing, but it's nice to be able to comprehend what happened locally. Heuristics could be added to bias register assignment based on instruction locality (think late register pairing, banks...). Intuituvely this will make some test cases that are on the threshold of register pressure more stable. llvm-svn: 187139	2013-07-25 18:35:14 +00:00
Adrian Prantl	e4daf52a63	typo. llvm-svn: 187135	2013-07-25 17:52:30 +00:00
Andrew Trick	401b6959ae	MI Sched: Register pressure heuristics. Consider which set is being increased or decreased before comparing. llvm-svn: 187110	2013-07-25 07:26:35 +00:00
Andrew Trick	27e5fea665	MI Sched: track register pressure by importance of the set, not weight of the units. llvm-svn: 187109	2013-07-25 07:26:32 +00:00
Andrew Trick	9706496b0d	Dump LIS before regalloc. MI sched changes them. llvm-svn: 187107	2013-07-25 07:26:26 +00:00
Bill Wendling	440e9d81bf	Replace the "NoFramePointerElimNonLeaf" target option with a function attribute. There's no need to specify a flag to omit frame pointer elimination on non-leaf nodes...(Honestly, I can't parse that option out.) Use the function attribute stuff instead. llvm-svn: 187093	2013-07-25 00:34:29 +00:00
Quentin Colombet	bdab227e53	Fix a bug in IfConverter with nested predicates. Prior to this patch, IfConverter may widen the cases where a sequence of instructions were executed because of the way it uses nested predicates. This result in incorrect execution. For instance, Let A be a basic block that flows conditionally into B and B be a predicated block. B can be predicated with A.BrToBPredicate into A iff B.Predicate is less "permissive" than A.BrToBPredicate, i.e., iff A.BrToBPredicate subsumes B.Predicate. The IfConverter was checking the opposite: B.Predicate subsumes A.BrToBPredicate. <rdar://problem/14379453> llvm-svn: 187071	2013-07-24 20:20:37 +00:00

1 2 3 4 5 ...

15343 Commits