llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	28caf2717a	don't depend on ADL. llvm-svn: 44351	2007-11-27 06:14:12 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Anton Korobeynikov	66b91e66ec	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Duncan Sands	d4494352f8	This assertion was bogus. llvm-svn: 44167	2007-11-15 09:54:37 +00:00
Dale Johannesen	4646aa3e33	Make labels work in asm blocks; allow labels as parameters. Rename ValueRefList to ParamList in AsmParser, since its only use is for parameters. llvm-svn: 43734	2007-11-05 21:20:28 +00:00
Dan Gohman	d7917b6248	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	c981d72d1a	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Bill Wendling	6d15b32c15	- Remove the hacky code that forces a memcpy. Alignment is taken care of in the FE. - Explicitly pass in the alignment of the load & store. - XFAIL 2007-10-23-UnalignedMemcpy.ll because llc has a bug that crashes on unaligned pointers. llvm-svn: 43398	2007-10-26 20:24:42 +00:00
Bill Wendling	38ccabcae9	Fix comment and use the "Size" variable that's already provided. llvm-svn: 43271	2007-10-23 23:36:57 +00:00
Bill Wendling	e3b859298a	If there's an unaligned memcpy to/from the stack, don't lower it. Just call the memcpy library function instead. llvm-svn: 43270	2007-10-23 23:32:40 +00:00
Bill Wendling	6f149c0571	This broke lots. Reverting. llvm-svn: 43264	2007-10-23 22:04:26 +00:00
Bill Wendling	8971440e56	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Chris Lattner	3ea519e56d	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Rafael Espindola	846c19dd70	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	579db81f1c	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Chris Lattner	3cfb56d489	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dan Gohman	e3583817ac	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	be37007e64	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dan Gohman	fadf40a655	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Dale Johannesen	4d4e77af8e	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	b6c05b1f90	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Dale Johannesen	25a00a63eb	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Dale Johannesen	b6d56401aa	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Dale Johannesen	98d3a08d8f	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Chris Lattner	33a7f51412	1. Don't call Value::getName(), which is slow. 2. Lower calls to fabs and friends to FABS nodes etc unless the function has internal linkage. Before we wouldn't lower if it had a definition, which is incorrect. This allows us to compile: define double @fabs(double %f) { %tmp2 = tail call double @fabs( double %f ) ret double %tmp2 } into: _fabs: fabs f1, f1 blr llvm-svn: 41805	2007-09-10 21:15:22 +00:00
Rafael Espindola	1de0c86717	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Anton Korobeynikov	122bf4be7e	Split eh.select / eh.typeid.for intrinsics into i32/i64 versions. This is needed, because they just "mark" register liveins and we let frontend solve type issue, not lowering code :) llvm-svn: 41763	2007-09-07 11:39:35 +00:00
Dale Johannesen	bed9dc423c	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Duncan Sands	3c1b7fc056	Fix PR1628. When exception handling is turned on, labels are generated bracketing each call (not just invokes). This is used to generate entries in the exception table required by the C++ personality. However it gets in the way of tail-merging. This patch solves the problem by no longer placing labels around ordinary calls. Instead we generate entries in the exception table that cover every instruction in the function that wasn't covered by an invoke range (the range given by the labels around the invoke). As an optimization, such entries are only generated for parts of the function that contain a call, since for the moment those are the only instructions that can throw an exception [1]. As a happy consequence, we now get a smaller exception table, since the same region can cover many calls. While there, I also implemented folding of invoke ranges - successive ranges are merged when safe to do so. Finally, if a selector contains only a cleanup, there's a special shorthand for it - place a 0 in the call-site entry. I implemented this while there. As a result, the exception table output (excluding filters) is now optimal - it cannot be made smaller [2]. The problem with throw filters is that folding them optimally is hard, and the benefit of folding them is minimal. [1] I tested that having trapping instructions (eg divide by zero) in such a region doesn't cause trouble. [2] It could be made smaller with the help of higher layers, eg by having branch folding reorder basic blocks ending in invokes with the same landing pad so they follow each other. I don't know if this is worth doing. llvm-svn: 41718	2007-09-05 11:27:52 +00:00
Evan Cheng	e0cb6bb8da	Fix for PR1632. EHSELECTION always produces a i32 value. llvm-svn: 41712	2007-09-04 20:39:26 +00:00
Dan Gohman	81b62e1218	Add an option, -view-sunit-dags, for viewing the actual SUnit DAGs used by scheduling. llvm-svn: 41556	2007-08-28 20:32:58 +00:00
Dan Gohman	8dc0b93151	If the source and destination pointers in an llvm.memmove are known to not alias each other, it can be translated as an llvm.memcpy. llvm-svn: 41489	2007-08-27 16:26:13 +00:00
Duncan Sands	ef5a654216	There is an impedance matching problem between LLVM and gcc exception handling: if an exception unwinds through an invoke, then execution must branch to the invoke's unwind target. We previously tried to enforce this by appending a cleanup action to every selector, however this does not always work correctly due to an optimization in the C++ unwinding runtime: if only cleanups would be run while unwinding an exception, then the program just terminates without actually executing the cleanups, as invoke semantics would require. I was hoping this wouldn't be a problem, but in fact it turns out to be the cause of all the remaining failures in the LLVM testsuite (these also fail with -enable-correct-eh-support, so turning on -enable-eh didn't make things worse!). Instead we need to append a full-blown catch-all to the end of each selector. The correct way of doing this depends on the personality function, i.e. it is language dependent, so can only be done by gcc. Thus this patch which generalizes the eh.selector intrinsic so that it can handle all possible kinds of action table entries (before it didn't accomodate cleanups): now 0 indicates a cleanup, and filters have to be specified using the number of type infos plus one rather than the number of type infos. Related gcc patches will cause Ada to pass a cleanup (0) to force the selector to always fire, while C++ will use a C++ catch-all (null). llvm-svn: 41484	2007-08-27 15:47:50 +00:00
Chris Lattner	d8c9cb9182	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Anton Korobeynikov	97cdac8d19	Perform correct codegen for eh_dwarf_cfa intrinsic. llvm-svn: 41316	2007-08-23 07:21:06 +00:00
Rafael Espindola	9c3d20d823	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Evan Cheng	95667c532c	- If a dynamic_stackalloc alignment requirement is <= stack alignment, then the alignment argument is ignored. - Always round up the size of the allocation to multiples of stack alignment to ensure the stack ptr is never left in an invalid state after a dynamic_stackalloc. llvm-svn: 41132	2007-08-16 23:46:29 +00:00
Dan Gohman	a17799a3bd	Fix EXTRACT_ELEMENT, EXTRACT_SUBVECTOR, and EXTRACT_VECTOR_ELT to use an intptr ValueType instead of i32 for the index operand in getCopyToParts. llvm-svn: 40987	2007-08-10 14:59:38 +00:00
Rafael Espindola	66011c17d5	propagate struct size and alignment of byval arguments to the DAG llvm-svn: 40986	2007-08-10 14:44:42 +00:00
Chandler Carruth	7132e00de7	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	3ffe7187db	don't redefine a parameter llvm-svn: 40748	2007-08-02 18:08:16 +00:00
Dan Gohman	4ff9fb14f6	Fix a bug in getCopyFromParts turned up in the testcase for PR1132. llvm-svn: 40598	2007-07-30 19:09:17 +00:00
Duncan Sands	644f917358	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Dan Gohman	f0bb12848f	Add const to CanBeFoldedBy, CheckAndMask, and CheckOrMask. llvm-svn: 40480	2007-07-24 23:00:27 +00:00
Dan Gohman	a7b65c30a3	It's not necessary to do rounding for alloca operations when the requested alignment is equal to the stack alignment. llvm-svn: 40004	2007-07-18 16:29:46 +00:00
Dan Gohman	06c60b6032	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Anton Korobeynikov	383a324735	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dale Johannesen	2182f06f2d	Skeleton of post-RA scheduler; doesn't do anything yet. Change name of -sched option and DEBUG_TYPE to pre-RA-sched; adjust testcases. llvm-svn: 39816	2007-07-13 17:13:54 +00:00
Dan Gohman	f8f531bf69	Change getCopyToParts and getCopyFromParts to always use target-endian register ordering, for both physical and virtual registers. Update the PPC target lowering for calls to expect registers for the call result to already be in target order. llvm-svn: 38471	2007-07-09 20:59:04 +00:00
Duncan Sands	9d97420473	The exception handling intrinsics return values, so must be lowered to a value, not nothing at all. Subtle point: I made eh_selector return 0 and eh_typeid_for return 1. This means that only cleanups (destructors) will be run as the exception unwinds [if eh_typeid_for returned 0 then it would be as if the first catch always matched, and the corresponding handler would be run], which is probably want you want in the CBE. llvm-svn: 37947	2007-07-06 14:46:23 +00:00
Rafael Espindola	b567e3ffb0	Add the byval attribute llvm-svn: 37940	2007-07-06 10:57:03 +00:00
Duncan Sands	003c0b1f90	Remove propagateEHRegister in favour of a more limited fix, that is adequate while PR1508 remains unresolved. llvm-svn: 37938	2007-07-06 09:18:59 +00:00
Duncan Sands	81df18a50a	Remove ExtractGlobalVariable - use StripPointerCasts instead. llvm-svn: 37937	2007-07-06 09:10:03 +00:00
Evan Cheng	fc7010d962	Workaround of getCopyToRegs and getCopyFromRegs bugs for big-endian machines. llvm-svn: 37935	2007-07-06 01:47:35 +00:00
Dan Gohman	d258e80583	Add a parameter to getCopyToParts and getCopyFromParts to specify whether endian swapping should be done, and update the code to use it. This fixes some register ordering issues on big-endian systems, such as PowerPC, introduced by the recent illegal by-val arguments changes. llvm-svn: 37921	2007-07-05 20:12:34 +00:00
Duncan Sands	fe80638417	Extend eh.selector to support both catches and filters. Drop the eh.filter intrinsic. llvm-svn: 37875	2007-07-04 20:52:51 +00:00
Dale Johannesen	a2b3c175db	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	533dd16a7f	Replace ExpandScalarFormalArgs and ExpandScalarCallArgs with the newly refactored getCopyFromParts and getCopyToParts, which are more general. This effectively adds support for lowering illegal by-val vector call arguments. llvm-svn: 37843	2007-07-02 16:18:06 +00:00
Evan Cheng	fa68d069ad	Only do FNEG xform when the vector type is a floating point type. llvm-svn: 37818	2007-06-29 21:44:35 +00:00
David Greene	4c1e6f3804	Remove unnecessary attributions in comments. llvm-svn: 37799	2007-06-29 03:42:23 +00:00
David Greene	9468bfd932	Fix reference to cached end iterator invalidated by an erase operation. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37795	2007-06-29 02:49:11 +00:00
Dan Gohman	7867793aff	Add new TargetLowering code to provide the final register type that an illegal value type will be transformed to, for code that needs the register type after all transformations instead of just after the first transformation. Factor out the code that uses this information to do copy-from-regs and copy-to-regs for various purposes into separate functions so that they are done consistently. llvm-svn: 37781	2007-06-28 23:29:44 +00:00
Evan Cheng	77f541ddfd	Partial fix for PR1502: If a EH register is needed in a successor of landing pad, add it as livein to all the blocks in the paths between the landing pad and the specified block. llvm-svn: 37763	2007-06-27 18:45:32 +00:00
Dan Gohman	7139a48057	Use getVectorTypeBreakdown in FunctionLoweringInfo::CreateRegForValue to compute the number and type of registers needed for vector values instead of computing it manually. This fixes PR1529. llvm-svn: 37755	2007-06-27 14:34:07 +00:00
Dan Gohman	a866514528	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	309d3d51b3	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Dan Gohman	04deef3a49	Rename TargetLowering::getNumElements and friends to TargetLowering::getNumRegisters and similar, to avoid confusion with the actual number of elements for vector types. llvm-svn: 37687	2007-06-21 14:42:22 +00:00
Tanya Lattner	e199f97fa8	Codegen support (stripped out) for the annotate attribute. llvm-svn: 37608	2007-06-15 22:26:58 +00:00
Chris Lattner	f852e339b6	Fix CodeGen/X86/inline-asm-x-scalar.ll:test4, by retaining regclass info for tied register constraints. llvm-svn: 37601	2007-06-15 19:11:01 +00:00
Duncan Sands	92bf2c628c	Workaround for PR1508. llvm-svn: 37597	2007-06-15 19:04:19 +00:00
Dan Gohman	5c4413120f	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Duncan Sands	7413736a7e	Only correctly lower exception handing intrinsics if exception handling is turned on. Likewise for scanning of invokes to mark landing pads. llvm-svn: 37570	2007-06-13 16:53:21 +00:00
Dan Gohman	26455c4ae0	Introduce new SelectionDAG node opcodes VEXTRACT_SUBVECTOR and VCONCAT_VECTORS. Use these for CopyToReg and CopyFromReg legalizing in the case that the full register is to be split into subvectors instead of scalars. This replaces uses of VBIT_CONVERT to present values as vector-of-vector types in order to make whole subvectors accessible via BUILD_VECTOR and EXTRACT_VECTOR_ELT. This is in preparation for adding extended ValueType values, where having vector-of-vector types is undesirable. llvm-svn: 37569	2007-06-13 15:12:02 +00:00
Dan Gohman	cbd51c8b60	When creating CopyFromReg nodes, always use legal types. And use the correct types for the result vector, even though it is currently bitcasted to a different type immediately. llvm-svn: 37568	2007-06-13 14:55:16 +00:00
Duncan Sands	97f7236e70	The fix that was applied for PR1224 stops the compiler crashing but breaks exception handling. The problem described in PR1224 is that invoke is a terminator that can produce a value. The value may be needed in other blocks. The code that writes to registers values needed in other blocks runs before terminators are lowered (in this case invoke) so asserted because the value was not yet available. The fix that was applied was to do invoke lowering earlier, before writing values to registers. The problem this causes is that the code to copy values to registers can be output after the invoke call. If an exception is raised and control is passed to the landing pad then this copy-code will never execute. If the value is needed in some code path reached via the landing pad then that code will get something bogus. So revert the original fix and simply skip invoke values in the general copying to registers code. Instead copy the invoke value to a register in the invoke lowering code. llvm-svn: 37567	2007-06-13 05:51:31 +00:00
Dale Johannesen	9a4d987a5f	Do not change the size of function arguments. PR 1489. llvm-svn: 37496	2007-06-07 21:07:15 +00:00
Duncan Sands	61166501a1	Additional fix for PR1422: make sure the landing pad label is placed in the correct machine basic block - do not rely on the eh.exception intrinsic being in the landing pad: the loop optimizers can move it out. llvm-svn: 37463	2007-06-06 10:05:18 +00:00
Duncan Sands	c063f5f362	Integrate exception filter support and exception catch support. This simplifies the code in DwarfWriter, allows for multiple filters and makes it trivial to specify filters accompanied by cleanups or catch-all specifications (see next patch). What a deal! Patch blessed by Anton. llvm-svn: 37398	2007-06-02 16:53:42 +00:00
Duncan Sands	706421e712	Since TypeInfos are passed as i8 pointers, a NULL TypeInfo should be passed as a null i8 pointer not as a 0 i32. llvm-svn: 37383	2007-06-01 08:18:30 +00:00
Dan Gohman	30978078bf	Minor comment cleanups. llvm-svn: 37321	2007-05-24 14:36:04 +00:00
Anton Korobeynikov	3b327826db	Mark all calls as "could throw", when exceptions are enabled. Emit necessary LP info too. This fixes PR1439 llvm-svn: 37311	2007-05-23 11:08:31 +00:00
Dan Gohman	1796f1f8e9	Qualify several calls to functions in the MVT namespace, for consistency. llvm-svn: 37230	2007-05-18 17:52:13 +00:00
Chris Lattner	c7596efdad	Fix some subtle issues handling immediate values. This fixes test/CodeGen/ARM/2007-05-14-InlineAsmCstCrash.ll llvm-svn: 37069	2007-05-15 01:33:58 +00:00
Anton Korobeynikov	192d09c2d9	Do not assert, when case range split metric is zero and JTs are not allowed: just emit binary tree in this case. This fixes PR1403. llvm-svn: 36959	2007-05-09 20:07:08 +00:00
Duncan Sands	671e8c4444	Parameter attributes on invoke calls were being lost due to the wrong attribute index being used. Fix proposed by Anton Korobeynikov, who asked me to implement and commit it for him. This is PR1398. llvm-svn: 36906	2007-05-07 20:49:28 +00:00
Anton Korobeynikov	a8fd7fdc25	Detabify llvm-svn: 36891	2007-05-06 20:14:21 +00:00
Duncan Sands	4cb9eb81ef	A bitcast of a global variable may have been constant folded to a GEP - handle this case too. llvm-svn: 36745	2007-05-04 17:12:26 +00:00
Devang Patel	8c78a0bff0	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Anton Korobeynikov	11940fbba3	Properly set arguments bitwidth of EHSELECT node llvm-svn: 36654	2007-05-02 22:15:48 +00:00
Devang Patel	e95c6ad802	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	09f162ca6a	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Chris Lattner	8cfd33b647	Continue refactoring inline asm code. If there is an earlyclobber output register, preallocate all input registers and the early clobbered output. This fixes PR1357 and CodeGen/PowerPC/2007-04-30-InlineAsmEarlyClobber.ll llvm-svn: 36599	2007-04-30 21:11:17 +00:00
Chris Lattner	4333f8b1cf	refactor GetRegistersForValue to take OpInfo as an argument instead of various pieces of it. No functionality change. llvm-svn: 36592	2007-04-30 17:29:31 +00:00
Chris Lattner	ef07332504	refactor some code, no functionality change llvm-svn: 36590	2007-04-30 17:16:27 +00:00
Chris Lattner	412d61af43	generalize aggregate handling llvm-svn: 36568	2007-04-29 18:58:03 +00:00
Chris Lattner	401d8db381	memory operands that have a direct operand should have their stores created before the copies into physregs are done. This avoids having flag operands skip the store, causing cycles in the dag at sched time. This fixes infinite loops on these tests: test/CodeGen/Generic/2007-04-08-MultipleFrameIndices.ll for PR1308 test/CodeGen/PowerPC/2007-01-29-lbrx-asm.ll test/CodeGen/PowerPC/2007-01-31-InlineAsmAddrMode.ll test/CodeGen/X86/2006-07-12-InlineAsmQConstraint.ll for PR828 llvm-svn: 36547	2007-04-28 21:12:06 +00:00
Chris Lattner	de339fa55d	eliminate more redundant constraint type analysis llvm-svn: 36546	2007-04-28 21:03:16 +00:00
Chris Lattner	b2e55562ed	merge constraint type analysis stuff together. llvm-svn: 36545	2007-04-28 21:01:43 +00:00
Chris Lattner	d7e3b6c442	Significant refactoring of the inline asm stuff, to support future changes. No functionality change. llvm-svn: 36544	2007-04-28 20:49:53 +00:00
Chris Lattner	1deacd61f4	memory inputs to an inline asm are required to have an address available. If the operand is not already an indirect operand, spill it to a constant pool entry or a stack slot. This fixes PR1356 and CodeGen/X86/2007-04-27-InlineAsm-IntMemInput.ll llvm-svn: 36536	2007-04-28 06:42:38 +00:00
Chris Lattner	d102ed0ac6	Fix CodeGen/Generic/2007-04-27-LargeMemObject.ll and CodeGen/Generic/2007-04-27-InlineAsm-X-Dest.ll llvm-svn: 36534	2007-04-28 06:08:13 +00:00
Chris Lattner	4df3e8093b	Fix this to match change to InlineAsm class. llvm-svn: 36524	2007-04-28 04:05:59 +00:00
Chris Lattner	784fe9dbbb	improve EH global handling, patch by Duncan Sands. llvm-svn: 36499	2007-04-27 01:20:11 +00:00
Chris Lattner	8131ab7c0f	enable Anton's shift/and switch lowering stuff! It now passes ppc bootstrap successfully! woohoo... llvm-svn: 36496	2007-04-26 21:09:43 +00:00
Anton Korobeynikov	d7ae7f1659	Fixx off-by-one bug, which prevents llvm-gcc bootstrap on ppc32 llvm-svn: 36490	2007-04-26 20:44:04 +00:00
Evan Cheng	15f269afa3	This was lefted out. Fixed sumarray-dbl. llvm-svn: 36445	2007-04-25 18:33:21 +00:00
Chris Lattner	cb0ed0cfbd	allow support for 64-bit stack objects llvm-svn: 36420	2007-04-25 04:08:28 +00:00
Bill Wendling	47917b697f	Assertion when using a 1-element vector for an add operation. Get the real vector type in this case. llvm-svn: 36402	2007-04-24 21:13:23 +00:00
Scott Michel	4cfa616cee	Use '-1U' where '-1UL' is obvious overkill, eliminating gcc warnings about tests always being true in the process. llvm-svn: 36387	2007-04-24 01:24:20 +00:00
Christopher Lamb	8af6d5896f	PR400 phase 2. Propagate attributed load/store information through DAGs. llvm-svn: 36356	2007-04-22 23:15:30 +00:00
Reid Spencer	0c1349e6bc	Revert Christopher Lamb's load/store alignment changes. llvm-svn: 36309	2007-04-21 18:36:27 +00:00
Christopher Lamb	bff50208c8	add support for alignment attributes on load/store instructions llvm-svn: 36301	2007-04-21 08:16:25 +00:00
Chris Lattner	6bd7b7b30b	disable switch lowering using shift/and. It still breaks ppc bootstrap for some reason. :( Will investigate. llvm-svn: 36011	2007-04-14 19:39:41 +00:00
Anton Korobeynikov	8a1a84f96e	Fix PR1325: Case range optimization was performed in the case it shouldn't. Also fix some "latent" bug on 64-bit platforms llvm-svn: 35990	2007-04-14 13:25:55 +00:00
Chris Lattner	7196f09edc	disable shift/and lowering to work around PR1325 for now. llvm-svn: 35985	2007-04-14 02:26:56 +00:00
Anton Korobeynikov	e288040abf	Fix PR1323 : we haven't updated phi nodes in good manner :) llvm-svn: 35963	2007-04-13 06:53:51 +00:00
Chris Lattner	5111499136	the result of an inline asm copy can be an arbitrary VT that the register class supports. In the case of vectors, this means we often get the wrong type (e.g. we get v4f32 instead of v8i16). Make sure to convert the vector result to the right type. This fixes CodeGen/X86/2007-04-11-InlineAsmVectorResult.ll llvm-svn: 35944	2007-04-12 06:00:20 +00:00
Reid Spencer	c6251a7dfd	For PR1284: Implement the "part_set" intrinsic. llvm-svn: 35938	2007-04-12 02:48:46 +00:00
Reid Spencer	a472f66dd0	For PR1146: Put the parameter attributes in their own ParamAttr name space. Adjust the rest of llvm as a result. llvm-svn: 35877	2007-04-11 02:44:20 +00:00
Chris Lattner	f269d84ca0	apparently some people commit without building the tree, or they forget to commit a LOT of files. llvm-svn: 35858	2007-04-10 03:20:39 +00:00
Jeff Cohen	e0bbbd3774	No longer needed. llvm-svn: 35850	2007-04-09 23:42:32 +00:00
Anton Korobeynikov	da964a2852	Use integer log for metric calculation llvm-svn: 35834	2007-04-09 21:57:03 +00:00
Jeff Cohen	0475f3b4e9	Unbreak VC++ build. llvm-svn: 35817	2007-04-09 14:32:59 +00:00
Anton Korobeynikov	506eaf7915	Next stage into switch lowering refactoring 1. Fix some bugs in the jump table lowering threshold 2. Implement much better metric for optimal pivot selection 3. Tune thresholds for different lowering methods 4. Implement shift-and trick for lowering small (<machine word length) cases with few destinations. Good testcase will follow. llvm-svn: 35816	2007-04-09 12:31:58 +00:00
Reid Spencer	71b79e3d99	For PR1146: Adapt handling of parameter attributes to use the new ParamAttrsList class. llvm-svn: 35814	2007-04-09 06:17:21 +00:00
Chris Lattner	7b2decfa0a	implement CodeGen/X86/inline-asm-x-scalar.ll:test3 llvm-svn: 35802	2007-04-09 05:31:20 +00:00
Chris Lattner	b49917da92	Fix PR1316 llvm-svn: 35783	2007-04-09 00:33:58 +00:00
Chris Lattner	e55ecfb870	Fix for CodeGen/X86/2007-04-08-InlineAsmCrash.ll and PR1314 llvm-svn: 35779	2007-04-08 22:23:26 +00:00
Chris Lattner	1c741e95d3	minor comment fix llvm-svn: 35696	2007-04-06 17:47:14 +00:00
Reid Spencer	85460acfbf	Change the bit_part_select (non)implementation from "return 0" to abort. llvm-svn: 35679	2007-04-05 01:20:18 +00:00
Reid Spencer	cce90f55ed	Implement the llvm.bit.part_select.iN.iN.iN overloaded intrinsic. llvm-svn: 35678	2007-04-04 23:48:25 +00:00
Anton Korobeynikov	915e61736b	Properly emit range comparisons for switch cases, where neighbour cases go to the same destination. Now we're producing really good code for switch-lower-feature.ll testcase llvm-svn: 35672	2007-04-04 21:14:49 +00:00
Reid Spencer	3a0843e734	For PR1297: Adjust for changes in the bit counting intrinsics. They all return i32 now so we have to trunc/zext the DAG node accordingly. llvm-svn: 35546	2007-04-01 07:34:11 +00:00
Chris Lattner	f6a6d3c8b0	move a bunch of code out of the sdisel pass into its own opt pass "codegenprepare". llvm-svn: 35529	2007-03-31 04:18:03 +00:00
Evan Cheng	4388043b25	Scale 1 is always ok. llvm-svn: 35407	2007-03-28 01:55:52 +00:00
Evan Cheng	07c42d43a2	GEP index sinking fixes: 1) Take address scale into consideration. e.g. i32* -> scale 4. 2) Examine all the users of GEP. 3) Generalize to inter-block GEP's (no longer uses loopinfo). 4) Don't do xform if GEP has other variable index(es). llvm-svn: 35403	2007-03-28 01:49:39 +00:00
Anton Korobeynikov	37a0bfe128	Remove dead code llvm-svn: 35380	2007-03-27 12:05:48 +00:00
Anton Korobeynikov	3a9d68181a	Split big monster into small helpers. No functionality change. llvm-svn: 35379	2007-03-27 11:29:11 +00:00
Evan Cheng	c42406b5ad	SDISel does not preserve all, it changes CFG and other info. llvm-svn: 35376	2007-03-27 00:53:36 +00:00
Anton Korobeynikov	7037826c86	First step of switch lowering refactoring: perform worklist-driven strategy, emit JT's where possible. llvm-svn: 35338	2007-03-25 15:07:15 +00:00
Chris Lattner	77f0479833	Implement support for vector operands to inline asm, implementing CodeGen/X86/2007-03-24-InlineAsmVectorOp.ll llvm-svn: 35332	2007-03-25 05:00:54 +00:00
Chris Lattner	d685514e2e	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Dan Gohman	dcb291faa4	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Evan Cheng	550cf0369c	Minor bug. llvm-svn: 35219	2007-03-20 19:32:11 +00:00
Evan Cheng	a2465dfc07	Use SmallSet instead of std::set. llvm-svn: 35133	2007-03-17 08:53:30 +00:00
Evan Cheng	be22235790	If sdisel has decided to sink GEP index expression into any BB. Replace all uses in that BB. llvm-svn: 35132	2007-03-17 08:22:49 +00:00
Evan Cheng	c5bc763f50	Turn on GEP index sinking by default. llvm-svn: 35127	2007-03-16 18:32:30 +00:00
Evan Cheng	0a9d0cabaf	Stupid bug. llvm-svn: 35126	2007-03-16 17:50:20 +00:00
Evan Cheng	009ea54262	Sink a binary expression into its use blocks if it is a loop invariant computation used as GEP indexes and if the expression can be folded into target addressing mode of GEP load / store use types. llvm-svn: 35123	2007-03-16 08:46:27 +00:00
Chris Lattner	ce8aba03ee	implement support for floating point constants used as inline asm memory operands. llvm-svn: 35033	2007-03-08 22:29:47 +00:00
Chris Lattner	b7bc3f2d30	make this fail even in non-assert builds. llvm-svn: 35025	2007-03-08 07:07:03 +00:00
Anton Korobeynikov	ed4b303c10	Refactoring of formal parameter flags. Enable properly use of zext/sext/aext stuff. llvm-svn: 35008	2007-03-07 16:25:09 +00:00
Anton Korobeynikov	f0b9316552	Enumerate SDISel formal parameter attributes. Make use of new enumeration. llvm-svn: 34960	2007-03-06 06:10:33 +00:00
Jeff Cohen	b622c11f77	Unbreak VC++ build. llvm-svn: 34917	2007-03-05 00:00:42 +00:00
Jim Laskey	d5453d7b56	Lower eh filter intrinsic. llvm-svn: 34802	2007-03-01 20:24:30 +00:00
Jim Laskey	cf465fcebc	MERGE_VALUES unnecessary. llvm-svn: 34750	2007-02-28 18:37:04 +00:00
Chris Lattner	ab5d0ac02c	track signedness of formal argument, though we have a fixme here. llvm-svn: 34620	2007-02-26 02:56:58 +00:00
Jim Laskey	14059d958a	Fix for PR1224. llvm-svn: 34610	2007-02-25 21:43:59 +00:00
Chris Lattner	8c504cf9a0	optimize duplicate ValueMap lookups llvm-svn: 34599	2007-02-25 18:40:32 +00:00
Jim Laskey	e1d1c0590f	Deal with cases when MMI is not requested. llvm-svn: 34556	2007-02-24 09:45:44 +00:00
Jim Laskey	31fef788eb	Handle improper cast. llvm-svn: 34535	2007-02-23 21:45:01 +00:00
Jim Laskey	44c37e7dbf	Tighten up error checking of args. llvm-svn: 34493	2007-02-22 16:10:05 +00:00
Jim Laskey	504e99479c	Handle lowering invoke to call correctly. llvm-svn: 34492	2007-02-22 15:38:06 +00:00
Jim Laskey	4b37a4c712	Selection and lowering for exception handling. llvm-svn: 34481	2007-02-21 22:53:45 +00:00
Reid Spencer	09575bac2e	For PR1195: Change use of "packed" term to "vector" in comments, strings, variable names, etc. llvm-svn: 34300	2007-02-15 03:39:18 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	ab1812f806	fix a warning llvm-svn: 34272	2007-02-14 07:34:56 +00:00
Chris Lattner	1cf84d2745	Refix CodeGen/Generic/switch-lower.ll. In contrast to my previous patch, this doesn't miscompile lots of programs :) llvm-svn: 34268	2007-02-14 07:18:16 +00:00
Chris Lattner	945e437c65	Generalize TargetData strings, to support more interesting forms of data. Patch by Scott Michel. llvm-svn: 34266	2007-02-14 05:52:17 +00:00
Chris Lattner	2fbff4d2dc	revert my previous switch lowering change, which miscompiles a few programs. This will break a dj test until I have time to investigate. llvm-svn: 34247	2007-02-13 20:09:07 +00:00
Lauro Ramos Venancio	abde3cc16c	Add a space between // and the comment. llvm-svn: 34244	2007-02-13 18:10:13 +00:00
Lauro Ramos Venancio	9956dcffbe	Add "original alignment" to function arguments flags. llvm-svn: 34240	2007-02-13 13:50:08 +00:00
Chris Lattner	9056bae3be	Fix switch lowering to order cases in zext order, which is how we emit the comparisons. This fixes an infinite loop on CodeGen/Generic/switch-lower.ll and PR1197 llvm-svn: 34216	2007-02-13 01:05:56 +00:00
Chris Lattner	c473d8e431	Privatize StructLayout::MemberOffsets, adding an accessor llvm-svn: 34156	2007-02-10 19:55:17 +00:00
Evan Cheng	276b44b0f9	Add function live-ins to entry block live-in set. llvm-svn: 34112	2007-02-10 02:43:39 +00:00
Evan Cheng	de6083463d	Rename some variables to avoid confusion with SelectionDAGISel::BB. llvm-svn: 34110	2007-02-10 01:08:18 +00:00
Chris Lattner	289aa4495c	Switch VAlueMap from std::map to DenseMap. llvm-svn: 33863	2007-02-04 01:35:11 +00:00
Chris Lattner	79084305ee	Switch NodeMap from std::map to DenseMap, this speeds up isel by 2.3% llvm-svn: 33862	2007-02-04 01:31:47 +00:00
Reid Spencer	2341c22ec7	Changes to support making the shift instructions be true BinaryOperators. This feature is needed in order to support shifts of more than 255 bits on large integer types. This changes the syntax for llvm assembly to make shl, ashr and lshr instructions look like a binary operator: shl i32 %X, 1 instead of shl i32 %X, i8 1 Additionally, this should help a few passes perform additional optimizations. llvm-svn: 33776	2007-02-02 02:16:23 +00:00
Chris Lattner	296a83cefb	Fit in 80 columns llvm-svn: 33745	2007-02-01 04:55:59 +00:00
Chris Lattner	e3eeb24a86	Emit a better assertion message for PR1133 llvm-svn: 33736	2007-02-01 01:21:12 +00:00
Reid Spencer	5301e7c605	For PR1136: Rename GlobalVariable::isExternal as isDeclaration to avoid confusion with external linkage types. llvm-svn: 33663	2007-01-30 20:08:39 +00:00
Chris Lattner	d27f95e08d	add initial support for handling inline asms with multiple constraints. This doesn't do the "right thing" but will probably work in most cases. This implements CodeGen/PowerPC/2007-01-29-lbrx-asm.ll. llvm-svn: 33643	2007-01-29 23:45:14 +00:00
Nate Begeman	eda5997cc8	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Anton Korobeynikov	06f7d4bec7	Arguments are counting from 1. not from 0. Maybe we should change numbering somehow? E.g. make return argument the last? llvm-svn: 33606	2007-01-28 18:01:49 +00:00
Anton Korobeynikov	9fa3839d29	More cleanup llvm-svn: 33605	2007-01-28 16:04:40 +00:00
Anton Korobeynikov	037c867b54	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Jim Laskey	c56315c2b5	Change the MachineDebugInfo to MachineModuleInfo to better reflect usage for debugging and exception handling. llvm-svn: 33550	2007-01-26 21:22:28 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Reid Spencer	2eadb5310d	For PR970: Clean up handling of isFloatingPoint() and dealing with PackedType. Patch by Gordon Henriksen! llvm-svn: 33415	2007-01-21 00:29:26 +00:00
Chris Lattner	50ee0e40e5	Teach TargetData to handle 'preferred' alignment for each target, and use these alignment amounts to align scalars when we can. Patch by Scott Michel! llvm-svn: 33409	2007-01-20 22:35:55 +00:00
Zhou Sheng	75b871fb1e	For PR1043: Merge ConstantIntegral and ConstantBool into ConstantInt. Remove ConstantIntegral and ConstantBool from LLVM. llvm-svn: 33073	2007-01-11 12:24:14 +00:00
Chris Lattner	10cae15d8e	remove support for llvm.isunordered llvm-svn: 32992	2007-01-07 08:37:22 +00:00
Evan Cheng	8ec5283dc4	GEP subscript is interpreted as a signed value. llvm-svn: 32888	2007-01-05 01:46:20 +00:00
Chris Lattner	96035bed51	fix PowerPC/2007-01-04-ArgExtension.ll, a bug handling K&R prototypes with the recent signless changes. llvm-svn: 32884	2007-01-04 22:22:37 +00:00
Reid Spencer	e6f81876eb	Legalizer doesn't do an ANY_EXTEND if we don't ask for one so make sure that we default to an ANY_EXTEND if no parameter attribute is set on the result value of a function. llvm-svn: 32836	2007-01-03 16:49:33 +00:00
Reid Spencer	2a34b91666	Restore previous behavior of defaulting to ZEXT. This works around two things: (1) preventing PR1071 and (2) working around missing parameter attributes for bool type. (2) will be fixed shortly. When PR1071 is fixed, this patch should be undone. llvm-svn: 32831	2007-01-03 05:03:05 +00:00
Reid Spencer	0917adf614	Two changes: 1. Switch expression and cases are compared signed and are sign extended. 2. For function results needing extended, do SIGN_EXTEND if the SExtAttribute is set and ZERO_EXTEND if the ZExtAttribute is set, otherwise just let the Legalizer do ANY_EXTEND. This fixes the recent regression in kimwitu++ and probably the llvm-gcc bootstrap issue we had today. llvm-svn: 32830	2007-01-03 04:25:33 +00:00
Reid Spencer	e63b6518fa	For PR950: Three changes: 1. Convert signed integer types to signless versions. 2. Implement the @sext and @zext parameter attributes. Previously the type of an function parameter was used to determine whether it should be sign extended or zero extended before the call. This information is now communicated via the function type's parameter attributes. 3. The interface to LowerCallTo had to be changed in order to accommodate the parameter attribute information. Although it would have been convenient to pass in the FunctionType itself, there isn't always one present in the caller. Consequently, a signedness indication for the result type and for each parameter was provided for in the interface to this method. All implementations were changed to make the adjustment necessary. llvm-svn: 32788	2006-12-31 05:55:36 +00:00
Reid Spencer	266e42b312	For PR950: This patch removes the SetCC instructions and replaces them with the ICmp and FCmp instructions. The SetCondInst instruction has been removed and been replaced with ICmpInst and FCmpInst. llvm-svn: 32751	2006-12-23 06:05:41 +00:00
Evan Cheng	258657e64e	getLoad() and getStore() calls missed SVOffset operand. Thanks to Dan Gohman for pointing it out! llvm-svn: 32712	2006-12-20 01:27:29 +00:00
Chris Lattner	9bd5ed636c	Fix PR1049 and CodeGen/Generic/2006-12-16-InlineAsmCrash.ll by producing target constants instead of constants. Constants can get selected to li/movri instructions, which causes the scheduler to explode. llvm-svn: 32633	2006-12-16 21:14:48 +00:00
Evan Cheng	22cf89967b	More soft-fp work. llvm-svn: 32559	2006-12-13 20:57:08 +00:00
Reid Spencer	bfe26ffcfc	Replace CastInst::createInferredCast calls with more accurate cast creation calls. llvm-svn: 32521	2006-12-13 00:50:17 +00:00
Evan Cheng	634885f71e	Expand i32/i64 CopyToReg f32/f64 to BIT_CONVERT + CopyToReg. llvm-svn: 32493	2006-12-12 21:21:32 +00:00
Evan Cheng	0c0b78c18e	Expand formal arguments and call arguments recursively: e.g. f64 -> i64 -> 2 x i32. llvm-svn: 32476	2006-12-12 07:27:38 +00:00
Anton Korobeynikov	3b7c257cae	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Evan Cheng	4eee72471c	Preliminary soft float support. llvm-svn: 32394	2006-12-09 02:42:38 +00:00
Bill Wendling	22e978a736	Removing even more <iostream> includes. llvm-svn: 32320	2006-12-07 20:04:42 +00:00
Evan Cheng	feba507a97	Fix for PR1023 by Dan Gohman. llvm-svn: 32003	2006-11-29 01:58:12 +00:00
Evan Cheng	6e12a052ff	Fix for PR1022 (folding loads of static initializers) by Dan Gohman. llvm-svn: 32000	2006-11-29 01:38:07 +00:00
Chris Lattner	90f4238c38	add a hook to allow targets to hack on inline asms to lower them to llvm when they want to. llvm-svn: 31997	2006-11-29 01:12:32 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Reid Spencer	d9436b6837	For PR950: First in a series of patches to convert SetCondInst into ICmpInst and FCmpInst using only two opcodes and having the instructions contain their predicate value. Nothing uses these classes yet. More patches to follow. llvm-svn: 31867	2006-11-20 01:22:35 +00:00
Chris Lattner	30d08801ef	remove dead #include llvm-svn: 31753	2006-11-15 17:51:15 +00:00
Chris Lattner	d5e604dbb2	commentate llvm-svn: 31627	2006-11-10 04:41:34 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Reid Spencer	7eb55b395f	For PR950: Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369	2006-11-02 01:53:59 +00:00
Chris Lattner	55402d4403	Allow the getRegForInlineAsmConstraint method to return a register class with no fixes physreg. Treat this as permission to use any register in the register class. When this happens and it is safe, allow the llvm register allcoator to allocate the register instead of doing it at isel time. This eliminates a ton of copies around common inline asms. For example: int test2(int Y, int X) { asm("foo %0, %1" : "=r"(X): "r"(X)); return X; } now compiles to: _test2: foo r3, r4 blr instead of: _test2: mr r2, r4 foo r2, r2 mr r3, r2 blr GCC produces: _test2: foo r4, r4 mr r3,r4 blr llvm-svn: 31366	2006-11-02 01:41:49 +00:00
Chris Lattner	fe43befeda	Compile CodeGen/PowerPC/fp-branch.ll to: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 blt cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr instead of: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bge cr0, LBB1_1 ;bb72.exitStub LBB1_4: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bnu cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr llvm-svn: 31330	2006-10-31 23:06:00 +00:00
Chris Lattner	427301fdae	look through isunordered to inline it into branch blocks. llvm-svn: 31328	2006-10-31 22:37:42 +00:00
Chris Lattner	6f043b90ea	TargetLowering::isOperandValidForConstraint llvm-svn: 31319	2006-10-31 19:41:18 +00:00
Chris Lattner	968f803928	Turn an assert into an error message. This is commonly triggered when we don't support a specific constraint yet. When this happens, print the unsupported constraint. llvm-svn: 31310	2006-10-31 07:33:13 +00:00
Evan Cheng	84a28d4e76	Lower jumptable to BR_JT. The legalizer can lower it to a BRIND or let the target custom lower it. llvm-svn: 31293	2006-10-30 08:00:44 +00:00
Chris Lattner	e60ae823e8	fix Generic/2006-10-29-Crash.ll llvm-svn: 31281	2006-10-29 21:01:20 +00:00
Chris Lattner	f31b9ef458	Fix a load folding issue that Evan noticed: there is no need to export values used by comparisons in the main block. llvm-svn: 31279	2006-10-29 18:23:37 +00:00
Chris Lattner	bba52191fa	split critical edges more carefully and intelligently. In particular, critical edges whose destinations are not phi nodes don't bother us. Also, share split edges, since the split edge can't have a phi. This significantly reduces the complexity of generated code in some cases. llvm-svn: 31274	2006-10-28 19:22:10 +00:00
Chris Lattner	3e6b1c6157	Split all critical edges before isel. This resolves issues with spill code being inserted on unsplit critical edges, which introduces (sometimes large amounts of) partially dead spill code. This also fixes PR925 + CodeGen/Generic/switch-crit-edge-constant.ll llvm-svn: 31260	2006-10-28 17:04:37 +00:00
Chris Lattner	84a035056e	Fix a bug in merged condition handling (CodeGen/Generic/2006-10-27-CondFolding.ll). Add many fewer CFG edges and PHI node entries. If there is a switch which has the same block as multiple destinations, only add that block once as a successor/phi node (in the jumptable case) llvm-svn: 31242	2006-10-27 23:50:33 +00:00
Chris Lattner	b9392fb635	remove debug code llvm-svn: 31233	2006-10-27 21:58:03 +00:00
Chris Lattner	f1b54fd7a5	Codegen cond&cond with two branches. This compiles (f.e.) PowerPC/and-branch.ll to: cmpwi cr0, r4, 4 bgt cr0, LBB1_2 ;UnifiedReturnBlock LBB1_3: ;entry cmplwi cr0, r3, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r3, r4, r3 rlwinm r2, r2, 30, 31, 31 or r2, r2, r3 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock LBB1_1: ;cond_true llvm-svn: 31232	2006-10-27 21:54:23 +00:00
Chris Lattner	ed0110b949	Turn conditions like x<Y\|z==q into multiple blocks. This compiles Regression/CodeGen/X86/or-branch.ll into: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $5, %eax jl LBB1_1 #cond_true LBB1_3: #entry testl %ecx, %ecx jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret instead of: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $4, %eax setg %al testl %ecx, %ecx setne %cl testb %cl, %al jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret And on ppc to: cmpwi cr0, r29, 5 blt cr0, LBB1_1 ;cond_true LBB1_3: ;entry cmplwi cr0, r30, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r30, r4, r3 rlwinm r29, r2, 30, 31, 31 and r2, r29, r30 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock llvm-svn: 31230	2006-10-27 21:36:01 +00:00
Reid Spencer	7e80b0b31e	For PR950: Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195	2006-10-26 06:15:43 +00:00
Chris Lattner	61bcf9154d	visitSwitchCase knows how to insert conditional branches well. Change visitBr to just call visitSwitchCase, eliminating duplicate logic. llvm-svn: 31167	2006-10-24 18:07:37 +00:00
Chris Lattner	963ddad31a	Generalize CaseBlock a bit more: Rename LHSBB/RHSBB to TrueBB/FalseBB. Allow the RHS value to be null, in which case the LHS is treated as a bool. llvm-svn: 31166	2006-10-24 17:57:59 +00:00
Chris Lattner	3f179d24c6	generalize 'CaseBlock'. It really allows any comparison to be inserted. llvm-svn: 31161	2006-10-24 17:03:35 +00:00
Chris Lattner	4c931502cc	Minor tweak. Instead of generating: movl 32(%esp), %eax cmpl $1, %eax je LBB1_1 #bb LBB1_4: #entry cmpl $2, %eax je LBB1_2 #bb2 jmp LBB1_3 #UnifiedReturnBlock LBB1_1: #bb notice that we would miss the fall through and emit this instead: movl 32(%esp), %eax cmpl $2, %eax je LBB1_2 #bb2 LBB1_4: #entry cmpl $1, %eax jne LBB1_3 #UnifiedReturnBlock LBB1_1: #bb llvm-svn: 31130	2006-10-23 18:38:22 +00:00
Chris Lattner	76a7bc8c55	Fix phi node updating for switches lowered to linear sequences of branches. llvm-svn: 31125	2006-10-22 23:00:53 +00:00
Chris Lattner	4c3ef4782d	disable this code for now, it's not yet safely updating phi nodes llvm-svn: 31124	2006-10-22 22:47:10 +00:00
Chris Lattner	6d6fc26257	Implement PR964 and Regression/CodeGen/Generic/SwitchLowering.ll llvm-svn: 31119	2006-10-22 21:36:53 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Bill Wendling	be96e1cd09	Partially in response to PR926: insert the newly created machine basic blocks into the basic block list when lowering the switch inst. into a binary tree of if-then statements. This allows the "visitSwitchCase" func to allow for fall-through behavior. llvm-svn: 31057	2006-10-19 21:46:38 +00:00
Jim Laskey	dcb2b83886	Pass AliasAnalysis thru to DAGCombiner. llvm-svn: 30984	2006-10-16 20:52:31 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00

... 3 4 5 6 7 ...

737 Commits