llvm-project

Commit Graph

Author	SHA1	Message	Date
Anton Korobeynikov	6bbbc4cbfa	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Duncan Sands	08c728b519	Remove the assumption that byval has been applied to a pointer to a struct. llvm-svn: 45939	2008-01-13 21:19:59 +00:00
Gordon Henriksen	5180e85675	Enabling the target-independent garbage collection infrastructure by hooking it up to the various compiler pipelines. This doesn't actually add support for any GC algorithms, which means it temporarily breaks a few tests. To be fixed shortly. llvm-svn: 45669	2008-01-07 01:30:38 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	20421fe936	use simplified operand addition methods. llvm-svn: 45436	2007-12-30 00:57:42 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Duncan Sands	e9d8861cdf	Simplify LowerCallTo by using a callsite. llvm-svn: 45198	2007-12-19 09:48:52 +00:00
Duncan Sands	030bce7b83	The C++ exception handling personality function wants to know about calls that cannot throw ('nounwind'): if such a call does throw for some reason then the personality will terminate the program. The distinction between an ordinary call and a nounwind call is that an ordinary call gets an entry in the exception table but a nounwind call does not. This patch sets up the exception table appropriately. One oddity is that I've chosen to bracket nounwind calls with labels (like invokes) - the other choice would have been to bracket ordinary calls with labels. While bracketing ordinary calls is more natural (because bracketing by labels would then correspond exactly to getting an entry in the exception table), I didn't do it because introducing labels impedes some optimizations and I'm guessing that ordinary calls occur more often than nounwind calls. This fixes the gcc filter2 eh test, at least at -O0 (the inliner needs some tweaking at higher optimization levels). llvm-svn: 45197	2007-12-19 07:36:31 +00:00
Duncan Sands	b5a79d0eaa	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Duncan Sands	38ef3a8ec7	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Duncan Sands	5208d1ab4a	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	698b1cb28d	err, no really. llvm-svn: 44352	2007-11-27 06:14:32 +00:00
Chris Lattner	28caf2717a	don't depend on ADL. llvm-svn: 44351	2007-11-27 06:14:12 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Anton Korobeynikov	66b91e66ec	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Duncan Sands	d4494352f8	This assertion was bogus. llvm-svn: 44167	2007-11-15 09:54:37 +00:00
Dale Johannesen	4646aa3e33	Make labels work in asm blocks; allow labels as parameters. Rename ValueRefList to ParamList in AsmParser, since its only use is for parameters. llvm-svn: 43734	2007-11-05 21:20:28 +00:00
Dan Gohman	d7917b6248	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	c981d72d1a	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Bill Wendling	6d15b32c15	- Remove the hacky code that forces a memcpy. Alignment is taken care of in the FE. - Explicitly pass in the alignment of the load & store. - XFAIL 2007-10-23-UnalignedMemcpy.ll because llc has a bug that crashes on unaligned pointers. llvm-svn: 43398	2007-10-26 20:24:42 +00:00
Bill Wendling	38ccabcae9	Fix comment and use the "Size" variable that's already provided. llvm-svn: 43271	2007-10-23 23:36:57 +00:00
Bill Wendling	e3b859298a	If there's an unaligned memcpy to/from the stack, don't lower it. Just call the memcpy library function instead. llvm-svn: 43270	2007-10-23 23:32:40 +00:00
Bill Wendling	6f149c0571	This broke lots. Reverting. llvm-svn: 43264	2007-10-23 22:04:26 +00:00
Bill Wendling	8971440e56	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Chris Lattner	3ea519e56d	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Rafael Espindola	846c19dd70	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	579db81f1c	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Chris Lattner	3cfb56d489	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dan Gohman	e3583817ac	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	be37007e64	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dan Gohman	fadf40a655	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Dale Johannesen	4d4e77af8e	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	b6c05b1f90	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Dale Johannesen	25a00a63eb	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Dale Johannesen	b6d56401aa	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Dale Johannesen	98d3a08d8f	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Chris Lattner	33a7f51412	1. Don't call Value::getName(), which is slow. 2. Lower calls to fabs and friends to FABS nodes etc unless the function has internal linkage. Before we wouldn't lower if it had a definition, which is incorrect. This allows us to compile: define double @fabs(double %f) { %tmp2 = tail call double @fabs( double %f ) ret double %tmp2 } into: _fabs: fabs f1, f1 blr llvm-svn: 41805	2007-09-10 21:15:22 +00:00
Rafael Espindola	1de0c86717	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Anton Korobeynikov	122bf4be7e	Split eh.select / eh.typeid.for intrinsics into i32/i64 versions. This is needed, because they just "mark" register liveins and we let frontend solve type issue, not lowering code :) llvm-svn: 41763	2007-09-07 11:39:35 +00:00
Dale Johannesen	bed9dc423c	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Duncan Sands	3c1b7fc056	Fix PR1628. When exception handling is turned on, labels are generated bracketing each call (not just invokes). This is used to generate entries in the exception table required by the C++ personality. However it gets in the way of tail-merging. This patch solves the problem by no longer placing labels around ordinary calls. Instead we generate entries in the exception table that cover every instruction in the function that wasn't covered by an invoke range (the range given by the labels around the invoke). As an optimization, such entries are only generated for parts of the function that contain a call, since for the moment those are the only instructions that can throw an exception [1]. As a happy consequence, we now get a smaller exception table, since the same region can cover many calls. While there, I also implemented folding of invoke ranges - successive ranges are merged when safe to do so. Finally, if a selector contains only a cleanup, there's a special shorthand for it - place a 0 in the call-site entry. I implemented this while there. As a result, the exception table output (excluding filters) is now optimal - it cannot be made smaller [2]. The problem with throw filters is that folding them optimally is hard, and the benefit of folding them is minimal. [1] I tested that having trapping instructions (eg divide by zero) in such a region doesn't cause trouble. [2] It could be made smaller with the help of higher layers, eg by having branch folding reorder basic blocks ending in invokes with the same landing pad so they follow each other. I don't know if this is worth doing. llvm-svn: 41718	2007-09-05 11:27:52 +00:00
Evan Cheng	e0cb6bb8da	Fix for PR1632. EHSELECTION always produces a i32 value. llvm-svn: 41712	2007-09-04 20:39:26 +00:00
Dan Gohman	81b62e1218	Add an option, -view-sunit-dags, for viewing the actual SUnit DAGs used by scheduling. llvm-svn: 41556	2007-08-28 20:32:58 +00:00
Dan Gohman	8dc0b93151	If the source and destination pointers in an llvm.memmove are known to not alias each other, it can be translated as an llvm.memcpy. llvm-svn: 41489	2007-08-27 16:26:13 +00:00
Duncan Sands	ef5a654216	There is an impedance matching problem between LLVM and gcc exception handling: if an exception unwinds through an invoke, then execution must branch to the invoke's unwind target. We previously tried to enforce this by appending a cleanup action to every selector, however this does not always work correctly due to an optimization in the C++ unwinding runtime: if only cleanups would be run while unwinding an exception, then the program just terminates without actually executing the cleanups, as invoke semantics would require. I was hoping this wouldn't be a problem, but in fact it turns out to be the cause of all the remaining failures in the LLVM testsuite (these also fail with -enable-correct-eh-support, so turning on -enable-eh didn't make things worse!). Instead we need to append a full-blown catch-all to the end of each selector. The correct way of doing this depends on the personality function, i.e. it is language dependent, so can only be done by gcc. Thus this patch which generalizes the eh.selector intrinsic so that it can handle all possible kinds of action table entries (before it didn't accomodate cleanups): now 0 indicates a cleanup, and filters have to be specified using the number of type infos plus one rather than the number of type infos. Related gcc patches will cause Ada to pass a cleanup (0) to force the selector to always fire, while C++ will use a C++ catch-all (null). llvm-svn: 41484	2007-08-27 15:47:50 +00:00
Chris Lattner	d8c9cb9182	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Anton Korobeynikov	97cdac8d19	Perform correct codegen for eh_dwarf_cfa intrinsic. llvm-svn: 41316	2007-08-23 07:21:06 +00:00
Rafael Espindola	9c3d20d823	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Evan Cheng	95667c532c	- If a dynamic_stackalloc alignment requirement is <= stack alignment, then the alignment argument is ignored. - Always round up the size of the allocation to multiples of stack alignment to ensure the stack ptr is never left in an invalid state after a dynamic_stackalloc. llvm-svn: 41132	2007-08-16 23:46:29 +00:00
Dan Gohman	a17799a3bd	Fix EXTRACT_ELEMENT, EXTRACT_SUBVECTOR, and EXTRACT_VECTOR_ELT to use an intptr ValueType instead of i32 for the index operand in getCopyToParts. llvm-svn: 40987	2007-08-10 14:59:38 +00:00
Rafael Espindola	66011c17d5	propagate struct size and alignment of byval arguments to the DAG llvm-svn: 40986	2007-08-10 14:44:42 +00:00
Chandler Carruth	7132e00de7	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	3ffe7187db	don't redefine a parameter llvm-svn: 40748	2007-08-02 18:08:16 +00:00
Dan Gohman	4ff9fb14f6	Fix a bug in getCopyFromParts turned up in the testcase for PR1132. llvm-svn: 40598	2007-07-30 19:09:17 +00:00
Duncan Sands	644f917358	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Dan Gohman	f0bb12848f	Add const to CanBeFoldedBy, CheckAndMask, and CheckOrMask. llvm-svn: 40480	2007-07-24 23:00:27 +00:00
Dan Gohman	a7b65c30a3	It's not necessary to do rounding for alloca operations when the requested alignment is equal to the stack alignment. llvm-svn: 40004	2007-07-18 16:29:46 +00:00
Dan Gohman	06c60b6032	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Anton Korobeynikov	383a324735	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dale Johannesen	2182f06f2d	Skeleton of post-RA scheduler; doesn't do anything yet. Change name of -sched option and DEBUG_TYPE to pre-RA-sched; adjust testcases. llvm-svn: 39816	2007-07-13 17:13:54 +00:00
Dan Gohman	f8f531bf69	Change getCopyToParts and getCopyFromParts to always use target-endian register ordering, for both physical and virtual registers. Update the PPC target lowering for calls to expect registers for the call result to already be in target order. llvm-svn: 38471	2007-07-09 20:59:04 +00:00
Duncan Sands	9d97420473	The exception handling intrinsics return values, so must be lowered to a value, not nothing at all. Subtle point: I made eh_selector return 0 and eh_typeid_for return 1. This means that only cleanups (destructors) will be run as the exception unwinds [if eh_typeid_for returned 0 then it would be as if the first catch always matched, and the corresponding handler would be run], which is probably want you want in the CBE. llvm-svn: 37947	2007-07-06 14:46:23 +00:00
Rafael Espindola	b567e3ffb0	Add the byval attribute llvm-svn: 37940	2007-07-06 10:57:03 +00:00
Duncan Sands	003c0b1f90	Remove propagateEHRegister in favour of a more limited fix, that is adequate while PR1508 remains unresolved. llvm-svn: 37938	2007-07-06 09:18:59 +00:00
Duncan Sands	81df18a50a	Remove ExtractGlobalVariable - use StripPointerCasts instead. llvm-svn: 37937	2007-07-06 09:10:03 +00:00
Evan Cheng	fc7010d962	Workaround of getCopyToRegs and getCopyFromRegs bugs for big-endian machines. llvm-svn: 37935	2007-07-06 01:47:35 +00:00
Dan Gohman	d258e80583	Add a parameter to getCopyToParts and getCopyFromParts to specify whether endian swapping should be done, and update the code to use it. This fixes some register ordering issues on big-endian systems, such as PowerPC, introduced by the recent illegal by-val arguments changes. llvm-svn: 37921	2007-07-05 20:12:34 +00:00
Duncan Sands	fe80638417	Extend eh.selector to support both catches and filters. Drop the eh.filter intrinsic. llvm-svn: 37875	2007-07-04 20:52:51 +00:00
Dale Johannesen	a2b3c175db	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	533dd16a7f	Replace ExpandScalarFormalArgs and ExpandScalarCallArgs with the newly refactored getCopyFromParts and getCopyToParts, which are more general. This effectively adds support for lowering illegal by-val vector call arguments. llvm-svn: 37843	2007-07-02 16:18:06 +00:00
Evan Cheng	fa68d069ad	Only do FNEG xform when the vector type is a floating point type. llvm-svn: 37818	2007-06-29 21:44:35 +00:00
David Greene	4c1e6f3804	Remove unnecessary attributions in comments. llvm-svn: 37799	2007-06-29 03:42:23 +00:00
David Greene	9468bfd932	Fix reference to cached end iterator invalidated by an erase operation. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37795	2007-06-29 02:49:11 +00:00
Dan Gohman	7867793aff	Add new TargetLowering code to provide the final register type that an illegal value type will be transformed to, for code that needs the register type after all transformations instead of just after the first transformation. Factor out the code that uses this information to do copy-from-regs and copy-to-regs for various purposes into separate functions so that they are done consistently. llvm-svn: 37781	2007-06-28 23:29:44 +00:00
Evan Cheng	77f541ddfd	Partial fix for PR1502: If a EH register is needed in a successor of landing pad, add it as livein to all the blocks in the paths between the landing pad and the specified block. llvm-svn: 37763	2007-06-27 18:45:32 +00:00
Dan Gohman	7139a48057	Use getVectorTypeBreakdown in FunctionLoweringInfo::CreateRegForValue to compute the number and type of registers needed for vector values instead of computing it manually. This fixes PR1529. llvm-svn: 37755	2007-06-27 14:34:07 +00:00
Dan Gohman	a866514528	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	309d3d51b3	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Dan Gohman	04deef3a49	Rename TargetLowering::getNumElements and friends to TargetLowering::getNumRegisters and similar, to avoid confusion with the actual number of elements for vector types. llvm-svn: 37687	2007-06-21 14:42:22 +00:00
Tanya Lattner	e199f97fa8	Codegen support (stripped out) for the annotate attribute. llvm-svn: 37608	2007-06-15 22:26:58 +00:00
Chris Lattner	f852e339b6	Fix CodeGen/X86/inline-asm-x-scalar.ll:test4, by retaining regclass info for tied register constraints. llvm-svn: 37601	2007-06-15 19:11:01 +00:00
Duncan Sands	92bf2c628c	Workaround for PR1508. llvm-svn: 37597	2007-06-15 19:04:19 +00:00
Dan Gohman	5c4413120f	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Duncan Sands	7413736a7e	Only correctly lower exception handing intrinsics if exception handling is turned on. Likewise for scanning of invokes to mark landing pads. llvm-svn: 37570	2007-06-13 16:53:21 +00:00
Dan Gohman	26455c4ae0	Introduce new SelectionDAG node opcodes VEXTRACT_SUBVECTOR and VCONCAT_VECTORS. Use these for CopyToReg and CopyFromReg legalizing in the case that the full register is to be split into subvectors instead of scalars. This replaces uses of VBIT_CONVERT to present values as vector-of-vector types in order to make whole subvectors accessible via BUILD_VECTOR and EXTRACT_VECTOR_ELT. This is in preparation for adding extended ValueType values, where having vector-of-vector types is undesirable. llvm-svn: 37569	2007-06-13 15:12:02 +00:00
Dan Gohman	cbd51c8b60	When creating CopyFromReg nodes, always use legal types. And use the correct types for the result vector, even though it is currently bitcasted to a different type immediately. llvm-svn: 37568	2007-06-13 14:55:16 +00:00
Duncan Sands	97f7236e70	The fix that was applied for PR1224 stops the compiler crashing but breaks exception handling. The problem described in PR1224 is that invoke is a terminator that can produce a value. The value may be needed in other blocks. The code that writes to registers values needed in other blocks runs before terminators are lowered (in this case invoke) so asserted because the value was not yet available. The fix that was applied was to do invoke lowering earlier, before writing values to registers. The problem this causes is that the code to copy values to registers can be output after the invoke call. If an exception is raised and control is passed to the landing pad then this copy-code will never execute. If the value is needed in some code path reached via the landing pad then that code will get something bogus. So revert the original fix and simply skip invoke values in the general copying to registers code. Instead copy the invoke value to a register in the invoke lowering code. llvm-svn: 37567	2007-06-13 05:51:31 +00:00
Dale Johannesen	9a4d987a5f	Do not change the size of function arguments. PR 1489. llvm-svn: 37496	2007-06-07 21:07:15 +00:00
Duncan Sands	61166501a1	Additional fix for PR1422: make sure the landing pad label is placed in the correct machine basic block - do not rely on the eh.exception intrinsic being in the landing pad: the loop optimizers can move it out. llvm-svn: 37463	2007-06-06 10:05:18 +00:00
Duncan Sands	c063f5f362	Integrate exception filter support and exception catch support. This simplifies the code in DwarfWriter, allows for multiple filters and makes it trivial to specify filters accompanied by cleanups or catch-all specifications (see next patch). What a deal! Patch blessed by Anton. llvm-svn: 37398	2007-06-02 16:53:42 +00:00
Duncan Sands	706421e712	Since TypeInfos are passed as i8 pointers, a NULL TypeInfo should be passed as a null i8 pointer not as a 0 i32. llvm-svn: 37383	2007-06-01 08:18:30 +00:00
Dan Gohman	30978078bf	Minor comment cleanups. llvm-svn: 37321	2007-05-24 14:36:04 +00:00
Anton Korobeynikov	3b327826db	Mark all calls as "could throw", when exceptions are enabled. Emit necessary LP info too. This fixes PR1439 llvm-svn: 37311	2007-05-23 11:08:31 +00:00
Dan Gohman	1796f1f8e9	Qualify several calls to functions in the MVT namespace, for consistency. llvm-svn: 37230	2007-05-18 17:52:13 +00:00
Chris Lattner	c7596efdad	Fix some subtle issues handling immediate values. This fixes test/CodeGen/ARM/2007-05-14-InlineAsmCstCrash.ll llvm-svn: 37069	2007-05-15 01:33:58 +00:00
Anton Korobeynikov	192d09c2d9	Do not assert, when case range split metric is zero and JTs are not allowed: just emit binary tree in this case. This fixes PR1403. llvm-svn: 36959	2007-05-09 20:07:08 +00:00
Duncan Sands	671e8c4444	Parameter attributes on invoke calls were being lost due to the wrong attribute index being used. Fix proposed by Anton Korobeynikov, who asked me to implement and commit it for him. This is PR1398. llvm-svn: 36906	2007-05-07 20:49:28 +00:00
Anton Korobeynikov	a8fd7fdc25	Detabify llvm-svn: 36891	2007-05-06 20:14:21 +00:00
Duncan Sands	4cb9eb81ef	A bitcast of a global variable may have been constant folded to a GEP - handle this case too. llvm-svn: 36745	2007-05-04 17:12:26 +00:00
Devang Patel	8c78a0bff0	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Anton Korobeynikov	11940fbba3	Properly set arguments bitwidth of EHSELECT node llvm-svn: 36654	2007-05-02 22:15:48 +00:00
Devang Patel	e95c6ad802	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	09f162ca6a	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Chris Lattner	8cfd33b647	Continue refactoring inline asm code. If there is an earlyclobber output register, preallocate all input registers and the early clobbered output. This fixes PR1357 and CodeGen/PowerPC/2007-04-30-InlineAsmEarlyClobber.ll llvm-svn: 36599	2007-04-30 21:11:17 +00:00
Chris Lattner	4333f8b1cf	refactor GetRegistersForValue to take OpInfo as an argument instead of various pieces of it. No functionality change. llvm-svn: 36592	2007-04-30 17:29:31 +00:00
Chris Lattner	ef07332504	refactor some code, no functionality change llvm-svn: 36590	2007-04-30 17:16:27 +00:00
Chris Lattner	412d61af43	generalize aggregate handling llvm-svn: 36568	2007-04-29 18:58:03 +00:00
Chris Lattner	401d8db381	memory operands that have a direct operand should have their stores created before the copies into physregs are done. This avoids having flag operands skip the store, causing cycles in the dag at sched time. This fixes infinite loops on these tests: test/CodeGen/Generic/2007-04-08-MultipleFrameIndices.ll for PR1308 test/CodeGen/PowerPC/2007-01-29-lbrx-asm.ll test/CodeGen/PowerPC/2007-01-31-InlineAsmAddrMode.ll test/CodeGen/X86/2006-07-12-InlineAsmQConstraint.ll for PR828 llvm-svn: 36547	2007-04-28 21:12:06 +00:00
Chris Lattner	de339fa55d	eliminate more redundant constraint type analysis llvm-svn: 36546	2007-04-28 21:03:16 +00:00
Chris Lattner	b2e55562ed	merge constraint type analysis stuff together. llvm-svn: 36545	2007-04-28 21:01:43 +00:00
Chris Lattner	d7e3b6c442	Significant refactoring of the inline asm stuff, to support future changes. No functionality change. llvm-svn: 36544	2007-04-28 20:49:53 +00:00
Chris Lattner	1deacd61f4	memory inputs to an inline asm are required to have an address available. If the operand is not already an indirect operand, spill it to a constant pool entry or a stack slot. This fixes PR1356 and CodeGen/X86/2007-04-27-InlineAsm-IntMemInput.ll llvm-svn: 36536	2007-04-28 06:42:38 +00:00
Chris Lattner	d102ed0ac6	Fix CodeGen/Generic/2007-04-27-LargeMemObject.ll and CodeGen/Generic/2007-04-27-InlineAsm-X-Dest.ll llvm-svn: 36534	2007-04-28 06:08:13 +00:00
Chris Lattner	4df3e8093b	Fix this to match change to InlineAsm class. llvm-svn: 36524	2007-04-28 04:05:59 +00:00
Chris Lattner	784fe9dbbb	improve EH global handling, patch by Duncan Sands. llvm-svn: 36499	2007-04-27 01:20:11 +00:00
Chris Lattner	8131ab7c0f	enable Anton's shift/and switch lowering stuff! It now passes ppc bootstrap successfully! woohoo... llvm-svn: 36496	2007-04-26 21:09:43 +00:00
Anton Korobeynikov	d7ae7f1659	Fixx off-by-one bug, which prevents llvm-gcc bootstrap on ppc32 llvm-svn: 36490	2007-04-26 20:44:04 +00:00
Evan Cheng	15f269afa3	This was lefted out. Fixed sumarray-dbl. llvm-svn: 36445	2007-04-25 18:33:21 +00:00
Chris Lattner	cb0ed0cfbd	allow support for 64-bit stack objects llvm-svn: 36420	2007-04-25 04:08:28 +00:00
Bill Wendling	47917b697f	Assertion when using a 1-element vector for an add operation. Get the real vector type in this case. llvm-svn: 36402	2007-04-24 21:13:23 +00:00
Scott Michel	4cfa616cee	Use '-1U' where '-1UL' is obvious overkill, eliminating gcc warnings about tests always being true in the process. llvm-svn: 36387	2007-04-24 01:24:20 +00:00
Christopher Lamb	8af6d5896f	PR400 phase 2. Propagate attributed load/store information through DAGs. llvm-svn: 36356	2007-04-22 23:15:30 +00:00
Reid Spencer	0c1349e6bc	Revert Christopher Lamb's load/store alignment changes. llvm-svn: 36309	2007-04-21 18:36:27 +00:00
Christopher Lamb	bff50208c8	add support for alignment attributes on load/store instructions llvm-svn: 36301	2007-04-21 08:16:25 +00:00
Chris Lattner	6bd7b7b30b	disable switch lowering using shift/and. It still breaks ppc bootstrap for some reason. :( Will investigate. llvm-svn: 36011	2007-04-14 19:39:41 +00:00
Anton Korobeynikov	8a1a84f96e	Fix PR1325: Case range optimization was performed in the case it shouldn't. Also fix some "latent" bug on 64-bit platforms llvm-svn: 35990	2007-04-14 13:25:55 +00:00
Chris Lattner	7196f09edc	disable shift/and lowering to work around PR1325 for now. llvm-svn: 35985	2007-04-14 02:26:56 +00:00
Anton Korobeynikov	e288040abf	Fix PR1323 : we haven't updated phi nodes in good manner :) llvm-svn: 35963	2007-04-13 06:53:51 +00:00
Chris Lattner	5111499136	the result of an inline asm copy can be an arbitrary VT that the register class supports. In the case of vectors, this means we often get the wrong type (e.g. we get v4f32 instead of v8i16). Make sure to convert the vector result to the right type. This fixes CodeGen/X86/2007-04-11-InlineAsmVectorResult.ll llvm-svn: 35944	2007-04-12 06:00:20 +00:00
Reid Spencer	c6251a7dfd	For PR1284: Implement the "part_set" intrinsic. llvm-svn: 35938	2007-04-12 02:48:46 +00:00
Reid Spencer	a472f66dd0	For PR1146: Put the parameter attributes in their own ParamAttr name space. Adjust the rest of llvm as a result. llvm-svn: 35877	2007-04-11 02:44:20 +00:00
Chris Lattner	f269d84ca0	apparently some people commit without building the tree, or they forget to commit a LOT of files. llvm-svn: 35858	2007-04-10 03:20:39 +00:00
Jeff Cohen	e0bbbd3774	No longer needed. llvm-svn: 35850	2007-04-09 23:42:32 +00:00
Anton Korobeynikov	da964a2852	Use integer log for metric calculation llvm-svn: 35834	2007-04-09 21:57:03 +00:00
Jeff Cohen	0475f3b4e9	Unbreak VC++ build. llvm-svn: 35817	2007-04-09 14:32:59 +00:00
Anton Korobeynikov	506eaf7915	Next stage into switch lowering refactoring 1. Fix some bugs in the jump table lowering threshold 2. Implement much better metric for optimal pivot selection 3. Tune thresholds for different lowering methods 4. Implement shift-and trick for lowering small (<machine word length) cases with few destinations. Good testcase will follow. llvm-svn: 35816	2007-04-09 12:31:58 +00:00
Reid Spencer	71b79e3d99	For PR1146: Adapt handling of parameter attributes to use the new ParamAttrsList class. llvm-svn: 35814	2007-04-09 06:17:21 +00:00
Chris Lattner	7b2decfa0a	implement CodeGen/X86/inline-asm-x-scalar.ll:test3 llvm-svn: 35802	2007-04-09 05:31:20 +00:00
Chris Lattner	b49917da92	Fix PR1316 llvm-svn: 35783	2007-04-09 00:33:58 +00:00
Chris Lattner	e55ecfb870	Fix for CodeGen/X86/2007-04-08-InlineAsmCrash.ll and PR1314 llvm-svn: 35779	2007-04-08 22:23:26 +00:00
Chris Lattner	1c741e95d3	minor comment fix llvm-svn: 35696	2007-04-06 17:47:14 +00:00
Reid Spencer	85460acfbf	Change the bit_part_select (non)implementation from "return 0" to abort. llvm-svn: 35679	2007-04-05 01:20:18 +00:00
Reid Spencer	cce90f55ed	Implement the llvm.bit.part_select.iN.iN.iN overloaded intrinsic. llvm-svn: 35678	2007-04-04 23:48:25 +00:00
Anton Korobeynikov	915e61736b	Properly emit range comparisons for switch cases, where neighbour cases go to the same destination. Now we're producing really good code for switch-lower-feature.ll testcase llvm-svn: 35672	2007-04-04 21:14:49 +00:00
Reid Spencer	3a0843e734	For PR1297: Adjust for changes in the bit counting intrinsics. They all return i32 now so we have to trunc/zext the DAG node accordingly. llvm-svn: 35546	2007-04-01 07:34:11 +00:00
Chris Lattner	f6a6d3c8b0	move a bunch of code out of the sdisel pass into its own opt pass "codegenprepare". llvm-svn: 35529	2007-03-31 04:18:03 +00:00
Evan Cheng	4388043b25	Scale 1 is always ok. llvm-svn: 35407	2007-03-28 01:55:52 +00:00
Evan Cheng	07c42d43a2	GEP index sinking fixes: 1) Take address scale into consideration. e.g. i32* -> scale 4. 2) Examine all the users of GEP. 3) Generalize to inter-block GEP's (no longer uses loopinfo). 4) Don't do xform if GEP has other variable index(es). llvm-svn: 35403	2007-03-28 01:49:39 +00:00
Anton Korobeynikov	37a0bfe128	Remove dead code llvm-svn: 35380	2007-03-27 12:05:48 +00:00
Anton Korobeynikov	3a9d68181a	Split big monster into small helpers. No functionality change. llvm-svn: 35379	2007-03-27 11:29:11 +00:00
Evan Cheng	c42406b5ad	SDISel does not preserve all, it changes CFG and other info. llvm-svn: 35376	2007-03-27 00:53:36 +00:00
Anton Korobeynikov	7037826c86	First step of switch lowering refactoring: perform worklist-driven strategy, emit JT's where possible. llvm-svn: 35338	2007-03-25 15:07:15 +00:00
Chris Lattner	77f0479833	Implement support for vector operands to inline asm, implementing CodeGen/X86/2007-03-24-InlineAsmVectorOp.ll llvm-svn: 35332	2007-03-25 05:00:54 +00:00
Chris Lattner	d685514e2e	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Dan Gohman	dcb291faa4	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Evan Cheng	550cf0369c	Minor bug. llvm-svn: 35219	2007-03-20 19:32:11 +00:00
Evan Cheng	a2465dfc07	Use SmallSet instead of std::set. llvm-svn: 35133	2007-03-17 08:53:30 +00:00
Evan Cheng	be22235790	If sdisel has decided to sink GEP index expression into any BB. Replace all uses in that BB. llvm-svn: 35132	2007-03-17 08:22:49 +00:00
Evan Cheng	c5bc763f50	Turn on GEP index sinking by default. llvm-svn: 35127	2007-03-16 18:32:30 +00:00
Evan Cheng	0a9d0cabaf	Stupid bug. llvm-svn: 35126	2007-03-16 17:50:20 +00:00
Evan Cheng	009ea54262	Sink a binary expression into its use blocks if it is a loop invariant computation used as GEP indexes and if the expression can be folded into target addressing mode of GEP load / store use types. llvm-svn: 35123	2007-03-16 08:46:27 +00:00
Chris Lattner	ce8aba03ee	implement support for floating point constants used as inline asm memory operands. llvm-svn: 35033	2007-03-08 22:29:47 +00:00
Chris Lattner	b7bc3f2d30	make this fail even in non-assert builds. llvm-svn: 35025	2007-03-08 07:07:03 +00:00
Anton Korobeynikov	ed4b303c10	Refactoring of formal parameter flags. Enable properly use of zext/sext/aext stuff. llvm-svn: 35008	2007-03-07 16:25:09 +00:00
Anton Korobeynikov	f0b9316552	Enumerate SDISel formal parameter attributes. Make use of new enumeration. llvm-svn: 34960	2007-03-06 06:10:33 +00:00
Jeff Cohen	b622c11f77	Unbreak VC++ build. llvm-svn: 34917	2007-03-05 00:00:42 +00:00
Jim Laskey	d5453d7b56	Lower eh filter intrinsic. llvm-svn: 34802	2007-03-01 20:24:30 +00:00
Jim Laskey	cf465fcebc	MERGE_VALUES unnecessary. llvm-svn: 34750	2007-02-28 18:37:04 +00:00
Chris Lattner	ab5d0ac02c	track signedness of formal argument, though we have a fixme here. llvm-svn: 34620	2007-02-26 02:56:58 +00:00
Jim Laskey	14059d958a	Fix for PR1224. llvm-svn: 34610	2007-02-25 21:43:59 +00:00
Chris Lattner	8c504cf9a0	optimize duplicate ValueMap lookups llvm-svn: 34599	2007-02-25 18:40:32 +00:00
Jim Laskey	e1d1c0590f	Deal with cases when MMI is not requested. llvm-svn: 34556	2007-02-24 09:45:44 +00:00
Jim Laskey	31fef788eb	Handle improper cast. llvm-svn: 34535	2007-02-23 21:45:01 +00:00
Jim Laskey	44c37e7dbf	Tighten up error checking of args. llvm-svn: 34493	2007-02-22 16:10:05 +00:00
Jim Laskey	504e99479c	Handle lowering invoke to call correctly. llvm-svn: 34492	2007-02-22 15:38:06 +00:00
Jim Laskey	4b37a4c712	Selection and lowering for exception handling. llvm-svn: 34481	2007-02-21 22:53:45 +00:00
Reid Spencer	09575bac2e	For PR1195: Change use of "packed" term to "vector" in comments, strings, variable names, etc. llvm-svn: 34300	2007-02-15 03:39:18 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	ab1812f806	fix a warning llvm-svn: 34272	2007-02-14 07:34:56 +00:00
Chris Lattner	1cf84d2745	Refix CodeGen/Generic/switch-lower.ll. In contrast to my previous patch, this doesn't miscompile lots of programs :) llvm-svn: 34268	2007-02-14 07:18:16 +00:00
Chris Lattner	945e437c65	Generalize TargetData strings, to support more interesting forms of data. Patch by Scott Michel. llvm-svn: 34266	2007-02-14 05:52:17 +00:00
Chris Lattner	2fbff4d2dc	revert my previous switch lowering change, which miscompiles a few programs. This will break a dj test until I have time to investigate. llvm-svn: 34247	2007-02-13 20:09:07 +00:00
Lauro Ramos Venancio	abde3cc16c	Add a space between // and the comment. llvm-svn: 34244	2007-02-13 18:10:13 +00:00
Lauro Ramos Venancio	9956dcffbe	Add "original alignment" to function arguments flags. llvm-svn: 34240	2007-02-13 13:50:08 +00:00
Chris Lattner	9056bae3be	Fix switch lowering to order cases in zext order, which is how we emit the comparisons. This fixes an infinite loop on CodeGen/Generic/switch-lower.ll and PR1197 llvm-svn: 34216	2007-02-13 01:05:56 +00:00
Chris Lattner	c473d8e431	Privatize StructLayout::MemberOffsets, adding an accessor llvm-svn: 34156	2007-02-10 19:55:17 +00:00
Evan Cheng	276b44b0f9	Add function live-ins to entry block live-in set. llvm-svn: 34112	2007-02-10 02:43:39 +00:00
Evan Cheng	de6083463d	Rename some variables to avoid confusion with SelectionDAGISel::BB. llvm-svn: 34110	2007-02-10 01:08:18 +00:00
Chris Lattner	289aa4495c	Switch VAlueMap from std::map to DenseMap. llvm-svn: 33863	2007-02-04 01:35:11 +00:00
Chris Lattner	79084305ee	Switch NodeMap from std::map to DenseMap, this speeds up isel by 2.3% llvm-svn: 33862	2007-02-04 01:31:47 +00:00
Reid Spencer	2341c22ec7	Changes to support making the shift instructions be true BinaryOperators. This feature is needed in order to support shifts of more than 255 bits on large integer types. This changes the syntax for llvm assembly to make shl, ashr and lshr instructions look like a binary operator: shl i32 %X, 1 instead of shl i32 %X, i8 1 Additionally, this should help a few passes perform additional optimizations. llvm-svn: 33776	2007-02-02 02:16:23 +00:00
Chris Lattner	296a83cefb	Fit in 80 columns llvm-svn: 33745	2007-02-01 04:55:59 +00:00
Chris Lattner	e3eeb24a86	Emit a better assertion message for PR1133 llvm-svn: 33736	2007-02-01 01:21:12 +00:00
Reid Spencer	5301e7c605	For PR1136: Rename GlobalVariable::isExternal as isDeclaration to avoid confusion with external linkage types. llvm-svn: 33663	2007-01-30 20:08:39 +00:00
Chris Lattner	d27f95e08d	add initial support for handling inline asms with multiple constraints. This doesn't do the "right thing" but will probably work in most cases. This implements CodeGen/PowerPC/2007-01-29-lbrx-asm.ll. llvm-svn: 33643	2007-01-29 23:45:14 +00:00
Nate Begeman	eda5997cc8	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Anton Korobeynikov	06f7d4bec7	Arguments are counting from 1. not from 0. Maybe we should change numbering somehow? E.g. make return argument the last? llvm-svn: 33606	2007-01-28 18:01:49 +00:00
Anton Korobeynikov	9fa3839d29	More cleanup llvm-svn: 33605	2007-01-28 16:04:40 +00:00
Anton Korobeynikov	037c867b54	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Jim Laskey	c56315c2b5	Change the MachineDebugInfo to MachineModuleInfo to better reflect usage for debugging and exception handling. llvm-svn: 33550	2007-01-26 21:22:28 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Reid Spencer	2eadb5310d	For PR970: Clean up handling of isFloatingPoint() and dealing with PackedType. Patch by Gordon Henriksen! llvm-svn: 33415	2007-01-21 00:29:26 +00:00
Chris Lattner	50ee0e40e5	Teach TargetData to handle 'preferred' alignment for each target, and use these alignment amounts to align scalars when we can. Patch by Scott Michel! llvm-svn: 33409	2007-01-20 22:35:55 +00:00
Zhou Sheng	75b871fb1e	For PR1043: Merge ConstantIntegral and ConstantBool into ConstantInt. Remove ConstantIntegral and ConstantBool from LLVM. llvm-svn: 33073	2007-01-11 12:24:14 +00:00
Chris Lattner	10cae15d8e	remove support for llvm.isunordered llvm-svn: 32992	2007-01-07 08:37:22 +00:00
Evan Cheng	8ec5283dc4	GEP subscript is interpreted as a signed value. llvm-svn: 32888	2007-01-05 01:46:20 +00:00
Chris Lattner	96035bed51	fix PowerPC/2007-01-04-ArgExtension.ll, a bug handling K&R prototypes with the recent signless changes. llvm-svn: 32884	2007-01-04 22:22:37 +00:00
Reid Spencer	e6f81876eb	Legalizer doesn't do an ANY_EXTEND if we don't ask for one so make sure that we default to an ANY_EXTEND if no parameter attribute is set on the result value of a function. llvm-svn: 32836	2007-01-03 16:49:33 +00:00
Reid Spencer	2a34b91666	Restore previous behavior of defaulting to ZEXT. This works around two things: (1) preventing PR1071 and (2) working around missing parameter attributes for bool type. (2) will be fixed shortly. When PR1071 is fixed, this patch should be undone. llvm-svn: 32831	2007-01-03 05:03:05 +00:00
Reid Spencer	0917adf614	Two changes: 1. Switch expression and cases are compared signed and are sign extended. 2. For function results needing extended, do SIGN_EXTEND if the SExtAttribute is set and ZERO_EXTEND if the ZExtAttribute is set, otherwise just let the Legalizer do ANY_EXTEND. This fixes the recent regression in kimwitu++ and probably the llvm-gcc bootstrap issue we had today. llvm-svn: 32830	2007-01-03 04:25:33 +00:00
Reid Spencer	e63b6518fa	For PR950: Three changes: 1. Convert signed integer types to signless versions. 2. Implement the @sext and @zext parameter attributes. Previously the type of an function parameter was used to determine whether it should be sign extended or zero extended before the call. This information is now communicated via the function type's parameter attributes. 3. The interface to LowerCallTo had to be changed in order to accommodate the parameter attribute information. Although it would have been convenient to pass in the FunctionType itself, there isn't always one present in the caller. Consequently, a signedness indication for the result type and for each parameter was provided for in the interface to this method. All implementations were changed to make the adjustment necessary. llvm-svn: 32788	2006-12-31 05:55:36 +00:00
Reid Spencer	266e42b312	For PR950: This patch removes the SetCC instructions and replaces them with the ICmp and FCmp instructions. The SetCondInst instruction has been removed and been replaced with ICmpInst and FCmpInst. llvm-svn: 32751	2006-12-23 06:05:41 +00:00
Evan Cheng	258657e64e	getLoad() and getStore() calls missed SVOffset operand. Thanks to Dan Gohman for pointing it out! llvm-svn: 32712	2006-12-20 01:27:29 +00:00
Chris Lattner	9bd5ed636c	Fix PR1049 and CodeGen/Generic/2006-12-16-InlineAsmCrash.ll by producing target constants instead of constants. Constants can get selected to li/movri instructions, which causes the scheduler to explode. llvm-svn: 32633	2006-12-16 21:14:48 +00:00
Evan Cheng	22cf89967b	More soft-fp work. llvm-svn: 32559	2006-12-13 20:57:08 +00:00
Reid Spencer	bfe26ffcfc	Replace CastInst::createInferredCast calls with more accurate cast creation calls. llvm-svn: 32521	2006-12-13 00:50:17 +00:00
Evan Cheng	634885f71e	Expand i32/i64 CopyToReg f32/f64 to BIT_CONVERT + CopyToReg. llvm-svn: 32493	2006-12-12 21:21:32 +00:00
Evan Cheng	0c0b78c18e	Expand formal arguments and call arguments recursively: e.g. f64 -> i64 -> 2 x i32. llvm-svn: 32476	2006-12-12 07:27:38 +00:00
Anton Korobeynikov	3b7c257cae	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Evan Cheng	4eee72471c	Preliminary soft float support. llvm-svn: 32394	2006-12-09 02:42:38 +00:00
Bill Wendling	22e978a736	Removing even more <iostream> includes. llvm-svn: 32320	2006-12-07 20:04:42 +00:00
Evan Cheng	feba507a97	Fix for PR1023 by Dan Gohman. llvm-svn: 32003	2006-11-29 01:58:12 +00:00
Evan Cheng	6e12a052ff	Fix for PR1022 (folding loads of static initializers) by Dan Gohman. llvm-svn: 32000	2006-11-29 01:38:07 +00:00
Chris Lattner	90f4238c38	add a hook to allow targets to hack on inline asms to lower them to llvm when they want to. llvm-svn: 31997	2006-11-29 01:12:32 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Reid Spencer	d9436b6837	For PR950: First in a series of patches to convert SetCondInst into ICmpInst and FCmpInst using only two opcodes and having the instructions contain their predicate value. Nothing uses these classes yet. More patches to follow. llvm-svn: 31867	2006-11-20 01:22:35 +00:00
Chris Lattner	30d08801ef	remove dead #include llvm-svn: 31753	2006-11-15 17:51:15 +00:00
Chris Lattner	d5e604dbb2	commentate llvm-svn: 31627	2006-11-10 04:41:34 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Reid Spencer	7eb55b395f	For PR950: Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369	2006-11-02 01:53:59 +00:00
Chris Lattner	55402d4403	Allow the getRegForInlineAsmConstraint method to return a register class with no fixes physreg. Treat this as permission to use any register in the register class. When this happens and it is safe, allow the llvm register allcoator to allocate the register instead of doing it at isel time. This eliminates a ton of copies around common inline asms. For example: int test2(int Y, int X) { asm("foo %0, %1" : "=r"(X): "r"(X)); return X; } now compiles to: _test2: foo r3, r4 blr instead of: _test2: mr r2, r4 foo r2, r2 mr r3, r2 blr GCC produces: _test2: foo r4, r4 mr r3,r4 blr llvm-svn: 31366	2006-11-02 01:41:49 +00:00
Chris Lattner	fe43befeda	Compile CodeGen/PowerPC/fp-branch.ll to: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 blt cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr instead of: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bge cr0, LBB1_1 ;bb72.exitStub LBB1_4: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bnu cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr llvm-svn: 31330	2006-10-31 23:06:00 +00:00
Chris Lattner	427301fdae	look through isunordered to inline it into branch blocks. llvm-svn: 31328	2006-10-31 22:37:42 +00:00
Chris Lattner	6f043b90ea	TargetLowering::isOperandValidForConstraint llvm-svn: 31319	2006-10-31 19:41:18 +00:00
Chris Lattner	968f803928	Turn an assert into an error message. This is commonly triggered when we don't support a specific constraint yet. When this happens, print the unsupported constraint. llvm-svn: 31310	2006-10-31 07:33:13 +00:00
Evan Cheng	84a28d4e76	Lower jumptable to BR_JT. The legalizer can lower it to a BRIND or let the target custom lower it. llvm-svn: 31293	2006-10-30 08:00:44 +00:00
Chris Lattner	e60ae823e8	fix Generic/2006-10-29-Crash.ll llvm-svn: 31281	2006-10-29 21:01:20 +00:00
Chris Lattner	f31b9ef458	Fix a load folding issue that Evan noticed: there is no need to export values used by comparisons in the main block. llvm-svn: 31279	2006-10-29 18:23:37 +00:00
Chris Lattner	bba52191fa	split critical edges more carefully and intelligently. In particular, critical edges whose destinations are not phi nodes don't bother us. Also, share split edges, since the split edge can't have a phi. This significantly reduces the complexity of generated code in some cases. llvm-svn: 31274	2006-10-28 19:22:10 +00:00
Chris Lattner	3e6b1c6157	Split all critical edges before isel. This resolves issues with spill code being inserted on unsplit critical edges, which introduces (sometimes large amounts of) partially dead spill code. This also fixes PR925 + CodeGen/Generic/switch-crit-edge-constant.ll llvm-svn: 31260	2006-10-28 17:04:37 +00:00
Chris Lattner	84a035056e	Fix a bug in merged condition handling (CodeGen/Generic/2006-10-27-CondFolding.ll). Add many fewer CFG edges and PHI node entries. If there is a switch which has the same block as multiple destinations, only add that block once as a successor/phi node (in the jumptable case) llvm-svn: 31242	2006-10-27 23:50:33 +00:00
Chris Lattner	b9392fb635	remove debug code llvm-svn: 31233	2006-10-27 21:58:03 +00:00
Chris Lattner	f1b54fd7a5	Codegen cond&cond with two branches. This compiles (f.e.) PowerPC/and-branch.ll to: cmpwi cr0, r4, 4 bgt cr0, LBB1_2 ;UnifiedReturnBlock LBB1_3: ;entry cmplwi cr0, r3, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r3, r4, r3 rlwinm r2, r2, 30, 31, 31 or r2, r2, r3 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock LBB1_1: ;cond_true llvm-svn: 31232	2006-10-27 21:54:23 +00:00
Chris Lattner	ed0110b949	Turn conditions like x<Y\|z==q into multiple blocks. This compiles Regression/CodeGen/X86/or-branch.ll into: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $5, %eax jl LBB1_1 #cond_true LBB1_3: #entry testl %ecx, %ecx jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret instead of: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $4, %eax setg %al testl %ecx, %ecx setne %cl testb %cl, %al jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret And on ppc to: cmpwi cr0, r29, 5 blt cr0, LBB1_1 ;cond_true LBB1_3: ;entry cmplwi cr0, r30, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r30, r4, r3 rlwinm r29, r2, 30, 31, 31 and r2, r29, r30 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock llvm-svn: 31230	2006-10-27 21:36:01 +00:00
Reid Spencer	7e80b0b31e	For PR950: Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195	2006-10-26 06:15:43 +00:00
Chris Lattner	61bcf9154d	visitSwitchCase knows how to insert conditional branches well. Change visitBr to just call visitSwitchCase, eliminating duplicate logic. llvm-svn: 31167	2006-10-24 18:07:37 +00:00
Chris Lattner	963ddad31a	Generalize CaseBlock a bit more: Rename LHSBB/RHSBB to TrueBB/FalseBB. Allow the RHS value to be null, in which case the LHS is treated as a bool. llvm-svn: 31166	2006-10-24 17:57:59 +00:00
Chris Lattner	3f179d24c6	generalize 'CaseBlock'. It really allows any comparison to be inserted. llvm-svn: 31161	2006-10-24 17:03:35 +00:00
Chris Lattner	4c931502cc	Minor tweak. Instead of generating: movl 32(%esp), %eax cmpl $1, %eax je LBB1_1 #bb LBB1_4: #entry cmpl $2, %eax je LBB1_2 #bb2 jmp LBB1_3 #UnifiedReturnBlock LBB1_1: #bb notice that we would miss the fall through and emit this instead: movl 32(%esp), %eax cmpl $2, %eax je LBB1_2 #bb2 LBB1_4: #entry cmpl $1, %eax jne LBB1_3 #UnifiedReturnBlock LBB1_1: #bb llvm-svn: 31130	2006-10-23 18:38:22 +00:00
Chris Lattner	76a7bc8c55	Fix phi node updating for switches lowered to linear sequences of branches. llvm-svn: 31125	2006-10-22 23:00:53 +00:00
Chris Lattner	4c3ef4782d	disable this code for now, it's not yet safely updating phi nodes llvm-svn: 31124	2006-10-22 22:47:10 +00:00
Chris Lattner	6d6fc26257	Implement PR964 and Regression/CodeGen/Generic/SwitchLowering.ll llvm-svn: 31119	2006-10-22 21:36:53 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Bill Wendling	be96e1cd09	Partially in response to PR926: insert the newly created machine basic blocks into the basic block list when lowering the switch inst. into a binary tree of if-then statements. This allows the "visitSwitchCase" func to allow for fall-through behavior. llvm-svn: 31057	2006-10-19 21:46:38 +00:00
Jim Laskey	dcb2b83886	Pass AliasAnalysis thru to DAGCombiner. llvm-svn: 30984	2006-10-16 20:52:31 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Andrew Lenharth	a6bbf33cbf	Jimptables working again on alpha. As a bonus, use the GOT node instead of the AlphaISD::GOT for internal stuff. llvm-svn: 30873	2006-10-11 04:29:42 +00:00
Chris Lattner	6df349676e	add two helper methods. llvm-svn: 30869	2006-10-11 03:58:02 +00:00
Evan Cheng	e71fe34d75	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	9d75324ddf	jump tables handle pic llvm-svn: 30776	2006-10-06 22:32:29 +00:00
Evan Cheng	df9ac47e5e	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	f80dfa83a0	Fix some typos that can cause a flag value to have more than one use. llvm-svn: 30727	2006-10-04 22:23:53 +00:00
Chris Lattner	a9caf95591	refactor critical edge breaking out into the SplitCritEdgesForPHIConstants method. This is a baby step towards fixing PR925. llvm-svn: 30643	2006-09-28 06:17:10 +00:00
Andrew Lenharth	c19ef92403	Comments on JumpTableness llvm-svn: 30615	2006-09-26 20:02:30 +00:00
Andrew Lenharth	783a4a9d86	Add support for other relocation bases to jump tables, as well as custom asm directives llvm-svn: 30593	2006-09-24 19:45:58 +00:00
Evan Cheng	77c0757f8b	PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. llvm-svn: 30590	2006-09-24 05:22:38 +00:00
Andrew Lenharth	c50458fb90	absolute addresses must match pointer size llvm-svn: 30461	2006-09-18 17:59:35 +00:00
Chris Lattner	84cc1f7cb8	If LSR went through a lot of trouble to put constants (e.g. the addr of a global in a specific BB, don't undo this!). This allows us to compile CodeGen/X86/loop-hoist.ll into: _foo: xorl %eax, %eax * movl L_Arr$non_lazy_ptr, %ecx movl 4(%esp), %edx LBB1_1: #cond_true movl %eax, (%ecx,%eax,4) incl %eax cmpl %edx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret instead of: _foo: xorl %eax, %eax movl 4(%esp), %ecx LBB1_1: #cond_true * movl L_Arr$non_lazy_ptr, %edx movl %eax, (%edx,%eax,4) incl %eax cmpl %ecx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret This was noticed in 464.h264ref. This doesn't usually affect PPC, but strikes X86 all the time. llvm-svn: 30290	2006-09-13 06:02:42 +00:00
Chris Lattner	2e0dfb0b16	This code was trying too hard. By eliminating redundant edges in the CFG due to switch cases going to the same place, it make #pred != #phi entries, breaking live interval analysis. This fixes 458.sjeng on x86 with llc. llvm-svn: 30236	2006-09-10 06:36:57 +00:00
Chris Lattner	f0359b343a	Implement the fpowi now by lowering to a libcall llvm-svn: 30225	2006-09-09 06:03:30 +00:00
Chris Lattner	707339a57b	Fix CodeGen/Generic/2006-09-06-SwitchLowering.ll, a bug where SDIsel inserted too many phi operands when lowering a switch to branches in some cases. llvm-svn: 30142	2006-09-07 01:59:34 +00:00
Chris Lattner	af23f9b5f6	Completely eliminate def&use operands. Now a register operand is EITHER a def operand or a use operand. llvm-svn: 30109	2006-09-05 02:31:13 +00:00
Chris Lattner	3d27be1333	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Chris Lattner	65879caf07	minor changes. llvm-svn: 29740	2006-08-16 22:57:46 +00:00
Chris Lattner	bd8877744b	eliminate use of getNode that takes vector of valuetypes. llvm-svn: 29687	2006-08-14 23:53:35 +00:00
Chris Lattner	c24a1d3093	Start eliminating temporary vectors used to create DAG nodes. Instead, pass in the start of an array and a count of operands where applicable. In many cases, the number of operands is known, so this static array can be allocated on the stack, avoiding the heap. In many other cases, a SmallVector can be used, which has the same benefit in the common cases. I updated a lot of code calling getNode that takes a vector, but ran out of time. The rest of the code should be updated, and these methods should be removed. We should also do the same thing to eliminate the methods that take a vector of MVT::ValueTypes. It would be extra nice to convert the dagiselemitter to avoid creating vectors for operands when calling getTargetNode. llvm-svn: 29566	2006-08-08 02:23:42 +00:00
Chris Lattner	524c1a21f2	Work around a GCC 3.3.5 bug noticed by a user. llvm-svn: 29490	2006-08-03 00:18:59 +00:00
Jim Laskey	29e635d3c9	Final polish on machine pass registries. llvm-svn: 29471	2006-08-02 12:30:23 +00:00
Jim Laskey	17c67efe8a	Now that the ISel is available, it's possible to create a default instruction scheduler creator. llvm-svn: 29452	2006-08-01 19:14:14 +00:00
Jim Laskey	03593f72db	1. Change use of "Cache" to "Default". 2. Added argument to instruction scheduler creators so the creators can do special things. 3. Repaired target hazard code. 4. Misc. More to follow. llvm-svn: 29450	2006-08-01 18:29:48 +00:00
Jim Laskey	95eda5b1f3	Introducing plugable register allocators and instruction schedulers. llvm-svn: 29434	2006-08-01 14:21:23 +00:00
Evan Cheng	6ae6ac1216	PIC jump table entries are always 32-bit even in 64-bit mode. llvm-svn: 29422	2006-08-01 01:03:13 +00:00
Nate Begeman	efc312a5c7	Code cleanups, per review llvm-svn: 29347	2006-07-27 16:46:58 +00:00
Nate Begeman	787565024a	Support jump tables when in PIC relocation model llvm-svn: 29318	2006-07-27 01:13:04 +00:00
Chris Lattner	b030532910	Mems can be in the output list also. This is the second half of a fix for PR833 llvm-svn: 29224	2006-07-20 19:02:21 +00:00
Chris Lattner	996795b0dd	Use hidden visibility to make symbols in an anonymous namespace get dropped. This shrinks libllvmgcc.dylib another 67K llvm-svn: 28975	2006-06-28 23:17:24 +00:00
Evan Cheng	ef9e07d3f0	Consistency. EXTRACT_ELEMENT index operand should have ptr type. llvm-svn: 28795	2006-06-15 08:11:54 +00:00
Chris Lattner	32d92e004d	Make sure to update the CFG correctly if a switch only has a default dest. This fixes CodeGen/Generic/2006-06-12-LowerSwitchCrash.ll llvm-svn: 28755	2006-06-12 18:25:29 +00:00
Chris Lattner	c03a9259c0	Fix X86/inline-asm.ll:test2, a case where an input value was implicitly truncated. llvm-svn: 28733	2006-06-08 18:27:11 +00:00
Chris Lattner	705948d742	Fix Regression/CodeGen/X86/inline-asm.ll, a case where inline asm causes implement extension of a register. llvm-svn: 28731	2006-06-08 18:22:48 +00:00
Evan Cheng	21dee4e0b2	Make CALL node consistent with RET node. Signness of value has type MVT::i32 instead of MVT::i1. Either is fine except MVT::i32 is probably a legal type for most (if not all) platforms while MVT::i1 is not. llvm-svn: 28511	2006-05-26 23:13:20 +00:00
Evan Cheng	a2e9953c54	Change RET node to include signness information of the return values. e.g. RET chain, value1, sign1, value2, sign2 llvm-svn: 28509	2006-05-26 23:09:09 +00:00
Evan Cheng	4582771f3f	CALL node change: now including signness of every argument. llvm-svn: 28461	2006-05-25 00:55:32 +00:00
Evan Cheng	ac4f66ff24	-enable-unsafe-fp-math implies -enable-finite-only-fp-math llvm-svn: 28437	2006-05-23 18:18:46 +00:00
Vladimir Prus	df1d439849	Fix missing include llvm-svn: 28435	2006-05-23 13:43:15 +00:00
Evan Cheng	1c5b7d12df	Incorrect SETCC CondCode used for FP comparisons. llvm-svn: 28433	2006-05-23 06:40:47 +00:00
Chris Lattner	7949c2e8b2	Fix the result of the call to use a correct vbitconvert. There is no need to use getPackedTypeBreakdown at all here. llvm-svn: 28365	2006-05-17 20:49:36 +00:00
Chris Lattner	938155ca57	Correct a previous patch which broke CodeGen/PowerPC/vec_call.ll llvm-svn: 28364	2006-05-17 20:43:21 +00:00
Evan Cheng	751cd7653d	Fixed a LowerCallTo and LowerArguments bug. They were introducing illegal VBIT_VECTOR nodes. There were some confusion about the semantics of getPackedTypeBreakdown(). e.g. for <4 x f32> it returns 1 and v4f32, not 4, and f32. llvm-svn: 28352	2006-05-17 18:16:39 +00:00
Chris Lattner	b77ba73a29	Add support for calls that pass and return legal vectors. llvm-svn: 28340	2006-05-16 23:39:44 +00:00
Chris Lattner	aaa23d953f	Add a new ISD::CALL node, make the default impl of TargetLowering::LowerCallTo produce it. llvm-svn: 28338	2006-05-16 22:53:20 +00:00
Chris Lattner	3d82699605	Add a chain to FORMAL_ARGUMENTS. This is a minimal port of the X86 backend, it doesn't currently use/maintain the chain properly. Also, make the X86ISelLowering.cpp file 80-col clean. llvm-svn: 28320	2006-05-16 06:45:34 +00:00
Chris Lattner	957cb6733a	Move function-live-in-handling code from the sdisel code to the scheduler. This code should be emitted after legalize, so it can't be in sdisel. Note that the EmitFunctionEntryCode hook should be updated to operate on the DAG. The X86 backend is the only one currently using this hook. llvm-svn: 28315	2006-05-16 06:10:58 +00:00
Evan Cheng	d1915cfa6f	Revert an un-intended change llvm-svn: 28278	2006-05-13 05:53:47 +00:00
Chris Lattner	53cdb2f2b0	Remove dead vars llvm-svn: 28255	2006-05-12 18:06:45 +00:00
Evan Cheng	d38c22bdd3	Refactor scheduler code. Move register-reduction list scheduler to a separate file. Added an initial implementation of top-down register pressure reduction list scheduler. llvm-svn: 28226	2006-05-11 23:55:42 +00:00
Nate Begeman	d7a19102d1	Make emission of jump tables a bit less conservative; they are now required to be only 31.25% dense, rather than 75% dense. llvm-svn: 28165	2006-05-08 16:51:36 +00:00
Chris Lattner	21cd99024a	When inserting casts, be careful of where we put them. We cannot insert a cast immediately before a PHI node. This fixes Regression/CodeGen/Generic/2006-05-06-GEP-Cast-Sink-Crash.ll llvm-svn: 28143	2006-05-06 09:10:37 +00:00
Chris Lattner	3e3f2c63c3	More aggressively sink GEP offsets into loops. For example, before we generated: movl 8(%esp), %eax movl %eax, %edx addl $4316, %edx cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, (%edx) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %edx movl %edx, 4460(%eax) ret ... Now we generate: movl 8(%esp), %eax cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, 4316(%eax) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %ecx movl %ecx, 4460(%eax) ret ... which uses one fewer register. llvm-svn: 28129	2006-05-05 21:17:49 +00:00
Chris Lattner	7a3ecf7993	Sink noop copies into the basic block that uses them. This reduces the number of cross-block live ranges, and allows the bb-at-a-time selector to always coallesce these away, at isel time. This reduces the load on the coallescer and register allocator. For example on a codec on X86, we went from: 1643 asm-printer - Number of machine instrs printed 419 liveintervals - Number of loads/stores folded into instructions 1144 liveintervals - Number of identity moves eliminated after coalescing 1022 liveintervals - Number of interval joins performed 282 liveintervals - Number of intervals after coalescing 1304 liveintervals - Number of original intervals 86 regalloc - Number of times we had to backtrack 1.90232 regalloc - Ratio of intervals processed over total intervals 40 spiller - Number of values reused 182 spiller - Number of loads added 121 spiller - Number of stores added 132 spiller - Number of register spills 6 twoaddressinstruction - Number of instructions commuted to coalesce 360 twoaddressinstruction - Number of two-address instructions to: 1636 asm-printer - Number of machine instrs printed 403 liveintervals - Number of loads/stores folded into instructions 1155 liveintervals - Number of identity moves eliminated after coalescing 1033 liveintervals - Number of interval joins performed 279 liveintervals - Number of intervals after coalescing 1312 liveintervals - Number of original intervals 76 regalloc - Number of times we had to backtrack 1.88998 regalloc - Ratio of intervals processed over total intervals 1 spiller - Number of copies elided 41 spiller - Number of values reused 191 spiller - Number of loads added 114 spiller - Number of stores added 128 spiller - Number of register spills 4 twoaddressinstruction - Number of instructions commuted to coalesce 356 twoaddressinstruction - Number of two-address instructions On this testcase, this change provides a modest reduction in spill code, regalloc iterations, and total instructions emitted. It increases the number of register coallesces. llvm-svn: 28115	2006-05-05 01:04:50 +00:00
Nate Begeman	df4883971e	Finish up the initial jump table implementation by allowing jump tables to not be 100% dense. Increase the minimum threshold for the number of cases in a switch statement from 4 to 6 in order to create a jump table. llvm-svn: 28079	2006-05-03 03:48:02 +00:00
Owen Anderson	20a631fde7	Refactor TargetMachine, pushing handling of TargetData into the target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference. This fixes PR 759. llvm-svn: 28074	2006-05-03 01:29:57 +00:00
Evan Cheng	c5e8ce8b8c	Remove the temporary option: -no-isel-fold-inflight llvm-svn: 28012	2006-04-28 18:54:11 +00:00
Evan Cheng	d43c5c6046	TargetLowering::LowerArguments should return a VBIT_CONVERT of FORMAL_ARGUMENTS SDOperand in the return result vector. llvm-svn: 28009	2006-04-28 05:25:15 +00:00
Evan Cheng	51ab4498e7	Added a temporary option -no-isel-fold-inflight to control whether a "inflight" node can be folded. llvm-svn: 28003	2006-04-28 02:09:19 +00:00
Evan Cheng	3784f3c57c	Insert a VBIT_CONVERT between a FORMAL_ARGUMENT node and its vector uses (VAND, VADD, etc.). Legalizer will assert otherwise. llvm-svn: 27991	2006-04-27 08:29:42 +00:00
Evan Cheng	9618df1190	Don't forget return void. llvm-svn: 27974	2006-04-25 23:03:35 +00:00
Nate Begeman	866b4b4d45	Fix the updating of the machine CFG when a PHI node was in a successor of the jump table's range check block. This re-enables 100% dense jump tables by default on PPC & x86 llvm-svn: 27952	2006-04-23 06:26:20 +00:00
Nate Begeman	ecb1dafd3d	Turn of jump tables for a bit, there are still some issues to work out with updating the machine CFG. llvm-svn: 27949	2006-04-22 23:51:56 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	b21d3bfd1f	The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap miscompares). Switch RISC targets to use the list-td scheduler, which isn't. llvm-svn: 27933	2006-04-21 17:16:16 +00:00
Chris Lattner	d3b504ae10	Implement support for the formal_arguments node. To get this, targets shouldcustom legalize it and remove their XXXTargetLowering::LowerArguments overload llvm-svn: 27604	2006-04-12 16:20:43 +00:00
Chris Lattner	02274a5265	Add code generator support for VSELECT llvm-svn: 27542	2006-04-08 22:22:57 +00:00
Chris Lattner	098c01e94e	Codegen shufflevector as VVECTOR_SHUFFLE llvm-svn: 27529	2006-04-08 04:15:24 +00:00
Chris Lattner	aa3185f12e	Stub out shufflevector llvm-svn: 27514	2006-04-08 01:19:25 +00:00
Chris Lattner	4a2413a590	Make a vector live across blocks have the correct Vec type. This fixes CodeGen/X86/2006-04-04-CrossBlockCrash.ll llvm-svn: 27436	2006-04-05 06:54:42 +00:00
Chris Lattner	a9c59156be	Intrinsics that just load from memory can be treated like loads: they don't have to serialize against each other. This allows us to schedule lvx's across each other, for example. llvm-svn: 27346	2006-04-02 03:41:14 +00:00
Chris Lattner	ef598059f2	Add a new -view-legalize-dags command line option llvm-svn: 27342	2006-04-02 03:07:27 +00:00
Chris Lattner	bec582f4cd	Prefer larger register classes over smaller ones when a register occurs in multiple register classes. This fixes PowerPC/2006-04-01-FloatDoubleExtend.ll llvm-svn: 27334	2006-04-02 00:24:45 +00:00
Chris Lattner	ba38035e21	Make sure to pass enough values to phi nodes when we are dealing with decimated vectors. This fixes UnitTests/Vector/sumarray-dbl.c llvm-svn: 27280	2006-03-31 02:12:18 +00:00
Chris Lattner	5fe1f54c17	Significantly improve handling of vectors that are live across basic blocks, handling cases where the vector elements need promotion, expansion, and when the vector type itself needs to be decimated. llvm-svn: 27278	2006-03-31 02:06:56 +00:00
Chris Lattner	67271869a8	Bug fixes: handle constantexpr insert/extract element operations Handle constantpacked vectors with constantexpr elements. This fixes CodeGen/Generic/vector-constantexpr.ll llvm-svn: 27241	2006-03-29 00:11:43 +00:00
Jim Laskey	67a636c587	More bulletproofing of llvm.dbg.declare. llvm-svn: 27224	2006-03-28 13:45:20 +00:00
Chris Lattner	e55d171ccd	Tblgen doesn't like multiple SDNode<> definitions that map to the sameenum value. Split them into separate enums. llvm-svn: 27201	2006-03-28 00:40:33 +00:00
Jim Laskey	d387cc5cde	Reactivate llvm.dbg.declare. llvm-svn: 27192	2006-03-27 23:31:10 +00:00
Chris Lattner	5bb1d90afd	Disable dbg_declare, it currently breaks the CFE build llvm-svn: 27182	2006-03-27 21:36:03 +00:00
Nate Begeman	ed728c1291	SelectionDAGISel can now natively handle Switch instructions, in the same manner that the LowerSwitch LLVM to LLVM pass does: emitting a binary search tree of basic blocks. The new approach has several advantages: it is faster, it generates significantly smaller code in many cases, and it paves the way for implementing dense switch tables as a jump table by handling switches directly in the instruction selector. This functionality is currently only enabled on x86, but should be safe for every target. In anticipation of making it the default, the cfg is now properly updated in the x86, ppc, and sparc select lowering code. llvm-svn: 27156	2006-03-27 01:32:24 +00:00
Jim Laskey	7092888bcc	Bullet proof against undefined args produced by upgrading ols-style debug info. llvm-svn: 27155	2006-03-26 22:46:27 +00:00
Chris Lattner	313229c74b	fix inverted conditional llvm-svn: 27089	2006-03-24 22:49:42 +00:00
Jim Laskey	53f1ecc560	Rename for truth in advertising. llvm-svn: 27063	2006-03-24 09:50:27 +00:00
Chris Lattner	d96b09a7b9	Lower target intrinsics into an INTRINSIC node llvm-svn: 27035	2006-03-24 02:22:33 +00:00
Jim Laskey	a8bdac875d	Handle new forms of llvm.dbg intrinsics. llvm-svn: 26988	2006-03-23 18:06:46 +00:00
Chris Lattner	b893d04a67	Fix a typo llvm-svn: 26965	2006-03-22 22:20:49 +00:00
Chris Lattner	2f4119a608	Implement simple support for vector casting. This can currently only handle casts between legal vector types. llvm-svn: 26961	2006-03-22 20:09:35 +00:00
Chris Lattner	7c0cd8cafc	add some trivial support for extractelement. llvm-svn: 26928	2006-03-21 20:44:12 +00:00
Chris Lattner	672a42d731	Add a hacky workaround for crashes due to vectors live across blocks. Note that this code won't work for vectors that aren't legal on the target. Improvements coming. llvm-svn: 26925	2006-03-21 19:20:37 +00:00
Chris Lattner	29b2301460	implement basic support for INSERT_VECTOR_ELT. llvm-svn: 26849	2006-03-19 01:17:20 +00:00
Chris Lattner	f4e1a53647	Rename ConstantVec -> BUILD_VECTOR and VConstant -> VBUILD_VECTOR. Allow*BUILD_VECTOR to take variable inputs. llvm-svn: 26847	2006-03-19 00:52:58 +00:00
Chris Lattner	c16b05e67d	implement vector.ll:test_undef llvm-svn: 26845	2006-03-19 00:20:20 +00:00
Chris Lattner	32206f54c6	Change the structure of lowering vector stuff. Note: This breaks some things. llvm-svn: 26840	2006-03-18 01:44:44 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	7ececaad83	Fix a problem fully scalarizing values. llvm-svn: 26811	2006-03-16 23:05:19 +00:00
Chris Lattner	8471b15706	Add support for CopyFromReg from vector values. Note: this doesn't support illegal vector types yet! llvm-svn: 26799	2006-03-16 19:57:50 +00:00
Chris Lattner	49409cb925	Teach CreateRegForValue how to handle vector types. llvm-svn: 26798	2006-03-16 19:51:18 +00:00
Chris Lattner	4024c00ce7	add support for vector->vector casts llvm-svn: 26788	2006-03-15 22:19:46 +00:00
Jim Laskey	acb6e34277	Handle the removal of the debug chain. llvm-svn: 26729	2006-03-13 13:07:37 +00:00
Evan Cheng	38280c0020	Added a parameter to control whether Constant::getStringValue() would chop off the result string at the first null terminator. llvm-svn: 26704	2006-03-10 23:52:03 +00:00
Chris Lattner	d3ef6c290a	scrape out bits of llvm-db llvm-svn: 26701	2006-03-10 22:48:19 +00:00
Chris Lattner	5255d04357	Simplify the interface to the schedulers, to not pass the selected heuristicin. llvm-svn: 26692	2006-03-10 07:49:12 +00:00
Chris Lattner	213209a248	remove dbg_declare, it's not used yet. llvm-svn: 26659	2006-03-09 20:02:42 +00:00
Jim Laskey	2698f0de7a	Get rid of the multiple copies of getStringValue. Now a Constant:: method. llvm-svn: 26616	2006-03-08 18:11:07 +00:00
Chris Lattner	543832d39d	Change the interface for getting a target HazardRecognizer to be more clean. llvm-svn: 26608	2006-03-08 04:25:59 +00:00
Chris Lattner	47639dbb93	Hoist the HazardRecognizer out of the ScheduleDAGList.cpp file to where targets can implement them. Make the top-down scheduler non-g5-specific. Remove the old testing hazard recognizer. llvm-svn: 26569	2006-03-06 00:22:00 +00:00
Chris Lattner	98ecb8ec61	Split the list scheduler into top-down and bottom-up pieces. The priority function of the top-down scheduler are completely bogus currently, and having (future) PPC specific in this file is also wrong, but this is a small incremental step. llvm-svn: 26552	2006-03-05 21:10:33 +00:00
Chris Lattner	5c1ba2ac08	Codegen copysign[f] into a FCOPYSIGN node llvm-svn: 26542	2006-03-05 05:09:38 +00:00
Evan Cheng	3bf916ddd9	Add more vector NodeTypes: VSDIV, VUDIV, VAND, VOR, and VXOR. llvm-svn: 26504	2006-03-03 07:01:07 +00:00
Chris Lattner	ad3c974a77	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Chris Lattner	093c159efb	Split memcpy/memset/memmove intrinsics into i32/i64 versions, resolving PR709, and paving the way for future progress. llvm-svn: 26476	2006-03-03 00:00:25 +00:00
Evan Cheng	b97aab4371	Vector ops lowering. llvm-svn: 26436	2006-03-01 01:09:54 +00:00
Chris Lattner	9fed5b6122	Add support for output memory constraints. llvm-svn: 26410	2006-02-27 23:45:39 +00:00
Jeff Cohen	83c22e0d75	Get VC++ building again. llvm-svn: 26351	2006-02-24 02:52:40 +00:00
Chris Lattner	dcf785bf46	Implement (most of) selection of inline asm memory operands. llvm-svn: 26350	2006-02-24 02:13:54 +00:00
Chris Lattner	7ef7a64ebb	Lower C_Memory operands. llvm-svn: 26346	2006-02-24 01:11:24 +00:00
Chris Lattner	e7c0ffb3a0	Fix an endianness problem on big-endian targets with expanded operands to inline asms. Mark some methods const. llvm-svn: 26334	2006-02-23 20:06:57 +00:00
Chris Lattner	571d9647c6	Record all of the expanded registers in the DAG and machine instr, fixing several bugs in inline asm expanded operands. llvm-svn: 26332	2006-02-23 19:21:04 +00:00
Chris Lattner	b1124f3c76	This fixes a couple of problems with expansion llvm-svn: 26318	2006-02-22 23:09:03 +00:00
Chris Lattner	6f87d18be9	Change a whole bunch of code to be built around RegsForValue instead of a single register number. This fully implements promotion for inline asms, expand is close but not quite right yet. llvm-svn: 26316	2006-02-22 22:37:12 +00:00
Chris Lattner	7ad77dfc2a	split register class handling from explicit physreg handling. llvm-svn: 26308	2006-02-22 00:56:39 +00:00
Chris Lattner	5c79f98f15	Adjust to changes in getRegForInlineAsmConstraint prototype llvm-svn: 26306	2006-02-21 23:12:12 +00:00
Evan Cheng	c3dcf5a4d7	Dumb bug. Code sees a memcpy from X+c so it increments src offset. But it turns out not to point to a constant string but it forgot change the offset back. llvm-svn: 26242	2006-02-16 23:11:42 +00:00
Evan Cheng	42c01c8d39	If the false case is the current basic block, then this is a self loop. We do not want to emit "Loop: ... brcond Out; br Loop", as it adds an extra instruction in the loop. Instead, invert the condition and emit "Loop: ... br!cond Loop; br Out. Generalize the fix by moving it from PPCDAGToDAGISel to SelectionDAGLowering. llvm-svn: 26231	2006-02-16 08:27:56 +00:00
Evan Cheng	93e4865d4b	Remove an unused function parameter. llvm-svn: 26221	2006-02-15 22:12:35 +00:00
Evan Cheng	6781b6e62e	Turn a memcpy from string constant into a series of stores of constant values. llvm-svn: 26219	2006-02-15 21:59:04 +00:00
Evan Cheng	e2038bdeee	Lower memcpy with small constant size operand into a series of load / store ops. llvm-svn: 26195	2006-02-15 01:54:51 +00:00
Evan Cheng	0451499b3c	Doh again! llvm-svn: 26188	2006-02-14 23:05:54 +00:00
Evan Cheng	db2a7a736a	Keep to < 80 cols llvm-svn: 26177	2006-02-14 20:12:38 +00:00
Evan Cheng	038521ef76	Missed a break so memcpy cases fell through to memset. Doh. llvm-svn: 26176	2006-02-14 19:45:56 +00:00
Evan Cheng	d502610604	Fixed a build breakage. llvm-svn: 26175	2006-02-14 09:11:59 +00:00
Evan Cheng	4b40a42653	Rename maxStoresPerMemSet to maxStoresPerMemset, etc. llvm-svn: 26174	2006-02-14 08:38:30 +00:00
Evan Cheng	81fcea8aa2	Expand memset dst, c, size to a series of stores if size falls below the target specific theshold, e.g. 16 for x86. llvm-svn: 26171	2006-02-14 08:22:34 +00:00
Chris Lattner	1784a9d267	now that libcalls don't suck, we can remove this hack llvm-svn: 26164	2006-02-14 05:39:35 +00:00
Jim Laskey	390c63e9d9	Rename to better reflect usage (current and planned.) llvm-svn: 26145	2006-02-13 12:50:39 +00:00
Jim Laskey	5995d0160c	Reorg for integration with gcc4. Old style debug info will not be passed though to SelIDAG. llvm-svn: 26115	2006-02-11 01:01:30 +00:00
Evan Cheng	f9adce90bf	Get rid of some memory leaks identified by Valgrind llvm-svn: 25960	2006-02-04 06:49:00 +00:00
Chris Lattner	3b48431333	Add initial support for immediates. This allows us to compile this: int %rlwnm(int %A, int %B) { %C = call int asm "rlwnm $0, $1, $2, $3, $4", "=r,r,r,n,n"(int %A, int %B, int 4, int 17) ret int %C } into: _rlwnm: or r2, r3, r3 or r3, r4, r4 rlwnm r2, r2, r3, 4, 17 ;; note the immediates :) or r3, r2, r2 blr llvm-svn: 25955	2006-02-04 02:26:14 +00:00
Chris Lattner	65ad53feb3	Initial early support for non-register operands, like immediates llvm-svn: 25952	2006-02-04 02:16:44 +00:00
Chris Lattner	f68fd20286	remove some #ifdef'd out code, which should properly be in the dag combiner anyway. llvm-svn: 25941	2006-02-03 20:13:59 +00:00
Chris Lattner	7f5880b1c7	Implement matching constraints. We can now say things like this: %C = call int asm "xyz $0, $1, $2, $3", "=r,r,r,0"(int %A, int %B, int 4) and get: xyz r2, r3, r4, r2 note that the r2's are pinned together. Yaay for 2-address instructions. 2342 ---------------------------------------------------------------------- llvm-svn: 25893	2006-02-02 00:25:23 +00:00
Chris Lattner	1558fc64f9	Implement simple register assignment for inline asms. This allows us to compile: int %test(int %A, int %B) { %C = call int asm "xyz $0, $1, $2", "=r,r,r"(int %A, int %B) ret int %C } into: (0x8906130, LLVM BB @0x8902220): %r2 = OR4 %r3, %r3 %r3 = OR4 %r4, %r4 INLINEASM <es:xyz $0, $1, $2>, %r2<def>, %r2, %r3 %r3 = OR4 %r2, %r2 BLR which asmprints as: _test: or r2, r3, r3 or r3, r4, r4 xyz $0, $1, $2 ;; need to print the operands now :) or r3, r2, r2 blr llvm-svn: 25878	2006-02-01 18:59:47 +00:00
Chris Lattner	3a5ed55187	adjust to changes in InlineAsm interface. Fix a few minor bugs. llvm-svn: 25865	2006-02-01 01:28:23 +00:00
Chris Lattner	2e56e89452	Handle physreg input/outputs. We now compile this: int %test_cpuid(int %op) { %B = alloca int %C = alloca int %D = alloca int %A = call int asm "cpuid", "=eax,==ebx,==ecx,==edx,eax"(int* %B, int* %C, int* %D, int %op) %Bv = load int* %B %Cv = load int* %C %Dv = load int* %D %x = add int %A, %Bv %y = add int %x, %Cv %z = add int %y, %Dv ret int %z } to this: _test_cpuid: sub %ESP, 16 mov DWORD PTR [%ESP], %EBX mov %EAX, DWORD PTR [%ESP + 20] cpuid mov DWORD PTR [%ESP + 8], %ECX mov DWORD PTR [%ESP + 12], %EBX mov DWORD PTR [%ESP + 4], %EDX mov %ECX, DWORD PTR [%ESP + 12] add %EAX, %ECX mov %ECX, DWORD PTR [%ESP + 8] add %EAX, %ECX mov %ECX, DWORD PTR [%ESP + 4] add %EAX, %ECX mov %EBX, DWORD PTR [%ESP] add %ESP, 16 ret ... note the proper register allocation. :) it is unclear to me why the loads aren't folded into the adds. llvm-svn: 25827	2006-01-31 02:03:41 +00:00
Chris Lattner	98ed05c81d	remove method I just added llvm-svn: 25728	2006-01-28 03:43:09 +00:00
Chris Lattner	43b867dd3b	add a new callback llvm-svn: 25727	2006-01-28 03:37:03 +00:00
Nate Begeman	595ec734fc	Implement Promote for VAARG, and allow it to be custom promoted for people who don't want the default behavior (Alpha). llvm-svn: 25726	2006-01-28 03:14:31 +00:00
Nate Begeman	8c47c3a3b1	Remove TLI.LowerReturnTo, and just let targets custom lower ISD::RET for the same functionality. This addresses another piece of bug 680. Next, on to fixing Alpha VAARG, which I broke last time. llvm-svn: 25696	2006-01-27 21:09:22 +00:00
Chris Lattner	476e67be14	initial selectiondag support for new INLINEASM node. Note that inline asms with outputs or inputs are not supported yet. :) llvm-svn: 25664	2006-01-26 22:24:51 +00:00
Nate Begeman	e74795cd70	First part of bug 680: Remove TLI.LowerVA* and replace it with SDNodes that are lowered the same way as everything else. llvm-svn: 25606	2006-01-25 18:21:52 +00:00
Evan Cheng	a6eff8a432	If scheduler choice is the default (-sched=default), use target scheduling preference to determine which scheduler to use. SchedulingForLatency == Breadth first; SchedulingForRegPressure == bottom up register reduction list scheduler. llvm-svn: 25599	2006-01-25 09:12:57 +00:00
Jim Laskey	b8566fa10a	Typo. llvm-svn: 25545	2006-01-23 13:34:04 +00:00
Evan Cheng	31272347d4	Skeleton of the list schedule. llvm-svn: 25544	2006-01-23 08:26:10 +00:00
Evan Cheng	c1e1d9724d	Factor out more instruction scheduler code to the base class. llvm-svn: 25532	2006-01-23 07:01:07 +00:00
Chris Lattner	deda32a786	Fix bugs lowering stackrestore, fixing 2004-08-12-InlinerAndAllocas.c on PPC. llvm-svn: 25522	2006-01-23 05:22:07 +00:00
Chris Lattner	e23928c67f	Fix a bug in a recent refactor that caused a bunch of programs to miscompile or the compiler to crash. llvm-svn: 25503	2006-01-21 19:12:11 +00:00
Evan Cheng	739a6a456e	Do some code refactoring on Jim's scheduler in preparation of the new list scheduler. llvm-svn: 25493	2006-01-21 02:32:06 +00:00
Chris Lattner	222ceabbee	If the target doesn't support f32 natively, insert the FP_EXTEND in target-indep code, so that the LowerReturn code doesn't have to handle it. llvm-svn: 25482	2006-01-20 18:38:32 +00:00
Chris Lattner	e2ee190821	Temporary work around for a libcall insertion bug: If a target doesn't support FSIN/FCOS nodes, do not lower sin/cos to them. llvm-svn: 25425	2006-01-18 21:50:14 +00:00
Robert Bocchino	03e95af9f7	Support for the insertelement operation. llvm-svn: 25405	2006-01-17 20:06:42 +00:00
Reid Spencer	b4f9a6f110	For PR411: This patch is an incremental step towards supporting a flat symbol table. It de-overloads the intrinsic functions by providing type-specific intrinsics and arranging for automatically upgrading from the old overloaded name to the new non-overloaded name. Specifically: llvm.isunordered -> llvm.isunordered.f32, llvm.isunordered.f64 llvm.sqrt -> llvm.sqrt.f32, llvm.sqrt.f64 llvm.ctpop -> llvm.ctpop.i8, llvm.ctpop.i16, llvm.ctpop.i32, llvm.ctpop.i64 llvm.ctlz -> llvm.ctlz.i8, llvm.ctlz.i16, llvm.ctlz.i32, llvm.ctlz.i64 llvm.cttz -> llvm.cttz.i8, llvm.cttz.i16, llvm.cttz.i32, llvm.cttz.i64 New code should not use the overloaded intrinsic names. Warnings will be emitted if they are used. llvm-svn: 25366	2006-01-16 21:12:35 +00:00
Nate Begeman	542c3c17a9	Remove some duplicated code llvm-svn: 25313	2006-01-14 03:18:27 +00:00
Nate Begeman	2fba8a3aaa	bswap implementation llvm-svn: 25312	2006-01-14 03:14:10 +00:00
Chris Lattner	b32664583b	Compile llvm.stacksave/restore into STACKSAVE/STACKRESTORE nodes, and allow targets to custom expand them as they desire. llvm-svn: 25273	2006-01-13 02:50:02 +00:00
Chris Lattner	6c9c250dcd	Add "support" for stacksave/stackrestore to the dag isel llvm-svn: 25268	2006-01-13 02:24:42 +00:00
Robert Bocchino	2c966e7617	Added selection DAG support for the extractelement operation. llvm-svn: 25179	2006-01-10 19:04:57 +00:00
Jim Laskey	219d559824	Applied some recommend changes from sabre. The dominate one beginning "let the pass manager do it's thing." Fixes crash when compiling -g files and suppresses dwarf statements if no debug info is present. llvm-svn: 25100	2006-01-04 22:28:25 +00:00
Chris Lattner	44c07ed61a	enable the gep isel opt llvm-svn: 24910	2005-12-21 19:36:36 +00:00
Chris Lattner	803a575616	Lower ConstantAggregateZero into zeros llvm-svn: 24890	2005-12-21 02:43:26 +00:00
Jim Laskey	7c462768ed	Added source file/line correspondence for dwarf (PowerPC only at this point.) llvm-svn: 24748	2005-12-16 22:45:29 +00:00
Chris Lattner	5d4e61dd87	Don't lump the filename and working dir together llvm-svn: 24697	2005-12-13 17:40:33 +00:00
Chris Lattner	9e8b633ec1	Accept and ignore prefetches for now llvm-svn: 24678	2005-12-12 22:51:16 +00:00
Chris Lattner	f1a54c0d14	Minor tweak to get isel opt llvm-svn: 24663	2005-12-11 09:05:13 +00:00
Chris Lattner	be73d6eece	improve code insertion in two ways: 1. Only forward subst offsets into loads and stores, not into arbitrary things, where it will likely become a load. 2. If the source is a cast from pointer, forward subst the cast as well, allowing us to fold the cast away (improving cases when the cast is from an alloca or global). This hasn't been fully tested, but does appear to further reduce register pressure and improve code. Lets let the testers grind on it a bit. :) llvm-svn: 24640	2005-12-08 08:00:12 +00:00
Nate Begeman	ae89d862f5	Fix a crash where ConstantVec nodes were being generated with the wrong type when the target did not support them. Also teach Legalize how to expand ConstantVecs. This allows us to generate _test: lwz r2, 12(r3) lwz r4, 8(r3) lwz r5, 4(r3) lwz r6, 0(r3) addi r2, r2, 4 addi r4, r4, 3 addi r5, r5, 2 addi r6, r6, 1 stw r2, 12(r3) stw r4, 8(r3) stw r5, 4(r3) stw r6, 0(r3) blr For: void %test(%v4i %P) { %T = load %v4i %P %S = add %v4i %T, <int 1, int 2, int 3, int 4> store %v4i %S, %v4i * %P ret void } On PowerPC. llvm-svn: 24633	2005-12-07 19:48:11 +00:00
Nate Begeman	41b1cdc771	Teach the SelectionDAG ISel how to turn ConstantPacked values into constant nodes with vector types. Also teach the asm printer how to print ConstantPacked constant pool entries. This allows us to generate altivec code such as the following, which adds a vector constantto a packed float. LCPI1_0: <4 x float> < float 0.0e+0, float 0.0e+0, float 0.0e+0, float 1.0e+0 > .space 4 .space 4 .space 4 .long 1065353216 ; float 1 .text .align 4 .globl _foo _foo: lis r2, ha16(LCPI1_0) la r2, lo16(LCPI1_0)(r2) li r4, 0 lvx v0, r4, r2 lvx v1, r4, r3 vaddfp v0, v1, v0 stvx v0, r4, r3 blr For the llvm code: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, < float 0.0, float 0.0, float 0.0, float 1.0 > store <4 x float> %tmp2, <4 x float> *%a ret void } llvm-svn: 24616	2005-12-06 06:18:55 +00:00
Chris Lattner	3539778883	Fix the #1 code quality problem that I have seen on X86 (and it also affects PPC and other targets). In a particular, consider code like this: struct Vector3 { double x, y, z; }; struct Matrix3 { Vector3 a, b, c; }; double dot(Vector3 &a, Vector3 &b) { return a.x * b.x + a.y * b.y + a.z * b.z; } Vector3 mul(Vector3 &a, Matrix3 &b) { Vector3 r; r.x = dot( a, b.a ); r.y = dot( a, b.b ); r.z = dot( a, b.c ); return r; } void transform(Matrix3 &m, Vector3 *x, int n) { for (int i = 0; i < n; i++) x[i] = mul( x[i], m ); } we compile transform to a loop with all of the GEP instructions for indexing into 'm' pulled out of the loop (9 of them). Because isel occurs a bb at a time we are unable to fold the constant index into the loads in the loop, leading to PPC code that looks like this: LBB3_1: ; no_exit.preheader li r2, 0 addi r6, r3, 64 ;; 9 values live across the loop body! addi r7, r3, 56 addi r8, r3, 48 addi r9, r3, 40 addi r10, r3, 32 addi r11, r3, 24 addi r12, r3, 16 addi r30, r3, 8 LBB3_2: ; no_exit lfd f0, 0(r30) lfd f1, 8(r4) fmul f0, f1, f0 lfd f2, 0(r3) ;; no constant indices folded into the loads! lfd f3, 0(r4) lfd f4, 0(r10) lfd f5, 0(r6) lfd f6, 0(r7) lfd f7, 0(r8) lfd f8, 0(r9) lfd f9, 0(r11) lfd f10, 0(r12) lfd f11, 16(r4) fmadd f0, f3, f2, f0 fmul f2, f1, f4 fmadd f0, f11, f10, f0 fmadd f2, f3, f9, f2 fmul f1, f1, f6 stfd f0, 0(r4) fmadd f0, f11, f8, f2 fmadd f1, f3, f7, f1 stfd f0, 8(r4) fmadd f0, f11, f5, f1 addi r29, r4, 24 stfd f0, 16(r4) addi r2, r2, 1 cmpw cr0, r2, r5 or r4, r29, r29 bne cr0, LBB3_2 ; no_exit uh, yuck. With this patch, we now sink the constant offsets into the loop, producing this code: LBB3_1: ; no_exit.preheader li r2, 0 LBB3_2: ; no_exit lfd f0, 8(r3) lfd f1, 8(r4) fmul f0, f1, f0 lfd f2, 0(r3) lfd f3, 0(r4) lfd f4, 32(r3) ;; much nicer. lfd f5, 64(r3) lfd f6, 56(r3) lfd f7, 48(r3) lfd f8, 40(r3) lfd f9, 24(r3) lfd f10, 16(r3) lfd f11, 16(r4) fmadd f0, f3, f2, f0 fmul f2, f1, f4 fmadd f0, f11, f10, f0 fmadd f2, f3, f9, f2 fmul f1, f1, f6 stfd f0, 0(r4) fmadd f0, f11, f8, f2 fmadd f1, f3, f7, f1 stfd f0, 8(r4) fmadd f0, f11, f5, f1 addi r6, r4, 24 stfd f0, 16(r4) addi r2, r2, 1 cmpw cr0, r2, r5 or r4, r6, r6 bne cr0, LBB3_2 ; no_exit This is much nicer as it reduces register pressure in the loop a lot. On X86, this takes the function from having 9 spilled registers to 2. This should help some spec programs on X86 (gzip?) This is currently only enabled with -enable-gep-isel-opt to allow perf testing tonight. llvm-svn: 24606	2005-12-05 07:10:48 +00:00
Chris Lattner	8782b782cd	dbg.stoppoint returns a value, don't forget to init it llvm-svn: 24583	2005-12-03 18:50:48 +00:00
Nate Begeman	1064d6ec43	First chunk of actually generating vector code for packed types. These changes allow us to generate the following code: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr for this llvm: void %foo(<4 x float>* %a) { entry: %tmp1 = load <4 x float>* %a %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float>* %a ret void } llvm-svn: 24534	2005-11-30 08:22:07 +00:00
Reid Spencer	3fd1b4c9bf	Fix a problem with llvm-ranlib that (on some platforms) caused the archive file to become corrupted due to interactions between mmap'd memory segments and file descriptors closing. The problem is completely avoiding by using a third temporary file. Patch provided by Evan Jones llvm-svn: 24527	2005-11-30 05:21:10 +00:00
Chris Lattner	435b402e1f	Add support for a new STRING and LOCATION node for line number support, patch contributed by Daniel Berlin, with a few cleanups here and there by me. llvm-svn: 24515	2005-11-29 06:21:05 +00:00
Nate Begeman	d37c13154a	Check in code to scalarize arbitrarily wide packed types for some simple vector operations (load, add, sub, mul). This allows us to codegen: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } on ppc as: _foo: lfs f0, 12(r3) lfs f1, 8(r3) lfs f2, 4(r3) lfs f3, 0(r3) fadds f0, f0, f0 fadds f1, f1, f1 fadds f2, f2, f2 fadds f3, f3, f3 stfs f0, 12(r3) stfs f1, 8(r3) stfs f2, 4(r3) stfs f3, 0(r3) blr llvm-svn: 24484	2005-11-22 18:16:00 +00:00
Nate Begeman	07890bbec4	Rather than attempting to legalize 1 x float, make sure the SD ISel never generates it. Make MVT::Vector expand-only, and remove the code in Legalize that attempts to legalize it. The plan for supporting N x Type is to continually epxand it in ExpandOp until it gets down to 2 x Type, where it will be scalarized into a pair of scalars. llvm-svn: 24482	2005-11-22 01:29:36 +00:00
Chris Lattner	19baba67b5	Unbreak codegen of bools. This should fix the llc/jit/llc-beta failures from last night. llvm-svn: 24427	2005-11-19 18:40:42 +00:00
Nate Begeman	b2e089c31b	Teach LLVM how to scalarize packed types. Currently, this only works on packed types with an element count of 1, although more generic support is coming. This allows LLVM to turn the following code: void %foo(<1 x float> * %a) { entry: %tmp1 = load <1 x float> * %a; %tmp2 = add <1 x float> %tmp1, %tmp1 store <1 x float> %tmp2, <1 x float> *%a ret void } Into: _foo: lfs f0, 0(r3) fadds f0, f0, f0 stfs f0, 0(r3) blr llvm-svn: 24416	2005-11-19 00:36:38 +00:00
Nate Begeman	127321b14c	Split out the shift code from visitBinary. llvm-svn: 24412	2005-11-18 07:42:56 +00:00
Chris Lattner	f2b62f317c	when debugging lower dbg intrinsics to calls llvm-svn: 24377	2005-11-16 07:22:30 +00:00
Andrew Lenharth	de1b5d6baa	added a chain output llvm-svn: 24306	2005-11-11 22:48:54 +00:00
Andrew Lenharth	01aa56397d	continued readcyclecounter support llvm-svn: 24300	2005-11-11 16:47:30 +00:00
Chris Lattner	cd6f0f47f2	Refactor intrinsic lowering stuff out of visitCall llvm-svn: 24261	2005-11-09 19:44:01 +00:00
Chris Lattner	41fd6d5d27	Fix CodeGen/X86/shift-folding.ll:test3 on X86 llvm-svn: 24256	2005-11-09 16:50:40 +00:00
Chris Lattner	b7cad90e55	Avoid creating a token factor node in trivially redundant cases. This eliminates almost one node per block in common cases. llvm-svn: 24254	2005-11-09 05:03:03 +00:00
Chris Lattner	43535a19b1	Handle GEP's a bit more intelligently. Fold constant indices early and turn power-of-two multiplies into shifts early to improve compile time. llvm-svn: 24253	2005-11-09 04:45:33 +00:00
Nate Begeman	3ee3e69556	Add the necessary support to the ISel to allow targets to codegen the new alignment information appropriately. Includes code for PowerPC to support fixed-size allocas with alignment larger than the stack. Support for arbitrarily aligned dynamic allocas coming soon. llvm-svn: 24224	2005-11-06 09:00:38 +00:00
Chris Lattner	6871b23d02	Significantly simplify this code and make it more aggressive. Instead of having a special case hack for X86, make the hack more general: if an incoming argument register is not used in any block other than the entry block, don't copy it to a vreg. This helps us compile code like this: %struct.foo = type { int, int, [0 x ubyte] } int %test(%struct.foo* %X) { %tmp1 = getelementptr %struct.foo* %X, int 0, uint 2, int 100 %tmp = load ubyte* %tmp1 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp to int ; <int> [#uses=1] ret int %tmp2 } to: _test: lbz r3, 108(r3) blr instead of: _test: lbz r2, 108(r3) or r3, r2, r2 blr The (dead) copy emitted to copy r3 into a vreg for extra-block uses was increasing the live range of r3 past the load, preventing the coallescing. This implements CodeGen/PowerPC/reg-coallesce-simple.ll llvm-svn: 24115	2005-10-30 19:42:35 +00:00
Nate Begeman	78afac2ddd	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Chris Lattner	0a71a9ac86	Fix Generic/2005-10-18-ZeroSizeStackObject.ll by not requesting a zero sized stack object if either the array size or the type size is zero. llvm-svn: 23801	2005-10-18 22:14:06 +00:00
Chris Lattner	8396a308a7	remove hack llvm-svn: 23797	2005-10-18 22:11:42 +00:00
Chris Lattner	bcfebebf22	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	6bd8fd09b6	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Jeff Cohen	f8a5e5ae6e	Fix VC++ warnings. llvm-svn: 23579	2005-10-01 03:57:14 +00:00
Chris Lattner	6f3b577ee6	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23504	2005-09-28 22:28:18 +00:00
Chris Lattner	0fd8f9fbc9	If the target prefers it, use _setjmp/_longjmp should be used instead of setjmp/longjmp for llvm.setjmp/llvm.longjmp. llvm-svn: 23481	2005-09-27 22:15:53 +00:00
Chris Lattner	d4382f0afa	If a function has liveins, and if the target requested that they be plopped into particular vregs, emit copies into the entry MBB. llvm-svn: 23331	2005-09-13 19:30:54 +00:00
Nate Begeman	007c650699	Add an option to the DAG Combiner to enable it for beta runs, and turn on that option for PowerPC's beta. llvm-svn: 23253	2005-09-07 00:15:36 +00:00
Chris Lattner	b0b4ec5655	Don't create zero sized stack objects even for array allocas with a zero number of elements. llvm-svn: 23219	2005-09-02 18:41:28 +00:00
Chris Lattner	b6cde17d29	Fix the release build, noticed by Eric van Riet Paap llvm-svn: 23215	2005-09-02 07:09:28 +00:00
Chris Lattner	a66403dbf7	For values that are live across basic blocks and need promotion, use ANY_EXTEND instead of ZERO_EXTEND to eliminate extraneous extensions. This eliminates dead zero extensions on formal arguments and other cases on PPC, implementing the newly tightened up test/Regression/CodeGen/PowerPC/small-arguments.ll test. llvm-svn: 23205	2005-09-02 00:19:37 +00:00
Chris Lattner	975f5c9f46	It is NDEBUG not _NDEBUG llvm-svn: 23186	2005-09-01 18:44:10 +00:00
Chris Lattner	075250bda1	Disable this code, which broke many tests last night llvm-svn: 23114	2005-08-27 16:16:51 +00:00
Chris Lattner	e7a2998064	Don't copy regs that are only used in the entry block into a vreg. This changes the code generated for: short %test(short %A) { %B = xor short %A, -32768 ret short %B } to: _test: xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr instead of: _test: rlwinm r2, r3, 0, 16, 31 xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr llvm-svn: 23109	2005-08-26 22:49:59 +00:00
Chris Lattner	13d7c252e5	Call the InsertAtEndOfBasicBlock hook if the usesCustomDAGSchedInserter flag is set on an instruction. llvm-svn: 23098	2005-08-26 20:54:47 +00:00
Chris Lattner	99282c7b92	Make -view-isel-dags show the dag before instruction selecting, in case the target isel crashes due to unimplemented features like calls :) llvm-svn: 22997	2005-08-24 00:34:29 +00:00
Chris Lattner	7f9e078d11	Fix a problem where constant expr shifts would not have their shift amount promoted to the right type. This fixes: IA64/2005-08-22-LegalizerCrash.ll llvm-svn: 22969	2005-08-22 17:28:31 +00:00
Chris Lattner	1a908c8920	Enable critical edge splitting by default llvm-svn: 22863	2005-08-18 17:35:14 +00:00
Chris Lattner	c9950c11a9	Add a new beta option for critical edge splitting, to avoid a problem that Nate noticed in yacr2 (and I know occurs in other places as well). This is still rough, as the critical edge blocks are not intelligently placed but is added to get some idea to see if this improves performance. llvm-svn: 22825	2005-08-17 06:37:43 +00:00
Chris Lattner	ba28c2733f	Fix a regression on X86, where FP values can be promoted too. llvm-svn: 22822	2005-08-17 06:06:25 +00:00
Chris Lattner	33182325f5	Eliminate the RegSDNode class, which 3 nodes (CopyFromReg/CopyToReg/ImplicitDef) used to tack a register number onto the node. Instead of doing this, make a new node, RegisterSDNode, which is a leaf containing a register number. These three operations just become normal DAG nodes now, instead of requiring special handling. Note that with this change, it is no longer correct to make illegal CopyFromReg/CopyToReg nodes. The legalizer will not touch them, and this is bad, so don't do it. :) llvm-svn: 22806	2005-08-16 21:55:35 +00:00
Chris Lattner	d47675ed24	Eliminate the SetCCSDNode in favor of a CondCodeSDNode class. This pulls the CC out of the SetCC operation, making SETCC a standard ternary operation and CC's a standard DAG leaf. This will make it possible for other node to use CC's as operands in the future... llvm-svn: 22728	2005-08-09 20:20:18 +00:00
Jeff Cohen	5f4ef3c5a8	Eliminate all remaining tabs and trailing spaces. llvm-svn: 22523	2005-07-27 06:12:32 +00:00
Nate Begeman	1ac40a1245	Remove unnecessary FP_EXTEND. This causes worse codegen for SSE. llvm-svn: 22469	2005-07-19 16:50:03 +00:00
Chris Lattner	f5473e44a9	Make several cleanups to Andrews varargs change: 1. Pass Value*'s into lowering methods so that the proper pointers can be added to load/stores from the valist 2. Intrinsics that return void should only return a token chain, not a token chain/retval pair. 3. Rename LowerVAArgNext -> LowerVAArg, because VANext is long gone. llvm-svn: 22338	2005-07-05 19:57:53 +00:00
Andrew Lenharth	2edc1881ac	restore old srcValueNode behavior and try to to work around it llvm-svn: 22315	2005-06-29 18:54:02 +00:00
Andrew Lenharth	8192568fbc	tracking the instructions causing loads and stores provides more information than just the pointer being loaded or stored llvm-svn: 22311	2005-06-29 15:57:19 +00:00
Andrew Lenharth	253145299b	If we support structs as va_list, we must pass pointers to them to va_copy See last commit for LangRef, this implements it on all targets. llvm-svn: 22273	2005-06-22 21:04:42 +00:00
Andrew Lenharth	9144ec4764	core changes for varargs llvm-svn: 22254	2005-06-18 18:34:52 +00:00
Chris Lattner	e4f71d036f	Fix construction of ioport intrinsics, fixing X86/io.llx and io-port.llx llvm-svn: 22026	2005-05-14 13:56:55 +00:00
Chris Lattner	96c262e24b	Eliminate special purpose hacks for dynamic_stack_alloc. llvm-svn: 22015	2005-05-14 07:29:57 +00:00
Chris Lattner	29dcc71d83	LowerOperation takes a dag llvm-svn: 22004	2005-05-14 05:50:48 +00:00
Chris Lattner	cbefe72fb2	Align doubles on 8-byte boundaries if possible. llvm-svn: 21993	2005-05-13 23:14:17 +00:00
Chris Lattner	2e77db6af6	Add an isTailCall flag to LowerCallTo llvm-svn: 21958	2005-05-13 18:50:42 +00:00
Chris Lattner	d0b0ecca3f	Emit function entry code after lowering hte arguments. llvm-svn: 21931	2005-05-13 07:33:32 +00:00
Chris Lattner	0220b2952f	Allow targets to emit code into the entry block of each function llvm-svn: 21930	2005-05-13 07:23:21 +00:00
Chris Lattner	111778e665	Pass calling convention to use into lower call to llvm-svn: 21900	2005-05-12 19:56:57 +00:00
Chris Lattner	490769c5b6	wrap long line llvm-svn: 21870	2005-05-11 18:57:06 +00:00
Chris Lattner	2d8b55c476	The semantics of cast X to bool are a comparison against zero, not a truncation! llvm-svn: 21833	2005-05-09 22:17:13 +00:00
Chris Lattner	20eaeae966	Add support for matching the READPORT, WRITEPORT, READIO, WRITEIO intrinsics llvm-svn: 21825	2005-05-09 20:22:36 +00:00
Chris Lattner	57d294f2ac	Don't use the load/store instruction as the source pointer, use the pointer being stored/loaded through! llvm-svn: 21806	2005-05-09 04:28:51 +00:00
Chris Lattner	f5675a0813	wrap long lines llvm-svn: 21804	2005-05-09 04:08:33 +00:00
Chris Lattner	7876156ba0	When hitting an unsupported intrinsic, actually print it Lower debug info to noops. llvm-svn: 21698	2005-05-05 17:55:17 +00:00
Andrew Lenharth	5e177826fd	Implement count leading zeros (ctlz), count trailing zeros (cttz), and count population (ctpop). Generic lowering is implemented, however only promotion is implemented for SelectionDAG at the moment. More coming soon. llvm-svn: 21676	2005-05-03 17:19:30 +00:00
Chris Lattner	8002640eab	Codegen and legalize sin/cos/llvm.sqrt as FSIN/FCOS/FSQRT calls. This patch was contributed by Morten Ofstad, with some minor tweaks and bug fixes added by me. llvm-svn: 21636	2005-04-30 04:43:14 +00:00
Andrew Lenharth	4a73c2cfdc	Implement Value* tracking for loads and stores in the selection DAG. This enables one to use alias analysis in the backends. (TRUNK)Stores and (EXT\|ZEXT\|SEXT)Loads have an extra SDOperand which is a SrcValueSDNode which contains the Value. Note that if the operation is introduced by the backend, it will still have the operand, but the value will be null. llvm-svn: 21599	2005-04-27 20:10:01 +00:00
Misha Brukman	774511633d	Convert tabs to spaces llvm-svn: 21439	2005-04-22 04:01:18 +00:00
Misha Brukman	835702a094	Remove trailing whitespace llvm-svn: 21420	2005-04-21 22:36:52 +00:00
Nate Begeman	af1c0f7a00	Fold shift by size larger than type size to undef Make llvm undef values generate ISD::UNDEF nodes llvm-svn: 21261	2005-04-12 23:12:17 +00:00
Chris Lattner	8a98c7f337	Emit BRCONDTWOWAY when possible. llvm-svn: 21167	2005-04-09 03:30:29 +00:00
Chris Lattner	0c14000760	transform fabs/fabsf calls into FABS nodes. llvm-svn: 21014	2005-04-02 05:26:53 +00:00
Chris Lattner	f68fd0b533	Turn -0.0 - X -> fneg llvm-svn: 21011	2005-04-02 05:04:50 +00:00
Andrew Lenharth	dec53920b4	PCMarker support for DAG and Alpha llvm-svn: 20965	2005-03-31 21:24:06 +00:00
Chris Lattner	5ca31d9831	Instead of setting up the CFG edges at selectiondag construction time, set them up after the code has been emitted. This allows targets to select one mbb as multiple mbb's as needed. llvm-svn: 20937	2005-03-30 01:10:47 +00:00
Chris Lattner	db45f7d763	Fix a bug that andrew noticed where we do not correctly sign/zero extend returned integer values all of the way to 64-bits (we only did it to 32-bits leaving the top bits undefined). This causes problems for targets like alpha whose ABI's define the top bits too. llvm-svn: 20926	2005-03-29 19:09:56 +00:00
Nate Begeman	f656525cb6	Change interface to LowerCallTo to take a boolean isVarArg argument. llvm-svn: 20842	2005-03-26 01:29:23 +00:00
Chris Lattner	531f9e92d4	This mega patch converts us from using Function::a{iterator\|begin\|end} to using Function::arg_{iterator\|begin\|end}. Likewise Module::g* -> Module::global_*. This patch is contributed by Gabor Greif, thanks! llvm-svn: 20597	2005-03-15 04:54:21 +00:00
Misha Brukman	73e929f89d	Fix compilation errors with VS 2005, patch by Aaron Gray. llvm-svn: 20231	2005-02-17 21:39:27 +00:00
Chris Lattner	0c56a548ed	Don't sink argument loads into loops or other bad places. This disables folding of argument loads with instructions that are not in the entry block. llvm-svn: 20228	2005-02-17 19:40:32 +00:00
Chris Lattner	ffcb0ae329	Adjust to changes in SelectionDAG interface. llvm-svn: 19779	2005-01-23 04:36:26 +00:00
Chris Lattner	eccb73d57f	Get this to work for 64-bit systems. llvm-svn: 19763	2005-01-22 23:04:37 +00:00
Chris Lattner	96c26751ec	Support targets that do not use i8 shift amounts. llvm-svn: 19707	2005-01-19 22:31:21 +00:00
Chris Lattner	9f2c4a5200	Teach legalize to promote copy(from\|to)reg, instead of making the isel pass do it. This results in better code on X86 for floats (because if strict precision is not required, we can elide some more expensive double -> float conversions like the old isel did), and allows other targets to emit CopyFromRegs that are not legal for arguments. llvm-svn: 19668	2005-01-18 17:54:55 +00:00
Chris Lattner	b07e2d2084	Allow setcc operations to have nonbool types. llvm-svn: 19656	2005-01-18 02:52:03 +00:00
Chris Lattner	4d9651c760	Non-volatile loads can be freely reordered against each other. This fixes X86/reg-pressure.ll again, and allows us to do nice things in other cases. For example, we now codegen this sort of thing: int %loadload(int %X, int %Y) { %Z = load int* %Y %Y = load int* %X ;; load between %Z and store %Q = add int %Z, 1 store int %Q, int* %Y ret int %Y } Into this: loadload: mov %EAX, DWORD PTR [%ESP + 4] mov %EAX, DWORD PTR [%EAX] mov %ECX, DWORD PTR [%ESP + 8] inc DWORD PTR [%ECX] ret where we weren't able to form the 'inc [mem]' before. This also lets the instruction selector emit loads in any order it wants to, which can be good for register pressure as well. llvm-svn: 19644	2005-01-17 22:19:26 +00:00
Chris Lattner	4108bb01cf	Don't call SelectionDAG.getRoot() directly, go through a forwarding method. llvm-svn: 19642	2005-01-17 19:43:36 +00:00
Chris Lattner	e3c2cf4854	Implement a target independent optimization to codegen arguments only into the basic block that uses them if possible. This is a big win on X86, as it lets us fold the argument loads into instructions and reduce register pressure (by not loading all of the arguments in the entry block). For this (contrived to show the optimization) testcase: int %argtest(int %A, int %B) { %X = sub int 12345, %A br label %L L: %Y = add int %X, %B ret int %Y } we used to produce: argtest: mov %ECX, DWORD PTR [%ESP + 4] mov %EAX, 12345 sub %EAX, %ECX mov %EDX, DWORD PTR [%ESP + 8] .LBBargtest_1: # L add %EAX, %EDX ret now we produce: argtest: mov %EAX, 12345 sub %EAX, DWORD PTR [%ESP + 4] .LBBargtest_1: # L add %EAX, DWORD PTR [%ESP + 8] ret This also fixes the FIXME in the code. BTW, this occurs in real code. 164.gzip shrinks from 8623 to 8608 lines of .s file. The stack frame in huft_build shrinks from 1644->1628 bytes, inflate_codes shrinks from 116->108 bytes, and inflate_block from 2620->2612, due to fewer spills. Take that alkis. :-) llvm-svn: 19639	2005-01-17 17:55:19 +00:00
Chris Lattner	16f64df93a	Refactor code into a new method. llvm-svn: 19635	2005-01-17 17:15:02 +00:00
Chris Lattner	897cd7dc0a	add method stub llvm-svn: 19612	2005-01-16 07:28:41 +00:00
Chris Lattner	209f585033	Add support for promoted registers being live across blocks. llvm-svn: 19595	2005-01-16 02:23:07 +00:00
Chris Lattner	d58384fca6	Use the new TLI method to get this. llvm-svn: 19582	2005-01-16 01:11:19 +00:00
Chris Lattner	a8d34fb8c6	Add support for targets that require promotions. llvm-svn: 19579	2005-01-16 00:37:38 +00:00
Chris Lattner	1001c6e2cd	Add new SIGN_EXTEND_INREG, ZERO_EXTEND_INREG, and FP_ROUND_INREG operators. llvm-svn: 19568	2005-01-15 06:17:04 +00:00
Chris Lattner	3b8e719d1d	Adjust to CopyFromReg changes, implement deletion of truncating/extending stores/loads. llvm-svn: 19562	2005-01-14 22:38:01 +00:00
Chris Lattner	e727af06c8	Add new ImplicitDef node, rename CopyRegSDNode class to RegSDNode. llvm-svn: 19535	2005-01-13 20:50:02 +00:00
Chris Lattner	2451684678	Don't forget the existing root. llvm-svn: 19531	2005-01-13 19:53:14 +00:00
Chris Lattner	718b5c2f82	Codegen independent ops as being independent. llvm-svn: 19528	2005-01-13 17:59:43 +00:00
Chris Lattner	e05a461f1d	Add an option to view the selection dags as they are generated. llvm-svn: 19498	2005-01-12 03:41:21 +00:00
Chris Lattner	613f79fcbb	add an assertion, avoid creating copyfromreg/copytoreg pairs that are the same for PHI nodes. llvm-svn: 19484	2005-01-11 22:03:46 +00:00
Chris Lattner	875def9b71	Turn memset/memcpy/memmove into the corresponding operations. llvm-svn: 19463	2005-01-11 05:56:49 +00:00
Chris Lattner	a2c5d9168c	Handle static alloca arguments to PHI nodes. llvm-svn: 19409	2005-01-09 01:16:24 +00:00
Chris Lattner	58cfd7945d	Use new interfaces to correctly lower varargs and return/frame address intrinsics. llvm-svn: 19407	2005-01-09 00:00:49 +00:00
Chris Lattner	18d2b34637	Add support for llvm.setjmp and longjmp. Only 3 SingleSource/UnitTests fail now. llvm-svn: 19404	2005-01-08 22:48:57 +00:00
Chris Lattner	d006195517	Silence VS warnings. llvm-svn: 19384	2005-01-08 19:52:31 +00:00
Chris Lattner	1f45cd7418	Adjust to changes in LowerCAllTo interfaces llvm-svn: 19374	2005-01-08 19:26:18 +00:00
Chris Lattner	2a6db3c351	Add support for FP->INT conversions and back. llvm-svn: 19369	2005-01-08 08:08:56 +00:00
Chris Lattner	19a83990e1	Implement support for long GEP indices on 32-bit archs and support for int GEP indices on 64-bit archs. llvm-svn: 19354	2005-01-07 21:56:57 +00:00
Chris Lattner	8ea875fb05	Fix handling of dead PHI nodes. llvm-svn: 19349	2005-01-07 21:34:19 +00:00
Chris Lattner	7a60d91953	Initial implementation of the SelectionDAGISel class. This contains most of the code for lowering from LLVM code to a SelectionDAG. llvm-svn: 19331	2005-01-07 07:47:53 +00:00

... 10 11 12 13 14 ...

1100 Commits