llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	f47e30627a	Enhance the truncstore optimization code to handle shifted values and propagate demanded bits through them in simple cases. This allows this code: void foo(char *P) { strcpy(P, "abc"); } to compile to: _foo: ldrb r3, [r1] ldrb r2, [r1, #+1] ldrb r12, [r1, #+2]! ldrb r1, [r1, #+1] strb r1, [r0, #+3] strb r2, [r0, #+1] strb r12, [r0, #+2] strb r3, [r0] bx lr instead of: _foo: ldrb r3, [r1, #+3] ldrb r2, [r1, #+2] orr r3, r2, r3, lsl #8 ldrb r2, [r1, #+1] ldrb r1, [r1] orr r2, r1, r2, lsl #8 orr r3, r2, r3, lsl #16 strb r3, [r0] mov r2, r3, lsr #24 strb r2, [r0, #+3] mov r2, r3, lsr #16 strb r2, [r0, #+2] mov r3, r3, lsr #8 strb r3, [r0, #+1] bx lr testcase here: test/CodeGen/ARM/truncstore-dag-combine.ll This also helps occasionally for X86 and other cases not involving unaligned load/stores. llvm-svn: 42954	2007-10-13 06:58:48 +00:00
Chris Lattner	5e6fe054a2	Add a simple optimization to simplify the input to truncate and truncstore instructions, based on the knowledge that they don't demand the top bits. llvm-svn: 42952	2007-10-13 06:35:54 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dale Johannesen	61c574fc51	ppc long double. Implement fabs and fneg. llvm-svn: 42924	2007-10-12 19:02:17 +00:00
Dale Johannesen	a1a4a9ebfa	Implement i64->ppcf128 conversions. llvm-svn: 42919	2007-10-12 17:52:03 +00:00
Dan Gohman	e3583817ac	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	4f056f3c10	Add support to SplitVectorOp for powi, where the second operand is a scalar integer. llvm-svn: 42906	2007-10-12 14:13:46 +00:00
Evan Cheng	aa2d6ef81d	EXTRACT_SUBREG coalescing support. The coalescer now treats EXTRACT_SUBREG like (almost) a register copy. However, it always coalesced to the register of the RHS (the super-register). All uses of the result of a EXTRACT_SUBREG are sub- register uses which adds subtle complications to load folding, spiller rewrite, etc. llvm-svn: 42899	2007-10-12 08:50:34 +00:00
Dale Johannesen	05ff9e8cda	PPC long double. Implement a couple more conversions. llvm-svn: 42888	2007-10-12 01:37:08 +00:00
Dan Gohman	be37007e64	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Dan Gohman	2a7de41682	Codegen support for vector intrinsics. Factor out the code that expands the "nasty scalar code" for unrolling vectors into a separate routine, teach it how to handle mixed vector/scalar operands, as seen in powi, and use it for several operators, including sin, cos, powi, and pow. Add support in SplitVectorOp for fpow, fpowi and for several unary operators. llvm-svn: 42884	2007-10-11 23:57:53 +00:00
Dale Johannesen	6472eb63c2	Implement ppc long double->uint conversion. Make ppc long double constants print. llvm-svn: 42882	2007-10-11 23:32:15 +00:00
Dan Gohman	fd66486950	Add runtime library names for pow. llvm-svn: 42880	2007-10-11 23:09:10 +00:00
Dan Gohman	daee002438	Add an ISD::FPOW node type. llvm-svn: 42879	2007-10-11 23:06:37 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dale Johannesen	007aa378ad	Next PPC long double bits. First cut at constants. No compile-time support for constant operations yet, just format transformations. Make readers and writers work. Split constants into 2 doubles in Legalize. llvm-svn: 42865	2007-10-11 18:07:22 +00:00
Duncan Sands	56ab90d3ad	Correct swapped arguments to getConstant. llvm-svn: 42824	2007-10-10 09:54:50 +00:00
Dale Johannesen	666323eacd	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Dan Gohman	a160361c85	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Dan Gohman	5c6d0c3b99	DAGCombiner support for UDIVREM/SDIVREM and UMUL_LOHI/SMUL_LOHI. Check if one of the two results unneeded so see if a simpler operator could bs used. Also check to see if each of the two computations could be simplified if they were split into separate operators. Factor out the code that calls visit() so that it can be used for this purpose. llvm-svn: 42759	2007-10-08 17:57:15 +00:00
Dan Gohman	b08c8bfe41	Add convenience overloads of SelectionDAG::getNode that take a SDVTList and individual SDOperand operands. llvm-svn: 42753	2007-10-08 15:49:58 +00:00
Dan Gohman	fadf40a655	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Neil Booth	5f00973393	convertFromInteger, as originally written, expected sign-extended input. APInt unfortunately zero-extends signed integers, so Dale modified the function to expect zero-extended input. Make this assumption explicit in the function name. llvm-svn: 42732	2007-10-07 11:45:55 +00:00
Evan Cheng	0de312dd7d	Reapply 42677. llvm-svn: 42692	2007-10-06 08:19:55 +00:00
Chris Lattner	82217bd155	revert evan's patch until the header is committed llvm-svn: 42686	2007-10-06 06:08:17 +00:00
Evan Cheng	f4b5d491df	Added DAG xforms. e.g. (vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) (vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr) Remove x86 specific patterns. llvm-svn: 42677	2007-10-06 02:46:29 +00:00
Dale Johannesen	f864ac96d8	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Dale Johannesen	c0154c06d6	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Dan Gohman	12334acbfb	Legalize support for MUL_LOHI and DIVREM. llvm-svn: 42636	2007-10-05 14:17:22 +00:00
Dan Gohman	2682bb6df2	Fix a typo in a comment. llvm-svn: 42635	2007-10-05 14:11:58 +00:00
Dan Gohman	1a77dfba15	Provide names for MUL_LOHI and DIVREM operators. llvm-svn: 42634	2007-10-05 14:11:04 +00:00
Evan Cheng	84d0ebc10a	Chain producing nodes cannot be moved, not chain reading nodes. llvm-svn: 42627	2007-10-05 01:42:35 +00:00
Evan Cheng	991cf47221	Oops. Didn't mean to leave this in. llvm-svn: 42626	2007-10-05 01:39:40 +00:00
Evan Cheng	79e9713b11	If a node that defines a physical register that is expensive to copy. The scheduler will try a number of tricks in order to avoid generating the copies. This may not be possible in case the node produces a chain value that prevent movement. Try unfolding the load from the node before to allow it to be moved / cloned. llvm-svn: 42625	2007-10-05 01:39:18 +00:00
Evan Cheng	4852303bdb	Add a variant of getTargetNode() that takes a vector of MVT::ValueType. llvm-svn: 42620	2007-10-05 01:10:49 +00:00
Evan Cheng	fd11ef4665	Silence a warning. llvm-svn: 42619	2007-10-05 01:09:32 +00:00
Dan Gohman	c731c97fac	Use empty() member functions when that's what's being tested for instead of comparing begin() and end(). llvm-svn: 42585	2007-10-03 19:26:29 +00:00
Dale Johannesen	4d4e77af8e	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	b6c05b1f90	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Evan Cheng	a3a67596f6	Remove simple scheduler. llvm-svn: 42499	2007-10-01 20:44:07 +00:00
Dale Johannesen	c0855f8a88	remove dup comment llvm-svn: 42486	2007-09-30 19:08:12 +00:00
Dale Johannesen	9150652b21	Constant fold int-to-long-double conversions; use APFloat for int-to-float/double; use round-to-nearest for these (implementation-defined, seems to match gcc). llvm-svn: 42484	2007-09-30 18:19:03 +00:00
Dan Gohman	a90183e7d1	Teach SplitVectorOp how to split INSERT_VECTOR_ELT. llvm-svn: 42457	2007-09-28 23:53:40 +00:00
Evan Cheng	a5e595d23a	If two instructions are both two-address code, favors (schedule closer to terminator) the one that has a CopyToReg use. This fixes 2006-05-11-InstrSched.ll with -new-cc-modeling-scheme. llvm-svn: 42453	2007-09-28 22:32:30 +00:00
Evan Cheng	f72693f36e	Remove a poor scheduling heuristic. llvm-svn: 42443	2007-09-28 19:37:35 +00:00
Evan Cheng	038dcc5136	Trim some unneeded fields. llvm-svn: 42442	2007-09-28 19:24:24 +00:00
Dale Johannesen	789b5a505b	Fix long double -> uint64 conversion. llvm-svn: 42440	2007-09-28 18:44:17 +00:00
Dale Johannesen	25a00a63eb	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Evan Cheng	e6f92253f5	Avoid inserting a live register more than once. llvm-svn: 42410	2007-09-27 18:46:06 +00:00
Evan Cheng	75439b3b78	Silence a compiler warning. llvm-svn: 42389	2007-09-27 07:35:39 +00:00
Evan Cheng	bde499be60	Boogs. llvm-svn: 42388	2007-09-27 07:29:27 +00:00
Evan Cheng	1ec79b41db	Be smarter about which node to force schedule. Reduce # of duplications + copies; Added statistics. llvm-svn: 42387	2007-09-27 07:09:03 +00:00
Evan Cheng	cfd5f82890	Backtracking only when it won't create a cycle. llvm-svn: 42384	2007-09-27 00:25:29 +00:00
Evan Cheng	8e136a9dc4	- Move getPhysicalRegisterRegClass() from ScheduleDAG to MRegisterInfo. - Added ability to emit cross class register copies to the BBRU scheduler. - More aggressive backtracking. llvm-svn: 42375	2007-09-26 21:36:17 +00:00
Dale Johannesen	b6d56401aa	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Dale Johannesen	f04d37d3a9	Fix f80 UNDEF. llvm-svn: 42359	2007-09-26 17:26:49 +00:00
Evan Cheng	c1e4e3743b	Allow copyRegToReg to emit cross register classes copies. Tested with "make check"! llvm-svn: 42346	2007-09-26 06:25:56 +00:00
Dan Gohman	5e1a428344	Move the setOperationAction(ISD::DEBUG_LOC, MVT::Other, Expand) and the check to see if the assembler supports .loc from X86TargetLowering into the superclass TargetLowering. llvm-svn: 42297	2007-09-25 15:10:49 +00:00
Evan Cheng	5924bf7d3b	Added major new capabilities to scheduler (only BURR for now) to support physical register dependency. The BURR scheduler can now backtrace and duplicate instructions in order to avoid "expensive / impossible to copy" values (e.g. status flag EFLAGS for x86) from being clobbered. llvm-svn: 42284	2007-09-25 01:54:36 +00:00
Dan Gohman	6002818999	Use the correct result value type instead of using getValueType(0) in ExpandEXTRACT_VECTOR_ELT and SplitVectorOp. This fixes an abort in the included testcase. llvm-svn: 42264	2007-09-24 15:54:53 +00:00
Chris Lattner	10671ad650	initialize isstore/isload fields in ctor, fixing PR1695 llvm-svn: 42222	2007-09-22 07:02:12 +00:00
Dale Johannesen	4230512f32	Change APFloat::convertFromInteger to take the incoming bit width instead of number of words allocated, which makes it actually work for int->APF conversions. Adjust callers. Add const to one of the APInt constructors to prevent surprising match when called with const argument. llvm-svn: 42210	2007-09-21 22:09:37 +00:00
Chris Lattner	b3d01d2f56	initialize SetCCResultContents, fixing PR1693 llvm-svn: 42193	2007-09-21 17:06:39 +00:00
Dale Johannesen	7d67e547b5	More long double fixes. x86_64 should build now. llvm-svn: 42155	2007-09-19 23:55:34 +00:00
Dale Johannesen	b59d25fe54	Fix longdouble -> uint conversion. llvm-svn: 42143	2007-09-19 17:53:26 +00:00
Evan Cheng	0effc3a6b8	Use struct SDep instead of std::pair for SUnit pred and succ lists. First step in tracking physical register output dependencies. llvm-svn: 42125	2007-09-19 01:38:40 +00:00
Evan Cheng	e2e8f2d96b	Fix a bogus splat xform: shuffle <undef, undef, x, undef>, <undef, undef, undef, undef>, <2, 2, 2, 2> != <undef, undef, x, undef> llvm-svn: 42111	2007-09-18 21:54:37 +00:00
Dale Johannesen	af12b57405	Prevent crash on long double. llvm-svn: 42103	2007-09-18 18:36:59 +00:00
Devang Patel	00064e1bab	Do not hide APInt::dump() inside #ifndef NDEBUG. llvm-svn: 42068	2007-09-17 22:24:00 +00:00
Devang Patel	77ae4d358f	This is not ideal but unbreaks build failure. APInt::dump() is inside #ifndef NDEBUG, however SelectionDAG dump() routines are not. llvm-svn: 42047	2007-09-17 20:03:03 +00:00
Dale Johannesen	7f724e9b94	Adjust per revew comments. llvm-svn: 42002	2007-09-16 16:51:49 +00:00
Dale Johannesen	98d3a08d8f	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Chris Lattner	7955bbd9fd	Fix build problems on Cygwin (PR1652), patch by Patrick Walton. llvm-svn: 41923	2007-09-13 06:09:48 +00:00
Evan Cheng	100c8d6c8f	Bug fixes. llvm-svn: 41900	2007-09-13 00:06:00 +00:00
Evan Cheng	57ff158255	Remove dead code. llvm-svn: 41899	2007-09-12 23:45:46 +00:00
Evan Cheng	bb6a574def	Yet another getTargetNode variant. llvm-svn: 41898	2007-09-12 23:39:49 +00:00
Dale Johannesen	028084efe5	Revise previous patch per review comments. Next round of x87 long double stuff. Getting close now, basically works. llvm-svn: 41875	2007-09-12 03:30:33 +00:00
Dale Johannesen	245dceb06d	Add APInt interfaces to APFloat (allows directly access to bits). Use them in place of float and double interfaces where appropriate. First bits of x86 long double constants handling (untested, probably does not work). llvm-svn: 41858	2007-09-11 18:32:33 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Chris Lattner	58c227bd09	Emit: cmpl %eax, %ecx setae %al movzbl %al, %eax instead of: cmpl %eax, %ecx setb %al xorb $1, %al movzbl %al, %eax when using logical not of a C comparison. llvm-svn: 41807	2007-09-10 21:39:07 +00:00
Chris Lattner	33a7f51412	1. Don't call Value::getName(), which is slow. 2. Lower calls to fabs and friends to FABS nodes etc unless the function has internal linkage. Before we wouldn't lower if it had a definition, which is incorrect. This allows us to compile: define double @fabs(double %f) { %tmp2 = tail call double @fabs( double %f ) ret double %tmp2 } into: _fabs: fabs f1, f1 blr llvm-svn: 41805	2007-09-10 21:15:22 +00:00
Dale Johannesen	29e6ac4281	Implement misaligned FP loads and stores. llvm-svn: 41786	2007-09-08 19:29:23 +00:00
Rafael Espindola	1de0c86717	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Anton Korobeynikov	122bf4be7e	Split eh.select / eh.typeid.for intrinsics into i32/i64 versions. This is needed, because they just "mark" register liveins and we let frontend solve type issue, not lowering code :) llvm-svn: 41763	2007-09-07 11:39:35 +00:00
Owen Anderson	e2f23a3abf	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Dale Johannesen	bed9dc423c	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Duncan Sands	3c1b7fc056	Fix PR1628. When exception handling is turned on, labels are generated bracketing each call (not just invokes). This is used to generate entries in the exception table required by the C++ personality. However it gets in the way of tail-merging. This patch solves the problem by no longer placing labels around ordinary calls. Instead we generate entries in the exception table that cover every instruction in the function that wasn't covered by an invoke range (the range given by the labels around the invoke). As an optimization, such entries are only generated for parts of the function that contain a call, since for the moment those are the only instructions that can throw an exception [1]. As a happy consequence, we now get a smaller exception table, since the same region can cover many calls. While there, I also implemented folding of invoke ranges - successive ranges are merged when safe to do so. Finally, if a selector contains only a cleanup, there's a special shorthand for it - place a 0 in the call-site entry. I implemented this while there. As a result, the exception table output (excluding filters) is now optimal - it cannot be made smaller [2]. The problem with throw filters is that folding them optimally is hard, and the benefit of folding them is minimal. [1] I tested that having trapping instructions (eg divide by zero) in such a region doesn't cause trouble. [2] It could be made smaller with the help of higher layers, eg by having branch folding reorder basic blocks ending in invokes with the same landing pad so they follow each other. I don't know if this is worth doing. llvm-svn: 41718	2007-09-05 11:27:52 +00:00
Evan Cheng	e0cb6bb8da	Fix for PR1632. EHSELECTION always produces a i32 value. llvm-svn: 41712	2007-09-04 20:39:26 +00:00
Dale Johannesen	446b900192	Add mod, copysign, abs operations to APFloat. Implement some constant folding in SelectionDAG and DAGCombiner using APFloat. Remove double versions of constructor and getValue from ConstantFPSDNode. llvm-svn: 41664	2007-08-31 23:34:27 +00:00
Dale Johannesen	da7469f2b5	Revise per review of previous patch. llvm-svn: 41645	2007-08-31 17:03:33 +00:00
Dale Johannesen	3cf889f75e	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Dale Johannesen	d246b2ca5c	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Anton Korobeynikov	2bdec2a5ee	Fix use of declaration inside case block llvm-svn: 41584	2007-08-29 23:18:48 +00:00
Anton Korobeynikov	830b1cb4e9	Lower FRAME_TO_ADDR_OFFSET to zero by default (if not custom lowered) llvm-svn: 41578	2007-08-29 19:28:29 +00:00
Dan Gohman	81b62e1218	Add an option, -view-sunit-dags, for viewing the actual SUnit DAGs used by scheduling. llvm-svn: 41556	2007-08-28 20:32:58 +00:00
Dan Gohman	9625d812c9	Make DAGCombiner's global alias analysis query more precise in the case where both pointers have non-zero offsets. llvm-svn: 41491	2007-08-27 16:32:11 +00:00
Dan Gohman	8dc0b93151	If the source and destination pointers in an llvm.memmove are known to not alias each other, it can be translated as an llvm.memcpy. llvm-svn: 41489	2007-08-27 16:26:13 +00:00
Duncan Sands	ef5a654216	There is an impedance matching problem between LLVM and gcc exception handling: if an exception unwinds through an invoke, then execution must branch to the invoke's unwind target. We previously tried to enforce this by appending a cleanup action to every selector, however this does not always work correctly due to an optimization in the C++ unwinding runtime: if only cleanups would be run while unwinding an exception, then the program just terminates without actually executing the cleanups, as invoke semantics would require. I was hoping this wouldn't be a problem, but in fact it turns out to be the cause of all the remaining failures in the LLVM testsuite (these also fail with -enable-correct-eh-support, so turning on -enable-eh didn't make things worse!). Instead we need to append a full-blown catch-all to the end of each selector. The correct way of doing this depends on the personality function, i.e. it is language dependent, so can only be done by gcc. Thus this patch which generalizes the eh.selector intrinsic so that it can handle all possible kinds of action table entries (before it didn't accomodate cleanups): now 0 indicates a cleanup, and filters have to be specified using the number of type infos plus one rather than the number of type infos. Related gcc patches will cause Ada to pass a cleanup (0) to force the selector to always fire, while C++ will use a C++ catch-all (null). llvm-svn: 41484	2007-08-27 15:47:50 +00:00
Dale Johannesen	b6d2bec418	Revise per review comments. llvm-svn: 41409	2007-08-26 01:18:27 +00:00
Dale Johannesen	2cfcf70f82	Add APFloat interface to ConstantFPSDNode. Change over uses in DAGCombiner. Fix interfaces to work with APFloats. llvm-svn: 41407	2007-08-25 22:10:57 +00:00
Chris Lattner	2ed652f11d	Allow target constants to be illegal types. The target should know how to handle them. This fixes test/CodeGen/Generic/asm-large-immediate.ll llvm-svn: 41388	2007-08-25 01:00:22 +00:00
Chris Lattner	dbfc4e4b07	Teach the dag scheduler to handle inline asm nodes with multi-value immediate operands. llvm-svn: 41386	2007-08-25 00:53:07 +00:00
Chris Lattner	d8c9cb9182	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Dale Johannesen	bdea32d812	Poison APFloat::operator==. Replace existing uses with bitwiseIsEqual. This means backing out the preceding change to Constants.cpp, alas. llvm-svn: 41378	2007-08-24 22:09:56 +00:00
Dale Johannesen	7891d8edf0	Use APFloat internally for ConstantFPSDNode. llvm-svn: 41372	2007-08-24 20:59:15 +00:00
Anton Korobeynikov	97cdac8d19	Perform correct codegen for eh_dwarf_cfa intrinsic. llvm-svn: 41316	2007-08-23 07:21:06 +00:00
Dan Gohman	54a187ea8b	Minor cleanups to reduce some spurious differences between different scheduler implementations. llvm-svn: 41191	2007-08-20 19:28:38 +00:00
Rafael Espindola	9c3d20d823	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Evan Cheng	f5a23abf37	Fold C ? 0 : 1 to ~C or zext(~C) or trunc(~C) depending the types. llvm-svn: 41163	2007-08-18 05:57:05 +00:00
Evan Cheng	cb6d65e1bf	Avoid issue on 64-bit hosts. llvm-svn: 41143	2007-08-17 18:02:22 +00:00
David Greene	81db5acab0	Fix GLIBCXX_DEBUG error of comparing two singular iterators llvm-svn: 41139	2007-08-17 15:13:55 +00:00
Evan Cheng	631ccc6144	If dynamic_stackalloc alignment is > stack alignment, first issue an instruction to align the stack ptr before the decrement. llvm-svn: 41133	2007-08-16 23:50:06 +00:00
Evan Cheng	95667c532c	- If a dynamic_stackalloc alignment requirement is <= stack alignment, then the alignment argument is ignored. - Always round up the size of the allocation to multiples of stack alignment to ensure the stack ptr is never left in an invalid state after a dynamic_stackalloc. llvm-svn: 41132	2007-08-16 23:46:29 +00:00
Lauro Ramos Venancio	a392cd2fde	Implement FPOWI ExpandOp. Fix PR1287. llvm-svn: 41112	2007-08-15 22:13:27 +00:00
Dan Gohman	a17799a3bd	Fix EXTRACT_ELEMENT, EXTRACT_SUBVECTOR, and EXTRACT_VECTOR_ELT to use an intptr ValueType instead of i32 for the index operand in getCopyToParts. llvm-svn: 40987	2007-08-10 14:59:38 +00:00
Rafael Espindola	66011c17d5	propagate struct size and alignment of byval arguments to the DAG llvm-svn: 40986	2007-08-10 14:44:42 +00:00
Dale Johannesen	c339e45274	Update per review comments. llvm-svn: 40965	2007-08-09 17:27:48 +00:00
Dale Johannesen	ba1a98a4e0	long double 9 of N. This finishes up the X86-32 bits (constants are still not handled). Adds ConvertActions to control fp-to-fp conversions (these are currently defaulted for all other targets, so no changes there). llvm-svn: 40958	2007-08-09 01:04:01 +00:00
Scott Michel	9d09c5ccda	If a target really needs to custom lower constants, it should be allowed to do so. llvm-svn: 40955	2007-08-08 23:23:31 +00:00
Chandler Carruth	7132e00de7	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	3ffe7187db	don't redefine a parameter llvm-svn: 40748	2007-08-02 18:08:16 +00:00
Evan Cheng	358c3d1dac	Do not emit copies for physical register output if it's not used. llvm-svn: 40722	2007-08-02 05:29:38 +00:00
Scott Michel	5b80ecbcf5	Style police: Expand the tabs to spaces! llvm-svn: 40712	2007-08-02 02:22:46 +00:00
Evan Cheng	c5549fc3a0	Instead of adding copyfromreg's to handle physical definitions. Now isel can simply specify them as results and let scheduledag handle them. That is, instead of SDOperand Flag = DAG.getTargetNode(Opc, MVT::i32, MVT::Flag, ...) SDOperand Result = DAG.getCopyFromReg(Chain, X86::EAX, MVT::i32, Flag) Just write: SDOperand Result = DAG.getTargetNode(Opc, MVT::i32, MVT::i32, ...) And let scheduledag emit the move from X86::EAX to a virtual register. llvm-svn: 40710	2007-08-02 00:28:15 +00:00
Lauro Ramos Venancio	0db4418a5f	Expand unaligned loads/stores when the target doesn't support them. (PR1548) llvm-svn: 40682	2007-08-01 19:34:21 +00:00
Scott Michel	34e2d22d63	- Allow custom lowering for CTPOP, CTTZ, CTLZ. - Fixed an existing unexpanded tab. llvm-svn: 40605	2007-07-30 21:00:31 +00:00
Dan Gohman	4ff9fb14f6	Fix a bug in getCopyFromParts turned up in the testcase for PR1132. llvm-svn: 40598	2007-07-30 19:09:17 +00:00
Duncan Sands	644f917358	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Dan Gohman	30f060be80	Fix the alias analysis query in DAGCombiner to not add in two offsets. The SrcValueOffset values are the real offsets from the SrcValue base pointers. llvm-svn: 40534	2007-07-26 16:14:06 +00:00
Christopher Lamb	18603b03e1	Teach DAG scheduling how to properly emit subreg insert/extract machine instructions. PR1350 llvm-svn: 40520	2007-07-26 08:12:07 +00:00
Christopher Lamb	a8fc0e527b	Add selection DAG nodes for subreg insert/extract. PR1350 llvm-svn: 40516	2007-07-26 07:34:40 +00:00
Christopher Lamb	3fead96121	Fix infinite recursion for when extract_vector_elt is legal. Unfortunately no public targets use this code-path, so no test. llvm-svn: 40510	2007-07-26 03:33:13 +00:00
Dan Gohman	f0bb12848f	Add const to CanBeFoldedBy, CheckAndMask, and CheckOrMask. llvm-svn: 40480	2007-07-24 23:00:27 +00:00
Dan Gohman	b6a8ae20c7	Fix some uses of dyn_cast to be uses of cast. llvm-svn: 40443	2007-07-23 20:24:29 +00:00
Duncan Sands	85ec2af554	As pointed out by g++-4.2, the original code didn't do what it thought it was doing. llvm-svn: 40044	2007-07-19 07:31:58 +00:00
Dan Gohman	a7b65c30a3	It's not necessary to do rounding for alloca operations when the requested alignment is equal to the stack alignment. llvm-svn: 40004	2007-07-18 16:29:46 +00:00
Dan Gohman	06c60b6032	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Nick Lewycky	d20f485866	Fix the build. Patch from Holger Schurig. llvm-svn: 39856	2007-07-14 15:11:14 +00:00
Anton Korobeynikov	383a324735	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	ff72788863	Fix the comment for LegalizeOp to more accurately reflect what it does. llvm-svn: 39827	2007-07-13 20:14:11 +00:00
Dan Gohman	80f9f077e3	Don't call SimplifyVBinOp for non-vector operations, following earlier review feedback. This theoretically makes the common (scalar) case more efficient. llvm-svn: 39823	2007-07-13 20:03:40 +00:00
Dale Johannesen	2182f06f2d	Skeleton of post-RA scheduler; doesn't do anything yet. Change name of -sched option and DEBUG_TYPE to pre-RA-sched; adjust testcases. llvm-svn: 39816	2007-07-13 17:13:54 +00:00
Dan Gohman	60d6f96da3	Change the peep for EXTRACT_VECTOR_ELT of BUILD_PAIR to look for the new CONCAT_VECTORS node type instead, as that's what legalize uses now. And add a peep for EXTRACT_VECTOR_ELT of INSERT_VECTOR_ELT. llvm-svn: 38503	2007-07-10 18:20:44 +00:00
Evan Cheng	5e9084207f	If the operand is marked M_OPTIONAL_DEF_OPERAND, then it's a def. llvm-svn: 38496	2007-07-10 17:52:20 +00:00
Dan Gohman	adb3d37c07	Fix a bug in the folding of binary operators to undef. Thanks to Lauro for spotting this! llvm-svn: 38491	2007-07-10 15:19:29 +00:00
Dan Gohman	fa91282dbf	Fix the folding of undef in several binary operators to recognize undef in either the left or right operand. llvm-svn: 38489	2007-07-10 14:20:37 +00:00
Evan Cheng	ff6f279adf	When a node value is only used by a CopyToReg, use the user's dest. This should not be restricted to nodes that produce only a single value. llvm-svn: 38485	2007-07-10 07:08:32 +00:00
Evan Cheng	32aad49b24	Move DenseMapKeyInfo<SDOperand> from LegalizeDAG.cpp to SelectionDAGNodes.h llvm-svn: 38484	2007-07-10 06:59:55 +00:00
Dan Gohman	2af3063337	Preserve volatililty and alignment information when lowering or simplifying loads and stores. llvm-svn: 38473	2007-07-09 22:18:38 +00:00
Dan Gohman	f8f531bf69	Change getCopyToParts and getCopyFromParts to always use target-endian register ordering, for both physical and virtual registers. Update the PPC target lowering for calls to expect registers for the call result to already be in target order. llvm-svn: 38471	2007-07-09 20:59:04 +00:00
Dan Gohman	6decfbf133	Initialize the IndexedModeActions array with memset before updating it with calls to setIndexedLoadAction/setIndexedStoreAction, which only update a few bits at a time. This avoids ostensible undefined behavior of operationg on values which may be trap-representations, and as a practical matter fixes errors from valgrind, which doesn't track uninitialized memory with bit granularity. llvm-svn: 38468	2007-07-09 20:49:44 +00:00
Chris Lattner	6caf8fdd04	Fix this warning: DAGCombiner.cpp: In member function 'llvm::SDOperand<unnamed>::DAGCombiner::visitOR(llvm::SDNode*)': DAGCombiner.cpp:1608: warning: passing negative value '-0x00000000000000001' for argument 1 to 'llvm::SDOperand llvm::SelectionDAG::getConstant(uint64_t, llvm::MVT::ValueType, bool)' oiy. llvm-svn: 38458	2007-07-09 16:16:34 +00:00
Duncan Sands	9d97420473	The exception handling intrinsics return values, so must be lowered to a value, not nothing at all. Subtle point: I made eh_selector return 0 and eh_typeid_for return 1. This means that only cleanups (destructors) will be run as the exception unwinds [if eh_typeid_for returned 0 then it would be as if the first catch always matched, and the corresponding handler would be run], which is probably want you want in the CBE. llvm-svn: 37947	2007-07-06 14:46:23 +00:00
Rafael Espindola	b567e3ffb0	Add the byval attribute llvm-svn: 37940	2007-07-06 10:57:03 +00:00
Duncan Sands	003c0b1f90	Remove propagateEHRegister in favour of a more limited fix, that is adequate while PR1508 remains unresolved. llvm-svn: 37938	2007-07-06 09:18:59 +00:00
Duncan Sands	81df18a50a	Remove ExtractGlobalVariable - use StripPointerCasts instead. llvm-svn: 37937	2007-07-06 09:10:03 +00:00
Evan Cheng	fc7010d962	Workaround of getCopyToRegs and getCopyFromRegs bugs for big-endian machines. llvm-svn: 37935	2007-07-06 01:47:35 +00:00
Evan Cheng	642be16bbf	Change CalculateHeights and CalculateDepths to be non-recursive. llvm-svn: 37934	2007-07-06 01:37:28 +00:00
Dan Gohman	a282694acf	Make the debug string for ISD::MERGE_VALUES consistent with the others. llvm-svn: 37922	2007-07-05 20:15:43 +00:00
Dan Gohman	d258e80583	Add a parameter to getCopyToParts and getCopyFromParts to specify whether endian swapping should be done, and update the code to use it. This fixes some register ordering issues on big-endian systems, such as PowerPC, introduced by the recent illegal by-val arguments changes. llvm-svn: 37921	2007-07-05 20:12:34 +00:00
Duncan Sands	fe80638417	Extend eh.selector to support both catches and filters. Drop the eh.filter intrinsic. llvm-svn: 37875	2007-07-04 20:52:51 +00:00
Dan Gohman	06563a8702	Fix several over-aggressive folds for undef nodes in dagcombine, to follow the rules for undef used in instcombine. llvm-svn: 37851	2007-07-03 14:03:57 +00:00
Dale Johannesen	a2b3c175db	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	533dd16a7f	Replace ExpandScalarFormalArgs and ExpandScalarCallArgs with the newly refactored getCopyFromParts and getCopyToParts, which are more general. This effectively adds support for lowering illegal by-val vector call arguments. llvm-svn: 37843	2007-07-02 16:18:06 +00:00
Dan Gohman	9a70823375	Teach GetNegatedExpression to negate 0-B to B in UnsafeFPMath mode, and visitFSUB to fold 0-B to -B in UnsafeFPMath mode. Also change visitFNEG to use isNegatibleForFree/GetNegatedExpression instead of doing a subset of the same thing manually. This fixes test/CodeGen/X86/negative-sin.ll. llvm-svn: 37842	2007-07-02 15:48:56 +00:00
Evan Cheng	fa68d069ad	Only do FNEG xform when the vector type is a floating point type. llvm-svn: 37818	2007-06-29 21:44:35 +00:00
David Greene	cf2a51e8db	Remove unused variables. llvm-svn: 37816	2007-06-29 21:42:03 +00:00
Evan Cheng	9458e6a551	Fix a vector FP constant CSE bug. llvm-svn: 37814	2007-06-29 21:36:04 +00:00
David Greene	4c1e6f3804	Remove unnecessary attributions in comments. llvm-svn: 37799	2007-06-29 03:42:23 +00:00
David Greene	9468bfd932	Fix reference to cached end iterator invalidated by an erase operation. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37795	2007-06-29 02:49:11 +00:00
David Greene	5b6f755575	Remove the "special tie breaker" because it resulted in inconsistent ordering and thus violated the strict weak ordering requirement of priority_queue. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37794	2007-06-29 02:48:09 +00:00
Dan Gohman	0de7694de6	Fix an assertion failure in legalizing bitcast operators on targets where vectors are split down to single elements as part of legalization. llvm-svn: 37785	2007-06-29 00:09:08 +00:00
Dan Gohman	7867793aff	Add new TargetLowering code to provide the final register type that an illegal value type will be transformed to, for code that needs the register type after all transformations instead of just after the first transformation. Factor out the code that uses this information to do copy-from-regs and copy-to-regs for various purposes into separate functions so that they are done consistently. llvm-svn: 37781	2007-06-28 23:29:44 +00:00
Evan Cheng	77f541ddfd	Partial fix for PR1502: If a EH register is needed in a successor of landing pad, add it as livein to all the blocks in the paths between the landing pad and the specified block. llvm-svn: 37763	2007-06-27 18:45:32 +00:00
Dan Gohman	3b62d7265d	Rename ("shrinkify") MVT::isExtendedValueType to MVT::isExtendedVT. llvm-svn: 37758	2007-06-27 16:08:04 +00:00
Dan Gohman	7139a48057	Use getVectorTypeBreakdown in FunctionLoweringInfo::CreateRegForValue to compute the number and type of registers needed for vector values instead of computing it manually. This fixes PR1529. llvm-svn: 37755	2007-06-27 14:34:07 +00:00
Dan Gohman	f4e86da3a6	Make the comment for ScalarizeVectorOp mention that it is only for use with single-element vectors. llvm-svn: 37752	2007-06-27 14:06:22 +00:00
Dan Gohman	a866514528	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	309d3d51b3	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Evan Cheng	e3c4419953	std::set is really really terrible. Switch to SmallPtrSet to reduce compile time. For Duraid's example. The overall isel time is reduced from 0.6255 sec to 0.1876 sec. llvm-svn: 37701	2007-06-22 01:35:51 +00:00
Dan Gohman	8e8d34b220	Tidy up ValueType names in comments. llvm-svn: 37688	2007-06-21 14:48:26 +00:00
Dan Gohman	04deef3a49	Rename TargetLowering::getNumElements and friends to TargetLowering::getNumRegisters and similar, to avoid confusion with the actual number of elements for vector types. llvm-svn: 37687	2007-06-21 14:42:22 +00:00
Evan Cheng	aa5f5d960d	Xforms: (add (select cc, 0, c), x) -> (select cc, x, (add, x, c)) (sub x, (select cc, 0, c)) -> (select cc, x, (sub, x, c)) llvm-svn: 37685	2007-06-21 07:39:16 +00:00
Dan Gohman	a7644dd9b9	Pass a SelectionDAG into SDNode::dump everywhere it's used, in prepration for needing the DAG node to print pre-legalize extended value types, and to get better debug messages with target-specific nodes. llvm-svn: 37656	2007-06-19 14:13:56 +00:00
Chris Lattner	26be02febf	add isVarArg to CCState llvm-svn: 37640	2007-06-19 00:11:09 +00:00
Chris Lattner	e31adc8ab9	make ComputeTopDownOrdering significantly faster and use less stack space by making it non-recursive llvm-svn: 37629	2007-06-18 21:28:10 +00:00
Dan Gohman	8c7333266c	Make chain dependencies blue, in addition to being dashed. llvm-svn: 37626	2007-06-18 15:30:16 +00:00
Tanya Lattner	e199f97fa8	Codegen support (stripped out) for the annotate attribute. llvm-svn: 37608	2007-06-15 22:26:58 +00:00
Chris Lattner	f852e339b6	Fix CodeGen/X86/inline-asm-x-scalar.ll:test4, by retaining regclass info for tied register constraints. llvm-svn: 37601	2007-06-15 19:11:01 +00:00
Duncan Sands	92bf2c628c	Workaround for PR1508. llvm-svn: 37597	2007-06-15 19:04:19 +00:00
Dan Gohman	5c4413120f	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Duncan Sands	7413736a7e	Only correctly lower exception handing intrinsics if exception handling is turned on. Likewise for scanning of invokes to mark landing pads. llvm-svn: 37570	2007-06-13 16:53:21 +00:00
Dan Gohman	26455c4ae0	Introduce new SelectionDAG node opcodes VEXTRACT_SUBVECTOR and VCONCAT_VECTORS. Use these for CopyToReg and CopyFromReg legalizing in the case that the full register is to be split into subvectors instead of scalars. This replaces uses of VBIT_CONVERT to present values as vector-of-vector types in order to make whole subvectors accessible via BUILD_VECTOR and EXTRACT_VECTOR_ELT. This is in preparation for adding extended ValueType values, where having vector-of-vector types is undesirable. llvm-svn: 37569	2007-06-13 15:12:02 +00:00
Dan Gohman	cbd51c8b60	When creating CopyFromReg nodes, always use legal types. And use the correct types for the result vector, even though it is currently bitcasted to a different type immediately. llvm-svn: 37568	2007-06-13 14:55:16 +00:00
Duncan Sands	97f7236e70	The fix that was applied for PR1224 stops the compiler crashing but breaks exception handling. The problem described in PR1224 is that invoke is a terminator that can produce a value. The value may be needed in other blocks. The code that writes to registers values needed in other blocks runs before terminators are lowered (in this case invoke) so asserted because the value was not yet available. The fix that was applied was to do invoke lowering earlier, before writing values to registers. The problem this causes is that the code to copy values to registers can be output after the invoke call. If an exception is raised and control is passed to the landing pad then this copy-code will never execute. If the value is needed in some code path reached via the landing pad then that code will get something bogus. So revert the original fix and simply skip invoke values in the general copying to registers code. Instead copy the invoke value to a register in the invoke lowering code. llvm-svn: 37567	2007-06-13 05:51:31 +00:00
Dale Johannesen	9a4d987a5f	Do not change the size of function arguments. PR 1489. llvm-svn: 37496	2007-06-07 21:07:15 +00:00
Duncan Sands	61166501a1	Additional fix for PR1422: make sure the landing pad label is placed in the correct machine basic block - do not rely on the eh.exception intrinsic being in the landing pad: the loop optimizers can move it out. llvm-svn: 37463	2007-06-06 10:05:18 +00:00
Dan Gohman	b4c2690446	Pass the DAG to SDNode::dump to let it do more detailed dumps in some cases. llvm-svn: 37413	2007-06-04 16:17:33 +00:00
Dan Gohman	92a7f3a65e	Resolve implicit alignment before computing the FoldingSet information so that the CSE map always contains explicit alignment information. This allows more loads to be CSE'd when there is a mix of explicit-alignment loads and implicit-alignment loads. Also, in SelectionDAG::FindModifiedNodeSlot, add the operands to the FoldingSetNodeID before the load/store information instead of after, so that it matches what is done elsewhere. llvm-svn: 37411	2007-06-04 15:49:41 +00:00
Duncan Sands	c063f5f362	Integrate exception filter support and exception catch support. This simplifies the code in DwarfWriter, allows for multiple filters and makes it trivial to specify filters accompanied by cleanups or catch-all specifications (see next patch). What a deal! Patch blessed by Anton. llvm-svn: 37398	2007-06-02 16:53:42 +00:00
Duncan Sands	706421e712	Since TypeInfos are passed as i8 pointers, a NULL TypeInfo should be passed as a null i8 pointer not as a 0 i32. llvm-svn: 37383	2007-06-01 08:18:30 +00:00
Chris Lattner	397c4d9ef6	Fix CodeGen/PowerPC/2007-05-30-dagcombine-miscomp.ll, and PR1473. llvm-svn: 37362	2007-05-30 16:30:06 +00:00
Chris Lattner	4698083b96	tighten up recursion depth again llvm-svn: 37330	2007-05-25 02:19:06 +00:00
Dan Gohman	30978078bf	Minor comment cleanups. llvm-svn: 37321	2007-05-24 14:36:04 +00:00
Dan Gohman	703e0f8608	Add explicit qualification for namespace MVT members. llvm-svn: 37320	2007-05-24 14:33:05 +00:00
Evan Cheng	a4d187b8ce	Fix a typo that caused combiner to create mal-formed pre-indexed store where value store is the same as the base pointer. llvm-svn: 37318	2007-05-24 02:35:39 +00:00
Anton Korobeynikov	3b327826db	Mark all calls as "could throw", when exceptions are enabled. Emit necessary LP info too. This fixes PR1439 llvm-svn: 37311	2007-05-23 11:08:31 +00:00
Chris Lattner	6509c0673f	prevent exponential recursion in isNegatibleForFree llvm-svn: 37310	2007-05-23 07:35:22 +00:00
Chris Lattner	1fa8276e70	same patch as the previous one, but the symmetric case llvm-svn: 37249	2007-05-19 00:46:51 +00:00
Chris Lattner	b08cbbd737	Disable the (A == (B-A)) -> 2*A == B xform when the sub has multiple uses (in this case, the xform introduces an extra operation). This compiles PowerPC/compare-duplicate.ll into: _test: subf r2, r3, r4 cmplw cr0, r2, r3 bne cr0, LBB1_2 ;F instead of: _test: slwi r2, r3, 1 subf r3, r3, r4 cmplw cr0, r4, r2 bne cr0, LBB1_2 ;F This is target independent of course. llvm-svn: 37246	2007-05-19 00:43:44 +00:00
Dan Gohman	b539df3389	Qualify calls to getTypeForValueType with MVT:: too. llvm-svn: 37233	2007-05-18 18:41:29 +00:00
Dan Gohman	1796f1f8e9	Qualify several calls to functions in the MVT namespace, for consistency. llvm-svn: 37230	2007-05-18 17:52:13 +00:00
Chris Lattner	0184f88deb	disable MaskedValueIsZero, ComputeMaskedBits, and SimplifyDemandedBits for i128 integers. The 64-bit masks are not wide enough to represent the results. These should be converted to APInt someday. llvm-svn: 37169	2007-05-17 18:19:23 +00:00
Chris Lattner	2135bc08d6	add expand support for ADDC/SUBC/ADDE/SUBE so we can codegen 128-bit add/sub on 32-bit (or less) targets llvm-svn: 37168	2007-05-17 18:15:41 +00:00
Evan Cheng	429178d727	Add target hook to specify block size limit for if-conversion. llvm-svn: 37134	2007-05-16 23:45:53 +00:00
Dale Johannesen	7a6c175e7a	Don't fold bitconvert(load) for preinc/postdec loads. Likewise stores. llvm-svn: 37130	2007-05-16 22:45:30 +00:00
Chris Lattner	48fb92f75d	Use a ptr set instead of a linear search to unique TokenFactor operands. This fixes PR1423 llvm-svn: 37102	2007-05-16 06:37:59 +00:00
Evan Cheng	288f133c71	Bug fix: should check ABI alignment, not pref. alignment. llvm-svn: 37094	2007-05-16 02:04:50 +00:00
Lauro Ramos Venancio	3f142cbca2	Fix an infinite recursion in GetNegatedExpression. llvm-svn: 37086	2007-05-15 17:05:43 +00:00
Chris Lattner	c7596efdad	Fix some subtle issues handling immediate values. This fixes test/CodeGen/ARM/2007-05-14-InlineAsmCstCrash.ll llvm-svn: 37069	2007-05-15 01:33:58 +00:00
Chris Lattner	e49c974a7c	implement a simple fneg optimization/propagation thing. This compiles: CodeGen/PowerPC/fneg.ll into: _t4: fmul f0, f3, f4 fmadd f1, f1, f2, f0 blr instead of: _t4: fneg f0, f3 fmul f0, f0, f4 fmsub f1, f1, f2, f0 blr llvm-svn: 37054	2007-05-14 22:04:50 +00:00
Evan Cheng	f325c2a65e	Can't fold the bit_convert is the store is a truncating store. llvm-svn: 36962	2007-05-09 21:49:47 +00:00
Anton Korobeynikov	192d09c2d9	Do not assert, when case range split metric is zero and JTs are not allowed: just emit binary tree in this case. This fixes PR1403. llvm-svn: 36959	2007-05-09 20:07:08 +00:00
Evan Cheng	562e45692e	Forgot a check. llvm-svn: 36910	2007-05-07 21:36:06 +00:00
Evan Cheng	a4cf58a103	Enable a couple of xforms: - (store (bitconvert v)) -> (store v) if resultant store does not require higher alignment - (bitconvert (load v)) -> (load (bitconvert*)v) if resultant load does not require higher alignment llvm-svn: 36908	2007-05-07 21:27:48 +00:00
Duncan Sands	671e8c4444	Parameter attributes on invoke calls were being lost due to the wrong attribute index being used. Fix proposed by Anton Korobeynikov, who asked me to implement and commit it for him. This is PR1398. llvm-svn: 36906	2007-05-07 20:49:28 +00:00
Anton Korobeynikov	a8fd7fdc25	Detabify llvm-svn: 36891	2007-05-06 20:14:21 +00:00
Chris Lattner	07e6f3257c	Propagate alignment/volatility in two places. Implement support for expanding a bitcast from an illegal vector type to a legal one (e.g. 4xi32 -> 4xf32 in SSE1). This fixes PR1371 and CodeGen/X86/2007-05-05-VecCastExpand.ll llvm-svn: 36787	2007-05-05 19:39:05 +00:00
Duncan Sands	4cb9eb81ef	A bitcast of a global variable may have been constant folded to a GEP - handle this case too. llvm-svn: 36745	2007-05-04 17:12:26 +00:00
Evan Cheng	044a0a8cfb	Don't create indexed load / store with zero offset! llvm-svn: 36716	2007-05-03 23:52:19 +00:00
Chris Lattner	44a2ed66b1	Allow i/s to match (gv+c). This fixes CodeGen/PowerPC/2007-05-03-InlineAsm-S-Constraint.ll and PR1382 llvm-svn: 36672	2007-05-03 16:54:34 +00:00
Devang Patel	8c78a0bff0	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Anton Korobeynikov	11940fbba3	Properly set arguments bitwidth of EHSELECT node llvm-svn: 36654	2007-05-02 22:15:48 +00:00
Devang Patel	e95c6ad802	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	09f162ca6a	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Evan Cheng	b68343cdd8	Forgot about chain result; also UNDEF cannot have multiple values. llvm-svn: 36622	2007-05-01 08:53:39 +00:00
Evan Cheng	a684cd23a5	* Only turn a load to UNDEF if all of its outputs have no uses (indexed loads produce two results.) * Do not touch volatile loads. llvm-svn: 36604	2007-05-01 00:38:21 +00:00
Chris Lattner	8cfd33b647	Continue refactoring inline asm code. If there is an earlyclobber output register, preallocate all input registers and the early clobbered output. This fixes PR1357 and CodeGen/PowerPC/2007-04-30-InlineAsmEarlyClobber.ll llvm-svn: 36599	2007-04-30 21:11:17 +00:00
Chris Lattner	4333f8b1cf	refactor GetRegistersForValue to take OpInfo as an argument instead of various pieces of it. No functionality change. llvm-svn: 36592	2007-04-30 17:29:31 +00:00
Chris Lattner	ef07332504	refactor some code, no functionality change llvm-svn: 36590	2007-04-30 17:16:27 +00:00
Chris Lattner	412d61af43	generalize aggregate handling llvm-svn: 36568	2007-04-29 18:58:03 +00:00
Chris Lattner	401d8db381	memory operands that have a direct operand should have their stores created before the copies into physregs are done. This avoids having flag operands skip the store, causing cycles in the dag at sched time. This fixes infinite loops on these tests: test/CodeGen/Generic/2007-04-08-MultipleFrameIndices.ll for PR1308 test/CodeGen/PowerPC/2007-01-29-lbrx-asm.ll test/CodeGen/PowerPC/2007-01-31-InlineAsmAddrMode.ll test/CodeGen/X86/2006-07-12-InlineAsmQConstraint.ll for PR828 llvm-svn: 36547	2007-04-28 21:12:06 +00:00
Chris Lattner	de339fa55d	eliminate more redundant constraint type analysis llvm-svn: 36546	2007-04-28 21:03:16 +00:00
Chris Lattner	b2e55562ed	merge constraint type analysis stuff together. llvm-svn: 36545	2007-04-28 21:01:43 +00:00
Chris Lattner	d7e3b6c442	Significant refactoring of the inline asm stuff, to support future changes. No functionality change. llvm-svn: 36544	2007-04-28 20:49:53 +00:00
Chris Lattner	1deacd61f4	memory inputs to an inline asm are required to have an address available. If the operand is not already an indirect operand, spill it to a constant pool entry or a stack slot. This fixes PR1356 and CodeGen/X86/2007-04-27-InlineAsm-IntMemInput.ll llvm-svn: 36536	2007-04-28 06:42:38 +00:00
Chris Lattner	d102ed0ac6	Fix CodeGen/Generic/2007-04-27-LargeMemObject.ll and CodeGen/Generic/2007-04-27-InlineAsm-X-Dest.ll llvm-svn: 36534	2007-04-28 06:08:13 +00:00
Chris Lattner	4df3e8093b	Fix this to match change to InlineAsm class. llvm-svn: 36524	2007-04-28 04:05:59 +00:00
Chris Lattner	1cbe208cda	Fix incorrect legalization of EHSELECTOR. This fixes CodeGen/Generic/2007-04-14-EHSelectorCrash.ll and PR1326 llvm-svn: 36510	2007-04-27 17:12:52 +00:00
Evan Cheng	bf535fc8bd	Expand UINT_TO_FP in turns of SINT_TO_FP when UINTTOFP_* libcalls are not available. llvm-svn: 36501	2007-04-27 07:33:31 +00:00
Chris Lattner	784fe9dbbb	improve EH global handling, patch by Duncan Sands. llvm-svn: 36499	2007-04-27 01:20:11 +00:00
Chris Lattner	8131ab7c0f	enable Anton's shift/and switch lowering stuff! It now passes ppc bootstrap successfully! woohoo... llvm-svn: 36496	2007-04-26 21:09:43 +00:00
Anton Korobeynikov	d7ae7f1659	Fixx off-by-one bug, which prevents llvm-gcc bootstrap on ppc32 llvm-svn: 36490	2007-04-26 20:44:04 +00:00
Dan Gohman	e131e3ac02	Fix a typo in a comment. llvm-svn: 36485	2007-04-26 19:40:56 +00:00
Evan Cheng	15f269afa3	This was lefted out. Fixed sumarray-dbl. llvm-svn: 36445	2007-04-25 18:33:21 +00:00
Chris Lattner	cb0ed0cfbd	allow support for 64-bit stack objects llvm-svn: 36420	2007-04-25 04:08:28 +00:00
Chris Lattner	01a26c74ae	Be more careful about folding op(x, undef) when we have vector operands. This fixes CodeGen/X86/2007-04-24-VectorCrash.ll llvm-svn: 36413	2007-04-25 00:00:45 +00:00
Bill Wendling	47917b697f	Assertion when using a 1-element vector for an add operation. Get the real vector type in this case. llvm-svn: 36402	2007-04-24 21:13:23 +00:00
Scott Michel	4cfa616cee	Use '-1U' where '-1UL' is obvious overkill, eliminating gcc warnings about tests always being true in the process. llvm-svn: 36387	2007-04-24 01:24:20 +00:00
Christopher Lamb	8af6d5896f	PR400 phase 2. Propagate attributed load/store information through DAGs. llvm-svn: 36356	2007-04-22 23:15:30 +00:00
Lauro Ramos Venancio	4e91908f17	X86 TLS: Implement review feedback. llvm-svn: 36318	2007-04-21 20:56:26 +00:00
Reid Spencer	0c1349e6bc	Revert Christopher Lamb's load/store alignment changes. llvm-svn: 36309	2007-04-21 18:36:27 +00:00
Christopher Lamb	bff50208c8	add support for alignment attributes on load/store instructions llvm-svn: 36301	2007-04-21 08:16:25 +00:00
Lauro Ramos Venancio	94314be0e0	Allow the lowering of ISD::GLOBAL_OFFSET_TABLE. llvm-svn: 36290	2007-04-20 23:02:39 +00:00
Lauro Ramos Venancio	2518889872	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Chris Lattner	f03c90bee6	allow SRL to simplify its operands, as it doesn't demand all bits as input. llvm-svn: 36245	2007-04-18 03:06:49 +00:00
Chris Lattner	bf14f20632	When replacing a node in SimplifyDemandedBits, if the old node used any single-use nodes, they will be dead soon. Make sure to remove them before processing other nodes. This implements CodeGen/X86/shl_elim.ll llvm-svn: 36244	2007-04-18 03:05:22 +00:00
Chris Lattner	15c1b820cc	fix a pasto llvm-svn: 36242	2007-04-18 03:01:40 +00:00
Chris Lattner	4aff52bf3d	Fix a bug in my previous patch, grabbing the shift amount width from the wrong operand. llvm-svn: 36223	2007-04-17 22:53:02 +00:00
Chris Lattner	9a861a8550	Fold (x << c1)>> c2 into a single shift if the bits shifted out aren't used. This compiles: int baz(long long a) { return (short)(((int)(a >>24)) >> 9); } into: _baz: srwi r2, r3, 1 extsh r3, r2 blr on PPC, instead of: _baz: slwi r2, r3, 8 srwi r2, r2, 9 extsh r3, r2 blr GCC produces: _baz: srwi r10,r4,24 insrwi r10,r3,24,0 srawi r9,r3,24 srawi r3,r10,9 extsh r3,r3 blr This implements CodeGen/PowerPC/shl_elim.ll llvm-svn: 36221	2007-04-17 21:14:16 +00:00
Chris Lattner	9ad5915559	SIGN_EXTEND_INREG does not demand its top bits. Give SimplifyDemandedBits a chance to hack on it. This compiles: int baz(long long a) { return (short)(((int)(a >>24)) >> 9); } into: _baz: slwi r2, r3, 8 srwi r2, r2, 9 extsh r3, r2 blr instead of: _baz: srwi r2, r4, 24 rlwimi r2, r3, 8, 0, 23 srwi r2, r2, 9 extsh r3, r2 blr This implements CodeGen/PowerPC/sign_ext_inreg1.ll llvm-svn: 36212	2007-04-17 19:03:21 +00:00
Anton Korobeynikov	fb80151c42	Removed tabs everywhere except autogenerated & external files. Add make target for tabs checking. llvm-svn: 36146	2007-04-16 18:10:23 +00:00
Chris Lattner	6bd7b7b30b	disable switch lowering using shift/and. It still breaks ppc bootstrap for some reason. :( Will investigate. llvm-svn: 36011	2007-04-14 19:39:41 +00:00
Anton Korobeynikov	8a1a84f96e	Fix PR1325: Case range optimization was performed in the case it shouldn't. Also fix some "latent" bug on 64-bit platforms llvm-svn: 35990	2007-04-14 13:25:55 +00:00
Chris Lattner	7196f09edc	disable shift/and lowering to work around PR1325 for now. llvm-svn: 35985	2007-04-14 02:26:56 +00:00
Anton Korobeynikov	e288040abf	Fix PR1323 : we haven't updated phi nodes in good manner :) llvm-svn: 35963	2007-04-13 06:53:51 +00:00
Chris Lattner	5111499136	the result of an inline asm copy can be an arbitrary VT that the register class supports. In the case of vectors, this means we often get the wrong type (e.g. we get v4f32 instead of v8i16). Make sure to convert the vector result to the right type. This fixes CodeGen/X86/2007-04-11-InlineAsmVectorResult.ll llvm-svn: 35944	2007-04-12 06:00:20 +00:00
Chris Lattner	a77cb3ce68	fold noop vbitconvert instructions llvm-svn: 35943	2007-04-12 05:58:43 +00:00
Chris Lattner	784a68a702	Fix weirdness handling single element vectors. llvm-svn: 35941	2007-04-12 04:44:28 +00:00
Reid Spencer	c6251a7dfd	For PR1284: Implement the "part_set" intrinsic. llvm-svn: 35938	2007-04-12 02:48:46 +00:00
Chris Lattner	18e4ac4107	fix an infinite loop compiling ldecod, notice by JeffC. llvm-svn: 35910	2007-04-11 16:51:53 +00:00
Chris Lattner	a083ffcad7	Fix this harder. llvm-svn: 35888	2007-04-11 06:50:51 +00:00
Chris Lattner	c5f85d3738	don't create shifts by zero, fix some problems with my previous patch llvm-svn: 35887	2007-04-11 06:43:25 +00:00
Chris Lattner	65786b078c	Teach the codegen to turn [aez]ext (setcc) -> selectcc of 1/0, which often allows other simplifications. For example, this compiles: int isnegative(unsigned int X) { return !(X < 2147483648U); } Into this code: x86: movl 4(%esp), %eax shrl $31, %eax ret arm: mov r0, r0, lsr #31 bx lr thumb: lsr r0, r0, #31 bx lr instead of: x86: cmpl $0, 4(%esp) sets %al movzbl %al, %eax ret arm: mov r3, #0 cmp r0, #0 movlt r3, #1 mov r0, r3 bx lr thumb: mov r2, #1 mov r1, #0 cmp r0, #0 blt LBB1_2 @entry LBB1_1: @entry cpy r2, r1 LBB1_2: @entry cpy r0, r2 bx lr Testcase here: test/CodeGen/Generic/ispositive.ll llvm-svn: 35883	2007-04-11 05:32:27 +00:00
Chris Lattner	41189c63cc	Codegen integer abs more efficiently using the trick from the PPC CWG. This improves codegen on many architectures. Tests committed as CodeGen/*/iabs.ll X86 Old: X86 New: _test: _test: movl 4(%esp), %ecx movl 4(%esp), %eax movl %ecx, %eax movl %eax, %ecx negl %eax sarl $31, %ecx testl %ecx, %ecx addl %ecx, %eax cmovns %ecx, %eax xorl %ecx, %eax ret ret PPC Old: PPC New: _test: _test: cmpwi cr0, r3, -1 srawi r2, r3, 31 neg r2, r3 add r3, r3, r2 bgt cr0, LBB1_2 ; xor r3, r3, r2 LBB1_1: ; blr mr r3, r2 LBB1_2: ; blr ARM Old: ARM New: _test: _test: rsb r3, r0, #0 add r3, r0, r0, asr #31 cmp r0, #0 eor r0, r3, r0, asr #31 movge r3, r0 bx lr mov r0, r3 bx lr Thumb Old: Thumb New: _test: _test: neg r2, r0 asr r2, r0, #31 cmp r0, #0 add r0, r0, r2 bge LBB1_2 eor r0, r2 LBB1_1: @ bx lr cpy r0, r2 LBB1_2: @ bx lr Sparc Old: Sparc New: test: test: save -96, %o6, %o6 save -96, %o6, %o6 sethi 0, %l0 sra %i0, 31, %l0 sub %l0, %i0, %l0 add %i0, %l0, %l1 subcc %i0, -1, %l1 xor %l1, %l0, %i0 bg .BB1_2 restore %g0, %g0, %g0 nop retl .BB1_1: nop or %g0, %l0, %i0 .BB1_2: restore %g0, %g0, %g0 retl nop It also helps alpha/ia64 :) llvm-svn: 35881	2007-04-11 05:11:38 +00:00
Reid Spencer	a472f66dd0	For PR1146: Put the parameter attributes in their own ParamAttr name space. Adjust the rest of llvm as a result. llvm-svn: 35877	2007-04-11 02:44:20 +00:00
Chris Lattner	f269d84ca0	apparently some people commit without building the tree, or they forget to commit a LOT of files. llvm-svn: 35858	2007-04-10 03:20:39 +00:00
Jeff Cohen	e0bbbd3774	No longer needed. llvm-svn: 35850	2007-04-09 23:42:32 +00:00
Chris Lattner	35f0417ec1	remove dead target hooks. llvm-svn: 35847	2007-04-09 23:34:08 +00:00
Chris Lattner	39f65335d5	remove some dead target hooks, subsumed by isLegalAddressingMode llvm-svn: 35840	2007-04-09 22:27:04 +00:00
Anton Korobeynikov	da964a2852	Use integer log for metric calculation llvm-svn: 35834	2007-04-09 21:57:03 +00:00
Jeff Cohen	0475f3b4e9	Unbreak VC++ build. llvm-svn: 35817	2007-04-09 14:32:59 +00:00
Anton Korobeynikov	506eaf7915	Next stage into switch lowering refactoring 1. Fix some bugs in the jump table lowering threshold 2. Implement much better metric for optimal pivot selection 3. Tune thresholds for different lowering methods 4. Implement shift-and trick for lowering small (<machine word length) cases with few destinations. Good testcase will follow. llvm-svn: 35816	2007-04-09 12:31:58 +00:00
Reid Spencer	71b79e3d99	For PR1146: Adapt handling of parameter attributes to use the new ParamAttrsList class. llvm-svn: 35814	2007-04-09 06:17:21 +00:00
Chris Lattner	7b2decfa0a	implement CodeGen/X86/inline-asm-x-scalar.ll:test3 llvm-svn: 35802	2007-04-09 05:31:20 +00:00
Chris Lattner	18d6718e78	add some assertions llvm-svn: 35800	2007-04-09 05:23:13 +00:00
Chris Lattner	b49917da92	Fix PR1316 llvm-svn: 35783	2007-04-09 00:33:58 +00:00
Chris Lattner	e55ecfb870	Fix for CodeGen/X86/2007-04-08-InlineAsmCrash.ll and PR1314 llvm-svn: 35779	2007-04-08 22:23:26 +00:00
Chris Lattner	1c741e95d3	minor comment fix llvm-svn: 35696	2007-04-06 17:47:14 +00:00
Reid Spencer	85460acfbf	Change the bit_part_select (non)implementation from "return 0" to abort. llvm-svn: 35679	2007-04-05 01:20:18 +00:00
Reid Spencer	cce90f55ed	Implement the llvm.bit.part_select.iN.iN.iN overloaded intrinsic. llvm-svn: 35678	2007-04-04 23:48:25 +00:00
Anton Korobeynikov	915e61736b	Properly emit range comparisons for switch cases, where neighbour cases go to the same destination. Now we're producing really good code for switch-lower-feature.ll testcase llvm-svn: 35672	2007-04-04 21:14:49 +00:00
Scott Michel	16627a542f	1. Insert custom lowering hooks for ISD::ROTR and ISD::ROTL. 2. Help DAGCombiner recognize zero/sign/any-extended versions of ROTR and ROTL patterns. This was motivated by the X86/rotate.ll testcase, which should now generate code for other platforms (and soon-to-come platforms.) Rewrote code slightly to make it easier to read. llvm-svn: 35605	2007-04-02 21:36:32 +00:00
Reid Spencer	3a0843e734	For PR1297: Adjust for changes in the bit counting intrinsics. They all return i32 now so we have to trunc/zext the DAG node accordingly. llvm-svn: 35546	2007-04-01 07:34:11 +00:00
Reid Spencer	a090ffb2ab	For PR1297: Change getOperationName to return std::string instead of const char* llvm-svn: 35545	2007-04-01 07:32:19 +00:00
Chris Lattner	f6a6d3c8b0	move a bunch of code out of the sdisel pass into its own opt pass "codegenprepare". llvm-svn: 35529	2007-03-31 04:18:03 +00:00
Chris Lattner	f2d71d49e2	switch TL::getValueType to use MVT::getValueType. llvm-svn: 35527	2007-03-31 04:05:24 +00:00
Chris Lattner	ac3f81508c	add one addressing mode description hook to rule them all. llvm-svn: 35520	2007-03-30 23:14:50 +00:00
Dale Johannesen	4bbd2eefba	Fix incorrect combination of different loads. Reenable zext-over-truncate combination. llvm-svn: 35517	2007-03-30 21:38:07 +00:00
Evan Cheng	ccee35fd0d	Disable load width reduction xform of variant (zext (truncate load x)) for big endian targets until llvm-gcc build issue has been resolved. llvm-svn: 35449	2007-03-29 07:56:46 +00:00
Evan Cheng	4388043b25	Scale 1 is always ok. llvm-svn: 35407	2007-03-28 01:55:52 +00:00
Evan Cheng	c2cba18f2b	Remove isLegalAddressImmediate. llvm-svn: 35406	2007-03-28 01:53:55 +00:00
Evan Cheng	07c42d43a2	GEP index sinking fixes: 1) Take address scale into consideration. e.g. i32* -> scale 4. 2) Examine all the users of GEP. 3) Generalize to inter-block GEP's (no longer uses loopinfo). 4) Don't do xform if GEP has other variable index(es). llvm-svn: 35403	2007-03-28 01:49:39 +00:00
Anton Korobeynikov	37a0bfe128	Remove dead code llvm-svn: 35380	2007-03-27 12:05:48 +00:00
Anton Korobeynikov	3a9d68181a	Split big monster into small helpers. No functionality change. llvm-svn: 35379	2007-03-27 11:29:11 +00:00
Evan Cheng	c42406b5ad	SDISel does not preserve all, it changes CFG and other info. llvm-svn: 35376	2007-03-27 00:53:36 +00:00
Evan Cheng	8275f0e0af	SIGN_EXTEND_INREG requires one extra operand, a ValueType node. llvm-svn: 35350	2007-03-26 07:12:51 +00:00
Anton Korobeynikov	7037826c86	First step of switch lowering refactoring: perform worklist-driven strategy, emit JT's where possible. llvm-svn: 35338	2007-03-25 15:07:15 +00:00
Chris Lattner	77f0479833	Implement support for vector operands to inline asm, implementing CodeGen/X86/2007-03-24-InlineAsmVectorOp.ll llvm-svn: 35332	2007-03-25 05:00:54 +00:00
Chris Lattner	3d7efa2586	implement initial support for the silly X constraint. Testcase here: CodeGen/X86/2007-03-24-InlineAsmXConstraint.ll llvm-svn: 35327	2007-03-25 04:35:41 +00:00
Chris Lattner	843e44503c	Implement CodeGen/X86/2007-03-24-InlineAsmMultiRegConstraint.ll llvm-svn: 35324	2007-03-25 02:18:14 +00:00
Chris Lattner	d685514e2e	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Chris Lattner	2a991268f7	don't rely on ADL llvm-svn: 35299	2007-03-24 17:37:03 +00:00
Evan Cheng	b7051f596a	Adjust offset to compensate for big endian machines. llvm-svn: 35293	2007-03-24 00:02:43 +00:00
Evan Cheng	a883b58caf	Make sure SEXTLOAD of the specific type is supported on the target. llvm-svn: 35289	2007-03-23 22:13:36 +00:00
Evan Cheng	e2f5f24e8e	Also replace uses of SRL if that's also folded during ReduceLoadWidth(). llvm-svn: 35286	2007-03-23 20:55:21 +00:00
Evan Cheng	a824e79f06	A couple of bug fixes for reducing load width xform: 1. Address offset is in bytes. 2. Make sure truncate node uses are replaced with new load. llvm-svn: 35274	2007-03-23 02:16:52 +00:00
Dan Gohman	dcb291faa4	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Evan Cheng	464dc9b74c	More opportunities to reduce load size. llvm-svn: 35254	2007-03-22 01:54:19 +00:00
Dale Johannesen	0c6bb5eab7	repair x86 performance, dejagnu problems from previous change llvm-svn: 35245	2007-03-21 21:51:52 +00:00
Evan Cheng	d63baead9b	fold (truncate (srl (load x), c)) -> (smaller load (x+c/vt bits)) llvm-svn: 35239	2007-03-21 20:14:05 +00:00
Dale Johannesen	bacf4acf65	do not share old induction variables when this would result in invalid instructions (that would have to be split later) llvm-svn: 35227	2007-03-20 21:54:54 +00:00
Jeff Cohen	1baf5c84ab	Fix some VC++ warnings. llvm-svn: 35224	2007-03-20 20:43:18 +00:00
Lauro Ramos Venancio	971aa18867	Code clean up. llvm-svn: 35220	2007-03-20 20:09:03 +00:00
Evan Cheng	550cf0369c	Minor bug. llvm-svn: 35219	2007-03-20 19:32:11 +00:00
Lauro Ramos Venancio	25878b45f5	CopyToReg source operand can be a physical register. llvm-svn: 35213	2007-03-20 16:46:44 +00:00
Evan Cheng	a2465dfc07	Use SmallSet instead of std::set. llvm-svn: 35133	2007-03-17 08:53:30 +00:00
Evan Cheng	be22235790	If sdisel has decided to sink GEP index expression into any BB. Replace all uses in that BB. llvm-svn: 35132	2007-03-17 08:22:49 +00:00
Evan Cheng	c5bc763f50	Turn on GEP index sinking by default. llvm-svn: 35127	2007-03-16 18:32:30 +00:00
Evan Cheng	0a9d0cabaf	Stupid bug. llvm-svn: 35126	2007-03-16 17:50:20 +00:00
Evan Cheng	009ea54262	Sink a binary expression into its use blocks if it is a loop invariant computation used as GEP indexes and if the expression can be folded into target addressing mode of GEP load / store use types. llvm-svn: 35123	2007-03-16 08:46:27 +00:00
Evan Cheng	a2a2fd1e55	Added isLegalAddressExpression hook to test if the given expression can be folded into target addressing mode for the given type. llvm-svn: 35121	2007-03-16 08:42:32 +00:00
Evan Cheng	b9e3db67fb	Estimate a cost using the possible number of scratch registers required and use it as a late BURR scheduling tie-breaker. Intuitively, it's good to push down instructions whose results are liveout so their long live ranges won't conflict with other values which are needed inside the BB. Further prioritize liveout instructions by the number of operands which are calculated within the BB. llvm-svn: 35109	2007-03-14 22:43:40 +00:00
Evan Cheng	2874855302	Try schedule def + use closer whne Sethi-Ullman numbers are the same. e.g. t1 = op t2, c1 t3 = op t4, c2 and the following instructions are both ready. t2 = op c3 t4 = op c4 Then schedule t2 = op first. i.e. t4 = op c4 t2 = op c3 t1 = op t2, c1 t3 = op t4, c2 This creates more short live intervals which work better with the register allocator. llvm-svn: 35089	2007-03-13 23:25:11 +00:00
Evan Cheng	b7004fd889	More flexible TargetLowering LSR hooks for testing whether an immediate is a legal target address immediate or scale. llvm-svn: 35076	2007-03-12 23:37:10 +00:00
Chris Lattner	ce8aba03ee	implement support for floating point constants used as inline asm memory operands. llvm-svn: 35033	2007-03-08 22:29:47 +00:00
Chris Lattner	b7bc3f2d30	make this fail even in non-assert builds. llvm-svn: 35025	2007-03-08 07:07:03 +00:00
Anton Korobeynikov	ed4b303c10	Refactoring of formal parameter flags. Enable properly use of zext/sext/aext stuff. llvm-svn: 35008	2007-03-07 16:25:09 +00:00
Evan Cheng	8a1d09d079	Avoid combining indexed load further. llvm-svn: 35005	2007-03-07 08:07:03 +00:00
Chris Lattner	13780ac7db	big endian 32-bit systems (e.g. ppc32) want to return the high reg first, not the lo-reg first. This is fallout from my ppc calling conv change yesterday, it fixes test/ExecutionEngine/2003-05-06-LivenessClobber.llx llvm-svn: 34983	2007-03-06 20:01:06 +00:00
Anton Korobeynikov	f0b9316552	Enumerate SDISel formal parameter attributes. Make use of new enumeration. llvm-svn: 34960	2007-03-06 06:10:33 +00:00
Jeff Cohen	b622c11f77	Unbreak VC++ build. llvm-svn: 34917	2007-03-05 00:00:42 +00:00
Chris Lattner	47206667c0	fold away addc nodes when we know there cannot be a carry-out. llvm-svn: 34913	2007-03-04 20:40:38 +00:00
Chris Lattner	2dcc6e7f58	generalize llvm-svn: 34910	2007-03-04 20:08:45 +00:00
Chris Lattner	e2e13caeb2	canonicalize constants to the RHS of addc/adde. If nothing uses the carry out of addc, turn it into add. This allows us to compile: long long test(long long A, unsigned B) { return (A + ((long long)B << 32)) & 123; } into: _test: movl $123, %eax andl 4(%esp), %eax xorl %edx, %edx ret instead of: _test: xorl %edx, %edx movl %edx, %eax addl 4(%esp), %eax ;; add of zero andl $123, %eax ret llvm-svn: 34909	2007-03-04 20:03:15 +00:00
Chris Lattner	362621c7ae	eliminate some ops if they have an undef RHS llvm-svn: 34908	2007-03-04 20:01:46 +00:00
Chris Lattner	ca401aac31	Fix CodeGen/Generic/fpowi-promote.ll and PR1239 llvm-svn: 34893	2007-03-03 23:43:21 +00:00
Chris Lattner	567b9254cd	Add an expand action for ISD label which just deletes the label. This "fixes" PR1238. llvm-svn: 34890	2007-03-03 19:21:38 +00:00
Jim Laskey	d5453d7b56	Lower eh filter intrinsic. llvm-svn: 34802	2007-03-01 20:24:30 +00:00
Jim Laskey	644af6b68f	Chain is on second operand. llvm-svn: 34759	2007-02-28 20:43:58 +00:00
Jim Laskey	cf465fcebc	MERGE_VALUES unnecessary. llvm-svn: 34750	2007-02-28 18:37:04 +00:00
Chris Lattner	74bb92902e	add methods for analysis of call results and return nodes. llvm-svn: 34738	2007-02-28 07:09:40 +00:00
Chris Lattner	e74744143f	add methods to analyze calls and formals. llvm-svn: 34736	2007-02-28 06:56:37 +00:00
Chris Lattner	9f059194a7	Minor refactoring of CC Lowering interfaces llvm-svn: 34656	2007-02-27 05:13:54 +00:00
Chris Lattner	dc3adc83e7	move CC Lowering stuff to its own public interface llvm-svn: 34655	2007-02-27 04:43:02 +00:00
Chris Lattner	fce448f856	Fold (sext (truncate x)) more aggressively, by avoiding creation of a sextinreg if not needed. This is useful in two cases: before legalize, it avoids creating a sextinreg that will be trivially removed. After legalize if the target doesn't support sextinreg, the trunc/sext would not have been removed before. llvm-svn: 34621	2007-02-26 03:13:59 +00:00
Chris Lattner	ab5d0ac02c	track signedness of formal argument, though we have a fixme here. llvm-svn: 34620	2007-02-26 02:56:58 +00:00
Jim Laskey	14059d958a	Fix for PR1224. llvm-svn: 34610	2007-02-25 21:43:59 +00:00
Chris Lattner	8c504cf9a0	optimize duplicate ValueMap lookups llvm-svn: 34599	2007-02-25 18:40:32 +00:00
Chris Lattner	387f464121	fold trivial token factor nodes. This allows us to compile test/CodeGen/X86/fp-stack-ret.ll into: movl 4(%esp), %eax fldl (%eax) ret instead of: subl $12, %esp movl 16(%esp), %eax movsd (%eax), %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret by eliminating a token factor that blocked a check. llvm-svn: 34584	2007-02-25 08:24:27 +00:00
Chris Lattner	168c5856bf	initialize a instance variable llvm-svn: 34567	2007-02-25 01:28:05 +00:00
Jim Laskey	e1d1c0590f	Deal with cases when MMI is not requested. llvm-svn: 34556	2007-02-24 09:45:44 +00:00
Jim Laskey	b869ab6f31	Drop unused operand. llvm-svn: 34555	2007-02-24 09:44:17 +00:00
Chris Lattner	d7ef3f804d	Fix CodeGen/Generic/2007-02-23-DAGCombine-Miscompile.ll and PR1219 llvm-svn: 34551	2007-02-24 02:09:29 +00:00
Jim Laskey	31fef788eb	Handle improper cast. llvm-svn: 34535	2007-02-23 21:45:01 +00:00
Jim Laskey	3e3a65b764	Need to init. llvm-svn: 34499	2007-02-22 18:04:49 +00:00
Jim Laskey	44c37e7dbf	Tighten up error checking of args. llvm-svn: 34493	2007-02-22 16:10:05 +00:00
Jim Laskey	504e99479c	Handle lowering invoke to call correctly. llvm-svn: 34492	2007-02-22 15:38:06 +00:00
Jim Laskey	7f5872c455	Simplify lowering and selection of exception ops. llvm-svn: 34491	2007-02-22 15:37:19 +00:00
Jim Laskey	4b37a4c712	Selection and lowering for exception handling. llvm-svn: 34481	2007-02-21 22:53:45 +00:00
Chris Lattner	56e5fea163	print target nodes nicely llvm-svn: 34369	2007-02-17 06:38:37 +00:00
Chris Lattner	a9f917af59	Implement i/n/s constraints correctly. This fixes test/CodeGen/PowerPC/2007-02-16-InlineAsmNConstraint.ll llvm-svn: 34368	2007-02-17 06:00:35 +00:00
Chris Lattner	68dcec6fea	fix indentation llvm-svn: 34307	2007-02-15 18:19:15 +00:00
Chris Lattner	21ebae3394	Apply B Scott Michel's patch for PR1184, which improves diagnostics in an abort case. llvm-svn: 34306	2007-02-15 18:17:56 +00:00
Reid Spencer	09575bac2e	For PR1195: Change use of "packed" term to "vector" in comments, strings, variable names, etc. llvm-svn: 34300	2007-02-15 03:39:18 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	ab1812f806	fix a warning llvm-svn: 34272	2007-02-14 07:34:56 +00:00
Chris Lattner	1cf84d2745	Refix CodeGen/Generic/switch-lower.ll. In contrast to my previous patch, this doesn't miscompile lots of programs :) llvm-svn: 34268	2007-02-14 07:18:16 +00:00
Chris Lattner	945e437c65	Generalize TargetData strings, to support more interesting forms of data. Patch by Scott Michel. llvm-svn: 34266	2007-02-14 05:52:17 +00:00
Chris Lattner	59b27fa371	implement expand of truncate. This allows truncates from i128 to i64 to be supported on 32-bit hosts. llvm-svn: 34257	2007-02-13 23:55:16 +00:00
Chris Lattner	d08d31f68a	Fix PR1198, by adding initial i128 support. Patch by Dan Gohman. llvm-svn: 34256	2007-02-13 23:41:38 +00:00
Chris Lattner	2fbff4d2dc	revert my previous switch lowering change, which miscompiles a few programs. This will break a dj test until I have time to investigate. llvm-svn: 34247	2007-02-13 20:09:07 +00:00
Lauro Ramos Venancio	abde3cc16c	Add a space between // and the comment. llvm-svn: 34244	2007-02-13 18:10:13 +00:00
Lauro Ramos Venancio	9956dcffbe	Add "original alignment" to function arguments flags. llvm-svn: 34240	2007-02-13 13:50:08 +00:00
Chris Lattner	9056bae3be	Fix switch lowering to order cases in zext order, which is how we emit the comparisons. This fixes an infinite loop on CodeGen/Generic/switch-lower.ll and PR1197 llvm-svn: 34216	2007-02-13 01:05:56 +00:00
Chris Lattner	c473d8e431	Privatize StructLayout::MemberOffsets, adding an accessor llvm-svn: 34156	2007-02-10 19:55:17 +00:00
Evan Cheng	276b44b0f9	Add function live-ins to entry block live-in set. llvm-svn: 34112	2007-02-10 02:43:39 +00:00
Evan Cheng	de6083463d	Rename some variables to avoid confusion with SelectionDAGISel::BB. llvm-svn: 34110	2007-02-10 01:08:18 +00:00
Evan Cheng	93049457ee	Make use of TLI.SimplifySetCC() in LegalizeSetCCOperands(). llvm-svn: 34066	2007-02-08 22:16:19 +00:00
Evan Cheng	92658d5648	Move SimplifySetCC to TargetLowering and allow it to be shared with legalizer. llvm-svn: 34065	2007-02-08 22:13:59 +00:00
Chris Lattner	19083a4671	switch the VRBaseMap in the scheduler from an std::map to a DenseMap. This speeds up the isel pass from 2.5570s to 2.4722s on kc++ (3.4%). llvm-svn: 33879	2007-02-04 08:47:20 +00:00
Chris Lattner	9af2c86bc8	Introduce new UnarySDNode/BinarySDNode/TernarySDNode nodes, which coallocate their operands with the node itself. This reduces malloc traffic for operand lists. This reduces isel time on kc++ from 2.6164 to 2.5570s, about 2.3%. llvm-svn: 33878	2007-02-04 08:35:21 +00:00
Chris Lattner	22639f3d90	eliminate the SDNode::setValueTypes method. llvm-svn: 33876	2007-02-04 07:37:24 +00:00
Chris Lattner	f17b4222e2	eliminate a bunch of duplicate ctors and helper functions. llvm-svn: 33875	2007-02-04 07:28:00 +00:00
Chris Lattner	edfc7e5fa2	move MorphNode to out of line and merge setNodeOperands into it. There is no behavior or performance change here. llvm-svn: 33869	2007-02-04 02:49:29 +00:00
Chris Lattner	3bf17b6fa5	simplify MorphNodeTo to take a VTList operand. llvm-svn: 33868	2007-02-04 02:41:42 +00:00
Chris Lattner	486edfbc6f	eliminate some extraneous methods in SDNode llvm-svn: 33867	2007-02-04 02:32:44 +00:00
Chris Lattner	20754cc579	Give each selectiondag node class a home for it's vtable and rtti info llvm-svn: 33866	2007-02-04 02:23:32 +00:00
Chris Lattner	289aa4495c	Switch VAlueMap from std::map to DenseMap. llvm-svn: 33863	2007-02-04 01:35:11 +00:00
Chris Lattner	79084305ee	Switch NodeMap from std::map to DenseMap, this speeds up isel by 2.3% llvm-svn: 33862	2007-02-04 01:31:47 +00:00
Chris Lattner	94c44c96d3	swtich vector-> smallvector, speeding up selectiondag stuff 1% llvm-svn: 33861	2007-02-04 01:20:02 +00:00
Chris Lattner	4b0ddb22e9	Switch promoted/expanded ops over to using a DenseMap. Vector related maps aren't worth it. llvm-svn: 33860	2007-02-04 01:17:38 +00:00
Chris Lattner	ed39c86176	switch LegalizedNodes from std::map to a DenseMap. This speeds up isel time as a whole on kc++ by 11%. llvm-svn: 33857	2007-02-04 00:50:02 +00:00
Chris Lattner	ebeb48d4bc	Eliminate some malloc traffic from LegalizeAllNodesNotLeadingTo, speeding up isel on kimwitu by 0.7%. llvm-svn: 33853	2007-02-04 00:27:56 +00:00
Chris Lattner	cba058ce51	Eliminate some std::sets. This speeds up isel of kimwitu by about 0.9% llvm-svn: 33852	2007-02-04 00:24:41 +00:00
Chris Lattner	feec7137ce	Switch SelectionDAG::ReplaceAllUsesOfValueWith to use a SmallSetVector for the users set (most nodes have 1 or 2 users). This speeds up the isel pass 3.2% on kimwitu. llvm-svn: 33849	2007-02-04 00:14:31 +00:00
Chris Lattner	0a30b1f00f	switch the sched unit map over to use a DenseMap instead of std::map. This speeds up isel as a whole time by 2.6%. llvm-svn: 33810	2007-02-03 01:34:13 +00:00
Chris Lattner	e83030b9c8	Switch ComputeTopDownOrdering over to using a densemap. This speeds up isel as a whole by 3.3%. llvm-svn: 33809	2007-02-03 01:12:36 +00:00
Evan Cheng	f309d13677	Pasto llvm-svn: 33806	2007-02-03 00:43:46 +00:00
Reid Spencer	2341c22ec7	Changes to support making the shift instructions be true BinaryOperators. This feature is needed in order to support shifts of more than 255 bits on large integer types. This changes the syntax for llvm assembly to make shl, ashr and lshr instructions look like a binary operator: shl i32 %X, 1 instead of shl i32 %X, i8 1 Additionally, this should help a few passes perform additional optimizations. llvm-svn: 33776	2007-02-02 02:16:23 +00:00
Anton Korobeynikov	1b4e6015b4	Fixed uninitialized stuff inside LegalizeDAG. Fortunately, the only affected part is codegen of "memove" inside x86 backend. This fixes PR1144 llvm-svn: 33752	2007-02-01 08:39:52 +00:00
Chris Lattner	296a83cefb	Fit in 80 columns llvm-svn: 33745	2007-02-01 04:55:59 +00:00
Chris Lattner	e3eeb24a86	Emit a better assertion message for PR1133 llvm-svn: 33736	2007-02-01 01:21:12 +00:00
Evan Cheng	53026f1d5a	Allow the target to override the ISD::CondCode that's to be used to test the result of the comparison libcall against zero. llvm-svn: 33701	2007-01-31 09:29:11 +00:00
Reid Spencer	5301e7c605	For PR1136: Rename GlobalVariable::isExternal as isDeclaration to avoid confusion with external linkage types. llvm-svn: 33663	2007-01-30 20:08:39 +00:00
Chris Lattner	d27f95e08d	add initial support for handling inline asms with multiple constraints. This doesn't do the "right thing" but will probably work in most cases. This implements CodeGen/PowerPC/2007-01-29-lbrx-asm.ll. llvm-svn: 33643	2007-01-29 23:45:14 +00:00
Nate Begeman	eda5997cc8	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Anton Korobeynikov	06f7d4bec7	Arguments are counting from 1. not from 0. Maybe we should change numbering somehow? E.g. make return argument the last? llvm-svn: 33606	2007-01-28 18:01:49 +00:00
Anton Korobeynikov	9fa3839d29	More cleanup llvm-svn: 33605	2007-01-28 16:04:40 +00:00
Anton Korobeynikov	037c867b54	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Jim Laskey	c56315c2b5	Change the MachineDebugInfo to MachineModuleInfo to better reflect usage for debugging and exception handling. llvm-svn: 33550	2007-01-26 21:22:28 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Evan Cheng	f5c96fabf9	Renamed getTypeAlignmentShift() to getPreferredTypeAlignmentShift(). llvm-svn: 33482	2007-01-24 07:03:39 +00:00
Evan Cheng	be48a47d9d	Remove the DoubleTy special case. llvm-svn: 33449	2007-01-22 23:13:55 +00:00
Reid Spencer	2eadb5310d	For PR970: Clean up handling of isFloatingPoint() and dealing with PackedType. Patch by Gordon Henriksen! llvm-svn: 33415	2007-01-21 00:29:26 +00:00
Chris Lattner	50ee0e40e5	Teach TargetData to handle 'preferred' alignment for each target, and use these alignment amounts to align scalars when we can. Patch by Scott Michel! llvm-svn: 33409	2007-01-20 22:35:55 +00:00
Evan Cheng	00a640dbe0	Fix for PR1108: type of insert_vector_elt index operand is PtrVT, not MVT::i32. llvm-svn: 33398	2007-01-20 10:10:26 +00:00
Reid Spencer	a94d394ad2	For PR1043: This is the final patch for this PR. It implements some minor cleanup in the use of IntegerType, to wit: 1. Type::getIntegerTypeMask -> IntegerType::getBitMask 2. Type::IntTy changed to IntegerType from Type* 3. ConstantInt::getType() returns IntegerType* now, not Type* This also fixes PR1120. Patch by Sheng Zhou. llvm-svn: 33370	2007-01-19 21:13:56 +00:00
Evan Cheng	9201100b29	Remove this xform: (shl (add x, c1), c2) -> (add (shl x, c2), c1<<c2) Replace it with: (add (shl (add x, c1), c2), ) -> (add (add (shl x, c2), c1<<c2), ) This fixes test/CodeGen/ARM/smul.ll llvm-svn: 33361	2007-01-19 17:51:44 +00:00
Chris Lattner	4dc4489286	Fix PR1114 and CodeGen/Generic/2007-01-15-LoadSelectCycle.ll by being careful when folding "c ? load p : load q" that C doesn't reach either load. If so, folding this into load (c ? p : q) will induce a cycle in the graph. llvm-svn: 33251	2007-01-16 05:59:59 +00:00
Chris Lattner	f70c5cd5db	add options to view the dags before the first or second pass of dag combine. llvm-svn: 33249	2007-01-16 04:55:25 +00:00
Reid Spencer	a8a0f2cf68	Compensate for loss of DerivedTypes.h in TargetLowering.h llvm-svn: 33159	2007-01-12 23:31:12 +00:00
Reid Spencer	ddf1421b8e	Move a function out of line. llvm-svn: 33158	2007-01-12 23:30:31 +00:00
Evan Cheng	61a4be88b4	Minor fix. llvm-svn: 33149	2007-01-12 22:51:10 +00:00
Evan Cheng	31cbddf28a	Store default libgcc routine names and allow them to be redefined by target. llvm-svn: 33105	2007-01-12 02:11:51 +00:00
Zhou Sheng	75b871fb1e	For PR1043: Merge ConstantIntegral and ConstantBool into ConstantInt. Remove ConstantIntegral and ConstantBool from LLVM. llvm-svn: 33073	2007-01-11 12:24:14 +00:00
Evan Cheng	6730f03370	Naming consistency. llvm-svn: 33026	2007-01-08 23:55:53 +00:00
Evan Cheng	961bbd393b	Fix for PR1075: bottom-up register-reduction scheduling actually increases register pressure. - Fixed bugs in sethi-ullman number computation and priority queue comparison functions. - Separate code that handles priority computation special cases from SU number computation. llvm-svn: 33025	2007-01-08 23:50:38 +00:00
Chris Lattner	0199fd6d59	Implement some trivial FP foldings when -enable-unsafe-fp-math is specified. This implements CodeGen/PowerPC/unsafe-math.ll llvm-svn: 33024	2007-01-08 23:04:05 +00:00
Chris Lattner	10cae15d8e	remove support for llvm.isunordered llvm-svn: 32992	2007-01-07 08:37:22 +00:00
Evan Cheng	5f80c450f3	Expand fcopysign to the bitwise sequence if select is marked as expensive. llvm-svn: 32940	2007-01-05 23:33:44 +00:00
Evan Cheng	3b841ddbe0	Bug in ExpandFCOPYSIGNToBitwiseOps(). Clear the old sign bit of operand 0 before or'ing in the sign bit of operand 1. llvm-svn: 32930	2007-01-05 21:31:51 +00:00
Evan Cheng	376c9c4c97	CopyToReg source operand can be a register as well. e.g. Copy from GlobalBaseReg. llvm-svn: 32929	2007-01-05 20:59:06 +00:00
Evan Cheng	8ec5283dc4	GEP subscript is interpreted as a signed value. llvm-svn: 32888	2007-01-05 01:46:20 +00:00
Chris Lattner	96035bed51	fix PowerPC/2007-01-04-ArgExtension.ll, a bug handling K&R prototypes with the recent signless changes. llvm-svn: 32884	2007-01-04 22:22:37 +00:00
Evan Cheng	003feb03d5	Expand fcopysign to a series of bitwise of operations when it's profitable to do so. llvm-svn: 32881	2007-01-04 21:56:39 +00:00
Reid Spencer	e6f81876eb	Legalizer doesn't do an ANY_EXTEND if we don't ask for one so make sure that we default to an ANY_EXTEND if no parameter attribute is set on the result value of a function. llvm-svn: 32836	2007-01-03 16:49:33 +00:00
Reid Spencer	2a34b91666	Restore previous behavior of defaulting to ZEXT. This works around two things: (1) preventing PR1071 and (2) working around missing parameter attributes for bool type. (2) will be fixed shortly. When PR1071 is fixed, this patch should be undone. llvm-svn: 32831	2007-01-03 05:03:05 +00:00
Reid Spencer	0917adf614	Two changes: 1. Switch expression and cases are compared signed and are sign extended. 2. For function results needing extended, do SIGN_EXTEND if the SExtAttribute is set and ZERO_EXTEND if the ZExtAttribute is set, otherwise just let the Legalizer do ANY_EXTEND. This fixes the recent regression in kimwitu++ and probably the llvm-gcc bootstrap issue we had today. llvm-svn: 32830	2007-01-03 04:25:33 +00:00
Reid Spencer	791864c6a5	Clean up from recent changes. Comment the new parameter to ExpandLibCall. Consolidate some lines of code and remove duplication. llvm-svn: 32829	2007-01-03 04:22:32 +00:00
Reid Spencer	e63b6518fa	For PR950: Three changes: 1. Convert signed integer types to signless versions. 2. Implement the @sext and @zext parameter attributes. Previously the type of an function parameter was used to determine whether it should be sign extended or zero extended before the call. This information is now communicated via the function type's parameter attributes. 3. The interface to LowerCallTo had to be changed in order to accommodate the parameter attribute information. Although it would have been convenient to pass in the FunctionType itself, there isn't always one present in the caller. Consequently, a signedness indication for the result type and for each parameter was provided for in the interface to this method. All implementations were changed to make the adjustment necessary. llvm-svn: 32788	2006-12-31 05:55:36 +00:00
Reid Spencer	266e42b312	For PR950: This patch removes the SetCC instructions and replaces them with the ICmp and FCmp instructions. The SetCondInst instruction has been removed and been replaced with ICmpInst and FCmpInst. llvm-svn: 32751	2006-12-23 06:05:41 +00:00
Evan Cheng	258657e64e	getLoad() and getStore() calls missed SVOffset operand. Thanks to Dan Gohman for pointing it out! llvm-svn: 32712	2006-12-20 01:27:29 +00:00
Chris Lattner	aee775a6b7	Eliminate static ctors from Statistics llvm-svn: 32698	2006-12-19 22:41:21 +00:00
Evan Cheng	9ad6edf2ec	May need to promote the operand (either sign_extend_inreg or and) before expanding a {s\|u}int_to_fp. llvm-svn: 32665	2006-12-19 01:44:04 +00:00
Evan Cheng	adc80f98cf	LegalizeSetCCOperands() may end up inserting libcalls. They need to be properly serialized. Do not clear LastCallSEQ_END until that is done. llvm-svn: 32659	2006-12-18 22:55:34 +00:00
Bill Wendling	e33ce528da	Fixed so that it dereferences the ostream pointer. llvm-svn: 32640	2006-12-17 11:15:53 +00:00
Bill Wendling	a77f14265b	Added an automatic cast to "std::ostream" etc. from OStream. We then can rework the hacks that had us passing OStream in. We pass in std::ostream instead, check for null, and then dispatch to the correct print() method. llvm-svn: 32636	2006-12-17 05:15:13 +00:00
Chris Lattner	9bd5ed636c	Fix PR1049 and CodeGen/Generic/2006-12-16-InlineAsmCrash.ll by producing target constants instead of constants. Constants can get selected to li/movri instructions, which causes the scheduler to explode. llvm-svn: 32633	2006-12-16 21:14:48 +00:00
Evan Cheng	28cf4277bb	Cannot combine an indexed load / store any further. llvm-svn: 32629	2006-12-16 06:25:23 +00:00
Evan Cheng	851e589eda	Expand FP undef llvm-svn: 32623	2006-12-16 02:20:50 +00:00
Evan Cheng	860004688a	Allow promoted FP_TO_UINT / FP_TO_SINT to expand operand. llvm-svn: 32621	2006-12-16 02:10:30 +00:00
Evan Cheng	388cbbf000	Expand fabs / fneg to and / xor. llvm-svn: 32619	2006-12-16 00:52:40 +00:00
Evan Cheng	884bc09d10	Fix select_cc, select expansion to soft-fp bugs. llvm-svn: 32616	2006-12-15 22:42:55 +00:00
Jim Laskey	26df19ace6	This code was usurping the sextload expand in teh legalizer. Just make sure the right conditions are checked. llvm-svn: 32611	2006-12-15 21:38:30 +00:00
Chris Lattner	b1a9492ed7	silence a bogus warning llvm-svn: 32597	2006-12-15 07:36:19 +00:00
Evan Cheng	35fdd5ffe1	Expand FP compares to soft-fp call(s) llvm-svn: 32590	2006-12-15 02:59:56 +00:00
Jim Laskey	70323a8146	1. Tidy up jump table info. 2. Allow the jit to handle PIC relocable jump tables. llvm-svn: 32581	2006-12-14 19:17:33 +00:00
Evan Cheng	22cf89967b	More soft-fp work. llvm-svn: 32559	2006-12-13 20:57:08 +00:00
Evan Cheng	e370e0eb09	Expand (f64 extload f32) to (f64 fp_ext (load f32)) if f64 type action is expand. llvm-svn: 32527	2006-12-13 03:19:57 +00:00
Evan Cheng	f3a80c6235	Expand fsqrt, fsin, and fcos to libcalls. llvm-svn: 32526	2006-12-13 02:38:13 +00:00
Evan Cheng	0a5b805f6d	Expand f32 / f64 to i32 / i64 conversion to soft-fp library calls. llvm-svn: 32523	2006-12-13 01:57:55 +00:00
Reid Spencer	bfe26ffcfc	Replace CastInst::createInferredCast calls with more accurate cast creation calls. llvm-svn: 32521	2006-12-13 00:50:17 +00:00
Evan Cheng	3766fc60da	Expand FP constant to integers if FP types are not legal. llvm-svn: 32497	2006-12-12 22:19:28 +00:00
Evan Cheng	97a750fc47	Soft fp FNEG, SINT_TO_FP, UINT_TO_FP libcall expansion. llvm-svn: 32495	2006-12-12 21:51:17 +00:00
Evan Cheng	47833a1d28	Expand ConstantFP to load from CP if float types are being expanded. llvm-svn: 32494	2006-12-12 21:32:44 +00:00
Evan Cheng	634885f71e	Expand i32/i64 CopyToReg f32/f64 to BIT_CONVERT + CopyToReg. llvm-svn: 32493	2006-12-12 21:21:32 +00:00
Evan Cheng	0076ca0da9	- When expanding a bit_convert whose src operand is also to be expanded and its expansion result type is equal to the result type of the bit_convert, e.g. (i64 bit_convert (f64 op)) if FP is not legal returns the result of the expanded source operand. - Store f32 / f64 may be expanded to a single store i32/i64. llvm-svn: 32490	2006-12-12 19:53:13 +00:00
Evan Cheng	0c0b78c18e	Expand formal arguments and call arguments recursively: e.g. f64 -> i64 -> 2 x i32. llvm-svn: 32476	2006-12-12 07:27:38 +00:00
Chris Lattner	2f96e7d241	fit in 80 cols llvm-svn: 32474	2006-12-12 05:22:21 +00:00
Chris Lattner	080881614d	this can only be fptrunc. llvm-svn: 32473	2006-12-12 05:21:51 +00:00
Chris Lattner	6ba11fbd75	Revert Nate's patch to fix X86/store-fp-constant.ll. With the dag combiner and legalizer separated like they currently are, I don't see a way to handle this xform. llvm-svn: 32466	2006-12-12 04:18:56 +00:00
Chris Lattner	b7524b6d0e	make this code more aggressive about turning store fpimm into store int imm. This is not sufficient to fix X86/store-fp-constant.ll llvm-svn: 32465	2006-12-12 04:16:14 +00:00
Reid Spencer	3c49edcaa1	Change inferred cast creation calls to more specific cast creations. llvm-svn: 32460	2006-12-12 01:17:41 +00:00
Evan Cheng	3432ab97c1	Re-apply changes that were backed out and fix a naughty typo. llvm-svn: 32442	2006-12-11 19:27:14 +00:00
John Criswell	b3b285185f	It seems the llvm::OStream class does not handle stream manipulators. For now, just grab the stream and perform the output on it directly. llvm-svn: 32441	2006-12-11 19:15:36 +00:00
Chris Lattner	e9a203c4e5	Revert changes that broke oggenc on ppc llvm-svn: 32440	2006-12-11 18:53:38 +00:00
Evan Cheng	218369881f	Don't convert store double C, Ptr to store long C, Ptr if i64 is not a legal type. llvm-svn: 32434	2006-12-11 17:25:19 +00:00
Evan Cheng	f4bec95b58	f32 / f64 node is expanded to one i32 / i64 node. llvm-svn: 32433	2006-12-11 06:50:04 +00:00
Evan Cheng	f6b01fdb48	Clean up some bad code. llvm-svn: 32432	2006-12-11 06:25:26 +00:00
Nate Begeman	8e20c760fa	Move something that should be in the dag combiner from the legalizer to the dag combiner. llvm-svn: 32431	2006-12-11 02:23:46 +00:00
Anton Korobeynikov	3b7c257cae	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Evan Cheng	4eee72471c	Preliminary soft float support. llvm-svn: 32394	2006-12-09 02:42:38 +00:00
Chris Lattner	d9f04e4875	Fix CodeGen/PowerPC/2006-12-07-SelectCrash.ll on PPC64 llvm-svn: 32336	2006-12-07 22:36:47 +00:00
Bill Wendling	355fc5ad50	Removed more <iostream> includes llvm-svn: 32321	2006-12-07 20:28:15 +00:00
Bill Wendling	22e978a736	Removing even more <iostream> includes. llvm-svn: 32320	2006-12-07 20:04:42 +00:00
Chris Lattner	700b873130	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Jeff Cohen	cc08c83186	Unbreak VC++ build. llvm-svn: 32113	2006-12-02 02:22:01 +00:00
Evan Cheng	67fc141db5	Match TargetInstrInfo changes. llvm-svn: 32098	2006-12-01 21:52:58 +00:00
Evan Cheng	a743fada65	Avoid inifinite looping if READCYCLECOUNTER isn't custom lowered. llvm-svn: 32022	2006-11-29 19:13:47 +00:00
Evan Cheng	6973993e9c	Allow target to custom lower READCYCLECOUNTER (when it doesn't have to be expanded). llvm-svn: 32016	2006-11-29 08:26:18 +00:00
Evan Cheng	feba507a97	Fix for PR1023 by Dan Gohman. llvm-svn: 32003	2006-11-29 01:58:12 +00:00
Evan Cheng	6e12a052ff	Fix for PR1022 (folding loads of static initializers) by Dan Gohman. llvm-svn: 32000	2006-11-29 01:38:07 +00:00
Chris Lattner	90f4238c38	add a hook to allow targets to hack on inline asms to lower them to llvm when they want to. llvm-svn: 31997	2006-11-29 01:12:32 +00:00
Chris Lattner	3abb63651b	Fix PR1016 llvm-svn: 31950	2006-11-28 01:03:30 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Chris Lattner	5d5916b4d1	Fix the dag combiner bug corresponding to PR1014. llvm-svn: 31943	2006-11-27 21:50:02 +00:00
Chris Lattner	3da631f29a	For better or worse, load from i1 is assumed to be zero extended. Do not form a load from i1 from larger loads that may not be zext'd. llvm-svn: 31933	2006-11-27 04:40:53 +00:00
Chris Lattner	db18938355	If a brcond condition is promoted, make sure to zero extend it, even if not expanded into BR_CC. llvm-svn: 31932	2006-11-27 04:39:56 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Chris Lattner	3676a994ca	Fix PR1011 and CodeGen/Generic/2006-11-20-DAGCombineCrash.ll llvm-svn: 31878	2006-11-20 18:05:46 +00:00
Reid Spencer	d9436b6837	For PR950: First in a series of patches to convert SetCondInst into ICmpInst and FCmpInst using only two opcodes and having the instructions contain their predicate value. Nothing uses these classes yet. More patches to follow. llvm-svn: 31867	2006-11-20 01:22:35 +00:00
Jim Laskey	da0add3fd0	Fixing the ENABLE_OPTIMIZED=1 DISABLE_ASSERTIONS=1 build. llvm-svn: 31822	2006-11-17 13:07:55 +00:00
Evan Cheng	f64da389f8	Fix an incorrectly inverted condition. llvm-svn: 31773	2006-11-16 00:08:20 +00:00
Chris Lattner	30d08801ef	remove dead #include llvm-svn: 31753	2006-11-15 17:51:15 +00:00
Evan Cheng	dbd3d294e6	Matches MachineInstr changes. llvm-svn: 31712	2006-11-13 23:36:35 +00:00
Reid Spencer	2230144a75	Make an assert comment match the tested assertion. llvm-svn: 31686	2006-11-11 20:07:59 +00:00
Evan Cheng	979bbf48d5	Add methods to add implicit def use operands to a MI. llvm-svn: 31675	2006-11-11 10:20:02 +00:00
Chris Lattner	a0a8003f59	disallow preinc of a frameindex. This is not profitable and causes 2-addr pass to explode. This fixes a bunch of llc-beta failures on ppc last night. llvm-svn: 31661	2006-11-11 01:00:15 +00:00
Chris Lattner	eabc15c1d8	reduce indentation by using early exits. No functionality change. llvm-svn: 31660	2006-11-11 00:56:29 +00:00
Chris Lattner	ffad2166e1	move big chunks of code out-of-line, no functionality change. llvm-svn: 31658	2006-11-11 00:39:41 +00:00
Chris Lattner	4eac5f59e6	Fix a dag combiner bug exposed by my recent instcombine patch. This fixes CodeGen/Generic/2006-11-10-DAGCombineMiscompile.ll and PPC gsm/toast llvm-svn: 31644	2006-11-10 21:37:15 +00:00
Evan Cheng	8c9c6d71ed	Add implicit def / use operands to MachineInstr. llvm-svn: 31633	2006-11-10 08:43:01 +00:00
Evan Cheng	13440b025c	When forming a pre-indexed store, make sure ptr isn't the same or is a pred of value being stored. It would cause a cycle. llvm-svn: 31631	2006-11-10 08:28:11 +00:00
Chris Lattner	d5e604dbb2	commentate llvm-svn: 31627	2006-11-10 04:41:34 +00:00
Evan Cheng	6878378390	Don't attempt expensive pre-/post- indexed dag combine if target does not support them. llvm-svn: 31598	2006-11-09 19:10:46 +00:00
Evan Cheng	d550248f2c	Add a mechanism to specify whether a target supports a particular indexed load / store. llvm-svn: 31597	2006-11-09 18:56:43 +00:00
Evan Cheng	c034f14fbe	Rename ISD::MemOpAddrMode to ISD::MemIndexedMode llvm-svn: 31596	2006-11-09 18:44:21 +00:00
Evan Cheng	b15000736c	Rename ISD::MemOpAddrMode to ISD::MemIndexedMode llvm-svn: 31595	2006-11-09 17:55:04 +00:00
Evan Cheng	b58e06bc9e	getPostIndexedAddressParts change: passes in load/store instead of its loaded / stored VT. llvm-svn: 31584	2006-11-09 04:29:46 +00:00
Evan Cheng	85e54223cd	Match more post-indexed ops. llvm-svn: 31569	2006-11-08 20:27:27 +00:00
Jim Laskey	61feeb90f9	Remove redundant <cmath>. llvm-svn: 31561	2006-11-08 19:16:44 +00:00
Evan Cheng	0303cb9b33	- When performing pre-/post- indexed load/store transformation, do not worry about whether the new base ptr would be live below the load/store. Let two address pass split it back to non-indexed ops. - Minor tweaks / fixes. llvm-svn: 31544	2006-11-08 08:30:28 +00:00
Evan Cheng	6072435756	Fixed a minor bug preventing some pre-indexed load / store transformation. llvm-svn: 31543	2006-11-08 06:56:05 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Evan Cheng	d48f7dd250	Fix a obscure post-indexed load / store dag combine bug. llvm-svn: 31537	2006-11-08 02:38:55 +00:00
Evan Cheng	60c6846d21	Add post-indexed load / store transformations. llvm-svn: 31498	2006-11-07 09:03:05 +00:00
Chris Lattner	94c231f453	Fix PR988 and CodeGen/Generic/2006-11-06-MemIntrinsicExpand.ll. The low part goes in the first operand of expandop, not the second one. llvm-svn: 31487	2006-11-07 04:11:44 +00:00
Evan Cheng	f24d15f969	Remove dead code; added a missing null ptr check. llvm-svn: 31478	2006-11-06 21:33:46 +00:00
Evan Cheng	eb99bd736a	Add comment. llvm-svn: 31473	2006-11-06 08:14:30 +00:00
Jeff Cohen	7d6f3db3e2	Unbreak VC++ build. llvm-svn: 31464	2006-11-05 19:31:28 +00:00
Evan Cheng	33157700d9	Added pre-indexed store support. llvm-svn: 31459	2006-11-05 09:31:14 +00:00
Evan Cheng	1a1e23eff7	Added getIndexedStore. llvm-svn: 31458	2006-11-05 09:30:09 +00:00
Evan Cheng	fd2c5dd806	Changes to use operand constraints to process two-address instructions. llvm-svn: 31453	2006-11-04 09:44:31 +00:00
Evan Cheng	9456dd8b81	Fix comments. llvm-svn: 31414	2006-11-03 07:31:32 +00:00
Evan Cheng	1dfd26a151	Rename llvm-svn: 31413	2006-11-03 07:21:16 +00:00
Reid Spencer	52f958741a	Remove dead variable. Fix 80 column violations. llvm-svn: 31412	2006-11-03 03:30:34 +00:00
Evan Cheng	357017f4a9	Added DAG combiner transformation to generate pre-indexed loads. llvm-svn: 31410	2006-11-03 03:06:21 +00:00
Evan Cheng	c176f038b9	Added isPredecessor. llvm-svn: 31409	2006-11-03 03:05:24 +00:00
Chris Lattner	cd7b92251d	silence warning llvm-svn: 31397	2006-11-03 01:28:29 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Reid Spencer	7eb55b395f	For PR950: Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369	2006-11-02 01:53:59 +00:00
Chris Lattner	55402d4403	Allow the getRegForInlineAsmConstraint method to return a register class with no fixes physreg. Treat this as permission to use any register in the register class. When this happens and it is safe, allow the llvm register allcoator to allocate the register instead of doing it at isel time. This eliminates a ton of copies around common inline asms. For example: int test2(int Y, int X) { asm("foo %0, %1" : "=r"(X): "r"(X)); return X; } now compiles to: _test2: foo r3, r4 blr instead of: _test2: mr r2, r4 foo r2, r2 mr r3, r2 blr GCC produces: _test2: foo r4, r4 mr r3,r4 blr llvm-svn: 31366	2006-11-02 01:41:49 +00:00
Evan Cheng	1359196c4e	Clean up. llvm-svn: 31359	2006-11-01 22:39:30 +00:00
Evan Cheng	47218fab42	CopyFromReg starts a live range so its use should not be considered a floater. llvm-svn: 31356	2006-11-01 22:17:06 +00:00
Evan Cheng	415f365e5c	Print jumptable index. llvm-svn: 31340	2006-11-01 04:48:30 +00:00
Chris Lattner	fe43befeda	Compile CodeGen/PowerPC/fp-branch.ll to: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 blt cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr instead of: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bge cr0, LBB1_1 ;bb72.exitStub LBB1_4: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bnu cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr llvm-svn: 31330	2006-10-31 23:06:00 +00:00
Chris Lattner	427301fdae	look through isunordered to inline it into branch blocks. llvm-svn: 31328	2006-10-31 22:37:42 +00:00
Chris Lattner	1fd360e13a	handle global address constant sdnodes llvm-svn: 31323	2006-10-31 20:01:56 +00:00
Chris Lattner	6f043b90ea	TargetLowering::isOperandValidForConstraint llvm-svn: 31319	2006-10-31 19:41:18 +00:00
Chris Lattner	8c6949e5b2	Change the prototype for TargetLowering::isOperandValidForConstraint llvm-svn: 31318	2006-10-31 19:40:43 +00:00
Chris Lattner	968f803928	Turn an assert into an error message. This is commonly triggered when we don't support a specific constraint yet. When this happens, print the unsupported constraint. llvm-svn: 31310	2006-10-31 07:33:13 +00:00
Evan Cheng	e6d584765f	Fix a typo which can break jumptables. llvm-svn: 31305	2006-10-31 02:31:00 +00:00
Evan Cheng	84a28d4e76	Lower jumptable to BR_JT. The legalizer can lower it to a BRIND or let the target custom lower it. llvm-svn: 31293	2006-10-30 08:00:44 +00:00
Evan Cheng	c3e695137d	Added a new SDNode type: BR_JT for jumptable branch. llvm-svn: 31292	2006-10-30 07:59:36 +00:00
Chris Lattner	e60ae823e8	fix Generic/2006-10-29-Crash.ll llvm-svn: 31281	2006-10-29 21:01:20 +00:00
Chris Lattner	f31b9ef458	Fix a load folding issue that Evan noticed: there is no need to export values used by comparisons in the main block. llvm-svn: 31279	2006-10-29 18:23:37 +00:00
Evan Cheng	7ab6123c42	VLOAD is not the LoadSDNode opcode. llvm-svn: 31276	2006-10-29 06:14:47 +00:00
Nick Lewycky	dc146a9fb9	Remove spurious case. EXTLOAD is not one of the node opcodes. llvm-svn: 31275	2006-10-29 02:26:30 +00:00
Chris Lattner	bba52191fa	split critical edges more carefully and intelligently. In particular, critical edges whose destinations are not phi nodes don't bother us. Also, share split edges, since the split edge can't have a phi. This significantly reduces the complexity of generated code in some cases. llvm-svn: 31274	2006-10-28 19:22:10 +00:00
Jim Laskey	eef273a16f	Load and stores have not been uniqued properly. llvm-svn: 31261	2006-10-28 17:25:28 +00:00
Chris Lattner	3e6b1c6157	Split all critical edges before isel. This resolves issues with spill code being inserted on unsplit critical edges, which introduces (sometimes large amounts of) partially dead spill code. This also fixes PR925 + CodeGen/Generic/switch-crit-edge-constant.ll llvm-svn: 31260	2006-10-28 17:04:37 +00:00
Chris Lattner	b78eb6c8d1	Fix a serious bug that caused any x86 vector stuff to infinite loop llvm-svn: 31254	2006-10-28 06:15:26 +00:00
Jim Laskey	bd0f088743	Clean up. llvm-svn: 31243	2006-10-27 23:52:51 +00:00
Chris Lattner	84a035056e	Fix a bug in merged condition handling (CodeGen/Generic/2006-10-27-CondFolding.ll). Add many fewer CFG edges and PHI node entries. If there is a switch which has the same block as multiple destinations, only add that block once as a successor/phi node (in the jumptable case) llvm-svn: 31242	2006-10-27 23:50:33 +00:00
Jim Laskey	f576b42bb2	Switch over from SelectionNodeCSEMap to FoldingSet. llvm-svn: 31240	2006-10-27 23:46:08 +00:00
Chris Lattner	b9392fb635	remove debug code llvm-svn: 31233	2006-10-27 21:58:03 +00:00
Chris Lattner	f1b54fd7a5	Codegen cond&cond with two branches. This compiles (f.e.) PowerPC/and-branch.ll to: cmpwi cr0, r4, 4 bgt cr0, LBB1_2 ;UnifiedReturnBlock LBB1_3: ;entry cmplwi cr0, r3, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r3, r4, r3 rlwinm r2, r2, 30, 31, 31 or r2, r2, r3 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock LBB1_1: ;cond_true llvm-svn: 31232	2006-10-27 21:54:23 +00:00
Chris Lattner	ed0110b949	Turn conditions like x<Y\|z==q into multiple blocks. This compiles Regression/CodeGen/X86/or-branch.ll into: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $5, %eax jl LBB1_1 #cond_true LBB1_3: #entry testl %ecx, %ecx jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret instead of: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $4, %eax setg %al testl %ecx, %ecx setne %cl testb %cl, %al jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret And on ppc to: cmpwi cr0, r29, 5 blt cr0, LBB1_1 ;cond_true LBB1_3: ;entry cmplwi cr0, r30, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r30, r4, r3 rlwinm r29, r2, 30, 31, 31 and r2, r29, r30 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock llvm-svn: 31230	2006-10-27 21:36:01 +00:00
Evan Cheng	96d6bf50ae	getPreIndexedLoad -> getIndexedLoad. llvm-svn: 31209	2006-10-26 21:53:40 +00:00
Reid Spencer	7e80b0b31e	For PR950: Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195	2006-10-26 06:15:43 +00:00
Chris Lattner	61bcf9154d	visitSwitchCase knows how to insert conditional branches well. Change visitBr to just call visitSwitchCase, eliminating duplicate logic. llvm-svn: 31167	2006-10-24 18:07:37 +00:00
Chris Lattner	963ddad31a	Generalize CaseBlock a bit more: Rename LHSBB/RHSBB to TrueBB/FalseBB. Allow the RHS value to be null, in which case the LHS is treated as a bool. llvm-svn: 31166	2006-10-24 17:57:59 +00:00
Chris Lattner	3f179d24c6	generalize 'CaseBlock'. It really allows any comparison to be inserted. llvm-svn: 31161	2006-10-24 17:03:35 +00:00
Chris Lattner	4c931502cc	Minor tweak. Instead of generating: movl 32(%esp), %eax cmpl $1, %eax je LBB1_1 #bb LBB1_4: #entry cmpl $2, %eax je LBB1_2 #bb2 jmp LBB1_3 #UnifiedReturnBlock LBB1_1: #bb notice that we would miss the fall through and emit this instead: movl 32(%esp), %eax cmpl $2, %eax je LBB1_2 #bb2 LBB1_4: #entry cmpl $1, %eax jne LBB1_3 #UnifiedReturnBlock LBB1_1: #bb llvm-svn: 31130	2006-10-23 18:38:22 +00:00
Chris Lattner	76a7bc8c55	Fix phi node updating for switches lowered to linear sequences of branches. llvm-svn: 31125	2006-10-22 23:00:53 +00:00
Chris Lattner	4c3ef4782d	disable this code for now, it's not yet safely updating phi nodes llvm-svn: 31124	2006-10-22 22:47:10 +00:00
Chris Lattner	6d6fc26257	Implement PR964 and Regression/CodeGen/Generic/SwitchLowering.ll llvm-svn: 31119	2006-10-22 21:36:53 +00:00
Chris Lattner	c5ab6ce613	Make flag and chain edges visually distinguishable from value edges in DOT output. llvm-svn: 31067	2006-10-20 18:06:09 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Bill Wendling	be96e1cd09	Partially in response to PR926: insert the newly created machine basic blocks into the basic block list when lowering the switch inst. into a binary tree of if-then statements. This allows the "visitSwitchCase" func to allow for fall-through behavior. llvm-svn: 31057	2006-10-19 21:46:38 +00:00
Jim Laskey	55e4dcad36	Add option for controlling inclusion of global AA. llvm-svn: 31040	2006-10-18 19:08:31 +00:00
Jim Laskey	a15b0ebb5e	Use global info for alias analysis. llvm-svn: 31035	2006-10-18 12:29:57 +00:00
Chris Lattner	78fd0f83ff	Trivial patch to speed up legalizing common i64 constants. llvm-svn: 31020	2006-10-17 21:47:13 +00:00
Chris Lattner	327b88b102	Fix CodeGen/PowerPC/2006-10-17-brcc-miscompile.ll llvm-svn: 31019	2006-10-17 21:24:15 +00:00
Evan Cheng	2f4ddce75c	Fix printer for StoreSDNode. llvm-svn: 31017	2006-10-17 21:18:26 +00:00
Evan Cheng	1839d76f69	Reflect MemOpAddrMode change; added a helper to create pre-indexed load. llvm-svn: 31016	2006-10-17 21:14:32 +00:00
Jim Laskey	e7d2c24a7d	Make it simplier to dump DAGs while in DAGCombiner. Remove a nasty optimization. llvm-svn: 31009	2006-10-17 19:33:52 +00:00
Evan Cheng	1e3a39cd08	Make sure operand does have size and element type operands. llvm-svn: 30999	2006-10-17 17:06:35 +00:00
Evan Cheng	f3ae00a64a	Be careful when looking through a vbit_convert. Optimizing this: (vector_shuffle (vbitconvert (vbuildvector (copyfromreg v4f32), 1, v4f32), 4, f32), (undef, undef, undef, undef), (0, 0, 0, 0), 4, f32) to the vbitconvert is a very bad idea. llvm-svn: 30989	2006-10-16 22:49:37 +00:00
Jim Laskey	dcb2b83886	Pass AliasAnalysis thru to DAGCombiner. llvm-svn: 30984	2006-10-16 20:52:31 +00:00
Jim Laskey	3bf4f3bd60	Tidy up after truncstore changes. llvm-svn: 30961	2006-10-14 12:14:27 +00:00
Evan Cheng	47fbeda5ce	Debug tweak. llvm-svn: 30959	2006-10-14 08:34:06 +00:00
Chris Lattner	6a1b2de8c4	Make sure that the node returned by SimplifySetCC is added to the worklist so that it can be deleted if unused. llvm-svn: 30955	2006-10-14 03:52:46 +00:00
Chris Lattner	0626bd2fbc	fold setcc of a setcc. llvm-svn: 30953	2006-10-14 01:02:29 +00:00
Chris Lattner	bd9acad805	When SimplifySetCC was moved to the DAGCombiner, it was never removed from SelectionDAG and it has since bitrotted. Remove the copy from SelectionDAG. Next, remove the constant folding piece of DAGCombiner::SimplifySetCC into a new FoldSetCC method which can be used by getNode() and SimplifySetCC. This fixes obscure bugs. llvm-svn: 30952	2006-10-14 00:41:01 +00:00
Jim Laskey	dcf983ce41	Reduce the workload by not adding chain users to work list. llvm-svn: 30948	2006-10-13 23:32:28 +00:00
Chris Lattner	45ffb1eb70	Fix a bug where we incorrectly turned '(X & 0) == 0' into '(X & 0) >> -1', which is undefined. "0" isn't a power of 2. llvm-svn: 30947	2006-10-13 22:46:18 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	d0620d2773	Lower X%C into X/C+stuff. This allows the 'division by a constant' logic to apply to rems as well as divs. This fixes PR945 and speeds up ReedSolomon from 14.57s to 10.90s (which is now faster than gcc). It compiles CodeGen/X86/rem.ll into: _test1: subl $4, %esp movl %esi, (%esp) movl $2155905153, %ecx movl 8(%esp), %esi movl %esi, %eax imull %ecx addl %esi, %edx movl %edx, %eax shrl $31, %eax sarl $7, %edx addl %eax, %edx imull $255, %edx, %eax subl %eax, %esi movl %esi, %eax movl (%esp), %esi addl $4, %esp ret _test2: movl 4(%esp), %eax movl %eax, %ecx sarl $31, %ecx shrl $24, %ecx addl %eax, %ecx andl $4294967040, %ecx subl %ecx, %eax ret _test3: subl $4, %esp movl %esi, (%esp) movl $2155905153, %ecx movl 8(%esp), %esi movl %esi, %eax mull %ecx shrl $7, %edx imull $255, %edx, %eax subl %eax, %esi movl %esi, %eax movl (%esp), %esi addl $4, %esp ret instead of div/idiv instructions. llvm-svn: 30920	2006-10-12 20:58:32 +00:00
Evan Cheng	a731cb674a	Add RemoveDeadNode to remove a dead node and its (potentially) dead operands. llvm-svn: 30916	2006-10-12 20:34:05 +00:00
Chris Lattner	2e33fb453b	add a minor dag combine noticed when looking at PR945 llvm-svn: 30915	2006-10-12 20:23:19 +00:00
Jim Laskey	df2ccc395e	D'oh - need to use the rigth kind of store. llvm-svn: 30903	2006-10-12 15:22:24 +00:00
Jim Laskey	a13b9c7aa4	Alias analysis of TRUNCSTORE. llvm-svn: 30889	2006-10-11 18:55:16 +00:00
Jim Laskey	6a4c6d3a7a	Typo llvm-svn: 30884	2006-10-11 17:52:19 +00:00
Jim Laskey	0f7c328ae7	Handle aliasing of loadext. llvm-svn: 30883	2006-10-11 17:47:52 +00:00
Jim Laskey	08edf332ed	Fix regression in combiner alias analysis. llvm-svn: 30880	2006-10-11 13:47:09 +00:00
Evan Cheng	d35734bd1f	Naming consistency. llvm-svn: 30878	2006-10-11 07:10:22 +00:00
Andrew Lenharth	a6bbf33cbf	Jimptables working again on alpha. As a bonus, use the GOT node instead of the AlphaISD::GOT for internal stuff. llvm-svn: 30873	2006-10-11 04:29:42 +00:00
Chris Lattner	6df349676e	add two helper methods. llvm-svn: 30869	2006-10-11 03:58:02 +00:00
Evan Cheng	2da4671e05	FindModifiedNodeSlot needs to add LoadSDNode ivars to create proper SelectionDAGCSEMap ID. llvm-svn: 30866	2006-10-11 01:47:58 +00:00
Evan Cheng	7994aec7b5	Also update getNodeLabel for LoadSDNode. llvm-svn: 30861	2006-10-10 20:11:26 +00:00
Evan Cheng	fe858538c0	SDNode::dump should also print out extension type and VT. llvm-svn: 30860	2006-10-10 20:05:10 +00:00
Chris Lattner	8438429c96	Fix another bug in extload promotion. llvm-svn: 30857	2006-10-10 18:54:19 +00:00
Evan Cheng	dc6a3aab71	Fix a bug introduced by my LOAD/LOADX changes. llvm-svn: 30853	2006-10-10 07:51:21 +00:00
Evan Cheng	e71fe34d75	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	5ab6d8b3fc	Eliminate more token factors by taking advantage of transitivity: if TF depends on A and B, and A depends on B, TF just needs to depend on A. With Jim's alias-analysis stuff enabled, this compiles the testcase in PR892 into: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %edx, 28(%esp) movl %eax, 32(%esp) movl %eax, 24(%esp) movl %edx, 36(%esp) movl 52(%esp), %ecx movl %ecx, 4(%esp) movl %eax, 8(%esp) movl %edx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret instead of: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %eax, 24(%esp) movl %edx, 28(%esp) movl 24(%esp), %eax movl %eax, 32(%esp) movl 28(%esp), %eax movl %eax, 36(%esp) movl 32(%esp), %eax movl 36(%esp), %ecx movl 52(%esp), %edx movl %edx, 4(%esp) movl %eax, 8(%esp) movl %ecx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret llvm-svn: 30821	2006-10-08 22:57:01 +00:00
Jim Laskey	0463e08005	Combiner alias analysis passes Multisource (release-asserts.) llvm-svn: 30818	2006-10-07 23:37:56 +00:00
Chris Lattner	f9f90bc239	Fix a bug legalizing zero-extending i64 loads into 32-bit loads. The bottom part was always forced to be sextload, even when we needed an zextload. llvm-svn: 30782	2006-10-07 00:58:36 +00:00
Chris Lattner	a389a612bb	initialize ivar llvm-svn: 30780	2006-10-06 22:52:08 +00:00
Chris Lattner	9d75324ddf	jump tables handle pic llvm-svn: 30776	2006-10-06 22:32:29 +00:00
Chris Lattner	f5839a0816	Fix a miscompilation of: long long foo(long long X) { return (long long)(signed char)(int)X; } Instead of: _foo: extsb r2, r4 srawi r3, r4, 31 mr r4, r2 blr we now produce: _foo: extsb r4, r4 srawi r3, r4, 31 blr This fixes a miscompilation in ConstantFolding.cpp. llvm-svn: 30768	2006-10-06 17:34:12 +00:00
Evan Cheng	df9ac47e5e	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	af309d29b1	Add getStore() helper function to create ISD::STORE nodes. llvm-svn: 30758	2006-10-05 22:57:11 +00:00
Jim Laskey	6549d22ef9	Alias analysis code clean ups. llvm-svn: 30753	2006-10-05 15:07:25 +00:00
Evan Cheng	f80dfa83a0	Fix some typos that can cause a flag value to have more than one use. llvm-svn: 30727	2006-10-04 22:23:53 +00:00
Jim Laskey	708d0db2d8	More extensive alias analysis. llvm-svn: 30721	2006-10-04 16:53:27 +00:00
Evan Cheng	5d9fd977d3	Combine ISD::EXTLOAD, ISD::SEXTLOAD, ISD::ZEXTLOAD into ISD::LOADX. Add an extra operand to LOADX to specify the exact value extension type. llvm-svn: 30714	2006-10-04 00:56:09 +00:00
Evan Cheng	91d76cb27f	Fix an obvious typo. llvm-svn: 30711	2006-10-03 23:08:27 +00:00
Jim Laskey	e73a22514d	Debugging kruft llvm-svn: 30688	2006-10-02 13:01:17 +00:00
Jim Laskey	1368c265da	Add ability to annotate (color) nodes in a viewGraph. llvm-svn: 30686	2006-10-02 12:26:53 +00:00
Chris Lattner	a9caf95591	refactor critical edge breaking out into the SplitCritEdgesForPHIConstants method. This is a baby step towards fixing PR925. llvm-svn: 30643	2006-09-28 06:17:10 +00:00
Andrew Lenharth	c19ef92403	Comments on JumpTableness llvm-svn: 30615	2006-09-26 20:02:30 +00:00
Jim Laskey	60832693a7	Load chain check is not needed llvm-svn: 30613	2006-09-26 17:44:58 +00:00
Jim Laskey	dde51671e5	Chain can be any operand llvm-svn: 30611	2006-09-26 09:32:41 +00:00
Jim Laskey	5f3e0af9d0	Wrong size for load llvm-svn: 30610	2006-09-26 08:14:06 +00:00
Jim Laskey	b4a864d533	Can't move a load node if it's chain is not used. llvm-svn: 30609	2006-09-26 07:37:42 +00:00
Jim Laskey	7aa0638aa9	Accidental enable of bad code llvm-svn: 30601	2006-09-25 21:11:32 +00:00
Jim Laskey	b5534e5c28	Fix chain dropping in load and drop unused stores in ret blocks. llvm-svn: 30600	2006-09-25 19:32:58 +00:00
Jim Laskey	d07be232ba	Core antialiasing for load and store. llvm-svn: 30597	2006-09-25 16:29:54 +00:00
Andrew Lenharth	783a4a9d86	Add support for other relocation bases to jump tables, as well as custom asm directives llvm-svn: 30593	2006-09-24 19:45:58 +00:00
Evan Cheng	77c0757f8b	PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. llvm-svn: 30590	2006-09-24 05:22:38 +00:00
Evan Cheng	449a0c7e33	Make it work for DAG combine of multi-value nodes. llvm-svn: 30573	2006-09-21 19:04:05 +00:00
Jim Laskey	35f7eebb49	core corrections llvm-svn: 30570	2006-09-21 17:35:47 +00:00
Jim Laskey	5d19d59017	Basic "in frame" alias analysis. llvm-svn: 30568	2006-09-21 16:28:59 +00:00
Chris Lattner	082db3f9aa	fold (aext (and (trunc x), cst)) -> (and x, cst). llvm-svn: 30561	2006-09-21 06:40:43 +00:00
Chris Lattner	fa9f92cf65	Check the right value type. This fixes 186.crafty on x86 llvm-svn: 30560	2006-09-21 06:17:39 +00:00
Chris Lattner	8d8a3bf9c9	Compile: int %test(ulong %tmp) { %tmp = load ulong %tmp ; <ulong> [#uses=1] %tmp.mask = shr ulong %tmp, ubyte 50 ; <ulong> [#uses=1] %tmp.mask = cast ulong %tmp.mask to ubyte %tmp2 = and ubyte %tmp.mask, 3 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp2 to int ; <int> [#uses=1] ret int %tmp2 } to: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax andl $3, %eax ret instead of: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax # TRUNCATE movb %al, %al andb $3, %al movzbl %al, %eax ret llvm-svn: 30558	2006-09-21 06:14:31 +00:00
Chris Lattner	a31f0a622b	Generalize (zext (truncate x)) and (sext (truncate x)) folding to work when the src/dst are not the same size. This catches things like "truncate 32-bit X to 8 bits, then zext to 16", which happens a bit on X86. llvm-svn: 30557	2006-09-21 06:00:20 +00:00
Chris Lattner	c8cd62d381	Compile: int test3(int a, int b) { return (a < 0) ? a : 0; } to: _test3: srawi r2, r3, 31 and r3, r2, r3 blr instead of: _test3: cmpwi cr0, r3, 1 li r2, 0 blt cr0, LBB2_2 ;entry LBB2_1: ;entry mr r3, r2 LBB2_2: ;entry blr This implements: PowerPC/select_lt0.ll:seli32_a_a llvm-svn: 30517	2006-09-20 06:41:35 +00:00
Chris Lattner	8746e2cd57	Fold the full generality of (any_extend (truncate x)) llvm-svn: 30514	2006-09-20 06:29:17 +00:00
Chris Lattner	8b68decb27	Two things: 1. teach SimplifySetCC that '(srl (ctlz x), 5) == 0' is really x != 0. 2. Teach visitSELECT_CC to use SimplifySetCC instead of calling it and ignoring the result. This allows us to compile: bool %test(ulong %x) { %tmp = setlt ulong %x, 4294967296 ret bool %tmp } to: _test: cntlzw r2, r3 cmplwi cr0, r3, 1 srwi r2, r2, 5 li r3, 0 beq cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr instead of: _test: addi r2, r3, -1 cntlzw r2, r2 cntlzw r3, r3 srwi r2, r2, 5 cmplwi cr0, r2, 0 srwi r2, r3, 5 li r3, 0 bne cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr This isn't wonderful, but it's an improvement. llvm-svn: 30513	2006-09-20 06:19:26 +00:00
Chris Lattner	875ea0cdbd	Expand 64-bit shifts more optimally if we know that the high bit of the shift amount is one or zero. For example, for: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } we get: _foo1: movb $31, %cl movl 4(%esp), %edx andb 12(%esp), %cl shll %cl, %edx xorl %eax, %eax ret _foo2: movb $223, %cl movl 4(%esp), %eax movl 8(%esp), %edx andb 12(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax ret instead of: _foo1: subl $4, %esp movl %ebx, (%esp) movb $32, %bl movl 8(%esp), %eax movl 12(%esp), %edx movb %bl, %cl orb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret _foo2: subl $4, %esp movl %ebx, (%esp) movb $223, %cl movl 8(%esp), %eax movl 12(%esp), %edx andb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx xorb %bl, %bl testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret llvm-svn: 30506	2006-09-20 03:38:48 +00:00
Chris Lattner	5a42ebcff3	Fold extract_element(cst) to cst llvm-svn: 30478	2006-09-19 05:02:39 +00:00
Chris Lattner	4c059f4962	Minor speedup for legalize by avoiding some malloc traffic llvm-svn: 30477	2006-09-19 04:51:23 +00:00
Evan Cheng	1fc7c363e6	Fix a typo. llvm-svn: 30474	2006-09-18 23:28:33 +00:00
Evan Cheng	4bfaf0bd2c	Allow i32 UDIV, SDIV, UREM, SREM to be expanded into libcalls. llvm-svn: 30470	2006-09-18 21:49:04 +00:00
Andrew Lenharth	c50458fb90	absolute addresses must match pointer size llvm-svn: 30461	2006-09-18 17:59:35 +00:00
Chris Lattner	e50f5d1fb1	Oh yeah, this is needed too llvm-svn: 30407	2006-09-16 05:08:34 +00:00
Chris Lattner	1b63391fdf	simplify control flow, no functionality change llvm-svn: 30403	2006-09-16 00:21:44 +00:00
Chris Lattner	fbadbda6ba	Allow custom expand of mul llvm-svn: 30402	2006-09-16 00:09:24 +00:00
Chris Lattner	46d710e6ea	Fold (X & C1) \| (Y & C2) -> (X\|Y) & C3 when possible. This implements CodeGen/X86/and-or-fold.ll llvm-svn: 30379	2006-09-14 21:11:37 +00:00
Chris Lattner	97614c86ce	Split rotate matching code out to its own function. Make it stronger, by matching things like ((x >> c1) & c2) \| ((x << c3) & c4) to (rot x, c5) & c6 llvm-svn: 30376	2006-09-14 20:50:57 +00:00
Chris Lattner	84cc1f7cb8	If LSR went through a lot of trouble to put constants (e.g. the addr of a global in a specific BB, don't undo this!). This allows us to compile CodeGen/X86/loop-hoist.ll into: _foo: xorl %eax, %eax * movl L_Arr$non_lazy_ptr, %ecx movl 4(%esp), %edx LBB1_1: #cond_true movl %eax, (%ecx,%eax,4) incl %eax cmpl %edx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret instead of: _foo: xorl %eax, %eax movl 4(%esp), %ecx LBB1_1: #cond_true * movl L_Arr$non_lazy_ptr, %edx movl %eax, (%edx,%eax,4) incl %eax cmpl %ecx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret This was noticed in 464.h264ref. This doesn't usually affect PPC, but strikes X86 all the time. llvm-svn: 30290	2006-09-13 06:02:42 +00:00
Chris Lattner	72b503bcad	Compile X << 1 (where X is a long-long) to: addl %ecx, %ecx adcl %eax, %eax instead of: movl %ecx, %edx addl %edx, %edx shrl $31, %ecx addl %eax, %eax orl %ecx, %eax and to: addc r5, r5, r5 adde r4, r4, r4 instead of: slwi r2,r9,1 srwi r0,r11,31 slwi r3,r11,1 or r2,r0,r2 on PPC. llvm-svn: 30284	2006-09-13 03:50:39 +00:00
Evan Cheng	45fe3bc72c	Added support for machine specific constantpool values. These are useful for representing expressions that can only be resolved at link time, etc. llvm-svn: 30278	2006-09-12 21:00:35 +00:00
Chris Lattner	2e0dfb0b16	This code was trying too hard. By eliminating redundant edges in the CFG due to switch cases going to the same place, it make #pred != #phi entries, breaking live interval analysis. This fixes 458.sjeng on x86 with llc. llvm-svn: 30236	2006-09-10 06:36:57 +00:00
Chris Lattner	f0359b343a	Implement the fpowi now by lowering to a libcall llvm-svn: 30225	2006-09-09 06:03:30 +00:00
Chris Lattner	e4bbb6c341	Allow targets to custom lower expanded BIT_CONVERT's llvm-svn: 30217	2006-09-09 00:20:27 +00:00
Chris Lattner	707339a57b	Fix CodeGen/Generic/2006-09-06-SwitchLowering.ll, a bug where SDIsel inserted too many phi operands when lowering a switch to branches in some cases. llvm-svn: 30142	2006-09-07 01:59:34 +00:00
Chris Lattner	0dce3311c4	Change the default to 0, which means 'default'. llvm-svn: 30114	2006-09-05 17:39:15 +00:00
Chris Lattner	af23f9b5f6	Completely eliminate def&use operands. Now a register operand is EITHER a def operand or a use operand. llvm-svn: 30109	2006-09-05 02:31:13 +00:00
Duraid Madina	373be1d1a2	forgot this llvm-svn: 30097	2006-09-04 07:44:11 +00:00
Evan Cheng	e93762d36e	Allow legalizer to expand ISD::MUL using only MULHS in the rare case that is possible and the target only supports MULHS. llvm-svn: 30022	2006-09-01 18:17:58 +00:00
Evan Cheng	31305c45da	DAG combiner fix for rotates. Previously the outer-most condition checks for ROTL availability. This prevents it from forming ROTR for targets that has ROTR only. llvm-svn: 29997	2006-08-31 07:41:12 +00:00
Evan Cheng	e5570a4c3f	Move isCommutativeBinOp from SelectionDAG.cpp and DAGCombiner.cpp out. Make it a static method of SelectionDAG. llvm-svn: 29951	2006-08-29 06:42:35 +00:00
Chris Lattner	3d27be1333	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Evan Cheng	849f4bf8dd	Eliminate SelectNodeTo() and getTargetNode() variants which take more than 3 SDOperand operands. They are replaced by versions which take an array of SDOperand and the number of operands. llvm-svn: 29905	2006-08-27 08:08:54 +00:00
Evan Cheng	34b70eea5c	SelectNodeTo now returns a SDNode*. llvm-svn: 29901	2006-08-26 08:00:10 +00:00
Chris Lattner	451b099113	Fix PR861 llvm-svn: 29796	2006-08-21 20:24:53 +00:00
Chris Lattner	d86418ab20	switch the SUnit pred/succ sets from being std::sets to being smallvectors. This reduces selectiondag time on kc++ from 5.43s to 4.98s (9%). More significantly, this speeds up the default ppc scheduler from ~1571ms to 1063ms, a 33% speedup. llvm-svn: 29743	2006-08-17 00:09:56 +00:00
Chris Lattner	65879caf07	minor changes. llvm-svn: 29740	2006-08-16 22:57:46 +00:00
Chris Lattner	a4f3625c23	Use the appropriate typedef llvm-svn: 29730	2006-08-16 20:59:32 +00:00
Chris Lattner	a5a3eafbd0	Start using SDVTList more consistently llvm-svn: 29711	2006-08-15 19:11:05 +00:00
Chris Lattner	f98411a220	add a new SDVTList type and new SelectionDAG::getVTList methods to streamline the creation of canonical VTLists. llvm-svn: 29709	2006-08-15 17:46:01 +00:00
Chris Lattner	bd8877744b	eliminate use of getNode that takes vector of valuetypes. llvm-svn: 29687	2006-08-14 23:53:35 +00:00
Chris Lattner	3bf4be453f	Add a new getNode() method that takes a pointer to an already-intern'd list of value-type nodes. This avoids having to do mallocs for std::vectors of valuetypes when a node returns more than one type. llvm-svn: 29685	2006-08-14 23:31:51 +00:00
Chris Lattner	e93a39f2d7	remove SelectionDAG::InsertISelMapEntry, it is dead llvm-svn: 29677	2006-08-14 22:24:39 +00:00
Chris Lattner	63268f0672	Add code to resize the CSEMap hash table. This doesn't speedup codegen of kimwitu, but seems like a good idea from a "avoid performance cliffs" standpoint :) llvm-svn: 29675	2006-08-14 22:19:25 +00:00
Chris Lattner	8e37283d8b	Add the actual constant to the hash for ConstantPool nodes. Thanks to Rafael Espindola for pointing this out. llvm-svn: 29669	2006-08-14 20:12:44 +00:00
Chris Lattner	0a60294fa0	Switch to using SuperFastHash instead of adding all elements together. This doesn't significantly improve performance but it helps a small amount. llvm-svn: 29642	2006-08-12 01:07:10 +00:00
Chris Lattner	04aa034f38	Switch NodeID to track 32-bit chunks instead of 8-bit chunks, for a 2.5% speedup in isel time. llvm-svn: 29640	2006-08-11 23:55:53 +00:00
Chris Lattner	0c2e5412bb	Remove 8 more std::map's. llvm-svn: 29631	2006-08-11 21:55:30 +00:00
Chris Lattner	3f16b201e2	Move the BBNodes, GlobalValues, TargetGlobalValues, Constants, TargetConstants, RegNodes, and ValueNodes maps into the CSEMap. llvm-svn: 29626	2006-08-11 21:01:22 +00:00
Chris Lattner	fcb16470ec	eliminate the NullaryOps map, use CSEMap instead. llvm-svn: 29621	2006-08-11 18:38:11 +00:00
Chris Lattner	6f22ebd8be	change internal impl of dag combiner so that calls to CombineTo never have to make a temporary vector. llvm-svn: 29618	2006-08-11 17:56:38 +00:00
Chris Lattner	a2f4086828	Change one ReplaceAllUsesWith method to take an array of operands to replace instead of a vector of operands. llvm-svn: 29616	2006-08-11 17:46:28 +00:00
Chris Lattner	c24a1d3093	Start eliminating temporary vectors used to create DAG nodes. Instead, pass in the start of an array and a count of operands where applicable. In many cases, the number of operands is known, so this static array can be allocated on the stack, avoiding the heap. In many other cases, a SmallVector can be used, which has the same benefit in the common cases. I updated a lot of code calling getNode that takes a vector, but ran out of time. The rest of the code should be updated, and these methods should be removed. We should also do the same thing to eliminate the methods that take a vector of MVT::ValueTypes. It would be extra nice to convert the dagiselemitter to avoid creating vectors for operands when calling getTargetNode. llvm-svn: 29566	2006-08-08 02:23:42 +00:00
Chris Lattner	97af9d5d3a	Eliminate some malloc traffic by allocating vectors on the stack. Change some method that took std::vector<SDOperand> to take a pointer to a first operand and #operands. This speeds up isel on kc++ by about 3%. llvm-svn: 29561	2006-08-08 01:09:31 +00:00
Chris Lattner	1ee75ce65d	Revamp the "CSEMap" datastructure used in the SelectionDAG class. This eliminates a bunch of std::map's in the SelectionDAG, replacing them with a home-grown hashtable. This is still a work in progress: not all the maps have been moved over and the hashtable never resizes. That said, this still speeds up llc 20% on kimwitu++ with -fast -regalloc=local using a release build. llvm-svn: 29550	2006-08-07 23:03:03 +00:00
Evan Cheng	445b91a041	Clear TopOrder before assigning topological order. Some clean ups. llvm-svn: 29546	2006-08-07 22:13:29 +00:00
Evan Cheng	1640ae5a84	Reverse the FlaggedNodes after scanning up for flagged preds or else the order would be reversed. llvm-svn: 29545	2006-08-07 22:12:12 +00:00
Chris Lattner	8927c875bb	Make SelectionDAG::RemoveDeadNodes iterative instead of recursive, which also make it simpler. llvm-svn: 29524	2006-08-04 17:45:20 +00:00
Jim Laskey	a5b707e3ad	Copy the liveins for the first block. PR859 llvm-svn: 29511	2006-08-03 20:51:06 +00:00
Chris Lattner	524c1a21f2	Work around a GCC 3.3.5 bug noticed by a user. llvm-svn: 29490	2006-08-03 00:18:59 +00:00
Evan Cheng	bba1ebda32	- Change AssignTopologicalOrder to return vector of SDNode* by reference. - Tweak implementation to avoid using std::map. llvm-svn: 29479	2006-08-02 22:00:34 +00:00
Jim Laskey	29e635d3c9	Final polish on machine pass registries. llvm-svn: 29471	2006-08-02 12:30:23 +00:00
Jim Laskey	17c67efe8a	Now that the ISel is available, it's possible to create a default instruction scheduler creator. llvm-svn: 29452	2006-08-01 19:14:14 +00:00
Jim Laskey	03593f72db	1. Change use of "Cache" to "Default". 2. Added argument to instruction scheduler creators so the creators can do special things. 3. Repaired target hazard code. 4. Misc. More to follow. llvm-svn: 29450	2006-08-01 18:29:48 +00:00
Jim Laskey	95eda5b1f3	Introducing plugable register allocators and instruction schedulers. llvm-svn: 29434	2006-08-01 14:21:23 +00:00
Evan Cheng	9631a60020	Added AssignTopologicalOrder() to assign each node an unique id based on their topological order. llvm-svn: 29431	2006-08-01 08:20:41 +00:00
Evan Cheng	6ae6ac1216	PIC jump table entries are always 32-bit even in 64-bit mode. llvm-svn: 29422	2006-08-01 01:03:13 +00:00
Evan Cheng	b572401bea	Remove InFlightSet hack. No longer needed. llvm-svn: 29373	2006-07-28 00:47:19 +00:00
Nate Begeman	efc312a5c7	Code cleanups, per review llvm-svn: 29347	2006-07-27 16:46:58 +00:00
Evan Cheng	acb606ff33	AssignNodeIds should return unsigned. llvm-svn: 29343	2006-07-27 07:36:47 +00:00
Evan Cheng	29eefc164c	AssignNodeIds assign each node in the DAG an unique id. llvm-svn: 29337	2006-07-27 06:39:06 +00:00
Chris Lattner	85ea83e821	Add some advice llvm-svn: 29324	2006-07-27 04:24:14 +00:00
Nate Begeman	787565024a	Support jump tables when in PIC relocation model llvm-svn: 29318	2006-07-27 01:13:04 +00:00
Chris Lattner	4488f0c303	Fix a case where LegalizeAllNodesNotLeadingTo could take exponential time. This manifested itself as really long time to compile Regression/CodeGen/Generic/2003-05-28-ManyArgs.ll on ppc. This is PR847. llvm-svn: 29313	2006-07-26 23:55:56 +00:00
Reid Spencer	421475cd3b	For PR780: 1. Move IncludeFile.h to System library 2. Move IncludeFile.cpp to System library 3. #1 and #2 required to prevent cyclic library dependencies for libSystem 4. Convert all existing uses of Support/IncludeFile.h to System/IncludeFile.h 5. Add IncludeFile support to various lib/System classes. 6. Add new lib/System classes to LinkAllVMCore.h All this in an attempt to pull in lib/System to what's required for VMCore llvm-svn: 29287	2006-07-26 16:18:00 +00:00
Reid Spencer	658b9476f0	Initialize some variables the compiler warns about. llvm-svn: 29277	2006-07-25 20:44:41 +00:00
Jim Laskey	4e153f1b91	Use an enumeration to eliminate data relocations. llvm-svn: 29249	2006-07-21 20:57:35 +00:00
Evan Cheng	7c970b98d0	If a shuffle is a splat, check if the argument is a build_vector with all elements being the same. If so, return the argument. llvm-svn: 29242	2006-07-21 08:25:53 +00:00
Chris Lattner	55782c6c41	Build more debugger/selectiondag libraries as archives instead of .o files. This works around bugs in some versions of the cygwin linker. Patch contributed by Anton Korobeynikov. llvm-svn: 29239	2006-07-21 00:10:47 +00:00
Evan Cheng	8472e0c4af	If a shuffle is unary, i.e. one of the vector argument is not needed, turn the operand into a undef and adjust mask accordingly. llvm-svn: 29232	2006-07-20 22:44:41 +00:00
Chris Lattner	b030532910	Mems can be in the output list also. This is the second half of a fix for PR833 llvm-svn: 29224	2006-07-20 19:02:21 +00:00
Andrew Lenharth	ec104a2b41	80 cols llvm-svn: 29221	2006-07-20 17:43:27 +00:00
Andrew Lenharth	c496b418b5	Reduce number of exported symbols llvm-svn: 29220	2006-07-20 17:28:38 +00:00
Chris Lattner	c0973edc69	Add an out-of-line virtual method for the sdnode class to give it a home. llvm-svn: 29192	2006-07-19 00:00:37 +00:00
Jim Laskey	f7300b2706	It was pointed out that DEBUG() is only available with -debug. llvm-svn: 29106	2006-07-11 18:25:13 +00:00
Jim Laskey	c3d341ea98	Ensure that dump calls that are associated with asserts are removed from non-debug build. llvm-svn: 29105	2006-07-11 17:58:07 +00:00
Chris Lattner	1b8ea1f5ba	Fix CodeGen/Alpha/2006-07-03-ASMFormalLowering.ll and PR818. llvm-svn: 29099	2006-07-11 01:40:09 +00:00
Evan Cheng	d19938834b	Ugly hack! Add helper functions InsertInFlightSetEntry and RemoveInFlightSetEntry. They are used in place of direct set operators to reduce instruction selection function stack size. llvm-svn: 28987	2006-06-29 23:57:05 +00:00
Chris Lattner	996795b0dd	Use hidden visibility to make symbols in an anonymous namespace get dropped. This shrinks libllvmgcc.dylib another 67K llvm-svn: 28975	2006-06-28 23:17:24 +00:00
Chris Lattner	e097e6f7c7	Shave another 27K off libllvmgcc.dylib with visibility hidden llvm-svn: 28973	2006-06-28 22:17:39 +00:00
Chris Lattner	54a34cd20b	Mark these two classes as hidden, shrinking libllbmgcc.dylib by 25K llvm-svn: 28970	2006-06-28 21:58:30 +00:00
Chris Lattner	710b3d5ea1	Fix CodeGen/Generic/2006-06-28-SimplifySetCCCrash.ll llvm-svn: 28965	2006-06-28 18:29:47 +00:00
Reid Spencer	ee7eaa25cf	For PR801: Refactor the Graph writing code to use a common implementation which is now in lib/Support/GraphWriter.cpp. This completes the PR. Patch by Anton Korobeynikov. Thanks, Anton! llvm-svn: 28925	2006-06-27 16:49:46 +00:00
Evan Cheng	ef9e07d3f0	Consistency. EXTRACT_ELEMENT index operand should have ptr type. llvm-svn: 28795	2006-06-15 08:11:54 +00:00
Evan Cheng	55772ccfd6	Instructions with variable operands (variable_ops) can have a number required operands. e.g. def CALL32r : I<0xFF, MRM2r, (ops GR32:$dst, variable_ops), "call {*}$dst", [(X86call GR32:$dst)]>; TableGen should emit operand informations for the "required" operands. Added a target instruction info flag M_VARIABLE_OPS to indicate the target instruction may have more operands in addition to the minimum required operands. llvm-svn: 28791	2006-06-15 07:22:16 +00:00
Chris Lattner	32d92e004d	Make sure to update the CFG correctly if a switch only has a default dest. This fixes CodeGen/Generic/2006-06-12-LowerSwitchCrash.ll llvm-svn: 28755	2006-06-12 18:25:29 +00:00
Andrew Lenharth	0e57b2cb92	Start on my todo list llvm-svn: 28752	2006-06-12 16:07:18 +00:00
Chris Lattner	c03a9259c0	Fix X86/inline-asm.ll:test2, a case where an input value was implicitly truncated. llvm-svn: 28733	2006-06-08 18:27:11 +00:00
Chris Lattner	705948d742	Fix Regression/CodeGen/X86/inline-asm.ll, a case where inline asm causes implement extension of a register. llvm-svn: 28731	2006-06-08 18:22:48 +00:00
Reid Spencer	614cb2ff82	For PR798: Provide GraphViz support for MingW32. Patch provided by Anton Korobeynikov llvm-svn: 28688	2006-06-05 16:26:06 +00:00
Reid Spencer	a647c7ff42	Use archive libraries instead of object files for VMCore, BCReader, BCWriter, and bzip2 libraries. Adjust the various makefiles to accommodate these changes. This was done to speed up link times. llvm-svn: 28610	2006-06-01 01:30:27 +00:00
Evan Cheng	0c0996a97b	commuteInstruction() does not always create a new MI! llvm-svn: 28592	2006-05-31 18:03:39 +00:00
Evan Cheng	9d91caa053	Eliminate a memory leak. llvm-svn: 28585	2006-05-31 07:13:03 +00:00
Evan Cheng	64d2846017	visitVBinOp: Can't fold divide by zero! llvm-svn: 28584	2006-05-31 06:08:35 +00:00
Evan Cheng	d12c97d23a	Make sure the register pressure reduction schedulers work for non-uniform latency targets, e.g. PPC32. llvm-svn: 28561	2006-05-30 18:05:39 +00:00
Evan Cheng	61e9f0d680	When a priority_queue is empty, the behavior of top() operator is non-deterministic. Returns NULL when it's empty! llvm-svn: 28560	2006-05-30 18:04:34 +00:00
Chris Lattner	8f872d2091	Fix a nasty dag combiner bug that caused nondeterminstic crashes (MY FAVORITE!): SimplifySelectOps would eliminate a Select, delete it, then return true. The clients would see that it did something and return null. The top level would see a null return, and decide that nothing happened, proceeding to process the node in other ways: boom. The fix is simple: clients of SimplifySelectOps should return the select node itself. In order to catch really obnoxious boogs like this in the future, add an assert that nodes are not deleted. We do this by checking for a sentry node type that the SDNode dtor sets when a node is destroyed. llvm-svn: 28514	2006-05-27 00:43:02 +00:00
Evan Cheng	21dee4e0b2	Make CALL node consistent with RET node. Signness of value has type MVT::i32 instead of MVT::i1. Either is fine except MVT::i32 is probably a legal type for most (if not all) platforms while MVT::i1 is not. llvm-svn: 28511	2006-05-26 23:13:20 +00:00
Evan Cheng	a2e9953c54	Change RET node to include signness information of the return values. e.g. RET chain, value1, sign1, value2, sign2 llvm-svn: 28509	2006-05-26 23:09:09 +00:00
Evan Cheng	009f5f55f7	Turn on -sched-commute-nodes by default. llvm-svn: 28465	2006-05-25 08:37:31 +00:00
Evan Cheng	4582771f3f	CALL node change: now including signness of every argument. llvm-svn: 28461	2006-05-25 00:55:32 +00:00
Chris Lattner	aa2372562e	Patches to make the LLVM sources more -pedantic clean. Patch provided by Anton Korobeynikov! This is a step towards closing PR786. llvm-svn: 28447	2006-05-24 17:04:05 +00:00
Evan Cheng	ac4f66ff24	-enable-unsafe-fp-math implies -enable-finite-only-fp-math llvm-svn: 28437	2006-05-23 18:18:46 +00:00
Vladimir Prus	df1d439849	Fix missing include llvm-svn: 28435	2006-05-23 13:43:15 +00:00
Evan Cheng	1c5b7d12df	Incorrect SETCC CondCode used for FP comparisons. llvm-svn: 28433	2006-05-23 06:40:47 +00:00
Evan Cheng	d8e2f6ebc1	lib/Target/Target.td llvm-svn: 28386	2006-05-18 20:42:07 +00:00
Chris Lattner	7949c2e8b2	Fix the result of the call to use a correct vbitconvert. There is no need to use getPackedTypeBreakdown at all here. llvm-svn: 28365	2006-05-17 20:49:36 +00:00
Chris Lattner	938155ca57	Correct a previous patch which broke CodeGen/PowerPC/vec_call.ll llvm-svn: 28364	2006-05-17 20:43:21 +00:00
Evan Cheng	751cd7653d	Fixed a LowerCallTo and LowerArguments bug. They were introducing illegal VBIT_VECTOR nodes. There were some confusion about the semantics of getPackedTypeBreakdown(). e.g. for <4 x f32> it returns 1 and v4f32, not 4, and f32. llvm-svn: 28352	2006-05-17 18:16:39 +00:00
Chris Lattner	62f1b83c0e	When we legalize target nodes, do not use getNode to create a new node, use UpdateNodeOperands to just update the operands! This is important because getNode will allocate a new node if the node returns a flag and this breaks assumptions in the legalizer that you can legalize some things multiple times and get exactly the same results. This latent bug was exposed by my ppc patch last night, and this fixes gsm/toast. llvm-svn: 28348	2006-05-17 18:00:08 +00:00
Chris Lattner	a1cec0106a	Add an assertion, avoid some unneeded work for each call. No functionality change. llvm-svn: 28347	2006-05-17 17:55:45 +00:00
Chris Lattner	b77ba73a29	Add support for calls that pass and return legal vectors. llvm-svn: 28340	2006-05-16 23:39:44 +00:00
Chris Lattner	aaa23d953f	Add a new ISD::CALL node, make the default impl of TargetLowering::LowerCallTo produce it. llvm-svn: 28338	2006-05-16 22:53:20 +00:00
Andrew Lenharth	1dc9ec5874	Move this code to a common place llvm-svn: 28329	2006-05-16 17:42:15 +00:00
Chris Lattner	3d82699605	Add a chain to FORMAL_ARGUMENTS. This is a minimal port of the X86 backend, it doesn't currently use/maintain the chain properly. Also, make the X86ISelLowering.cpp file 80-col clean. llvm-svn: 28320	2006-05-16 06:45:34 +00:00
Chris Lattner	957cb6733a	Move function-live-in-handling code from the sdisel code to the scheduler. This code should be emitted after legalize, so it can't be in sdisel. Note that the EmitFunctionEntryCode hook should be updated to operate on the DAG. The X86 backend is the only one currently using this hook. llvm-svn: 28315	2006-05-16 06:10:58 +00:00
Chris Lattner	5f0edfb849	Legalize FORMAL_ARGUMENTS nodes correctly, we don't want to legalize them once for each argument. llvm-svn: 28313	2006-05-16 05:49:56 +00:00
Evan Cheng	99f2f79e2f	Fixing 2006-05-01-SchedCausingSpills.ll; some clean up llvm-svn: 28279	2006-05-13 08:22:24 +00:00
Evan Cheng	d1915cfa6f	Revert an un-intended change llvm-svn: 28278	2006-05-13 05:53:47 +00:00
Chris Lattner	69a0ce6261	Merge identical code. llvm-svn: 28274	2006-05-13 02:11:14 +00:00
Chris Lattner	53cdb2f2b0	Remove dead vars llvm-svn: 28255	2006-05-12 18:06:45 +00:00
Chris Lattner	da076e41ab	remove dead vars llvm-svn: 28254	2006-05-12 18:04:28 +00:00
Chris Lattner	afe72481f6	Comment out dead variables llvm-svn: 28252	2006-05-12 17:57:54 +00:00
Chris Lattner	8c02c3f41a	Compile: %tmp152 = setgt uint %tmp144, %tmp149 ; <bool> [#uses=1] %tmp159 = setlt uint %tmp144, %tmp149 ; <bool> [#uses=1] %bothcond2 = or bool %tmp152, %tmp159 ; <bool> [#uses=1] To setne, not setune, which causes an assertion fault. llvm-svn: 28244	2006-05-12 17:03:46 +00:00
Owen Anderson	8c2c1e90c4	Refactor a bunch of includes so that TargetMachine.h doesn't have to include TargetData.h. This should make recompiles a bit faster with my current TargetData tinkering. llvm-svn: 28238	2006-05-12 06:33:49 +00:00
Evan Cheng	095c9d9b7f	Duh. That could take a long time. llvm-svn: 28235	2006-05-12 06:05:18 +00:00
Chris Lattner	66adee93aa	Two simplifications for token factor nodes: simplify tf(x,x) -> x. simplify tf(x,y,y,z) -> tf(x,y,z). llvm-svn: 28233	2006-05-12 05:01:37 +00:00
Evan Cheng	afed73eebe	Add capability to scheduler to commute nodes for profit. If a two-address code whose first operand has uses below, it should be commuted when possible. llvm-svn: 28230	2006-05-12 01:58:24 +00:00
Evan Cheng	d38c22bdd3	Refactor scheduler code. Move register-reduction list scheduler to a separate file. Added an initial implementation of top-down register pressure reduction list scheduler. llvm-svn: 28226	2006-05-11 23:55:42 +00:00
Evan Cheng	9665ba053f	Templatify RegReductionPriorityQueue llvm-svn: 28212	2006-05-10 06:16:44 +00:00
Nate Begeman	1a225d23ae	Fix PR773 llvm-svn: 28207	2006-05-09 18:20:51 +00:00
Evan Cheng	7d693898ee	Add pseudo dependency to force a def&use operand to be scheduled last (unless the distance between the def and another use is much longer). This is under option control for now "-sched-lower-defnuse". llvm-svn: 28201	2006-05-09 07:13:34 +00:00
Evan Cheng	2c74848af1	Debugging info llvm-svn: 28200	2006-05-09 06:55:15 +00:00
Chris Lattner	446e1ef26a	Make the case I just checked in stronger. Now we compile this: short test2(short X, short x) { int Y = (short)(X+x); return Y >> 1; } to: _test2: add r2, r3, r4 extsh r2, r2 srawi r3, r2, 1 blr instead of: _test2: add r2, r3, r4 extsh r2, r2 srwi r2, r2, 1 extsh r3, r2 blr llvm-svn: 28175	2006-05-08 21:18:59 +00:00
Chris Lattner	29062da0ac	Implement and_sext.ll:test3, generating: _test4: srawi r3, r3, 16 blr instead of: _test4: srwi r2, r3, 16 extsh r3, r2 blr for: short test4(unsigned X) { return (X >> 16); } llvm-svn: 28174	2006-05-08 20:59:41 +00:00
Chris Lattner	2935d8190c	Compile this: short test4(unsigned X) { return (X >> 16); } to: _test4: movl 4(%esp), %eax sarl $16, %eax ret instead of: _test4: movl $-65536, %eax andl 4(%esp), %eax sarl $16, %eax ret llvm-svn: 28171	2006-05-08 20:51:54 +00:00
Chris Lattner	78da6792e7	Fold shifts with undef operands. llvm-svn: 28167	2006-05-08 17:29:49 +00:00
Nate Begeman	d7a19102d1	Make emission of jump tables a bit less conservative; they are now required to be only 31.25% dense, rather than 75% dense. llvm-svn: 28165	2006-05-08 16:51:36 +00:00
Nate Begeman	e5ce5bb6da	Fix PR772 llvm-svn: 28161	2006-05-08 01:35:01 +00:00
Chris Lattner	7e7bcf3a54	Simplify some code, add a couple minor missed folds llvm-svn: 28152	2006-05-06 23:06:26 +00:00
Chris Lattner	751817c54f	constant fold sign_extend_inreg llvm-svn: 28151	2006-05-06 23:05:41 +00:00
Chris Lattner	2a4d7b845b	remove cases handled elsewhere llvm-svn: 28150	2006-05-06 22:43:44 +00:00
Chris Lattner	1ecb2a2dac	Use the new TargetLowering::ComputeNumSignBits method to eliminate sign_extend_inreg operations. Though ComputeNumSignBits is still rudimentary, this is enough to compile this: short test(short X, short x) { int Y = X+x; return (Y >> 1); } short test2(short X, short x) { int Y = (short)(X+x); return Y >> 1; } into: _test: add r2, r3, r4 srawi r3, r2, 1 blr _test2: add r2, r3, r4 extsh r2, r2 srawi r3, r2, 1 blr instead of: _test: add r2, r3, r4 srawi r2, r2, 1 extsh r3, r2 blr _test2: add r2, r3, r4 extsh r2, r2 srawi r2, r2, 1 extsh r3, r2 blr llvm-svn: 28146	2006-05-06 09:30:03 +00:00
Chris Lattner	21cd99024a	When inserting casts, be careful of where we put them. We cannot insert a cast immediately before a PHI node. This fixes Regression/CodeGen/Generic/2006-05-06-GEP-Cast-Sink-Crash.ll llvm-svn: 28143	2006-05-06 09:10:37 +00:00
Chris Lattner	907e392dba	Fold trunc(any_ext). This gives stuff like: 27,28c27 < movzwl %di, %edi < movl %edi, %ebx --- > movw %di, %bx llvm-svn: 28137	2006-05-05 22:56:26 +00:00
Chris Lattner	57f8c5a387	Shrink shifts when possible. llvm-svn: 28136	2006-05-05 22:53:17 +00:00
Chris Lattner	3d26577396	Fold (fpext (load x)) -> (extload x) llvm-svn: 28130	2006-05-05 21:34:35 +00:00
Chris Lattner	3e3f2c63c3	More aggressively sink GEP offsets into loops. For example, before we generated: movl 8(%esp), %eax movl %eax, %edx addl $4316, %edx cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, (%edx) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %edx movl %edx, 4460(%eax) ret ... Now we generate: movl 8(%esp), %eax cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, 4316(%eax) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %ecx movl %ecx, 4460(%eax) ret ... which uses one fewer register. llvm-svn: 28129	2006-05-05 21:17:49 +00:00
Chris Lattner	25a5283a86	Fold some common code. llvm-svn: 28124	2006-05-05 06:32:04 +00:00
Chris Lattner	002ee91457	Implement: // fold (and (sext x), (sext y)) -> (sext (and x, y)) // fold (or (sext x), (sext y)) -> (sext (or x, y)) // fold (xor (sext x), (sext y)) -> (sext (xor x, y)) // fold (and (aext x), (aext y)) -> (aext (and x, y)) // fold (or (aext x), (aext y)) -> (aext (or x, y)) // fold (xor (aext x), (aext y)) -> (aext (xor x, y)) llvm-svn: 28123	2006-05-05 06:31:05 +00:00
Chris Lattner	5ac4293606	Pull and through and/or/xor. This compiles some bitfield code to: mov EAX, DWORD PTR [ESP + 4] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX or EDX, ECX and EDX, -2147483648 and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX ret instead of: sub ESP, 4 mov DWORD PTR [ESP], ESI mov EAX, DWORD PTR [ESP + 8] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX mov ESI, ECX and ESI, -2147483648 and EDX, -2147483648 or EDX, ESI and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX mov ESI, DWORD PTR [ESP] add ESP, 4 ret llvm-svn: 28122	2006-05-05 06:10:43 +00:00
Chris Lattner	812646aa0c	Implement a variety of simplifications for ANY_EXTEND. llvm-svn: 28121	2006-05-05 05:58:59 +00:00
Chris Lattner	8d6fc20181	Factor some code, add these transformations: // fold (and (trunc x), (trunc y)) -> (trunc (and x, y)) // fold (or (trunc x), (trunc y)) -> (trunc (or x, y)) // fold (xor (trunc x), (trunc y)) -> (trunc (xor x, y)) llvm-svn: 28120	2006-05-05 05:51:50 +00:00
Jeff Cohen	78a7f0e05e	Fix VC++ compilation error. llvm-svn: 28117	2006-05-05 01:47:05 +00:00
Chris Lattner	7a3ecf7993	Sink noop copies into the basic block that uses them. This reduces the number of cross-block live ranges, and allows the bb-at-a-time selector to always coallesce these away, at isel time. This reduces the load on the coallescer and register allocator. For example on a codec on X86, we went from: 1643 asm-printer - Number of machine instrs printed 419 liveintervals - Number of loads/stores folded into instructions 1144 liveintervals - Number of identity moves eliminated after coalescing 1022 liveintervals - Number of interval joins performed 282 liveintervals - Number of intervals after coalescing 1304 liveintervals - Number of original intervals 86 regalloc - Number of times we had to backtrack 1.90232 regalloc - Ratio of intervals processed over total intervals 40 spiller - Number of values reused 182 spiller - Number of loads added 121 spiller - Number of stores added 132 spiller - Number of register spills 6 twoaddressinstruction - Number of instructions commuted to coalesce 360 twoaddressinstruction - Number of two-address instructions to: 1636 asm-printer - Number of machine instrs printed 403 liveintervals - Number of loads/stores folded into instructions 1155 liveintervals - Number of identity moves eliminated after coalescing 1033 liveintervals - Number of interval joins performed 279 liveintervals - Number of intervals after coalescing 1312 liveintervals - Number of original intervals 76 regalloc - Number of times we had to backtrack 1.88998 regalloc - Ratio of intervals processed over total intervals 1 spiller - Number of copies elided 41 spiller - Number of values reused 191 spiller - Number of loads added 114 spiller - Number of stores added 128 spiller - Number of register spills 4 twoaddressinstruction - Number of instructions commuted to coalesce 356 twoaddressinstruction - Number of two-address instructions On this testcase, this change provides a modest reduction in spill code, regalloc iterations, and total instructions emitted. It increases the number of register coallesces. llvm-svn: 28115	2006-05-05 01:04:50 +00:00
Evan Cheng	9add880566	Initial support for register pressure aware scheduling. The register reduction scheduler can go into a "vertical mode" (i.e. traversing up the two-address chain, etc.) when the register pressure is low. This does seem to reduce the number of spills in the cases I've looked at. But with x86, it's no guarantee the performance of the code improves. It can be turned on with -sched-vertically option. llvm-svn: 28108	2006-05-04 19:16:39 +00:00
Chris Lattner	469647bf38	Remove and simplify some more machineinstr/machineoperand stuff. llvm-svn: 28105	2006-05-04 18:16:01 +00:00
Chris Lattner	10b71c0d08	Rename MO_VirtualRegister -> MO_Register. Clean up immediate handling. llvm-svn: 28104	2006-05-04 18:05:43 +00:00
Chris Lattner	940cc978ef	Remove a bunch more SparcV9 specific stuff llvm-svn: 28093	2006-05-04 01:15:02 +00:00
Nate Begeman	df4883971e	Finish up the initial jump table implementation by allowing jump tables to not be 100% dense. Increase the minimum threshold for the number of cases in a switch statement from 4 to 6 in order to create a jump table. llvm-svn: 28079	2006-05-03 03:48:02 +00:00
Evan Cheng	ffef8b9412	Bottom up register pressure reduction work: clean up some hacks and enhanced the heuristic to further reduce spills for several test cases. (Note, it may not necessarily translate to runtime win!) llvm-svn: 28076	2006-05-03 02:10:45 +00:00
Owen Anderson	20a631fde7	Refactor TargetMachine, pushing handling of TargetData into the target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference. This fixes PR 759. llvm-svn: 28074	2006-05-03 01:29:57 +00:00
Evan Cheng	0d084fb9ca	Dis-favor stores more llvm-svn: 28035	2006-05-01 09:20:44 +00:00
Evan Cheng	24e795496d	Bottom up register-pressure reduction scheduler now pushes store operations up the schedule. This helps code that looks like this: loads ... computations (first set) ... stores (first set) ... loads computations (seccond set) ... stores (seccond set) ... Without this change, the stores and computations are more likely to interleave: loads ... loads ... computations (first set) ... computations (second set) ... computations (first set) ... stores (first set) ... computations (second set) ... stores (stores set) ... This can increase the number of spills if we are unlucky. llvm-svn: 28033	2006-05-01 09:14:40 +00:00
Evan Cheng	10ff7b27ce	Didn't mean ScheduleDAGList.cpp to make the last checkin. llvm-svn: 28030	2006-05-01 08:56:34 +00:00
Evan Cheng	a656242690	Remove temp. option -spiller-check-liveout, it didn't cause any failure nor performance regressions. llvm-svn: 28029	2006-05-01 08:54:57 +00:00
Chris Lattner	2b48a94413	Remove a bogus transformation. This fixes SingleSource/UnitTests/2006-01-23-InitializedBitField.c with some changes I have to the new CFE. llvm-svn: 28022	2006-04-28 23:33:20 +00:00
Evan Cheng	c5e8ce8b8c	Remove the temporary option: -no-isel-fold-inflight llvm-svn: 28012	2006-04-28 18:54:11 +00:00
Evan Cheng	d43c5c6046	TargetLowering::LowerArguments should return a VBIT_CONVERT of FORMAL_ARGUMENTS SDOperand in the return result vector. llvm-svn: 28009	2006-04-28 05:25:15 +00:00
Evan Cheng	51ab4498e7	Added a temporary option -no-isel-fold-inflight to control whether a "inflight" node can be folded. llvm-svn: 28003	2006-04-28 02:09:19 +00:00
Evan Cheng	3784f3c57c	Insert a VBIT_CONVERT between a FORMAL_ARGUMENT node and its vector uses (VAND, VADD, etc.). Legalizer will assert otherwise. llvm-svn: 27991	2006-04-27 08:29:42 +00:00
Chris Lattner	393d96a56c	Fix Regression/CodeGen/Generic/2006-04-26-SetCCAnd.ll and PR748. llvm-svn: 27987	2006-04-27 05:01:07 +00:00
Evan Cheng	9618df1190	Don't forget return void. llvm-svn: 27974	2006-04-25 23:03:35 +00:00
Nate Begeman	866b4b4d45	Fix the updating of the machine CFG when a PHI node was in a successor of the jump table's range check block. This re-enables 100% dense jump tables by default on PPC & x86 llvm-svn: 27952	2006-04-23 06:26:20 +00:00
Nate Begeman	ecb1dafd3d	Turn of jump tables for a bit, there are still some issues to work out with updating the machine CFG. llvm-svn: 27949	2006-04-22 23:51:56 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	b21d3bfd1f	The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap miscompares). Switch RISC targets to use the list-td scheduler, which isn't. llvm-svn: 27933	2006-04-21 17:16:16 +00:00
Chris Lattner	662e940f73	Fix a couple more memory issues llvm-svn: 27930	2006-04-21 15:32:26 +00:00
Chris Lattner	cc47ab3305	Fix a really subtle and obnoxious memory bug that caused issues with an llvm-gcc4 boostrap. Whenever a node is deleted by the dag combiner, it must be returned by the visit function, or the dag combiner will not know that the node has been processed (and will, e.g., send it to the target dag combine xforms). llvm-svn: 27922	2006-04-20 23:55:59 +00:00
Evan Cheng	a320abc494	Turn a VAND into a VECTOR_SHUFFLE is applicable. DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements, into a vector shuffle with a zero vector. It only does so when TLI tells it the xform is profitable. llvm-svn: 27874	2006-04-20 08:56:16 +00:00
Chris Lattner	bc1b262725	Implement folding of a bunch of binops with undef llvm-svn: 27863	2006-04-20 05:39:12 +00:00
Chris Lattner	73eb58e1a2	Simplify some code llvm-svn: 27846	2006-04-19 23:17:50 +00:00
Chris Lattner	916ae0775e	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Chris Lattner	326870b40b	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret llvm-svn: 27765	2006-04-17 19:21:01 +00:00
Chris Lattner	91226e5799	Add support for promoting stores from one legal type to another, allowing us to write one pattern for vector stores instead of 4. llvm-svn: 27730	2006-04-16 01:36:45 +00:00
Chris Lattner	7e7ad593cc	Make these predicates return true for bit_convert(buildvector)'s as well as buildvectors. llvm-svn: 27723	2006-04-15 23:38:00 +00:00
Chris Lattner	086e986e94	Make this assertion better llvm-svn: 27695	2006-04-14 06:08:35 +00:00
Evan Cheng	119266ea92	Promote vector AND, OR, and XOR llvm-svn: 27632	2006-04-12 21:20:24 +00:00
Evan Cheng	be8a8933e6	Vector type promotion for ISD::LOAD and ISD::SELECT llvm-svn: 27606	2006-04-12 16:33:18 +00:00
Chris Lattner	d3b504ae10	Implement support for the formal_arguments node. To get this, targets shouldcustom legalize it and remove their XXXTargetLowering::LowerArguments overload llvm-svn: 27604	2006-04-12 16:20:43 +00:00
Chris Lattner	417b96b6dd	Don't memoize vloads in the load map! Don't memoize them anywhere here, let getNode do it. This fixes CodeGen/Generic/2006-04-11-vecload.ll llvm-svn: 27602	2006-04-12 03:25:41 +00:00
Evan Cheng	7256b0ae05	Only get Tmp2 for cases where number of operands is > 1. Fixed return void. llvm-svn: 27586	2006-04-11 06:33:39 +00:00
Chris Lattner	6cf3bbbe17	add some todos llvm-svn: 27580	2006-04-11 02:00:08 +00:00
Chris Lattner	2eb22eef7d	Add basic support for legalizing returns of vectors llvm-svn: 27578	2006-04-11 01:31:51 +00:00
Evan Cheng	cb73b8d419	Missing break llvm-svn: 27559	2006-04-10 18:54:36 +00:00
Chris Lattner	02274a5265	Add code generator support for VSELECT llvm-svn: 27542	2006-04-08 22:22:57 +00:00
Chris Lattner	e1401e3610	Canonicalize vvector_shuffle(x,x) -> vvector_shuffle(x,undef) to enable patterns to match again :) llvm-svn: 27533	2006-04-08 05:34:25 +00:00
Chris Lattner	098c01e94e	Codegen shufflevector as VVECTOR_SHUFFLE llvm-svn: 27529	2006-04-08 04:15:24 +00:00
Chris Lattner	101ea66813	add a sanity check: LegalizeOp should return a value that is the same type as its input. llvm-svn: 27528	2006-04-08 04:13:17 +00:00
Evan Cheng	78e3d565af	INSERT_VECTOR_ELT lowering bug: store vector to $esp store element to $esp + sizeof(VT) * index load vector from $esp The bug is VT is the type of the vector element, not the type of the vector! llvm-svn: 27517	2006-04-08 01:46:37 +00:00
Chris Lattner	aa3185f12e	Stub out shufflevector llvm-svn: 27514	2006-04-08 01:19:25 +00:00
Evan Cheng	613996c55e	1. If both vector operands of a vector_shuffle are undef, turn it into an undef. 2. A shuffle mask element can also be an undef. llvm-svn: 27472	2006-04-06 23:20:43 +00:00
Chris Lattner	4a2413a590	Make a vector live across blocks have the correct Vec type. This fixes CodeGen/X86/2006-04-04-CrossBlockCrash.ll llvm-svn: 27436	2006-04-05 06:54:42 +00:00
Evan Cheng	9fa8959dce	Exapnd a VECTOR_SHUFFLE to a BUILD_VECTOR if target asks for it to be expanded or custom lowering fails. llvm-svn: 27432	2006-04-05 06:07:11 +00:00
Chris Lattner	4ea52cac01	Do not create ZEXTLOAD's unless we are before legalize or the operation is legal. llvm-svn: 27402	2006-04-04 17:39:18 +00:00
Chris Lattner	6be79823e7	* Add supprot for SCALAR_TO_VECTOR operations where the input needs to be promoted/expanded (e.g. SCALAR_TO_VECTOR from i8/i16 on PPC). * Add support for targets to request that VECTOR_SHUFFLE nodes be promoted to a canonical type, for example, we only want v16i8 shuffles on PPC. * Move isShuffleLegal out of TLI into Legalize. * Teach isShuffleLegal to allow shuffles that need to be promoted. llvm-svn: 27399	2006-04-04 17:23:26 +00:00
Chris Lattner	a9e77d14c7	Constant fold bitconvert(undef) llvm-svn: 27391	2006-04-04 01:02:22 +00:00
Chris Lattner	e1e3adf802	Add a missing check, this fixes UnitTests/Vector/sumarray.c llvm-svn: 27375	2006-04-03 17:29:28 +00:00
Chris Lattner	04c00fc844	Add a missing check, which broke a bunch of vector tests. llvm-svn: 27374	2006-04-03 17:21:50 +00:00
Andrew Lenharth	94f012f606	back this out llvm-svn: 27367	2006-04-03 03:16:50 +00:00
Andrew Lenharth	015eaf5f33	This should be a win of every arch llvm-svn: 27364	2006-04-02 21:42:45 +00:00
Chris Lattner	4993249a04	Add a little dag combine to compile this: int %AreSecondAndThirdElementsBothNegative(<4 x float>* %in) { entry: %tmp1 = load <4 x float>* %in ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.ppc.altivec.vcmpgefp.p( int 1, <4 x float> < float 0x7FF8000000000000, float 0.000000e+00, float 0.000000e+00, float 0x7FF8000000000000 >, <4 x float> %tmp1 ) ; <int> [#uses=1] %tmp = seteq int %tmp, 0 ; <bool> [#uses=1] %tmp3 = cast bool %tmp to int ; <int> [#uses=1] ret int %tmp3 } into this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 mtspr 256, r2 blr instead of this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 xori r3, r3, 1 cntlzw r3, r3 srwi r3, r3, 5 mtspr 256, r2 blr llvm-svn: 27356	2006-04-02 06:11:11 +00:00
Chris Lattner	42a5fca47e	Implement promotion for EXTRACT_VECTOR_ELT, allowing v16i8 multiplies to work with PowerPC. llvm-svn: 27349	2006-04-02 05:06:04 +00:00
Chris Lattner	87f080949b	Implement the Expand action for binary vector operations to break the binop into elements and operate on each piece. This allows generic vector integer multiplies to work on PPC, though the generated code is horrible. llvm-svn: 27347	2006-04-02 03:57:31 +00:00
Chris Lattner	a9c59156be	Intrinsics that just load from memory can be treated like loads: they don't have to serialize against each other. This allows us to schedule lvx's across each other, for example. llvm-svn: 27346	2006-04-02 03:41:14 +00:00
Chris Lattner	0442a18758	Constant fold all of the vector binops. This allows us to compile this: "vector unsigned char mergeLowHigh = (vector unsigned char) ( 8, 9, 10, 11, 16, 17, 18, 19, 12, 13, 14, 15, 20, 21, 22, 23 ); vector unsigned char mergeHighLow = vec_xor( mergeLowHigh, vec_splat_u8(8));" aka: void %test2(<16 x sbyte>* %P) { store <16 x sbyte> cast (<4 x int> xor (<4 x int> cast (<16 x ubyte> < ubyte 8, ubyte 9, ubyte 10, ubyte 11, ubyte 16, ubyte 17, ubyte 18, ubyte 19, ubyte 12, ubyte 13, ubyte 14, ubyte 15, ubyte 20, ubyte 21, ubyte 22, ubyte 23 > to <4 x int>), <4 x int> cast (<16 x sbyte> < sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8 > to <4 x int>)) to <16 x sbyte>), <16 x sbyte> * %P ret void } into this: _test2: mfspr r2, 256 oris r4, r2, 32768 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) lvx v0, r5, r4 stvx v0, 0, r3 mtspr 256, r2 blr instead of this: _test2: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) vspltisb v0, 8 lvx v1, r5, r4 vxor v0, v1, v0 stvx v0, 0, r3 mtspr 256, r2 blr ... which occurs here: http://developer.apple.com/hardware/ve/calcspeed.html llvm-svn: 27343	2006-04-02 03:25:57 +00:00
Chris Lattner	ef598059f2	Add a new -view-legalize-dags command line option llvm-svn: 27342	2006-04-02 03:07:27 +00:00
Chris Lattner	e4e64b6b85	Implement constant folding of bit_convert of arbitrary constant vbuild_vector nodes. llvm-svn: 27341	2006-04-02 02:53:43 +00:00
Chris Lattner	1c22728787	These entries already exist llvm-svn: 27340	2006-04-02 02:51:27 +00:00
Chris Lattner	1985e1cbb8	Add some missing node names llvm-svn: 27339	2006-04-02 02:41:18 +00:00
Chris Lattner	bec582f4cd	Prefer larger register classes over smaller ones when a register occurs in multiple register classes. This fixes PowerPC/2006-04-01-FloatDoubleExtend.ll llvm-svn: 27334	2006-04-02 00:24:45 +00:00
Chris Lattner	39dcf1a9e2	Delete identity shuffles, implementing CodeGen/Generic/vector-identity-shuffle.ll llvm-svn: 27317	2006-03-31 22:16:43 +00:00
Chris Lattner	d9e4daabd2	Do not endian swap split vector loads. This fixes UnitTests/Vector/sumarray-dbl on PPC. Now all UnitTests/Vector/* tests pass on PPC. llvm-svn: 27299	2006-03-31 18:22:37 +00:00
Chris Lattner	8d90f526d7	Do not endian swap the operands to a store if the operands came from a vector. This fixes UnitTests/Vector/simple.c with altivec. llvm-svn: 27298	2006-03-31 18:20:46 +00:00
Chris Lattner	7e30af3887	Remove dead *extloads. This allows us to codegen vector.ll:test_extract_elt to: test_extract_elt: alloc r3 = ar.pfs,0,1,0,0 adds r8 = 12, r32 ;; ldfs f8 = [r8] mov ar.pfs = r3 br.ret.sptk.many rp instead of: test_extract_elt: alloc r3 = ar.pfs,0,1,0,0 adds r8 = 28, r32 adds r9 = 24, r32 adds r10 = 20, r32 adds r11 = 16, r32 ;; ldfs f6 = [r8] ;; ldfs f6 = [r9] adds r8 = 12, r32 adds r9 = 8, r32 adds r14 = 4, r32 ;; ldfs f6 = [r10] ;; ldfs f6 = [r11] ldfs f8 = [r8] ;; ldfs f6 = [r9] ;; ldfs f6 = [r14] ;; ldfs f6 = [r32] mov ar.pfs = r3 br.ret.sptk.many rp llvm-svn: 27297	2006-03-31 18:10:41 +00:00
Chris Lattner	2d8551c85b	Delete dead loads in the dag. This allows us to compile vector.ll:test_extract_elt2 into: _test_extract_elt2: lfd f1, 32(r3) blr instead of: _test_extract_elt2: lfd f0, 56(r3) lfd f0, 48(r3) lfd f0, 40(r3) lfd f1, 32(r3) lfd f0, 24(r3) lfd f0, 16(r3) lfd f0, 8(r3) lfd f0, 0(r3) blr llvm-svn: 27296	2006-03-31 18:06:18 +00:00
Chris Lattner	6f42325dca	Implement PromoteOp for VEXTRACT_VECTOR_ELT. Thsi fixes Generic/vector.ll:test_extract_elt on non-sse X86 systems. llvm-svn: 27294	2006-03-31 17:55:51 +00:00
Chris Lattner	8e1fcab2bc	Scalarized vector stores need not be legal, e.g. if the vector element type needs to be promoted or expanded. Relegalize the scalar store once created. This fixes CodeGen/Generic/vector.ll:test1 on non-SSE x86 targets. llvm-svn: 27293	2006-03-31 17:37:22 +00:00
Chris Lattner	ba38035e21	Make sure to pass enough values to phi nodes when we are dealing with decimated vectors. This fixes UnitTests/Vector/sumarray-dbl.c llvm-svn: 27280	2006-03-31 02:12:18 +00:00
Chris Lattner	5fe1f54c17	Significantly improve handling of vectors that are live across basic blocks, handling cases where the vector elements need promotion, expansion, and when the vector type itself needs to be decimated. llvm-svn: 27278	2006-03-31 02:06:56 +00:00
Evan Cheng	168e45b0b3	Expand INSERT_VECTOR_ELT to store vec, sp; store elt, sp+k; vec = load sp; llvm-svn: 27274	2006-03-31 01:27:51 +00:00
Chris Lattner	67271869a8	Bug fixes: handle constantexpr insert/extract element operations Handle constantpacked vectors with constantexpr elements. This fixes CodeGen/Generic/vector-constantexpr.ll llvm-svn: 27241	2006-03-29 00:11:43 +00:00
Chris Lattner	20e619fba3	When building a VVECTOR_SHUFFLE node from extract_element operations, make sure to build it as SHUFFLE(X, undef, mask), not SHUFFLE(X, X, mask). The later is not canonical form, and prevents the PPC splat pattern from matching. For a particular splat, we go from generating this: li r10, lo16(LCPI1_0) lis r11, ha16(LCPI1_0) lvx v3, r11, r10 vperm v3, v2, v2, v3 to generating: vspltw v3, v2, 3 llvm-svn: 27236	2006-03-28 22:19:47 +00:00
Chris Lattner	a46dfe80c8	Canonicalize VECTOR_SHUFFLE(X, X, Y) -> VECTOR_SHUFFLE(X,undef,Y') llvm-svn: 27235	2006-03-28 22:11:53 +00:00
Chris Lattner	c9992548fc	Turn a series of extract_element's feeding a build_vector into a vector_shuffle node. For this: void test(__m128 res, __m128 A, __m128 B) { res = _mm_unpacklo_ps(A, B); } we now produce this code: _test: movl 8(%esp), %eax movaps (%eax), %xmm0 movl 12(%esp), %eax unpcklps (%eax), %xmm0 movl 4(%esp), %eax movaps %xmm0, (%eax) ret instead of this: _test: subl $76, %esp movl 88(%esp), %eax movaps (%eax), %xmm0 movaps %xmm0, (%esp) movaps %xmm0, 32(%esp) movss 4(%esp), %xmm0 movss 32(%esp), %xmm1 unpcklps %xmm0, %xmm1 movl 84(%esp), %eax movaps (%eax), %xmm0 movaps %xmm0, 16(%esp) movaps %xmm0, 48(%esp) movss 20(%esp), %xmm0 movss 48(%esp), %xmm2 unpcklps %xmm0, %xmm2 unpcklps %xmm1, %xmm2 movl 80(%esp), %eax movaps %xmm2, (%eax) addl $76, %esp ret GCC produces this (with -fomit-frame-pointer): _test: subl $12, %esp movl 20(%esp), %eax movaps (%eax), %xmm0 movl 24(%esp), %eax unpcklps (%eax), %xmm0 movl 16(%esp), %eax movaps %xmm0, (%eax) addl $12, %esp ret llvm-svn: 27233	2006-03-28 20:28:38 +00:00
Chris Lattner	f6f94d3bce	Teach Legalize how to pack VVECTOR_SHUFFLE nodes into VECTOR_SHUFFLE nodes. llvm-svn: 27232	2006-03-28 20:24:43 +00:00
Chris Lattner	8d57da2ffc	new node llvm-svn: 27231	2006-03-28 19:54:42 +00:00
Chris Lattner	b7163598f9	Don't crash on X^X if X is a vector. Instead, produce a vector of zeros. llvm-svn: 27229	2006-03-28 19:11:05 +00:00
Chris Lattner	ffec47ebff	Add an assertion llvm-svn: 27228	2006-03-28 19:04:49 +00:00
Jim Laskey	67a636c587	More bulletproofing of llvm.dbg.declare. llvm-svn: 27224	2006-03-28 13:45:20 +00:00
Chris Lattner	e55d171ccd	Tblgen doesn't like multiple SDNode<> definitions that map to the sameenum value. Split them into separate enums. llvm-svn: 27201	2006-03-28 00:40:33 +00:00
Jim Laskey	d387cc5cde	Reactivate llvm.dbg.declare. llvm-svn: 27192	2006-03-27 23:31:10 +00:00
Chris Lattner	5bb1d90afd	Disable dbg_declare, it currently breaks the CFE build llvm-svn: 27182	2006-03-27 21:36:03 +00:00
Chris Lattner	d5f94c9574	Fix legalization of intrinsics with chain and result values llvm-svn: 27181	2006-03-27 20:28:29 +00:00
Chris Lattner	0e84f1e532	Unbreak the build on non-apple compilers :-( llvm-svn: 27173	2006-03-27 16:10:59 +00:00
Evan Cheng	d09b05b0bc	Try again llvm-svn: 27171	2006-03-27 08:10:26 +00:00
Evan Cheng	64efb35c32	Incorrect check for FP all one's llvm-svn: 27169	2006-03-27 07:26:17 +00:00
Evan Cheng	c70e33cd6e	Change isBuildVectorAllOnesInteger to isBuildVectorAllOnes. Also check for floating point cases. llvm-svn: 27165	2006-03-27 06:58:47 +00:00
Chris Lattner	52fcad3a37	Instead of printing "INTRINSIC" on intrinsic node, print the intrinsic name. llvm-svn: 27164	2006-03-27 06:45:25 +00:00
Nate Begeman	ed728c1291	SelectionDAGISel can now natively handle Switch instructions, in the same manner that the LowerSwitch LLVM to LLVM pass does: emitting a binary search tree of basic blocks. The new approach has several advantages: it is faster, it generates significantly smaller code in many cases, and it paves the way for implementing dense switch tables as a jump table by handling switches directly in the instruction selector. This functionality is currently only enabled on x86, but should be safe for every target. In anticipation of making it the default, the cfg is now properly updated in the x86, ppc, and sparc select lowering code. llvm-svn: 27156	2006-03-27 01:32:24 +00:00
Jim Laskey	7092888bcc	Bullet proof against undefined args produced by upgrading ols-style debug info. llvm-svn: 27155	2006-03-26 22:46:27 +00:00
Evan Cheng	a67899195f	Add ISD::isBuildVectorAllZeros predicate llvm-svn: 27147	2006-03-26 09:50:58 +00:00
Chris Lattner	30ee72586d	Allow targets to custom lower their own intrinsics if desired. llvm-svn: 27146	2006-03-26 09:12:51 +00:00
Chris Lattner	f6e3b957b8	Fix a bug in ISD::isBuildVectorAllOnesInteger that caused it to always return false llvm-svn: 27131	2006-03-25 22:59:28 +00:00
Chris Lattner	c2d2811a07	Implement the ISD::isBuildVectorAllOnesInteger predicate llvm-svn: 27130	2006-03-25 22:57:01 +00:00
Chris Lattner	dc1eab5886	Don't call SimplifyDemandedBits on vectors llvm-svn: 27128	2006-03-25 22:19:00 +00:00
Chris Lattner	313229c74b	fix inverted conditional llvm-svn: 27089	2006-03-24 22:49:42 +00:00
Evan Cheng	68d9bf26c8	Only to vector shuffle for {x,x,y,y} cases when SCALAR_TO_VECTOR is free. llvm-svn: 27071	2006-03-24 18:45:20 +00:00
Jim Laskey	53f1ecc560	Rename for truth in advertising. llvm-svn: 27063	2006-03-24 09:50:27 +00:00
Chris Lattner	77e271cb4e	prefer to generate constant pool loads over splats. This prevents us from using a splat for {1.0,1.0,1.0,1.0} llvm-svn: 27055	2006-03-24 07:29:17 +00:00
Chris Lattner	87b1dddb1c	fix spello llvm-svn: 27053	2006-03-24 07:15:07 +00:00
Chris Lattner	a4f6805a86	legalize vbit_convert nodes whose result is a legal type. Legalize intrinsic nodes. llvm-svn: 27036	2006-03-24 02:26:29 +00:00
Chris Lattner	d96b09a7b9	Lower target intrinsics into an INTRINSIC node llvm-svn: 27035	2006-03-24 02:22:33 +00:00
Chris Lattner	6b05290922	fix some bogus assertions: noop bitconverts are legal llvm-svn: 27032	2006-03-24 02:20:47 +00:00
Evan Cheng	1d2e995fc1	Lower BUILD_VECTOR to VECTOR_SHUFFLE if there are two distinct nodes (and if the target can handle it). Issue two SCALAR_TO_VECTOR ops followed by a VECTOR_SHUFFLE to select from the two vectors. llvm-svn: 27023	2006-03-24 01:17:21 +00:00
Chris Lattner	ebac9a4adf	Identify the INTRINSIC node llvm-svn: 27020	2006-03-24 01:04:30 +00:00
Chris Lattner	d7c4e7d255	add support for splitting casts. This implements CodeGen/Generic/vector.ll:test_cast_2. llvm-svn: 26999	2006-03-23 21:16:34 +00:00
Jim Laskey	a8bdac875d	Handle new forms of llvm.dbg intrinsics. llvm-svn: 26988	2006-03-23 18:06:46 +00:00
Chris Lattner	9ea1b3f9fd	simplify some code llvm-svn: 26972	2006-03-23 05:29:04 +00:00
Chris Lattner	b893d04a67	Fix a typo llvm-svn: 26965	2006-03-22 22:20:49 +00:00
Chris Lattner	2f4119a608	Implement simple support for vector casting. This can currently only handle casts between legal vector types. llvm-svn: 26961	2006-03-22 20:09:35 +00:00
Chris Lattner	8fa445a89d	Endianness does not affect the order of vector fields. This fixes SingleSource/UnitTests/Vector/build.c llvm-svn: 26936	2006-03-22 01:46:54 +00:00
Chris Lattner	5be4352124	Enclose some variables in a scope to avoid error with some gcc versions llvm-svn: 26934	2006-03-22 00:12:37 +00:00
Chris Lattner	340a6b5c26	add expand support for extractelement llvm-svn: 26931	2006-03-21 21:02:03 +00:00
Chris Lattner	7c0cd8cafc	add some trivial support for extractelement. llvm-svn: 26928	2006-03-21 20:44:12 +00:00
Chris Lattner	672a42d731	Add a hacky workaround for crashes due to vectors live across blocks. Note that this code won't work for vectors that aren't legal on the target. Improvements coming. llvm-svn: 26925	2006-03-21 19:20:37 +00:00
Chris Lattner	21e68c8001	If a target supports splatting with SHUFFLE_VECTOR, lower to it from BUILD_VECTOR(x,x,x,x) llvm-svn: 26885	2006-03-20 01:52:29 +00:00
Chris Lattner	6b20104410	TargetData doesn't know the alignment of vectors :( llvm-svn: 26884	2006-03-20 01:51:46 +00:00
Chris Lattner	00f0589bc0	Add very basic support for VECTOR_SHUFFLE llvm-svn: 26880	2006-03-19 23:56:04 +00:00
Chris Lattner	79fb91cc69	Allow SCALAR_TO_VECTOR to be custom lowered. llvm-svn: 26867	2006-03-19 06:47:21 +00:00
Chris Lattner	9cdc5a0ce7	Add SCALAR_TO_VECTOR support llvm-svn: 26866	2006-03-19 06:31:19 +00:00
Chris Lattner	eb5b2e705c	Don't bother storing undef elements of BUILD_VECTOR's llvm-svn: 26858	2006-03-19 05:46:04 +00:00
Chris Lattner	5d3ff12c8f	Implement expand of BUILD_VECTOR containing variable elements. This implements CodeGen/Generic/vector.ll:test_variable_buildvector llvm-svn: 26852	2006-03-19 04:18:56 +00:00
Chris Lattner	5336a59e4b	fold insertelement(buildvector) -> buildvector if the inserted element # is a constant. This implements test_constant_insert in CodeGen/Generic/vector.ll llvm-svn: 26851	2006-03-19 01:27:56 +00:00
Chris Lattner	29b2301460	implement basic support for INSERT_VECTOR_ELT. llvm-svn: 26849	2006-03-19 01:17:20 +00:00
Chris Lattner	f4e1a53647	Rename ConstantVec -> BUILD_VECTOR and VConstant -> VBUILD_VECTOR. Allow*BUILD_VECTOR to take variable inputs. llvm-svn: 26847	2006-03-19 00:52:58 +00:00
Chris Lattner	c16b05e67d	implement vector.ll:test_undef llvm-svn: 26845	2006-03-19 00:20:20 +00:00
Chris Lattner	93640543a9	Fix the remaining bugs in the vector expansion rework I commited yesterday. This fixes CodeGen/Generic/vector.ll llvm-svn: 26843	2006-03-19 00:07:49 +00:00
Chris Lattner	32206f54c6	Change the structure of lowering vector stuff. Note: This breaks some things. llvm-svn: 26840	2006-03-18 01:44:44 +00:00
Chris Lattner	98931bc381	add a couple enum values llvm-svn: 26830	2006-03-17 19:53:59 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	7ececaad83	Fix a problem fully scalarizing values. llvm-svn: 26811	2006-03-16 23:05:19 +00:00
Chris Lattner	8471b15706	Add support for CopyFromReg from vector values. Note: this doesn't support illegal vector types yet! llvm-svn: 26799	2006-03-16 19:57:50 +00:00
Chris Lattner	49409cb925	Teach CreateRegForValue how to handle vector types. llvm-svn: 26798	2006-03-16 19:51:18 +00:00
Chris Lattner	4024c00ce7	add support for vector->vector casts llvm-svn: 26788	2006-03-15 22:19:46 +00:00
Chris Lattner	cad70c3e46	Add a note, this code should be moved to the dag combiner. llvm-svn: 26787	2006-03-15 22:19:18 +00:00
Chris Lattner	68ac09d5cb	make sure dead token factor nodes are removed by the dag combiner. llvm-svn: 26731	2006-03-13 18:37:30 +00:00
Jim Laskey	acb6e34277	Handle the removal of the debug chain. llvm-svn: 26729	2006-03-13 13:07:37 +00:00
Chris Lattner	d8c2a48d58	Fold X+Y -> X\|Y when safe. This implements: Regression/CodeGen/PowerPC/and_add.ll a case that occurs with dynamic allocas of constant size. llvm-svn: 26727	2006-03-13 06:51:27 +00:00
Chris Lattner	8bb6cb7d7b	add a couple of missing folds llvm-svn: 26724	2006-03-13 06:26:26 +00:00
Chris Lattner	994d8e6bd4	For targets with FABS/FNEG support, lower copysign to an integer load, a select and FABS/FNEG. This speeds up a trivial (aka stupid) copysign benchmark I wrote from 6.73s to 2.64s, woo. llvm-svn: 26723	2006-03-13 06:08:38 +00:00
Chris Lattner	a767dbf197	Don't advance the hazard recognizer when there are no hazards and no instructions to be emitted. Don't add one to the latency of a completed instruction if the latency of the op is 0. llvm-svn: 26718	2006-03-12 09:01:41 +00:00
Chris Lattner	86a9b60a25	Chain operands aren't real uses: they don't require the full latency of the predecessor to finish before they can start. llvm-svn: 26717	2006-03-12 03:52:09 +00:00
Chris Lattner	572003ca15	As a pending queue data structure to keep track of instructions whose operands have all issued, but whose results are not yet available. This allows us to compile: int G; int test(int A, int B, int* P) { return (G+A)*(B+1); } to: _test: lis r2, ha16(L_G$non_lazy_ptr) addi r4, r4, 1 lwz r2, lo16(L_G$non_lazy_ptr)(r2) lwz r2, 0(r2) add r2, r2, r3 mullw r3, r2, r4 blr instead of this, which has a stall between the lis/lwz: _test: lis r2, ha16(L_G$non_lazy_ptr) lwz r2, lo16(L_G$non_lazy_ptr)(r2) addi r4, r4, 1 lwz r2, 0(r2) add r2, r2, r3 mullw r3, r2, r4 blr llvm-svn: 26716	2006-03-12 00:38:57 +00:00
Chris Lattner	356183d91e	rename priorityqueue -> availablequeue. When a node is scheduled, remember which cycle it lands on. llvm-svn: 26714	2006-03-11 22:44:37 +00:00
Chris Lattner	063086b0f4	Make CurrCycle a local var instead of an instance var llvm-svn: 26713	2006-03-11 22:34:41 +00:00
Chris Lattner	9995a0c019	Move some methods around so that BU specific code is together, TD specific code is together, and direction independent code is together. llvm-svn: 26712	2006-03-11 22:28:35 +00:00
Chris Lattner	578d8fcb59	merge preds/chainpreds -> preds set merge succs/chainsuccs -> succs set This has no functionality change, simplifies the code, and reduces the size of sunits. llvm-svn: 26711	2006-03-11 22:24:20 +00:00
Evan Cheng	38280c0020	Added a parameter to control whether Constant::getStringValue() would chop off the result string at the first null terminator. llvm-svn: 26704	2006-03-10 23:52:03 +00:00
Chris Lattner	d3ef6c290a	scrape out bits of llvm-db llvm-svn: 26701	2006-03-10 22:48:19 +00:00
Chris Lattner	f918e15362	Move simple-selector-specific types to the simple selector. llvm-svn: 26693	2006-03-10 07:51:18 +00:00
Chris Lattner	5255d04357	Simplify the interface to the schedulers, to not pass the selected heuristicin. llvm-svn: 26692	2006-03-10 07:49:12 +00:00
Chris Lattner	a5b93b8c6d	Move some simple-sched-specific instance vars to the simple scheduler. llvm-svn: 26690	2006-03-10 07:42:02 +00:00
Chris Lattner	e015178de1	prune #includes llvm-svn: 26689	2006-03-10 07:37:35 +00:00
Chris Lattner	4b70ff7876	move some simple scheduler methods into the simple scheduler llvm-svn: 26688	2006-03-10 07:35:21 +00:00
Chris Lattner	dc2f135f5c	Make EmitNode take a SDNode instead of a NodeInfo* llvm-svn: 26687	2006-03-10 07:28:36 +00:00
Chris Lattner	b9d8fa0342	Move the VRBase field from NodeInfo to being a separate, explicit, map. llvm-svn: 26686	2006-03-10 07:25:12 +00:00
Chris Lattner	c48cfba44b	no need to build groups anymore llvm-svn: 26684	2006-03-10 07:15:58 +00:00
Chris Lattner	6f82fe8106	Create SUnits directly from the SelectionDAG. llvm-svn: 26683	2006-03-10 07:13:32 +00:00
Chris Lattner	2f8c7c3d55	Push PrepareNodeInfo/IdentifyGroups down the inheritance hierarchy llvm-svn: 26682	2006-03-10 06:34:51 +00:00
Chris Lattner	349e9ddccc	Teach the latency scheduler some new tricks. In particular, to break ties, keep track of a sense of "mobility", i.e. how many other nodes scheduling one node will free up. For something like this: float testadd(float X, float Y, float Z, float W, float V) { return (X+Y)(Z+W)+V; } For example, this makes us schedule X then Y, not X then *Z. The former allows us to issue the add, the later only lets us issue other loads. This turns the above code from this: _testadd: lfs f0, 0(r3) lfs f1, 0(r6) lfs f2, 0(r4) lfs f3, 0(r5) fadds f0, f0, f2 fadds f1, f3, f1 lfs f2, 0(r7) fmadds f1, f0, f1, f2 blr into this: _testadd: lfs f0, 0(r6) lfs f1, 0(r5) fadds f0, f1, f0 lfs f1, 0(r4) lfs f2, 0(r3) fadds f1, f2, f1 lfs f2, 0(r7) fmadds f1, f1, f0, f2 blr llvm-svn: 26680	2006-03-10 05:51:05 +00:00
Chris Lattner	25e2556b71	add an aggregate method for reinserting scheduled nodes, add a callback for priority impls that want to be notified when a node is scheduled llvm-svn: 26678	2006-03-10 04:32:49 +00:00
Jeff Cohen	6ce97687f7	Fix VC++ build breakage. llvm-svn: 26676	2006-03-10 03:57:45 +00:00
Chris Lattner	213209a248	remove dbg_declare, it's not used yet. llvm-svn: 26659	2006-03-09 20:02:42 +00:00
Chris Lattner	c6c9e65301	remove temporary option llvm-svn: 26646	2006-03-09 17:31:22 +00:00
Chris Lattner	d17d77aa1d	yes yes, enabled debug output is bad llvm-svn: 26637	2006-03-09 07:39:25 +00:00
Chris Lattner	6398c13128	switch the t-d scheduler to use a really dumb and trivial critical path latency priority function. llvm-svn: 26636	2006-03-09 07:38:27 +00:00
Chris Lattner	d4130375c0	Pull latency information for target instructions out of the latency tables. :) Only enable this with -use-sched-latencies, I'll enable it by default with a clean nightly tester run tonight. PPC is the only target that provides latency info currently. llvm-svn: 26634	2006-03-09 07:15:18 +00:00
Chris Lattner	da6aafeef4	don't copy all itinerary data llvm-svn: 26633	2006-03-09 07:13:00 +00:00
Chris Lattner	399bee27f0	PriorityQueue is an instance var, use it. llvm-svn: 26632	2006-03-09 06:48:37 +00:00
Chris Lattner	9e95accf4e	add some comments llvm-svn: 26631	2006-03-09 06:37:29 +00:00
Chris Lattner	9df647539d	Refactor the priority mechanism one step further: now that it is a separate class, sever its implementation from the interface. Now we can provide new implementations of the same interface (priority computation) without touching the scheduler itself. llvm-svn: 26630	2006-03-09 06:35:14 +00:00
Jim Laskey	2698f0de7a	Get rid of the multiple copies of getStringValue. Now a Constant:: method. llvm-svn: 26616	2006-03-08 18:11:07 +00:00
Chris Lattner	fd22d42945	Split the priority function computation and priority queue management out of the ScheduleDAGList class into a new SchedulingPriorityQueue class. llvm-svn: 26613	2006-03-08 05:18:27 +00:00
Chris Lattner	42e2026cb0	switch from an explicitly managed list of SUnits to a simple vector of sunits llvm-svn: 26612	2006-03-08 04:54:34 +00:00
Chris Lattner	12c6d89204	Shrinkify some fields, fit to 80 columns llvm-svn: 26611	2006-03-08 04:41:06 +00:00
Chris Lattner	3fe975b846	revert the previous patch, didn't mean to check it in yet llvm-svn: 26610	2006-03-08 04:39:05 +00:00
Chris Lattner	af5e26c980	remove "Slot", it is dead llvm-svn: 26609	2006-03-08 04:37:58 +00:00
Chris Lattner	543832d39d	Change the interface for getting a target HazardRecognizer to be more clean. llvm-svn: 26608	2006-03-08 04:25:59 +00:00
Chris Lattner	0c801bd1cf	Fix some formatting, when looking for hazards, prefer target nodes over things like copyfromreg. llvm-svn: 26586	2006-03-07 05:40:43 +00:00
Chris Lattner	01aa752a36	update file comment llvm-svn: 26573	2006-03-06 17:58:04 +00:00
Evan Cheng	a00c61932d	Remove some code that doesn't make sense llvm-svn: 26572	2006-03-06 07:31:44 +00:00
Evan Cheng	c5c0658aa6	Remove SUnit::Priority1: it is re-calculated on demand as number of live range to be generated. llvm-svn: 26570	2006-03-06 06:08:54 +00:00
Chris Lattner	47639dbb93	Hoist the HazardRecognizer out of the ScheduleDAGList.cpp file to where targets can implement them. Make the top-down scheduler non-g5-specific. Remove the old testing hazard recognizer. llvm-svn: 26569	2006-03-06 00:22:00 +00:00
Chris Lattner	00b52ea8f9	Comment fixes llvm-svn: 26567	2006-03-05 23:59:20 +00:00
Chris Lattner	80268aaeed	Don't depend on the C99 copysign function, implement it ourselves. llvm-svn: 26566	2006-03-05 23:57:58 +00:00
Chris Lattner	2d945ba4c7	When a hazard recognizer needs noops to be inserted, do so. This represents noops as null pointers in the instruction sequence. llvm-svn: 26564	2006-03-05 23:51:47 +00:00
Chris Lattner	fa5e1c9c26	Implement G5HazardRecognizer as a trivial thing that wants 5 cycles between copyfromreg nodes. Clearly useful! llvm-svn: 26559	2006-03-05 23:13:56 +00:00
Chris Lattner	e50c092b7c	Add basic hazard recognizer support. noop insertion isn't complete yet though. llvm-svn: 26558	2006-03-05 22:45:01 +00:00
Jeff Cohen	55e2aac24b	Fix VC++ compilation error. llvm-svn: 26554	2006-03-05 21:43:37 +00:00
Chris Lattner	98ecb8ec61	Split the list scheduler into top-down and bottom-up pieces. The priority function of the top-down scheduler are completely bogus currently, and having (future) PPC specific in this file is also wrong, but this is a small incremental step. llvm-svn: 26552	2006-03-05 21:10:33 +00:00
Chris Lattner	7a36d97518	Move the available queue to being inside the ListSchedule method, since it bounds its lifetime. llvm-svn: 26550	2006-03-05 20:21:55 +00:00
Chris Lattner	bdaf4f38b5	Reinstate this now that the offending opposite xform has been removed. llvm-svn: 26548	2006-03-05 19:53:55 +00:00
Chris Lattner	c610e62e46	print arbitrary constant pool entries llvm-svn: 26545	2006-03-05 09:38:03 +00:00
Evan Cheng	d428e22c07	Back out fold (shl (add x, c1), c2) -> (add (shl x, c2), c1<<c2) for now. It's causing an infinite loop compiling ldecod on x86 / Darwin. llvm-svn: 26544	2006-03-05 07:30:16 +00:00
Chris Lattner	3bc4050217	Add some simple copysign folds llvm-svn: 26543	2006-03-05 05:30:57 +00:00
Chris Lattner	5c1ba2ac08	Codegen copysign[f] into a FCOPYSIGN node llvm-svn: 26542	2006-03-05 05:09:38 +00:00
Chris Lattner	f29f5204cc	fold (mul (add x, c1), c2) -> (add (mul x, c2), c1*c2) fold (shl (add x, c1), c2) -> (add (shl x, c2), c1<<c2) This allows us to compile CodeGen/PowerPC/addi-reassoc.ll into: _test1: slwi r2, r4, 4 add r2, r2, r3 lwz r3, 36(r2) blr _test2: mulli r2, r4, 5 add r2, r2, r3 lbz r2, 11(r2) extsb r3, r2 blr instead of: _test1: addi r2, r4, 2 slwi r2, r2, 4 add r2, r3, r2 lwz r3, 4(r2) blr _test2: addi r2, r4, 2 mulli r2, r2, 5 add r2, r3, r2 lbz r2, 1(r2) extsb r3, r2 blr llvm-svn: 26535	2006-03-04 23:33:26 +00:00
Evan Cheng	3bf916ddd9	Add more vector NodeTypes: VSDIV, VUDIV, VAND, VOR, and VXOR. llvm-svn: 26504	2006-03-03 07:01:07 +00:00
Evan Cheng	23e75f5b49	SDOperand::isOperand should not be a forwarding. It must check *this against N's operands. llvm-svn: 26502	2006-03-03 06:42:32 +00:00
Evan Cheng	6b08ae8497	Added isOperand(N): true if this is an operand of N llvm-svn: 26501	2006-03-03 06:24:54 +00:00
Evan Cheng	5e9a695026	A bit more tweaking llvm-svn: 26500	2006-03-03 06:23:43 +00:00
Jeff Cohen	55c1173a6c	Fix VC++ compilation errors. llvm-svn: 26498	2006-03-03 03:25:07 +00:00
Chris Lattner	ad3c974a77	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Chris Lattner	093c159efb	Split memcpy/memset/memmove intrinsics into i32/i64 versions, resolving PR709, and paving the way for future progress. llvm-svn: 26476	2006-03-03 00:00:25 +00:00
Evan Cheng	4e3904f637	- Fixed some priority calculation bugs that were causing bug 478. Among them: a predecessor appearing more than once in the operand list was counted as multiple predecessor; priority1 should be updated during scheduling; CycleBound was updated after the node is inserted into priority queue; one of the tie breaking condition was flipped. - Take into consideration of two address opcodes. If a predecessor is a def&use operand, it should have a higher priority. - Scheduler should also favor floaters, i.e. nodes that do not have real predecessors such as MOV32ri. - The scheduling fixes / tweaks fixed bug 478: .text .align 4 .globl _f _f: movl 4(%esp), %eax movl 8(%esp), %ecx movl %eax, %edx imull %ecx, %edx imull %eax, %eax imull %ecx, %ecx addl %eax, %ecx leal (%ecx,%edx,2), %eax ret It is also a slight performance win (1% - 3%) for most tests. llvm-svn: 26470	2006-03-02 21:38:29 +00:00
Chris Lattner	0db2f2c689	Fix CodeGen/Generic/2006-03-01-dagcombineinfloop.ll, an infinite loop in the dag combiner on 176.gcc on x86. llvm-svn: 26459	2006-03-01 21:47:21 +00:00
Chris Lattner	232024edb8	Fix a typo evan noticed llvm-svn: 26454	2006-03-01 19:55:35 +00:00
Chris Lattner	bc1c85beea	Add support for target-specific dag combines llvm-svn: 26443	2006-03-01 04:53:38 +00:00
Chris Lattner	fbcd62d3bb	Add a new AddToWorkList method, start using it llvm-svn: 26441	2006-03-01 04:03:14 +00:00
Chris Lattner	324871ef1a	Pull shifts by a constant through multiplies (a form of reassociation), implementing Regression/CodeGen/X86/mul-shift-reassoc.ll llvm-svn: 26440	2006-03-01 03:44:24 +00:00
Evan Cheng	b97aab4371	Vector ops lowering. llvm-svn: 26436	2006-03-01 01:09:54 +00:00
Evan Cheng	be85e89ec4	- Added VConstant as an abstract version of ConstantVec. - All abstrct vector nodes must have # of elements and element type as their first two operands. llvm-svn: 26432	2006-03-01 00:51:13 +00:00
Chris Lattner	f0032b350c	Compile: unsigned foo4(unsigned short P) { return P & 255; } unsigned foo5(short P) { return P & 255; } to: _foo4: lbz r3,1(r3) blr _foo5: lbz r3,1(r3) blr not: _foo4: lhz r2, 0(r3) rlwinm r3, r2, 0, 24, 31 blr _foo5: lhz r2, 0(r3) rlwinm r3, r2, 0, 24, 31 blr llvm-svn: 26419	2006-02-28 06:49:37 +00:00
Chris Lattner	bdbc4476d9	Fold "and (LOAD P), 255" -> zextload. This allows us to compile: unsigned foo3(unsigned P) { return P & 255; } as: _foo3: lbz r3, 3(r3) blr instead of: _foo3: lwz r2, 0(r3) rlwinm r3, r2, 0, 24, 31 blr and: unsigned short foo2(float a) { return a; } as: _foo2: fctiwz f0, f1 stfd f0, -8(r1) lhz r3, -2(r1) blr instead of: _foo2: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr llvm-svn: 26417	2006-02-28 06:35:35 +00:00
Chris Lattner	0f8a727c49	fold (sra (sra x, c1), c2) -> (sra x, c1+c2) llvm-svn: 26416	2006-02-28 06:23:04 +00:00
Chris Lattner	9fed5b6122	Add support for output memory constraints. llvm-svn: 26410	2006-02-27 23:45:39 +00:00
Chris Lattner	47ee42829d	remove some completed notes llvm-svn: 26390	2006-02-27 00:39:31 +00:00
Evan Cheng	9f9662b86e	Print ConstantPoolSDNode offset field. llvm-svn: 26381	2006-02-26 08:36:57 +00:00
Evan Cheng	ed169db8a5	Added an offset field to ConstantPoolSDNode. llvm-svn: 26371	2006-02-25 09:54:52 +00:00
Chris Lattner	5af3fdec12	Pass all the flags to the asm printer, not just the # operands. llvm-svn: 26362	2006-02-24 19:50:58 +00:00
Chris Lattner	2f8a794b13	rename NumOps -> NumVals to avoid shadowing a NumOps var in an outer scope. Add support for addressing modes. llvm-svn: 26361	2006-02-24 19:18:20 +00:00
Chris Lattner	86c51000db	Refactor operand adding out to a new AddOperand method llvm-svn: 26358	2006-02-24 18:54:03 +00:00
Jeff Cohen	83c22e0d75	Get VC++ building again. llvm-svn: 26351	2006-02-24 02:52:40 +00:00
Chris Lattner	dcf785bf46	Implement (most of) selection of inline asm memory operands. llvm-svn: 26350	2006-02-24 02:13:54 +00:00
Chris Lattner	7ef7a64ebb	Lower C_Memory operands. llvm-svn: 26346	2006-02-24 01:11:24 +00:00
Chris Lattner	e7c0ffb3a0	Fix an endianness problem on big-endian targets with expanded operands to inline asms. Mark some methods const. llvm-svn: 26334	2006-02-23 20:06:57 +00:00
Chris Lattner	571d9647c6	Record all of the expanded registers in the DAG and machine instr, fixing several bugs in inline asm expanded operands. llvm-svn: 26332	2006-02-23 19:21:04 +00:00
Chris Lattner	b1124f3c76	This fixes a couple of problems with expansion llvm-svn: 26318	2006-02-22 23:09:03 +00:00
Chris Lattner	6f87d18be9	Change a whole bunch of code to be built around RegsForValue instead of a single register number. This fully implements promotion for inline asms, expand is close but not quite right yet. llvm-svn: 26316	2006-02-22 22:37:12 +00:00
Chris Lattner	7ad77dfc2a	split register class handling from explicit physreg handling. llvm-svn: 26308	2006-02-22 00:56:39 +00:00
Chris Lattner	5c79f98f15	Adjust to changes in getRegForInlineAsmConstraint prototype llvm-svn: 26306	2006-02-21 23:12:12 +00:00
Chris Lattner	301f45cf6f	Fix a problem Nate and Duraid reported where simplifying nodes can cause them to get ressurected, in which case, deleting the undead nodes is unfriendly. llvm-svn: 26291	2006-02-20 06:51:04 +00:00
Chris Lattner	486d1bc5ed	Fix a problem on itanium with memset. The value to set has been promoted to i64 before this code, so zero_ext doesn't work. llvm-svn: 26290	2006-02-20 06:38:35 +00:00
Nate Begeman	abac61603f	Add checks to make sure we don't create bogus extend nodes, and fix a bug where we were doing exactly that which was causing failures on x86 and alpha. llvm-svn: 26284	2006-02-18 02:40:58 +00:00
Chris Lattner	375e1a71cc	Fix a tricky issue in the SimplifyDemandedBits code where CombineTo wasn't exactly the API we wanted to call into. This fixes the crash on crafty last night. llvm-svn: 26269	2006-02-17 21:58:01 +00:00
Nate Begeman	fb5dbadf15	Clean up DemandedBitsAreZero interface Make more use of the new mask helpers in valuetypes.h Combine (sra (srl x, c1), c1) -> sext_inreg if legal llvm-svn: 26263	2006-02-17 19:54:08 +00:00
Nate Begeman	57b3567552	Don't expand sdiv by power of two before legalize, since it will likely generate illegal nodes. llvm-svn: 26261	2006-02-17 07:26:20 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Chris Lattner	9ec392b2aa	Fix another miscompilation exposed by lencode, where we lowered i64->f32 conversions to __floatdidf instead of __floatdisf on targets that support f32 but not i64 (e.g. sparc). llvm-svn: 26254	2006-02-17 04:32:33 +00:00
Evan Cheng	c3dcf5a4d7	Dumb bug. Code sees a memcpy from X+c so it increments src offset. But it turns out not to point to a constant string but it forgot change the offset back. llvm-svn: 26242	2006-02-16 23:11:42 +00:00
Nate Begeman	8a77efe4f7	Rework the SelectionDAG-based implementations of SimplifyDemandedBits and ComputeMaskedBits to match the new improved versions in instcombine. Tested against all of multisource/benchmarks on ppc. llvm-svn: 26238	2006-02-16 21:11:51 +00:00
Evan Cheng	42c01c8d39	If the false case is the current basic block, then this is a self loop. We do not want to emit "Loop: ... brcond Out; br Loop", as it adds an extra instruction in the loop. Instead, invert the condition and emit "Loop: ... br!cond Loop; br Out. Generalize the fix by moving it from PPCDAGToDAGISel to SelectionDAGLowering. llvm-svn: 26231	2006-02-16 08:27:56 +00:00
Chris Lattner	471627c49d	Lowering of sdiv X, pow2 was broken, this fixes it. This patch is written by Nate, I'm just committing it for him. llvm-svn: 26230	2006-02-16 08:02:36 +00:00
Evan Cheng	93e4865d4b	Remove an unused function parameter. llvm-svn: 26221	2006-02-15 22:12:35 +00:00
Evan Cheng	6781b6e62e	Turn a memcpy from string constant into a series of stores of constant values. llvm-svn: 26219	2006-02-15 21:59:04 +00:00
Jim Laskey	2eea436192	Should not combine ISD::LOCATIONs until we have scheme to remove from MachineDebugInfo tables. llvm-svn: 26216	2006-02-15 19:34:44 +00:00
Evan Cheng	e2038bdeee	Lower memcpy with small constant size operand into a series of load / store ops. llvm-svn: 26195	2006-02-15 01:54:51 +00:00
Evan Cheng	0451499b3c	Doh again! llvm-svn: 26188	2006-02-14 23:05:54 +00:00
Evan Cheng	db2a7a736a	Keep to < 80 cols llvm-svn: 26177	2006-02-14 20:12:38 +00:00
Evan Cheng	038521ef76	Missed a break so memcpy cases fell through to memset. Doh. llvm-svn: 26176	2006-02-14 19:45:56 +00:00
Evan Cheng	d502610604	Fixed a build breakage. llvm-svn: 26175	2006-02-14 09:11:59 +00:00
Evan Cheng	4b40a42653	Rename maxStoresPerMemSet to maxStoresPerMemset, etc. llvm-svn: 26174	2006-02-14 08:38:30 +00:00
Evan Cheng	81fcea8aa2	Expand memset dst, c, size to a series of stores if size falls below the target specific theshold, e.g. 16 for x86. llvm-svn: 26171	2006-02-14 08:22:34 +00:00
Chris Lattner	1784a9d267	now that libcalls don't suck, we can remove this hack llvm-svn: 26164	2006-02-14 05:39:35 +00:00
Chris Lattner	8e2ee7358f	Fix a latent bug in the call sequence handling stuff. Some targets (e.g. x86) create these nodes with flag results. Remember that we legalized them. llvm-svn: 26156	2006-02-14 00:55:02 +00:00
Jim Laskey	390c63e9d9	Rename to better reflect usage (current and planned.) llvm-svn: 26145	2006-02-13 12:50:39 +00:00
Chris Lattner	462505fc5f	Completely rewrite libcall insertion by the legalizer, providing the following handy-dandy properties: 1. it is always correct now 2. it is much faster than before 3. it is easier to understand This implementation builds off of the recent simplifications of the legalizer that made it single-pass instead of iterative. This fixes JM/lencod, JM/ldecod, and CodeGen/Generic/2006-02-12-InsertLibcall.ll (at least on PPC). llvm-svn: 26144	2006-02-13 09:18:02 +00:00
Jim Laskey	5995d0160c	Reorg for integration with gcc4. Old style debug info will not be passed though to SelIDAG. llvm-svn: 26115	2006-02-11 01:01:30 +00:00
Evan Cheng	a1ef3ec5b5	Added SelectionDAG::InsertISelMapEntry(). This is used to workaround the gcc problem where it inline the map insertion call too aggressively. Before this change it was producing a frame size of 24k for Select_store(), now it's down to 10k (by calling this method rather than calling the map insertion operator). llvm-svn: 26094	2006-02-09 22:11:03 +00:00
Evan Cheng	d3f1db93c1	More changes to reduce frame size. Move all getTargetNode() out of SelectionDAG.h into SelectionDAG.cpp. This prevents them from being inlined. Change getTargetNode() so they return SDNode * instead of SDOperand to prevent copying. It should also help compilation speed. llvm-svn: 26083	2006-02-09 07:15:23 +00:00
Chris Lattner	4576bb74d5	Make MachineConstantPool entries alignments explicit llvm-svn: 26071	2006-02-09 02:23:13 +00:00
Chris Lattner	a10e23c19f	Compile this: xori r6, r2, 1 rlwinm r6, r6, 0, 31, 31 cmpwi cr0, r6, 0 bne cr0, LBB1_3 ; endif to this: rlwinm r6, r2, 0, 31, 31 cmpwi cr0, r6, 0 beq cr0, LBB1_3 ; endif llvm-svn: 26047	2006-02-08 02:13:15 +00:00
Nate Begeman	8c9cd461df	Back out previous commit, it isn't safe. llvm-svn: 26006	2006-02-05 08:23:00 +00:00
Nate Begeman	3dc8b89493	fold c1 << (x + c2) into (c1 << c2) << x. fix a warning. llvm-svn: 26005	2006-02-05 08:07:24 +00:00
Nate Begeman	c89fdf1eb3	Handle urem by shifted powers of 2. llvm-svn: 26001	2006-02-05 07:36:48 +00:00
Nate Begeman	25d178bece	handle combining A / (B << N) into A >>u (log2(B)+N) when B is a power of 2 llvm-svn: 26000	2006-02-05 07:20:23 +00:00
Evan Cheng	d37645c07d	* Added SDNode::isOnlyUse(). * Fix hasNUsesOfValue(), it should be const. llvm-svn: 25990	2006-02-05 06:29:23 +00:00
Jeff Cohen	95ae171d5b	Fix VC++ warning. llvm-svn: 25975	2006-02-04 16:20:31 +00:00
Evan Cheng	f9adce90bf	Get rid of some memory leaks identified by Valgrind llvm-svn: 25960	2006-02-04 06:49:00 +00:00
Chris Lattner	3b48431333	Add initial support for immediates. This allows us to compile this: int %rlwnm(int %A, int %B) { %C = call int asm "rlwnm $0, $1, $2, $3, $4", "=r,r,r,n,n"(int %A, int %B, int 4, int 17) ret int %C } into: _rlwnm: or r2, r3, r3 or r3, r4, r4 rlwnm r2, r2, r3, 4, 17 ;; note the immediates :) or r3, r2, r2 blr llvm-svn: 25955	2006-02-04 02:26:14 +00:00
Chris Lattner	65ad53feb3	Initial early support for non-register operands, like immediates llvm-svn: 25952	2006-02-04 02:16:44 +00:00
Nate Begeman	dc7bba9ffe	Add a framework for eliminating instructions that produces undemanded bits. llvm-svn: 25945	2006-02-03 22:24:05 +00:00
Chris Lattner	f68fd20286	remove some #ifdef'd out code, which should properly be in the dag combiner anyway. llvm-svn: 25941	2006-02-03 20:13:59 +00:00
Chris Lattner	6091407783	remove dead fn llvm-svn: 25935	2006-02-03 06:51:34 +00:00
Nate Begeman	22e251abf1	Add common code for reassociating ops in the dag combiner llvm-svn: 25934	2006-02-03 06:46:56 +00:00
Evan Cheng	02b5b9cdd6	Added case HANDLENODE to getOperationName(). llvm-svn: 25920	2006-02-03 01:33:01 +00:00
Chris Lattner	49beaf40fc	Turn any_extend nodes into zero_extend nodes when it allows us to remove an and instruction. This allows us to compile stuff like this: bool %X(int %X) { %Y = add int %X, 14 %Z = setne int %Y, 12345 ret bool %Z } to this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax ret instead of this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax andl $1, %eax ret This occurs quite a bit with the X86 backend. For example, 25 times in lambda, 30 times in 177.mesa, 14 times in galgel, 70 times in fma3d, 25 times in vpr, several hundred times in gcc, ~45 times in crafty, ~60 times in parser, ~140 times in eon, 110 times in perlbmk, 55 on gap, 16 times on bzip2, 14 times on twolf, and 1-2 times in many other SPEC2K programs. llvm-svn: 25901	2006-02-02 07:17:31 +00:00
Chris Lattner	49ce35542f	add two dag combines: (C1-X) == C2 --> X == C1-C2 (X+C1) == C2 --> X == C2-C1 This allows us to compile this: bool %X(int %X) { %Y = add int %X, 14 %Z = setne int %Y, 12345 ret bool %Z } into this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax andl $1, %eax ret not this: _X: movl $14, %eax addl 4(%esp), %eax cmpl $12345, %eax setne %al movzbl %al, %eax andl $1, %eax ret Testcase here: Regression/CodeGen/X86/compare-add.ll nukage of the and coming up next. llvm-svn: 25898	2006-02-02 06:36:13 +00:00
Chris Lattner	0bd74558ae	make -debug output less newliney llvm-svn: 25895	2006-02-02 00:38:08 +00:00
Chris Lattner	7f5880b1c7	Implement matching constraints. We can now say things like this: %C = call int asm "xyz $0, $1, $2, $3", "=r,r,r,0"(int %A, int %B, int 4) and get: xyz r2, r3, r4, r2 note that the r2's are pinned together. Yaay for 2-address instructions. 2342 ---------------------------------------------------------------------- llvm-svn: 25893	2006-02-02 00:25:23 +00:00
Nate Begeman	01bd9d9911	* empty log message * llvm-svn: 25879	2006-02-01 19:05:15 +00:00
Chris Lattner	1558fc64f9	Implement simple register assignment for inline asms. This allows us to compile: int %test(int %A, int %B) { %C = call int asm "xyz $0, $1, $2", "=r,r,r"(int %A, int %B) ret int %C } into: (0x8906130, LLVM BB @0x8902220): %r2 = OR4 %r3, %r3 %r3 = OR4 %r4, %r4 INLINEASM <es:xyz $0, $1, $2>, %r2<def>, %r2, %r3 %r3 = OR4 %r2, %r2 BLR which asmprints as: _test: or r2, r3, r3 or r3, r4, r4 xyz $0, $1, $2 ;; need to print the operands now :) or r3, r2, r2 blr llvm-svn: 25878	2006-02-01 18:59:47 +00:00
Nate Begeman	7e7f439f85	Fix some of the stuff in the PPC README file, and clean up legalization of the SELECT_CC, BR_CC, and BRTWOWAY_CC nodes. llvm-svn: 25875	2006-02-01 07:19:44 +00:00
Chris Lattner	3a5ed55187	adjust to changes in InlineAsm interface. Fix a few minor bugs. llvm-svn: 25865	2006-02-01 01:28:23 +00:00
Evan Cheng	32be2dc0af	Allow the specification of explicit alignments for constant pool entries. llvm-svn: 25855	2006-01-31 22:23:14 +00:00
Evan Cheng	2443ab932d	Allow custom lowering of fabs. I forgot to check in this change which caused several test failures. llvm-svn: 25852	2006-01-31 18:14:25 +00:00
Chris Lattner	e9721b2984	Only insert an AND when converting from BR_COND to BRCC if needed. llvm-svn: 25832	2006-01-31 05:04:52 +00:00
Chris Lattner	2e56e89452	Handle physreg input/outputs. We now compile this: int %test_cpuid(int %op) { %B = alloca int %C = alloca int %D = alloca int %A = call int asm "cpuid", "=eax,==ebx,==ecx,==edx,eax"(int* %B, int* %C, int* %D, int %op) %Bv = load int* %B %Cv = load int* %C %Dv = load int* %D %x = add int %A, %Bv %y = add int %x, %Cv %z = add int %y, %Dv ret int %z } to this: _test_cpuid: sub %ESP, 16 mov DWORD PTR [%ESP], %EBX mov %EAX, DWORD PTR [%ESP + 20] cpuid mov DWORD PTR [%ESP + 8], %ECX mov DWORD PTR [%ESP + 12], %EBX mov DWORD PTR [%ESP + 4], %EDX mov %ECX, DWORD PTR [%ESP + 12] add %EAX, %ECX mov %ECX, DWORD PTR [%ESP + 8] add %EAX, %ECX mov %ECX, DWORD PTR [%ESP + 4] add %EAX, %ECX mov %EBX, DWORD PTR [%ESP] add %ESP, 16 ret ... note the proper register allocation. :) it is unclear to me why the loads aren't folded into the adds. llvm-svn: 25827	2006-01-31 02:03:41 +00:00
Chris Lattner	f263a23735	Fix a bug in my legalizer reworking that caused the X86 backend to not get a chance to custom legalize setcc, which broke a bunch of C++ Codes. Testcase here: CodeGen/X86/2006-01-30-LongSetcc.ll llvm-svn: 25821	2006-01-30 22:43:50 +00:00
Chris Lattner	d6f5ae4455	don't insert an and node if it isn't needed here, this can prevent folding of lowered target nodes. llvm-svn: 25804	2006-01-30 04:22:28 +00:00
Chris Lattner	f0b24d2dc0	Move MaskedValueIsZero from the DAGCombiner to the TargetLowering interface,making isMaskedValueZeroForTargetNode simpler, and useable from other partsof the compiler. llvm-svn: 25803	2006-01-30 04:09:27 +00:00
Chris Lattner	3b40e64aa3	pass the address of MaskedValueIsZero into isMaskedValueZeroForTargetNode, to permit recursion llvm-svn: 25799	2006-01-30 03:49:37 +00:00
Chris Lattner	4d1ea71a31	Fix RET of promoted values on targets that custom expand RET to a target node. llvm-svn: 25794	2006-01-29 21:02:23 +00:00
Chris Lattner	2c748afd6c	cleanups to the ValueTypeActions interface llvm-svn: 25785	2006-01-29 08:42:06 +00:00
Chris Lattner	ccb4476c87	Remove some special case hacks for CALLSEQ_*, using UpdateNodeOperands instead. llvm-svn: 25780	2006-01-29 07:58:15 +00:00
Chris Lattner	2f292789dc	Allow custom expansion of ConstantVec nodes. PPC will use this in the future. llvm-svn: 25774	2006-01-29 06:34:16 +00:00
Chris Lattner	758b0ac54b	Legalize ConstantFP into TargetConstantFP when the target allows. Implement custom expansion of ConstantFP nodes. llvm-svn: 25772	2006-01-29 06:26:56 +00:00
Chris Lattner	678da98835	eliminate uses of SelectionDAG::getBR2Way_CC llvm-svn: 25767	2006-01-29 06:00:45 +00:00
Chris Lattner	d02b05473c	Use the new "UpdateNodeOperands" method to simplify LegalizeDAG and make it faster. This cuts about 120 lines of code out of the legalizer (mostly code checking to see if operands have changed). It also fixes an ugly performance issue, where the legalizer cloned the entire graph after any change. Now the "UpdateNodeOperands" method gives it a chance to reuse nodes if the operands of a node change but not its opcode or valuetypes. This speeds up instruction selection time on kimwitu++ by about 8.2% with a release build. llvm-svn: 25746	2006-01-28 10:58:55 +00:00
Chris Lattner	580b12ad34	add another method variant llvm-svn: 25744	2006-01-28 10:09:25 +00:00
Chris Lattner	f34156e8cb	add some methods for updating nodes llvm-svn: 25742	2006-01-28 09:32:45 +00:00
Chris Lattner	eb63751499	minor tweaks llvm-svn: 25740	2006-01-28 08:31:04 +00:00
Chris Lattner	689bdcc9cf	move a bunch of code, no other change. llvm-svn: 25739	2006-01-28 08:25:58 +00:00
Chris Lattner	fcfda5a174	remove a couple more now-extraneous legalizeop's llvm-svn: 25738	2006-01-28 08:22:56 +00:00
Chris Lattner	364b89a784	fix a bug llvm-svn: 25737	2006-01-28 07:42:08 +00:00
Chris Lattner	9dcce6da8e	Several major changes: 1. Pull out the expand cases for BSWAP and CT* into a separate function, reducing the size of LegalizeOp. 2. Fix a bug where expand(bswap i64) was wrong when i64 is legal. 3. Changed LegalizeOp/PromoteOp so that the legalizer never needs to be iterative. It now operates in a single pass over the nodes. 4. Simplify a LOT of code, with a net reduction of ~280 lines. llvm-svn: 25736	2006-01-28 07:39:30 +00:00
Chris Lattner	fd4a7f76a9	Eliminate the need for ExpandOp to set 'needsanotheriteration', as it already relegalizes the stuff it returns. Add the ability to custom expand ADD/SUB, so that targets don't need to deal with ADD_PARTS/SUB_PARTS if they don't want. Fix some obscure potential bugs and simplify code. llvm-svn: 25732	2006-01-28 05:07:51 +00:00
Chris Lattner	10f677508f	Instead of making callers of ExpandLibCall legalize the result, make ExpandLibCall do it itself. llvm-svn: 25731	2006-01-28 04:28:26 +00:00
Chris Lattner	a593acfe66	Eliminate the need to do another iteration of the legalizer after inserting a libcall. llvm-svn: 25730	2006-01-28 04:23:12 +00:00
Chris Lattner	98ed05c81d	remove method I just added llvm-svn: 25728	2006-01-28 03:43:09 +00:00
Chris Lattner	43b867dd3b	add a new callback llvm-svn: 25727	2006-01-28 03:37:03 +00:00
Nate Begeman	595ec734fc	Implement Promote for VAARG, and allow it to be custom promoted for people who don't want the default behavior (Alpha). llvm-svn: 25726	2006-01-28 03:14:31 +00:00
Nate Begeman	af397cec0b	Add a missing case to the dag combiner. llvm-svn: 25723	2006-01-28 01:06:30 +00:00
Chris Lattner	fb16a62fba	Remove the ISD::CALL and ISD::TAILCALL nodes llvm-svn: 25721	2006-01-28 00:18:58 +00:00
Nate Begeman	8c47c3a3b1	Remove TLI.LowerReturnTo, and just let targets custom lower ISD::RET for the same functionality. This addresses another piece of bug 680. Next, on to fixing Alpha VAARG, which I broke last time. llvm-svn: 25696	2006-01-27 21:09:22 +00:00
Chris Lattner	4df279cfda	Teach the scheduler to emit the appropriate INLINEASM MachineInstr for an ISD::INLINEASM node. llvm-svn: 25668	2006-01-26 23:28:04 +00:00
Chris Lattner	476e67be14	initial selectiondag support for new INLINEASM node. Note that inline asms with outputs or inputs are not supported yet. :) llvm-svn: 25664	2006-01-26 22:24:51 +00:00
Evan Cheng	c4c339c3d0	Clean up some code; improve efficiency; and fixed a potential bug involving chain successors. llvm-svn: 25630	2006-01-26 00:30:29 +00:00
Reid Spencer	5edde66863	Don't break the optimized build (by incorrect placement of #endif) llvm-svn: 25613	2006-01-25 21:49:13 +00:00
Evan Cheng	1880f8db02	No need to keep track of top and bottom nodes in a group since the vector is already in order. Thanks Jim for pointing it out. llvm-svn: 25608	2006-01-25 18:54:24 +00:00
Nate Begeman	e74795cd70	First part of bug 680: Remove TLI.LowerVA* and replace it with SDNodes that are lowered the same way as everything else. llvm-svn: 25606	2006-01-25 18:21:52 +00:00
Jeff Cohen	fb20616aa6	Fix VC++ compilation error. llvm-svn: 25604	2006-01-25 17:17:49 +00:00
Evan Cheng	ab49556cf4	Bottom up register usage reducing list scheduler. llvm-svn: 25601	2006-01-25 09:14:32 +00:00
Evan Cheng	fbc88a624a	Keep track of bottom / top element of a set of flagged nodes. llvm-svn: 25600	2006-01-25 09:13:41 +00:00
Evan Cheng	a6eff8a432	If scheduler choice is the default (-sched=default), use target scheduling preference to determine which scheduler to use. SchedulingForLatency == Breadth first; SchedulingForRegPressure == bottom up register reduction list scheduler. llvm-svn: 25599	2006-01-25 09:12:57 +00:00
Chris Lattner	f9a1e3aadc	Fix an infinite loop I caused by making sure to legalize the flag operand of CALLSEQ_* nodes llvm-svn: 25582	2006-01-24 05:48:21 +00:00
Jeff Cohen	12f8441c03	Fix VC++ compilation error. llvm-svn: 25577	2006-01-24 04:43:17 +00:00
Andrew Lenharth	683352382e	another couple selects llvm-svn: 25551	2006-01-23 21:51:14 +00:00
Andrew Lenharth	c28563874c	another selectto llvm-svn: 25548	2006-01-23 20:59:12 +00:00
Jim Laskey	b8566fa10a	Typo. llvm-svn: 25545	2006-01-23 13:34:04 +00:00
Evan Cheng	31272347d4	Skeleton of the list schedule. llvm-svn: 25544	2006-01-23 08:26:10 +00:00
Evan Cheng	421cfe8006	Minor clean up. llvm-svn: 25543	2006-01-23 08:25:34 +00:00
Chris Lattner	763dfd7723	Fix Regression/CodeGen/SparcV8/2006-01-22-BitConvertLegalize.ll by making sure that the result of expanding a BIT_CONVERT node is itself legalized. llvm-svn: 25538	2006-01-23 07:30:46 +00:00
Evan Cheng	87063b9986	Remove a couple of unnecessary #include's llvm-svn: 25535	2006-01-23 07:21:01 +00:00
Evan Cheng	c1e1d9724d	Factor out more instruction scheduler code to the base class. llvm-svn: 25532	2006-01-23 07:01:07 +00:00
Chris Lattner	deda32a786	Fix bugs lowering stackrestore, fixing 2004-08-12-InlinerAndAllocas.c on PPC. llvm-svn: 25522	2006-01-23 05:22:07 +00:00
Chris Lattner	de02d7727f	Add explicit #includes of <iostream> llvm-svn: 25515	2006-01-22 23:41:00 +00:00
Chris Lattner	e23928c67f	Fix a bug in a recent refactor that caused a bunch of programs to miscompile or the compiler to crash. llvm-svn: 25503	2006-01-21 19:12:11 +00:00
Chris Lattner	44cab00045	Fix CodeGen/PowerPC/2006-01-20-ShiftPartsCrash.ll llvm-svn: 25496	2006-01-21 04:27:00 +00:00
Evan Cheng	739a6a456e	Do some code refactoring on Jim's scheduler in preparation of the new list scheduler. llvm-svn: 25493	2006-01-21 02:32:06 +00:00
Chris Lattner	15afe462a8	remove some unintentionally committed code llvm-svn: 25483	2006-01-20 18:40:10 +00:00
Chris Lattner	222ceabbee	If the target doesn't support f32 natively, insert the FP_EXTEND in target-indep code, so that the LowerReturn code doesn't have to handle it. llvm-svn: 25482	2006-01-20 18:38:32 +00:00
Evan Cheng	13e8c9d6de	Another typo llvm-svn: 25440	2006-01-19 04:54:52 +00:00
Andrew Lenharth	7599b6e4af	was ignoring the legalized chain in this case, fixed SPASS on alpha llvm-svn: 25428	2006-01-18 23:19:08 +00:00
Nate Begeman	569c439567	Get rid of code in the DAGCombiner that is duplicated in SelectionDAG.cpp Now all constant folding in the code generator is in one place. llvm-svn: 25426	2006-01-18 22:35:16 +00:00
Chris Lattner	e2ee190821	Temporary work around for a libcall insertion bug: If a target doesn't support FSIN/FCOS nodes, do not lower sin/cos to them. llvm-svn: 25425	2006-01-18 21:50:14 +00:00
Chris Lattner	5fee908be5	Fix a backwards conditional that caused an inf loop in some cases. This fixes: test/Regression/CodeGen/Generic/2005-01-18-SetUO-InfLoop.ll llvm-svn: 25419	2006-01-18 19:13:41 +00:00
Robert Bocchino	03e95af9f7	Support for the insertelement operation. llvm-svn: 25405	2006-01-17 20:06:42 +00:00
Evan Cheng	6f86a7db07	Bug fix: missing LegalizeOp() on newly created nodes. llvm-svn: 25401	2006-01-17 19:47:13 +00:00
Jim Laskey	b9966029fe	Adding basic support for Dwarf line number debug information. I promise to keep future commits smaller. llvm-svn: 25396	2006-01-17 17:31:53 +00:00
Reid Spencer	b4f9a6f110	For PR411: This patch is an incremental step towards supporting a flat symbol table. It de-overloads the intrinsic functions by providing type-specific intrinsics and arranging for automatically upgrading from the old overloaded name to the new non-overloaded name. Specifically: llvm.isunordered -> llvm.isunordered.f32, llvm.isunordered.f64 llvm.sqrt -> llvm.sqrt.f32, llvm.sqrt.f64 llvm.ctpop -> llvm.ctpop.i8, llvm.ctpop.i16, llvm.ctpop.i32, llvm.ctpop.i64 llvm.ctlz -> llvm.ctlz.i8, llvm.ctlz.i16, llvm.ctlz.i32, llvm.ctlz.i64 llvm.cttz -> llvm.cttz.i8, llvm.cttz.i16, llvm.cttz.i32, llvm.cttz.i64 New code should not use the overloaded intrinsic names. Warnings will be emitted if they are used. llvm-svn: 25366	2006-01-16 21:12:35 +00:00
Nate Begeman	1e1eb5ee6c	Constant fold ctpop/ctlz/cttz, and a couple other small cleanups llvm-svn: 25357	2006-01-16 08:07:10 +00:00
Nate Begeman	2642a35f4c	Expand case for 64b Legalize, even though no one should end up using this (itanium supports bswap natively, alpha should custom lower it using the VAX floating point swapload, ha ha). llvm-svn: 25356	2006-01-16 07:59:13 +00:00
Chris Lattner	fcdb420baf	Disable two transformations that contribute to bus errors on SparcV8. llvm-svn: 25339	2006-01-15 18:58:59 +00:00
Chris Lattner	59b82f9848	Allow the target to specify 'expand' if they just require the amount to be subtracted from the stack pointer. llvm-svn: 25331	2006-01-15 08:54:32 +00:00
Chris Lattner	2d59142613	Fix custom lowering of dynamic_stackalloc llvm-svn: 25329	2006-01-15 08:43:08 +00:00
Chris Lattner	9597b33d58	add a missing node name llvm-svn: 25327	2006-01-15 08:39:35 +00:00
Chris Lattner	02011c9a4f	Token chain results are not always the first or last result. Consider copyfromreg nodes, where they are the middle result (the flag result is last) llvm-svn: 25325	2006-01-14 22:41:46 +00:00
Nate Begeman	542c3c17a9	Remove some duplicated code llvm-svn: 25313	2006-01-14 03:18:27 +00:00
Nate Begeman	2fba8a3aaa	bswap implementation llvm-svn: 25312	2006-01-14 03:14:10 +00:00
Chris Lattner	ed9b3e1c0a	If a target specified a stack pointer with setStackPointerRegisterToSaveRestore, lower STACKSAVE/STACKRESTORE into a copy from/to that register. llvm-svn: 25276	2006-01-13 17:48:44 +00:00
Chris Lattner	b32664583b	Compile llvm.stacksave/restore into STACKSAVE/STACKRESTORE nodes, and allow targets to custom expand them as they desire. llvm-svn: 25273	2006-01-13 02:50:02 +00:00
Chris Lattner	a5110e854d	add stacksave/stackrestore nodes llvm-svn: 25270	2006-01-13 02:39:42 +00:00
Chris Lattner	6c9c250dcd	Add "support" for stacksave/stackrestore to the dag isel llvm-svn: 25268	2006-01-13 02:24:42 +00:00
Chris Lattner	3470b5dee6	Add a simple missing fold to produce this: subfic r3, r2, 33 instead of this: subfic r2, r2, 32 addi r3, r2, 1 llvm-svn: 25255	2006-01-12 20:22:43 +00:00
Chris Lattner	b1ee616de9	Don't create rotate instructions in unsupported types, because we don't have promote/expand code yet. This fixes the 177.mesa failure on PPC. llvm-svn: 25250	2006-01-12 18:57:33 +00:00
Evan Cheng	7f4ec8274f	Allow custom lowering of DYNAMIC_STACKALLOC. llvm-svn: 25224	2006-01-11 22:14:47 +00:00
Evan Cheng	982493300e	ignore register #0 llvm-svn: 25223	2006-01-11 22:13:48 +00:00
Nate Begeman	1b8121b227	Add bswap, rotl, and rotr nodes Add dag combiner code to recognize rotl, rotr Add ppc code to match rotl Targets should add rotl/rotr patterns if they have them llvm-svn: 25222	2006-01-11 21:21:00 +00:00
Chris Lattner	fb5f46541c	silence a warning llvm-svn: 25184	2006-01-10 19:43:26 +00:00
Robert Bocchino	2c966e7617	Added selection DAG support for the extractelement operation. llvm-svn: 25179	2006-01-10 19:04:57 +00:00
Chris Lattner	90ba544826	Fix an exponential function in libcall insertion to not be exponential. :) llvm-svn: 25165	2006-01-09 23:21:49 +00:00
Evan Cheng	870e4f8e38	* Allow custom lowering of ADD_PARTS, SUB_PARTS, SHL_PARTS, SRA_PARTS, and SRL_PARTS. * Fix a bug that caused *_PARTS to be custom lowered twice. llvm-svn: 25157	2006-01-09 18:31:59 +00:00
Evan Cheng	53a1f57fc5	New getNode() variants. llvm-svn: 25156	2006-01-09 18:29:18 +00:00
Chris Lattner	fae8afb77f	Unbreak the build :( llvm-svn: 25124	2006-01-06 05:47:48 +00:00
Evan Cheng	85c973cda9	Revert the previous check-in. Leave shl x, 1 along for target to deal with. llvm-svn: 25121	2006-01-06 01:56:02 +00:00
Evan Cheng	b03f9b32d2	fold (shl x, 1) -> (add x, x) llvm-svn: 25120	2006-01-06 01:06:31 +00:00
Evan Cheng	f35b1c837f	Support for custom lowering of ISD::RET. llvm-svn: 25116	2006-01-06 00:41:43 +00:00
Jim Laskey	762e9ec06c	Added initial support for DEBUG_LABEL allowing debug specific labels to be inserted in the code. llvm-svn: 25104	2006-01-05 01:25:28 +00:00
Jim Laskey	219d559824	Applied some recommend changes from sabre. The dominate one beginning "let the pass manager do it's thing." Fixes crash when compiling -g files and suppresses dwarf statements if no debug info is present. llvm-svn: 25100	2006-01-04 22:28:25 +00:00
Jim Laskey	0da76a676a	Add unique id to debug location for debug label use (work in progress.) llvm-svn: 25096	2006-01-04 15:04:11 +00:00
Jim Laskey	6f9ff633a6	Change how MachineDebugInfo is fetched. llvm-svn: 25089	2006-01-04 13:42:59 +00:00
Nate Begeman	164db3a7eb	Make sure to pass the offset into the new node, so that we don't silently drop it on the floor. llvm-svn: 25044	2005-12-30 00:10:38 +00:00
Duraid Madina	fb6a914ca7	purity++ llvm-svn: 25041	2005-12-29 05:59:19 +00:00
Andrew Lenharth	30db2ec59f	allow custom lowering to return null for legal results llvm-svn: 25007	2005-12-25 01:07:37 +00:00
Andrew Lenharth	7259426d88	Support Custom lowering of a few more operations. Alpha needs to custom lower DIV and REM llvm-svn: 25006	2005-12-24 23:42:32 +00:00
Jim Laskey	bdba3e2a46	Remove redundant debug locations. llvm-svn: 24995	2005-12-23 20:08:28 +00:00
Chris Lattner	c7037abc5b	unbreak the build :-/ llvm-svn: 24992	2005-12-23 16:12:20 +00:00
Evan Cheng	31d15fa093	Allow custom lowering of LOAD, EXTLOAD, ZEXTLOAD, STORE, and TRUNCSTORE. Not currently used. llvm-svn: 24988	2005-12-23 07:29:34 +00:00
Chris Lattner	26943b9691	Simplify store(bitconv(x)) to store(x). This allows us to compile this: void bar(double Y, double X) { X = Y; } to this: bar: save -96, %o6, %o6 st %i1, [%i2+4] st %i0, [%i2] restore %g0, %g0, %g0 retl nop instead of this: bar: save -104, %o6, %o6 st %i1, [%i6+-4] st %i0, [%i6+-8] ldd [%i6+-8], %f0 std %f0, [%i2] restore %g0, %g0, %g0 retl nop on sparcv8. llvm-svn: 24983	2005-12-23 05:48:07 +00:00
Chris Lattner	54560f6887	fold (conv (load x)) -> (load (conv)x). This allows us to compile this: void foo(double); void bar(double X) { foo(*X); } To this: bar: save -96, %o6, %o6 ld [%i0+4], %o1 ld [%i0], %o0 call foo nop restore %g0, %g0, %g0 retl nop instead of this: bar: save -104, %o6, %o6 ldd [%i0], %f0 std %f0, [%i6+-8] ld [%i6+-4], %o1 ld [%i6+-8], %o0 call foo nop restore %g0, %g0, %g0 retl nop on SparcV8. llvm-svn: 24982	2005-12-23 05:44:41 +00:00
Chris Lattner	efbbedbf4a	Fold bitconv(bitconv(x)) -> x. We now compile this: void foo(double); void bar(double X) { foo(X); } to this: bar: save -96, %o6, %o6 or %g0, %i0, %o0 or %g0, %i1, %o1 call foo nop restore %g0, %g0, %g0 retl nop instead of this: bar: save -112, %o6, %o6 st %i1, [%i6+-4] st %i0, [%i6+-8] ldd [%i6+-8], %f0 std %f0, [%i6+-16] ld [%i6+-12], %o1 ld [%i6+-16], %o0 call foo nop restore %g0, %g0, %g0 retl nop on V8. llvm-svn: 24981	2005-12-23 05:37:50 +00:00
Chris Lattner	a187460552	constant fold bits_convert in getNode and in the dag combiner for fp<->int conversions. This allows V8 to compiles this: void %test() { call float %test2( float 1.000000e+00, float 2.000000e+00, double 3.000000e+00, double* null ) ret void } into: test: save -96, %o6, %o6 sethi 0, %o3 sethi 1049088, %o2 sethi 1048576, %o1 sethi `1040384`, %o0 or %g0, %o3, %o4 call test2 nop restore %g0, %g0, %g0 retl nop instead of: test: save -112, %o6, %o6 sethi 0, %o4 sethi 1049088, %l0 st %o4, [%i6+-12] st %l0, [%i6+-16] ld [%i6+-12], %o3 ld [%i6+-16], %o2 sethi 1048576, %o1 sethi `1040384`, %o0 call test2 nop restore %g0, %g0, %g0 retl nop llvm-svn: 24980	2005-12-23 05:30:37 +00:00
Chris Lattner	884eb3adc3	Fix a pasto llvm-svn: 24973	2005-12-23 00:52:30 +00:00
Chris Lattner	9eae8d5d03	fix a thinko in the bit_convert handling code llvm-svn: 24972	2005-12-23 00:50:25 +00:00
Chris Lattner	36e663d6e1	add very simple support for the BIT_CONVERT node llvm-svn: 24970	2005-12-23 00:16:34 +00:00
Chris Lattner	177d7af5d5	remove dead code llvm-svn: 24965	2005-12-22 21:16:08 +00:00
Chris Lattner	1408c05a8b	The 81st column doesn't like code in it. llvm-svn: 24943	2005-12-22 05:23:45 +00:00
Evan Cheng	9cdc16c6d3	* Fix a GlobalAddress lowering bug. * Teach DAG combiner about X86ISD::SETCC by adding a TargetLowering hook. llvm-svn: 24921	2005-12-21 23:05:39 +00:00
Jim Laskey	9e296bee9a	Disengage DEBUG_LOC from non-PPC targets. llvm-svn: 24919	2005-12-21 20:51:37 +00:00
Evan Cheng	c1583dbd63	* Added support for X86 RET with an additional operand to specify number of bytes to pop off stack. * Added support for X86 SETCC. llvm-svn: 24917	2005-12-21 20:21:51 +00:00
Chris Lattner	0fab459362	make sure to relegalize all cases llvm-svn: 24911	2005-12-21 19:40:42 +00:00
Chris Lattner	44c07ed61a	enable the gep isel opt llvm-svn: 24910	2005-12-21 19:36:36 +00:00
Chris Lattner	ac12f68424	fix a bug I introduced that broke recursive expansion of nodes (e.g. scalarizing vectors) llvm-svn: 24905	2005-12-21 18:02:52 +00:00
Chris Lattner	803a575616	Lower ConstantAggregateZero into zeros llvm-svn: 24890	2005-12-21 02:43:26 +00:00
Evan Cheng	6af02635a7	Added a hook to print out names of target specific DAG nodes. llvm-svn: 24877	2005-12-20 06:22:03 +00:00
Chris Lattner	2af3ee4bdd	Fix a nasty latent bug in the legalizer that was triggered by my patch last night, breaking crafty and twolf. Make sure that the newly found legal nodes are themselves not re-legalized until the next iteration. Also, since this functionality exists now, we can reduce number of legalizer iterations by depending on this behavior instead of having to misuse 'do another iteration' to get the same effect. llvm-svn: 24875	2005-12-20 00:53:54 +00:00
Evan Cheng	6fc31046aa	X86 conditional branch support. llvm-svn: 24870	2005-12-19 23:12:38 +00:00
Evan Cheng	9fd9541367	Print out opcode number if it's an unknown target node. llvm-svn: 24869	2005-12-19 23:11:49 +00:00
Chris Lattner	50b2d302d5	Fix a case where the DAG Combiner would accidentally CSE flag-producing nodes, creating graphs that cannot be scheduled. llvm-svn: 24866	2005-12-19 22:21:21 +00:00
Jim Laskey	9b9688aeb8	Amend comment. llvm-svn: 24861	2005-12-19 16:32:26 +00:00
Jim Laskey	ce23987e6b	Create a strong dependency for loads following stores. This will leave a latency period between the two. llvm-svn: 24860	2005-12-19 16:30:13 +00:00
Chris Lattner	c06da626b4	Make sure to relegalize new nodes llvm-svn: 24843	2005-12-18 23:54:29 +00:00
Jeff Cohen	c7cb351aac	Keep VC++ happy. llvm-svn: 24835	2005-12-18 22:20:05 +00:00
Chris Lattner	ebcfa0c210	More corrections for flagged copyto/from reg llvm-svn: 24828	2005-12-18 15:36:21 +00:00
Chris Lattner	e3c67e97c7	legalize copytoreg and copyfromreg nodes that have flag operands correctly. llvm-svn: 24826	2005-12-18 15:27:43 +00:00
Jim Laskey	c97b7d0be9	Fix a bug Sabre was having where the DAG root was a group. The group dominator needed to be added to the ordering list, not the first member of the group. llvm-svn: 24816	2005-12-18 04:40:52 +00:00
Jim Laskey	e220821deb	Groups were not emitted if the dominator node and the node in the ordering list were not the same node. Ultimately the test was bogus. llvm-svn: 24815	2005-12-18 03:59:21 +00:00
Chris Lattner	cf12118965	Simplify code llvm-svn: 24806	2005-12-18 01:03:46 +00:00
Chris Lattner	bf0bd99e03	allow custom expansion of BR_CC llvm-svn: 24804	2005-12-17 23:46:46 +00:00
Evan Cheng	225a4d0d6d	X86 lowers SELECT to a cmp / test followed by a conditional move. llvm-svn: 24754	2005-12-17 01:21:05 +00:00
Jim Laskey	7c462768ed	Added source file/line correspondence for dwarf (PowerPC only at this point.) llvm-svn: 24748	2005-12-16 22:45:29 +00:00
Chris Lattner	83e4407379	Don't create SEXTLOAD/ZEXTLOAD instructions that the target doesn't support if after legalize. This fixes IA64 failures. llvm-svn: 24725	2005-12-15 19:02:38 +00:00
Chris Lattner	d39c60fcc8	When folding loads into ops, immediately replace uses of the op with the load. This reduces number of worklist iterations and avoid missing optimizations depending on folding of things into sext_inreg nodes (which aren't supported by all targets). Tested by Regression/CodeGen/X86/extend.ll:test2 llvm-svn: 24712	2005-12-14 19:25:30 +00:00
Chris Lattner	7dac1083da	Fix the (zext (zextload)) case to trigger, similarly for sign extends. Allow (zext (truncate)) to apply after legalize if the target supports AND (which all do). This compiles short %foo() { %tmp.0 = load ubyte* %X ; <ubyte> [#uses=1] %tmp.3 = cast ubyte %tmp.0 to short ; <short> [#uses=1] ret short %tmp.3 } to: _foo: movzbl _X, %eax ret instead of: _foo: movzbl _X, %eax movzbl %al, %eax ret thanks to Evan for pointing this out. llvm-svn: 24709	2005-12-14 19:05:06 +00:00
Chris Lattner	f753d1a574	Fix a miscompilation in crafty due to a recent patch llvm-svn: 24706	2005-12-14 07:58:38 +00:00
Evan Cheng	bce7c47306	Fold (zext (load x) to (zextload x). llvm-svn: 24702	2005-12-14 02:19:23 +00:00
Chris Lattner	5d4e61dd87	Don't lump the filename and working dir together llvm-svn: 24697	2005-12-13 17:40:33 +00:00
Nate Begeman	956aef45c9	Lowering constant pool entries on ppc exposed a bug in the recently added ConstantVec legalizing code, which would return constantpool nodes that were not of the target's pointer type. llvm-svn: 24691	2005-12-13 03:03:23 +00:00
Chris Lattner	9e8b633ec1	Accept and ignore prefetches for now llvm-svn: 24678	2005-12-12 22:51:16 +00:00
Chris Lattner	b42ce7ca63	Fix CodeGen/Generic/2005-12-12-ExpandSextInreg.ll llvm-svn: 24677	2005-12-12 22:27:43 +00:00
Chris Lattner	f1a54c0d14	Minor tweak to get isel opt llvm-svn: 24663	2005-12-11 09:05:13 +00:00
Nate Begeman	4e56db674c	Add support for TargetConstantPool nodes to the dag isel emitter, and use them in the PPC backend, to simplify some logic out of Select and SelectAddr. llvm-svn: 24657	2005-12-10 02:36:00 +00:00
Evan Cheng	dadc1057ac	Added new getNode and getTargetNode variants for X86 stores. llvm-svn: 24653	2005-12-10 00:37:58 +00:00
Chris Lattner	268d457b69	Teach legalize how to promote sext_inreg to fix a problem Andrew pointed out to me. llvm-svn: 24644	2005-12-09 17:32:47 +00:00
Chris Lattner	be73d6eece	improve code insertion in two ways: 1. Only forward subst offsets into loads and stores, not into arbitrary things, where it will likely become a load. 2. If the source is a cast from pointer, forward subst the cast as well, allowing us to fold the cast away (improving cases when the cast is from an alloca or global). This hasn't been fully tested, but does appear to further reduce register pressure and improve code. Lets let the testers grind on it a bit. :) llvm-svn: 24640	2005-12-08 08:00:12 +00:00
Nate Begeman	ae89d862f5	Fix a crash where ConstantVec nodes were being generated with the wrong type when the target did not support them. Also teach Legalize how to expand ConstantVecs. This allows us to generate _test: lwz r2, 12(r3) lwz r4, 8(r3) lwz r5, 4(r3) lwz r6, 0(r3) addi r2, r2, 4 addi r4, r4, 3 addi r5, r5, 2 addi r6, r6, 1 stw r2, 12(r3) stw r4, 8(r3) stw r5, 4(r3) stw r6, 0(r3) blr For: void %test(%v4i %P) { %T = load %v4i %P %S = add %v4i %T, <int 1, int 2, int 3, int 4> store %v4i %S, %v4i * %P ret void } On PowerPC. llvm-svn: 24633	2005-12-07 19:48:11 +00:00
Chris Lattner	57c882edf8	Only transform (sext (truncate x)) -> (sextinreg x) if before legalize or if the target supports the resultant sextinreg llvm-svn: 24632	2005-12-07 18:02:05 +00:00
Chris Lattner	cbd3d01a43	Teach the dag combiner to turn a truncate/sign_extend pair into a sextinreg when the types match up. This allows the X86 backend to compile: sbyte %toggle_value(sbyte* %tmp.1) { %tmp.2 = load sbyte* %tmp.1 ret sbyte %tmp.2 } to this: _toggle_value: mov %EAX, DWORD PTR [%ESP + 4] movsx %EAX, BYTE PTR [%EAX] ret instead of this: _toggle_value: mov %EAX, DWORD PTR [%ESP + 4] movsx %EAX, BYTE PTR [%EAX] movsx %EAX, %AL ret noticed in Shootout/objinst. -Chris llvm-svn: 24630	2005-12-07 07:11:03 +00:00
Nate Begeman	41b1cdc771	Teach the SelectionDAG ISel how to turn ConstantPacked values into constant nodes with vector types. Also teach the asm printer how to print ConstantPacked constant pool entries. This allows us to generate altivec code such as the following, which adds a vector constantto a packed float. LCPI1_0: <4 x float> < float 0.0e+0, float 0.0e+0, float 0.0e+0, float 1.0e+0 > .space 4 .space 4 .space 4 .long 1065353216 ; float 1 .text .align 4 .globl _foo _foo: lis r2, ha16(LCPI1_0) la r2, lo16(LCPI1_0)(r2) li r4, 0 lvx v0, r4, r2 lvx v1, r4, r3 vaddfp v0, v1, v0 stvx v0, r4, r3 blr For the llvm code: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, < float 0.0, float 0.0, float 0.0, float 1.0 > store <4 x float> %tmp2, <4 x float> *%a ret void } llvm-svn: 24616	2005-12-06 06:18:55 +00:00
Chris Lattner	3539778883	Fix the #1 code quality problem that I have seen on X86 (and it also affects PPC and other targets). In a particular, consider code like this: struct Vector3 { double x, y, z; }; struct Matrix3 { Vector3 a, b, c; }; double dot(Vector3 &a, Vector3 &b) { return a.x * b.x + a.y * b.y + a.z * b.z; } Vector3 mul(Vector3 &a, Matrix3 &b) { Vector3 r; r.x = dot( a, b.a ); r.y = dot( a, b.b ); r.z = dot( a, b.c ); return r; } void transform(Matrix3 &m, Vector3 *x, int n) { for (int i = 0; i < n; i++) x[i] = mul( x[i], m ); } we compile transform to a loop with all of the GEP instructions for indexing into 'm' pulled out of the loop (9 of them). Because isel occurs a bb at a time we are unable to fold the constant index into the loads in the loop, leading to PPC code that looks like this: LBB3_1: ; no_exit.preheader li r2, 0 addi r6, r3, 64 ;; 9 values live across the loop body! addi r7, r3, 56 addi r8, r3, 48 addi r9, r3, 40 addi r10, r3, 32 addi r11, r3, 24 addi r12, r3, 16 addi r30, r3, 8 LBB3_2: ; no_exit lfd f0, 0(r30) lfd f1, 8(r4) fmul f0, f1, f0 lfd f2, 0(r3) ;; no constant indices folded into the loads! lfd f3, 0(r4) lfd f4, 0(r10) lfd f5, 0(r6) lfd f6, 0(r7) lfd f7, 0(r8) lfd f8, 0(r9) lfd f9, 0(r11) lfd f10, 0(r12) lfd f11, 16(r4) fmadd f0, f3, f2, f0 fmul f2, f1, f4 fmadd f0, f11, f10, f0 fmadd f2, f3, f9, f2 fmul f1, f1, f6 stfd f0, 0(r4) fmadd f0, f11, f8, f2 fmadd f1, f3, f7, f1 stfd f0, 8(r4) fmadd f0, f11, f5, f1 addi r29, r4, 24 stfd f0, 16(r4) addi r2, r2, 1 cmpw cr0, r2, r5 or r4, r29, r29 bne cr0, LBB3_2 ; no_exit uh, yuck. With this patch, we now sink the constant offsets into the loop, producing this code: LBB3_1: ; no_exit.preheader li r2, 0 LBB3_2: ; no_exit lfd f0, 8(r3) lfd f1, 8(r4) fmul f0, f1, f0 lfd f2, 0(r3) lfd f3, 0(r4) lfd f4, 32(r3) ;; much nicer. lfd f5, 64(r3) lfd f6, 56(r3) lfd f7, 48(r3) lfd f8, 40(r3) lfd f9, 24(r3) lfd f10, 16(r3) lfd f11, 16(r4) fmadd f0, f3, f2, f0 fmul f2, f1, f4 fmadd f0, f11, f10, f0 fmadd f2, f3, f9, f2 fmul f1, f1, f6 stfd f0, 0(r4) fmadd f0, f11, f8, f2 fmadd f1, f3, f7, f1 stfd f0, 8(r4) fmadd f0, f11, f5, f1 addi r6, r4, 24 stfd f0, 16(r4) addi r2, r2, 1 cmpw cr0, r2, r5 or r4, r6, r6 bne cr0, LBB3_2 ; no_exit This is much nicer as it reduces register pressure in the loop a lot. On X86, this takes the function from having 9 spilled registers to 2. This should help some spec programs on X86 (gzip?) This is currently only enabled with -enable-gep-isel-opt to allow perf testing tonight. llvm-svn: 24606	2005-12-05 07:10:48 +00:00
Chris Lattner	8782b782cd	dbg.stoppoint returns a value, don't forget to init it llvm-svn: 24583	2005-12-03 18:50:48 +00:00
Andrew Lenharth	f9b27d7011	bah, must generate all results llvm-svn: 24574	2005-12-02 06:08:08 +00:00
Andrew Lenharth	73420b3795	cycle counter fix llvm-svn: 24573	2005-12-02 04:56:24 +00:00
Chris Lattner	0142afd6c1	Don't remove two operand, two result nodes from the binary ops map. These should come from the arbitrary ops map. This fixes Regression/CodeGen/PowerPC/2005-12-01-Crash.ll llvm-svn: 24571	2005-12-01 23:14:50 +00:00
Chris Lattner	05b0b4575b	Promote line and column number information for our friendly 64-bit targets. llvm-svn: 24568	2005-12-01 18:21:35 +00:00
Chris Lattner	9d0d715e83	This is a bugfix for SelectNodeTo. In certain situations, we could be selecting a node and use a mix of getTargetNode() and SelectNodeTo. Because SelectNodeTo didn't check the CSE maps for a preexisting node and didn't insert its result into the CSE maps, we would sometimes miss a CSE opportunity. This is extremely rare, but worth fixing for completeness. llvm-svn: 24565	2005-12-01 18:00:57 +00:00
Nate Begeman	006bb04f3a	Support multiple ValueTypes per RegisterClass, needed for upcoming vector work. This change has no effect on generated code. llvm-svn: 24563	2005-12-01 04:51:06 +00:00
Chris Lattner	be5dd5da19	Make SelectNodeTo return N llvm-svn: 24548	2005-11-30 22:45:14 +00:00
Chris Lattner	c174048430	CALLSEQ_START/END nodes don't get memoized, do not add them in when replaceAllUses'ing. llvm-svn: 24539	2005-11-30 18:20:52 +00:00
Andrew Lenharth	6ee8566cae	At long last, you can say that f32 isn't supported for setcc llvm-svn: 24537	2005-11-30 17:12:26 +00:00
Nate Begeman	1064d6ec43	First chunk of actually generating vector code for packed types. These changes allow us to generate the following code: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr for this llvm: void %foo(<4 x float>* %a) { entry: %tmp1 = load <4 x float>* %a %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float>* %a ret void } llvm-svn: 24534	2005-11-30 08:22:07 +00:00
Andrew Lenharth	8d17c70171	add support for custom lowering SINT_TO_FP llvm-svn: 24531	2005-11-30 06:43:03 +00:00
Reid Spencer	3fd1b4c9bf	Fix a problem with llvm-ranlib that (on some platforms) caused the archive file to become corrupted due to interactions between mmap'd memory segments and file descriptors closing. The problem is completely avoiding by using a third temporary file. Patch provided by Evan Jones llvm-svn: 24527	2005-11-30 05:21:10 +00:00
Evan Cheng	11d61613af	Fixed a bug introduced by my last commit: TargetGlobalValues should key on GlobalValue * and index pair. Update getGlobalAddress() for symmetry. llvm-svn: 24524	2005-11-30 02:49:21 +00:00
Evan Cheng	0e0de2f3f0	Added an index field to GlobalAddressSDNode so it can represent X+12, etc. llvm-svn: 24523	2005-11-30 02:04:11 +00:00
Chris Lattner	435b402e1f	Add support for a new STRING and LOCATION node for line number support, patch contributed by Daniel Berlin, with a few cleanups here and there by me. llvm-svn: 24515	2005-11-29 06:21:05 +00:00
Nate Begeman	89b049af90	Add the majority of the vector machien value types we expect to support, and make a few changes to the legalization machinery to support more than 16 types. llvm-svn: 24511	2005-11-29 05:45:29 +00:00
Nate Begeman	d37c13154a	Check in code to scalarize arbitrarily wide packed types for some simple vector operations (load, add, sub, mul). This allows us to codegen: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } on ppc as: _foo: lfs f0, 12(r3) lfs f1, 8(r3) lfs f2, 4(r3) lfs f3, 0(r3) fadds f0, f0, f0 fadds f1, f1, f1 fadds f2, f2, f2 fadds f3, f3, f3 stfs f0, 12(r3) stfs f1, 8(r3) stfs f2, 4(r3) stfs f3, 0(r3) blr llvm-svn: 24484	2005-11-22 18:16:00 +00:00
Nate Begeman	07890bbec4	Rather than attempting to legalize 1 x float, make sure the SD ISel never generates it. Make MVT::Vector expand-only, and remove the code in Legalize that attempts to legalize it. The plan for supporting N x Type is to continually epxand it in ExpandOp until it gets down to 2 x Type, where it will be scalarized into a pair of scalars. llvm-svn: 24482	2005-11-22 01:29:36 +00:00
Chris Lattner	44c28c22b7	Legalize MERGE_VALUES, expand READCYCLECOUNTER correctly, so it doesn't break control dependence. llvm-svn: 24437	2005-11-20 22:56:56 +00:00
Andrew Lenharth	627cbd49b1	The first patch of X86 support for read cycle counter llvm-svn: 24429	2005-11-20 21:32:07 +00:00
Chris Lattner	a8d37d748f	more progress towards bug 291 being finished. Patch by Owen Anderson, HAVE_GV case fixed up by me. llvm-svn: 24428	2005-11-20 03:45:52 +00:00
Chris Lattner	19baba67b5	Unbreak codegen of bools. This should fix the llc/jit/llc-beta failures from last night. llvm-svn: 24427	2005-11-19 18:40:42 +00:00
Chris Lattner	377bdbff91	Improve Selection DAG printer portability. Patch by Owen Anderson! llvm-svn: 24425	2005-11-19 07:44:09 +00:00
Chris Lattner	a22eae0163	Teach the graph viewer to handle register operands that are zero. llvm-svn: 24421	2005-11-19 06:58:46 +00:00
Chris Lattner	301015a703	Silence a bogus warning llvm-svn: 24420	2005-11-19 05:51:46 +00:00
Chris Lattner	f090f7eb0e	Add some method variants, patch by Evan Cheng llvm-svn: 24418	2005-11-19 01:44:53 +00:00
Nate Begeman	b2e089c31b	Teach LLVM how to scalarize packed types. Currently, this only works on packed types with an element count of 1, although more generic support is coming. This allows LLVM to turn the following code: void %foo(<1 x float> * %a) { entry: %tmp1 = load <1 x float> * %a; %tmp2 = add <1 x float> %tmp1, %tmp1 store <1 x float> %tmp2, <1 x float> *%a ret void } Into: _foo: lfs f0, 0(r3) fadds f0, f0, f0 stfs f0, 0(r3) blr llvm-svn: 24416	2005-11-19 00:36:38 +00:00
Nate Begeman	127321b14c	Split out the shift code from visitBinary. llvm-svn: 24412	2005-11-18 07:42:56 +00:00
Chris Lattner	45ca1c0194	Allow targets to custom legalize leaf nodes like GlobalAddress. llvm-svn: 24387	2005-11-17 06:41:44 +00:00
Chris Lattner	4ff65ec745	Teach legalize about targetglobaladdress llvm-svn: 24385	2005-11-17 05:52:24 +00:00
Chris Lattner	f2b62f317c	when debugging lower dbg intrinsics to calls llvm-svn: 24377	2005-11-16 07:22:30 +00:00
Jeff Cohen	cf1f782a2f	Fix operator precedence bug caught by VC++. llvm-svn: 24318	2005-11-12 00:59:01 +00:00
Andrew Lenharth	de1b5d6baa	added a chain output llvm-svn: 24306	2005-11-11 22:48:54 +00:00
Andrew Lenharth	01aa56397d	continued readcyclecounter support llvm-svn: 24300	2005-11-11 16:47:30 +00:00
Chris Lattner	bf4f233214	Switch the allnodes list from a vector of pointers to an ilist of nodes.This eliminates the vector, allows constant time removal of a node froma graph, and makes iteration over the all nodes list stable when adding nodes to the graph. llvm-svn: 24263	2005-11-09 23:47:37 +00:00
Chris Lattner	cd6f0f47f2	Refactor intrinsic lowering stuff out of visitCall llvm-svn: 24261	2005-11-09 19:44:01 +00:00
Chris Lattner	af3aefa10e	Handle the trivial (but common) two-op case more efficiently llvm-svn: 24259	2005-11-09 18:48:57 +00:00
Chris Lattner	41fd6d5d27	Fix CodeGen/X86/shift-folding.ll:test3 on X86 llvm-svn: 24256	2005-11-09 16:50:40 +00:00
Chris Lattner	b7cad90e55	Avoid creating a token factor node in trivially redundant cases. This eliminates almost one node per block in common cases. llvm-svn: 24254	2005-11-09 05:03:03 +00:00
Chris Lattner	43535a19b1	Handle GEP's a bit more intelligently. Fold constant indices early and turn power-of-two multiplies into shifts early to improve compile time. llvm-svn: 24253	2005-11-09 04:45:33 +00:00
Chris Lattner	c4d6050db6	Allocate the right amount of memory for this vector up front. llvm-svn: 24252	2005-11-08 23:32:44 +00:00
Chris Lattner	88fa11c3d5	Change the ValueList array for each node to be shared instead of individuallyallocated. Further, in the common case where a node has a single value, justreference an element from a small array. This is a small compile-time win. llvm-svn: 24251	2005-11-08 23:30:28 +00:00
Chris Lattner	7e4b5d33cb	Switch the operandlist/valuelist from being vectors to being just an array.This saves 12 bytes from SDNode, but doesn't speed things up substantially (our graphs apparently already fit within the cache on my g5). In any case this reduces memory usage. llvm-svn: 24249	2005-11-08 22:07:03 +00:00
Chris Lattner	3ba38cba64	Explicitly initialize some instance vars llvm-svn: 24247	2005-11-08 21:54:57 +00:00
Chris Lattner	aba48dd34c	Clean up RemoveDeadNodes significantly, by eliminating the need for a temporary set and eliminating the need to iterate whenever something is removed (which can be really slow in some cases). Thx to Jim for pointing out something silly I was getting stuck on. :) llvm-svn: 24241	2005-11-08 18:52:27 +00:00
Jim Laskey	1d2f26adcc	Let's try ignoring resource utilization on the backward pass. llvm-svn: 24231	2005-11-07 19:08:53 +00:00
Nate Begeman	3ee3e69556	Add the necessary support to the ISel to allow targets to codegen the new alignment information appropriately. Includes code for PowerPC to support fixed-size allocas with alignment larger than the stack. Support for arbitrarily aligned dynamic allocas coming soon. llvm-svn: 24224	2005-11-06 09:00:38 +00:00
Jim Laskey	904dbb4a27	Fix logic bug in finding retry slot in tally. llvm-svn: 24188	2005-11-05 00:01:25 +00:00
Jim Laskey	ded4759d81	Fix a warning llvm-svn: 24187	2005-11-04 18:26:02 +00:00
Jim Laskey	e682b677c1	Scheduling now uses itinerary data. llvm-svn: 24180	2005-11-04 04:05:35 +00:00
Nate Begeman	ee065281e8	Fix a crash that Andrew noticed, and add a pair of braces to unfconfuse XCode's indenting. llvm-svn: 24159	2005-11-02 18:42:59 +00:00
Chris Lattner	17df608719	Fix a source of undefined behavior when dealing with 64-bit types. This may fix PR652. Thanks to Andrew for tracking down the problem. llvm-svn: 24145	2005-11-02 01:47:04 +00:00
Jim Laskey	5ce0538253	1. Embed and not inherit vector for NodeGroup. 2. Iterate operands and not uses (performance.) 3. Some long pending comment changes. llvm-svn: 24119	2005-10-31 12:49:09 +00:00
Chris Lattner	6871b23d02	Significantly simplify this code and make it more aggressive. Instead of having a special case hack for X86, make the hack more general: if an incoming argument register is not used in any block other than the entry block, don't copy it to a vreg. This helps us compile code like this: %struct.foo = type { int, int, [0 x ubyte] } int %test(%struct.foo* %X) { %tmp1 = getelementptr %struct.foo* %X, int 0, uint 2, int 100 %tmp = load ubyte* %tmp1 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp to int ; <int> [#uses=1] ret int %tmp2 } to: _test: lbz r3, 108(r3) blr instead of: _test: lbz r2, 108(r3) or r3, r2, r2 blr The (dead) copy emitted to copy r3 into a vreg for extra-block uses was increasing the live range of r3 past the load, preventing the coallescing. This implements CodeGen/PowerPC/reg-coallesce-simple.ll llvm-svn: 24115	2005-10-30 19:42:35 +00:00
Chris Lattner	dd5663dfa0	Reduce the number of copies emitted as machine instructions by generating results in vregs that will need them. In the case of something like this: CopyToReg((add X, Y), reg1024), we no longer emit code like this: reg1025 = add X, Y reg1024 = reg 1025 Instead, we emit: reg1024 = add X, Y Whoa! :) llvm-svn: 24111	2005-10-30 18:54:27 +00:00
Chris Lattner	a70878d4fb	Codegen mul by negative power of two with a shift and negate. This implements test/Regression/CodeGen/PowerPC/mul-neg-power-2.ll, producing: _foo: slwi r2, r3, 1 subfic r3, r2, 63 blr instead of: _foo: mulli r2, r3, -2 addi r3, r2, 63 blr llvm-svn: 24106	2005-10-30 06:41:49 +00:00
Chris Lattner	4b6d583d7a	Fix DSE to not nuke dead stores unless they redundant store is the same VT as the killing one. Fix fixes PR491 llvm-svn: 24034	2005-10-27 07:10:34 +00:00
Chris Lattner	d8c5c066a1	Add a simple xform that is useful for bitfield operations. llvm-svn: 24029	2005-10-27 05:06:38 +00:00
Nate Begeman	d8f2a1a0f3	Allow custom lowered FP_TO_SINT ops in the check for whether a larger FP_TO_SINT is preferred to a larger FP_TO_UINT. This seems to be begging for a TLI.isOperationCustom() helper function. llvm-svn: 23992	2005-10-25 23:47:25 +00:00
Chris Lattner	3b409a85eb	Clear a bit in this file that was causing a miscompilation of 178.galgel. llvm-svn: 23980	2005-10-25 18:57:30 +00:00
Andrew Lenharth	4b3932aa89	add TargetExternalSymbol llvm-svn: 23886	2005-10-23 03:40:17 +00:00
Chris Lattner	9faa5b7a9a	BuildSDIV and BuildUDIV only work for i32/i64, but they don't check that the input is that type, this caused a failure on gs on X86 last night. Move the hard checks into Build[US]Div since that is where decisions like this should be made. llvm-svn: 23881	2005-10-22 18:50:15 +00:00
Chris Lattner	75ea5b10bf	add a case missing from the dag combiner that exposed the failure on 2005-10-21-longlonggtu.ll. llvm-svn: 23875	2005-10-21 21:23:25 +00:00
Nate Begeman	8f62cd32ad	Fix a typo in the dag combiner, so that this can work on i64 targets llvm-svn: 23856	2005-10-21 01:51:45 +00:00
Nate Begeman	4dd383120f	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Nate Begeman	7efe53d90b	Fix a couple bugs in the const div stuff where we'd generate MULHS/MULHU for types that aren't legal, and fail a divisor is less than zero comparison, which would cause us to drop a subtract. llvm-svn: 23846	2005-10-20 17:45:03 +00:00
Chris Lattner	a6efeb01f9	don't use llabs with apparently VC++ doesn't have llvm-svn: 23845	2005-10-20 17:01:00 +00:00
Nate Begeman	c6f067a8c4	Move the target constant divide optimization up into the dag combiner, so that the nodes can be folded with other nodes, and we can not duplicate code in every backend. Alpha will probably want this too. llvm-svn: 23835	2005-10-20 02:15:44 +00:00
Nate Begeman	5172ce641e	Teach Legalize how to do something with EXTRACT_ELEMENT when the type of the pair of elements is a legal type. llvm-svn: 23804	2005-10-19 00:06:56 +00:00
Nate Begeman	78afac2ddd	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Chris Lattner	0a71a9ac86	Fix Generic/2005-10-18-ZeroSizeStackObject.ll by not requesting a zero sized stack object if either the array size or the type size is zero. llvm-svn: 23801	2005-10-18 22:14:06 +00:00
Chris Lattner	8396a308a7	remove hack llvm-svn: 23797	2005-10-18 22:11:42 +00:00
Chris Lattner	6c14c35bd7	Fold (select C, load A, load B) -> load (select C, A, B). This happens quite a lot throughout many programs. In particular, specfp triggers it a bunch for constant FP nodes when you have code like cond ? 1.0 : -1.0. If the PPC ISel exposed the loads implicit in pic references to external globals, we would be able to eliminate a load in cases like this as well: %X = external global int %Y = external global int int* %test4(bool %C) { %G = select bool %C, int* %X, int* %Y ret int* %G } Note that this breaks things that use SrcValue's (see the fixme), but since nothing uses them yet, this is ok. Also, simplify some code to use hasOneUse() on an SDOperand instead of hasNUsesOfValue directly. llvm-svn: 23781	2005-10-18 06:04:22 +00:00
Nate Begeman	418c6e4045	Implement some feedback from Chris re: constant canonicalization llvm-svn: 23777	2005-10-18 00:28:13 +00:00
Nate Begeman	bd5f41a6a6	Legalize BUILD_PAIR appropriately for upcoming 64 bit PowerPC work. llvm-svn: 23776	2005-10-18 00:27:41 +00:00
Nate Begeman	ec48a1bfbd	fold fmul X, +2.0 -> fadd X, X; llvm-svn: 23774	2005-10-17 20:40:11 +00:00
Chris Lattner	eeb2bda2fa	add a trivial fold llvm-svn: 23764	2005-10-17 01:07:11 +00:00
Chris Lattner	e540800d5a	Fix this logic. llvm-svn: 23756	2005-10-15 22:35:40 +00:00
Chris Lattner	17cc9edd33	Add a case we were missing that was causing us to fail CodeGen/PowerPC/rlwinm.ll:test3 llvm-svn: 23755	2005-10-15 22:18:08 +00:00
Chris Lattner	b986f471be	Use getExtLoad here instead of getNode, as extloads produce two values. This fixes a legalize failure on SPASS for itanium. llvm-svn: 23747	2005-10-15 20:24:07 +00:00
Nate Begeman	6e673b24d3	fold sext_in_reg, sext_in_reg where both have the same VT. This was popping up in Fourinarow. llvm-svn: 23722	2005-10-14 01:29:07 +00:00
Nate Begeman	d59e5a7abb	Relax the checking on zextload generation a bit, since as sabre pointed out you could be AND'ing with the result of a shift that shifts out all the bits you care about, in addition to a constant. Also, move over an add/sub_parts fold from legalize to the dag combiner, where it works for things other than constants. Woot! llvm-svn: 23720	2005-10-14 01:12:21 +00:00
Chris Lattner	b8282987f4	Fix the trunc(load) case, finally allowing crafty and povray to pass llvm-svn: 23718	2005-10-13 22:10:05 +00:00
Chris Lattner	dbc5ae3109	Fix some bugs in (sext (load x)) llvm-svn: 23717	2005-10-13 21:52:31 +00:00
Chris Lattner	258521d7ea	When ExpandOp'ing a [SZ]EXTLOAD, make sure to remember that the chain is also legal. Add support for ExpandOp'ing raw EXTLOADs too. llvm-svn: 23716	2005-10-13 21:44:47 +00:00
Chris Lattner	d23f4b7411	Implement PromoteOp for *EXTLOAD, allowing MallocBench/gs to Legalize llvm-svn: 23715	2005-10-13 20:07:41 +00:00
Nate Begeman	8e022b3d89	Fix the remaining DAGCombiner issues pointed out by sabre. This should fix the remainder of the failures introduced by my patch last night. llvm-svn: 23714	2005-10-13 18:34:58 +00:00
Chris Lattner	a80f1f6e72	Fix a minor bug in the dag combiner that broke pcompress2 and some other tests. llvm-svn: 23713	2005-10-13 18:16:34 +00:00
Nate Begeman	c3a89c5259	Add support to Legalize for expanding i64 sextload/zextload into hi and lo parts. This should fix the crafty and signed long long unit test failure on x86 last night. llvm-svn: 23711	2005-10-13 17:15:37 +00:00
Jim Laskey	5d7a50ac44	Inhibit instructions from being pushed before function calls. This will minimize unnecessary spilling. llvm-svn: 23710	2005-10-13 16:44:00 +00:00
Nate Begeman	02b23c6065	Move some Legalize functionality over to the DAGCombiner where it belongs. Kill some dead code. llvm-svn: 23706	2005-10-13 03:11:28 +00:00
Nate Begeman	70d28c5e32	Fix a potential bug with two combine-to's back to back that chris pointed out, where after the first CombineTo() call, the node the second CombineTo wishes to replace may no longer exist. Fix a very real bug with the truncated load optimization on little endian targets, which do not need a byte offset added to the load. llvm-svn: 23704	2005-10-12 23:18:53 +00:00
Nate Begeman	8caf81d617	More cool stuff for the dag combiner. We can now finally handle things like turning: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr into _foo: fctiwz f0,f1 stfd f0,-8(r1) lhz r3,-2(r1) blr Also removed an unncessary constraint from sra -> srl conversion, which should take care of hte only reason we would ever need to handle sra in MaskedValueIsZero, AFAIK. llvm-svn: 23703	2005-10-12 20:40:40 +00:00
Jim Laskey	63b1419b74	Finally committing to the new scheduler. Still -sched=none by default. llvm-svn: 23702	2005-10-12 18:29:35 +00:00
Chris Lattner	514f058be1	Fix a powerpc crash on CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 23694	2005-10-11 17:56:34 +00:00
Chris Lattner	c38fb8e2a1	Add a canonicalization that got lost, fixing PowerPC/fold-li.ll:SUB llvm-svn: 23693	2005-10-11 06:07:15 +00:00
Chris Lattner	cc6e53e6ee	clean up some corner cases llvm-svn: 23692	2005-10-10 23:00:08 +00:00
Chris Lattner	04c737091f	Implement trivial DSE. If two stores are neighbors and store to the same location, replace them with a new store of the last value. This occurs in the same neighborhood in 197.parser, speeding it up about 1.5% llvm-svn: 23691	2005-10-10 22:31:19 +00:00
Chris Lattner	e260ed8628	Add support for CombineTo, allowing the dag combiner to replace nodes with multiple results. Use this support to implement trivial store->load forwarding, implementing CodeGen/PowerPC/store-load-fwd.ll. Though this is the most simple case and can be extended in the future, it is still useful. For example, it speeds up 197.parser by 6.2% by avoiding an LSU reject in xalloc: stw r6, lo16(l5_end_of_array)(r2) addi r2, r5, -4 stwx r5, r4, r2 - lwzx r5, r4, r2 - rlwinm r5, r5, 0, 0, 30 stwx r5, r4, r2 lwz r2, -4(r4) ori r2, r2, 1 llvm-svn: 23690	2005-10-10 22:04:48 +00:00
Nate Begeman	6828ed9bfd	Teach the DAGCombiner several new tricks, teaching it how to turn sext_inreg into zext_inreg based on the signbit (fires a lot), srem into urem, etc. llvm-svn: 23688	2005-10-10 21:26:48 +00:00
Chris Lattner	7730924067	Fix comment llvm-svn: 23686	2005-10-10 16:52:03 +00:00
Chris Lattner	3d1d4a3d12	Add ISD::ADD to MaskedValueIsZero llvm-svn: 23685	2005-10-10 16:51:40 +00:00
Chris Lattner	56e44a6da5	This function is now dead llvm-svn: 23684	2005-10-10 16:49:22 +00:00
Chris Lattner	bcfebebf22	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	6a49b7cabb	add a todo for something I noticed llvm-svn: 23679	2005-10-09 22:59:08 +00:00
Chris Lattner	1d3dc00674	(X & Y) & C == 0 if either X&C or Y&C are zero llvm-svn: 23678	2005-10-09 22:12:36 +00:00
Chris Lattner	0832f2635a	When emiting a CopyFromReg and the source is already a vreg, do not bother creating a new vreg and inserting a copy: just use the input vreg directly. This speeds up the compile (e.g. about 5% on mesa with a debug build of llc) by not adding a bunch of copies and vregs to be coallesced away. On mesa, for example, this reduces the number of intervals from 168601 to 129040 going into the coallescer. llvm-svn: 23671	2005-10-09 05:58:56 +00:00
Nate Begeman	2042aa5b92	Lo and behold, the last bits of SelectionDAG.cpp have been moved over. llvm-svn: 23665	2005-10-08 00:29:44 +00:00
Chris Lattner	be4bbca0ba	remove debugging code llvm-svn: 23663	2005-10-07 15:31:26 +00:00
Chris Lattner	fb12624a3f	implement CodeGen/PowerPC/div-2.ll:test2-4 by propagating zero bits through C-X's llvm-svn: 23662	2005-10-07 15:30:32 +00:00
Chris Lattner	b27a4147d3	fix indentation llvm-svn: 23660	2005-10-07 06:37:02 +00:00
Chris Lattner	5bcd0dd811	Turn sdivs into udivs when we can prove the sign bits are clear. This implements CodeGen/PowerPC/div-2.ll llvm-svn: 23659	2005-10-07 06:10:46 +00:00
Chris Lattner	7bf8d06f02	silence a bogus GCC warning llvm-svn: 23646	2005-10-06 17:39:10 +00:00
Chris Lattner	4bbbb9eed7	Make the legalizer completely non-recursive llvm-svn: 23642	2005-10-06 01:20:27 +00:00
Nate Begeman	558beb3729	Let the combiner handle more cases llvm-svn: 23641	2005-10-05 21:44:43 +00:00
Nate Begeman	f8221c5e2c	Remove some bad code from Legalize llvm-svn: 23640	2005-10-05 21:44:10 +00:00
Nate Begeman	bd7df030d2	Check in some more DAGCombiner pieces llvm-svn: 23639	2005-10-05 21:43:42 +00:00
Chris Lattner	a49e16fefa	implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes with the dag combiner. This speeds up espresso by 8%, reaching performance parity with the dag-combiner-disabled llc. llvm-svn: 23636	2005-10-05 06:47:48 +00:00
Chris Lattner	b11d15637a	fix some pastos llvm-svn: 23635	2005-10-05 06:37:22 +00:00
Chris Lattner	06f1d0f73a	Add a new HandleNode class, which is used to handle (haha) cases in the dead node elim and dag combiner passes where the root is potentially updated. This fixes a fixme in the dag combiner. llvm-svn: 23634	2005-10-05 06:35:28 +00:00
Chris Lattner	a6895d180e	Implement the code for PowerPC/inverted-bool-compares.ll, even though it that testcase still does not pass with the dag combiner. This is because not all forms of br* are folded yet. Also, when we combine a node into another one, delete the node immediately instead of waiting for the node to potentially come up in the future. llvm-svn: 23632	2005-10-05 06:11:08 +00:00
Chris Lattner	6bd8fd09b6	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Chris Lattner	746d50a01a	Fix a crash compiling Olden/tsp llvm-svn: 23630	2005-10-05 04:45:43 +00:00
Jim Laskey	327d4298e1	Reverting to version - until problem isolated. llvm-svn: 23622	2005-10-04 16:41:51 +00:00
Nate Begeman	5da6908d65	Fix some faulty logic in the libcall inserter. Since calls return more than one value, don't bail if one of their uses happens to be a node that's not an MVT::Other when following the chain from CALLSEQ_START to CALLSEQ_END. Once we've found a CALLSEQ_START, we can just return; there's no need to tail-recurse further up the graph. Most importantly, just because something only has one use doesn't mean we should use it's one use to follow from start to end. This faulty logic caused us to follow a chain of one-use FP operations back to a much earlier call, putting a cycle in the graph from a later start to an earlier end. This is a better fix that reverting to the workaround committed earlier today. llvm-svn: 23620	2005-10-04 02:10:55 +00:00
Nate Begeman	54fb5002e5	Add back a workaround that fixes some breakages from chris's last change. Neither of us have yet figured out why this code is necessary, but stuff breaks if its not there. Still tracking this down... llvm-svn: 23617	2005-10-04 00:37:37 +00:00
Jim Laskey	409a6b204e	Refactor gathering node info and emission. llvm-svn: 23610	2005-10-03 12:30:32 +00:00
Chris Lattner	9cfccfb517	Fix a problem where the legalizer would run out of stack space on extremely large basic blocks because it was purely recursive. This switches it to an iterative/recursive hybrid. llvm-svn: 23596	2005-10-02 17:49:46 +00:00
Chris Lattner	7f718e61e8	silence a bogus warning llvm-svn: 23595	2005-10-02 16:30:51 +00:00
Chris Lattner	704d97f8b2	Add assertions to the trivial scheduler to check that the value types match up between defs and uses. llvm-svn: 23590	2005-10-02 07:10:55 +00:00
Chris Lattner	a038d901fb	Codegen CopyFromReg using the regclass that matches the valuetype of the destination vreg. llvm-svn: 23586	2005-10-02 06:34:16 +00:00
Chris Lattner	5a7bfe0b72	Add some very paranoid checking for operand/result reg class matchup For instructions that define multiple results, use the right regclass to define the result, not always the rc of result #0 llvm-svn: 23580	2005-10-01 07:45:09 +00:00
Jeff Cohen	f8a5e5ae6e	Fix VC++ warnings. llvm-svn: 23579	2005-10-01 03:57:14 +00:00
Chris Lattner	fda6944c5b	add a method llvm-svn: 23575	2005-10-01 00:17:07 +00:00
Jim Laskey	d3850457a1	typo llvm-svn: 23574	2005-10-01 00:08:23 +00:00
Jim Laskey	9d96932879	1. Simplify the gathering of node groups. 2. Printing node groups when displaying nodes. llvm-svn: 23573	2005-10-01 00:03:07 +00:00
Jim Laskey	3fe3841c2a	1. Made things node-centric (from operand). 2. Added node groups to handle flagged nodes. 3. Started weaning simple scheduling off existing emitter. llvm-svn: 23566	2005-09-30 19:15:27 +00:00
Chris Lattner	5b2be1f890	Fix two bugs in my patch earlier today that broke int->fp conversion on X86. llvm-svn: 23522	2005-09-29 06:44:39 +00:00
Jeff Cohen	b01a41a06d	Silence VC++ redeclaration warnings. llvm-svn: 23516	2005-09-29 01:59:49 +00:00
Chris Lattner	6f3b577ee6	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23504	2005-09-28 22:28:18 +00:00
Chris Lattner	0fd8f9fbc9	If the target prefers it, use _setjmp/_longjmp should be used instead of setjmp/longjmp for llvm.setjmp/llvm.longjmp. llvm-svn: 23481	2005-09-27 22:15:53 +00:00
Jim Laskey	63523f98d5	Remove some redundancies. llvm-svn: 23469	2005-09-27 17:32:45 +00:00
Jim Laskey	5f2443c8a3	Addition of a simple two pass scheduler. This version is currently hacked up for testing and will require target machine info to do a proper scheduling. The simple scheduler can be turned on using -sched=simple (defaults to -sched=none) llvm-svn: 23455	2005-09-26 21:57:04 +00:00
Chris Lattner	59a05bdde6	Turn (X^C1) == C2 into X == C1^C2 iff X&~C1 = 0 (and move a function) This happens all the time on PPC for bool values, e.g. eliminating a xori in inverted-bool-compares.ll. This should be added to the dag combiner as well. llvm-svn: 23403	2005-09-23 00:55:52 +00:00
Nate Begeman	c760f80fed	Stub out the rest of the DAG Combiner. Just need to fill in the select_cc bits and then wrap it in a convenience function for use with regular select. llvm-svn: 23389	2005-09-19 22:34:01 +00:00
Nate Begeman	24a7eca282	More DAG combining. Still need the branch instructions, and select_cc llvm-svn: 23371	2005-09-16 00:54:12 +00:00
Chris Lattner	d4382f0afa	If a function has liveins, and if the target requested that they be plopped into particular vregs, emit copies into the entry MBB. llvm-svn: 23331	2005-09-13 19:30:54 +00:00
Chris Lattner	2d454bf5be	Allow targets to say they don't support truncstore i1 (which includes a mask when storing to an 8-bit memory location), as most don't. llvm-svn: 23303	2005-09-10 00:20:18 +00:00
Chris Lattner	bd39c1a4c6	Add a missing #include, patch courtesy of Baptiste Lepilleur. llvm-svn: 23302	2005-09-09 23:53:39 +00:00
Chris Lattner	331b311f7b	Fix a problem duraid encountered on itanium where this folding: select (x < y), 1, 0 -> (x < y) incorrectly: the setcc returns i1 but the select returned i32. Add the zero extend as needed. llvm-svn: 23301	2005-09-09 23:00:07 +00:00
Chris Lattner	16e5cb87ba	Fix a crash viewing dags that have target nodes in them llvm-svn: 23300	2005-09-09 22:35:03 +00:00
Nate Begeman	049b748c76	Last round of 2-node folds from SD.cpp. Will move on to 3 node ops such as setcc and select next. llvm-svn: 23295	2005-09-09 19:49:52 +00:00
Nate Begeman	85c1cc4523	Move yet more folds over to the dag combiner from sd.cpp llvm-svn: 23278	2005-09-08 20:18:10 +00:00
Nate Begeman	2cc2c9a79c	Another round of dag combiner changes. This fixes some missing XOR folds as well as fixing how we replace old values with new values. llvm-svn: 23260	2005-09-07 23:25:52 +00:00
Nate Begeman	6791d63e55	Implement a common missing fold, (add (add x, c1), c2) -> (add x, c1+c2). This restores all of stanford to being identical with and without the dag combiner with the add folding turned off in sd.cpp. llvm-svn: 23258	2005-09-07 16:09:19 +00:00
Chris Lattner	fe883adfd2	Fix a bug nate ran into with replacealluseswith. In the recursive cse case, we were losing a node, causing an assertion to fail. Now we eagerly delete discovered CSE's, and provide an optional vector to keep track of these discovered equivalences. llvm-svn: 23255	2005-09-07 05:37:01 +00:00
Nate Begeman	007c650699	Add an option to the DAG Combiner to enable it for beta runs, and turn on that option for PowerPC's beta. llvm-svn: 23253	2005-09-07 00:15:36 +00:00
Nate Begeman	d23739d020	Next round of DAGCombiner changes. This version now passes all the tests I have run so far when run before Legalize. It still needs to pick up the SetCC folds, and nodes that use SetCC. llvm-svn: 23243	2005-09-06 04:43:02 +00:00
Chris Lattner	821628ff2a	Fix a checking failure in gs llvm-svn: 23235	2005-09-03 01:04:40 +00:00
Nate Begeman	7cea6ef16e	Next round of DAG Combiner changes. Just need to support multiple return values, and then we should be able to hook it up. llvm-svn: 23231	2005-09-02 21:18:40 +00:00
Chris Lattner	1a570f1fe4	Clean up some code from the last checkin llvm-svn: 23229	2005-09-02 20:32:45 +00:00
Chris Lattner	630226697f	Fix a bug in legalize where it would emit two calls to libcalls that return i64 values on targets that need that expanded to 32-bit registers. This fixes PowerPC/2005-09-02-LegalizeDuplicatesCalls.ll and speeds up 189.lucas from taking 122.72s to 81.96s on my desktop. llvm-svn: 23228	2005-09-02 20:26:58 +00:00
Chris Lattner	b95b280bee	Make sure to auto-cse nullary ops llvm-svn: 23224	2005-09-02 19:36:17 +00:00
Chris Lattner	1e89e36dcd	Fix some buggy logic where we would try to remove nodes with two operands from the binary ops map, even if they had multiple results. This latent bug caused a few failures with the dag isel last night. To prevent stuff like this from happening in the future, add some really strict checking to make sure that the CSE maps always match up with reality! llvm-svn: 23221	2005-09-02 19:15:44 +00:00
Chris Lattner	b0b4ec5655	Don't create zero sized stack objects even for array allocas with a zero number of elements. llvm-svn: 23219	2005-09-02 18:41:28 +00:00
Chris Lattner	b6cde17d29	Fix the release build, noticed by Eric van Riet Paap llvm-svn: 23215	2005-09-02 07:09:28 +00:00
Chris Lattner	d9af1aab51	Make sure to legalize assert[zs]ext's operand correctly llvm-svn: 23208	2005-09-02 01:15:01 +00:00
Chris Lattner	a66403dbf7	For values that are live across basic blocks and need promotion, use ANY_EXTEND instead of ZERO_EXTEND to eliminate extraneous extensions. This eliminates dead zero extensions on formal arguments and other cases on PPC, implementing the newly tightened up test/Regression/CodeGen/PowerPC/small-arguments.ll test. llvm-svn: 23205	2005-09-02 00:19:37 +00:00
Chris Lattner	7753f175e6	legalize ANY_EXTEND appropriately llvm-svn: 23204	2005-09-02 00:18:10 +00:00
Chris Lattner	8c393c218b	Add support for ANY_EXTEND and add a few minor folds for it llvm-svn: 23203	2005-09-02 00:17:32 +00:00
Nate Begeman	d78d975437	Fix some code in the current node combining code, spotted when it was moved over to DAGCombiner.cpp 1. Don't assume that SetCC returns i1 when folding (xor (setcc) constant) 2. Don't duplicate code in folding AND with AssertZext that is handled by MaskedValueIsZero llvm-svn: 23196	2005-09-01 23:25:49 +00:00
Nate Begeman	2504fe2613	Implement first round of feedback from chris (there's still a couple things left to do). llvm-svn: 23195	2005-09-01 23:24:04 +00:00
Chris Lattner	975f5c9f46	It is NDEBUG not _NDEBUG llvm-svn: 23186	2005-09-01 18:44:10 +00:00
Nate Begeman	e8f78d1aab	Add the rest of the currently implemented visit routines to the switch statement in visit(). llvm-svn: 23185	2005-09-01 00:33:32 +00:00
Nate Begeman	21158fc485	First pass at the DAG Combiner. It isn't used anywhere yet, but it should be mostly functional. It currently has all folds from SelectionDAG.cpp that do not involve a condition code. llvm-svn: 23184	2005-09-01 00:19:25 +00:00
Chris Lattner	8a1a5f2818	Allow targets to custom expand shifts that are too large for their registers llvm-svn: 23173	2005-08-31 19:01:53 +00:00
Jeff Cohen	d8c84e3c7e	Fix VC++ precedence warnings llvm-svn: 23169	2005-08-31 02:47:06 +00:00
Nate Begeman	539e7c892c	Sigh, not my day. Fix typo. llvm-svn: 23166	2005-08-31 00:43:49 +00:00
Nate Begeman	d513d8a662	Fix a mistake in my previous patch pointed out by sabre; the AssertZext case in MaskedValueIsZero was wrong. llvm-svn: 23165	2005-08-31 00:43:08 +00:00
Nate Begeman	e07bc28cca	Remove some unnecessary casts, and add the AssertZext case to MaskedValueIsZero. llvm-svn: 23164	2005-08-31 00:27:53 +00:00
Chris Lattner	5764da422a	Allow physregs to occur in the dag with multiple types. Though I don't likethis, it is a requirement on PPC, which can have an f32 value in r3 at onepoint in a function and a f64 value in r3 at another point. :( This fixes compilation of mesa llvm-svn: 23161	2005-08-30 22:38:38 +00:00
Chris Lattner	61d21b1f3c	Fix FreeBench/fourinarow with the dag isel, by not adding a bogus result to SHIFT_PARTS nodes llvm-svn: 23151	2005-08-30 17:21:17 +00:00
Chris Lattner	9a4ad487f0	Fix a miscompile of PtrDist/bc. Sign extending bools is not the right thing, at least tends to expose problems elsewhere. llvm-svn: 23149	2005-08-30 16:56:19 +00:00
Nate Begeman	a3da8c4819	Remove a bogus piece of my AssertSext/AssertZext patch. oops. llvm-svn: 23148	2005-08-30 02:54:28 +00:00
Nate Begeman	43144a2fe0	Add support for AssertSext and AssertZext, folding other extensions with them. This allows for elminination of redundant extends in the entry blocks of functions on PowerPC. Add support for i32 x i32 -> i64 multiplies, by recognizing when the inputs to ISD::MUL in ExpandOp are actually just extended i32 values and not real i64 values. this allows us to codegen int mulhs(int a, int b) { return ((long long)a * b) >> 32; } as: _mulhs: mulhw r3, r4, r3 blr instead of: _mulhs: mulhwu r2, r4, r3 srawi r5, r3, 31 mullw r5, r4, r5 add r2, r2, r5 srawi r4, r4, 31 mullw r3, r4, r3 add r3, r2, r3 blr with a similar improvement on x86. llvm-svn: 23147	2005-08-30 02:44:00 +00:00
Chris Lattner	08a1e38730	Name this variable to be what it really is! llvm-svn: 23145	2005-08-30 01:58:51 +00:00
Chris Lattner	04cb82278a	Handle CopyToReg nodes with flag operands correctly llvm-svn: 23144	2005-08-30 01:57:23 +00:00
Chris Lattner	f7e5ec84c6	Add a hack to avoid some horrible code in some cases by always emitting token chains first. For this C function: int test() { int i; for (i = 0; i < 100000; ++i) foo(); } Instead of emitting this (condition before call) .LBB_test_1: ; no_exit addi r30, r30, 1 lis r2, 1 ori r2, r2, 34464 cmpw cr2, r30, r2 bl L_foo$stub bne cr2, .LBB_test_1 ; no_exit Emit this: .LBB_test_1: ; no_exit bl L_foo$stub addi r30, r30, 1 lis r2, 1 ori r2, r2, 34464 cmpw cr0, r30, r2 bne cr0, .LBB_test_1 ; no_exit Which makes it so we don't have to save/restore cr2 in the prolog/epilog of the function. This also makes the code much more similar to what the pattern isel produces. llvm-svn: 23135	2005-08-29 23:21:29 +00:00
Chris Lattner	c738d000d5	Add a new API for Nate llvm-svn: 23131	2005-08-29 21:59:31 +00:00
Andrew Lenharth	835cbb364d	Some of us cared about the the promote path llvm-svn: 23130	2005-08-29 20:46:51 +00:00
Chris Lattner	dcde1b2b6a	Fix an infinite loop on x86 llvm-svn: 23129	2005-08-29 17:30:00 +00:00
Chris Lattner	87421c8658	Fix a bug in ReplaceAllUsesWith llvm-svn: 23122	2005-08-28 23:59:36 +00:00
Chris Lattner	075250bda1	Disable this code, which broke many tests last night llvm-svn: 23114	2005-08-27 16:16:51 +00:00
Chris Lattner	5ee85e89b6	fix PHI node emission for basic blocks that have select_cc's in them on ppc32 llvm-svn: 23113	2005-08-27 00:58:02 +00:00
Chris Lattner	56ca46ee04	Nate noticed that Andrew never did this. This fixes PR600 llvm-svn: 23110	2005-08-26 22:50:40 +00:00
Chris Lattner	e7a2998064	Don't copy regs that are only used in the entry block into a vreg. This changes the code generated for: short %test(short %A) { %B = xor short %A, -32768 ret short %B } to: _test: xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr instead of: _test: rlwinm r2, r3, 0, 16, 31 xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr llvm-svn: 23109	2005-08-26 22:49:59 +00:00
Chris Lattner	4a5ebe94ba	Checking types here is not safe, because multiple types can map to the same register class. llvm-svn: 23103	2005-08-26 21:39:15 +00:00
Chris Lattner	13d7c252e5	Call the InsertAtEndOfBasicBlock hook if the usesCustomDAGSchedInserter flag is set on an instruction. llvm-svn: 23098	2005-08-26 20:54:47 +00:00
Chris Lattner	373f048a79	Revampt ReplaceAllUsesWith to be more efficient and easier to use. llvm-svn: 23087	2005-08-26 18:36:28 +00:00
Chris Lattner	c30405e0ee	Change ConstantPoolSDNode to actually hold the Constant itself instead of putting it into the constant pool. This allows the isel machinery to create constants that it will end up deciding are not needed, without them ending up in the resultant function constant pool. llvm-svn: 23081	2005-08-26 17:15:30 +00:00
Chris Lattner	2091a36631	Fix a huge annoyance: SelectNodeTo took types before the opcode unlike every other SD API. Fix it to take the opcode before the types. llvm-svn: 23079	2005-08-26 16:36:26 +00:00
Chris Lattner	c6d481db7a	the 5th operand is the 4th number llvm-svn: 23074	2005-08-26 00:43:46 +00:00
Chris Lattner	5f573416cd	Add support for targets that want to custom expand select_cc in some cases. llvm-svn: 23071	2005-08-26 00:23:59 +00:00
Chris Lattner	dff50cadaa	Allow LowerOperation to return a null SDOperand in case it wants to lower some things given to it, but not all. llvm-svn: 23070	2005-08-26 00:14:16 +00:00
Chris Lattner	1cb550c603	Fix a nasty bug from a previous patch of mine llvm-svn: 23069	2005-08-26 00:13:12 +00:00
Nate Begeman	33840c3268	New fold for SELECT_CC llvm-svn: 23058	2005-08-25 20:04:38 +00:00
Chris Lattner	f9c19157df	Don't auto-cse nodes that return flags llvm-svn: 23055	2005-08-25 19:12:10 +00:00
Chris Lattner	9d28a56d55	simplify the code a bit using isOperationLegal llvm-svn: 23053	2005-08-25 17:54:58 +00:00
Chris Lattner	8a93f64efa	Add support for flag operands llvm-svn: 23050	2005-08-25 17:48:54 +00:00
Chris Lattner	407c6415b4	ADd support for TargetConstantPool nodes llvm-svn: 23041	2005-08-25 05:03:06 +00:00
Chris Lattner	bbe0e7df2c	add a new TargetFrameIndex node llvm-svn: 23035	2005-08-25 00:43:01 +00:00
Chris Lattner	45e1ce4e28	add a method llvm-svn: 23027	2005-08-24 23:00:29 +00:00
Chris Lattner	d7ee4d8671	Add ReplaceAllUsesWith that can take a vector of replacement values. Add some foldings to hopefully help the illegal setcc issue, and move some code around. llvm-svn: 23025	2005-08-24 22:44:39 +00:00
Chris Lattner	ad9565dfbe	Add support for external symbols, and support for variable arity instructions llvm-svn: 23022	2005-08-24 22:02:41 +00:00
Chris Lattner	bb8cc0acb2	Fix pasto that prevented VT ndoes from showing up in -view-isel-dags correctly llvm-svn: 23021	2005-08-24 18:30:00 +00:00
Chris Lattner	86b1658d58	teach selection dag mask tracking about the fact that select_cc operates like select. Also teach it that the bit count instructions can only set the low bits of the result, depending on the size of the input. This allows us to compile this: int %eq0(int %a) { %tmp.1 = seteq int %a, 0 ; <bool> [#uses=1] %tmp.2 = cast bool %tmp.1 to int ; <int> [#uses=1] ret int %tmp.2 } To this: _eq0: cntlzw r2, r3 srwi r3, r2, 5 blr instead of this: _eq0: cntlzw r2, r3 rlwinm r3, r2, 27, 31, 31 blr when setcc is marked illegal on ppc (which restores parity to non-illegal setcc). Thanks to Nate for pointing this out. llvm-svn: 23013	2005-08-24 16:46:55 +00:00
Chris Lattner	f12eb4d676	Start using isOperationLegal and isTypeLegal to simplify the code llvm-svn: 23012	2005-08-24 16:35:28 +00:00
Nate Begeman	45bbbb3f11	Teach SelectionDAG how to simplify a few more setcc-equivalent select_cc nodes so that backends don't have to. llvm-svn: 22999	2005-08-24 04:57:57 +00:00
Chris Lattner	99282c7b92	Make -view-isel-dags show the dag before instruction selecting, in case the target isel crashes due to unimplemented features like calls :) llvm-svn: 22997	2005-08-24 00:34:29 +00:00
Nate Begeman	72eab5dd5c	Fix optimization of select_cc seteq X, 0, 1, 0 -> srl (ctlz X), log2 X size llvm-svn: 22995	2005-08-24 00:21:28 +00:00
Nate Begeman	bf8c3939d7	Teach the SelectionDAG how to transform select_cc eq, X, 0, 1, 0 into either seteq X, 0 or srl (ctlz X), size(X-1), depending on what's legal for the target. llvm-svn: 22978	2005-08-23 05:41:12 +00:00
Nate Begeman	987121a61a	Teach Legalize how to turn setcc into select_cc llvm-svn: 22977	2005-08-23 04:29:48 +00:00
Chris Lattner	7f9e078d11	Fix a problem where constant expr shifts would not have their shift amount promoted to the right type. This fixes: IA64/2005-08-22-LegalizerCrash.ll llvm-svn: 22969	2005-08-22 17:28:31 +00:00
Chris Lattner	92626b9bc5	Add a fast-path for register values. Add support for constant pool entries, allowing us to compile this: float %test2(float* %P) { %Q = load float* %P %R = add float %Q, 10.1 ret float %R } to this: _test2: lfs r2, 0(r3) lis r3, ha16(.CPI_test2_0) lfs r3, lo16(.CPI_test2_0)(r3) fadds f1, r2, r3 blr llvm-svn: 22962	2005-08-22 01:04:32 +00:00
Chris Lattner	466fecee19	add anew method llvm-svn: 22957	2005-08-21 22:30:30 +00:00
Chris Lattner	4866356907	Add support for frame index nodes llvm-svn: 22956	2005-08-21 19:56:04 +00:00
Chris Lattner	0548f50501	add a method llvm-svn: 22955	2005-08-21 19:48:59 +00:00
Chris Lattner	707b39fb8c	add a method llvm-svn: 22949	2005-08-21 18:49:33 +00:00
Chris Lattner	154b2bc59b	Add support for basic blocks, fix a bug in result # computation llvm-svn: 22948	2005-08-21 18:49:29 +00:00
Chris Lattner	539c3fa863	When legalizing brcond ->brcc or select -> selectcc, make sure to truncate the old condition to a one bit value. The incoming value must have been promoted, and the top bits are undefined. This causes us to generate: _test: rlwinm r2, r3, 0, 31, 31 li r3, 17 cmpwi cr0, r2, 0 bne .LBB_test_2 ; .LBB_test_1: ; li r3, 1 .LBB_test_2: ; blr instead of: _test: rlwinm r2, r3, 0, 31, 31 li r2, 17 cmpwi cr0, r3, 0 bne .LBB_test_2 ; .LBB_test_1: ; li r2, 1 .LBB_test_2: ; or r3, r2, r2 blr for: int %test(bool %c) { %retval = select bool %c, int 17, int 1 ret int %retval } llvm-svn: 22947	2005-08-21 18:03:09 +00:00
Chris Lattner	4b08ba26d8	fix bogus warning llvm-svn: 22943	2005-08-20 18:07:27 +00:00
Chris Lattner	319e65696d	Add support for global address nodes llvm-svn: 22940	2005-08-19 22:38:24 +00:00
Chris Lattner	1be7eddecf	Add support for TargetGlobalAddress nodes llvm-svn: 22938	2005-08-19 22:31:04 +00:00
Chris Lattner	6d7f814b01	Implement CopyFromReg, TokenFactor, and fix a bug in CopyToReg. This allows us to compile stuff like this: double %test(double %A, double %B, double %C, double %E) { %F = mul double %A, %A %G = add double %F, %B %H = sub double -0.0, %G %I = mul double %H, %C %J = add double %I, %E ret double %J } to: _test: fnmadd f0, f1, f1, f2 fmadd f1, f0, f3, f4 blr woot! llvm-svn: 22937	2005-08-19 21:43:53 +00:00
Chris Lattner	0875d1ab89	Fix a bug in previous commit llvm-svn: 22936	2005-08-19 21:34:13 +00:00
Chris Lattner	4990335eb8	Print physreg register nodes with target names (e.g. F1) instead of numbers llvm-svn: 22934	2005-08-19 21:21:16 +00:00
Chris Lattner	78b200eb74	Before implementing copyfromreg, we'll implement copytoreg correctly. This gets us this for the previous testcase: _test: lis r2, 0 ori r3, r2, 65535 blr Note that we actually write to r3 (the return reg) correctly now :) llvm-svn: 22933	2005-08-19 20:50:53 +00:00
Chris Lattner	cc3035e989	Now that we have operand info for machine instructions, use it to create temporary registers for things that define a register. This allows dag->dag isel to compile this: int %test() { ret int 65535 } into: _test: lis r2, 0 ori r2, r2, 65535 blr Next up, getting CopyFromReg to work, allowing arguments and cross-bb values. llvm-svn: 22932	2005-08-19 20:45:43 +00:00
Jeff Cohen	d1f22b1282	Fix VC++ precedence warning. llvm-svn: 22902	2005-08-19 04:39:48 +00:00
Chris Lattner	d18beab94c	Fix computation of # operands, add a temporary hack for CopyToReg llvm-svn: 22896	2005-08-19 01:01:34 +00:00
Chris Lattner	0c8c2c102d	add a new -view-sched-dags option to view dags as they are sent to the scheduler. llvm-svn: 22878	2005-08-18 20:11:49 +00:00
Chris Lattner	d342de9aaa	Implement the first chunk of a code emitter. This is sophisticated enough to codegen: _empty: .LBB_empty_0: ; blr but can't do anything more (yet). :) llvm-svn: 22876	2005-08-18 20:07:59 +00:00
Chris Lattner	1b4727de7d	new file, obviously just a stub llvm-svn: 22868	2005-08-18 18:45:24 +00:00
Chris Lattner	1a908c8920	Enable critical edge splitting by default llvm-svn: 22863	2005-08-18 17:35:14 +00:00
Nate Begeman	19a271a67b	Add support for target DAG nodes that take 4 operands, such as PowerPC's rlwinm. llvm-svn: 22856	2005-08-18 07:30:15 +00:00
Chris Lattner	802080d812	Fix printing of VTSDNodes llvm-svn: 22853	2005-08-18 03:31:02 +00:00
Jim Laskey	d66e616545	Move the code dependency for MathExtras.h from SelectionDAGNodes.h. Added some class dividers in SelectionDAG.cpp. llvm-svn: 22841	2005-08-17 20:08:02 +00:00
Jim Laskey	b74c666186	Culling out use of unions for converting FP to bits and vice versa. llvm-svn: 22838	2005-08-17 19:34:49 +00:00
Chris Lattner	ab0de9d7fc	Fix a bug in RemoveDeadNodes where it would crash when its "optional" argument is not specified. Implement ReplaceAllUsesWith. llvm-svn: 22834	2005-08-17 19:00:20 +00:00
Jim Laskey	686d6a1cb2	Switched to using BitsToDouble for int_to_float to avoid aliasing problem. llvm-svn: 22831	2005-08-17 17:42:52 +00:00
Jim Laskey	898ba557d0	Change hex float constants for the sake of VC++. llvm-svn: 22828	2005-08-17 09:44:59 +00:00
Chris Lattner	c9950c11a9	Add a new beta option for critical edge splitting, to avoid a problem that Nate noticed in yacr2 (and I know occurs in other places as well). This is still rough, as the critical edge blocks are not intelligently placed but is added to get some idea to see if this improves performance. llvm-svn: 22825	2005-08-17 06:37:43 +00:00
Chris Lattner	ba28c2733f	Fix a regression on X86, where FP values can be promoted too. llvm-svn: 22822	2005-08-17 06:06:25 +00:00
Jim Laskey	f2516a9180	Added generic code expansion for [signed\|unsigned] i32 to [f32\|f64] casts in the legalizer. PowerPC now uses this expansion instead of ISel version. Example: // signed integer to double conversion double f1(signed x) { return (double)x; } // unsigned integer to double conversion double f2(unsigned x) { return (double)x; } // signed integer to float conversion float f3(signed x) { return (float)x; } // unsigned integer to float conversion float f4(unsigned x) { return (float)x; } Byte Code: internal fastcc double %_Z2f1i(int %x) { entry: %tmp.1 = cast int %x to double ; <double> [#uses=1] ret double %tmp.1 } internal fastcc double %_Z2f2j(uint %x) { entry: %tmp.1 = cast uint %x to double ; <double> [#uses=1] ret double %tmp.1 } internal fastcc float %_Z2f3i(int %x) { entry: %tmp.1 = cast int %x to float ; <float> [#uses=1] ret float %tmp.1 } internal fastcc float %_Z2f4j(uint %x) { entry: %tmp.1 = cast uint %x to float ; <float> [#uses=1] ret float %tmp.1 } internal fastcc double %_Z2g1i(int %x) { entry: %buffer = alloca [2 x uint] ; <[2 x uint]> [#uses=3] %tmp.0 = getelementptr [2 x uint] %buffer, int 0, int 0 ; <uint> [#uses=1] store uint 1127219200, uint %tmp.0 %tmp.2 = cast int %x to uint ; <uint> [#uses=1] %tmp.3 = xor uint %tmp.2, 2147483648 ; <uint> [#uses=1] %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1 ; <uint> [#uses=1] store uint %tmp.3, uint %tmp.5 %tmp.9 = cast [2 x uint]* %buffer to double* ; <double> [#uses=1] %tmp.10 = load double %tmp.9 ; <double> [#uses=1] %tmp.13 = load double* cast (long* %signed_bias to double) ; <double> [#uses=1] %tmp.14 = sub double %tmp.10, %tmp.13 ; <double> [#uses=1] ret double %tmp.14 } internal fastcc double %_Z2g2j(uint %x) { entry: %buffer = alloca [2 x uint] ; <[2 x uint]> [#uses=3] %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0 ; <uint> [#uses=1] store uint 1127219200, uint %tmp.0 %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1 ; <uint> [#uses=1] store uint %x, uint %tmp.1 %tmp.4 = cast [2 x uint]* %buffer to double* ; <double> [#uses=1] %tmp.5 = load double %tmp.4 ; <double> [#uses=1] %tmp.8 = load double* cast (long* %unsigned_bias to double) ; <double> [#uses=1] %tmp.9 = sub double %tmp.5, %tmp.8 ; <double> [#uses=1] ret double %tmp.9 } internal fastcc float %_Z2g3i(int %x) { entry: %buffer = alloca [2 x uint] ; <[2 x uint]> [#uses=3] %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0 ; <uint> [#uses=1] store uint 1127219200, uint %tmp.0 %tmp.2 = cast int %x to uint ; <uint> [#uses=1] %tmp.3 = xor uint %tmp.2, 2147483648 ; <uint> [#uses=1] %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1 ; <uint> [#uses=1] store uint %tmp.3, uint %tmp.5 %tmp.9 = cast [2 x uint]* %buffer to double* ; <double> [#uses=1] %tmp.10 = load double %tmp.9 ; <double> [#uses=1] %tmp.13 = load double* cast (long* %signed_bias to double) ; <double> [#uses=1] %tmp.14 = sub double %tmp.10, %tmp.13 ; <double> [#uses=1] %tmp.16 = cast double %tmp.14 to float ; <float> [#uses=1] ret float %tmp.16 } internal fastcc float %_Z2g4j(uint %x) { entry: %buffer = alloca [2 x uint] ; <[2 x uint]> [#uses=3] %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0 ; <uint> [#uses=1] store uint 1127219200, uint %tmp.0 %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1 ; <uint> [#uses=1] store uint %x, uint %tmp.1 %tmp.4 = cast [2 x uint]* %buffer to double* ; <double> [#uses=1] %tmp.5 = load double %tmp.4 ; <double> [#uses=1] %tmp.8 = load double* cast (long* %unsigned_bias to double*) ; <double> [#uses=1] %tmp.9 = sub double %tmp.5, %tmp.8 ; <double> [#uses=1] %tmp.11 = cast double %tmp.9 to float ; <float> [#uses=1] ret float %tmp.11 } PowerPC Code: .machine ppc970 .const .align 2 .CPIl1__Z2f1i_0: ; float 0x4330000080000000 .long 1501560836 ; float 4.5036e+15 .text .align 2 .globl l1__Z2f1i l1__Z2f1i: .LBBl1__Z2f1i_0: ; entry xoris r2, r3, 32768 stw r2, -4(r1) lis r2, 17200 stw r2, -8(r1) lfd f0, -8(r1) lis r2, ha16(.CPIl1__Z2f1i_0) lfs f1, lo16(.CPIl1__Z2f1i_0)(r2) fsub f1, f0, f1 blr .const .align 2 .CPIl2__Z2f2j_0: ; float 0x4330000000000000 .long 1501560832 ; float 4.5036e+15 .text .align 2 .globl l2__Z2f2j l2__Z2f2j: .LBBl2__Z2f2j_0: ; entry stw r3, -4(r1) lis r2, 17200 stw r2, -8(r1) lfd f0, -8(r1) lis r2, ha16(.CPIl2__Z2f2j_0) lfs f1, lo16(.CPIl2__Z2f2j_0)(r2) fsub f1, f0, f1 blr .const .align 2 .CPIl3__Z2f3i_0: ; float 0x4330000080000000 .long 1501560836 ; float 4.5036e+15 .text .align 2 .globl l3__Z2f3i l3__Z2f3i: .LBBl3__Z2f3i_0: ; entry xoris r2, r3, 32768 stw r2, -4(r1) lis r2, 17200 stw r2, -8(r1) lfd f0, -8(r1) lis r2, ha16(.CPIl3__Z2f3i_0) lfs f1, lo16(.CPIl3__Z2f3i_0)(r2) fsub f0, f0, f1 frsp f1, f0 blr .const .align 2 .CPIl4__Z2f4j_0: ; float 0x4330000000000000 .long 1501560832 ; float 4.5036e+15 .text .align 2 .globl l4__Z2f4j l4__Z2f4j: .LBBl4__Z2f4j_0: ; entry stw r3, -4(r1) lis r2, 17200 stw r2, -8(r1) lfd f0, -8(r1) lis r2, ha16(.CPIl4__Z2f4j_0) lfs f1, lo16(.CPIl4__Z2f4j_0)(r2) fsub f0, f0, f1 frsp f1, f0 blr llvm-svn: 22814	2005-08-17 00:39:29 +00:00
Chris Lattner	0d2456e1f0	add a new TargetConstant node llvm-svn: 22813	2005-08-17 00:34:06 +00:00
Chris Lattner	33182325f5	Eliminate the RegSDNode class, which 3 nodes (CopyFromReg/CopyToReg/ImplicitDef) used to tack a register number onto the node. Instead of doing this, make a new node, RegisterSDNode, which is a leaf containing a register number. These three operations just become normal DAG nodes now, instead of requiring special handling. Note that with this change, it is no longer correct to make illegal CopyFromReg/CopyToReg nodes. The legalizer will not touch them, and this is bad, so don't do it. :) llvm-svn: 22806	2005-08-16 21:55:35 +00:00
Nate Begeman	371e49515d	Implement BR_CC and BRTWOWAY_CC. This allows the removal of a rather nasty fixme from the PowerPC backend. Emit slightly better code for legalizing select_cc. llvm-svn: 22805	2005-08-16 19:49:35 +00:00
Chris Lattner	bc89226527	Allow passing a dag into dump and getOperationName. If one is available when printing a node, use it to render target operations with their target instruction name instead of "<<unknown>>". llvm-svn: 22804	2005-08-16 18:33:07 +00:00
Chris Lattner	7e57d18b79	Use a extant helper to do this. llvm-svn: 22802	2005-08-16 18:31:23 +00:00
Chris Lattner	1973278b38	Add some methods for dag->dag isel. Split RemoveNodeFromCSEMaps out of DeleteNodesIfDead to do it. llvm-svn: 22801	2005-08-16 18:17:10 +00:00
Nate Begeman	d5e739dcc2	Fix last night's PPC32 regressions by 1. Not selecting the false value of a select_cc in the false arm, which isn't legal for nested selects. 2. Actually returning the node we created and Legalized in the FP_TO_UINT Expander. llvm-svn: 22789	2005-08-14 18:38:32 +00:00
Nate Begeman	36853ee1fd	Teach the legalizer how to legalize FP_TO_UINT. Teach the legalizer to promote FP_TO_UINT to FP_TO_SINT if the wider FP_TO_UINT is also illegal. This allows us on PPC to codegen unsigned short foo(float a) { return a; } as: _foo: .LBB_foo_0: ; entry fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr instead of: _foo: .LBB_foo_0: ; entry fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) lis r3, ha16(.CPI_foo_0) lfs f0, lo16(.CPI_foo_0)(r3) fcmpu cr0, f1, f0 blt .LBB_foo_2 ; entry .LBB_foo_1: ; entry fsubs f0, f1, f0 fctiwz f0, f0 stfd f0, -16(r1) lwz r2, -12(r1) xoris r2, r2, 32768 .LBB_foo_2: ; entry rlwinm r3, r2, 0, 16, 31 blr llvm-svn: 22785	2005-08-14 01:20:53 +00:00
Nate Begeman	dc3154ec66	Remove an unncessary argument to SimplifySelectCC and add an additional assert when creating a select_cc node. llvm-svn: 22780	2005-08-13 06:14:17 +00:00
Nate Begeman	b6651e81a0	Fix the fabs regression on x86 by abstracting the select_cc optimization out into SimplifySelectCC. This allows both ISD::SELECT and ISD::SELECT_CC to use the same set of simplifying folds. llvm-svn: 22779	2005-08-13 06:00:21 +00:00
Chris Lattner	21381e8424	implement a couple of simple shift foldings. e.g. (X & 7) >> 3 -> 0 llvm-svn: 22774	2005-08-12 23:54:58 +00:00
Nate Begeman	5c7656fd53	Add a select_cc optimization for recognizing abs(int). This speeds up an integer MPEG encoding loop by a factor of two. llvm-svn: 22758	2005-08-11 02:18:13 +00:00
Nate Begeman	180b08897f	Some SELECT_CC cleanups: 1. move assertions for node creation to getNode() 2. legalize the values returned in ExpandOp immediately 3. Move select_cc optimizations from SELECT's getNode() to SELECT_CC's, allowing them to be cleaned up significantly. This paves the way to pick up additional optimizations on SELECT_CC, such as sum-of-absolute-differences. llvm-svn: 22757	2005-08-11 01:12:20 +00:00
Nate Begeman	e5b86d7442	Add new node, SELECT_CC. This node is for targets that don't natively implement SELECT. llvm-svn: 22755	2005-08-10 20:51:12 +00:00
Chris Lattner	21c0fd9e8f	Fix an oversight that may be causing PR617. llvm-svn: 22753	2005-08-10 17:37:53 +00:00
Chris Lattner	679f5b0b40	Fix spelling, fix some broken canonicalizations by my last patch llvm-svn: 22734	2005-08-09 23:09:05 +00:00
Chris Lattner	14e060f743	add cc nodes to the AllNodes list so they show up in Graphviz output llvm-svn: 22731	2005-08-09 20:40:02 +00:00
Chris Lattner	d47675ed24	Eliminate the SetCCSDNode in favor of a CondCodeSDNode class. This pulls the CC out of the SetCC operation, making SETCC a standard ternary operation and CC's a standard DAG leaf. This will make it possible for other node to use CC's as operands in the future... llvm-svn: 22728	2005-08-09 20:20:18 +00:00

... 29 30 31 32 33 ...

3375 Commits