llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	b7524b6d0e	make this code more aggressive about turning store fpimm into store int imm. This is not sufficient to fix X86/store-fp-constant.ll llvm-svn: 32465	2006-12-12 04:16:14 +00:00
Reid Spencer	3c49edcaa1	Change inferred cast creation calls to more specific cast creations. llvm-svn: 32460	2006-12-12 01:17:41 +00:00
Evan Cheng	3432ab97c1	Re-apply changes that were backed out and fix a naughty typo. llvm-svn: 32442	2006-12-11 19:27:14 +00:00
John Criswell	b3b285185f	It seems the llvm::OStream class does not handle stream manipulators. For now, just grab the stream and perform the output on it directly. llvm-svn: 32441	2006-12-11 19:15:36 +00:00
Chris Lattner	e9a203c4e5	Revert changes that broke oggenc on ppc llvm-svn: 32440	2006-12-11 18:53:38 +00:00
Evan Cheng	218369881f	Don't convert store double C, Ptr to store long C, Ptr if i64 is not a legal type. llvm-svn: 32434	2006-12-11 17:25:19 +00:00
Evan Cheng	f4bec95b58	f32 / f64 node is expanded to one i32 / i64 node. llvm-svn: 32433	2006-12-11 06:50:04 +00:00
Evan Cheng	f6b01fdb48	Clean up some bad code. llvm-svn: 32432	2006-12-11 06:25:26 +00:00
Nate Begeman	8e20c760fa	Move something that should be in the dag combiner from the legalizer to the dag combiner. llvm-svn: 32431	2006-12-11 02:23:46 +00:00
Anton Korobeynikov	3b7c257cae	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Evan Cheng	4eee72471c	Preliminary soft float support. llvm-svn: 32394	2006-12-09 02:42:38 +00:00
Chris Lattner	d9f04e4875	Fix CodeGen/PowerPC/2006-12-07-SelectCrash.ll on PPC64 llvm-svn: 32336	2006-12-07 22:36:47 +00:00
Bill Wendling	355fc5ad50	Removed more <iostream> includes llvm-svn: 32321	2006-12-07 20:28:15 +00:00
Bill Wendling	22e978a736	Removing even more <iostream> includes. llvm-svn: 32320	2006-12-07 20:04:42 +00:00
Chris Lattner	700b873130	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Jeff Cohen	cc08c83186	Unbreak VC++ build. llvm-svn: 32113	2006-12-02 02:22:01 +00:00
Evan Cheng	67fc141db5	Match TargetInstrInfo changes. llvm-svn: 32098	2006-12-01 21:52:58 +00:00
Evan Cheng	a743fada65	Avoid inifinite looping if READCYCLECOUNTER isn't custom lowered. llvm-svn: 32022	2006-11-29 19:13:47 +00:00
Evan Cheng	6973993e9c	Allow target to custom lower READCYCLECOUNTER (when it doesn't have to be expanded). llvm-svn: 32016	2006-11-29 08:26:18 +00:00
Evan Cheng	feba507a97	Fix for PR1023 by Dan Gohman. llvm-svn: 32003	2006-11-29 01:58:12 +00:00
Evan Cheng	6e12a052ff	Fix for PR1022 (folding loads of static initializers) by Dan Gohman. llvm-svn: 32000	2006-11-29 01:38:07 +00:00
Chris Lattner	90f4238c38	add a hook to allow targets to hack on inline asms to lower them to llvm when they want to. llvm-svn: 31997	2006-11-29 01:12:32 +00:00
Chris Lattner	3abb63651b	Fix PR1016 llvm-svn: 31950	2006-11-28 01:03:30 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Chris Lattner	5d5916b4d1	Fix the dag combiner bug corresponding to PR1014. llvm-svn: 31943	2006-11-27 21:50:02 +00:00
Chris Lattner	3da631f29a	For better or worse, load from i1 is assumed to be zero extended. Do not form a load from i1 from larger loads that may not be zext'd. llvm-svn: 31933	2006-11-27 04:40:53 +00:00
Chris Lattner	db18938355	If a brcond condition is promoted, make sure to zero extend it, even if not expanded into BR_CC. llvm-svn: 31932	2006-11-27 04:39:56 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Chris Lattner	3676a994ca	Fix PR1011 and CodeGen/Generic/2006-11-20-DAGCombineCrash.ll llvm-svn: 31878	2006-11-20 18:05:46 +00:00
Reid Spencer	d9436b6837	For PR950: First in a series of patches to convert SetCondInst into ICmpInst and FCmpInst using only two opcodes and having the instructions contain their predicate value. Nothing uses these classes yet. More patches to follow. llvm-svn: 31867	2006-11-20 01:22:35 +00:00
Jim Laskey	da0add3fd0	Fixing the ENABLE_OPTIMIZED=1 DISABLE_ASSERTIONS=1 build. llvm-svn: 31822	2006-11-17 13:07:55 +00:00
Evan Cheng	f64da389f8	Fix an incorrectly inverted condition. llvm-svn: 31773	2006-11-16 00:08:20 +00:00
Chris Lattner	30d08801ef	remove dead #include llvm-svn: 31753	2006-11-15 17:51:15 +00:00
Evan Cheng	dbd3d294e6	Matches MachineInstr changes. llvm-svn: 31712	2006-11-13 23:36:35 +00:00
Reid Spencer	2230144a75	Make an assert comment match the tested assertion. llvm-svn: 31686	2006-11-11 20:07:59 +00:00
Evan Cheng	979bbf48d5	Add methods to add implicit def use operands to a MI. llvm-svn: 31675	2006-11-11 10:20:02 +00:00
Chris Lattner	a0a8003f59	disallow preinc of a frameindex. This is not profitable and causes 2-addr pass to explode. This fixes a bunch of llc-beta failures on ppc last night. llvm-svn: 31661	2006-11-11 01:00:15 +00:00
Chris Lattner	eabc15c1d8	reduce indentation by using early exits. No functionality change. llvm-svn: 31660	2006-11-11 00:56:29 +00:00
Chris Lattner	ffad2166e1	move big chunks of code out-of-line, no functionality change. llvm-svn: 31658	2006-11-11 00:39:41 +00:00
Chris Lattner	4eac5f59e6	Fix a dag combiner bug exposed by my recent instcombine patch. This fixes CodeGen/Generic/2006-11-10-DAGCombineMiscompile.ll and PPC gsm/toast llvm-svn: 31644	2006-11-10 21:37:15 +00:00
Evan Cheng	8c9c6d71ed	Add implicit def / use operands to MachineInstr. llvm-svn: 31633	2006-11-10 08:43:01 +00:00
Evan Cheng	13440b025c	When forming a pre-indexed store, make sure ptr isn't the same or is a pred of value being stored. It would cause a cycle. llvm-svn: 31631	2006-11-10 08:28:11 +00:00
Chris Lattner	d5e604dbb2	commentate llvm-svn: 31627	2006-11-10 04:41:34 +00:00
Evan Cheng	6878378390	Don't attempt expensive pre-/post- indexed dag combine if target does not support them. llvm-svn: 31598	2006-11-09 19:10:46 +00:00
Evan Cheng	d550248f2c	Add a mechanism to specify whether a target supports a particular indexed load / store. llvm-svn: 31597	2006-11-09 18:56:43 +00:00
Evan Cheng	c034f14fbe	Rename ISD::MemOpAddrMode to ISD::MemIndexedMode llvm-svn: 31596	2006-11-09 18:44:21 +00:00
Evan Cheng	b15000736c	Rename ISD::MemOpAddrMode to ISD::MemIndexedMode llvm-svn: 31595	2006-11-09 17:55:04 +00:00
Evan Cheng	b58e06bc9e	getPostIndexedAddressParts change: passes in load/store instead of its loaded / stored VT. llvm-svn: 31584	2006-11-09 04:29:46 +00:00
Evan Cheng	85e54223cd	Match more post-indexed ops. llvm-svn: 31569	2006-11-08 20:27:27 +00:00
Jim Laskey	61feeb90f9	Remove redundant <cmath>. llvm-svn: 31561	2006-11-08 19:16:44 +00:00
Evan Cheng	0303cb9b33	- When performing pre-/post- indexed load/store transformation, do not worry about whether the new base ptr would be live below the load/store. Let two address pass split it back to non-indexed ops. - Minor tweaks / fixes. llvm-svn: 31544	2006-11-08 08:30:28 +00:00
Evan Cheng	6072435756	Fixed a minor bug preventing some pre-indexed load / store transformation. llvm-svn: 31543	2006-11-08 06:56:05 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Evan Cheng	d48f7dd250	Fix a obscure post-indexed load / store dag combine bug. llvm-svn: 31537	2006-11-08 02:38:55 +00:00
Evan Cheng	60c6846d21	Add post-indexed load / store transformations. llvm-svn: 31498	2006-11-07 09:03:05 +00:00
Chris Lattner	94c231f453	Fix PR988 and CodeGen/Generic/2006-11-06-MemIntrinsicExpand.ll. The low part goes in the first operand of expandop, not the second one. llvm-svn: 31487	2006-11-07 04:11:44 +00:00
Evan Cheng	f24d15f969	Remove dead code; added a missing null ptr check. llvm-svn: 31478	2006-11-06 21:33:46 +00:00
Evan Cheng	eb99bd736a	Add comment. llvm-svn: 31473	2006-11-06 08:14:30 +00:00
Jeff Cohen	7d6f3db3e2	Unbreak VC++ build. llvm-svn: 31464	2006-11-05 19:31:28 +00:00
Evan Cheng	33157700d9	Added pre-indexed store support. llvm-svn: 31459	2006-11-05 09:31:14 +00:00
Evan Cheng	1a1e23eff7	Added getIndexedStore. llvm-svn: 31458	2006-11-05 09:30:09 +00:00
Evan Cheng	fd2c5dd806	Changes to use operand constraints to process two-address instructions. llvm-svn: 31453	2006-11-04 09:44:31 +00:00
Evan Cheng	9456dd8b81	Fix comments. llvm-svn: 31414	2006-11-03 07:31:32 +00:00
Evan Cheng	1dfd26a151	Rename llvm-svn: 31413	2006-11-03 07:21:16 +00:00
Reid Spencer	52f958741a	Remove dead variable. Fix 80 column violations. llvm-svn: 31412	2006-11-03 03:30:34 +00:00
Evan Cheng	357017f4a9	Added DAG combiner transformation to generate pre-indexed loads. llvm-svn: 31410	2006-11-03 03:06:21 +00:00
Evan Cheng	c176f038b9	Added isPredecessor. llvm-svn: 31409	2006-11-03 03:05:24 +00:00
Chris Lattner	cd7b92251d	silence warning llvm-svn: 31397	2006-11-03 01:28:29 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Reid Spencer	7eb55b395f	For PR950: Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369	2006-11-02 01:53:59 +00:00
Chris Lattner	55402d4403	Allow the getRegForInlineAsmConstraint method to return a register class with no fixes physreg. Treat this as permission to use any register in the register class. When this happens and it is safe, allow the llvm register allcoator to allocate the register instead of doing it at isel time. This eliminates a ton of copies around common inline asms. For example: int test2(int Y, int X) { asm("foo %0, %1" : "=r"(X): "r"(X)); return X; } now compiles to: _test2: foo r3, r4 blr instead of: _test2: mr r2, r4 foo r2, r2 mr r3, r2 blr GCC produces: _test2: foo r4, r4 mr r3,r4 blr llvm-svn: 31366	2006-11-02 01:41:49 +00:00
Evan Cheng	1359196c4e	Clean up. llvm-svn: 31359	2006-11-01 22:39:30 +00:00
Evan Cheng	47218fab42	CopyFromReg starts a live range so its use should not be considered a floater. llvm-svn: 31356	2006-11-01 22:17:06 +00:00
Evan Cheng	415f365e5c	Print jumptable index. llvm-svn: 31340	2006-11-01 04:48:30 +00:00
Chris Lattner	fe43befeda	Compile CodeGen/PowerPC/fp-branch.ll to: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 blt cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr instead of: _intcoord_cond_next55: LBB1_3: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bge cr0, LBB1_1 ;bb72.exitStub LBB1_4: ;cond_next55 lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr0, f1, f0 bnu cr0, LBB1_2 ;cond_next62.exitStub LBB1_1: ;bb72.exitStub li r3, 1 blr LBB1_2: ;cond_next62.exitStub li r3, 0 blr llvm-svn: 31330	2006-10-31 23:06:00 +00:00
Chris Lattner	427301fdae	look through isunordered to inline it into branch blocks. llvm-svn: 31328	2006-10-31 22:37:42 +00:00
Chris Lattner	1fd360e13a	handle global address constant sdnodes llvm-svn: 31323	2006-10-31 20:01:56 +00:00
Chris Lattner	6f043b90ea	TargetLowering::isOperandValidForConstraint llvm-svn: 31319	2006-10-31 19:41:18 +00:00
Chris Lattner	8c6949e5b2	Change the prototype for TargetLowering::isOperandValidForConstraint llvm-svn: 31318	2006-10-31 19:40:43 +00:00
Chris Lattner	968f803928	Turn an assert into an error message. This is commonly triggered when we don't support a specific constraint yet. When this happens, print the unsupported constraint. llvm-svn: 31310	2006-10-31 07:33:13 +00:00
Evan Cheng	e6d584765f	Fix a typo which can break jumptables. llvm-svn: 31305	2006-10-31 02:31:00 +00:00
Evan Cheng	84a28d4e76	Lower jumptable to BR_JT. The legalizer can lower it to a BRIND or let the target custom lower it. llvm-svn: 31293	2006-10-30 08:00:44 +00:00
Evan Cheng	c3e695137d	Added a new SDNode type: BR_JT for jumptable branch. llvm-svn: 31292	2006-10-30 07:59:36 +00:00
Chris Lattner	e60ae823e8	fix Generic/2006-10-29-Crash.ll llvm-svn: 31281	2006-10-29 21:01:20 +00:00
Chris Lattner	f31b9ef458	Fix a load folding issue that Evan noticed: there is no need to export values used by comparisons in the main block. llvm-svn: 31279	2006-10-29 18:23:37 +00:00
Evan Cheng	7ab6123c42	VLOAD is not the LoadSDNode opcode. llvm-svn: 31276	2006-10-29 06:14:47 +00:00
Nick Lewycky	dc146a9fb9	Remove spurious case. EXTLOAD is not one of the node opcodes. llvm-svn: 31275	2006-10-29 02:26:30 +00:00
Chris Lattner	bba52191fa	split critical edges more carefully and intelligently. In particular, critical edges whose destinations are not phi nodes don't bother us. Also, share split edges, since the split edge can't have a phi. This significantly reduces the complexity of generated code in some cases. llvm-svn: 31274	2006-10-28 19:22:10 +00:00
Jim Laskey	eef273a16f	Load and stores have not been uniqued properly. llvm-svn: 31261	2006-10-28 17:25:28 +00:00
Chris Lattner	3e6b1c6157	Split all critical edges before isel. This resolves issues with spill code being inserted on unsplit critical edges, which introduces (sometimes large amounts of) partially dead spill code. This also fixes PR925 + CodeGen/Generic/switch-crit-edge-constant.ll llvm-svn: 31260	2006-10-28 17:04:37 +00:00
Chris Lattner	b78eb6c8d1	Fix a serious bug that caused any x86 vector stuff to infinite loop llvm-svn: 31254	2006-10-28 06:15:26 +00:00
Jim Laskey	bd0f088743	Clean up. llvm-svn: 31243	2006-10-27 23:52:51 +00:00
Chris Lattner	84a035056e	Fix a bug in merged condition handling (CodeGen/Generic/2006-10-27-CondFolding.ll). Add many fewer CFG edges and PHI node entries. If there is a switch which has the same block as multiple destinations, only add that block once as a successor/phi node (in the jumptable case) llvm-svn: 31242	2006-10-27 23:50:33 +00:00
Jim Laskey	f576b42bb2	Switch over from SelectionNodeCSEMap to FoldingSet. llvm-svn: 31240	2006-10-27 23:46:08 +00:00
Chris Lattner	b9392fb635	remove debug code llvm-svn: 31233	2006-10-27 21:58:03 +00:00
Chris Lattner	f1b54fd7a5	Codegen cond&cond with two branches. This compiles (f.e.) PowerPC/and-branch.ll to: cmpwi cr0, r4, 4 bgt cr0, LBB1_2 ;UnifiedReturnBlock LBB1_3: ;entry cmplwi cr0, r3, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r3, r4, r3 rlwinm r2, r2, 30, 31, 31 or r2, r2, r3 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock LBB1_1: ;cond_true llvm-svn: 31232	2006-10-27 21:54:23 +00:00
Chris Lattner	ed0110b949	Turn conditions like x<Y\|z==q into multiple blocks. This compiles Regression/CodeGen/X86/or-branch.ll into: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $5, %eax jl LBB1_1 #cond_true LBB1_3: #entry testl %ecx, %ecx jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret instead of: _foo: subl $12, %esp call L_bar$stub movl 20(%esp), %eax movl 16(%esp), %ecx cmpl $4, %eax setg %al testl %ecx, %ecx setne %cl testb %cl, %al jne LBB1_2 #UnifiedReturnBlock LBB1_1: #cond_true call L_bar$stub addl $12, %esp ret LBB1_2: #UnifiedReturnBlock addl $12, %esp ret And on ppc to: cmpwi cr0, r29, 5 blt cr0, LBB1_1 ;cond_true LBB1_3: ;entry cmplwi cr0, r30, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock instead of: cmpwi cr7, r4, 4 mfcr r2 addic r4, r3, -1 subfe r30, r4, r3 rlwinm r29, r2, 30, 31, 31 and r2, r29, r30 cmplwi cr0, r2, 0 bne cr0, LBB1_2 ;UnifiedReturnBlock llvm-svn: 31230	2006-10-27 21:36:01 +00:00
Evan Cheng	96d6bf50ae	getPreIndexedLoad -> getIndexedLoad. llvm-svn: 31209	2006-10-26 21:53:40 +00:00
Reid Spencer	7e80b0b31e	For PR950: Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195	2006-10-26 06:15:43 +00:00
Chris Lattner	61bcf9154d	visitSwitchCase knows how to insert conditional branches well. Change visitBr to just call visitSwitchCase, eliminating duplicate logic. llvm-svn: 31167	2006-10-24 18:07:37 +00:00
Chris Lattner	963ddad31a	Generalize CaseBlock a bit more: Rename LHSBB/RHSBB to TrueBB/FalseBB. Allow the RHS value to be null, in which case the LHS is treated as a bool. llvm-svn: 31166	2006-10-24 17:57:59 +00:00
Chris Lattner	3f179d24c6	generalize 'CaseBlock'. It really allows any comparison to be inserted. llvm-svn: 31161	2006-10-24 17:03:35 +00:00
Chris Lattner	4c931502cc	Minor tweak. Instead of generating: movl 32(%esp), %eax cmpl $1, %eax je LBB1_1 #bb LBB1_4: #entry cmpl $2, %eax je LBB1_2 #bb2 jmp LBB1_3 #UnifiedReturnBlock LBB1_1: #bb notice that we would miss the fall through and emit this instead: movl 32(%esp), %eax cmpl $2, %eax je LBB1_2 #bb2 LBB1_4: #entry cmpl $1, %eax jne LBB1_3 #UnifiedReturnBlock LBB1_1: #bb llvm-svn: 31130	2006-10-23 18:38:22 +00:00
Chris Lattner	76a7bc8c55	Fix phi node updating for switches lowered to linear sequences of branches. llvm-svn: 31125	2006-10-22 23:00:53 +00:00
Chris Lattner	4c3ef4782d	disable this code for now, it's not yet safely updating phi nodes llvm-svn: 31124	2006-10-22 22:47:10 +00:00
Chris Lattner	6d6fc26257	Implement PR964 and Regression/CodeGen/Generic/SwitchLowering.ll llvm-svn: 31119	2006-10-22 21:36:53 +00:00
Chris Lattner	c5ab6ce613	Make flag and chain edges visually distinguishable from value edges in DOT output. llvm-svn: 31067	2006-10-20 18:06:09 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Bill Wendling	be96e1cd09	Partially in response to PR926: insert the newly created machine basic blocks into the basic block list when lowering the switch inst. into a binary tree of if-then statements. This allows the "visitSwitchCase" func to allow for fall-through behavior. llvm-svn: 31057	2006-10-19 21:46:38 +00:00
Jim Laskey	55e4dcad36	Add option for controlling inclusion of global AA. llvm-svn: 31040	2006-10-18 19:08:31 +00:00
Jim Laskey	a15b0ebb5e	Use global info for alias analysis. llvm-svn: 31035	2006-10-18 12:29:57 +00:00
Chris Lattner	78fd0f83ff	Trivial patch to speed up legalizing common i64 constants. llvm-svn: 31020	2006-10-17 21:47:13 +00:00
Chris Lattner	327b88b102	Fix CodeGen/PowerPC/2006-10-17-brcc-miscompile.ll llvm-svn: 31019	2006-10-17 21:24:15 +00:00
Evan Cheng	2f4ddce75c	Fix printer for StoreSDNode. llvm-svn: 31017	2006-10-17 21:18:26 +00:00
Evan Cheng	1839d76f69	Reflect MemOpAddrMode change; added a helper to create pre-indexed load. llvm-svn: 31016	2006-10-17 21:14:32 +00:00
Jim Laskey	e7d2c24a7d	Make it simplier to dump DAGs while in DAGCombiner. Remove a nasty optimization. llvm-svn: 31009	2006-10-17 19:33:52 +00:00
Evan Cheng	1e3a39cd08	Make sure operand does have size and element type operands. llvm-svn: 30999	2006-10-17 17:06:35 +00:00
Evan Cheng	f3ae00a64a	Be careful when looking through a vbit_convert. Optimizing this: (vector_shuffle (vbitconvert (vbuildvector (copyfromreg v4f32), 1, v4f32), 4, f32), (undef, undef, undef, undef), (0, 0, 0, 0), 4, f32) to the vbitconvert is a very bad idea. llvm-svn: 30989	2006-10-16 22:49:37 +00:00
Jim Laskey	dcb2b83886	Pass AliasAnalysis thru to DAGCombiner. llvm-svn: 30984	2006-10-16 20:52:31 +00:00
Jim Laskey	3bf4f3bd60	Tidy up after truncstore changes. llvm-svn: 30961	2006-10-14 12:14:27 +00:00
Evan Cheng	47fbeda5ce	Debug tweak. llvm-svn: 30959	2006-10-14 08:34:06 +00:00
Chris Lattner	6a1b2de8c4	Make sure that the node returned by SimplifySetCC is added to the worklist so that it can be deleted if unused. llvm-svn: 30955	2006-10-14 03:52:46 +00:00
Chris Lattner	0626bd2fbc	fold setcc of a setcc. llvm-svn: 30953	2006-10-14 01:02:29 +00:00
Chris Lattner	bd9acad805	When SimplifySetCC was moved to the DAGCombiner, it was never removed from SelectionDAG and it has since bitrotted. Remove the copy from SelectionDAG. Next, remove the constant folding piece of DAGCombiner::SimplifySetCC into a new FoldSetCC method which can be used by getNode() and SimplifySetCC. This fixes obscure bugs. llvm-svn: 30952	2006-10-14 00:41:01 +00:00
Jim Laskey	dcf983ce41	Reduce the workload by not adding chain users to work list. llvm-svn: 30948	2006-10-13 23:32:28 +00:00
Chris Lattner	45ffb1eb70	Fix a bug where we incorrectly turned '(X & 0) == 0' into '(X & 0) >> -1', which is undefined. "0" isn't a power of 2. llvm-svn: 30947	2006-10-13 22:46:18 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	d0620d2773	Lower X%C into X/C+stuff. This allows the 'division by a constant' logic to apply to rems as well as divs. This fixes PR945 and speeds up ReedSolomon from 14.57s to 10.90s (which is now faster than gcc). It compiles CodeGen/X86/rem.ll into: _test1: subl $4, %esp movl %esi, (%esp) movl $2155905153, %ecx movl 8(%esp), %esi movl %esi, %eax imull %ecx addl %esi, %edx movl %edx, %eax shrl $31, %eax sarl $7, %edx addl %eax, %edx imull $255, %edx, %eax subl %eax, %esi movl %esi, %eax movl (%esp), %esi addl $4, %esp ret _test2: movl 4(%esp), %eax movl %eax, %ecx sarl $31, %ecx shrl $24, %ecx addl %eax, %ecx andl $4294967040, %ecx subl %ecx, %eax ret _test3: subl $4, %esp movl %esi, (%esp) movl $2155905153, %ecx movl 8(%esp), %esi movl %esi, %eax mull %ecx shrl $7, %edx imull $255, %edx, %eax subl %eax, %esi movl %esi, %eax movl (%esp), %esi addl $4, %esp ret instead of div/idiv instructions. llvm-svn: 30920	2006-10-12 20:58:32 +00:00
Evan Cheng	a731cb674a	Add RemoveDeadNode to remove a dead node and its (potentially) dead operands. llvm-svn: 30916	2006-10-12 20:34:05 +00:00
Chris Lattner	2e33fb453b	add a minor dag combine noticed when looking at PR945 llvm-svn: 30915	2006-10-12 20:23:19 +00:00
Jim Laskey	df2ccc395e	D'oh - need to use the rigth kind of store. llvm-svn: 30903	2006-10-12 15:22:24 +00:00
Jim Laskey	a13b9c7aa4	Alias analysis of TRUNCSTORE. llvm-svn: 30889	2006-10-11 18:55:16 +00:00
Jim Laskey	6a4c6d3a7a	Typo llvm-svn: 30884	2006-10-11 17:52:19 +00:00
Jim Laskey	0f7c328ae7	Handle aliasing of loadext. llvm-svn: 30883	2006-10-11 17:47:52 +00:00
Jim Laskey	08edf332ed	Fix regression in combiner alias analysis. llvm-svn: 30880	2006-10-11 13:47:09 +00:00
Evan Cheng	d35734bd1f	Naming consistency. llvm-svn: 30878	2006-10-11 07:10:22 +00:00
Andrew Lenharth	a6bbf33cbf	Jimptables working again on alpha. As a bonus, use the GOT node instead of the AlphaISD::GOT for internal stuff. llvm-svn: 30873	2006-10-11 04:29:42 +00:00
Chris Lattner	6df349676e	add two helper methods. llvm-svn: 30869	2006-10-11 03:58:02 +00:00
Evan Cheng	2da4671e05	FindModifiedNodeSlot needs to add LoadSDNode ivars to create proper SelectionDAGCSEMap ID. llvm-svn: 30866	2006-10-11 01:47:58 +00:00
Evan Cheng	7994aec7b5	Also update getNodeLabel for LoadSDNode. llvm-svn: 30861	2006-10-10 20:11:26 +00:00
Evan Cheng	fe858538c0	SDNode::dump should also print out extension type and VT. llvm-svn: 30860	2006-10-10 20:05:10 +00:00
Chris Lattner	8438429c96	Fix another bug in extload promotion. llvm-svn: 30857	2006-10-10 18:54:19 +00:00
Evan Cheng	dc6a3aab71	Fix a bug introduced by my LOAD/LOADX changes. llvm-svn: 30853	2006-10-10 07:51:21 +00:00
Evan Cheng	e71fe34d75	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	5ab6d8b3fc	Eliminate more token factors by taking advantage of transitivity: if TF depends on A and B, and A depends on B, TF just needs to depend on A. With Jim's alias-analysis stuff enabled, this compiles the testcase in PR892 into: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %edx, 28(%esp) movl %eax, 32(%esp) movl %eax, 24(%esp) movl %edx, 36(%esp) movl 52(%esp), %ecx movl %ecx, 4(%esp) movl %eax, 8(%esp) movl %edx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret instead of: __Z4test3Val: subl $44, %esp call L__Z3foov$stub movl %eax, 24(%esp) movl %edx, 28(%esp) movl 24(%esp), %eax movl %eax, 32(%esp) movl 28(%esp), %eax movl %eax, 36(%esp) movl 32(%esp), %eax movl 36(%esp), %ecx movl 52(%esp), %edx movl %edx, 4(%esp) movl %eax, 8(%esp) movl %ecx, 12(%esp) movl 48(%esp), %eax movl %eax, (%esp) call L__Z3bar3ValS_$stub addl $44, %esp ret llvm-svn: 30821	2006-10-08 22:57:01 +00:00
Jim Laskey	0463e08005	Combiner alias analysis passes Multisource (release-asserts.) llvm-svn: 30818	2006-10-07 23:37:56 +00:00
Chris Lattner	f9f90bc239	Fix a bug legalizing zero-extending i64 loads into 32-bit loads. The bottom part was always forced to be sextload, even when we needed an zextload. llvm-svn: 30782	2006-10-07 00:58:36 +00:00
Chris Lattner	a389a612bb	initialize ivar llvm-svn: 30780	2006-10-06 22:52:08 +00:00
Chris Lattner	9d75324ddf	jump tables handle pic llvm-svn: 30776	2006-10-06 22:32:29 +00:00
Chris Lattner	f5839a0816	Fix a miscompilation of: long long foo(long long X) { return (long long)(signed char)(int)X; } Instead of: _foo: extsb r2, r4 srawi r3, r4, 31 mr r4, r2 blr we now produce: _foo: extsb r4, r4 srawi r3, r4, 31 blr This fixes a miscompilation in ConstantFolding.cpp. llvm-svn: 30768	2006-10-06 17:34:12 +00:00
Evan Cheng	df9ac47e5e	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	af309d29b1	Add getStore() helper function to create ISD::STORE nodes. llvm-svn: 30758	2006-10-05 22:57:11 +00:00
Jim Laskey	6549d22ef9	Alias analysis code clean ups. llvm-svn: 30753	2006-10-05 15:07:25 +00:00
Evan Cheng	f80dfa83a0	Fix some typos that can cause a flag value to have more than one use. llvm-svn: 30727	2006-10-04 22:23:53 +00:00
Jim Laskey	708d0db2d8	More extensive alias analysis. llvm-svn: 30721	2006-10-04 16:53:27 +00:00
Evan Cheng	5d9fd977d3	Combine ISD::EXTLOAD, ISD::SEXTLOAD, ISD::ZEXTLOAD into ISD::LOADX. Add an extra operand to LOADX to specify the exact value extension type. llvm-svn: 30714	2006-10-04 00:56:09 +00:00
Evan Cheng	91d76cb27f	Fix an obvious typo. llvm-svn: 30711	2006-10-03 23:08:27 +00:00
Jim Laskey	e73a22514d	Debugging kruft llvm-svn: 30688	2006-10-02 13:01:17 +00:00
Jim Laskey	1368c265da	Add ability to annotate (color) nodes in a viewGraph. llvm-svn: 30686	2006-10-02 12:26:53 +00:00
Chris Lattner	a9caf95591	refactor critical edge breaking out into the SplitCritEdgesForPHIConstants method. This is a baby step towards fixing PR925. llvm-svn: 30643	2006-09-28 06:17:10 +00:00
Andrew Lenharth	c19ef92403	Comments on JumpTableness llvm-svn: 30615	2006-09-26 20:02:30 +00:00
Jim Laskey	60832693a7	Load chain check is not needed llvm-svn: 30613	2006-09-26 17:44:58 +00:00
Jim Laskey	dde51671e5	Chain can be any operand llvm-svn: 30611	2006-09-26 09:32:41 +00:00
Jim Laskey	5f3e0af9d0	Wrong size for load llvm-svn: 30610	2006-09-26 08:14:06 +00:00
Jim Laskey	b4a864d533	Can't move a load node if it's chain is not used. llvm-svn: 30609	2006-09-26 07:37:42 +00:00
Jim Laskey	7aa0638aa9	Accidental enable of bad code llvm-svn: 30601	2006-09-25 21:11:32 +00:00
Jim Laskey	b5534e5c28	Fix chain dropping in load and drop unused stores in ret blocks. llvm-svn: 30600	2006-09-25 19:32:58 +00:00
Jim Laskey	d07be232ba	Core antialiasing for load and store. llvm-svn: 30597	2006-09-25 16:29:54 +00:00
Andrew Lenharth	783a4a9d86	Add support for other relocation bases to jump tables, as well as custom asm directives llvm-svn: 30593	2006-09-24 19:45:58 +00:00
Evan Cheng	77c0757f8b	PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. llvm-svn: 30590	2006-09-24 05:22:38 +00:00
Evan Cheng	449a0c7e33	Make it work for DAG combine of multi-value nodes. llvm-svn: 30573	2006-09-21 19:04:05 +00:00
Jim Laskey	35f7eebb49	core corrections llvm-svn: 30570	2006-09-21 17:35:47 +00:00
Jim Laskey	5d19d59017	Basic "in frame" alias analysis. llvm-svn: 30568	2006-09-21 16:28:59 +00:00
Chris Lattner	082db3f9aa	fold (aext (and (trunc x), cst)) -> (and x, cst). llvm-svn: 30561	2006-09-21 06:40:43 +00:00
Chris Lattner	fa9f92cf65	Check the right value type. This fixes 186.crafty on x86 llvm-svn: 30560	2006-09-21 06:17:39 +00:00
Chris Lattner	8d8a3bf9c9	Compile: int %test(ulong %tmp) { %tmp = load ulong %tmp ; <ulong> [#uses=1] %tmp.mask = shr ulong %tmp, ubyte 50 ; <ulong> [#uses=1] %tmp.mask = cast ulong %tmp.mask to ubyte %tmp2 = and ubyte %tmp.mask, 3 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp2 to int ; <int> [#uses=1] ret int %tmp2 } to: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax andl $3, %eax ret instead of: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax # TRUNCATE movb %al, %al andb $3, %al movzbl %al, %eax ret llvm-svn: 30558	2006-09-21 06:14:31 +00:00
Chris Lattner	a31f0a622b	Generalize (zext (truncate x)) and (sext (truncate x)) folding to work when the src/dst are not the same size. This catches things like "truncate 32-bit X to 8 bits, then zext to 16", which happens a bit on X86. llvm-svn: 30557	2006-09-21 06:00:20 +00:00
Chris Lattner	c8cd62d381	Compile: int test3(int a, int b) { return (a < 0) ? a : 0; } to: _test3: srawi r2, r3, 31 and r3, r2, r3 blr instead of: _test3: cmpwi cr0, r3, 1 li r2, 0 blt cr0, LBB2_2 ;entry LBB2_1: ;entry mr r3, r2 LBB2_2: ;entry blr This implements: PowerPC/select_lt0.ll:seli32_a_a llvm-svn: 30517	2006-09-20 06:41:35 +00:00
Chris Lattner	8746e2cd57	Fold the full generality of (any_extend (truncate x)) llvm-svn: 30514	2006-09-20 06:29:17 +00:00
Chris Lattner	8b68decb27	Two things: 1. teach SimplifySetCC that '(srl (ctlz x), 5) == 0' is really x != 0. 2. Teach visitSELECT_CC to use SimplifySetCC instead of calling it and ignoring the result. This allows us to compile: bool %test(ulong %x) { %tmp = setlt ulong %x, 4294967296 ret bool %tmp } to: _test: cntlzw r2, r3 cmplwi cr0, r3, 1 srwi r2, r2, 5 li r3, 0 beq cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr instead of: _test: addi r2, r3, -1 cntlzw r2, r2 cntlzw r3, r3 srwi r2, r2, 5 cmplwi cr0, r2, 0 srwi r2, r3, 5 li r3, 0 bne cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr This isn't wonderful, but it's an improvement. llvm-svn: 30513	2006-09-20 06:19:26 +00:00
Chris Lattner	875ea0cdbd	Expand 64-bit shifts more optimally if we know that the high bit of the shift amount is one or zero. For example, for: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } we get: _foo1: movb $31, %cl movl 4(%esp), %edx andb 12(%esp), %cl shll %cl, %edx xorl %eax, %eax ret _foo2: movb $223, %cl movl 4(%esp), %eax movl 8(%esp), %edx andb 12(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax ret instead of: _foo1: subl $4, %esp movl %ebx, (%esp) movb $32, %bl movl 8(%esp), %eax movl 12(%esp), %edx movb %bl, %cl orb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret _foo2: subl $4, %esp movl %ebx, (%esp) movb $223, %cl movl 8(%esp), %eax movl 12(%esp), %edx andb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx xorb %bl, %bl testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret llvm-svn: 30506	2006-09-20 03:38:48 +00:00
Chris Lattner	5a42ebcff3	Fold extract_element(cst) to cst llvm-svn: 30478	2006-09-19 05:02:39 +00:00
Chris Lattner	4c059f4962	Minor speedup for legalize by avoiding some malloc traffic llvm-svn: 30477	2006-09-19 04:51:23 +00:00
Evan Cheng	1fc7c363e6	Fix a typo. llvm-svn: 30474	2006-09-18 23:28:33 +00:00
Evan Cheng	4bfaf0bd2c	Allow i32 UDIV, SDIV, UREM, SREM to be expanded into libcalls. llvm-svn: 30470	2006-09-18 21:49:04 +00:00
Andrew Lenharth	c50458fb90	absolute addresses must match pointer size llvm-svn: 30461	2006-09-18 17:59:35 +00:00
Chris Lattner	e50f5d1fb1	Oh yeah, this is needed too llvm-svn: 30407	2006-09-16 05:08:34 +00:00
Chris Lattner	1b63391fdf	simplify control flow, no functionality change llvm-svn: 30403	2006-09-16 00:21:44 +00:00
Chris Lattner	fbadbda6ba	Allow custom expand of mul llvm-svn: 30402	2006-09-16 00:09:24 +00:00
Chris Lattner	46d710e6ea	Fold (X & C1) \| (Y & C2) -> (X\|Y) & C3 when possible. This implements CodeGen/X86/and-or-fold.ll llvm-svn: 30379	2006-09-14 21:11:37 +00:00
Chris Lattner	97614c86ce	Split rotate matching code out to its own function. Make it stronger, by matching things like ((x >> c1) & c2) \| ((x << c3) & c4) to (rot x, c5) & c6 llvm-svn: 30376	2006-09-14 20:50:57 +00:00
Chris Lattner	84cc1f7cb8	If LSR went through a lot of trouble to put constants (e.g. the addr of a global in a specific BB, don't undo this!). This allows us to compile CodeGen/X86/loop-hoist.ll into: _foo: xorl %eax, %eax * movl L_Arr$non_lazy_ptr, %ecx movl 4(%esp), %edx LBB1_1: #cond_true movl %eax, (%ecx,%eax,4) incl %eax cmpl %edx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret instead of: _foo: xorl %eax, %eax movl 4(%esp), %ecx LBB1_1: #cond_true * movl L_Arr$non_lazy_ptr, %edx movl %eax, (%edx,%eax,4) incl %eax cmpl %ecx, %eax jne LBB1_1 #cond_true LBB1_2: #return ret This was noticed in 464.h264ref. This doesn't usually affect PPC, but strikes X86 all the time. llvm-svn: 30290	2006-09-13 06:02:42 +00:00
Chris Lattner	72b503bcad	Compile X << 1 (where X is a long-long) to: addl %ecx, %ecx adcl %eax, %eax instead of: movl %ecx, %edx addl %edx, %edx shrl $31, %ecx addl %eax, %eax orl %ecx, %eax and to: addc r5, r5, r5 adde r4, r4, r4 instead of: slwi r2,r9,1 srwi r0,r11,31 slwi r3,r11,1 or r2,r0,r2 on PPC. llvm-svn: 30284	2006-09-13 03:50:39 +00:00
Evan Cheng	45fe3bc72c	Added support for machine specific constantpool values. These are useful for representing expressions that can only be resolved at link time, etc. llvm-svn: 30278	2006-09-12 21:00:35 +00:00
Chris Lattner	2e0dfb0b16	This code was trying too hard. By eliminating redundant edges in the CFG due to switch cases going to the same place, it make #pred != #phi entries, breaking live interval analysis. This fixes 458.sjeng on x86 with llc. llvm-svn: 30236	2006-09-10 06:36:57 +00:00
Chris Lattner	f0359b343a	Implement the fpowi now by lowering to a libcall llvm-svn: 30225	2006-09-09 06:03:30 +00:00
Chris Lattner	e4bbb6c341	Allow targets to custom lower expanded BIT_CONVERT's llvm-svn: 30217	2006-09-09 00:20:27 +00:00
Chris Lattner	707339a57b	Fix CodeGen/Generic/2006-09-06-SwitchLowering.ll, a bug where SDIsel inserted too many phi operands when lowering a switch to branches in some cases. llvm-svn: 30142	2006-09-07 01:59:34 +00:00
Chris Lattner	0dce3311c4	Change the default to 0, which means 'default'. llvm-svn: 30114	2006-09-05 17:39:15 +00:00
Chris Lattner	af23f9b5f6	Completely eliminate def&use operands. Now a register operand is EITHER a def operand or a use operand. llvm-svn: 30109	2006-09-05 02:31:13 +00:00
Duraid Madina	373be1d1a2	forgot this llvm-svn: 30097	2006-09-04 07:44:11 +00:00
Evan Cheng	e93762d36e	Allow legalizer to expand ISD::MUL using only MULHS in the rare case that is possible and the target only supports MULHS. llvm-svn: 30022	2006-09-01 18:17:58 +00:00
Evan Cheng	31305c45da	DAG combiner fix for rotates. Previously the outer-most condition checks for ROTL availability. This prevents it from forming ROTR for targets that has ROTR only. llvm-svn: 29997	2006-08-31 07:41:12 +00:00
Evan Cheng	e5570a4c3f	Move isCommutativeBinOp from SelectionDAG.cpp and DAGCombiner.cpp out. Make it a static method of SelectionDAG. llvm-svn: 29951	2006-08-29 06:42:35 +00:00
Chris Lattner	3d27be1333	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Evan Cheng	849f4bf8dd	Eliminate SelectNodeTo() and getTargetNode() variants which take more than 3 SDOperand operands. They are replaced by versions which take an array of SDOperand and the number of operands. llvm-svn: 29905	2006-08-27 08:08:54 +00:00
Evan Cheng	34b70eea5c	SelectNodeTo now returns a SDNode*. llvm-svn: 29901	2006-08-26 08:00:10 +00:00
Chris Lattner	451b099113	Fix PR861 llvm-svn: 29796	2006-08-21 20:24:53 +00:00
Chris Lattner	d86418ab20	switch the SUnit pred/succ sets from being std::sets to being smallvectors. This reduces selectiondag time on kc++ from 5.43s to 4.98s (9%). More significantly, this speeds up the default ppc scheduler from ~1571ms to 1063ms, a 33% speedup. llvm-svn: 29743	2006-08-17 00:09:56 +00:00
Chris Lattner	65879caf07	minor changes. llvm-svn: 29740	2006-08-16 22:57:46 +00:00
Chris Lattner	a4f3625c23	Use the appropriate typedef llvm-svn: 29730	2006-08-16 20:59:32 +00:00
Chris Lattner	a5a3eafbd0	Start using SDVTList more consistently llvm-svn: 29711	2006-08-15 19:11:05 +00:00
Chris Lattner	f98411a220	add a new SDVTList type and new SelectionDAG::getVTList methods to streamline the creation of canonical VTLists. llvm-svn: 29709	2006-08-15 17:46:01 +00:00
Chris Lattner	bd8877744b	eliminate use of getNode that takes vector of valuetypes. llvm-svn: 29687	2006-08-14 23:53:35 +00:00
Chris Lattner	3bf4be453f	Add a new getNode() method that takes a pointer to an already-intern'd list of value-type nodes. This avoids having to do mallocs for std::vectors of valuetypes when a node returns more than one type. llvm-svn: 29685	2006-08-14 23:31:51 +00:00
Chris Lattner	e93a39f2d7	remove SelectionDAG::InsertISelMapEntry, it is dead llvm-svn: 29677	2006-08-14 22:24:39 +00:00
Chris Lattner	63268f0672	Add code to resize the CSEMap hash table. This doesn't speedup codegen of kimwitu, but seems like a good idea from a "avoid performance cliffs" standpoint :) llvm-svn: 29675	2006-08-14 22:19:25 +00:00
Chris Lattner	8e37283d8b	Add the actual constant to the hash for ConstantPool nodes. Thanks to Rafael Espindola for pointing this out. llvm-svn: 29669	2006-08-14 20:12:44 +00:00
Chris Lattner	0a60294fa0	Switch to using SuperFastHash instead of adding all elements together. This doesn't significantly improve performance but it helps a small amount. llvm-svn: 29642	2006-08-12 01:07:10 +00:00
Chris Lattner	04aa034f38	Switch NodeID to track 32-bit chunks instead of 8-bit chunks, for a 2.5% speedup in isel time. llvm-svn: 29640	2006-08-11 23:55:53 +00:00
Chris Lattner	0c2e5412bb	Remove 8 more std::map's. llvm-svn: 29631	2006-08-11 21:55:30 +00:00
Chris Lattner	3f16b201e2	Move the BBNodes, GlobalValues, TargetGlobalValues, Constants, TargetConstants, RegNodes, and ValueNodes maps into the CSEMap. llvm-svn: 29626	2006-08-11 21:01:22 +00:00
Chris Lattner	fcb16470ec	eliminate the NullaryOps map, use CSEMap instead. llvm-svn: 29621	2006-08-11 18:38:11 +00:00
Chris Lattner	6f22ebd8be	change internal impl of dag combiner so that calls to CombineTo never have to make a temporary vector. llvm-svn: 29618	2006-08-11 17:56:38 +00:00
Chris Lattner	a2f4086828	Change one ReplaceAllUsesWith method to take an array of operands to replace instead of a vector of operands. llvm-svn: 29616	2006-08-11 17:46:28 +00:00
Chris Lattner	c24a1d3093	Start eliminating temporary vectors used to create DAG nodes. Instead, pass in the start of an array and a count of operands where applicable. In many cases, the number of operands is known, so this static array can be allocated on the stack, avoiding the heap. In many other cases, a SmallVector can be used, which has the same benefit in the common cases. I updated a lot of code calling getNode that takes a vector, but ran out of time. The rest of the code should be updated, and these methods should be removed. We should also do the same thing to eliminate the methods that take a vector of MVT::ValueTypes. It would be extra nice to convert the dagiselemitter to avoid creating vectors for operands when calling getTargetNode. llvm-svn: 29566	2006-08-08 02:23:42 +00:00
Chris Lattner	97af9d5d3a	Eliminate some malloc traffic by allocating vectors on the stack. Change some method that took std::vector<SDOperand> to take a pointer to a first operand and #operands. This speeds up isel on kc++ by about 3%. llvm-svn: 29561	2006-08-08 01:09:31 +00:00
Chris Lattner	1ee75ce65d	Revamp the "CSEMap" datastructure used in the SelectionDAG class. This eliminates a bunch of std::map's in the SelectionDAG, replacing them with a home-grown hashtable. This is still a work in progress: not all the maps have been moved over and the hashtable never resizes. That said, this still speeds up llc 20% on kimwitu++ with -fast -regalloc=local using a release build. llvm-svn: 29550	2006-08-07 23:03:03 +00:00
Evan Cheng	445b91a041	Clear TopOrder before assigning topological order. Some clean ups. llvm-svn: 29546	2006-08-07 22:13:29 +00:00
Evan Cheng	1640ae5a84	Reverse the FlaggedNodes after scanning up for flagged preds or else the order would be reversed. llvm-svn: 29545	2006-08-07 22:12:12 +00:00
Chris Lattner	8927c875bb	Make SelectionDAG::RemoveDeadNodes iterative instead of recursive, which also make it simpler. llvm-svn: 29524	2006-08-04 17:45:20 +00:00
Jim Laskey	a5b707e3ad	Copy the liveins for the first block. PR859 llvm-svn: 29511	2006-08-03 20:51:06 +00:00
Chris Lattner	524c1a21f2	Work around a GCC 3.3.5 bug noticed by a user. llvm-svn: 29490	2006-08-03 00:18:59 +00:00
Evan Cheng	bba1ebda32	- Change AssignTopologicalOrder to return vector of SDNode* by reference. - Tweak implementation to avoid using std::map. llvm-svn: 29479	2006-08-02 22:00:34 +00:00
Jim Laskey	29e635d3c9	Final polish on machine pass registries. llvm-svn: 29471	2006-08-02 12:30:23 +00:00
Jim Laskey	17c67efe8a	Now that the ISel is available, it's possible to create a default instruction scheduler creator. llvm-svn: 29452	2006-08-01 19:14:14 +00:00
Jim Laskey	03593f72db	1. Change use of "Cache" to "Default". 2. Added argument to instruction scheduler creators so the creators can do special things. 3. Repaired target hazard code. 4. Misc. More to follow. llvm-svn: 29450	2006-08-01 18:29:48 +00:00
Jim Laskey	95eda5b1f3	Introducing plugable register allocators and instruction schedulers. llvm-svn: 29434	2006-08-01 14:21:23 +00:00
Evan Cheng	9631a60020	Added AssignTopologicalOrder() to assign each node an unique id based on their topological order. llvm-svn: 29431	2006-08-01 08:20:41 +00:00
Evan Cheng	6ae6ac1216	PIC jump table entries are always 32-bit even in 64-bit mode. llvm-svn: 29422	2006-08-01 01:03:13 +00:00
Evan Cheng	b572401bea	Remove InFlightSet hack. No longer needed. llvm-svn: 29373	2006-07-28 00:47:19 +00:00
Nate Begeman	efc312a5c7	Code cleanups, per review llvm-svn: 29347	2006-07-27 16:46:58 +00:00
Evan Cheng	acb606ff33	AssignNodeIds should return unsigned. llvm-svn: 29343	2006-07-27 07:36:47 +00:00
Evan Cheng	29eefc164c	AssignNodeIds assign each node in the DAG an unique id. llvm-svn: 29337	2006-07-27 06:39:06 +00:00
Chris Lattner	85ea83e821	Add some advice llvm-svn: 29324	2006-07-27 04:24:14 +00:00
Nate Begeman	787565024a	Support jump tables when in PIC relocation model llvm-svn: 29318	2006-07-27 01:13:04 +00:00
Chris Lattner	4488f0c303	Fix a case where LegalizeAllNodesNotLeadingTo could take exponential time. This manifested itself as really long time to compile Regression/CodeGen/Generic/2003-05-28-ManyArgs.ll on ppc. This is PR847. llvm-svn: 29313	2006-07-26 23:55:56 +00:00
Reid Spencer	421475cd3b	For PR780: 1. Move IncludeFile.h to System library 2. Move IncludeFile.cpp to System library 3. #1 and #2 required to prevent cyclic library dependencies for libSystem 4. Convert all existing uses of Support/IncludeFile.h to System/IncludeFile.h 5. Add IncludeFile support to various lib/System classes. 6. Add new lib/System classes to LinkAllVMCore.h All this in an attempt to pull in lib/System to what's required for VMCore llvm-svn: 29287	2006-07-26 16:18:00 +00:00
Reid Spencer	658b9476f0	Initialize some variables the compiler warns about. llvm-svn: 29277	2006-07-25 20:44:41 +00:00
Jim Laskey	4e153f1b91	Use an enumeration to eliminate data relocations. llvm-svn: 29249	2006-07-21 20:57:35 +00:00
Evan Cheng	7c970b98d0	If a shuffle is a splat, check if the argument is a build_vector with all elements being the same. If so, return the argument. llvm-svn: 29242	2006-07-21 08:25:53 +00:00
Chris Lattner	55782c6c41	Build more debugger/selectiondag libraries as archives instead of .o files. This works around bugs in some versions of the cygwin linker. Patch contributed by Anton Korobeynikov. llvm-svn: 29239	2006-07-21 00:10:47 +00:00
Evan Cheng	8472e0c4af	If a shuffle is unary, i.e. one of the vector argument is not needed, turn the operand into a undef and adjust mask accordingly. llvm-svn: 29232	2006-07-20 22:44:41 +00:00
Chris Lattner	b030532910	Mems can be in the output list also. This is the second half of a fix for PR833 llvm-svn: 29224	2006-07-20 19:02:21 +00:00
Andrew Lenharth	ec104a2b41	80 cols llvm-svn: 29221	2006-07-20 17:43:27 +00:00
Andrew Lenharth	c496b418b5	Reduce number of exported symbols llvm-svn: 29220	2006-07-20 17:28:38 +00:00
Chris Lattner	c0973edc69	Add an out-of-line virtual method for the sdnode class to give it a home. llvm-svn: 29192	2006-07-19 00:00:37 +00:00
Jim Laskey	f7300b2706	It was pointed out that DEBUG() is only available with -debug. llvm-svn: 29106	2006-07-11 18:25:13 +00:00
Jim Laskey	c3d341ea98	Ensure that dump calls that are associated with asserts are removed from non-debug build. llvm-svn: 29105	2006-07-11 17:58:07 +00:00
Chris Lattner	1b8ea1f5ba	Fix CodeGen/Alpha/2006-07-03-ASMFormalLowering.ll and PR818. llvm-svn: 29099	2006-07-11 01:40:09 +00:00
Evan Cheng	d19938834b	Ugly hack! Add helper functions InsertInFlightSetEntry and RemoveInFlightSetEntry. They are used in place of direct set operators to reduce instruction selection function stack size. llvm-svn: 28987	2006-06-29 23:57:05 +00:00
Chris Lattner	996795b0dd	Use hidden visibility to make symbols in an anonymous namespace get dropped. This shrinks libllvmgcc.dylib another 67K llvm-svn: 28975	2006-06-28 23:17:24 +00:00
Chris Lattner	e097e6f7c7	Shave another 27K off libllvmgcc.dylib with visibility hidden llvm-svn: 28973	2006-06-28 22:17:39 +00:00
Chris Lattner	54a34cd20b	Mark these two classes as hidden, shrinking libllbmgcc.dylib by 25K llvm-svn: 28970	2006-06-28 21:58:30 +00:00
Chris Lattner	710b3d5ea1	Fix CodeGen/Generic/2006-06-28-SimplifySetCCCrash.ll llvm-svn: 28965	2006-06-28 18:29:47 +00:00
Reid Spencer	ee7eaa25cf	For PR801: Refactor the Graph writing code to use a common implementation which is now in lib/Support/GraphWriter.cpp. This completes the PR. Patch by Anton Korobeynikov. Thanks, Anton! llvm-svn: 28925	2006-06-27 16:49:46 +00:00
Evan Cheng	ef9e07d3f0	Consistency. EXTRACT_ELEMENT index operand should have ptr type. llvm-svn: 28795	2006-06-15 08:11:54 +00:00
Evan Cheng	55772ccfd6	Instructions with variable operands (variable_ops) can have a number required operands. e.g. def CALL32r : I<0xFF, MRM2r, (ops GR32:$dst, variable_ops), "call {*}$dst", [(X86call GR32:$dst)]>; TableGen should emit operand informations for the "required" operands. Added a target instruction info flag M_VARIABLE_OPS to indicate the target instruction may have more operands in addition to the minimum required operands. llvm-svn: 28791	2006-06-15 07:22:16 +00:00
Chris Lattner	32d92e004d	Make sure to update the CFG correctly if a switch only has a default dest. This fixes CodeGen/Generic/2006-06-12-LowerSwitchCrash.ll llvm-svn: 28755	2006-06-12 18:25:29 +00:00
Andrew Lenharth	0e57b2cb92	Start on my todo list llvm-svn: 28752	2006-06-12 16:07:18 +00:00
Chris Lattner	c03a9259c0	Fix X86/inline-asm.ll:test2, a case where an input value was implicitly truncated. llvm-svn: 28733	2006-06-08 18:27:11 +00:00
Chris Lattner	705948d742	Fix Regression/CodeGen/X86/inline-asm.ll, a case where inline asm causes implement extension of a register. llvm-svn: 28731	2006-06-08 18:22:48 +00:00
Reid Spencer	614cb2ff82	For PR798: Provide GraphViz support for MingW32. Patch provided by Anton Korobeynikov llvm-svn: 28688	2006-06-05 16:26:06 +00:00
Reid Spencer	a647c7ff42	Use archive libraries instead of object files for VMCore, BCReader, BCWriter, and bzip2 libraries. Adjust the various makefiles to accommodate these changes. This was done to speed up link times. llvm-svn: 28610	2006-06-01 01:30:27 +00:00
Evan Cheng	0c0996a97b	commuteInstruction() does not always create a new MI! llvm-svn: 28592	2006-05-31 18:03:39 +00:00
Evan Cheng	9d91caa053	Eliminate a memory leak. llvm-svn: 28585	2006-05-31 07:13:03 +00:00
Evan Cheng	64d2846017	visitVBinOp: Can't fold divide by zero! llvm-svn: 28584	2006-05-31 06:08:35 +00:00
Evan Cheng	d12c97d23a	Make sure the register pressure reduction schedulers work for non-uniform latency targets, e.g. PPC32. llvm-svn: 28561	2006-05-30 18:05:39 +00:00
Evan Cheng	61e9f0d680	When a priority_queue is empty, the behavior of top() operator is non-deterministic. Returns NULL when it's empty! llvm-svn: 28560	2006-05-30 18:04:34 +00:00
Chris Lattner	8f872d2091	Fix a nasty dag combiner bug that caused nondeterminstic crashes (MY FAVORITE!): SimplifySelectOps would eliminate a Select, delete it, then return true. The clients would see that it did something and return null. The top level would see a null return, and decide that nothing happened, proceeding to process the node in other ways: boom. The fix is simple: clients of SimplifySelectOps should return the select node itself. In order to catch really obnoxious boogs like this in the future, add an assert that nodes are not deleted. We do this by checking for a sentry node type that the SDNode dtor sets when a node is destroyed. llvm-svn: 28514	2006-05-27 00:43:02 +00:00
Evan Cheng	21dee4e0b2	Make CALL node consistent with RET node. Signness of value has type MVT::i32 instead of MVT::i1. Either is fine except MVT::i32 is probably a legal type for most (if not all) platforms while MVT::i1 is not. llvm-svn: 28511	2006-05-26 23:13:20 +00:00
Evan Cheng	a2e9953c54	Change RET node to include signness information of the return values. e.g. RET chain, value1, sign1, value2, sign2 llvm-svn: 28509	2006-05-26 23:09:09 +00:00
Evan Cheng	009f5f55f7	Turn on -sched-commute-nodes by default. llvm-svn: 28465	2006-05-25 08:37:31 +00:00
Evan Cheng	4582771f3f	CALL node change: now including signness of every argument. llvm-svn: 28461	2006-05-25 00:55:32 +00:00
Chris Lattner	aa2372562e	Patches to make the LLVM sources more -pedantic clean. Patch provided by Anton Korobeynikov! This is a step towards closing PR786. llvm-svn: 28447	2006-05-24 17:04:05 +00:00
Evan Cheng	ac4f66ff24	-enable-unsafe-fp-math implies -enable-finite-only-fp-math llvm-svn: 28437	2006-05-23 18:18:46 +00:00
Vladimir Prus	df1d439849	Fix missing include llvm-svn: 28435	2006-05-23 13:43:15 +00:00
Evan Cheng	1c5b7d12df	Incorrect SETCC CondCode used for FP comparisons. llvm-svn: 28433	2006-05-23 06:40:47 +00:00
Evan Cheng	d8e2f6ebc1	lib/Target/Target.td llvm-svn: 28386	2006-05-18 20:42:07 +00:00
Chris Lattner	7949c2e8b2	Fix the result of the call to use a correct vbitconvert. There is no need to use getPackedTypeBreakdown at all here. llvm-svn: 28365	2006-05-17 20:49:36 +00:00
Chris Lattner	938155ca57	Correct a previous patch which broke CodeGen/PowerPC/vec_call.ll llvm-svn: 28364	2006-05-17 20:43:21 +00:00
Evan Cheng	751cd7653d	Fixed a LowerCallTo and LowerArguments bug. They were introducing illegal VBIT_VECTOR nodes. There were some confusion about the semantics of getPackedTypeBreakdown(). e.g. for <4 x f32> it returns 1 and v4f32, not 4, and f32. llvm-svn: 28352	2006-05-17 18:16:39 +00:00
Chris Lattner	62f1b83c0e	When we legalize target nodes, do not use getNode to create a new node, use UpdateNodeOperands to just update the operands! This is important because getNode will allocate a new node if the node returns a flag and this breaks assumptions in the legalizer that you can legalize some things multiple times and get exactly the same results. This latent bug was exposed by my ppc patch last night, and this fixes gsm/toast. llvm-svn: 28348	2006-05-17 18:00:08 +00:00
Chris Lattner	a1cec0106a	Add an assertion, avoid some unneeded work for each call. No functionality change. llvm-svn: 28347	2006-05-17 17:55:45 +00:00
Chris Lattner	b77ba73a29	Add support for calls that pass and return legal vectors. llvm-svn: 28340	2006-05-16 23:39:44 +00:00
Chris Lattner	aaa23d953f	Add a new ISD::CALL node, make the default impl of TargetLowering::LowerCallTo produce it. llvm-svn: 28338	2006-05-16 22:53:20 +00:00
Andrew Lenharth	1dc9ec5874	Move this code to a common place llvm-svn: 28329	2006-05-16 17:42:15 +00:00
Chris Lattner	3d82699605	Add a chain to FORMAL_ARGUMENTS. This is a minimal port of the X86 backend, it doesn't currently use/maintain the chain properly. Also, make the X86ISelLowering.cpp file 80-col clean. llvm-svn: 28320	2006-05-16 06:45:34 +00:00
Chris Lattner	957cb6733a	Move function-live-in-handling code from the sdisel code to the scheduler. This code should be emitted after legalize, so it can't be in sdisel. Note that the EmitFunctionEntryCode hook should be updated to operate on the DAG. The X86 backend is the only one currently using this hook. llvm-svn: 28315	2006-05-16 06:10:58 +00:00
Chris Lattner	5f0edfb849	Legalize FORMAL_ARGUMENTS nodes correctly, we don't want to legalize them once for each argument. llvm-svn: 28313	2006-05-16 05:49:56 +00:00
Evan Cheng	99f2f79e2f	Fixing 2006-05-01-SchedCausingSpills.ll; some clean up llvm-svn: 28279	2006-05-13 08:22:24 +00:00
Evan Cheng	d1915cfa6f	Revert an un-intended change llvm-svn: 28278	2006-05-13 05:53:47 +00:00
Chris Lattner	69a0ce6261	Merge identical code. llvm-svn: 28274	2006-05-13 02:11:14 +00:00
Chris Lattner	53cdb2f2b0	Remove dead vars llvm-svn: 28255	2006-05-12 18:06:45 +00:00
Chris Lattner	da076e41ab	remove dead vars llvm-svn: 28254	2006-05-12 18:04:28 +00:00
Chris Lattner	afe72481f6	Comment out dead variables llvm-svn: 28252	2006-05-12 17:57:54 +00:00
Chris Lattner	8c02c3f41a	Compile: %tmp152 = setgt uint %tmp144, %tmp149 ; <bool> [#uses=1] %tmp159 = setlt uint %tmp144, %tmp149 ; <bool> [#uses=1] %bothcond2 = or bool %tmp152, %tmp159 ; <bool> [#uses=1] To setne, not setune, which causes an assertion fault. llvm-svn: 28244	2006-05-12 17:03:46 +00:00
Owen Anderson	8c2c1e90c4	Refactor a bunch of includes so that TargetMachine.h doesn't have to include TargetData.h. This should make recompiles a bit faster with my current TargetData tinkering. llvm-svn: 28238	2006-05-12 06:33:49 +00:00
Evan Cheng	095c9d9b7f	Duh. That could take a long time. llvm-svn: 28235	2006-05-12 06:05:18 +00:00
Chris Lattner	66adee93aa	Two simplifications for token factor nodes: simplify tf(x,x) -> x. simplify tf(x,y,y,z) -> tf(x,y,z). llvm-svn: 28233	2006-05-12 05:01:37 +00:00
Evan Cheng	afed73eebe	Add capability to scheduler to commute nodes for profit. If a two-address code whose first operand has uses below, it should be commuted when possible. llvm-svn: 28230	2006-05-12 01:58:24 +00:00
Evan Cheng	d38c22bdd3	Refactor scheduler code. Move register-reduction list scheduler to a separate file. Added an initial implementation of top-down register pressure reduction list scheduler. llvm-svn: 28226	2006-05-11 23:55:42 +00:00
Evan Cheng	9665ba053f	Templatify RegReductionPriorityQueue llvm-svn: 28212	2006-05-10 06:16:44 +00:00
Nate Begeman	1a225d23ae	Fix PR773 llvm-svn: 28207	2006-05-09 18:20:51 +00:00
Evan Cheng	7d693898ee	Add pseudo dependency to force a def&use operand to be scheduled last (unless the distance between the def and another use is much longer). This is under option control for now "-sched-lower-defnuse". llvm-svn: 28201	2006-05-09 07:13:34 +00:00
Evan Cheng	2c74848af1	Debugging info llvm-svn: 28200	2006-05-09 06:55:15 +00:00
Chris Lattner	446e1ef26a	Make the case I just checked in stronger. Now we compile this: short test2(short X, short x) { int Y = (short)(X+x); return Y >> 1; } to: _test2: add r2, r3, r4 extsh r2, r2 srawi r3, r2, 1 blr instead of: _test2: add r2, r3, r4 extsh r2, r2 srwi r2, r2, 1 extsh r3, r2 blr llvm-svn: 28175	2006-05-08 21:18:59 +00:00
Chris Lattner	29062da0ac	Implement and_sext.ll:test3, generating: _test4: srawi r3, r3, 16 blr instead of: _test4: srwi r2, r3, 16 extsh r3, r2 blr for: short test4(unsigned X) { return (X >> 16); } llvm-svn: 28174	2006-05-08 20:59:41 +00:00
Chris Lattner	2935d8190c	Compile this: short test4(unsigned X) { return (X >> 16); } to: _test4: movl 4(%esp), %eax sarl $16, %eax ret instead of: _test4: movl $-65536, %eax andl 4(%esp), %eax sarl $16, %eax ret llvm-svn: 28171	2006-05-08 20:51:54 +00:00
Chris Lattner	78da6792e7	Fold shifts with undef operands. llvm-svn: 28167	2006-05-08 17:29:49 +00:00
Nate Begeman	d7a19102d1	Make emission of jump tables a bit less conservative; they are now required to be only 31.25% dense, rather than 75% dense. llvm-svn: 28165	2006-05-08 16:51:36 +00:00
Nate Begeman	e5ce5bb6da	Fix PR772 llvm-svn: 28161	2006-05-08 01:35:01 +00:00
Chris Lattner	7e7bcf3a54	Simplify some code, add a couple minor missed folds llvm-svn: 28152	2006-05-06 23:06:26 +00:00
Chris Lattner	751817c54f	constant fold sign_extend_inreg llvm-svn: 28151	2006-05-06 23:05:41 +00:00
Chris Lattner	2a4d7b845b	remove cases handled elsewhere llvm-svn: 28150	2006-05-06 22:43:44 +00:00
Chris Lattner	1ecb2a2dac	Use the new TargetLowering::ComputeNumSignBits method to eliminate sign_extend_inreg operations. Though ComputeNumSignBits is still rudimentary, this is enough to compile this: short test(short X, short x) { int Y = X+x; return (Y >> 1); } short test2(short X, short x) { int Y = (short)(X+x); return Y >> 1; } into: _test: add r2, r3, r4 srawi r3, r2, 1 blr _test2: add r2, r3, r4 extsh r2, r2 srawi r3, r2, 1 blr instead of: _test: add r2, r3, r4 srawi r2, r2, 1 extsh r3, r2 blr _test2: add r2, r3, r4 extsh r2, r2 srawi r2, r2, 1 extsh r3, r2 blr llvm-svn: 28146	2006-05-06 09:30:03 +00:00
Chris Lattner	21cd99024a	When inserting casts, be careful of where we put them. We cannot insert a cast immediately before a PHI node. This fixes Regression/CodeGen/Generic/2006-05-06-GEP-Cast-Sink-Crash.ll llvm-svn: 28143	2006-05-06 09:10:37 +00:00
Chris Lattner	907e392dba	Fold trunc(any_ext). This gives stuff like: 27,28c27 < movzwl %di, %edi < movl %edi, %ebx --- > movw %di, %bx llvm-svn: 28137	2006-05-05 22:56:26 +00:00
Chris Lattner	57f8c5a387	Shrink shifts when possible. llvm-svn: 28136	2006-05-05 22:53:17 +00:00
Chris Lattner	3d26577396	Fold (fpext (load x)) -> (extload x) llvm-svn: 28130	2006-05-05 21:34:35 +00:00
Chris Lattner	3e3f2c63c3	More aggressively sink GEP offsets into loops. For example, before we generated: movl 8(%esp), %eax movl %eax, %edx addl $4316, %edx cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, (%edx) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %edx movl %edx, 4460(%eax) ret ... Now we generate: movl 8(%esp), %eax cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, 4316(%eax) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %ecx movl %ecx, 4460(%eax) ret ... which uses one fewer register. llvm-svn: 28129	2006-05-05 21:17:49 +00:00
Chris Lattner	25a5283a86	Fold some common code. llvm-svn: 28124	2006-05-05 06:32:04 +00:00
Chris Lattner	002ee91457	Implement: // fold (and (sext x), (sext y)) -> (sext (and x, y)) // fold (or (sext x), (sext y)) -> (sext (or x, y)) // fold (xor (sext x), (sext y)) -> (sext (xor x, y)) // fold (and (aext x), (aext y)) -> (aext (and x, y)) // fold (or (aext x), (aext y)) -> (aext (or x, y)) // fold (xor (aext x), (aext y)) -> (aext (xor x, y)) llvm-svn: 28123	2006-05-05 06:31:05 +00:00
Chris Lattner	5ac4293606	Pull and through and/or/xor. This compiles some bitfield code to: mov EAX, DWORD PTR [ESP + 4] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX or EDX, ECX and EDX, -2147483648 and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX ret instead of: sub ESP, 4 mov DWORD PTR [ESP], ESI mov EAX, DWORD PTR [ESP + 8] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX mov ESI, ECX and ESI, -2147483648 and EDX, -2147483648 or EDX, ESI and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX mov ESI, DWORD PTR [ESP] add ESP, 4 ret llvm-svn: 28122	2006-05-05 06:10:43 +00:00
Chris Lattner	812646aa0c	Implement a variety of simplifications for ANY_EXTEND. llvm-svn: 28121	2006-05-05 05:58:59 +00:00
Chris Lattner	8d6fc20181	Factor some code, add these transformations: // fold (and (trunc x), (trunc y)) -> (trunc (and x, y)) // fold (or (trunc x), (trunc y)) -> (trunc (or x, y)) // fold (xor (trunc x), (trunc y)) -> (trunc (xor x, y)) llvm-svn: 28120	2006-05-05 05:51:50 +00:00
Jeff Cohen	78a7f0e05e	Fix VC++ compilation error. llvm-svn: 28117	2006-05-05 01:47:05 +00:00
Chris Lattner	7a3ecf7993	Sink noop copies into the basic block that uses them. This reduces the number of cross-block live ranges, and allows the bb-at-a-time selector to always coallesce these away, at isel time. This reduces the load on the coallescer and register allocator. For example on a codec on X86, we went from: 1643 asm-printer - Number of machine instrs printed 419 liveintervals - Number of loads/stores folded into instructions 1144 liveintervals - Number of identity moves eliminated after coalescing 1022 liveintervals - Number of interval joins performed 282 liveintervals - Number of intervals after coalescing 1304 liveintervals - Number of original intervals 86 regalloc - Number of times we had to backtrack 1.90232 regalloc - Ratio of intervals processed over total intervals 40 spiller - Number of values reused 182 spiller - Number of loads added 121 spiller - Number of stores added 132 spiller - Number of register spills 6 twoaddressinstruction - Number of instructions commuted to coalesce 360 twoaddressinstruction - Number of two-address instructions to: 1636 asm-printer - Number of machine instrs printed 403 liveintervals - Number of loads/stores folded into instructions 1155 liveintervals - Number of identity moves eliminated after coalescing 1033 liveintervals - Number of interval joins performed 279 liveintervals - Number of intervals after coalescing 1312 liveintervals - Number of original intervals 76 regalloc - Number of times we had to backtrack 1.88998 regalloc - Ratio of intervals processed over total intervals 1 spiller - Number of copies elided 41 spiller - Number of values reused 191 spiller - Number of loads added 114 spiller - Number of stores added 128 spiller - Number of register spills 4 twoaddressinstruction - Number of instructions commuted to coalesce 356 twoaddressinstruction - Number of two-address instructions On this testcase, this change provides a modest reduction in spill code, regalloc iterations, and total instructions emitted. It increases the number of register coallesces. llvm-svn: 28115	2006-05-05 01:04:50 +00:00
Evan Cheng	9add880566	Initial support for register pressure aware scheduling. The register reduction scheduler can go into a "vertical mode" (i.e. traversing up the two-address chain, etc.) when the register pressure is low. This does seem to reduce the number of spills in the cases I've looked at. But with x86, it's no guarantee the performance of the code improves. It can be turned on with -sched-vertically option. llvm-svn: 28108	2006-05-04 19:16:39 +00:00
Chris Lattner	469647bf38	Remove and simplify some more machineinstr/machineoperand stuff. llvm-svn: 28105	2006-05-04 18:16:01 +00:00
Chris Lattner	10b71c0d08	Rename MO_VirtualRegister -> MO_Register. Clean up immediate handling. llvm-svn: 28104	2006-05-04 18:05:43 +00:00
Chris Lattner	940cc978ef	Remove a bunch more SparcV9 specific stuff llvm-svn: 28093	2006-05-04 01:15:02 +00:00
Nate Begeman	df4883971e	Finish up the initial jump table implementation by allowing jump tables to not be 100% dense. Increase the minimum threshold for the number of cases in a switch statement from 4 to 6 in order to create a jump table. llvm-svn: 28079	2006-05-03 03:48:02 +00:00
Evan Cheng	ffef8b9412	Bottom up register pressure reduction work: clean up some hacks and enhanced the heuristic to further reduce spills for several test cases. (Note, it may not necessarily translate to runtime win!) llvm-svn: 28076	2006-05-03 02:10:45 +00:00
Owen Anderson	20a631fde7	Refactor TargetMachine, pushing handling of TargetData into the target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference. This fixes PR 759. llvm-svn: 28074	2006-05-03 01:29:57 +00:00
Evan Cheng	0d084fb9ca	Dis-favor stores more llvm-svn: 28035	2006-05-01 09:20:44 +00:00
Evan Cheng	24e795496d	Bottom up register-pressure reduction scheduler now pushes store operations up the schedule. This helps code that looks like this: loads ... computations (first set) ... stores (first set) ... loads computations (seccond set) ... stores (seccond set) ... Without this change, the stores and computations are more likely to interleave: loads ... loads ... computations (first set) ... computations (second set) ... computations (first set) ... stores (first set) ... computations (second set) ... stores (stores set) ... This can increase the number of spills if we are unlucky. llvm-svn: 28033	2006-05-01 09:14:40 +00:00
Evan Cheng	10ff7b27ce	Didn't mean ScheduleDAGList.cpp to make the last checkin. llvm-svn: 28030	2006-05-01 08:56:34 +00:00
Evan Cheng	a656242690	Remove temp. option -spiller-check-liveout, it didn't cause any failure nor performance regressions. llvm-svn: 28029	2006-05-01 08:54:57 +00:00
Chris Lattner	2b48a94413	Remove a bogus transformation. This fixes SingleSource/UnitTests/2006-01-23-InitializedBitField.c with some changes I have to the new CFE. llvm-svn: 28022	2006-04-28 23:33:20 +00:00
Evan Cheng	c5e8ce8b8c	Remove the temporary option: -no-isel-fold-inflight llvm-svn: 28012	2006-04-28 18:54:11 +00:00
Evan Cheng	d43c5c6046	TargetLowering::LowerArguments should return a VBIT_CONVERT of FORMAL_ARGUMENTS SDOperand in the return result vector. llvm-svn: 28009	2006-04-28 05:25:15 +00:00
Evan Cheng	51ab4498e7	Added a temporary option -no-isel-fold-inflight to control whether a "inflight" node can be folded. llvm-svn: 28003	2006-04-28 02:09:19 +00:00
Evan Cheng	3784f3c57c	Insert a VBIT_CONVERT between a FORMAL_ARGUMENT node and its vector uses (VAND, VADD, etc.). Legalizer will assert otherwise. llvm-svn: 27991	2006-04-27 08:29:42 +00:00
Chris Lattner	393d96a56c	Fix Regression/CodeGen/Generic/2006-04-26-SetCCAnd.ll and PR748. llvm-svn: 27987	2006-04-27 05:01:07 +00:00
Evan Cheng	9618df1190	Don't forget return void. llvm-svn: 27974	2006-04-25 23:03:35 +00:00
Nate Begeman	866b4b4d45	Fix the updating of the machine CFG when a PHI node was in a successor of the jump table's range check block. This re-enables 100% dense jump tables by default on PPC & x86 llvm-svn: 27952	2006-04-23 06:26:20 +00:00
Nate Begeman	ecb1dafd3d	Turn of jump tables for a bit, there are still some issues to work out with updating the machine CFG. llvm-svn: 27949	2006-04-22 23:51:56 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	b21d3bfd1f	The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap miscompares). Switch RISC targets to use the list-td scheduler, which isn't. llvm-svn: 27933	2006-04-21 17:16:16 +00:00
Chris Lattner	662e940f73	Fix a couple more memory issues llvm-svn: 27930	2006-04-21 15:32:26 +00:00
Chris Lattner	cc47ab3305	Fix a really subtle and obnoxious memory bug that caused issues with an llvm-gcc4 boostrap. Whenever a node is deleted by the dag combiner, it must be returned by the visit function, or the dag combiner will not know that the node has been processed (and will, e.g., send it to the target dag combine xforms). llvm-svn: 27922	2006-04-20 23:55:59 +00:00
Evan Cheng	a320abc494	Turn a VAND into a VECTOR_SHUFFLE is applicable. DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements, into a vector shuffle with a zero vector. It only does so when TLI tells it the xform is profitable. llvm-svn: 27874	2006-04-20 08:56:16 +00:00
Chris Lattner	bc1b262725	Implement folding of a bunch of binops with undef llvm-svn: 27863	2006-04-20 05:39:12 +00:00
Chris Lattner	73eb58e1a2	Simplify some code llvm-svn: 27846	2006-04-19 23:17:50 +00:00
Chris Lattner	916ae0775e	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Chris Lattner	326870b40b	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret llvm-svn: 27765	2006-04-17 19:21:01 +00:00
Chris Lattner	91226e5799	Add support for promoting stores from one legal type to another, allowing us to write one pattern for vector stores instead of 4. llvm-svn: 27730	2006-04-16 01:36:45 +00:00
Chris Lattner	7e7ad593cc	Make these predicates return true for bit_convert(buildvector)'s as well as buildvectors. llvm-svn: 27723	2006-04-15 23:38:00 +00:00
Chris Lattner	086e986e94	Make this assertion better llvm-svn: 27695	2006-04-14 06:08:35 +00:00
Evan Cheng	119266ea92	Promote vector AND, OR, and XOR llvm-svn: 27632	2006-04-12 21:20:24 +00:00
Evan Cheng	be8a8933e6	Vector type promotion for ISD::LOAD and ISD::SELECT llvm-svn: 27606	2006-04-12 16:33:18 +00:00
Chris Lattner	d3b504ae10	Implement support for the formal_arguments node. To get this, targets shouldcustom legalize it and remove their XXXTargetLowering::LowerArguments overload llvm-svn: 27604	2006-04-12 16:20:43 +00:00
Chris Lattner	417b96b6dd	Don't memoize vloads in the load map! Don't memoize them anywhere here, let getNode do it. This fixes CodeGen/Generic/2006-04-11-vecload.ll llvm-svn: 27602	2006-04-12 03:25:41 +00:00
Evan Cheng	7256b0ae05	Only get Tmp2 for cases where number of operands is > 1. Fixed return void. llvm-svn: 27586	2006-04-11 06:33:39 +00:00
Chris Lattner	6cf3bbbe17	add some todos llvm-svn: 27580	2006-04-11 02:00:08 +00:00
Chris Lattner	2eb22eef7d	Add basic support for legalizing returns of vectors llvm-svn: 27578	2006-04-11 01:31:51 +00:00
Evan Cheng	cb73b8d419	Missing break llvm-svn: 27559	2006-04-10 18:54:36 +00:00
Chris Lattner	02274a5265	Add code generator support for VSELECT llvm-svn: 27542	2006-04-08 22:22:57 +00:00
Chris Lattner	e1401e3610	Canonicalize vvector_shuffle(x,x) -> vvector_shuffle(x,undef) to enable patterns to match again :) llvm-svn: 27533	2006-04-08 05:34:25 +00:00
Chris Lattner	098c01e94e	Codegen shufflevector as VVECTOR_SHUFFLE llvm-svn: 27529	2006-04-08 04:15:24 +00:00
Chris Lattner	101ea66813	add a sanity check: LegalizeOp should return a value that is the same type as its input. llvm-svn: 27528	2006-04-08 04:13:17 +00:00
Evan Cheng	78e3d565af	INSERT_VECTOR_ELT lowering bug: store vector to $esp store element to $esp + sizeof(VT) * index load vector from $esp The bug is VT is the type of the vector element, not the type of the vector! llvm-svn: 27517	2006-04-08 01:46:37 +00:00
Chris Lattner	aa3185f12e	Stub out shufflevector llvm-svn: 27514	2006-04-08 01:19:25 +00:00
Evan Cheng	613996c55e	1. If both vector operands of a vector_shuffle are undef, turn it into an undef. 2. A shuffle mask element can also be an undef. llvm-svn: 27472	2006-04-06 23:20:43 +00:00
Chris Lattner	4a2413a590	Make a vector live across blocks have the correct Vec type. This fixes CodeGen/X86/2006-04-04-CrossBlockCrash.ll llvm-svn: 27436	2006-04-05 06:54:42 +00:00
Evan Cheng	9fa8959dce	Exapnd a VECTOR_SHUFFLE to a BUILD_VECTOR if target asks for it to be expanded or custom lowering fails. llvm-svn: 27432	2006-04-05 06:07:11 +00:00
Chris Lattner	4ea52cac01	Do not create ZEXTLOAD's unless we are before legalize or the operation is legal. llvm-svn: 27402	2006-04-04 17:39:18 +00:00
Chris Lattner	6be79823e7	* Add supprot for SCALAR_TO_VECTOR operations where the input needs to be promoted/expanded (e.g. SCALAR_TO_VECTOR from i8/i16 on PPC). * Add support for targets to request that VECTOR_SHUFFLE nodes be promoted to a canonical type, for example, we only want v16i8 shuffles on PPC. * Move isShuffleLegal out of TLI into Legalize. * Teach isShuffleLegal to allow shuffles that need to be promoted. llvm-svn: 27399	2006-04-04 17:23:26 +00:00
Chris Lattner	a9e77d14c7	Constant fold bitconvert(undef) llvm-svn: 27391	2006-04-04 01:02:22 +00:00
Chris Lattner	e1e3adf802	Add a missing check, this fixes UnitTests/Vector/sumarray.c llvm-svn: 27375	2006-04-03 17:29:28 +00:00
Chris Lattner	04c00fc844	Add a missing check, which broke a bunch of vector tests. llvm-svn: 27374	2006-04-03 17:21:50 +00:00
Andrew Lenharth	94f012f606	back this out llvm-svn: 27367	2006-04-03 03:16:50 +00:00
Andrew Lenharth	015eaf5f33	This should be a win of every arch llvm-svn: 27364	2006-04-02 21:42:45 +00:00
Chris Lattner	4993249a04	Add a little dag combine to compile this: int %AreSecondAndThirdElementsBothNegative(<4 x float>* %in) { entry: %tmp1 = load <4 x float>* %in ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.ppc.altivec.vcmpgefp.p( int 1, <4 x float> < float 0x7FF8000000000000, float 0.000000e+00, float 0.000000e+00, float 0x7FF8000000000000 >, <4 x float> %tmp1 ) ; <int> [#uses=1] %tmp = seteq int %tmp, 0 ; <bool> [#uses=1] %tmp3 = cast bool %tmp to int ; <int> [#uses=1] ret int %tmp3 } into this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 mtspr 256, r2 blr instead of this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 xori r3, r3, 1 cntlzw r3, r3 srwi r3, r3, 5 mtspr 256, r2 blr llvm-svn: 27356	2006-04-02 06:11:11 +00:00
Chris Lattner	42a5fca47e	Implement promotion for EXTRACT_VECTOR_ELT, allowing v16i8 multiplies to work with PowerPC. llvm-svn: 27349	2006-04-02 05:06:04 +00:00
Chris Lattner	87f080949b	Implement the Expand action for binary vector operations to break the binop into elements and operate on each piece. This allows generic vector integer multiplies to work on PPC, though the generated code is horrible. llvm-svn: 27347	2006-04-02 03:57:31 +00:00
Chris Lattner	a9c59156be	Intrinsics that just load from memory can be treated like loads: they don't have to serialize against each other. This allows us to schedule lvx's across each other, for example. llvm-svn: 27346	2006-04-02 03:41:14 +00:00
Chris Lattner	0442a18758	Constant fold all of the vector binops. This allows us to compile this: "vector unsigned char mergeLowHigh = (vector unsigned char) ( 8, 9, 10, 11, 16, 17, 18, 19, 12, 13, 14, 15, 20, 21, 22, 23 ); vector unsigned char mergeHighLow = vec_xor( mergeLowHigh, vec_splat_u8(8));" aka: void %test2(<16 x sbyte>* %P) { store <16 x sbyte> cast (<4 x int> xor (<4 x int> cast (<16 x ubyte> < ubyte 8, ubyte 9, ubyte 10, ubyte 11, ubyte 16, ubyte 17, ubyte 18, ubyte 19, ubyte 12, ubyte 13, ubyte 14, ubyte 15, ubyte 20, ubyte 21, ubyte 22, ubyte 23 > to <4 x int>), <4 x int> cast (<16 x sbyte> < sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8 > to <4 x int>)) to <16 x sbyte>), <16 x sbyte> * %P ret void } into this: _test2: mfspr r2, 256 oris r4, r2, 32768 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) lvx v0, r5, r4 stvx v0, 0, r3 mtspr 256, r2 blr instead of this: _test2: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) vspltisb v0, 8 lvx v1, r5, r4 vxor v0, v1, v0 stvx v0, 0, r3 mtspr 256, r2 blr ... which occurs here: http://developer.apple.com/hardware/ve/calcspeed.html llvm-svn: 27343	2006-04-02 03:25:57 +00:00
Chris Lattner	ef598059f2	Add a new -view-legalize-dags command line option llvm-svn: 27342	2006-04-02 03:07:27 +00:00

... 6 7 8 9 10 ...

1733 Commits