llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	c6ae56d23f	enhance FindAvailableLoadedValue to make use of AliasAnalysis if it has it. llvm-svn: 60167	2008-11-27 08:18:12 +00:00
Chris Lattner	72f16e70f0	move FindAvailableLoadedValue from JumpThreading to Transforms/Utils. llvm-svn: 60166	2008-11-27 08:10:05 +00:00
Chris Lattner	d6204bed3d	simplify this code a bit. llvm-svn: 60164	2008-11-27 07:54:38 +00:00
Chris Lattner	206250284d	Use the new MergeBasicBlockIntoOnlyPred function. llvm-svn: 60163	2008-11-27 07:54:12 +00:00
Chris Lattner	99d6809ac1	move MergeBasicBlockIntoOnlyPred to Transforms/Utils. llvm-svn: 60162	2008-11-27 07:43:12 +00:00
Chris Lattner	240051aace	rename ThreadBlock to ProcessBlock, since it does other things than just simple threading. llvm-svn: 60157	2008-11-27 07:20:04 +00:00
Chris Lattner	98d89d1b1b	Make jump threading substantially more powerful, in the following ways: 1. Make it fold blocks separated by an unconditional branch. This enables jump threading to see a broader scope. 2. Make jump threading able to eliminate locally redundant loads when they feed the branch condition of a block. This frequently occurs due to reg2mem running. 3. Make jump threading able to eliminate partially redundant loads when they feed the branch condition of a block. This is common in code with lots of loads and stores like C++ code and 255.vortex. This implements thread-loads.ll and rdar://6402033. Per the fixme's, several pieces of this should be moved into Transforms/Utils. llvm-svn: 60148	2008-11-27 05:07:53 +00:00
Chris Lattner	397a11ccd8	Turn on my codegen prepare heuristic by default. It doesn't affect performance in most cases on the Grawp tester, but does speed some things up (like shootout/hash by 15%). This also doesn't impact compile time in a noticable way on the Grawp tester. It also, of course, gets the testcase it was designed for right :) llvm-svn: 60120	2008-11-26 22:16:44 +00:00
Chris Lattner	fef04acc50	teach the new heuristic how to handle inline asm. llvm-svn: 60088	2008-11-26 04:59:11 +00:00
Chris Lattner	6d71b7fb95	Improve ValueAlreadyLiveAtInst with a cheap and dirty, but effective heuristic: the value is already live at the new memory operation if it is used by some other instruction in the memop's block. This is cheap and simple to compute (moreso than full liveness). This improves the new heuristic even more. For example, it cuts two out of three new instructions out of 255.vortex:DbmFileInGrpHdr, which is one of the functions that the heuristic regressed. This overall eliminates another 40 instructions from 403.gcc and visibly reduces register pressure in 255.vortex (though this only actually ends up saving the 2 instructions from the whole program). llvm-svn: 60084	2008-11-26 03:20:37 +00:00
Chris Lattner	e34fe2c52d	Start rewroking a subpiece of the profitability heuristic to be phrased in terms of liveness instead of as a horrible hack. :) In pratice, this doesn't change the generated code for either 255.vortex or 403.gcc, but it could cause minor code changes in theory. This is framework for coming changes. llvm-svn: 60082	2008-11-26 03:02:41 +00:00
Chris Lattner	383a797f42	add a comment, make save/restore logic more obvious. llvm-svn: 60076	2008-11-26 02:11:11 +00:00
Chris Lattner	eb3e4fb6fb	This adds in some code (currently disabled unless you pass -enable-smarter-addr-folding to llc) that gives CGP a better cost model for when to sink computations into addressing modes. The basic observation is that sinking increases register pressure when part of the addr computation has to be available for other reasons, such as having a use that is a non-memory operation. In cases where it works, it can substantially reduce register pressure. This code is currently an overall win on 403.gcc and 255.vortex (the two things I've been looking at), but there are several things I want to do before enabling it by default: 1. This isn't doing any caching of results, so it is much slower than it could be. It currently slows down release-asserts llc by 1.7% on 176.gcc: 27.12s -> 27.60s. 2. This doesn't think about inline asm memory operands yet. 3. The cost model botches the case when the needed value is live across the computation for other reasons. I'll continue poking at this, and eventually turn it on as llcbeta. llvm-svn: 60074	2008-11-26 02:00:14 +00:00
Evan Cheng	496b042e20	Revert r60042. IndVarSimplify should check if APFloat is PPCDoubleDouble first before trying to convert it to an integer. llvm-svn: 60072	2008-11-26 01:11:57 +00:00
Chris Lattner	a9ab165b08	Teach CodeGenPrepare to look through Bitcast instructions when attempting to optimize addressing modes. This allows us to optimize things like isel-sink2.ll into: movl 4(%esp), %eax cmpb $0, 4(%eax) jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 7(%eax), %eax ret instead of: _test: movl 4(%esp), %eax cmpb $0, 4(%eax) leal 4(%eax), %eax jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 3(%eax), %eax ret This shrinks (e.g.) 403.gcc from 1133510 to 1128345 lines of .s. Note that the 2008-10-16-SpillerBug.ll testcase is dubious at best, I doubt it is really testing what it thinks it is. llvm-svn: 60068	2008-11-26 00:26:16 +00:00
Chris Lattner	f3e95505c5	Teach MatchScaledValue to handle Scales by 1 with MatchAddr (which can recursively match things) and scales by 0 by ignoring them. This triggers once in 403.gcc, saving 1 (!!!!) instruction in the whole huge app. llvm-svn: 60013	2008-11-25 07:25:26 +00:00
Chris Lattner	728f90220a	significantly refactor all the addressing mode matching logic into a new AddressingModeMatcher class. This makes it easier to reason about and reduces passing around of stuff, but has no functionality change. llvm-svn: 60012	2008-11-25 07:09:13 +00:00
Chris Lattner	58f49d2916	refactor all the constantexpr/instruction handling code out into a new FindMaximalLegalAddressingModeForOperation helper method. llvm-svn: 60011	2008-11-25 05:15:49 +00:00
Chris Lattner	a3fbff15b9	another minor tweak llvm-svn: 60010	2008-11-25 04:47:41 +00:00
Chris Lattner	d616ef5683	minor cleanups no functionality change. llvm-svn: 60009	2008-11-25 04:42:10 +00:00
Chris Lattner	6416a6b7a0	rearrange and tidy some code, no functionality change. llvm-svn: 59990	2008-11-24 22:44:16 +00:00
Chris Lattner	d917c8c8fe	minor cleanups to debug code, no functionality change. llvm-svn: 59989	2008-11-24 22:40:05 +00:00
Chris Lattner	d78894197a	reenable the right part of the code. llvm-svn: 59985	2008-11-24 21:26:21 +00:00
Chris Lattner	992a541002	revert an accidental commit, this fixes the regression on test/CodeGen/X86/isel-sink.ll llvm-svn: 59976	2008-11-24 19:40:34 +00:00
Chris Lattner	53d6a07869	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Devang Patel	702f45df58	Fix build failure. llvm-svn: 59844	2008-11-21 21:00:20 +00:00
Devang Patel	cb181bb203	Silence unused variable warnings. llvm-svn: 59841	2008-11-21 20:00:59 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Nick Lewycky	07d726ec4d	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Devang Patel	45f1ae028e	Fix unused variable warnings. llvm-svn: 59778	2008-11-21 01:52:59 +00:00
Bill Wendling	f5260d29c2	Fix error where it wasn't getting the correct caller function. llvm-svn: 59758	2008-11-21 00:09:21 +00:00
Bill Wendling	26c6a3e736	If the function being inlined has a higher stack protection level than the inlining function, then increase the stack protection level on the inlining function. llvm-svn: 59757	2008-11-21 00:06:32 +00:00
Devang Patel	38642e598e	Don't forget arguments! llvm-svn: 59745	2008-11-20 19:50:17 +00:00
Devang Patel	c8b2fe1eed	Do not forget llvm.dbg.declare's first argument while removing debugging information. llvm-svn: 59688	2008-11-20 01:20:42 +00:00
Oscar Fuentes	4fb443f81b	CMake: Removed source file. llvm-svn: 59662	2008-11-19 19:32:19 +00:00
Devang Patel	79303b2572	Do not use separate utility to walk all instructions and remove dead dbg intrinsics. Let instcombiner do this job. llvm-svn: 59659	2008-11-19 19:01:37 +00:00
Devang Patel	827bced2b1	Let instcombiner remove redundant dbg intrinsics. llvm-svn: 59658	2008-11-19 18:59:41 +00:00
Devang Patel	7ed6c5317c	If there are two consecutive llvm.dbg.stoppoint calls then it is likely that the optimizer deleted code in between these two intrinsics. Keep only the last llvm.dbg.stoppoint in this case. llvm-svn: 59657	2008-11-19 18:56:50 +00:00
Devang Patel	25662f3e4a	Remove unused variables. llvm-svn: 59570	2008-11-19 00:22:02 +00:00
Devang Patel	ebd2363339	Fix typo. llvm-svn: 59569	2008-11-19 00:19:18 +00:00
Devang Patel	b5e867acff	Add new helper pass that strips all symbol names except debugging information. This pass makes it easier to test wheter debugging info. influences optimization passes or not. llvm-svn: 59552	2008-11-18 21:34:39 +00:00
Devang Patel	3b7a2be88e	Remove even more llvm.dbg variables. Remove all dead globals from llvm.metadata. Ignore linkonce linkage for selected llvm.dbg values. llvm-svn: 59547	2008-11-18 21:13:41 +00:00
Devang Patel	a13f1f38fa	Initialize MallocFunc and FreeFunc properly. llvm-svn: 59538	2008-11-18 18:43:07 +00:00
Bill Wendling	cf194e9a27	Cast to remove warning about comparing signed and unsigned. llvm-svn: 59518	2008-11-18 10:57:27 +00:00
Devang Patel	f1e9329209	Give SIToFPInst preference over UIToFPInst because it is faster on platforms that are widely used. llvm-svn: 59476	2008-11-18 00:40:02 +00:00
Devang Patel	180afd2c55	While handling floating point IVs lift restrictions on initial value and increment value. llvm-svn: 59471	2008-11-17 23:27:13 +00:00
Devang Patel	aa3d68d301	Handle floating point ivs during doInitialization(). llvm-svn: 59466	2008-11-17 21:32:02 +00:00
Devang Patel	b63c74730c	Let AnalyzeAlloca() remove debug intrinsics. llvm-svn: 59454	2008-11-17 18:37:53 +00:00
Torok Edwin	026259faeb	If SI->size() is 0, we are not allowed to dereference ->begin(). This fixed PR3078. llvm-svn: 59416	2008-11-16 17:21:25 +00:00
Chris Lattner	7917b43a28	eliminate some std::set's. llvm-svn: 59409	2008-11-16 07:17:51 +00:00
Chris Lattner	f8f6270f14	simplify loop llvm-svn: 59406	2008-11-16 06:35:18 +00:00
Chris Lattner	44152742a0	simplify a bunch more instcombines to use m_Specific etc. llvm-svn: 59403	2008-11-16 05:38:51 +00:00
Chris Lattner	d397fef50d	factor the code for simplifying (icmp)\|(icmp) into its own function. llvm-svn: 59402	2008-11-16 05:20:07 +00:00
Chris Lattner	909b969b18	do some computation with apints instead of ConstantInts. llvm-svn: 59401	2008-11-16 05:14:43 +00:00
Chris Lattner	feaea9bdf7	merge a check into a place where it is simpler. llvm-svn: 59400	2008-11-16 05:10:52 +00:00
Chris Lattner	269cbd5770	factor a whole bunch of code out into a helper function. llvm-svn: 59398	2008-11-16 05:06:21 +00:00
Chris Lattner	b37b6e7e96	simplify the conditions on two gigantic if's, decreasing indentation a bit. Next step is to factor out into their own helper functions. llvm-svn: 59397	2008-11-16 04:55:20 +00:00
Chris Lattner	f1be285134	simplify some instcombine matches by using m_Specific llvm-svn: 59395	2008-11-16 04:46:19 +00:00
Chris Lattner	fae5e33111	Use new m_SelectCst template to eliminate macros. llvm-svn: 59392	2008-11-16 04:33:38 +00:00
Chris Lattner	569d78cbb5	simplify code. llvm-svn: 59390	2008-11-16 04:26:55 +00:00
Chris Lattner	c3f3b059d0	Handle the case where there is no "not". It is possible it got folded into the select. llvm-svn: 59389	2008-11-16 04:25:26 +00:00
Chris Lattner	5f6d9a313b	factor a bunch of copy/paste code out into a helper function. Eliminate the cases checking for cond?0:-1, since that is already handled by commutative checking. llvm-svn: 59388	2008-11-16 04:24:12 +00:00
Chris Lattner	68d2da2a19	rearrange some code, no functionality change. llvm-svn: 59381	2008-11-16 03:56:24 +00:00
Chris Lattner	e02c7c7ad2	if we're going to use a macro, use it maximally. no functionality change. llvm-svn: 59380	2008-11-16 03:54:57 +00:00
Devang Patel	8ada1d5de5	Refactor code. Strip debug information before stripping symbol names. llvm-svn: 59328	2008-11-14 22:49:37 +00:00
Devang Patel	3dd51c5c62	Really remove all debug information. llvm-svn: 59208	2008-11-13 01:28:40 +00:00
Oscar Fuentes	1b504d5372	CMake: Remove removed source file. llvm-svn: 59098	2008-11-12 00:14:12 +00:00
Devang Patel	4f02a0b740	Remove llvm-svn: 59093	2008-11-11 23:58:15 +00:00
Devang Patel	bf0835706c	Undo previous check-in. llvm-svn: 59092	2008-11-11 23:57:33 +00:00
Oscar Fuentes	2353ef3e91	CMake: Updated list of source files for lib/Transforms/Utils. llvm-svn: 59077	2008-11-11 19:51:36 +00:00
Devang Patel	6096f26bd4	Add utility pass to remove dbg info. llvm-svn: 59068	2008-11-11 19:33:39 +00:00
Devang Patel	95b18126ee	Use actual function name in comments. llvm-svn: 59063	2008-11-11 19:16:41 +00:00
Cedric Venet	8cb2e28e43	Update CMakeLists.txt llvm-svn: 59039	2008-11-11 09:55:48 +00:00
Devang Patel	53b39b5467	Cleanup debug info. assocated with deleted instructions. llvm-svn: 59012	2008-11-11 00:54:10 +00:00
Devang Patel	dc6699e82f	Add utility routines to remove dead debug info. llvm-svn: 59011	2008-11-11 00:53:02 +00:00
Devang Patel	d0ce981372	If the sign of exit condition and split condition does not match then do not split loop index. llvm-svn: 58995	2008-11-10 19:48:34 +00:00
Bill Wendling	7ef7314d1a	Third time's a charm. The previous patches didn't match correctly. Also, we need to make sure that the conditional is the same before doing the transformation. llvm-svn: 58978	2008-11-10 06:59:06 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Bill Wendling	4fb13c051d	Correction for the last patch. Should match the conditional in the first part of the select match, not the select instruction itself. llvm-svn: 58947	2008-11-09 23:37:53 +00:00
Bill Wendling	1579287550	The method of doing the matching with a 'select' instruction was wrong. The original code was matching like this: if (match(A, m_Not(m_Value(B)))) B was already matched as a 'select' instruction. However, this isn't matching what we think it's matching. It would match B as a 'Value', so basically anything would match to it. In this case, a Constant matched. B was replaced with a constant representation. And then the wrong value would be used in the SelectInst::Create statement, causing a crash. After thinking on this for a moment, and after Nick L. told me how the pattern matching stuff was supposed to work, the solution was to match NOT an m_Value, but an m_Select. llvm-svn: 58946	2008-11-09 23:17:42 +00:00
Nuno Lopes	2e42927e7c	fix leakage of ValueNumbering llvm-svn: 58933	2008-11-09 12:45:23 +00:00
Bill Wendling	3f547be28f	If the LHS of the FCMP is coming from a UIToFP instruction, then we don't want to generate signed ICMP instructions to replace the FCMP. This would violate the following: define i1 @test1(i32 %val) { %1 = uitofp i32 %val to double %2 = fcmp ole double %1, 0.000000e+00 ret i1 %2 } would be transformed into: define i1 @test1(i32 %val) { %1 = icmp slt i33 %val, 1 ret i1 %1 } which is obviously wrong. This patch modifes InstCombiner::FoldFCmp_IntToFP_Cst to handle when the LHS comes from UIToFP. llvm-svn: 58929	2008-11-09 04:26:50 +00:00
Daniel Dunbar	2b9dce2669	Rework r58829, allowing removal of dbg info intrinsics during alloca promotion. - Eliminate uses after free and simplify tests. Devang: Please check that this is still doing what you intended. llvm-svn: 58887	2008-11-08 04:12:17 +00:00
Bill Wendling	b9656df4ac	BCUI + 1 doesn't work. Use next instead. llvm-svn: 58830	2008-11-07 01:59:41 +00:00
Devang Patel	b8e0d59ceb	Handle (delete) dbg intrinsics while promoting alloca. llvm-svn: 58826	2008-11-07 01:30:07 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	5a5ab730e0	InstructionNamer preserves everything. llvm-svn: 58787	2008-11-06 01:00:16 +00:00
Devang Patel	f0ef35738c	Do now allow InlineAlways pass to remove dead functions. llvm-svn: 58744	2008-11-05 01:39:16 +00:00
Devang Patel	7a848b0ee3	Check Attribute::NoInline. llvm-svn: 58742	2008-11-05 01:37:05 +00:00
Oscar Fuentes	076e048cf7	CMake: updated list of source files. llvm-svn: 58736	2008-11-05 00:11:22 +00:00
Dan Gohman	8cdea717a3	Add a new pass to simplify specific half_powr function calls. This is a specialized pass that it not likely to be generally useful. llvm-svn: 58732	2008-11-04 23:41:45 +00:00
Dale Johannesen	0a7b4f5800	Allow SROA of vectors. Removing this caused a huge performance regression in something we care about. This may not be final fix. llvm-svn: 58718	2008-11-04 20:54:03 +00:00
Devang Patel	f33f8a8606	Fix unused variable warnings. llvm-svn: 58651	2008-11-03 23:14:09 +00:00
Devang Patel	fe57d109b6	Ignore conditions that are outside the loop. llvm-svn: 58631	2008-11-03 19:38:07 +00:00
Andrew Lenharth	348f3fa6a7	add a period at the end of the comment, ignoring the fact that the comment would be hard pressed to be considered a sentence, but if it makes Bill happy... llvm-svn: 58630	2008-11-03 19:29:29 +00:00
Devang Patel	c1631db93b	Turn floating point IVs into integer IVs where possible. This allows SCEV users to effectively calculate trip count. LSR later on transforms back integer IVs to floating point IVs later on to avoid int-to-float casts inside the loop. llvm-svn: 58625	2008-11-03 18:32:19 +00:00
Andrew Lenharth	45b86322f2	Ensure that we are checking only calls to the function we are interested in specializing llvm-svn: 58615	2008-11-03 16:05:35 +00:00
Nick Lewycky	d73806a9cc	Replace explicit loop with utility function. llvm-svn: 58593	2008-11-03 03:49:14 +00:00
Nick Lewycky	3c6d34a7f0	Changes from Duncan's review: * merge two weak functions by making them both alias a third non-weak fn * don't reimplement CallSite::hasArgument * whitelist the safe linkage types llvm-svn: 58568	2008-11-02 16:46:26 +00:00
Duncan Sands	cede1e035c	Get this building on 64 bit machines (error: cast from ‘const llvm::PointerType*’ to ‘unsigned int’ loses precision). llvm-svn: 58561	2008-11-02 09:00:33 +00:00
Oscar Fuentes	0433be6feb	CMake: added a source file. llvm-svn: 58559	2008-11-02 06:01:39 +00:00
Nick Lewycky	d01d42e76c	Add a new MergeFunctions pass. It finds identical functions and merges them. This triggers only 60 times in llvm-test (look at .llvm.bc, not .linked.rbc) and so it probably wont be turned on by default. Also, may of those are likely to go away when PR2973 is fixed. llvm-svn: 58557	2008-11-02 05:52:50 +00:00
Nick Lewycky	8d8acf327b	Fix demanded bits analysis with srem by negative number. Based on a patch by Richard Osborne. llvm-svn: 58555	2008-11-02 02:41:50 +00:00
Dan Gohman	83eea0b17f	Fix this recently moved code to use the correct type. CI is now a ConstantInt, and SI is the original cast instruction. This fixes PR2996. llvm-svn: 58549	2008-11-02 00:17:33 +00:00
Daniel Dunbar	a1c4fcfc29	Fix warning. llvm-svn: 58486	2008-10-31 01:50:01 +00:00
Dan Gohman	13cbcf1c18	Canonicalize sext(i1) to i1?-1:0, and update various instcombine optimizations accordingly. llvm-svn: 58457	2008-10-30 20:40:10 +00:00
Daniel Dunbar	3933e66a89	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	0934c0f35b	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Daniel Dunbar	e7fbf9f425	Factor shouldInline method out of Inliner. - No functionality change. llvm-svn: 58355	2008-10-29 01:02:02 +00:00
Daniel Dunbar	cc20455346	Assorted comment/naming fixes, 80-col violations, and reindentation. - No functionality change. llvm-svn: 58352	2008-10-28 23:24:26 +00:00
Dan Gohman	2c34c130bf	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Torok Edwin	ca97b42ef7	export an ID for the instructionNamer, allowing analysis/transformation passes that need it to require it by ID. llvm-svn: 58238	2008-10-27 10:16:27 +00:00
Chris Lattner	59b5691388	Rewrite all the 'PromoteLocallyUsedAlloca[s]' logic. With the power of LargeBlockInfo, we can now dramatically simplify their implementation and speed them up at the same time. Now the code has time proportional to the number of uses of the alloca, not the size of the block. This also eliminates code that tried to batch up different allocas which are used in the same blocks, and eliminates the 'retry list' logic which was baroque and no unneccesary. In addition to being a speedup for crazy cases, this is also a nice cleanup: PromoteMemoryToRegister.cpp \| 270 +++++++++++++++----------------------------- 1 file changed, 96 insertions(+), 174 deletions(-) llvm-svn: 58229	2008-10-27 07:05:53 +00:00
Chris Lattner	f594ecc453	Add a new LargeBlockInfo helper, which is just a wrapper around a trivial dense map. Use this in RewriteSingleStoreAlloca to avoid aggressively rescanning blocks over and over again. This fixes PR2925, speeding up mem2reg on the testcase in that bug from 4.56s to 0.02s in a debug build on my machine. llvm-svn: 58227	2008-10-27 06:05:26 +00:00
Nick Lewycky	f6e4dca67e	Add value range analyzing of Add and Sub. Understand that mul %x, 1 = %x. llvm-svn: 58069	2008-10-24 04:00:26 +00:00
Daniel Dunbar	7f39e2d85a	Change createPass factory functions to return Pass instead of LoopPass*. - Although less precise, this means they can be used in clients without RTTI (who would otherwise need to include LoopPass.h, which eventually includes things using dynamic_cast). This was the simplest solution that presented itself, but I am happy to use a better one if available. llvm-svn: 58010	2008-10-22 23:32:42 +00:00
Dan Gohman	72e66eedb8	Use Function::getEntryBlock() instead of Function::front(), for clarity. llvm-svn: 57870	2008-10-21 03:10:28 +00:00
Dan Gohman	fa29b67aee	Fix a bug that prevented llvm-extract -delete from working. llvm-svn: 57864	2008-10-21 01:08:07 +00:00
Dan Gohman	215742a966	Use 0 instead of false to return a null pointer. llvm-svn: 57660	2008-10-17 00:56:52 +00:00
Dan Gohman	bc0278400c	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	d885f6e139	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	ce70752b11	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	67786cce66	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Matthijs Kooijman	f7d3cb5435	Make InstructionCombining::getBitCastOperand() recognize GEP instructions and constant expression with all zero indices as being the same as a bitcast. llvm-svn: 57442	2008-10-13 15:17:01 +00:00
Chris Lattner	da435910e8	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Devang Patel	647a1e532b	Check loop exit predicate properly while eliminating one iteration loop. This patch fixes PR 2869 llvm-svn: 57369	2008-10-10 22:02:57 +00:00
Nuno Lopes	e3127f3f80	fix memleak by cleaning the global sets on pass exit llvm-svn: 57353	2008-10-10 16:25:50 +00:00
Dale Johannesen	4f0bd68cfe	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Nick Lewycky	03c5fa18f1	Don't drop alignment on globals when cloning. llvm-svn: 57320	2008-10-09 06:27:14 +00:00
Nuno Lopes	06c67f88d7	dont specialize weak functions and the like llvm-svn: 57305	2008-10-08 18:45:59 +00:00
Duncan Sands	26ff6f9c54	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Chris Lattner	42d5785dbd	Add parentheses to avoid warnings in GCC 4.4.0, patch by Samuel Tardieu! llvm-svn: 57288	2008-10-08 06:42:28 +00:00
Andrew Lenharth	5aa1cc4065	Correctly set attributes when removing args during cloning. Fixes PR2765 llvm-svn: 57254	2008-10-07 18:08:38 +00:00
Devang Patel	40aafce00d	Fix typo, fix PR 2865. llvm-svn: 57221	2008-10-06 23:22:54 +00:00
Matthijs Kooijman	cbe5e16eb5	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Chris Lattner	917a6c1343	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Chris Lattner	ca91f265c4	fix a bug where the bswap matcher could match a case involving ashr. It should only apply to lshr. llvm-svn: 57089	2008-10-05 00:50:57 +00:00
Duncan Sands	1d35e9aebe	Ignore loads from and stores to local memory (i.e. allocas) when deciding whether to mark a function readnone/readonly. Since the pass is currently run before SROA, this may be quite helpful. Requested by Chris on IRC. llvm-svn: 57050	2008-10-04 13:24:24 +00:00
Dan Gohman	e21903987f	Clean up some multiple-return-value code that is no longer applicable. llvm-svn: 57033	2008-10-03 22:21:24 +00:00
Devang Patel	f963403b58	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Duncan Sands	3a813a5d3f	Teach internalize to preserve the callgraph. Why? Because it was there! llvm-svn: 56996	2008-10-03 07:36:09 +00:00
Owen Anderson	cb4f156b6b	SplitBlock should only attempt to update LoopInfo if it is actually being used. llvm-svn: 56994	2008-10-03 06:55:35 +00:00
Duncan Sands	d65a4daeea	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Nuno Lopes	96740aad86	revert the addition of Preverves(CallGraph), per Duncan's comments llvm-svn: 56917	2008-10-01 09:13:40 +00:00
Dan Gohman	67d90de2b0	Call ScalarEvolution's deleteValueFromRecords before deleting an instruction, not after. This fixes some uses of free'd memory. llvm-svn: 56908	2008-10-01 02:02:03 +00:00
Nuno Lopes	5093ab4c76	add preserversCFG() + preservers(CallGraph) llvm-svn: 56887	2008-09-30 22:04:30 +00:00
Nuno Lopes	2bd7b24f1a	add AU.setPreservesCFG() since this pass only adds and removes function attributes llvm-svn: 56868	2008-09-30 18:34:38 +00:00
Nick Lewycky	e8ced3ec19	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Duncan Sands	57512a1be4	Speed up these passes when the callgraph has huge simply connected components. Suggested by Chris. llvm-svn: 56787	2008-09-29 14:59:04 +00:00
Nuno Lopes	ffc9da6772	remove redundant test (mayBeOverriden() includes hasLinkOnceLinkage) llvm-svn: 56786	2008-09-29 14:40:32 +00:00
Duncan Sands	e340e18783	Tweak some comments. llvm-svn: 56784	2008-09-29 13:35:31 +00:00
Duncan Sands	08d91178e9	Rename isWeakForLinker to mayBeOverridden. Use it instead of hasWeakLinkage in a bunch of optimization passes. llvm-svn: 56782	2008-09-29 11:25:42 +00:00
Devang Patel	9eb525d4f9	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Devang Patel	a05633e105	Now Attributes are divided in three groups - return attributes - inreg, zext and sext - parameter attributes - function attributes - nounwind, readonly, readnone, noreturn Return attributes use 0 as the index. Function attributes use ~0U as the index. This patch requires corresponding changes in llvm-gcc and clang. llvm-svn: 56704	2008-09-26 22:53:05 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Evan Cheng	25dd4a2daf	Commit CodeGenPrepare.cpp changes which was accidentially left out of 56526. llvm-svn: 56549	2008-09-24 06:48:55 +00:00
Eric Christopher	c1ea149dcd	Fix fallout in CodeGenPrepare from 56526. Will likely need more work. llvm-svn: 56546	2008-09-24 05:32:41 +00:00
Devang Patel	6402c7236f	s/ParamAttrsWithIndex/FnAttributeWithIndex/g llvm-svn: 56535	2008-09-24 00:55:02 +00:00
Devang Patel	e15607b7bb	Put FN_NOTE_AlwaysInline and others in FnAttr namespace. llvm-svn: 56527	2008-09-24 00:06:15 +00:00
Devang Patel	e87abd26ba	Move FN_NOTE_AlwaysInline and other out of ParamAttrs namespace. Do not check isDeclaration() in hasNote(). It is clients' responsibility. llvm-svn: 56524	2008-09-23 23:52:03 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Devang Patel	82fed6702b	Use parameter attribute store (soon to be renamed) for Function Notes also. Function notes are stored at index ~0. llvm-svn: 56511	2008-09-23 22:35:17 +00:00
Devang Patel	329fe728b5	Add hasNote() to check note associated with a function. llvm-svn: 56477	2008-09-22 22:32:29 +00:00
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Duncan Sands	e1dc84be64	Implement review feedback from Devang: make use of mayReadFromMemory and mayWriteToMemory. llvm-svn: 56387	2008-09-20 16:45:58 +00:00
Duncan Sands	310077034a	Remove the MarkModRef pass (use AddReadAttrs instead). Unfortunately this means removing one regression test of GlobalsModRef because I couldn't work out how to perform it without MarkModRef. llvm-svn: 56342	2008-09-19 08:23:44 +00:00
Duncan Sands	af25ee7ffc	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Devang Patel	c25be3b2de	splitLoop does not handle split condition EQ. Fixes PR 2805 llvm-svn: 56321	2008-09-18 23:45:14 +00:00
Bill Wendling	a00fa322b1	Decrementing the iterator here could be wrong if the worklist is empty after the "erase". Thanks to Ji Young Park for the patch! llvm-svn: 56316	2008-09-18 23:04:18 +00:00
Devang Patel	76b22c1420	Try to place hoisted instructions befoe icmp instruction. llvm-svn: 56315	2008-09-18 22:50:42 +00:00
Devang Patel	7f9671ba37	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	dca8d3b183	Do not ignore iv uses outside the loop. This one slipped through cracks very well. llvm-svn: 56284	2008-09-17 17:53:47 +00:00
Dan Gohman	dafa9c6e85	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	68e7735a38	Teach LSR to optimize away SMAX operations for tripcounts in common cases. See the comment above OptimizeSMax for the full story, and the testcase for an example. This cancels out a pessimization commonly attributed to indvars, and will allow us to lift some of the artificial throttles in indvars, rather than add new ones. llvm-svn: 56230	2008-09-15 21:22:06 +00:00
Dan Gohman	eff71f2953	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	7d01c0654c	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	9b9d547a5c	Fix a copy+paste bug that Duncan spotted. For several cases it was still getting lucky and detecting overflow but it was clearly incorrect. llvm-svn: 56113	2008-09-11 18:53:02 +00:00
Dan Gohman	9d9a4be588	In my analysis for r56076 I missed the case where the original multiplication overflows. llvm-svn: 56082	2008-09-11 00:25:00 +00:00
Dan Gohman	c1ae01688f	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Devang Patel	728c44ab56	fix white spaces. llvm-svn: 56056	2008-09-10 14:49:55 +00:00
Dan Gohman	97f0a0f28d	Fix a warning about comparing signed and unsigned values. llvm-svn: 56040	2008-09-10 01:09:32 +00:00
Devang Patel	92b032f3e6	if loop induction variable is always sign or zero extended then extend the type of induction variable. llvm-svn: 56017	2008-09-09 21:41:07 +00:00
Devang Patel	92c5367705	fix overflow check. llvm-svn: 56011	2008-09-09 20:54:34 +00:00
Anton Korobeynikov	1a1140429e	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Anton Korobeynikov	a9b60ee0fc	Resolve aliases, when possible llvm-svn: 56001	2008-09-09 19:04:59 +00:00
Dan Gohman	86fb5b48de	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Devang Patel	0f7a3507cf	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Devang Patel	3d56051f70	s/RemoveUnreachableBlocks/RemoveUnreachableBlocksFromFn/g llvm-svn: 55965	2008-09-08 22:14:17 +00:00
Devang Patel	7518f250b9	Remove unused counter. llvm-svn: 55924	2008-09-08 17:14:54 +00:00
Devang Patel	538a7f479a	Remove OptimizeIVType() llvm-svn: 55913	2008-09-08 16:13:27 +00:00
Duncan Sands	b9a6f861b4	Update the callgraph correctly. llvm-svn: 55896	2008-09-08 11:08:09 +00:00
Duncan Sands	3cf7d86556	Update the callgraph correctly in ArgumentPromotion. llvm-svn: 55895	2008-09-08 11:07:35 +00:00
Duncan Sands	46911f1271	Reapply 55859. This doesn't change anything as long as the callgraph is correct. It checks for wrong callgraphs more strictly. llvm-svn: 55894	2008-09-08 11:05:51 +00:00
Duncan Sands	1ea0d2e6db	Correct a comment and strip trailing whitespace. llvm-svn: 55883	2008-09-07 09:54:09 +00:00
Nuno Lopes	421f488cb7	fix crash when the malloc/free function is defined or is a declaration with 0 parameters. this pass doesnt seem to be used, but still it's now a little more correct llvm-svn: 55873	2008-09-06 17:44:06 +00:00
Duncan Sands	95c2a7848a	When PruneEH turned an invoke into an ordinary call (thus changing the call site) it didn't inform the callgraph about this. But the call site does matter - as shown by the testcase, the callgraph become invalid after the inliner ran (with an edge between two functions simply missing), resulting in wrong deductions by GlobalsModRef. llvm-svn: 55872	2008-09-06 17:19:29 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Devang Patel	d94269f906	Remove unused map. llvm-svn: 55861	2008-09-05 21:55:33 +00:00
Duncan Sands	9e23602849	Delete the removeCallEdgeTo callgraph method, because it does not maintain a correct list of callsites. I discovered (see following commit) that the inliner will create a wrong callgraph if it is fed a callgraph with correct edges but incorrect callsites. These were created by Prune-EH, and while it wasn't done via removeCallEdgeTo, it could have been done via removeCallEdgeTo, which is an accident waiting to happen. Use removeCallEdgeFor instead. llvm-svn: 55859	2008-09-05 21:43:04 +00:00
Duncan Sands	3a52056d4d	Use removeAllCalledFunctions rather than removing edges one by one by hand. llvm-svn: 55836	2008-09-05 14:56:53 +00:00
Duncan Sands	7c8fb1ad93	Remove trailing whitespace. llvm-svn: 55835	2008-09-05 12:37:12 +00:00
Duncan Sands	6dd02b5219	Make this pass return that it made a change if it modifies a functions attributes. llvm-svn: 55831	2008-09-05 09:08:37 +00:00
Devang Patel	40519f0370	A loop may be unswitched multiple times. Reconstruct dom info. at the end. llvm-svn: 55806	2008-09-04 22:43:59 +00:00
Devang Patel	00ec74616b	Initialize loop data first. llvm-svn: 55792	2008-09-04 20:36:36 +00:00
Devang Patel	d52071540c	Do not unswitch if the function notes say we're optimizing this function for size. llvm-svn: 55786	2008-09-04 18:55:13 +00:00
Andrew Lenharth	19fb2aba50	try to seperate the mechanism into something others can use llvm-svn: 55785	2008-09-04 18:51:26 +00:00
Dale Johannesen	fe1bb7964c	Add intrinsic forms of pow and exp2. The non-intrinsic forms remain to handle older IR files, but will go away soon. llvm-svn: 55781	2008-09-04 18:30:46 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Andrew Lenharth	95d573a7f0	cleanup as per Duncan's review llvm-svn: 55766	2008-09-04 14:34:22 +00:00
Devang Patel	a26e2075b8	Update inline threshold for current function if the notes say, optimize for size. llvm-svn: 55745	2008-09-03 23:06:09 +00:00
Owen Anderson	2fbfb70530	Fix a bug that prevented PRE from applying in some cases. llvm-svn: 55744	2008-09-03 23:06:07 +00:00
Andrew Lenharth	9fed8f5b9c	Initial version of a Partial Specialization IPO pass. It triggers a couple hundred times on 176.gcc. I don't know the performance impact yet, the heuristic is quite simple still. llvm-svn: 55734	2008-09-03 21:00:28 +00:00
Devang Patel	a563d24e5d	Fix typo in a comment. llvm-svn: 55720	2008-09-03 20:25:40 +00:00
Devang Patel	a4211876e5	Add parentheses to make code more readable. llvm-svn: 55717	2008-09-03 19:57:15 +00:00
Devang Patel	50c66cdb0d	Fix comments. llvm-svn: 55716	2008-09-03 19:52:17 +00:00
Devang Patel	924d9084d8	Add custom inliner that handles only functions that are marked as always_inline. llvm-svn: 55713	2008-09-03 18:50:53 +00:00
Devang Patel	0d442ffa2b	Handle "always inline" note during inline cost analysis. llvm-svn: 55712	2008-09-03 18:47:45 +00:00
Devang Patel	79661994b1	Check noinline note and ignore other notes. llvm-svn: 55711	2008-09-03 18:46:35 +00:00
Devang Patel	62be9ad270	Handle "noinline" note inside the simple inliner. llvm-svn: 55708	2008-09-03 18:10:21 +00:00
Nick Lewycky	2fcb26cc75	Don't apply this transform to vectors. Fixes PR2756. llvm-svn: 55690	2008-09-03 06:24:21 +00:00
Devang Patel	bcd39345de	Add additional check to ensure that iv is canonicalized. llvm-svn: 55682	2008-09-03 00:29:13 +00:00
Devang Patel	b530f08122	Check iteration count. llvm-svn: 55680	2008-09-03 00:10:56 +00:00
Devang Patel	81fed043c5	While removing PHI, use basicblock to identify incoming value. llvm-svn: 55678	2008-09-03 00:02:42 +00:00
Devang Patel	7e59270272	s/FP_AlwaysInline/FN_NOTE_AlwaysInline/g llvm-svn: 55676	2008-09-02 22:43:57 +00:00
Devang Patel	43c5a52e07	If all IV uses are extending integer IV then change the type of IV itself, if possible. llvm-svn: 55674	2008-09-02 22:18:08 +00:00
Devang Patel	bfa535af9f	respect inline=never and inline=always notes. llvm-svn: 55673	2008-09-02 22:16:13 +00:00
Duncan Sands	130d9efec3	Add a small pass that sets the readnone/readonly attributes on functions, based on the result of alias analysis. It's not hardwired to use GlobalsModRef even though this is the only (AFAIK) alias analysis that results in this pass actually doing something. Enable as follows: opt ... -globalsmodref-aa -markmodref ... Advantages of this pass: (1) records the result of globalsmodref in the bitcode, meaning it is available for use by later passes (currently the pass manager isn't smart enough to magically make an advanced alias analysis available to all later passes), which may expose more optimization opportunities; (2) hopefully speeds up compilation when code is optimized twice, for example when a file is compiled to bitcode, then later LTO is done on it: marking functions readonly/readnone when producing the initial bitcode should speed up alias analysis during LTO; (3) good for discovering that globalsmodref doesn't work very well :) Not currently turned on by default. llvm-svn: 55604	2008-09-01 11:40:11 +00:00
Devang Patel	d6adbb6a0f	Do not apply the transformation if the target does not support DestTy natively. llvm-svn: 55433	2008-08-27 20:55:23 +00:00
Devang Patel	cf7ca5d0ba	Fix typos and whitespaces. Other cosmetic changes based on feedback. llvm-svn: 55424	2008-08-27 17:50:18 +00:00
Owen Anderson	b39e0decf8	Put a heuristic in place to prevent GVN from falling into bad cases with massively complicated CFGs. This speeds up a particular testcase from 12+ hours to 5 seconds with little perceptible loss of quality. llvm-svn: 55391	2008-08-26 22:07:42 +00:00
Devang Patel	4310d39844	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast operation. llvm-svn: 55374	2008-08-26 17:57:54 +00:00
Chris Lattner	add44f3fb7	improve encapsulation of the BBExecutable set. llvm-svn: 55271	2008-08-23 23:39:31 +00:00
Chris Lattner	65938fc69a	Switch an assortment of maps, sets and vectors to more efficient versions, patch contributed by m-s! llvm-svn: 55270	2008-08-23 23:36:38 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Chris Lattner	20abc419e5	Add a new trivial -inst-namer pass which makes it possible to diff the before/after effects of a pass, crazy! llvm-svn: 55230	2008-08-23 06:07:02 +00:00
Chris Lattner	3f972c9150	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Chris Lattner	5fc8ab6d18	consolidate DenseMapInfo implementations, and add one for std::pair. Patch contributed by m-s. llvm-svn: 55167	2008-08-22 05:08:25 +00:00
Nick Lewycky	99f4558117	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00
Evan Cheng	f5a7e51c81	Silence a compiler warning. llvm-svn: 55087	2008-08-20 23:36:48 +00:00
Mon P Wang	1b2c061b73	Fixed shuffle optimizations to handle non power of 2 vectors llvm-svn: 55035	2008-08-20 02:23:25 +00:00
Chris Lattner	57693dda1d	don't use the result of WriteAsOperand llvm-svn: 54979	2008-08-19 04:45:19 +00:00
Nick Lewycky	75d4a83f2f	Make this comment clearer. Instead of using an ambiguous ~ (not) on an icmp predicate, swap the order of the operands. llvm-svn: 54907	2008-08-17 20:02:02 +00:00
Nick Lewycky	53b44029d6	Consider the case where xor by -1 and xor by 128 have been combined already to produce an xor by 127. llvm-svn: 54906	2008-08-17 19:58:24 +00:00
Gordon Henriksen	d930f913e6	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Evan Cheng	5dabe042a6	Revert 54821. It's miscompiling 252.eon and 447.dealII llvm-svn: 54878	2008-08-17 08:07:31 +00:00
Nick Lewycky	18c6f56c76	I found a better place for this optz'n. llvm-svn: 54877	2008-08-17 07:54:14 +00:00
Nick Lewycky	18f50b2637	Xor'ing both sides of icmp by sign-bit is equivalent to swapping signedness of the predicate. Also, make this optz'n apply in more cases where it's safe to do so. llvm-svn: 54876	2008-08-17 07:34:14 +00:00
Chris Lattner	17f7165f84	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Owen Anderson	affe0267f8	Remove GCSE, ValueNumbering, and LoadValueNumbering. These have been deprecated for almost a year; it's finally time for them to go away. llvm-svn: 54822	2008-08-15 21:31:02 +00:00
Devang Patel	f2a03d5a4b	Reapply 54786. Add overflow and number of mantissa bits checks. llvm-svn: 54821	2008-08-15 21:21:34 +00:00
Evan Cheng	86834d29f3	Revert 54786. It's not checking for overflows, etc. llvm-svn: 54813	2008-08-15 08:12:11 +00:00
Chris Lattner	1d23915a8f	use smallvector instead of vector for a couple worklists. This speeds up instcombine by ~10% on some testcases. llvm-svn: 54811	2008-08-15 04:03:01 +00:00
Bill Wendling	861bec78f8	Temporarily revert r54792. It's causing an ICE during bootstrapping. llvm-svn: 54804	2008-08-14 23:05:24 +00:00
Devang Patel	52dc07b01a	Use DenseMap. Patch by Pratik Solanki. llvm-svn: 54792	2008-08-14 21:31:10 +00:00
Devang Patel	054a833dd4	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast opeation. llvm-svn: 54786	2008-08-14 20:58:31 +00:00
Dan Gohman	8de6d22392	Use empty() instead of begin() == end(). llvm-svn: 54780	2008-08-14 18:13:49 +00:00
Matthijs Kooijman	4801bd41cf	Replace two for loops with while(!X->use_empty()) loops. This prevents invalidating the iterator by deleting the current use. This fixes a segfault on 64 bit linux reported in PR2675. Also remove an unneeded if. llvm-svn: 54778	2008-08-14 15:03:05 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	8ded5d5884	Fix SCCP's handling of struct value loads and stores. SCCP doesn't track individual leaf values in such cases, so it needs to treat struct values as normal values in this case. llvm-svn: 54760	2008-08-13 21:22:48 +00:00
Devang Patel	6369a798ba	Rename. s/FindIVForUser/FindIVUserForCond/g llvm-svn: 54754	2008-08-13 20:31:11 +00:00
Devang Patel	97387e6615	Check sign to detect overflow before changing compare stride. llvm-svn: 54710	2008-08-13 02:05:14 +00:00
Bill Wendling	f21a38700f	Remove tabs. llvm-svn: 54707	2008-08-12 23:15:44 +00:00
Chris Lattner	2aa0ff27aa	Implement support for simplifying vector comparisons by 0.0 and 1.0 like we do for scalars. Patch contributed by Nicolas Capens This also generalizes the previous xforms to work on long double, now that isExactlyValue works for long double. llvm-svn: 54653	2008-08-11 22:06:05 +00:00
Eric Christopher	5927883970	Have IRBuilder take a template argument on whether or not to preserve names. This can save a lot of allocations if you aren't going to be looking at the output. llvm-svn: 54546	2008-08-08 19:39:37 +00:00
Matthijs Kooijman	75b4fc2c84	Let SRETPromotion properly preserve the function name instead of (implicitly) postfixing it with a number. llvm-svn: 54468	2008-08-07 16:01:23 +00:00
Matthijs Kooijman	d6c1c8a974	Fix SRETPromotion, it was generating functions without returns statements since r53941 (but this was not noticed due to the lack of a basic test for SRETPromotion). llvm-svn: 54467	2008-08-07 15:58:09 +00:00
Matthijs Kooijman	41536988dd	Add some debug output to SRETPromotion. llvm-svn: 54464	2008-08-07 15:14:04 +00:00
Dan Gohman	ac22cfcae9	Fix a shufflevector instcombine that was emitting invalid masks indices when it meant to be emitting undef indices. llvm-svn: 54417	2008-08-06 18:17:32 +00:00
Evan Cheng	907dc2bc37	Fix PR2355: bug in ChangeCompareStride. When the loop termination compare is the only use of its iv stride, the stride can be eliminated by moving it to another stride. If the scale is negative, swap the predicate instead of using a inverse predicate. llvm-svn: 54415	2008-08-06 18:04:43 +00:00
Chris Lattner	f5b353c1fd	optimize a common idiom generated by clang for bitfield access, PR2638. llvm-svn: 54408	2008-08-06 07:35:52 +00:00
Chris Lattner	7bdaecb7f4	Zap sitofp/fptoui pairs. In all cases when the sign difference matters, the result is undefined anyway. llvm-svn: 54396	2008-08-06 05:13:06 +00:00
Nick Lewycky	bf42893567	Reinstate this optimization, but without the miscompile. Thanks to Bill for tracking down that this was breaking llvm-gcc bootstrap on Linux. llvm-svn: 54394	2008-08-06 04:54:03 +00:00
Dan Gohman	1fcc804cfd	Pass the computed iteration count value to RewriteLoopExitValues instead of having it call getIterationCount again. llvm-svn: 54380	2008-08-05 22:34:21 +00:00
Bill Wendling	ee12a7aeff	Revert r53282. This was causing a miscompile on Linux. Also, the transformation looks bogus. Please see PR2629 for details on why this is breaking things. llvm-svn: 54372	2008-08-05 21:23:45 +00:00
Dan Gohman	3da016d137	Trim #includes. llvm-svn: 54350	2008-08-05 15:32:23 +00:00
Duncan Sands	c1e48b582d	Fix comment typos. llvm-svn: 54266	2008-08-01 12:23:49 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Matthijs Kooijman	98b5c16e3b	Add -unroll-allow-partial command line option that enabled the loop unroller to partially unroll a loop when fully unrolling would not fit under the threshold. Patch by Mikael Lepistö. llvm-svn: 54160	2008-07-29 13:21:23 +00:00
Matthijs Kooijman	fd3070459b	Restructure ArgumentPromotion a bit. Instead of just having a single boolean that says "unconditional loads from this argument are safe", we now keep track of the safety per set of indices from which loads happen. This prevents ArgPromotion from promoting loads that aren't really valid. As an added effect, this will now disregard the the type of the indices passed to a GEP, so "load GEP %A, i32 1" and "load GEP %A, i64 1" will result in a single argument, not two. This fixes PR2598, for which a testcase has been added as well. llvm-svn: 54159	2008-07-29 10:00:13 +00:00
Owen Anderson	813bf7af7f	Don't remove volatile loads. Thanks to Duncan for noticing this one. llvm-svn: 54144	2008-07-28 20:52:42 +00:00
Owen Anderson	3f3389745d	Add support for eliminating stores that store the same value that was just loaded. This fixes PR2599. llvm-svn: 54133	2008-07-28 16:14:26 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	5f36a32e7b	Put the LICM of constant GlobalVariables, introduced in r53945, under a command-line option, and disable it by default. It introduced performance regressions because CodeGen is currently not able to remat such loads. llvm-svn: 53997	2008-07-24 23:57:25 +00:00
Chris Lattner	8a8fb908dc	"Allow LICM to sink or lift loads from constant memory. Also add a test case for this. This allows instructions like loads from global variables declared to be constant to be moved out of loops." Patch by Stefanus Du Toit! llvm-svn: 53945	2008-07-23 05:06:28 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Dan Gohman	7ad3cd8c9d	Fix a bug in LSR's dead-PHI cleanup. If a PHI has a def-use chain that leads into a cycle involving a different PHI, LSR got stuck running around that cycle looking for the original PHI. To avoid this, keep track of visited PHIs and stop searching if we see one more than once. This fixes PR2570. llvm-svn: 53879	2008-07-21 21:45:02 +00:00
Duncan Sands	2c741145a7	Supress a gcc-4.3 warning. llvm-svn: 53771	2008-07-18 21:06:02 +00:00
Owen Anderson	04a6e0ba8c	Make PRE actually handle critical edges (by splitting them). Confirmed that bootstrap passes with this change. llvm-svn: 53762	2008-07-18 18:03:38 +00:00
Owen Anderson	9858691f25	Reapply r53735. My last patch fixed the failures Dan observed. llvm-svn: 53761	2008-07-18 17:49:43 +00:00
Owen Anderson	1468bec06e	Add some checks that got lost in the shuffle. This fixes 464.h264ref. llvm-svn: 53760	2008-07-18 17:46:41 +00:00
Dan Gohman	29c3adaae0	Revert r53735. It broke SPEC 464.h264ref. llvm-svn: 53757	2008-07-18 16:44:49 +00:00
Owen Anderson	fd7102037d	Use MergeBlockIntoPredecessor to simplify some code. llvm-svn: 53735	2008-07-17 20:00:46 +00:00
Owen Anderson	27405efdc0	Make MergeBlockIntoPredecessor more aggressive when the same successor appears more than once. llvm-svn: 53731	2008-07-17 19:42:29 +00:00
Owen Anderson	addbe3eed1	Enable PRE. My last batch of changes fixed the miscompile. llvm-svn: 53730	2008-07-17 19:41:00 +00:00
Matthijs Kooijman	8b69d77a7a	Make GlobalOpt preserve address spaces when scalar replacing aggregate globals. llvm-svn: 53716	2008-07-17 11:59:53 +00:00
Chris Lattner	c600c53d1f	Fix PR2553 llvm-svn: 53715	2008-07-17 06:07:20 +00:00
Evan Cheng	97cd0298cc	Inliner tweak. Function calls should cost more than one instruction! llvm-svn: 53712	2008-07-17 01:31:49 +00:00
Owen Anderson	c062381c7b	Factor MergeBlockIntoPredecessor out into BasicBlockUtils. llvm-svn: 53705	2008-07-17 00:01:40 +00:00
Owen Anderson	ac31096311	There's no need to iterate block merging and PRE. In fact, iterating the latter could cause problems for memdep when it breaks critical edges. llvm-svn: 53691	2008-07-16 17:52:31 +00:00
Matthijs Kooijman	c1d7477ed2	Redo InstCombiner::visitExtractValueInst. Instead of using the (complicate) FindInsertedValue, it now performs a number of simple transformations that should result in the same effect when applied iteratively. llvm-svn: 53673	2008-07-16 12:55:45 +00:00
Evan Cheng	c97094552c	Fix PR2296. Do not transform x86_sse2_storel_dq into a full-width store. llvm-svn: 53666	2008-07-16 07:28:14 +00:00
Owen Anderson	24768e3dc4	Revert this, as it seems to still be broken. llvm-svn: 53627	2008-07-15 17:59:02 +00:00
Owen Anderson	9d1f497a28	Enable local PRE by default. llvm-svn: 53616	2008-07-15 16:28:23 +00:00
Owen Anderson	53d546e40b	Have GVN do a pre-pass over the CFG that folds away unconditional branches where possible. This allows local PRE to be more aggressive. llvm-svn: 53615	2008-07-15 16:28:06 +00:00
Matthijs Kooijman	c893bf472d	Allow deadargelim to change return types even though now values were dead. This again canonicalizes {i32} into i32 and {} into void. llvm-svn: 53610	2008-07-15 14:42:31 +00:00
Matthijs Kooijman	5e8c022e21	Revert r53606. It turns out that explicitely tracking the liveness of the return value as a whole in deadargelim is really not needed now that we simply rebuild the old return value and actually prevents some canonicalization from taking place. This revert stops deadargelim from changing {i32} into i32 for now, but I'll fix that next. llvm-svn: 53609	2008-07-15 14:39:36 +00:00
Matthijs Kooijman	c1da874478	Make deadargelim a bit less smart, so it doesn't choke on nested structs as return values that are still (partially) live. Instead of updating all uses of a call instruction after removing some elements, it now just rebuilds the original struct (With undef gaps where the unused values were) and leaves it to instcombine to clean this up. The added testcase still fails currently, but this is due to instcombine which isn't good enough yet. I will fix that part next. llvm-svn: 53608	2008-07-15 14:03:10 +00:00
Matthijs Kooijman	04d4c328ac	Don't use isa when we can reuse a previous dyn_cast. llvm-svn: 53607	2008-07-15 13:39:08 +00:00
Matthijs Kooijman	84194b6768	Make DeadArgElim keep liveness of the return value as a whole in addition to only the liveness of partial return values (for functions returning a struct). This is more explicit to prevent unwanted changes in the return value. In particular, deadargelim now canonicalizes a function returning {i32} to returning i32 and {} to void, if the struct returned is not used in its entirety, but only the single element is used. llvm-svn: 53606	2008-07-15 13:36:06 +00:00
Matthijs Kooijman	79a8eb547c	Let DAE keep a list of live functions, instead of simply marking all arguments and return values live for those functions. This doesn't change anything yet, but prepares for the coming commits. llvm-svn: 53601	2008-07-15 09:11:16 +00:00
Matthijs Kooijman	e9af814669	Split DAE::MarkLive into MarkLive and PropagateLiveness. llvm-svn: 53600	2008-07-15 09:00:17 +00:00
Matthijs Kooijman	2ce5709e31	Pass around const RetOrArg references instead of copying values. Also, mark RetOrArg::getDescription() as const. llvm-svn: 53599	2008-07-15 08:56:49 +00:00
Matthijs Kooijman	f2860b9fb3	Simplify debug code by using RetOrArg::getDescription(). llvm-svn: 53598	2008-07-15 08:53:36 +00:00
Matthijs Kooijman	90d08addb0	Fix indentation (intentionally left out of the previous commit). llvm-svn: 53592	2008-07-15 08:47:32 +00:00
Matthijs Kooijman	06642d3812	Move the deadargelim code for intrinsically alive functions into its own method, to slightly simplify control flow. llvm-svn: 53591	2008-07-15 08:45:12 +00:00
Dan Gohman	162668fa78	Fix uninitialized use of the Changed variable. llvm-svn: 53564	2008-07-14 17:55:01 +00:00
Chris Lattner	8882b1c41c	Reapply r53540, now with the matching header! llvm-svn: 53557	2008-07-14 17:32:59 +00:00
Duncan Sands	68b0383057	Revert r53540 - it does not compile. llvm-svn: 53549	2008-07-14 07:59:28 +00:00
Chris Lattner	2831ad28be	If a function calls setjmp, never inline it into other functions. This is a hack around the fact that we don't represent the CFG correctly for sj/lj. It fixes PR2486. llvm-svn: 53540	2008-07-14 00:46:56 +00:00
Chris Lattner	6f5ea6e49c	simplify some code, shuffle and insertelt always return a vector. llvm-svn: 53538	2008-07-14 00:32:20 +00:00
Chris Lattner	16395e51f4	Fix PR2506 by being a bit more careful about reverse fact propagation when disproving a condition. This actually compiles the existing testcase (udiv_select_to_select_shift) to: define i64 @test(i64 %X, i1 %Cond) { entry: %divisor1.t = lshr i64 %X, 3 ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %divisor1.t, %quotient2 ; <i64> [#uses=1] ret i64 %sum } instead of: define i64 @test(i64 %X, i1 %Cond) { entry: %quotient1.v = select i1 %Cond, i64 3, i64 4 ; <i64> [#uses=1] %quotient1 = lshr i64 %X, %quotient1.v ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %quotient1, %quotient2 ; <i64> [#uses=1] ret i64 %sum } llvm-svn: 53534	2008-07-14 00:15:52 +00:00
Chris Lattner	80b03a1b49	Fix mishandling of the infinite loop case when merging two blocks. This fixes PR2540. llvm-svn: 53533	2008-07-13 22:23:11 +00:00
Chris Lattner	834ab4ec1b	more refactoring. Use early exits instead of really complex logic. No functionality change. llvm-svn: 53532	2008-07-13 22:04:41 +00:00
Chris Lattner	5eed37224a	improve comments. llvm-svn: 53531	2008-07-13 21:55:46 +00:00
Chris Lattner	9aada1d755	factor another large hunk of code out into its own function. No functionality change. llvm-svn: 53530	2008-07-13 21:53:26 +00:00
Chris Lattner	55eaae1e0c	Final bit of simplification for FoldBranchToCommonDest. llvm-svn: 53528	2008-07-13 21:20:19 +00:00
Chris Lattner	1b317ea48a	simplify logic a bit llvm-svn: 53527	2008-07-13 21:15:11 +00:00
Chris Lattner	2e25b8f444	Refactor some code out into its own helper function, getting rid of crazy multiline conditionals and commenting the code better. No functionality change. llvm-svn: 53526	2008-07-13 21:12:01 +00:00
Nick Lewycky	f76aa23b54	Enhance analysis of srem. Remove dead code analyzing urem. 'urem' of power-of-2 is canonicalized to an 'and' instruction. llvm-svn: 53506	2008-07-12 05:04:38 +00:00
Dan Gohman	3707f1daba	Use find instead of lower_bound. llvm-svn: 53474	2008-07-11 20:58:19 +00:00
Owen Anderson	8e462e9a82	Don't call lookupNumber more than we have to. llvm-svn: 53470	2008-07-11 20:05:13 +00:00
Nick Lewycky	45e127ab20	Document 'mask' in this calculation. llvm-svn: 53454	2008-07-11 08:16:26 +00:00
Nick Lewycky	da405e1155	Remove misleading constant from comment. llvm-svn: 53452	2008-07-11 07:36:19 +00:00
Nick Lewycky	f95b64acaa	Add another optimization from PR2330. Also catch some missing cases that are similar. llvm-svn: 53451	2008-07-11 07:20:53 +00:00
Chris Lattner	3994bed1a9	a missed optimization that Eli spotted llvm-svn: 53449	2008-07-11 06:40:29 +00:00
Chris Lattner	13a6911ea2	another bug in the same line. llvm-svn: 53448	2008-07-11 06:38:16 +00:00
Chris Lattner	de89b507dd	fix a bug spotted by Eli's eagle eyes llvm-svn: 53447	2008-07-11 06:36:01 +00:00
Chris Lattner	bd25b8507c	simplify and merge a bunch of code. Instead of comparing against the min/max values for an integer type, compare against the min/max values we can prove contain the input. This might be a tighter bound, so this is general goodness. llvm-svn: 53446	2008-07-11 05:40:05 +00:00
Chris Lattner	38a50c9528	fold away (x <= cst) earlier, allowing us to not have to handle them in some code. llvm-svn: 53445	2008-07-11 05:08:55 +00:00
Chris Lattner	6af608b8ce	Fix folding of icmp's of i1 where the comparison is signed. The code was using the algorithm for folding unsigned comparisons which is completely wrong. This has been broken since the signless types change. llvm-svn: 53444	2008-07-11 04:20:58 +00:00
Chris Lattner	4fa8bb3430	Fix a bogus optimization: folding (slt (zext i1 A to i32), 1) -> (slt i1 A, true) This cause a regression in InstCombine/JavaCompare, which was doing the right thing on accident. To handle the missed case, generalize the comparisons based on masked bits a little bit to handle comparisons against the max value. For example, we can now xform (slt i32 (and X, 4), 4) -> (setne i32 (and X, 4), 4) llvm-svn: 53443	2008-07-11 04:09:09 +00:00
Matthijs Kooijman	e0f3ab82c4	Restructure dead argument elimination, try #3 :-) Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). This version fixed a few more bugs and was cleaned up a bit. It now passes all of LLVM's testing, and should still pass SPEC2006. There is still a minor bug with regard to returning nested structs. Since there is currently nothing that emits such IR, I will fix that in a seperate commit (partly because it requires a non-trivial fix). llvm-svn: 53400	2008-07-10 10:24:08 +00:00
Nick Lewycky	6193a564ab	Fix overzealous optimization. Thanks to Duncan Sands for pointing out my error! llvm-svn: 53393	2008-07-10 05:51:40 +00:00
Nick Lewycky	bb89c2a3f6	Simplify, suggested by Chris Lattner. llvm-svn: 53283	2008-07-09 07:35:26 +00:00
Nick Lewycky	f9c27c343a	Fold (a < 8) && (b < 8) into (a\|b) < 8 for unsigned less or greater than. llvm-svn: 53282	2008-07-09 07:29:11 +00:00
Nick Lewycky	364661c43e	Fold ((1 << a) & 1) to (a == 0). llvm-svn: 53276	2008-07-09 05:20:13 +00:00
Nick Lewycky	0d3645e673	Reduce x - y to -y when we know the 'x' part will get masked off anyways. llvm-svn: 53271	2008-07-09 04:32:37 +00:00
Devang Patel	51cbf928ab	If loop induction variable's start value is less then its exit value then do not split the loop. llvm-svn: 53265	2008-07-09 00:12:01 +00:00
Chris Lattner	501d78fdc0	Fix PR2496, a really nasty bug which involved sinking volatile loads into phis. This is actually the same bug as PR2262 / 2008-04-29-VolatileLoadDontMerge.ll, but I missed checking the first predecessor for multiple successors. Testcase here: InstCombine/2008-07-08-VolatileLoadMerge.ll llvm-svn: 53240	2008-07-08 17:18:32 +00:00
Evan Cheng	03001cb820	Fix two serious LSR bugs. 1. LSR runOnLoop is always returning false regardless if any transformation is made. 2. AddUsersIfInteresting can create new instructions that are added to DeadInsts. But there is a later early exit which prevents them from being freed. llvm-svn: 53193	2008-07-07 19:51:32 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Nick Lewycky	9f1a4dc672	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Owen Anderson	3ea90a7d55	Use information already present in the ValueTable to fast-fail when we know there won't be a value number match. This speeds up GVN on a case where there are very few redundancies by ~25%. llvm-svn: 53108	2008-07-03 17:44:33 +00:00
Devang Patel	eb611ddeb2	Do not try to update dominator info while manipulating CFG. This code does not handle all cases and keeps invalid dom info around some cases, which misleads other passes down stream. Right now, dom info is recaluclated in the end if the loop is switched. llvm-svn: 53106	2008-07-03 17:37:52 +00:00
Owen Anderson	d57cdc3c60	Remove the ability for ADCE to remove unreachable blocks in loop nests, because, as Eli pointed out, SimplifyCFG already does this. llvm-svn: 53104	2008-07-03 17:21:41 +00:00
Bill Wendling	a96eabaab7	Remove unused function. llvm-svn: 53090	2008-07-03 07:10:03 +00:00
Devang Patel	f94b9826b5	Preserve dom info. llvm-svn: 53089	2008-07-03 07:04:22 +00:00
Devang Patel	226edd1826	Remove extra FIXME llvm-svn: 53087	2008-07-03 06:50:04 +00:00
Devang Patel	c4dcf82a16	Reconstruct dom info, if loop is unswitched. llvm-svn: 53086	2008-07-03 06:48:21 +00:00
Devang Patel	e491bb8845	LoopUnswitch does not preserve dominator info in all cases. llvm-svn: 53085	2008-07-03 05:55:03 +00:00
Devang Patel	7dcfff392a	Undo previous patch. It is not that simple to fix dom info here. llvm-svn: 53062	2008-07-03 00:08:13 +00:00
Devang Patel	5adfcb5783	Preserve dom info while simplifing loop after the unswitch. llvm-svn: 53052	2008-07-02 22:58:54 +00:00
Owen Anderson	488b89f608	Use df_ext_iterator to capture the reachable set without allocating an extra set. Also, move large sets and vectors out of instance variables and onto the stack, and give them more reasonable sizes. llvm-svn: 53044	2008-07-02 18:41:09 +00:00
Owen Anderson	6acc782dad	Avoid a redundant call. llvm-svn: 53040	2008-07-02 18:15:31 +00:00
Owen Anderson	323b5755a6	Add support to ADCE for pruning unreachable blocks. This addresses the final part of PR2509. llvm-svn: 53038	2008-07-02 18:05:19 +00:00
Owen Anderson	9edcf24da9	Use DenseSet rather than SmallPtrSet for the alive set. Using SmallPtrSet with a huge "size" parameter is actually quite inefficient. llvm-svn: 53034	2008-07-02 17:32:04 +00:00
Owen Anderson	b22a640fe4	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Devang Patel	ed50fb5b61	reuse vectors. llvm-svn: 53007	2008-07-02 01:44:29 +00:00
Devang Patel	57d94d6304	Fix comment. llvm-svn: 53006	2008-07-02 01:31:19 +00:00
Devang Patel	e149d4ed4d	Preserve loop data so that it is not fetched everytime it is needed. Keep track of currentLoop. llvm-svn: 53005	2008-07-02 01:18:13 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Eric Christopher	3f1c75c4d8	Remove unused function. llvm-svn: 52749	2008-06-26 01:19:35 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Evan Cheng	88ca48b09d	Restore DeadArgElim back to 52570. It's breaking 447.dealII. llvm-svn: 52736	2008-06-25 18:10:09 +00:00
Duncan Sands	1b03c2ac98	Pacify gcc-4.3. llvm-svn: 52723	2008-06-25 16:31:18 +00:00
Matthijs Kooijman	2e2001d8b9	Fix a (false) warning on darwin. llvm-svn: 52705	2008-06-25 08:12:16 +00:00
Matthijs Kooijman	4e1cf1e7d7	Fix some cosmetics in comments. llvm-svn: 52704	2008-06-25 08:10:21 +00:00
Evan Cheng	5fd28b54c7	- Use O(1) check of basic block size limit. - Avoid speculatively execute vector ops. llvm-svn: 52703	2008-06-25 07:50:12 +00:00
Chris Lattner	c9c81fb0df	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Dan Gohman	04c8bd7e11	Revert 52645, the loop unroller changes. It caused a regression in 252.eon. llvm-svn: 52688	2008-06-24 20:44:42 +00:00
Dan Gohman	4be44e62b3	Fix a typo in a comment. llvm-svn: 52687	2008-06-24 18:00:21 +00:00
Matthijs Kooijman	c702e1d32f	Commit the new DeadArgElim pass again, this time with the gcc bootstrap failures fixed. Also add a testcase to reproduce the gcc bootstrap failure in very much reduced form. llvm-svn: 52677	2008-06-24 16:30:26 +00:00
Matthijs Kooijman	19a6469e1b	Rename a few variables to be more consistent. llvm-svn: 52672	2008-06-24 09:14:10 +00:00
Dan Gohman	abd8f41c81	Use use_empty() instead of getNumUses(), avoiding a use list traversal. llvm-svn: 52651	2008-06-23 23:23:49 +00:00
Dan Gohman	ac563833ae	Fix spelling and grammar in a comment. llvm-svn: 52648	2008-06-23 22:11:52 +00:00
Dan Gohman	48c5c7e860	Revamp the loop unroller, extending it to correctly update PHI nodes in the presence of out-of-loop users of in-loop values and the trip count is not a known multiple of the unroll count, and to be a bit simpler overall. This fixes PR2253. llvm-svn: 52645	2008-06-23 21:29:41 +00:00
Evan Cheng	403e567043	Disable PRE. It's breaking bootstrapping. llvm-svn: 52643	2008-06-23 21:22:35 +00:00
Owen Anderson	54e02194a1	Tighten the conditions under which we do PRE, remove some unneeded code, and correct our preserved analyses list, since we do now change the CFG by splitting critical edges during PRE. llvm-svn: 52631	2008-06-23 17:49:45 +00:00
Chris Lattner	4d754bc97b	minor tidying of comments. llvm-svn: 52630	2008-06-23 17:11:23 +00:00
Owen Anderson	00fdbd01e5	At Chris' suggestion, move the liveness and worklist datastructures into instance variables so they can be allocated just once, and reuse the worklist as the dead list as well. llvm-svn: 52618	2008-06-23 06:13:12 +00:00
Dan Gohman	5ca5e02480	Improve LSR's dead-phi detection to handle use-def cycles with more than two nodes. llvm-svn: 52617	2008-06-22 20:44:02 +00:00
Dan Gohman	90071075e2	Use Loop::block_iterator. llvm-svn: 52616	2008-06-22 20:18:58 +00:00
Chris Lattner	6ff85681e4	Fix PR2369 by making scalarrepl more careful about promoting structures. Its default threshold is to promote things that are smaller than 128 bytes, which is sane. However, it is not sane to do this for things that turn into 128 registers. Add a cap on the number of registers introduced, defaulting to 128/4=32. llvm-svn: 52611	2008-06-22 17:46:21 +00:00
Eli Friedman	d3449df326	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Dan Gohman	158ff2c4a9	Use Instruction::eraseFromParent(). llvm-svn: 52606	2008-06-21 22:08:46 +00:00
Chris Lattner	8459e0bc59	Fix warning when assertions disabled. llvm-svn: 52590	2008-06-21 19:49:01 +00:00
Evan Cheng	42bbca11cc	Enable PRE. llvm-svn: 52574	2008-06-21 07:26:53 +00:00
Evan Cheng	33067210d1	Back out Matthijs' DAE patches. It's miscompiling gcc driver. llvm-svn: 52570	2008-06-21 00:31:44 +00:00
Dan Gohman	3ada1e118b	Clean up a use of std::distance. llvm-svn: 52544	2008-06-20 17:11:32 +00:00
Dan Gohman	a5dd67f002	Tidy up some commments and use the getAggregateOperand and getInsertedValueOperand accessors. Thanks Matthijs! llvm-svn: 52543	2008-06-20 16:41:17 +00:00
Dan Gohman	b5210efb31	Fix the conditions under which SCCP should examine insertvalue instructions. Thanks to Matthijs Kooijman for pointing this out! llvm-svn: 52542	2008-06-20 16:39:44 +00:00
Matthijs Kooijman	c456f9dfc6	80 column and trailing whitespace fixes. llvm-svn: 52539	2008-06-20 15:34:07 +00:00
Matthijs Kooijman	0c50b953c5	Don't let DeadArgumentElimination attempt to update callers when the return type wasn't changed. llvm-svn: 52538	2008-06-20 15:25:43 +00:00
Matthijs Kooijman	9dc59b7666	Don't let DeadArgElimination change the return type ({} into void and {T} into T) when no return values are actually dead. llvm-svn: 52537	2008-06-20 15:16:45 +00:00
Matthijs Kooijman	013b6a9a42	Explicitely track if any arguments or return values were removed in DeadArgumentElimination and assert that the function type does not change if nothing was changed. This should catch subtle changes in function type that are not intended. llvm-svn: 52536	2008-06-20 14:28:52 +00:00
Matthijs Kooijman	e91aed6ce1	Remove debug output. llvm-svn: 52535	2008-06-20 14:03:35 +00:00
Matthijs Kooijman	8d32dee428	Recommit r52459, rewriting of the dead argument elimination pass. This is a fixed version that no longer uses multimap::equal_range, which resulted in a pointer invalidation problem. Also, DAE::InspectedFunctions was not really necessary, so it got removed. Lastly, this version no longer applies the extra arg hack on functions who did not have any arguments to start with. llvm-svn: 52532	2008-06-20 09:36:16 +00:00
Owen Anderson	78fbcafb53	Really disable PRE. llvm-svn: 52531	2008-06-20 08:59:13 +00:00
Chris Lattner	f3ecd2d290	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Owen Anderson	1b3ea963f7	Change around the data structures used to store availability sets, resulting in a GVN+PRE that is faster that GVN alone was before. llvm-svn: 52521	2008-06-20 01:15:47 +00:00
Dan Gohman	041f9d03ff	Teach SCCP about insertvalue and extractvalue, and about propagating constants across aggregate return values when insertvalue and extractvalue are used. llvm-svn: 52520	2008-06-20 01:15:44 +00:00
Dan Gohman	3b18fd7b02	Teach InlineFunction how to differentiate between multiple-value return statements and aggregate returns so that it handles both correctly. llvm-svn: 52519	2008-06-20 01:03:44 +00:00
Evan Cheng	9598f930f3	Disable PRE for now. It seems to be breaking llvm-gcc bootstrapping. llvm-svn: 52518	2008-06-20 01:01:07 +00:00
Owen Anderson	e780d66657	Add a hidden -disable-pre flag for testing purposes. This should be removed once benchmarking is completed. llvm-svn: 52506	2008-06-19 19:57:25 +00:00
Owen Anderson	fdf9f168b5	PRE requires that critical edges be split. llvm-svn: 52505	2008-06-19 19:54:19 +00:00
Bill Wendling	cd6fb1d0a8	Remove dead code causing a warning. llvm-svn: 52502	2008-06-19 18:00:44 +00:00
Dan Gohman	d6530872f3	Use the common API for adding instructions to basic blocks instead of using BasicBlock::getInstList. llvm-svn: 52500	2008-06-19 17:53:32 +00:00
Owen Anderson	ff21db851d	Be sure to remove values from the value numbering table after we delete them. This fixes a failure on povray. llvm-svn: 52499	2008-06-19 17:53:26 +00:00
Dan Gohman	ed2250990a	Use Instruction::moveBefore instead of manipulating the instruction list directly. llvm-svn: 52498	2008-06-19 17:47:47 +00:00
Dan Gohman	9eea470fcf	Avoid using BasicBlock::getInstList directly in a few places. llvm-svn: 52497	2008-06-19 17:37:25 +00:00
Owen Anderson	45d3701fce	Revert support for insertvalue and extractvalue instructions for the moment. GVN expects that all inputs which to an instruction fall somewhere in the value hierarchy, which isn't true for these. llvm-svn: 52496	2008-06-19 17:25:39 +00:00
Dan Gohman	68f539e807	Delete dead code. llvm-svn: 52494	2008-06-19 17:18:39 +00:00
Matthijs Kooijman	0c71732497	Use a CallSite to find the nth argument of a call/invoke instruction instead of using getOperand() directly. This makes things work with invoke instructions as well. llvm-svn: 52489	2008-06-19 08:53:24 +00:00
Owen Anderson	3ea800fbad	Add support for extractvalue and insertvalue instructions in GVN. llvm-svn: 52472	2008-06-18 21:59:00 +00:00
Owen Anderson	6a903bc601	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Chris Lattner	78119b4742	Fix the regressions on sext-misc.ll my patch yesterday caused. llvm-svn: 52466	2008-06-18 18:11:55 +00:00
Owen Anderson	9094cc957e	Revert r52459, which was causing an infinite loop or massive slowdown on MultiSource/Applications/SPASS, and possibly others as well. Please reapply once this is fixed. llvm-svn: 52465	2008-06-18 17:32:16 +00:00
Dan Gohman	be928e3b21	Move LSR's private isZero function to a public SCEV member function, and make use of it in several places. llvm-svn: 52463	2008-06-18 16:23:07 +00:00
Matthijs Kooijman	964557fdf5	Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). Also add a testcase for testing various variations of (multiple) dead rerturn values. llvm-svn: 52459	2008-06-18 11:12:53 +00:00
Matthijs Kooijman	fd17357643	Reapply r52397 (make IPConstProp promote returned arguments), but fixed this time. Sorry for the trouble! This time, also add a testcase, which I should have done in the first place... llvm-svn: 52455	2008-06-18 08:30:37 +00:00
Matthijs Kooijman	97034598b1	Reapply r52396, it was unrelated to the breakage (that was caused by r52397, my commit after this). llvm-svn: 52453	2008-06-18 08:09:27 +00:00
Chris Lattner	ef36dcd10b	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	b5ee8b3e89	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Devang Patel	cd6b697945	Preserve dominance frontier while trivially unswitching loop. llvm-svn: 52438	2008-06-18 02:16:38 +00:00
Owen Anderson	75f3732b23	We don't want to find dependencies within the same block in this case. It leads to incorrect results because we're detecting something at or after the call we're querying on. llvm-svn: 52433	2008-06-17 22:27:06 +00:00
Chris Lattner	aecc3750d1	revert recent patch which is causing widespread breakage. llvm-svn: 52415	2008-06-17 17:06:43 +00:00
Duncan Sands	4b50fde2c4	Fix typo that changed the logic to something wrong. Spotted by Nick Lewycky. llvm-svn: 52411	2008-06-17 15:55:30 +00:00
Matthijs Kooijman	332836d68d	Learn IPConstProp to propagate arguments that are directly returned. Strictly speaking these are not constant values. However, when a function always returns one of its arguments, then from the point of view of each caller the return value is constant (or at least a known value) and can be replaced. llvm-svn: 52397	2008-06-17 12:20:24 +00:00
Matthijs Kooijman	f03c1ae407	Learn IPConstProp to look at individual return values and propagate them individually. Also learn IPConstProp how returning first class aggregates work, in addition to old style multiple return instructions. Modify the return-constants testscase to confirm this behaviour. llvm-svn: 52396	2008-06-17 12:02:52 +00:00
Dan Gohman	ab0dccba6b	Refine the change in r52258 for avoiding use-before-def conditions when changing the stride of a comparison so that it's slightly more precise, by having it scan the instruction list to determine if there is a use of the condition after the point where the condition will be inserted. llvm-svn: 52371	2008-06-16 22:34:15 +00:00
Evan Cheng	319e9a4f63	Switch over to SetVector to ensure same order of iterations do not vary across runs. llvm-svn: 52361	2008-06-16 21:08:17 +00:00
Evan Cheng	a72cdcd1a2	Iterating over SmallPtrSet is not deterministic. llvm-svn: 52339	2008-06-16 18:17:09 +00:00
Matthijs Kooijman	86cda9e050	Pass around Instruction* instead of Instruction& in FindInsertedValue and friends. llvm-svn: 52318	2008-06-16 13:13:08 +00:00
Matthijs Kooijman	5cb387735d	80 column fixes. llvm-svn: 52316	2008-06-16 12:57:37 +00:00
Matthijs Kooijman	e92e18be5a	Move FindScalarValue from InstructionCombining.cpp to ValueTracking.cpp. While I'm at it, rename it to FindInsertedValue. The only functional change is that newly created instructions are no longer added to instcombine's worklist, but that is not really necessary anyway (and I'll commit some improvements next that will completely remove the need). llvm-svn: 52315	2008-06-16 12:48:21 +00:00
Chris Lattner	1c9922703f	Fix the crash on SimplifyLibCalls/2005-05-20-sprintf-crash.ll llvm-svn: 52295	2008-06-16 04:10:21 +00:00
Chris Lattner	a88cd4ea2a	Fix a case where tailcallelim wouldn't set the changed bit when it made a change. llvm-svn: 52267	2008-06-14 00:49:48 +00:00
Eli Friedman	5de0a77a9b	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Dan Gohman	9ad8c54aab	Protect ChangeCompareStride from situations in which it is possible for it to generate use-before-def IR, such as in this testcase. llvm-svn: 52258	2008-06-13 21:43:41 +00:00
Eli Friedman	9833a1b407	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Wojciech Matyjewicz	25a7f5de92	Use recently added getTruncateOrZeroExtend method to make the code shorter. llvm-svn: 52251	2008-06-13 17:02:03 +00:00
Gabor Greif	431e9560b7	fix a minor deviation from the original in my previous commit llvm-svn: 52247	2008-06-12 21:51:29 +00:00
Gabor Greif	f6d8e77027	op_iterator-ify some loops, low hanging fruit only, there is more llvm-svn: 52246	2008-06-12 21:37:33 +00:00
Evan Cheng	89553cc42e	Do not speculatively execute an instruction by hoisting it to its predecessor BB if any of its operands are defined but not used in BB. The transformation will prevent the operand from being sunk into the use block. llvm-svn: 52244	2008-06-12 21:15:59 +00:00
Evan Cheng	70fe16353a	Revert 52223. llvm-svn: 52243	2008-06-12 20:55:39 +00:00
Owen Anderson	accdca1b03	Switch GVN to use ScopedHashTable. llvm-svn: 52242	2008-06-12 19:25:32 +00:00
Gabor Greif	0babc61631	op_iterator-ify some loops, fix 80col violations llvm-svn: 52226	2008-06-11 21:38:51 +00:00
Evan Cheng	933c743042	For now, avoid generating FP select instructions in order to speculatively execute integer arithmetic instructions. FP selects are more likely to be expensive (even compared to branch on fcmp). This is not a wonderful solution but I rather err on the side of conservative. This fixes the heapsort performance regressions. llvm-svn: 52224	2008-06-11 19:18:20 +00:00
Evan Cheng	f3c2902ead	Avoid duplicating loop header which leads to unnatural loops (and just seem like general badness to me, likely to cause code explosion). Patch by Florian Brandner. llvm-svn: 52223	2008-06-11 19:07:54 +00:00
Matthijs Kooijman	b2fc72bfbf	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Matthijs Kooijman	3453c7bcb5	Clarify a comment. llvm-svn: 52212	2008-06-11 09:00:12 +00:00
Gabor Greif	945f2f7fed	op_iterator-ify loops llvm-svn: 52191	2008-06-10 22:03:26 +00:00
Chris Lattner	9c9f531a47	lower calls to abs to inline code, PR2337 llvm-svn: 52138	2008-06-09 08:26:51 +00:00
Chris Lattner	dbd595f22d	Fix PR2411, where ip constant prop would propagate the result of a weak function. llvm-svn: 52137	2008-06-09 07:58:07 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Chris Lattner	b4866ef30c	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Evan Cheng	89200c9177	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Devang Patel	8549e4ca07	LoopSimplify preserves AA. llvm-svn: 52053	2008-06-06 17:50:58 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Zhou Sheng	1152ca9101	As Chris suggested, handle the situation if ShAmt larger than BitWidth, otherwise, opt might crash. llvm-svn: 52041	2008-06-06 08:32:05 +00:00
Zhou Sheng	fbe1dc240c	If BitWidth equals to ShtAmt, the RHSKnownZero[BitWidth-ShiftAmt-1] will crash the opt. Just fix this. Test case in llvm/test/Transforms/InstCombine/2008-06-05-ashr-crash.ll llvm-svn: 52003	2008-06-05 14:23:44 +00:00
Matthijs Kooijman	812989b147	Learn ScalarReplAggregrates how stores and loads of first class aggregrates work and how to replace them into individual values. Also, when trying to replace an aggregrate that is used by load or store with a single (large) integer, don't crash (but don't replace the aggregrate either). Also adds a testcase for both structs and arrays. llvm-svn: 51997	2008-06-05 12:51:53 +00:00
Matthijs Kooijman	e0c5adc158	Let StructRetPromotion check if all if its users are really calls or invokesn, not other instructions. This fixes a crash with the added testcase. llvm-svn: 51992	2008-06-05 08:57:20 +00:00
Matthijs Kooijman	463f86639d	Let StructRetPromotion check if it's users are really calling it and not passing its pointer. Fixes test with added testcase. llvm-svn: 51991	2008-06-05 08:48:32 +00:00
Matthijs Kooijman	230d6fbfeb	Use use_iterator::getOperandNo instead of CallSite::hasArgument to check if a function is passed as an argument instead of called. Also do this check a bit earlier. llvm-svn: 51990	2008-06-05 08:34:25 +00:00
Matthijs Kooijman	5afc2740b7	Update comments and documentation to reflect that GCSE and ValueNumbering are deprecated by the GVN and GVNPRE passes. llvm-svn: 51983	2008-06-05 07:55:49 +00:00
Owen Anderson	61c7f2a633	Remove unneeded #include. llvm-svn: 51955	2008-06-04 18:28:10 +00:00
Matthijs Kooijman	2353f35989	Replace two manual loops with calls to CallSite::hasArguments (no functional changes). llvm-svn: 51947	2008-06-04 16:57:50 +00:00
Duncan Sands	fc3c489b52	Change packed struct layout so that field sizes are the same as in unpacked structs, only field positions differ. This only matters for structs containing x86 long double or an apint; it may cause backwards compatibility problems if someone has bitcode containing a packed struct with a field of one of those types. The issue is that only 10 bytes are needed to hold an x86 long double: the store size is 10 bytes, but the ABI size is 12 or 16 bytes (linux/ darwin) which comes from rounding the store size up by the alignment. Because it seemed silly not to pack an x86 long double into 10 bytes in a packed struct, this is what was done. I now think this was a mistake. Reserving the ABI size for an x86 long double field even in a packed struct makes things more uniform: the ABI size is now always used when reserving space for a type. This means that developers are less likely to make mistakes. It also makes life easier for the CBE which otherwise could not represent all LLVM packed structs (PR2402). Front-end people might need to adjust the way they create LLVM structs - see following change to llvm-gcc. llvm-svn: 51928	2008-06-04 08:21:45 +00:00
Owen Anderson	2df82e7cec	LoopIndexSplit can sometimes result in cases where a block in its own domfrontier. Don't crash when we encounter one of these. llvm-svn: 51915	2008-06-03 18:29:48 +00:00
Dan Gohman	2ad7e7341c	Fix whitespace in whitespace-significant pseudocode in a comment. llvm-svn: 51890	2008-06-03 00:57:21 +00:00
Devang Patel	7314d0ee3c	Update dom tree. Fix PR 2372. llvm-svn: 51887	2008-06-02 22:52:56 +00:00
Chris Lattner	a12a6de683	move CannotBeNegativeZero to ValueTracking. Simplify some signbit comparisons. llvm-svn: 51864	2008-06-02 01:29:46 +00:00
Chris Lattner	965c769b3c	move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits out of instcombine into a new file in libanalysis. This also teaches ComputeNumSignBits about the number of sign bits in a constantint. llvm-svn: 51863	2008-06-02 01:18:21 +00:00
Owen Anderson	38099c1b6e	Fix two issues that Eli Friedman pointed out, where would misoptimized code like: char a[200]; init(a, a+200); OR int a[200]; char* b = (char)a; char c = (char*)a; foo(b, c); llvm-svn: 51850	2008-06-01 22:26:26 +00:00
Owen Anderson	d071a8708e	Don't remove the memcpy when call slot substitution fails. llvm-svn: 51848	2008-06-01 21:52:16 +00:00
Duncan Sands	0397cd2ec4	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Nick Lewycky	035fe6f716	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	26b8cd84b3	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	df9242a833	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Gabor Greif	5df4326d78	rewrite operand loops to use iterators llvm-svn: 51789	2008-05-30 21:24:22 +00:00
Owen Anderson	1f59d9937f	Since LCSSA switched over to DenseMap, we have to be more careful to avoid iterator invalidation. Fixes PR2385. llvm-svn: 51777	2008-05-30 17:31:01 +00:00
Matthijs Kooijman	57da7d2308	Use eraseFromParent() instead of doing that manually in two places. llvm-svn: 51770	2008-05-30 12:35:46 +00:00
Dan Gohman	86ff8536f9	const-ify getOpcode. llvm-svn: 51698	2008-05-29 19:53:46 +00:00
Duncan Sands	9e064a2180	Add a newline at the end of this file. llvm-svn: 51680	2008-05-29 14:38:23 +00:00
Owen Anderson	7686b555e2	Replace the old ADCE implementation with a new one that more simply solves the one case that ADCE catches that normal DCE doesn't: non-induction variable loop computations. This implementation handles this problem without using postdominators. llvm-svn: 51668	2008-05-29 08:45:13 +00:00
Owen Anderson	f4aece5976	Remove debugging code. llvm-svn: 51666	2008-05-29 08:15:48 +00:00
Gabor Greif	3a9fba5a72	convert more operand loops to iterator formulation llvm-svn: 51663	2008-05-29 01:59:18 +00:00
Chris Lattner	ecdefb5df7	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Nick Lewycky	3ebe82b57a	InequalityGraph::node() can create new nodes, invalidating iterators across the set of nodes. Fix makeEqual to handle this by creating the new node first then iterating across them second. llvm-svn: 51573	2008-05-27 00:59:05 +00:00
Nick Lewycky	6be65d2a84	Grammaro. llvm-svn: 51572	2008-05-26 22:49:36 +00:00
Duncan Sands	dd7daee850	Factor code to copy global value attributes like the section or the visibility from one global value to another: copyAttributesFrom. This is particularly useful for duplicating functions: previously this was done by explicitly copying each attribute in turn at each place where a new function was created out of an old one, with the result that obscure attributes were regularly forgotten (like the collector or the section). Hopefully now everything is uniform and nothing is forgotten. llvm-svn: 51567	2008-05-26 19:58:59 +00:00
Owen Anderson	d3f21d165f	Use a DenseMap instead of an std::map, speeding up the testcase in PR2368 by about a third. llvm-svn: 51565	2008-05-26 10:07:43 +00:00
Nick Lewycky	f6ccd2580c	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	87a099a057	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Chris Lattner	5c207c83c6	Fix PR2358 by resolving calls with undef arguments to overdefined. llvm-svn: 51535	2008-05-24 03:59:33 +00:00
Evan Cheng	02912418f1	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. llvm-svn: 51521	2008-05-24 00:07:06 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	f52b23c0eb	Replace some weird usage of UserOp1 introduced in r49492 by a plain if. llvm-svn: 51482	2008-05-23 16:17:48 +00:00
Matthijs Kooijman	aef2b8198b	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Matthijs Kooijman	f399bbf980	Indent fix. llvm-svn: 51477	2008-05-23 07:57:02 +00:00
Nick Lewycky	3bf5512d87	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	8f3127c5b5	Typo. llvm-svn: 51475	2008-05-23 04:39:38 +00:00
Nick Lewycky	4f3d878507	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	452fb32927	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	2ec9a01173	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	53b2698531	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51467	2008-05-23 01:52:21 +00:00
Dale Johannesen	fecb88249f	Allow for switch with no cases. Was causing fault in gcc.dg/pr27531-1.c. llvm-svn: 51464	2008-05-23 01:01:31 +00:00
Dan Gohman	30ab45d01e	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51459	2008-05-23 00:17:26 +00:00
Dan Gohman	7a0566b9cd	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51456	2008-05-23 00:12:03 +00:00
Chris Lattner	c5ec1e19eb	rewrite the validity checking for memory promotion to be simpler, more aggressive, and more correct. Verify that we only attempt to promote loads and stores. llvm-svn: 51406	2008-05-22 03:22:42 +00:00
Chris Lattner	f12c08dcd8	Use 'continue' to reduce nesting in this loop. No functionality change. llvm-svn: 51399	2008-05-22 00:53:38 +00:00
Dan Gohman	e62632e0bb	When LSR is replacing an instruction, call ScalarEvolution::deleteValueFromRecords on it before doing the replaceAllUsesWith, because ScalarEvolution looks at the instruction's users to find SCEV references to the instruction's SCEV object in its internal maps. Move all of LSR's loop-related state clearing after processing the loop and before cleaning up dead PHI nodes. This eliminates all of LSR's SCEV references just before the calls to ScalarEvolution::deleteValueFromRecords so that when ScalarEvolution drops its own SCEV references, the reference counts will reach zero and the SCEVs will be deleted immediately. These changes fix some compiler aborts involving ScalarEvolution holding onto and reusing SCEV objects for instructions that have been deleted. No regression test unfortunately; because the symptoms were due to dangling pointers, reduced testcases ended up being fairly arbitrary. llvm-svn: 51359	2008-05-21 00:54:12 +00:00
Dan Gohman	81ab753b14	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Matthijs Kooijman	5148a4ba66	Fix typo. llvm-svn: 51303	2008-05-20 07:26:45 +00:00
Chris Lattner	7ac943fffd	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Chris Lattner	9c27f96d04	fix two issues Neil noticed, thanks! llvm-svn: 51296	2008-05-20 03:50:52 +00:00
Dan Gohman	e5572706e8	Refine the fix in r51169 to only apply when the operand val being replaced is a PHI. This prevents it from inserting uses before defs in the case that it isn't a PHI and it depends on other instructions later in the block. This fixes the 447.dealII regression on x86-64. llvm-svn: 51292	2008-05-20 03:01:48 +00:00
Dan Gohman	d717761a2b	Make AssociativeOpt static. llvm-svn: 51290	2008-05-20 01:14:05 +00:00
Devang Patel	ee7bf41c06	Do not erase induction variable increment if it is used outside the loop. llvm-svn: 51280	2008-05-19 22:23:55 +00:00
Dan Gohman	123438cc05	Add a ComputeNumSignBits function for use by instcombine, based on the code in SelectionDAG. llvm-svn: 51279	2008-05-19 22:14:15 +00:00
Chris Lattner	b42712288e	switch to Type::getFPMantissaWidth instead of reinventing it. llvm-svn: 51275	2008-05-19 21:17:23 +00:00
Chris Lattner	ba9acbe6dc	minor cleanups, teach instcombine that sitofp/uitofp cannot produce a negative zero. llvm-svn: 51272	2008-05-19 20:27:56 +00:00
Chris Lattner	e35fe0f1c6	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	5920a78034	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	6e70830af9	remove debug output llvm-svn: 51264	2008-05-19 20:03:53 +00:00
Chris Lattner	fc365b60dc	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00
Duncan Sands	eec7a3c071	Fix PR2341 - when the length is 4 use an i32 not an i16! Cleaned up trailing whitespace while there. llvm-svn: 51240	2008-05-19 09:27:24 +00:00
Nate Begeman	65720c968c	Teach GVN to not assert on vector comparisons llvm-svn: 51230	2008-05-18 19:49:05 +00:00
Chris Lattner	4b2a724fb8	Fix PR2339 llvm-svn: 51226	2008-05-18 04:11:26 +00:00
Nick Lewycky	79376f4e02	Move isTrueWhenEqual to ICmpInst. llvm-svn: 51215	2008-05-17 07:33:39 +00:00
Dale Johannesen	5610dabac9	Less conservative verison of previous patch, suggested by Duncan. llvm-svn: 51211	2008-05-16 23:18:52 +00:00
Dale Johannesen	e7f5bc2c3b	Weak functions not declared non-throwing might be replaced at linktime with a body that throws, even if the body in this file does not. Make PruneEH be more conservative in this case. g++.dg/eh/weak1.C llvm-svn: 51207	2008-05-16 21:31:48 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Duncan Sands	67933e6692	Bill pointed out that system headers should be included after local headers. llvm-svn: 51187	2008-05-16 09:30:00 +00:00

... 9 10 11 12 13 ...

5144 Commits