llvm-project

Commit Graph

Author	SHA1	Message	Date
Dale Johannesen	2b3389a626	Revert previous change; even this mild and clearly more accurate change loses more than it gains on benchmarks. llvm-svn: 62938	2009-01-24 21:49:34 +00:00
Dale Johannesen	899ecdbbba	Improve the inlining cost function a bit. Little practical effect. llvm-svn: 62908	2009-01-24 01:27:33 +00:00
Gabor Greif	eb61fcf2a1	Simplify the logic of getting hold of a PHI predecessor block. There is now a direct way from value-use-iterator to incoming block in PHINode's API. This way we avoid the iterator->index->iterator trip, and especially the costly getOperandNo() invocation. Additionally there is now an assertion that the iterator really refers to one of the PHI's Uses. llvm-svn: 62869	2009-01-23 19:40:15 +00:00
Chris Lattner	c59945b4bd	another fix for PR3354 llvm-svn: 62561	2009-01-20 01:15:41 +00:00
Bill Wendling	caf1d22243	Doxygen-ify comments. llvm-svn: 62546	2009-01-19 23:43:56 +00:00
Chris Lattner	ea9f1d3c47	Fix a problem exposed by PR3354: simplifycfg was making a potentially trapping instruction be executed unconditionally. llvm-svn: 62541	2009-01-19 23:03:13 +00:00
Bill Wendling	534d2e0bae	Temporarily revert r62487. It's causing this error during a release bootstrap of llvm-gcc. Most likely, it's miscompiling one of the "gen*" programs: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.6.0/bin/ -c -g -O2 -mdynamic-no-pic -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -mdynamic-no-pic -DHAVE_CONFIG_H -DGENERATOR_FILE -I. -Ibuild -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/build -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -o build/gencondmd.o build/gencondmd.c ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: excess elements in struct initializer ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: (near initialization for 'insn_conditions[4]') ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected ',' or ';' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:927: error: expected identifier or '(' before ',' token ../../llvm-gcc.src/gcc/config/i386/sse.md:3458: error: expected identifier or '(' before ',' token ... llvm-svn: 62506	2009-01-19 08:46:20 +00:00
Chris Lattner	f2bb4ea39c	Fix PR3016, a bug which can occur do to an invalid assumption: we assumed a CFG structure that would be valid when all code in the function is reachable, but not all code is necessarily reachable. Do a simple, but horrible, CFG walk to check for this case. llvm-svn: 62487	2009-01-19 02:46:28 +00:00
Chris Lattner	e381d7026f	reduce indentation by using 'continue', no functionality change. llvm-svn: 62477	2009-01-19 02:07:32 +00:00
Chris Lattner	54f0c61d71	Fix some problems in SpeculativelyExecuteBB. Basically, because of dead code, a phi could use the speculated instruction that was not in "BB2". Make this check explicit and tighten up some other corners. This fixes PR3292. No testcase becauase this depends entirely on visitation order of blocks and requires a sequence of 8 passes to repro. llvm-svn: 62476	2009-01-19 00:36:37 +00:00
Chris Lattner	e1c01e4e2b	Make this a bit more explicit about which cases need the check. No functionality change. llvm-svn: 62474	2009-01-18 23:22:07 +00:00
Gabor Greif	f1abfdccdc	introduce typedef for complicated vector, and use it too llvm-svn: 62384	2009-01-17 00:09:08 +00:00
Gabor Greif	8c573f7e49	typo llvm-svn: 62377	2009-01-16 23:08:50 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Gabor Greif	5aa1922614	avoid using iterators when they get invalidated potentially this fixes PR3332 llvm-svn: 62271	2009-01-15 18:40:09 +00:00
Dale Johannesen	0aeabdff57	Fix testsuite regressions from recursive inlining. llvm-svn: 62189	2009-01-13 22:43:37 +00:00
Dale Johannesen	433a9086c0	Enable recursive inlining. Reduce inlining threshold back to 200; 400 seems to be too high, loses more than it gains. llvm-svn: 62107	2009-01-12 22:11:50 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Misha Brukman	5cbf223916	Removed trailing whitespace from Makefiles. llvm-svn: 61991	2009-01-09 16:44:42 +00:00
Dale Johannesen	4755d9df78	Adjustments to last patch based on review. llvm-svn: 61969	2009-01-09 01:30:11 +00:00
Dale Johannesen	b48fc71fc6	Do not inline functions with (dynamic) alloca into functions that don't already have a (dynamic) alloca. Dynamic allocas cause inefficient codegen and we shouldn't propagate this (behavior follows gcc). Two existing tests assumed such inlining would be done; they are hacked by adding an alloca in the caller, preserving the point of the tests. llvm-svn: 61946	2009-01-08 21:45:23 +00:00
Dan Gohman	906152a20f	Tidy up #includes, deleting a bunch of unnecessary #includes. llvm-svn: 61715	2009-01-05 17:59:02 +00:00
Chris Lattner	4caf5eb70c	Fix PR2929 by making bugpoint/code extract propagate the nothrow bit from the original function to the cloned one. llvm-svn: 61194	2008-12-18 05:52:56 +00:00
Chris Lattner	c1c6404bba	make instnamer name unnamed blocks as well as instructions and args. llvm-svn: 61175	2008-12-18 00:33:11 +00:00
Eli Friedman	cb61afb546	Add a helper to remove a branch and DCE the condition, and use it consistently for deleting branches. In addition to being slightly more readable, this makes SimplifyCFG a bit better about cleaning up after itself when it makes conditions unused. llvm-svn: 61100	2008-12-16 20:54:32 +00:00
Misha Brukman	234b44add2	Fix spelling. llvm-svn: 60971	2008-12-13 05:21:37 +00:00
Chris Lattner	f50d7f76c6	fix a bug I introduced in simplifycfg handling single entry phi nodes. FoldSingleEntryPHINodes deletes the PHI, so there is no need to delete it afterward. llvm-svn: 60653	2008-12-07 07:22:45 +00:00
Chris Lattner	dc3f6f2c12	Factor some code into a new FoldSingleEntryPHINodes method. llvm-svn: 60501	2008-12-03 19:44:02 +00:00
Chris Lattner	37e0136fef	third time is the charm. llvm-svn: 60469	2008-12-03 07:45:15 +00:00
Chris Lattner	c04a1ffa9a	fix assertion. llvm-svn: 60468	2008-12-03 07:43:05 +00:00
Chris Lattner	7eb270ed03	Rename DeleteBlockIfDead to DeleteDeadBlock and make it unconditionally delete the block. All likely clients will do the checking anyway. llvm-svn: 60464	2008-12-03 06:40:52 +00:00
Chris Lattner	bcc904a67c	Factor some code out of SimplifyCFG, forming a new DeleteBlockIfDead method. llvm-svn: 60463	2008-12-03 06:37:44 +00:00
Chris Lattner	e9f6c355bf	rewrite RecursivelyDeleteTriviallyDeadInstructions to use a more efficient formulation that doesn't require set lookups or scanning a set. llvm-svn: 60203	2008-11-28 01:20:46 +00:00
Chris Lattner	d4b5ba615e	remove some weirdness that came from the LSR code that has nothing to do with dead instruction elimination. No tests in dejagnu depend on this, so I don't know what it was needed for. llvm-svn: 60202	2008-11-28 00:58:15 +00:00
Chris Lattner	8e84c129ce	delete ErasePossiblyDeadInstructionTree, replacing uses of it with RecursivelyDeleteTriviallyDeadInstructions. llvm-svn: 60196	2008-11-27 23:25:44 +00:00
Chris Lattner	a1bbdff933	enhance RecursivelyDeleteTriviallyDeadInstructions to make PHIs dead if they are single-value. llvm-svn: 60194	2008-11-27 23:18:11 +00:00
Chris Lattner	1cb4f72706	Enhance RecursivelyDeleteTriviallyDeadInstructions to optionally return a list of deleted instructions. llvm-svn: 60193	2008-11-27 23:14:34 +00:00
Chris Lattner	c6c481cdfc	remove doConstantPropagation and dceInstruction, they are just wrappers around the interesting code and use an obscure iterator abstraction that dates back many many years. Move EraseDeadInstructions to Transforms/Utils and name it RecursivelyDeleteTriviallyDeadInstructions. llvm-svn: 60191	2008-11-27 22:57:53 +00:00
Chris Lattner	e0d019def6	switch InstCombine::visitLoadInst to use FindAvailableLoadedValue llvm-svn: 60169	2008-11-27 08:56:30 +00:00
Chris Lattner	c6ae56d23f	enhance FindAvailableLoadedValue to make use of AliasAnalysis if it has it. llvm-svn: 60167	2008-11-27 08:18:12 +00:00
Chris Lattner	72f16e70f0	move FindAvailableLoadedValue from JumpThreading to Transforms/Utils. llvm-svn: 60166	2008-11-27 08:10:05 +00:00
Chris Lattner	d6204bed3d	simplify this code a bit. llvm-svn: 60164	2008-11-27 07:54:38 +00:00
Chris Lattner	99d6809ac1	move MergeBasicBlockIntoOnlyPred to Transforms/Utils. llvm-svn: 60162	2008-11-27 07:43:12 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Devang Patel	38642e598e	Don't forget arguments! llvm-svn: 59745	2008-11-20 19:50:17 +00:00
Oscar Fuentes	4fb443f81b	CMake: Removed source file. llvm-svn: 59662	2008-11-19 19:32:19 +00:00
Devang Patel	79303b2572	Do not use separate utility to walk all instructions and remove dead dbg intrinsics. Let instcombiner do this job. llvm-svn: 59659	2008-11-19 19:01:37 +00:00
Devang Patel	a13f1f38fa	Initialize MallocFunc and FreeFunc properly. llvm-svn: 59538	2008-11-18 18:43:07 +00:00
Devang Patel	b63c74730c	Let AnalyzeAlloca() remove debug intrinsics. llvm-svn: 59454	2008-11-17 18:37:53 +00:00
Oscar Fuentes	1b504d5372	CMake: Remove removed source file. llvm-svn: 59098	2008-11-12 00:14:12 +00:00
Devang Patel	4f02a0b740	Remove llvm-svn: 59093	2008-11-11 23:58:15 +00:00
Devang Patel	bf0835706c	Undo previous check-in. llvm-svn: 59092	2008-11-11 23:57:33 +00:00
Oscar Fuentes	2353ef3e91	CMake: Updated list of source files for lib/Transforms/Utils. llvm-svn: 59077	2008-11-11 19:51:36 +00:00
Devang Patel	6096f26bd4	Add utility pass to remove dbg info. llvm-svn: 59068	2008-11-11 19:33:39 +00:00
Cedric Venet	8cb2e28e43	Update CMakeLists.txt llvm-svn: 59039	2008-11-11 09:55:48 +00:00
Devang Patel	dc6699e82f	Add utility routines to remove dead debug info. llvm-svn: 59011	2008-11-11 00:53:02 +00:00
Daniel Dunbar	2b9dce2669	Rework r58829, allowing removal of dbg info intrinsics during alloca promotion. - Eliminate uses after free and simplify tests. Devang: Please check that this is still doing what you intended. llvm-svn: 58887	2008-11-08 04:12:17 +00:00
Bill Wendling	b9656df4ac	BCUI + 1 doesn't work. Use next instead. llvm-svn: 58830	2008-11-07 01:59:41 +00:00
Devang Patel	b8e0d59ceb	Handle (delete) dbg intrinsics while promoting alloca. llvm-svn: 58826	2008-11-07 01:30:07 +00:00
Devang Patel	5a5ab730e0	InstructionNamer preserves everything. llvm-svn: 58787	2008-11-06 01:00:16 +00:00
Devang Patel	7a848b0ee3	Check Attribute::NoInline. llvm-svn: 58742	2008-11-05 01:37:05 +00:00
Devang Patel	f33f8a8606	Fix unused variable warnings. llvm-svn: 58651	2008-11-03 23:14:09 +00:00
Daniel Dunbar	a1c4fcfc29	Fix warning. llvm-svn: 58486	2008-10-31 01:50:01 +00:00
Daniel Dunbar	3933e66a89	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	0934c0f35b	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Daniel Dunbar	cc20455346	Assorted comment/naming fixes, 80-col violations, and reindentation. - No functionality change. llvm-svn: 58352	2008-10-28 23:24:26 +00:00
Torok Edwin	ca97b42ef7	export an ID for the instructionNamer, allowing analysis/transformation passes that need it to require it by ID. llvm-svn: 58238	2008-10-27 10:16:27 +00:00
Chris Lattner	59b5691388	Rewrite all the 'PromoteLocallyUsedAlloca[s]' logic. With the power of LargeBlockInfo, we can now dramatically simplify their implementation and speed them up at the same time. Now the code has time proportional to the number of uses of the alloca, not the size of the block. This also eliminates code that tried to batch up different allocas which are used in the same blocks, and eliminates the 'retry list' logic which was baroque and no unneccesary. In addition to being a speedup for crazy cases, this is also a nice cleanup: PromoteMemoryToRegister.cpp \| 270 +++++++++++++++----------------------------- 1 file changed, 96 insertions(+), 174 deletions(-) llvm-svn: 58229	2008-10-27 07:05:53 +00:00
Chris Lattner	f594ecc453	Add a new LargeBlockInfo helper, which is just a wrapper around a trivial dense map. Use this in RewriteSingleStoreAlloca to avoid aggressively rescanning blocks over and over again. This fixes PR2925, speeding up mem2reg on the testcase in that bug from 4.56s to 0.02s in a debug build on my machine. llvm-svn: 58227	2008-10-27 06:05:26 +00:00
Daniel Dunbar	7f39e2d85a	Change createPass factory functions to return Pass instead of LoopPass*. - Although less precise, this means they can be used in clients without RTTI (who would otherwise need to include LoopPass.h, which eventually includes things using dynamic_cast). This was the simplest solution that presented itself, but I am happy to use a better one if available. llvm-svn: 58010	2008-10-22 23:32:42 +00:00
Nick Lewycky	03c5fa18f1	Don't drop alignment on globals when cloning. llvm-svn: 57320	2008-10-09 06:27:14 +00:00
Duncan Sands	26ff6f9c54	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Andrew Lenharth	5aa1cc4065	Correctly set attributes when removing args during cloning. Fixes PR2765 llvm-svn: 57254	2008-10-07 18:08:38 +00:00
Devang Patel	f963403b58	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Owen Anderson	cb4f156b6b	SplitBlock should only attempt to update LoopInfo if it is actually being used. llvm-svn: 56994	2008-10-03 06:55:35 +00:00
Duncan Sands	08d91178e9	Rename isWeakForLinker to mayBeOverridden. Use it instead of hasWeakLinkage in a bunch of optimization passes. llvm-svn: 56782	2008-09-29 11:25:42 +00:00
Devang Patel	9eb525d4f9	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Devang Patel	e15607b7bb	Put FN_NOTE_AlwaysInline and others in FnAttr namespace. llvm-svn: 56527	2008-09-24 00:06:15 +00:00
Devang Patel	e87abd26ba	Move FN_NOTE_AlwaysInline and other out of ParamAttrs namespace. Do not check isDeclaration() in hasNote(). It is clients' responsibility. llvm-svn: 56524	2008-09-23 23:52:03 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Devang Patel	82fed6702b	Use parameter attribute store (soon to be renamed) for Function Notes also. Function notes are stored at index ~0. llvm-svn: 56511	2008-09-23 22:35:17 +00:00
Devang Patel	329fe728b5	Add hasNote() to check note associated with a function. llvm-svn: 56477	2008-09-22 22:32:29 +00:00
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Devang Patel	76b22c1420	Try to place hoisted instructions befoe icmp instruction. llvm-svn: 56315	2008-09-18 22:50:42 +00:00
Devang Patel	7f9671ba37	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	0f7a3507cf	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Duncan Sands	46911f1271	Reapply 55859. This doesn't change anything as long as the callgraph is correct. It checks for wrong callgraphs more strictly. llvm-svn: 55894	2008-09-08 11:05:51 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Duncan Sands	9e23602849	Delete the removeCallEdgeTo callgraph method, because it does not maintain a correct list of callsites. I discovered (see following commit) that the inliner will create a wrong callgraph if it is fed a callgraph with correct edges but incorrect callsites. These were created by Prune-EH, and while it wasn't done via removeCallEdgeTo, it could have been done via removeCallEdgeTo, which is an accident waiting to happen. Use removeCallEdgeFor instead. llvm-svn: 55859	2008-09-05 21:43:04 +00:00
Duncan Sands	7c8fb1ad93	Remove trailing whitespace. llvm-svn: 55835	2008-09-05 12:37:12 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Devang Patel	0d442ffa2b	Handle "always inline" note during inline cost analysis. llvm-svn: 55712	2008-09-03 18:47:45 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Chris Lattner	20abc419e5	Add a new trivial -inst-namer pass which makes it possible to diff the before/after effects of a pass, crazy! llvm-svn: 55230	2008-08-23 06:07:02 +00:00
Gordon Henriksen	d930f913e6	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Chris Lattner	17f7165f84	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Dan Gohman	8de6d22392	Use empty() instead of begin() == end(). llvm-svn: 54780	2008-08-14 18:13:49 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Owen Anderson	9858691f25	Reapply r53735. My last patch fixed the failures Dan observed. llvm-svn: 53761	2008-07-18 17:49:43 +00:00
Owen Anderson	1468bec06e	Add some checks that got lost in the shuffle. This fixes 464.h264ref. llvm-svn: 53760	2008-07-18 17:46:41 +00:00
Dan Gohman	29c3adaae0	Revert r53735. It broke SPEC 464.h264ref. llvm-svn: 53757	2008-07-18 16:44:49 +00:00
Owen Anderson	fd7102037d	Use MergeBlockIntoPredecessor to simplify some code. llvm-svn: 53735	2008-07-17 20:00:46 +00:00
Owen Anderson	27405efdc0	Make MergeBlockIntoPredecessor more aggressive when the same successor appears more than once. llvm-svn: 53731	2008-07-17 19:42:29 +00:00
Evan Cheng	97cd0298cc	Inliner tweak. Function calls should cost more than one instruction! llvm-svn: 53712	2008-07-17 01:31:49 +00:00
Owen Anderson	c062381c7b	Factor MergeBlockIntoPredecessor out into BasicBlockUtils. llvm-svn: 53705	2008-07-17 00:01:40 +00:00
Chris Lattner	8882b1c41c	Reapply r53540, now with the matching header! llvm-svn: 53557	2008-07-14 17:32:59 +00:00
Duncan Sands	68b0383057	Revert r53540 - it does not compile. llvm-svn: 53549	2008-07-14 07:59:28 +00:00
Chris Lattner	2831ad28be	If a function calls setjmp, never inline it into other functions. This is a hack around the fact that we don't represent the CFG correctly for sj/lj. It fixes PR2486. llvm-svn: 53540	2008-07-14 00:46:56 +00:00
Chris Lattner	6f5ea6e49c	simplify some code, shuffle and insertelt always return a vector. llvm-svn: 53538	2008-07-14 00:32:20 +00:00
Chris Lattner	80b03a1b49	Fix mishandling of the infinite loop case when merging two blocks. This fixes PR2540. llvm-svn: 53533	2008-07-13 22:23:11 +00:00
Chris Lattner	834ab4ec1b	more refactoring. Use early exits instead of really complex logic. No functionality change. llvm-svn: 53532	2008-07-13 22:04:41 +00:00
Chris Lattner	5eed37224a	improve comments. llvm-svn: 53531	2008-07-13 21:55:46 +00:00
Chris Lattner	9aada1d755	factor another large hunk of code out into its own function. No functionality change. llvm-svn: 53530	2008-07-13 21:53:26 +00:00
Chris Lattner	55eaae1e0c	Final bit of simplification for FoldBranchToCommonDest. llvm-svn: 53528	2008-07-13 21:20:19 +00:00
Chris Lattner	1b317ea48a	simplify logic a bit llvm-svn: 53527	2008-07-13 21:15:11 +00:00
Chris Lattner	2e25b8f444	Refactor some code out into its own helper function, getting rid of crazy multiline conditionals and commenting the code better. No functionality change. llvm-svn: 53526	2008-07-13 21:12:01 +00:00
Evan Cheng	5fd28b54c7	- Use O(1) check of basic block size limit. - Avoid speculatively execute vector ops. llvm-svn: 52703	2008-06-25 07:50:12 +00:00
Dan Gohman	04c8bd7e11	Revert 52645, the loop unroller changes. It caused a regression in 252.eon. llvm-svn: 52688	2008-06-24 20:44:42 +00:00
Dan Gohman	48c5c7e860	Revamp the loop unroller, extending it to correctly update PHI nodes in the presence of out-of-loop users of in-loop values and the trip count is not a known multiple of the unroll count, and to be a bit simpler overall. This fixes PR2253. llvm-svn: 52645	2008-06-23 21:29:41 +00:00
Dan Gohman	90071075e2	Use Loop::block_iterator. llvm-svn: 52616	2008-06-22 20:18:58 +00:00
Dan Gohman	158ff2c4a9	Use Instruction::eraseFromParent(). llvm-svn: 52606	2008-06-21 22:08:46 +00:00
Chris Lattner	8459e0bc59	Fix warning when assertions disabled. llvm-svn: 52590	2008-06-21 19:49:01 +00:00
Dan Gohman	3ada1e118b	Clean up a use of std::distance. llvm-svn: 52544	2008-06-20 17:11:32 +00:00
Dan Gohman	3b18fd7b02	Teach InlineFunction how to differentiate between multiple-value return statements and aggregate returns so that it handles both correctly. llvm-svn: 52519	2008-06-20 01:03:44 +00:00
Dan Gohman	68f539e807	Delete dead code. llvm-svn: 52494	2008-06-19 17:18:39 +00:00
Evan Cheng	89553cc42e	Do not speculatively execute an instruction by hoisting it to its predecessor BB if any of its operands are defined but not used in BB. The transformation will prevent the operand from being sunk into the use block. llvm-svn: 52244	2008-06-12 21:15:59 +00:00
Evan Cheng	933c743042	For now, avoid generating FP select instructions in order to speculatively execute integer arithmetic instructions. FP selects are more likely to be expensive (even compared to branch on fcmp). This is not a wonderful solution but I rather err on the side of conservative. This fixes the heapsort performance regressions. llvm-svn: 52224	2008-06-11 19:18:20 +00:00
Gabor Greif	945f2f7fed	op_iterator-ify loops llvm-svn: 52191	2008-06-10 22:03:26 +00:00
Evan Cheng	89200c9177	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Devang Patel	8549e4ca07	LoopSimplify preserves AA. llvm-svn: 52053	2008-06-06 17:50:58 +00:00
Owen Anderson	2df82e7cec	LoopIndexSplit can sometimes result in cases where a block in its own domfrontier. Don't crash when we encounter one of these. llvm-svn: 51915	2008-06-03 18:29:48 +00:00
Dan Gohman	2ad7e7341c	Fix whitespace in whitespace-significant pseudocode in a comment. llvm-svn: 51890	2008-06-03 00:57:21 +00:00
Gabor Greif	5df4326d78	rewrite operand loops to use iterators llvm-svn: 51789	2008-05-30 21:24:22 +00:00
Owen Anderson	1f59d9937f	Since LCSSA switched over to DenseMap, we have to be more careful to avoid iterator invalidation. Fixes PR2385. llvm-svn: 51777	2008-05-30 17:31:01 +00:00
Duncan Sands	dd7daee850	Factor code to copy global value attributes like the section or the visibility from one global value to another: copyAttributesFrom. This is particularly useful for duplicating functions: previously this was done by explicitly copying each attribute in turn at each place where a new function was created out of an old one, with the result that obscure attributes were regularly forgotten (like the collector or the section). Hopefully now everything is uniform and nothing is forgotten. llvm-svn: 51567	2008-05-26 19:58:59 +00:00
Owen Anderson	d3f21d165f	Use a DenseMap instead of an std::map, speeding up the testcase in PR2368 by about a third. llvm-svn: 51565	2008-05-26 10:07:43 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	aef2b8198b	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Gabor Greif	697e94cc22	Fix a bunch of 80col violations that arose from the Create API change. Tweak makefile targets to find these better. llvm-svn: 51143	2008-05-15 10:04:30 +00:00
Dan Gohman	3dc2d92ebd	Split the loop unroll mechanism logic out into a utility function. Patch by Matthijs Kooijman! llvm-svn: 51083	2008-05-14 00:24:14 +00:00
Dan Gohman	0479aa5c0b	Change class' public PassInfo variables to by initialized with the address of the PassInfo directly instead of calling getPassInfo. This eliminates a bunch of dynamic initializations of static data. Also, fold RegisterPassBase into PassInfo, make a bunch of its data members const, and rearrange some code to initialize data members in constructors instead of using setter member functions. llvm-svn: 51022	2008-05-13 02:05:11 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	a8b7e78f54	Remove uses of llvm/System/IncludeFile.h that are no longer needed. llvm-svn: 50695	2008-05-06 01:32:53 +00:00
Devang Patel	fa0e3c4a92	Handle multiple return values. llvm-svn: 50604	2008-05-03 01:12:15 +00:00
Chris Lattner	8be72700b8	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Nate Begeman	ca270ad96f	Feedback from chris llvm-svn: 50271	2008-04-25 17:45:52 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Nate Begeman	6fed3b2038	Teach the PruningFunctionCloner how to look through loads with ConstantExpression GEPs pointing into constant globals. llvm-svn: 50256	2008-04-25 06:37:06 +00:00
Evan Cheng	608eeef5ce	Adjust inline cost computation to be less aggressive. llvm-svn: 50222	2008-04-24 18:42:47 +00:00
Chris Lattner	86bbf338e5	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Devang Patel	8f83081fea	Check type instead of no. of operands. llvm-svn: 50179	2008-04-23 20:18:29 +00:00
Chris Lattner	a5b11705b6	Move SplitBlockPredecessors out of loopsimplify into BasicBlockUtils.h as a global helper function. At the same type, switch it from taking a vector of predecessors to an arbitrary sequential input. This allows us to switch LoopSimplify to use a SmallVector for various temporary vectors that it passed into SplitBlockPredecessors. llvm-svn: 50020	2008-04-21 01:28:02 +00:00
Chris Lattner	d418b06abf	Move domtree/frontier updating earlier, allowing us to use it to update phi nodes, removing a hack. llvm-svn: 50019	2008-04-21 01:05:08 +00:00
Chris Lattner	96e9e22269	Factor dominator tree and frontier updating into SplitBlockPredecessors instead of doing it after every call. llvm-svn: 50018	2008-04-21 00:54:38 +00:00
Chris Lattner	aca912d793	simplify code, fit in 80 cols. llvm-svn: 50015	2008-04-21 00:23:14 +00:00
Chris Lattner	38806c3e9c	fit in 80 cols llvm-svn: 50014	2008-04-21 00:19:16 +00:00
Scott Michel	376acf4aaa	Remove unused variable llvm-svn: 49838	2008-04-17 01:30:44 +00:00
Scott Michel	f66cb3696a	Workaround for PR2207, in which pred_iterator assert gets triggered due to a wee problem in Xcode 2.[45]/gcc 4.0.1. llvm-svn: 49831	2008-04-16 23:46:39 +00:00
Chuck Rose III	c6a47e8a79	VisualStudio project files updated. #include <algorithm> added to make VisualStudio happy. Also had to undefine setjmp because of #include <csetjmp> turning setjmp into _setjmp in VisualStudio. llvm-svn: 49743	2008-04-15 21:27:11 +00:00
Owen Anderson	7629b71dd4	Revert r49614. As Dan pointed out, some of these aren't correct. llvm-svn: 49657	2008-04-14 17:38:21 +00:00
Owen Anderson	1f6fbc4bc3	Replace calls of the form V1->setName(V2->getName()) with V1->takeName(V2), which is significantly more efficient. llvm-svn: 49614	2008-04-13 19:15:17 +00:00
Devang Patel	8cd2a3ae2a	Fix insert point handling for multiple return values. llvm-svn: 49367	2008-04-08 02:24:08 +00:00
Duncan Sands	1416ebf1fe	The "stacksave is not nounwind problem" no longer needs to be fixed here - a previous commit made sure that intrinsics always get the right attributes. So remove no-longer needed code, and while there use Intrinsic::getDeclaration rather than getOrInsertFunction. llvm-svn: 49337	2008-04-07 13:43:58 +00:00
Duncan Sands	fbc6adcc59	Use Intrinsic::getDeclaration to get hold of intrinsics. Fix up the argument type (should be i8, was an array). llvm-svn: 49336	2008-04-07 13:41:19 +00:00
Dale Johannesen	87e484f08b	Mark calls to llvm.stacksave, llvm.stackrestore as nounwind. When such calls are inlined into something else that is invoked, they were getting changed to invokes, which is badness. llvm-svn: 49299	2008-04-07 00:08:48 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Evan Cheng	ac38d444e2	1. Drop default inline threshold back down to 200. 2. Do not use # of basic blocks as part of the cost computation since it doesn't really figure into function size. 3. More aggressively inline function with vector code. llvm-svn: 49061	2008-04-01 23:59:29 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Evan Cheng	3471ae8c5d	Increasing the inline limit from (overly conservative) 200 to 300. Given each BB costs 20 and each instruction costs 5, 200 means a 4 BB function + 24 instructions (actually less because caller's size also contributes to it). Furthermore, double the limit when more than 10% of the callee instructions are vector instructions. Multimedia kernels tend to love inlining. llvm-svn: 48725	2008-03-24 06:37:48 +00:00
Anton Korobeynikov	d38b3fb127	Preserve calling convention during function cloning llvm-svn: 48708	2008-03-23 16:03:00 +00:00
Evan Cheng	5daf090a1a	80 col violation. llvm-svn: 48573	2008-03-20 00:20:23 +00:00
Nick Lewycky	7698bfbe16	Update -mem2reg to use succ_iterator instead of iterating across TerminatorInst successors. This makes it support nounwind. llvm-svn: 48320	2008-03-13 02:42:41 +00:00
Dan Gohman	20af5a0fe7	Check to see if a two-entry PHI block can be simplified before trying to merge the block into its predecessors. This allows two-entry-phi-return.ll to be simplified into a single basic block. llvm-svn: 48252	2008-03-11 21:53:06 +00:00
Devang Patel	64d0f07085	Restore optimization that merges blocks when inline function has single return value. llvm-svn: 48162	2008-03-10 18:34:00 +00:00
Devang Patel	72ea2dc9a9	Simplify llvm-svn: 48161	2008-03-10 18:22:16 +00:00
Devang Patel	c0325b2040	simplify llvm-svn: 48160	2008-03-10 18:11:41 +00:00
Nick Lewycky	fb2c1a999a	Turn unwind_to into "unwinds to". llvm-svn: 48123	2008-03-10 02:20:00 +00:00
Nick Lewycky	42445be0df	Firstly, having a BranchInst isn't exclusive with having an unwind_to. Secondly, we have to check whether the branch is actually pointing to the block with the unwind in it. We could have gotten here because of the unwind_to alone. llvm-svn: 48099	2008-03-09 07:50:37 +00:00
Nick Lewycky	f3d637fa14	A BB that unwind_to an "unwind" inst is that same as one that doesn't unwind_to at all. llvm-svn: 48096	2008-03-09 07:36:38 +00:00
Nick Lewycky	11fc6f8765	Update the block cloner which fixes bugpoint on code using unwind_to (phew!) and also update the cloning interface's major user, the loop optimizations. llvm-svn: 48088	2008-03-09 05:24:34 +00:00
Nick Lewycky	5ce9b521d7	Update the inliner and simplifycfg to handle unwind_to. llvm-svn: 48086	2008-03-09 05:10:13 +00:00
Nick Lewycky	cc24104703	Two things. Preserve the unwind_to when splitting a BB. Add the ability to remove just one instance of a BB from a phi node. This fixes the compile error in the tree now. llvm-svn: 48085	2008-03-09 05:04:48 +00:00
Devang Patel	780b3ca64b	Update inliner to handle functions that return multiple values. llvm-svn: 48020	2008-03-07 20:06:16 +00:00
Devang Patel	3b1c95f885	Handle 'ret' with multiple values. llvm-svn: 47965	2008-03-05 21:50:24 +00:00
Devang Patel	e516aa1127	Skip functions that return multiple values. llvm-svn: 47924	2008-03-05 00:36:59 +00:00
Devang Patel	4566d885dd	Use while loop. llvm-svn: 47909	2008-03-04 21:59:49 +00:00
Devang Patel	941ab37ea8	Use cast instead of dyn_cast. Update test to use multiple return value directly, instead of relying on -sretpromotion. llvm-svn: 47907	2008-03-04 21:45:28 +00:00
Devang Patel	841322b32a	Handle multiple return values. llvm-svn: 47904	2008-03-04 21:15:15 +00:00
Anton Korobeynikov	18991d78fa	Fix newly-introduced 4.3 warnings llvm-svn: 47375	2008-02-20 12:07:57 +00:00
Anton Korobeynikov	1bfd121321	Make Transforms to be 4.3 warnings-clean llvm-svn: 47371	2008-02-20 11:26:25 +00:00
Chris Lattner	c3591a0d48	remove the LowerSelect pass. The last client was the old Sparc backend, which is long dead by now. llvm-svn: 47323	2008-02-19 07:49:17 +00:00
Chris Lattner	6b39cb907b	switch simplifycfg from using vectors for most things to smallvectors, this speeds it up 2.3% on eon. llvm-svn: 47261	2008-02-18 07:42:56 +00:00
Chris Lattner	70e294660a	Fix PR2029 llvm-svn: 47129	2008-02-14 19:18:13 +00:00
Chris Lattner	a838141957	Make RenamePass faster by making the 'is this a new phi node' check more intelligent. This speeds up mem2reg from 5.29s to 0.79s on a synthetic testcase with tons of predecessors and phi nodes. llvm-svn: 46767	2008-02-05 21:26:23 +00:00
Duncan Sands	053c9871cd	Revert r46393: readonly/readnone functions are no longer allowed to write through byval arguments. llvm-svn: 46416	2008-01-27 18:12:58 +00:00
Duncan Sands	c4dc3dc3a2	Create an explicit copy for byval parameters even when inlining a readonly function. llvm-svn: 46393	2008-01-26 06:41:49 +00:00
Duncan Sands	f52faf9a64	Do this more neatly. llvm-svn: 46369	2008-01-25 22:06:51 +00:00
Chris Lattner	4f6c81ac68	we don't have to make an explicit copy of a byval argument when inlining a function if we know that the function does not write to any memory. This implements test/Transforms/Inline/byval2.ll llvm-svn: 45912	2008-01-12 18:54:29 +00:00
Chris Lattner	908117bf69	When inlining a functino with a byval argument, make an explicit copy of it in case the callee modifies the struct. llvm-svn: 45853	2008-01-11 06:09:30 +00:00
Chris Lattner	f391883670	don't hoist FP additions into unconditional adds + selects. This could theoretically introduce a trap, but is also a performance issue. This speeds up ptrdist/ks by 8%. llvm-svn: 45533	2008-01-03 07:25:26 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	a087a8d2ce	remove attribution from lib Makefiles. llvm-svn: 45415	2007-12-29 20:09:26 +00:00
Chris Lattner	e96658392d	dead calls to llvm.stacksave can be deleted, even though they have potential side-effects. llvm-svn: 45392	2007-12-29 00:59:12 +00:00
Gordon Henriksen	b969c5981b	GC poses hazards to the inliner. Consider: define void @f() { ... call i32 @g() ... } define void @g() { ... } The hazards are: - @f and @g have GC, but they differ GC. Inlining is invalid. This may never occur. - @f has no GC, but @g does. g's GC must be propagated to @f. The other scenarios are safe: - @f and @g have the same GC. - @f and @g have no GC. - @g has no GC. This patch adds inliner checks for the former two scenarios. llvm-svn: 45351	2007-12-25 03:10:07 +00:00
Devang Patel	7a2c66b11e	If succ has succ itself as one of the predecessors then do not merge current bb and succ even if bb's terminator is unconditional branch to succ. llvm-svn: 45305	2007-12-22 01:32:53 +00:00
Duncan Sands	aa31b92508	When inlining through an 'nounwind' call, mark inlined calls 'nounwind'. It is important for correct C++ exception handling that nounwind markings do not get lost, so this transformation is actually needed for correctness. llvm-svn: 45218	2007-12-19 21:13:37 +00:00
Duncan Sands	3353ed09ac	Rename isNoReturn to doesNotReturn, and isNoUnwind to doesNotThrow. llvm-svn: 45160	2007-12-18 09:59:50 +00:00
Duncan Sands	b5a79d0eaa	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
David Greene	71eae8a5ee	GLIBCXX_DEBUG fix. std::vector<>::end() is invalidated by erase. llvm-svn: 45101	2007-12-17 17:42:03 +00:00
Christopher Lamb	edf0788758	Change the PointerType api for creating pointer types. The old functionality of PointerType::get() has become PointerType::getUnqual(), which returns a pointer in the generic address space. The new prototype of PointerType::get() requires both a type and an address space. llvm-svn: 45082	2007-12-17 01:12:55 +00:00
Duncan Sands	56ed48036b	Revert this part of r45073 until the verifier is changed not to reject invoke of inline asm. llvm-svn: 45077	2007-12-16 21:01:21 +00:00
Duncan Sands	8e4847ee95	Make instcombine promote inline asm calls to 'nounwind' calls. Remove special casing of inline asm from the inliner. There is a potential problem: the verifier rejects invokes of inline asm (not sure why). If an asm call is not marked "nounwind" in some .ll, and instcombine is not run, but the inliner is run, then an illegal module will be created. This is bad but I'm not sure what the best approach is. I'm tempted to remove the check in the verifier... llvm-svn: 45073	2007-12-16 15:51:49 +00:00
Chris Lattner	d2265b45ae	Fix PR1850 by removing an unsafe transformation from VMCore/ConstantFold.cpp. Reimplement the xform in Analysis/ConstantFolding.cpp where we can use targetdata to validate that it is safe. While I'm in there, fix some const correctness issues and generalize the interface to the "operand folder". llvm-svn: 44817	2007-12-10 22:53:04 +00:00
Gordon Henriksen	71183b6739	Adding a collector name attribute to Function in the IR. These methods are new to Function: bool hasCollector() const; const std::string &getCollector() const; void setCollector(const std::string &); void clearCollector(); The assembly representation is as such: define void @f() gc "shadow-stack" { ... The implementation uses an on-the-side table to map Functions to collector names, such that there is no overhead. A StringPool is further used to unique collector names, which are extremely likely to be unique per process. llvm-svn: 44769	2007-12-10 03:18:06 +00:00
Duncan Sands	38ef3a8ec7	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Owen Anderson	b0dd27ee91	Make LoopInfoBase more generic, in preparation for having MachineLoopInfo. This involves a small interface change. llvm-svn: 44348	2007-11-27 03:43:35 +00:00
Anton Korobeynikov	550b98e147	Fix indent llvm-svn: 43941	2007-11-09 12:34:20 +00:00
Anton Korobeynikov	98638aede6	Forget to commit users part of value mapper interface llvm-svn: 43940	2007-11-09 12:27:04 +00:00
Anton Korobeynikov	8eeca1c252	And delete this one llvm-svn: 43939	2007-11-09 12:22:04 +00:00
Gordon Henriksen	d568767ecb	Finishing initial docs for all transformations in Passes.html. Also cleaned up some comments in source files. llvm-svn: 43674	2007-11-04 16:15:04 +00:00
Dan Gohman	d7917b6248	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	c981d72d1a	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Chris Lattner	4a15e04aee	Fix PR1752 and LoopSimplify/2007-10-28-InvokeCrash.ll: terminators can have uses too. Wouldn't it be nice if invoke didn't exist? :) llvm-svn: 43426	2007-10-29 02:30:37 +00:00
Anton Korobeynikov	7499a3b092	Reg2Mem cleanup and optimizations: - enable phi instructions demotion to stack - create alloca instructions in the entry block llvm-svn: 43208	2007-10-21 23:05:16 +00:00
Owen Anderson	ca831a829d	Move Split<...>() into DomTreeBase. This should make the #include's of DominatorInternals.h in CodeExtractor and LoopSimplify unnecessary. Hartmut, could you confirm that this fixes the issues you were seeing? llvm-svn: 43115	2007-10-18 05:13:52 +00:00
Hartmut Kaiser	2f842e613f	Fixed linker errors (unresolved externals: split<>(...)) when compiling with VC++. Please review. llvm-svn: 43081	2007-10-17 18:37:09 +00:00
Devang Patel	9d1af9b63d	Fix comment. llvm-svn: 42048	2007-09-17 20:07:40 +00:00
Chris Lattner	0625bd6472	Merge DenseMapKeyInfo & DenseMapValueInfo into DenseMapInfo Add a new DenseMapInfo::isEqual method to allow clients to redefine the equality predicate used when probing the hash table. llvm-svn: 42042	2007-09-17 18:34:04 +00:00
Devang Patel	f6ef552f3d	Insert cloned loop basic blocks before original loop header. llvm-svn: 41713	2007-09-04 20:46:35 +00:00
David Greene	c656cbb8c2	Update GEP constructors to use an iterator interface to fix GLIBCXX_DEBUG issues. llvm-svn: 41697	2007-09-04 15:46:09 +00:00
Anton Korobeynikov	35322d745c	Silence warning while compiling with gcc 4.2 llvm-svn: 41676	2007-09-02 22:11:14 +00:00
David Greene	703623d571	Update InvokeInst to work like CallInst llvm-svn: 41506	2007-08-27 19:04:21 +00:00
Anton Korobeynikov	24fb6b2f8c	Don't promote volatile loads/stores. This is needed (for example) to handle setjmp/longjmp properly. This fixes PR1520. llvm-svn: 41461	2007-08-26 21:43:30 +00:00
Devang Patel	b5933bbbd5	Use SmallVector instead of std::vector. llvm-svn: 41207	2007-08-21 00:31:24 +00:00
Devang Patel	d1fcfcc76c	When one branch of condition is eliminated then head of the other branch is not necessary immediate dominators of merge blcok in all cases. llvm-svn: 41144	2007-08-17 21:59:16 +00:00
Devang Patel	22c7993ecf	Break infinite loop. llvm-svn: 41091	2007-08-14 23:59:17 +00:00
Devang Patel	da48cf40db	If NewBB dominates DestBB then DestBB is not part of NewBB's dominance frontier. llvm-svn: 41051	2007-08-13 21:59:17 +00:00
Devang Patel	aa36a43908	Add utility to clone loops. llvm-svn: 40997	2007-08-10 17:59:47 +00:00
Chris Lattner	c7ba225705	remove some dead lines llvm-svn: 40859	2007-08-06 06:21:06 +00:00
Chris Lattner	edce70d2fe	rewrite the code used to construct pruned SSA form with the IDF method. In the old way, we computed and inserted phi nodes for the whole IDF of the definitions of the alloca, then computed which ones were dead and removed them. In the new method, we first compute the region where the value is live, and use that information to only insert phi nodes that are live. This eliminates the need to compute liveness later, and stops the algorithm from inserting a bunch of phis which it then later removes. This speeds up the testcase in PR1432 from 2.00s to 0.15s (14x) in a release build and 6.84s->0.50s (14x) in a debug build. llvm-svn: 40825	2007-08-04 22:50:14 +00:00
Chris Lattner	d91576b01e	Factor out a whole bunch of code into it's own method. llvm-svn: 40824	2007-08-04 21:14:29 +00:00
Chris Lattner	4e1b4140eb	Use getNumPreds(BB) instead of computing them manually. This is a very small but measurable speedup. llvm-svn: 40823	2007-08-04 21:06:15 +00:00
Chris Lattner	b6a4ba808b	Change the rename pass to be "tail recursive", only adding N-1 successors to the worklist, and handling the last one with a 'tail call'. This speeds up PR1432 from 2.0578s to 2.0012s (2.8%) llvm-svn: 40822	2007-08-04 20:40:27 +00:00
Chris Lattner	840259c8d3	cache computation of #preds for a BB. This speeds up mem2reg from 2.0742->2.0522s on PR1432. llvm-svn: 40821	2007-08-04 20:24:50 +00:00
Chris Lattner	050bac4bed	reserve operand space for phi nodes when we insert them. llvm-svn: 40820	2007-08-04 20:14:34 +00:00
Chris Lattner	9318785df5	use continue to avoid nesting, no functionality change. llvm-svn: 40819	2007-08-04 20:07:06 +00:00
Chris Lattner	6b04ecbaf9	Promoting allocas with the 'single store' fastpath is faster than with the 'local to a block' fastpath. This speeds up PR1432 from 2.1232 to 2.0686s (2.6%) llvm-svn: 40818	2007-08-04 20:03:23 +00:00
Chris Lattner	4a930f9444	When PromoteLocallyUsedAllocas promoted allocas, it didn't remember to increment NumLocalPromoted, and didn't actually delete the dead alloca, leading to an extra iteration of mem2reg. llvm-svn: 40817	2007-08-04 20:01:43 +00:00
Chris Lattner	63c039780c	std::map -> DenseMap llvm-svn: 40816	2007-08-04 19:52:20 +00:00
Chris Lattner	7d382f7680	fix a logic bug where we wouldn't promote single store allocas if the stored value was a non-instruction value. Doh. This increase the # single store allocas from 8982 to 9026, and speeds up mem2reg on the testcase in PR1432 from 2.17 to 2.13s. llvm-svn: 40813	2007-08-04 02:45:02 +00:00
Chris Lattner	1b215f0661	When we do the single-store optimization, delete both the store and the alloca so they don't get reprocessed. This speeds up PR1432 from 2.20s to 2.17s. llvm-svn: 40812	2007-08-04 02:38:38 +00:00
Chris Lattner	862f125457	Three improvements: 1. Check for revisiting a block before checking domination, which is faster. 2. If the stored value isn't an instruction, we don't have to check for domination. 3. If we have a value used in the same block more than once, make sure to remove the block from the UsingBlocks vector. Not doing so forces us to go through the slow path for the alloca. The combination of these improvements increases the number of allocas on the fastpath from 8935 to 8982 on PR1432. This speeds it up from 2.90s to 2.20s (31%) llvm-svn: 40811	2007-08-04 02:32:22 +00:00
Chris Lattner	ae1e00eb36	switch from using a std::set to using a SmallPtrSet. This speeds up the testcase in PR1432 from 6.33s to 2.90s (2.22x) llvm-svn: 40810	2007-08-04 02:21:22 +00:00
Chris Lattner	9181801bb7	In mem2reg, when handling the single-store case, make sure to remove a using block from the list if we handle it. Not doing this caused us to not be able to promote (with the fast path) allocas which have uses (whoops). This increases the # allocas hitting this fastpath from 4042 to 8935 on the testcase in PR1432, speeding up mem2reg by 2.6x llvm-svn: 40809	2007-08-04 02:15:24 +00:00
Chris Lattner	886a41a007	split rewriting of single-store allocas into its own method. llvm-svn: 40806	2007-08-04 01:47:41 +00:00
Chris Lattner	3cede09c67	refactor some code to shrink PromoteMem2Reg::run a bit llvm-svn: 40805	2007-08-04 01:41:18 +00:00
Chris Lattner	d524537fe9	add a typedef, no other change. llvm-svn: 40804	2007-08-04 01:19:38 +00:00
Chris Lattner	df138be527	avoid an unneeded vector copy. This speeds up mem2reg on the testcase in PR1432 by 6% llvm-svn: 40803	2007-08-04 01:07:49 +00:00
Chris Lattner	fd838f0770	make RenamePassWorkList a local var instead of an ivar. llvm-svn: 40802	2007-08-04 01:04:40 +00:00
Dan Gohman	34d442f274	More explicit keywords. llvm-svn: 40673	2007-08-01 15:32:29 +00:00
David Greene	17a5dfe6f7	New CallInst interface to address GLIBCXX_DEBUG errors caused by indexing an empty std::vector. Updates to all clients. llvm-svn: 40660	2007-08-01 03:43:44 +00:00
Devang Patel	c5e340eded	LCSSA preserves dom info. llvm-svn: 40604	2007-07-30 20:23:45 +00:00
Devang Patel	e3206cb425	Use SmallPtrSet. llvm-svn: 40560	2007-07-27 18:34:27 +00:00
Dan Gohman	6e853bc73f	Move the GET_SIDE_EFFECT_INFO logic from isInstructionTriviallyDead to Instruction::mayWriteToMemory, fixing a FIXME, and helping various places that call mayWriteToMemory directly. llvm-svn: 40533	2007-07-26 16:06:08 +00:00
Devang Patel	33227115b9	Add BasicInliner interface. This interface allows clients to inline bunch of functions with module level call graph information.:wq llvm-svn: 40486	2007-07-25 18:00:25 +00:00
Devang Patel	a273d1cd3a	Verify loop info. llvm-svn: 40062	2007-07-19 18:02:32 +00:00
Devang Patel	186e0d8b0a	After a basic block is split into two parts, second part dominates all the blocks dominated by original basic block. And first part dominates second part. llvm-svn: 40035	2007-07-19 02:29:24 +00:00
Devang Patel	de5901523c	Now this temp. fix is not required. llvm-svn: 40034	2007-07-19 02:22:21 +00:00
Reid Spencer	3363f4ad96	Return Undef if the block has no dominator. This was required to allow llvm-gcc build to succeed. Without this change it fails in libstdc++ compilation. This causes no regressions in dejagnu tests. However, someone who knows this code better might want to review it. llvm-svn: 39924	2007-07-16 21:03:44 +00:00
Dan Gohman	06c60b6032	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Devang Patel	4cd1413f15	Make LCSSA a loop pass. llvm-svn: 39844	2007-07-13 23:57:11 +00:00
Tanya Lattner	ccecbcd779	Adding ability to demote phi to stack. llvm-svn: 39744	2007-07-11 18:41:34 +00:00
Anton Korobeynikov	76547349c1	During module cloning copy aliases too. This fixes PR1544 llvm-svn: 38505	2007-07-10 19:07:35 +00:00
Devang Patel	d7767cc2a7	Add SplitEdge and SplitBlock utility routines. llvm-svn: 37952	2007-07-06 21:39:20 +00:00
David Greene	1e2a12019f	Fix reference to iterator invalidated by an erase operation. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37796	2007-06-29 02:53:16 +00:00
Devang Patel	d5258a23a5	Move code to update dominator information after basic block is split from LoopSimplify.cpp to Dominator.cpp llvm-svn: 37689	2007-06-21 17:23:45 +00:00
Devang Patel	78b9c68164	Add and use DominatorTreeBase::findNearestCommonDominator(). llvm-svn: 37545	2007-06-11 23:31:22 +00:00
Devang Patel	536ac4dca7	Simplify. llvm-svn: 37542	2007-06-11 21:45:31 +00:00
Devang Patel	d18054afcf	simplify llvm-svn: 37541	2007-06-11 21:25:31 +00:00
Devang Patel	ab2eee89a4	Simplify. Dominator Tree is required so always available. llvm-svn: 37540	2007-06-11 21:18:00 +00:00
Devang Patel	becc466451	Update LoopSimplify to require and preserve DominatorTree only. Now LoopSimplify does not require nor preserve ETForest. llvm-svn: 37512	2007-06-08 01:50:32 +00:00
Devang Patel	8ecffa996a	Do not preserve ETForest. llvm-svn: 37506	2007-06-08 00:02:08 +00:00
Devang Patel	cf470e5255	Do not use ETForest as well as DomiantorTree. DominatorTree is sufficient. llvm-svn: 37501	2007-06-07 22:17:16 +00:00
Devang Patel	fc7fdef7d2	Use DominatorTree instead of ETForest. This allows faster immediate domiantor walk. llvm-svn: 37500	2007-06-07 21:57:03 +00:00
Devang Patel	af41e4a192	Maintain ETNode as part of DomTreeNode. This adds redundancy for now. llvm-svn: 37492	2007-06-07 17:47:21 +00:00
Devang Patel	ebc5b96735	s/DominatorTree::createNewNode/DominatorTree::addNewBlock/g llvm-svn: 37415	2007-06-04 16:43:25 +00:00
Devang Patel	a89566aefd	Add basic block level interface to change immediate dominator and create new node. llvm-svn: 37414	2007-06-04 16:22:33 +00:00
Devang Patel	bdd1aaef10	s/llvm::DominatorTreeBase::DomTreeNode/llvm::DomTreeNode/g llvm-svn: 37407	2007-06-04 00:32:22 +00:00
Devang Patel	0e8aa7b69a	s/DominatorTreeBase::Node/DominatorTreeBase:DomTreeNode/g llvm-svn: 37403	2007-06-03 06:26:14 +00:00
Dan Gohman	30978078bf	Minor comment cleanups. llvm-svn: 37321	2007-05-24 14:36:04 +00:00
Dan Gohman	b5650ebd6a	Fix typos. llvm-svn: 36994	2007-05-11 21:10:54 +00:00
Nick Lewycky	e7da2d6ac3	Fix typo in comment. llvm-svn: 36873	2007-05-06 13:37:16 +00:00
Devang Patel	8c78a0bff0	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Devang Patel	e95c6ad802	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	09f162ca6a	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Devang Patel	d3ccc073a2	Mem2Reg does not need TargetData. llvm-svn: 36444	2007-04-25 18:32:35 +00:00
Devang Patel	073be55d8e	Remove unused function argument. llvm-svn: 36441	2007-04-25 17:15:20 +00:00
Owen Anderson	2965adb849	Fix a comment. llvm-svn: 36299	2007-04-21 07:12:44 +00:00
Jeff Cohen	5959f42498	Comment out usage of write() for now. llvm-svn: 36287	2007-04-20 22:40:10 +00:00
Devang Patel	83a3adcc3f	Avoid recursion. llvm-svn: 36272	2007-04-20 20:04:37 +00:00
Owen Anderson	2da606c757	Move more passes to using ETForest instead of DominatorTree. llvm-svn: 36271	2007-04-20 06:27:13 +00:00
Evan Cheng	db9b65d67a	Revert Owen's last check-in. This is breaking Mac OS X / PPC llvm-gcc bootstrap. llvm-svn: 36258	2007-04-18 22:39:00 +00:00
Owen Anderson	08293fd6d1	Use new ETForest accessor. llvm-svn: 36248	2007-04-18 04:46:35 +00:00
Owen Anderson	f38f2f2394	Use ETForest instead of DominatorTree. llvm-svn: 36247	2007-04-18 04:39:32 +00:00
Chris Lattner	233f97ac6a	remove use of BasicBlock::getNext llvm-svn: 36205	2007-04-17 18:09:47 +00:00
Chris Lattner	77a3edcb92	remove use of Instruction::getNext llvm-svn: 36199	2007-04-17 17:47:54 +00:00
Anton Korobeynikov	fb80151c42	Removed tabs everywhere except autogenerated & external files. Add make target for tabs checking. llvm-svn: 36146	2007-04-16 18:10:23 +00:00
Chris Lattner	343c88cdb9	Fix PR1335 and Transforms/Inline/2007-04-15-InlineEH.ll llvm-svn: 36090	2007-04-15 21:38:06 +00:00
Owen Anderson	f35a1dbc7a	Remove ImmediateDominator analysis. The same information can be obtained from DomTree. A lot of code for constructing ImmediateDominator is now folded into DomTree construction. This is part of the ongoing work for PR217. llvm-svn: 36063	2007-04-15 08:47:27 +00:00
Chris Lattner	a6b5660209	avoid copying sets and vectors around. llvm-svn: 36017	2007-04-14 22:10:17 +00:00
Lauro Ramos Venancio	749e4668e7	Implement the "thread_local" keyword. llvm-svn: 35950	2007-04-12 18:32:50 +00:00
Owen Anderson	3c7867935e	Re-constify things that don't break the build. Last patch in this series, I promise. llvm-svn: 35848	2007-04-09 23:38:18 +00:00
Owen Anderson	f1ca1376d3	Unconst-ify stuff that broke the build. llvm-svn: 35843	2007-04-09 23:08:26 +00:00
Owen Anderson	5917716146	Const-ify some parameters, and some cosmetic cleanups. No functionality change. llvm-svn: 35842	2007-04-09 22:54:50 +00:00
Owen Anderson	e0ef5ac6bd	Tabs -> Spaces llvm-svn: 35841	2007-04-09 22:31:43 +00:00
Owen Anderson	83efbc84f7	Improve some _slow_ behavior introduced in my patches the last few days. llvm-svn: 35839	2007-04-09 22:25:09 +00:00
Owen Anderson	ae39ca037a	Cleanup some from my DomSet-removal changes. Add a new isReachableFromEntry test to ETForest to factor a common test out of code. llvm-svn: 35786	2007-04-09 00:52:49 +00:00
Nick Lewycky	e6c64466c7	Remove DominatorSet usage from LoopSimplify. Patch from Owen Anderson. llvm-svn: 35757	2007-04-08 01:04:30 +00:00
Owen Anderson	f7ebea1b9f	Add DomSet back, and revert the changes to LoopSimplify. Apparently the ETForest updating mechanisms don't work as I thought they did. These changes will be reapplied once the issue is worked out. llvm-svn: 35741	2007-04-07 18:23:27 +00:00
Owen Anderson	706e97049d	Completely purge DomSet from LoopSimplify. This is part of the continuing work on PR1171. llvm-svn: 35730	2007-04-07 06:56:47 +00:00
Owen Anderson	d03a646f06	BreakCriticalEdges does still preserve DominatorTree. llvm-svn: 35729	2007-04-07 05:57:09 +00:00
Owen Anderson	b39d9ca902	Expunge DomSet from BreakCriticalEdges. This is part of the continuing work for PR 1171. llvm-svn: 35728	2007-04-07 05:49:29 +00:00
Owen Anderson	f095bf3ac4	Expunge DomSet from CodeExtractor. This is part of the continuing work on PR1171. llvm-svn: 35726	2007-04-07 05:31:27 +00:00
Owen Anderson	910419596e	Expunge a bunch of uses of DomSet from LoopSimplify. Many more remain. This is the beginning of work for PR1171. llvm-svn: 35720	2007-04-07 04:37:14 +00:00
Chris Lattner	b7b75145f1	reduce use of std::set llvm-svn: 35576	2007-04-02 01:44:59 +00:00
Devang Patel	4398e242dd	Reduce malloc/free traffic. llvm-svn: 35370	2007-03-26 23:19:29 +00:00
Dan Gohman	dcb291faa4	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Devang Patel	1758cb50de	LoopSimplify::FindPHIToPartitionLoops() Use ETForest instead of DominatorSet. llvm-svn: 35221	2007-03-20 20:18:12 +00:00
Jeff Cohen	00227417d2	Unbreak VC++ build. Do not use identifiers starting with _ as they are reserved and can collide with system defined names. Windows defines _BB, for example. llvm-svn: 35066	2007-03-12 17:56:27 +00:00
Anton Korobeynikov	8a6dc102d3	Use range tests in LowerSwitch, where possible llvm-svn: 35057	2007-03-10 16:46:28 +00:00
Devang Patel	5f50e61d52	Remove dead comments. llvm-svn: 35053	2007-03-09 23:41:03 +00:00
Devang Patel	bda1250624	Avoid recursion. Use iterative algorithm for RenamePass(). llvm-svn: 35052	2007-03-09 23:39:14 +00:00
Reid Spencer	dec03a08d6	Make sure debug code is not evaluated in non-debug case. llvm-svn: 34856	2007-03-02 23:15:21 +00:00
Reid Spencer	1e102971d2	1. Sort switch cases using APInt safe comparison. 2. Make sure debug output of APInt values is safe for all bit widths. llvm-svn: 34855	2007-03-02 23:05:28 +00:00
Reid Spencer	43376a74af	Use APInt safe isOne() method on ConstantInt instead of getZExtValue()==1 llvm-svn: 34854	2007-03-02 23:03:17 +00:00
Reid Spencer	bb38d79ad6	Make sorting of ConstantInt be APInt clean through use of ult function. llvm-svn: 34853	2007-03-02 23:01:14 +00:00
Chris Lattner	4bd8cda3f0	switch the inliner from being recursive to being iterative. llvm-svn: 34832	2007-03-02 03:11:20 +00:00
Chris Lattner	1e48acb858	fix an obscure and tricky bug the inliner can hit sometimes. llvm-svn: 34531	2007-02-23 19:54:30 +00:00
Jim Laskey	d879dfbf1c	Revert changes for a simplier solution. llvm-svn: 34495	2007-02-22 16:21:18 +00:00
Jim Laskey	e4ccf22c34	Itanium ABI exception handing support. llvm-svn: 34480	2007-02-21 22:49:50 +00:00
Dan Gohman	8c8597c4d9	Fix typos in comments. llvm-svn: 34456	2007-02-20 20:52:03 +00:00
Chris Lattner	b5f6d0c15a	eliminate use of deprecated apis llvm-svn: 34417	2007-02-19 07:34:47 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	a06a8fd2d7	Eliminate use of ctors that take vectors. llvm-svn: 34219	2007-02-13 02:10:56 +00:00
Chris Lattner	a731513406	stop using methods that take vectors. llvm-svn: 34205	2007-02-12 22:56:41 +00:00
Chris Lattner	8dd4cae4f8	simplify code by using Value::takeName llvm-svn: 34177	2007-02-11 01:37:51 +00:00
Chris Lattner	430c9217f0	redesign the primary datastructure used by mem2reg to eliminate an std::map of std::vector's (ouch!). This speeds up mem2reg by 10% on 176.gcc. llvm-svn: 33974	2007-02-07 01:15:04 +00:00
Chris Lattner	c85e79f3e0	With the last change, we no longer need both directions of mapping from BBNumbers. Instead of using a bi-directional mapping, just use a single densemap. This speeds up mem2reg on 176.gcc by 8%, from 1.3489 to 1.2485s. llvm-svn: 33940	2007-02-05 23:37:20 +00:00
Reid Spencer	557ab15e71	Apply the VISIBILITY_HIDDEN field to the remaining anonymous classes in the Transforms library. This reduces debug library size by 132 KB, debug binary size by 376 KB, and reduces link time for llvm tools slightly. llvm-svn: 33939	2007-02-05 23:32:05 +00:00
Chris Lattner	52da61fb5c	Simplify use of DFBlocks, this makes no noticable performance difference, but paves the way to eliminate BBNumbers. llvm-svn: 33938	2007-02-05 23:31:26 +00:00
Chris Lattner	bf67b1229b	Switch InsertedPHINodes back to SmallPtrSet now that the SmallPtrSet::erase bug is fixed. llvm-svn: 33932	2007-02-05 23:11:37 +00:00
Chris Lattner	606dde0093	switch a SmallPtrSet back to an std::set for now, this caused problems. llvm-svn: 33930	2007-02-05 22:28:52 +00:00
Chris Lattner	1ed84bbd2d	switch an std::set over to a SmallPtrSet, speeding up mem2reg 6% on 176.gcc. llvm-svn: 33929	2007-02-05 22:15:21 +00:00
Chris Lattner	70fbb9de4c	switch an std::set over to SmallPtrSet, speeding up mem2reg 3.4% on 176.gcc. llvm-svn: 33928	2007-02-05 22:13:11 +00:00
Chris Lattner	8fbc888d91	eliminate some malloc traffic, this speeds up mem2reg by 3.4%. llvm-svn: 33927	2007-02-05 21:58:48 +00:00
Reid Spencer	3aaaa0b2bd	For PR411: This patch replaces the SymbolTable class with ValueSymbolTable which does not support types planes. This means that all symbol names in LLVM must now be unique. The patch addresses the necessary changes to deal with this and removes code no longer needed as a result. This completes the bulk of the changes for this PR. Some cleanup patches will follow. llvm-svn: 33918	2007-02-05 20:47:22 +00:00
Reid Spencer	a1d35926b7	For PR1177: Revert last patch which caused iteration invalidation. llvm-svn: 33901	2007-02-05 05:23:32 +00:00
Owen Anderson	f6fa108993	Use DenseMap for pointer->pointer maps. llvm-svn: 33897	2007-02-05 02:39:47 +00:00
Reid Spencer	3f4e6e84dc	For PR1163: Make the Module's dependent library use a std::vector instead of SetVector adjust #includes in .cpp files because SetVector.h is no longer included. llvm-svn: 33855	2007-02-04 00:40:42 +00:00
Chris Lattner	1bfc7ab6a7	Switch inliner over to use DenseMap instead of std::map for ValueMap. This speeds up the inliner 16%. llvm-svn: 33801	2007-02-03 00:08:31 +00:00
Chris Lattner	ce494229a1	Fix bugs in the inliner having to do with single-entry phi nodes and valuemap updating. These were exposed by Devang's recent passmgr changes (with non-default passorderings) because now the inliner can be interleved with the LCSSA pass. llvm-svn: 33760	2007-02-01 18:48:38 +00:00
Chris Lattner	7a63e7a7ad	eliminate temporary vectors llvm-svn: 33713	2007-01-31 20:07:32 +00:00
Chris Lattner	024f4ab383	Adjust #includes to match movement of constant folding code from transformutils to libanalysis. llvm-svn: 33680	2007-01-30 23:46:24 +00:00
Chris Lattner	2ae054adb0	move a bunch of constant folding code f rom Transforms/Utils/Local.cpp into libanalysis/ConstantFolding.cpp. llvm-svn: 33679	2007-01-30 23:45:45 +00:00
Chris Lattner	14789a92e1	remove now-dead code. llvm-svn: 33678	2007-01-30 23:29:47 +00:00
Chris Lattner	ad84a730ba	The inliner/cloner can now optionally take TargetData info, which can be used by constant folding. llvm-svn: 33676	2007-01-30 23:22:39 +00:00
Chris Lattner	2c4610e4ca	Change constant folding APIs to take an optional TargetData, and change ConstantFoldInstOperands/ConstantFoldCall to take a pointer to an array of operands + size, instead of an std::vector. In some cases, switch to using a SmallVector instead of a vector. This allows us to get rid of some special case gross code that was there to avoid the cost of constructing a vector. llvm-svn: 33670	2007-01-30 23:13:49 +00:00
Reid Spencer	5301e7c605	For PR1136: Rename GlobalVariable::isExternal as isDeclaration to avoid confusion with external linkage types. llvm-svn: 33663	2007-01-30 20:08:39 +00:00
Reid Spencer	3ac38e99b9	For PR761: The Module::setEndianness and Module::setPointerSize methods have been removed. Instead you can get/set the DataLayout. Adjust thise accordingly. llvm-svn: 33530	2007-01-26 08:11:39 +00:00
Devang Patel	5292e65791	Inherit BasicBlockPass directly from Pass. llvm-svn: 33511	2007-01-25 23:23:25 +00:00
Reid Spencer	a94d394ad2	For PR1043: This is the final patch for this PR. It implements some minor cleanup in the use of IntegerType, to wit: 1. Type::getIntegerTypeMask -> IntegerType::getBitMask 2. Type::IntTy changed to IntegerType from Type* 3. ConstantInt::getType() returns IntegerType* now, not Type* This also fixes PR1120. Patch by Sheng Zhou. llvm-svn: 33370	2007-01-19 21:13:56 +00:00
Chris Lattner	03c4953cdd	rename Type::isIntegral to Type::isInteger, eliminating the old Type::isInteger. rename Type::getIntegralTypeMask to Type::getIntegerTypeMask. This makes naming much more consistent. For example, there are now no longer any instances of IntegerType that are not considered isInteger! :) llvm-svn: 33225	2007-01-15 02:27:26 +00:00
Chris Lattner	1942249c5b	Eliminate calls to isInteger, generalizing code and tightening checks as needed. llvm-svn: 33218	2007-01-15 01:55:30 +00:00
Chris Lattner	f739d01059	Fix Analysis/Dominators/2006-10-02-BreakCritEdges.ll llvm-svn: 33210	2007-01-15 00:15:09 +00:00
Chris Lattner	9818a6fd76	Fix PR1110 and Analysis/Dominators/2007-01-14-BreakCritEdges.ll by being more careful about unreachable code when updating dominator info. llvm-svn: 33204	2007-01-14 18:33:35 +00:00
Reid Spencer	cddc9dfe97	Implement review feedback for the ConstantBool->ConstantInt merge. Chris recommended that getBoolValue be replaced with getZExtValue and that get(bool) be replaced by get(const Type*, uint64_t). This implements those changes. llvm-svn: 33110	2007-01-12 04:24:46 +00:00
Reid Spencer	542964f55b	Rename BoolTy as Int1Ty. Patch by Sheng Zhou. llvm-svn: 33076	2007-01-11 18:21:29 +00:00
Zhou Sheng	75b871fb1e	For PR1043: Merge ConstantIntegral and ConstantBool into ConstantInt. Remove ConstantIntegral and ConstantBool from LLVM. llvm-svn: 33073	2007-01-11 12:24:14 +00:00
Chris Lattner	34acba48cc	Change the interface to Module::getOrInsertFunction to be easier to use,to resolve PR1088, and to help PR411. This simplifies many clients also llvm-svn: 32989	2007-01-07 08:12:01 +00:00
Chris Lattner	d97f1936bb	prepare for adjustment to getOrInsertFunction method llvm-svn: 32985	2007-01-07 07:54:34 +00:00
Reid Spencer	32af9e8cc5	For PR411: Take an incremental step towards type plane elimination. This change separates types from values in the symbol tables by finally making use of the TypeSymbolTable class. This yields more natural interfaces for dealing with types and unclutters the SymbolTable class. llvm-svn: 32956	2007-01-06 07:24:44 +00:00
Reid Spencer	c635f47d9a	For PR950: This patch replaces signed integer types with signless ones: 1. [US]Byte -> Int8 2. [U]Short -> Int16 3. [U]Int -> Int32 4. [U]Long -> Int64. 5. Removal of isSigned, isUnsigned, getSignedVersion, getUnsignedVersion and other methods related to signedness. In a few places this warranted identifying the signedness information from other sources. llvm-svn: 32785	2006-12-31 05:48:39 +00:00
Reid Spencer	266e42b312	For PR950: This patch removes the SetCC instructions and replaces them with the ICmp and FCmp instructions. The SetCondInst instruction has been removed and been replaced with ICmpInst and FCmpInst. llvm-svn: 32751	2006-12-23 06:05:41 +00:00
Chris Lattner	45f966d80f	switch more statistics over to STATISTIC, eliminating static ctors. Also, delete some dead ones. llvm-svn: 32694	2006-12-19 22:17:40 +00:00
Bill Wendling	a77f14265b	Added an automatic cast to "std::ostream" etc. from OStream. We then can rework the hacks that had us passing OStream in. We pass in std::ostream instead, check for null, and then dispatch to the correct print() method. llvm-svn: 32636	2006-12-17 05:15:13 +00:00
Reid Spencer	bfe26ffcfc	Replace CastInst::createInferredCast calls with more accurate cast creation calls. llvm-svn: 32521	2006-12-13 00:50:17 +00:00
Reid Spencer	41cb269a2b	Fix the casting for the computation of the Malloc size. llvm-svn: 32477	2006-12-12 09:17:08 +00:00
Reid Spencer	b341b0861d	Change inferred getCast into specific getCast. Passes all tests. llvm-svn: 32469	2006-12-12 05:05:00 +00:00
Bill Wendling	f3baad3ee1	Changed llvm_ostream et all to OStream. llvm_cerr, llvm_cout, llvm_null, are now cerr, cout, and NullStream resp. llvm-svn: 32298	2006-12-07 01:30:32 +00:00
Chris Lattner	700b873130	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Bill Wendling	4ae401074c	Remove #include <iostream> and use llvm_* streams instead. llvm-svn: 31925	2006-11-26 10:17:54 +00:00
Chris Lattner	95adf8f1da	Do not convert massive blocks on phi nodes into select statements. Instead only do these transformations if there are a small number of phi's. This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro. llvm-svn: 31853	2006-11-18 19:19:36 +00:00
Jim Laskey	61feeb90f9	Remove redundant <cmath>. llvm-svn: 31561	2006-11-08 19:16:44 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Jeff Cohen	7d6f3db3e2	Unbreak VC++ build. llvm-svn: 31464	2006-11-05 19:31:28 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Chris Lattner	984d6e1669	generalize the fix for PR977 to also fix Transforms/LCSSA/2006-10-31-UnreachableBlock-2.ll llvm-svn: 31317	2006-10-31 18:56:48 +00:00
Chris Lattner	eb68f080ef	Fix PR977 and Transforms/LCSSA/2006-10-31-UnreachableBlock.ll llvm-svn: 31315	2006-10-31 17:52:18 +00:00
Chris Lattner	fc519cd2d1	Fix SimplifyCFG/2006-10-29-InvokeCrash.ll, a crash compiling QT. llvm-svn: 31284	2006-10-29 21:21:20 +00:00
Chris Lattner	3e763f5708	add option to isCriticalEdge llvm-svn: 31258	2006-10-28 06:58:17 +00:00
Chris Lattner	80ea207bfa	Expose a smarter way to break critical edges. llvm-svn: 31256	2006-10-28 06:44:56 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Chris Lattner	b8b11599dd	Fix SimplifyCFG/2006-10-19-UncondDiv.ll by disabling a bad xform. llvm-svn: 31061	2006-10-20 00:42:07 +00:00
Chris Lattner	52886e72d7	This case isn't implemented yet. It seems unlikely to be needed, but if it ever is, we want to get an assert instead of silent bad codegen. llvm-svn: 30716	2006-10-04 04:58:58 +00:00
Chris Lattner	8aca0ee8c3	Fix PR932 and Analysis/Dominators/2006-10-02-BreakCritEdges.ll: The critical edge block dominates the dest block if the destblock dominates all edges other than the one incoming from the critical edge. llvm-svn: 30696	2006-10-03 07:02:02 +00:00
Chris Lattner	525804f31e	simplify code llvm-svn: 30656	2006-09-28 22:58:25 +00:00
Chris Lattner	6bd6da4097	Be far more careful when splitting a loop header, either to form a preheader or when splitting loops with a common header into multiple loops. In particular the old code would always insert the preheader before the old loop header. This is disasterous in cases where the loop hasn't been rotated. For example, it can produce code like: .. outside the loop... jmp LBB1_2 #bb13.outer LBB1_1: #bb1 movsd 8(%esp,%esi,8), %xmm1 mulsd (%edi), %xmm1 addsd %xmm0, %xmm1 addl $24, %edi incl %esi jmp LBB1_3 #bb13 LBB1_2: #bb13.outer leal (%edx,%eax,8), %edi pxor %xmm1, %xmm1 xorl %esi, %esi LBB1_3: #bb13 movapd %xmm1, %xmm0 cmpl $4, %esi jl LBB1_1 #bb1 Note that the loop body is actually LBB1_1 + LBB1_3, which means that the loop now contains an uncond branch WITHIN it to jump around the inserted loop header (LBB1_2). Doh. This patch changes the preheader insertion code to insert it in the right spot, producing this code: ... outside the loop, fall into the header ... LBB1_1: #bb13.outer leal (%edx,%eax,8), %esi pxor %xmm0, %xmm0 xorl %edi, %edi jmp LBB1_3 #bb13 LBB1_2: #bb1 movsd 8(%esp,%edi,8), %xmm0 mulsd (%esi), %xmm0 addsd %xmm1, %xmm0 addl $24, %esi incl %edi LBB1_3: #bb13 movapd %xmm0, %xmm1 cmpl $4, %edi jl LBB1_2 #bb1 Totally crazy, no branch in the loop! :) llvm-svn: 30587	2006-09-23 08:19:21 +00:00
Chris Lattner	608cd05e3f	Teach UpdateDomInfoForRevectoredPreds to handle revectored preds that are not reachable, making it general purpose enough for use by InsertPreheaderForLoop. Eliminate custom dominfo updating code in InsertPreheaderForLoop, using UpdateDomInfoForRevectoredPreds instead. llvm-svn: 30586	2006-09-23 07:40:52 +00:00
Chris Lattner	237ccf2a51	Second half of the fix for Transforms/Inline/inline_cleanup.ll This folds unconditional branches that are often produced by code specialization. llvm-svn: 30307	2006-09-13 21:27:00 +00:00
Chris Lattner	6ef6d06d21	Implement the first half of Transforms/Inline/inline_cleanup.ll llvm-svn: 30303	2006-09-13 19:23:57 +00:00
Chris Lattner	845b223da4	Fix Duraid's changes to work when TLI is null. This fixes the failing lowerinvoke regtests. llvm-svn: 30115	2006-09-05 17:48:07 +00:00
Duraid Madina	cf6749e4c0	add setJumpBufSize() and setJumpBufAlignment() to target-lowering. Call these from your backend to enjoy setjmp/longjmp goodness, see lib/Target/IA64/IA64ISelLowering.cpp for an example llvm-svn: 30095	2006-09-04 06:21:35 +00:00
Chris Lattner	c2d3d3112e	eliminate RegisterOpt. It does the same thing as RegisterPass. llvm-svn: 29925	2006-08-27 22:42:52 +00:00
Chris Lattner	3d27be1333	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Chris Lattner	f18b396cc2	Don't attempt to split subloops out of a loop with a huge number of backedges. Not only will this take huge amounts of compile time, the resultant loop nests won't be useful for optimization. This reduces loopsimplify time on Transforms/LoopSimplify/2006-08-11-LoopSimplifyLongTime.ll from ~32s to ~0.4s with a debug build of llvm on a 2.7Ghz G5. llvm-svn: 29647	2006-08-12 05:25:00 +00:00
Chris Lattner	85d9944f9a	Reimplement the loopsimplify code which deletes edges from unreachable blocks that target loop blocks. Before, the code was run once per loop, and depended on the number of predecessors each block in the loop had. Unfortunately, scanning preds can be really slow when huge numbers of phis exist or when phis with huge numbers of inputs exist. Now, the code is run once per function and scans successors instead of preds, which is far faster. In addition, the new code is simpler and is goto free, woo. This change speeds up a nasty testcase Duraid provided me from taking hours to taking ~72s with a debug build. The functionality this implements is already tested in the testsuite as Transforms/CodeExtractor/2004-03-13-LoopExtractorCrash.ll. llvm-svn: 29644	2006-08-12 04:51:20 +00:00
Chris Lattner	c9009d917d	Fix PR867 (and maybe 868) and testcsae: Transforms/SimplifyCFG/2006-08-03-Crash.ll llvm-svn: 29515	2006-08-03 21:40:24 +00:00
Chris Lattner	38b6e8382a	Add special check to avoid isLoop call. Simple, but doesn't seem to speed up lcssa much in practice. llvm-svn: 29465	2006-08-02 00:16:47 +00:00
Chris Lattner	5a2bc786be	Replace the SSA update code in LCSSA with a bottom-up approach instead of a top down approach, inspired by discussions with Tanya. This approach is significantly faster, because it does not need dominator frontiers and it does not insert extraneous unused PHI nodes. For example, on 252.eon, in a release-asserts build, this speeds up LCSSA (which is the slowest pass in gccas) from 9.14s to 0.74s on my G5. This code is also slightly smaller and significantly simpler than the old code. Amusingly, in a normal Release build (which includes the "assert(L->isLCSSAForm());" assertion), asserting that the result of LCSSA is in LCSSA form is actually slower than the LCSSA transformation pass itself on 252.eon. I will see if Loop::isLCSSAForm can be sped up next. llvm-svn: 29463	2006-08-02 00:06:09 +00:00
Chris Lattner	85ea83e821	Add some advice llvm-svn: 29324	2006-07-27 04:24:14 +00:00
Chris Lattner	fea3974133	silence warnings in a release build llvm-svn: 29189	2006-07-18 21:48:57 +00:00
Chris Lattner	19247f36ea	eliminate some ugly code, using ConstantExpr::getWithOperands instead. llvm-svn: 29149	2006-07-14 22:21:31 +00:00
Chris Lattner	b3c64f7ab3	Handle instructions in the map, but that map to a null pointer. This unbreaks smg2000. llvm-svn: 29127	2006-07-12 21:37:11 +00:00
Chris Lattner	6148456ec2	In addition to deleting calls, the inliner can constant fold them as well. Handle this case, which doesn't require a new callgraph edge. This fixes a crash compiling MallocBench/gs. llvm-svn: 29121	2006-07-12 18:37:18 +00:00
Chris Lattner	5de3b8b262	Change the callgraph representation to store the callsite along with the target CG node. This allows the inliner to properly update the callgraph when using the pruning inliner. The pruning inliner may not copy over all call sites from a callee to a caller, so the edges corresponding to those call sites should not be copied over either. This fixes PR827 and Transforms/Inline/2006-07-12-InlinePruneCGUpdate.ll llvm-svn: 29120	2006-07-12 18:29:36 +00:00
Owen Anderson	fe6e97d275	Fix typo in the comment. llvm-svn: 29078	2006-07-09 21:35:40 +00:00
Owen Anderson	aecaabb6e1	Add a fix for an issue where LCSSA would fail to insert undef's in some corner cases. Ideally, this issue will go away in the future as LCSSA gets smarter about which Phi nodes it inserts. llvm-svn: 29076	2006-07-09 08:14:06 +00:00
Chris Lattner	996795b0dd	Use hidden visibility to make symbols in an anonymous namespace get dropped. This shrinks libllvmgcc.dylib another 67K llvm-svn: 28975	2006-06-28 23:17:24 +00:00
Chris Lattner	e3abb14503	Use the PotDoms map to memoize 'dominating value' lookup. With this patch, LCSSA is still the slowest pass when gccas'ing 252.eon, but now it only takes 39s instead of 289s. :) llvm-svn: 28776	2006-06-14 01:13:57 +00:00
Owen Anderson	e714a5c549	Fix another instance where PHI nodes need special treatment. llvm-svn: 28774	2006-06-13 20:50:09 +00:00
Owen Anderson	3f8ff0449a	Fix a bug that was causing major slowdowns in povray. This was due to LCSSA not handling PHI nodes correctly when determining if a value was live-out. This patch reduces the number of detected live-out variables in the testcase from 6565 to 485. llvm-svn: 28771	2006-06-13 19:37:18 +00:00
Chris Lattner	b5c9d7a0af	Fix an infinite loop on Transforms/SimplifyCFG/2006-06-12-InfLoop.ll llvm-svn: 28758	2006-06-12 20:18:01 +00:00
Owen Anderson	0ac336965e	Fix for 2006-06-26-MultipleExitsSingleBlock. If a single exit block has multiple predecessors within the loop, it will appear in the exit blocks list more than once. LCSSA needs to take that into account so that it doesn't double process that exit block. llvm-svn: 28750	2006-06-12 07:10:16 +00:00
Owen Anderson	b538f14d2a	Re-commit the safe parts of my 6/9 patch. Still working on fixing the unsafe parts. llvm-svn: 28748	2006-06-11 19:22:28 +00:00
Evan Cheng	1b6e310e6f	Back out Owen's 6/9 changes. They broke MultiSource/Benchmarks/Prolangs-C/bison (and perhaps others). llvm-svn: 28747	2006-06-11 09:32:57 +00:00
Owen Anderson	505adff3f0	Make Loop able to verify that it is in LCSSA-form, and have the LCSSA pass assert on this. llvm-svn: 28738	2006-06-09 18:33:30 +00:00
Owen Anderson	5d029264ec	Update some comments, and expose LCSSAID in preparation for having other passes require LCSSA. llvm-svn: 28734	2006-06-08 20:02:53 +00:00
Owen Anderson	ac601b4c4b	Fix some formatting, and use inLoop() when appropriate. llvm-svn: 28694	2006-06-06 04:36:36 +00:00
Owen Anderson	9e81c1bb03	Stop a memory leak, and update some comments. llvm-svn: 28693	2006-06-06 04:28:30 +00:00
Owen Anderson	766f90b08e	Some more clean-up, and squash an IDF-Phi related bug. llvm-svn: 28680	2006-06-04 00:55:19 +00:00
Owen Anderson	eb33815f1b	Various clean-ups suggested by Chris. llvm-svn: 28678	2006-06-04 00:02:23 +00:00
Owen Anderson	d00eacc4f9	Fix a bug in Phi-noded insertion. Also, update some comments to reflect what's actually going on. llvm-svn: 28677	2006-06-03 23:22:50 +00:00
Chris Lattner	02e0b4ddb7	Force anything that #includes llvm/Transforms/Utils/UnifyFunctionExitNodes.h to link in the implementation. Thanks to Anton Korobeynikov for figuring out what was going on here. llvm-svn: 28660	2006-06-02 18:40:06 +00:00
Chris Lattner	cdf2b1fc30	Remove dead #include llvm-svn: 28642	2006-06-01 20:02:28 +00:00
Chris Lattner	cc340c02a4	Make the "pruning cloner" smarter. As it propagates constants through the code (while cloning) it often gets the branch/switch instructions. Since it knows that edges of the CFG are dead, it need not clone (or even look) at the obviously dead blocks. This should speed up the inliner substantially on code where there are lots of inlinable calls to functions with constant arguments. On C++ code in particular, this kicks in. llvm-svn: 28641	2006-06-01 19:19:23 +00:00
Owen Anderson	619e4ba57f	Remove a FIXME that was fixed with my last patch. llvm-svn: 28619	2006-06-01 06:07:40 +00:00
Owen Anderson	cd76fa04a1	More cleanups. Also, add a special case for updating PHI nodes, and reimplement getValueDominatingFunction to walk the DominanceTree rather than just searching blindly. llvm-svn: 28618	2006-06-01 06:05:47 +00:00
Owen Anderson	dad8c57340	Extract a huge loop into a helper method. Fix a few iterator-invalidation bugs. llvm-svn: 28599	2006-05-31 20:55:06 +00:00
Owen Anderson	8a8f278f15	Add Use replacement. Assuming there is nothing horribly wrong with this, LCSSA is now theoretically feature-complete. It has not, however, been thoroughly test, and is still considered experimental. llvm-svn: 28529	2006-05-29 01:00:00 +00:00
Owen Anderson	152d063ccb	Major think-o. Iterate over all live out-of-loop values, and perform the other calculations on each individually, rather than trying to delay it and do them all at the end. llvm-svn: 28527	2006-05-28 19:33:28 +00:00
Owen Anderson	1310e42803	Make LCSSA insert proper Phi nodes throughout the rest of the CFG by computing the iterated Dominance Frontier of the loop-closure Phi's. This is the second phase of the LCSSA pass. The third phase (coming soon) will be to update all uses of loop variables to use the loop-closure Phi's instead. llvm-svn: 28524	2006-05-27 18:47:11 +00:00
Chris Lattner	67c424e010	Fix some regression from the inliner patch I committed last night. This fixes ldecod, lencod, and SPASS. llvm-svn: 28523	2006-05-27 17:28:13 +00:00
Chris Lattner	be853d77e9	Switch the inliner over to using CloneAndPruneFunctionInto. This effectively makes it so that it constant folds instructions on the fly. This is good for several reasons: 0. Many instructions are constant foldable after inlining, particularly if inlining a call with constant arguments. 1. Without this, the inliner has to allocate memory for all of the instructions that can be constant folded, then a subsequent pass has to delete them. This gets the job done without this extra work. 2. This makes the inliner pass a bit more aggressive: in particular, it partially solves a phase order issue where the inliner would inline lots of code that folds away to nothing, but think that the resultant function is big because of this code that will be gone. Now the code never exists. This is the first part of a 2-step process. The second part will be smart enough to see when this implicit constant folding propagates a constant into a branch or switch instruction, making CFG edges dead. This implements Transforms/Inline/inline_constprop.ll llvm-svn: 28521	2006-05-27 01:28:04 +00:00
Chris Lattner	3df13f4f22	Implement a new method, CloneAndPruneFunctionInto, as documented. llvm-svn: 28519	2006-05-27 01:22:24 +00:00
Chris Lattner	bc3c879fcf	Refactor some code to expose an interface to constant fold and instruction given it's opcode, typeand operands. llvm-svn: 28517	2006-05-27 01:18:04 +00:00
Owen Anderson	b4e16996f1	A few small clean-ups, and the addition of an LCSSA statistic. llvm-svn: 28512	2006-05-27 00:31:37 +00:00
Owen Anderson	6e047ab8fc	Fix a copy-and-paste-o that would break some compilers. llvm-svn: 28507	2006-05-26 21:19:17 +00:00
Owen Anderson	f3dd3e2bfd	Clean up and refactor LCSSA a bunch. It should also run faster now, though there's still a lot of work to be done on it. llvm-svn: 28506	2006-05-26 21:11:53 +00:00
Owen Anderson	8eca8910b6	Skeletal LCSSA pass. This is currently non-functional. Expect functionality and documentation updates soo. llvm-svn: 28495	2006-05-26 13:58:26 +00:00
Chris Lattner	0853700582	Revert a patch that is unsafe, due to out of range array accesses in inner array scopes possibly accessing valid memory in outer subscripts. llvm-svn: 28478	2006-05-25 21:25:12 +00:00
Chris Lattner	a643d528bd	Patch for a new instcombine xform, patch contributed by Nick Lewycky! This implements Transforms/InstCombine/2006-05-10-InvalidIndexUndef.ll llvm-svn: 28450	2006-05-24 17:34:30 +00:00
Reid Spencer	2452c94df4	Fix a doxygen problem and break lines at 80 columns llvm-svn: 28395	2006-05-19 19:09:46 +00:00
Chris Lattner	2e266807c3	Add a CloneModule call that exposes the mapping of values from the old module to the new module. Patch provided by Nick Lewycky! llvm-svn: 28349	2006-05-17 18:05:35 +00:00
Chris Lattner	35515557c7	remove some dead code identified by coverity llvm-svn: 28289	2006-05-14 18:45:44 +00:00
Chris Lattner	3237da073e	remove dead variables llvm-svn: 28286	2006-05-14 18:33:57 +00:00
Chris Lattner	4fe87d67c4	Patch to make some xforms preserve each other. Patch contributed by Domagoj Babic! llvm-svn: 28181	2006-05-09 04:13:41 +00:00
Chris Lattner	f98b4aa2e7	Fix some nondeterminstic behavior in the mem2reg pass that (in addition to nondeterminism being bad) could cause some trivial missed optimizations (dead phi nodes being left around for later passes to clean up). With this, llvm-gcc4 now bootstraps and correctly compares. I don't know why I never tried to do it before... :) llvm-svn: 27984	2006-04-27 01:14:43 +00:00
Chris Lattner	17bd60588c	Add supprot for shufflevector llvm-svn: 27513	2006-04-08 01:19:12 +00:00
Chris Lattner	8ec0205de4	Fix inlining of insert/extract element constantexprs llvm-svn: 27478	2006-04-07 04:41:03 +00:00
Chris Lattner	70ec96fa32	Adjust to change in Intrinsics.gen interface. llvm-svn: 27344	2006-04-02 03:35:01 +00:00
Chris Lattner	1b2436a624	add valuemapper support for inline asm llvm-svn: 27332	2006-04-01 23:17:11 +00:00
Chris Lattner	42e0ba09aa	teach the inliner to work with packed constants llvm-svn: 27161	2006-03-27 05:50:18 +00:00
Chris Lattner	60f6833376	use autogenerated side-effect information llvm-svn: 26673	2006-03-09 22:38:10 +00:00
Chris Lattner	d95665188b	Fix Transforms/SimplifyCFG/2006-02-17-InfiniteUnroll.ll llvm-svn: 26275	2006-02-18 00:33:17 +00:00
Chris Lattner	9c5693fb2a	Canonicalize inner loops before outer loops. Inner loop canonicalization can provide work for the outer loop to canonicalize. This fixes a case that breaks unswitching. llvm-svn: 26189	2006-02-14 23:06:02 +00:00
Chris Lattner	cffbbee8d1	When splitting exit edges to canonicalize loops, make sure to put the new block in the appropriate loop nest. Third time is the charm, right? llvm-svn: 26187	2006-02-14 22:34:08 +00:00
Chris Lattner	02f53ad3a2	Revert my last patch. It too breaks stuff llvm-svn: 26128	2006-02-12 01:59:10 +00:00
Chris Lattner	35248e06bc	Fix for my previously reverted patch llvm-svn: 26126	2006-02-11 21:24:54 +00:00
Chris Lattner	b24ce3a2a8	revert my previous change, it exposed other problems. llvm-svn: 26121	2006-02-11 08:47:47 +00:00
Chris Lattner	05bf90dddf	Make this check stricter. Disallow loop exit blocks from being shared by loops and their subloops. llvm-svn: 26118	2006-02-11 02:13:17 +00:00
Chris Lattner	a6ae101afa	remove dead expr llvm-svn: 26116	2006-02-11 01:43:37 +00:00
Chris Lattner	120f31b1fd	teach the cloner to handle inline asms llvm-svn: 25633	2006-01-26 01:55:22 +00:00
Chris Lattner	00fcdfef0d	rename method llvm-svn: 25572	2006-01-24 04:16:34 +00:00
Chris Lattner	37992b34c2	When cloning a module, clone the inline asm. llvm-svn: 25559	2006-01-23 23:06:28 +00:00
Chris Lattner	469640e506	Add explicit #includes of <iostream> llvm-svn: 25509	2006-01-22 22:53:01 +00:00
Robert Bocchino	027c18da98	ConstantFoldLoadThroughGEPConstantExpr wasn't handling pointers to packed types correctly. llvm-svn: 25470	2006-01-19 23:53:23 +00:00
Chris Lattner	b98282d2d6	Make sure that cloning a module clones its target triple and dependent library list as well. This should help bugpoint. llvm-svn: 25424	2006-01-18 21:32:45 +00:00
Robert Bocchino	e6336a9b69	Constant folding support for the insertelement operation. llvm-svn: 25407	2006-01-17 20:07:07 +00:00
Reid Spencer	b4f9a6f110	For PR411: This patch is an incremental step towards supporting a flat symbol table. It de-overloads the intrinsic functions by providing type-specific intrinsics and arranging for automatically upgrading from the old overloaded name to the new non-overloaded name. Specifically: llvm.isunordered -> llvm.isunordered.f32, llvm.isunordered.f64 llvm.sqrt -> llvm.sqrt.f32, llvm.sqrt.f64 llvm.ctpop -> llvm.ctpop.i8, llvm.ctpop.i16, llvm.ctpop.i32, llvm.ctpop.i64 llvm.ctlz -> llvm.ctlz.i8, llvm.ctlz.i16, llvm.ctlz.i32, llvm.ctlz.i64 llvm.cttz -> llvm.cttz.i8, llvm.cttz.i16, llvm.cttz.i32, llvm.cttz.i64 New code should not use the overloaded intrinsic names. Warnings will be emitted if they are used. llvm-svn: 25366	2006-01-16 21:12:35 +00:00
Chris Lattner	0841fb1d4c	Teach the inliner to update the CallGraph itself, and have it add edges to llvm.stacksave/restore when it inserts calls to them. llvm-svn: 25320	2006-01-14 20:07:50 +00:00
Nate Begeman	82049eba2c	Add bswap intrinsics as documented in the Language Reference llvm-svn: 25309	2006-01-14 01:25:24 +00:00
Chris Lattner	5fba6e6696	it is ok to dce stacksave. llvm-svn: 25295	2006-01-13 21:31:54 +00:00
Chris Lattner	2be0607a8d	If inlining a call to a function that contains dynamic allocas, wrap the resultant code with llvm.stacksave/llvm.stackrestore intrinsics. llvm-svn: 25286	2006-01-13 19:34:14 +00:00
Chris Lattner	e24f79a032	Use ClonedCodeInfo to avoid another walk over the inlined code, this this time in common C cases. llvm-svn: 25285	2006-01-13 19:18:11 +00:00
Chris Lattner	19e6a08d78	Use the ClonedCodeInfo object to avoid scans of the inlined code when it doesn't contain any calls. This is a fairly common case for C++ code, so it will probably speed up the inliner marginally in these cases. llvm-svn: 25284	2006-01-13 19:15:15 +00:00
Chris Lattner	908d79556d	Refactor a bunch of invoke handling stuff out into a new function "HandleInlinedInvoke". No functionality change. llvm-svn: 25283	2006-01-13 19:05:59 +00:00
Chris Lattner	edad1288fd	Allow the code cloning interfaces to capture some important info about the code being cloned if the client wants. llvm-svn: 25281	2006-01-13 18:39:17 +00:00
Chris Lattner	257492c0ab	Fix a bug I noticed by inspection: if the first instruction in the inlined function was not an alloca, we wouldn't check the entry block for any allocas, leading to increased stack space in some cases. In practice, allocas are almost always at the top of the block, so this was never noticed. llvm-svn: 25280	2006-01-13 18:16:48 +00:00
Chris Lattner	0770d8e326	Preserve and update ETForest. Patch by Daniel Berlin llvm-svn: 25203	2006-01-11 05:11:13 +00:00
Robert Bocchino	230044839d	Added support for the extractelement operation. llvm-svn: 25181	2006-01-10 19:05:34 +00:00
Chris Lattner	cda4aa6eb4	Teach loopsimplify to update et-forest. Patch contributed by Daniel Berlin! llvm-svn: 25153	2006-01-09 08:03:08 +00:00
Chris Lattner	2820b8c855	Fix SimplifyCFG/2005-12-03-IncorrectPHIFold.ll llvm-svn: 24581	2005-12-03 18:25:58 +00:00
Chris Lattner	3e9e8bd25c	Implement a refinement to the mem2reg algorithm for cases where an alloca has a single def. In this case, look for uses that are dominated by the def and attempt to rewrite them to directly use the stored value. This speeds up mem2reg on these values and reduces the number of phi nodes inserted. This should address PR665. llvm-svn: 24411	2005-11-18 07:31:42 +00:00
Chris Lattner	31dc3827d3	This needs proper dominance llvm-svn: 24410	2005-11-18 07:29:44 +00:00
Chris Lattner	479911f971	Fix #include order llvm-svn: 24044	2005-10-27 16:34:00 +00:00
John Criswell	fe5f33b120	Move some constant folding code shared by Analysis and Transform passes into the LLVMAnalysis library. This allows LLVMTranform and LLVMTransformUtils to be archives and linked with LLVMAnalysis.a, which provides any missing definitions. llvm-svn: 24036	2005-10-27 15:54:34 +00:00
John Criswell	94b7bea733	1. Remove libraries no longer created from the list of libraries linked into the SparcV9 JIT. 2. Make LLVMTransformUtils a relinked object file and always link it before LLVMAnalysis.a. These two libraries have circular dependencies on each other which creates problem when building the SparcV9 JIT. This change fixes the dependency on all platforms problems with a minimum of fuss. llvm-svn: 24023	2005-10-26 20:35:13 +00:00
Jeff Cohen	2b8cbf319c	Update Visual Studio projects to reflect moved file. llvm-svn: 23998	2005-10-26 05:36:51 +00:00
Chris Lattner	bde3845548	DONT_BUILD_RELINKED is gone and implied by BUILD_ARCHIVE now llvm-svn: 23940	2005-10-24 02:26:13 +00:00
Chris Lattner	8c087e962c	Only build .a file versions of these libraries, instead of .a and .o versions. This should speed up build times. llvm-svn: 23933	2005-10-24 01:59:48 +00:00
Chris Lattner	20b0754c41	Fix DemoteRegToStack on an invoke. This fixes PR634. llvm-svn: 23618	2005-10-04 00:44:01 +00:00
Chris Lattner	4c3b2b536c	Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive and more correct than use_empty(). This fixes PR635 and SimplifyCFG/2005-10-02-InvokeSimplify.ll llvm-svn: 23616	2005-10-03 23:43:43 +00:00
Chris Lattner	ea7214b23d	Constant fold llvm.sqrt llvm-svn: 23487	2005-09-28 01:34:32 +00:00
Chris Lattner	16cd356fb2	allow demotion to volatile values, add support for invoke llvm-svn: 23473	2005-09-27 19:39:00 +00:00
Chris Lattner	c13c7b9376	Move the ConstantFoldLoadThroughGEPConstantExpr function out of the InstCombine pass. llvm-svn: 23444	2005-09-26 05:27:10 +00:00
Chris Lattner	499e33646e	remove some debugging code llvm-svn: 23411	2005-09-23 18:49:09 +00:00
Chris Lattner	c59a371d45	Fold two consequtive branches that share a common destination between them. This implements SimplifyCFG/branch-fold.ll, and is useful on ?:/min/max heavy code llvm-svn: 23410	2005-09-23 18:47:20 +00:00
Chris Lattner	3a978bf66d	simplify some logic further llvm-svn: 23408	2005-09-23 07:23:18 +00:00
Chris Lattner	cc14ebc17b	pull a bunch of logic out of SimplifyCFG into a helper fn llvm-svn: 23407	2005-09-23 06:39:30 +00:00
Chris Lattner	6c70106053	Start threading across blocks with code in them, so long as the code does not define a value that is used outside of it's block. This catches many more simplifications, e.g. 854 in 176.gcc, 137 in vpr, etc. This implements branch-phi-thread.ll:test3.ll llvm-svn: 23397	2005-09-20 01:48:40 +00:00
Chris Lattner	f0bd8d0107	Implement merging of blocks with the same condition if the block has multiple predecessors. This implements branch-phi-thread.ll::test1 llvm-svn: 23395	2005-09-20 00:43:16 +00:00
Chris Lattner	049cb4482f	Reject a case we don't handle yet llvm-svn: 23393	2005-09-19 23:57:04 +00:00
Chris Lattner	a160924d57	remove debugging code :-/ llvm-svn: 23392	2005-09-19 23:50:15 +00:00
Chris Lattner	748f903046	Implement SimplifyCFG/branch-phi-thread.ll, the most trivial case of threading control across branches with determined outcomes. More generality to follow. This triggers a couple thousand times in specint. llvm-svn: 23391	2005-09-19 23:49:37 +00:00
Chris Lattner	89c1dfc733	Teach SplitCriticalEdge to update LoopInfo if it is alive. This fixes a problem in LoopStrengthReduction, where it would split critical edges then confused itself with outdated loop information. llvm-svn: 22776	2005-08-13 01:38:43 +00:00
Chris Lattner	b7ebe65c56	Change break critical edges to not remove, then insert, PHI node entries. Instead, just update the BB in-place. This is both faster, and it prevents split-critical-edges from shuffling the PHI argument list unneccesarily. llvm-svn: 22765	2005-08-12 21:58:07 +00:00
Chris Lattner	257efb2ad3	This code can handle non-dominating instructions llvm-svn: 22667	2005-08-05 00:57:45 +00:00
Nate Begeman	b392321cae	Fix a fixme in CondPropagate.cpp by moving a PhiNode optimization into BasicBlock's removePredecessor routine. This requires shuffling around the definition and implementation of hasContantValue from Utils.h,cpp into Instructions.h,cpp llvm-svn: 22664	2005-08-04 23:24:19 +00:00
Chris Lattner	d683bdd0f8	Fix Transforms/SimplifyCFG/2005-08-03-PHIFactorCrash.ll, a problem that occurred while bugpointing another testcase llvm-svn: 22621	2005-08-03 17:59:45 +00:00
Chris Lattner	2dbf1960ff	Finally, add the required constraint checks to fix Transforms/SimplifyCFG/2005-08-01-PHIUpdateFail.ll the right way llvm-svn: 22615	2005-08-03 00:59:12 +00:00
Chris Lattner	908036942c	Simplify some code, add the correct pred checks llvm-svn: 22613	2005-08-03 00:38:27 +00:00
Chris Lattner	982b75c061	Refactor code out of PropagatePredecessorsForPHIs, turning it into a pure function with no side-effects llvm-svn: 22612	2005-08-03 00:29:26 +00:00
Chris Lattner	1f047fd513	use splice instead of remove/insert to avoid some symtab operations llvm-svn: 22611	2005-08-03 00:23:42 +00:00
Chris Lattner	76dc204488	move two functions up in the file, use SafeToMergeTerminators to eliminate some duplicated code llvm-svn: 22610	2005-08-03 00:19:45 +00:00
Chris Lattner	733d6704ce	Rip some code out of the main SimplifyCFG function into a subfunction and call it from the only place it is live. No functionality changes. llvm-svn: 22609	2005-08-03 00:11:16 +00:00
Chris Lattner	ac594de8dc	Disable this patch: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20050801/027345.html This breaks real programs and only fixes an obscure regression testcase. A real fix is in development. llvm-svn: 22606	2005-08-02 23:31:38 +00:00
Chris Lattner	eee90f7eb4	Change a place to use an arbitrary value instead of null, when possible llvm-svn: 22605	2005-08-02 23:29:23 +00:00
Chris Lattner	4fd3e16cbd	This code was very close, but not quite right. It did not take into consideration the case where a reference in an unreachable block could occur. This fixes Transforms/SimplifyCFG/2005-08-01-PHIUpdateFail.ll, something I ran into while bugpoint'ing another pass. llvm-svn: 22584	2005-08-02 03:24:05 +00:00
Jeff Cohen	5f4ef3c5a8	Eliminate all remaining tabs and trailing spaces. llvm-svn: 22523	2005-07-27 06:12:32 +00:00
Chris Lattner	937c71f2b3	Fix PR590 and Transforms/Mem2Reg/2005-06-30-ReadBeforeWrite.ll. The optimization for locally used allocas was not safe for allocas that were read before they were written. This change disables that optimization in that case. llvm-svn: 22318	2005-06-30 07:29:44 +00:00
Andrew Lenharth	d4b103107e	prevent DCE of vaarg intrinsics. This should take care of most regressions llvm-svn: 22263	2005-06-19 14:41:20 +00:00

... 9 10 11 12 13 ...

1444 Commits