llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	b62f5082c5	implement and.ll:test33 llvm-svn: 21809	2005-05-09 04:58:36 +00:00
Chris Lattner	d0525a29d1	Preserve calling conventions when doing IPO llvm-svn: 21798	2005-05-09 01:05:50 +00:00
Chris Lattner	21d1dde72a	wrap long lines, preserve calling conventions when cloning functions and turning calls into invokes llvm-svn: 21797	2005-05-09 01:04:34 +00:00
Chris Lattner	a4c8022caf	Convert non-address taken functions with C calling conventions to fastcc. llvm-svn: 21791	2005-05-08 22:18:06 +00:00
Chris Lattner	df3332660f	Implement Reassociate/mul-neg-add.ll llvm-svn: 21788	2005-05-08 21:41:35 +00:00
Chris Lattner	c4f8e2b0ed	Bail out earlier llvm-svn: 21786	2005-05-08 21:33:47 +00:00
Chris Lattner	877b114037	Teach reassociate that 0-X === X*-1 llvm-svn: 21785	2005-05-08 21:28:52 +00:00
Chris Lattner	9f284e0a3c	Fix PR557 and basictest[34].ll. This makes reassociate realize that loads should be treated as unmovable, and gives distinct ranks to distinct values defined in the same basic block, allowing reassociate to do its thing. llvm-svn: 21783	2005-05-08 20:57:04 +00:00
Chris Lattner	9187f3905e	Add debugging information llvm-svn: 21781	2005-05-08 20:09:57 +00:00
Chris Lattner	08582be283	eliminate gotos llvm-svn: 21780	2005-05-08 19:48:43 +00:00
Chris Lattner	5847e5e10c	Improve reassociation handling of inverses, implementing inverses.ll. llvm-svn: 21778	2005-05-08 18:59:37 +00:00
Chris Lattner	4922118dc4	clean up and modernize this pass. llvm-svn: 21776	2005-05-08 18:45:26 +00:00
Chris Lattner	b18dbbfff5	Strength reduce SAR into SHR if there is no way sign bits could be shifted in. This tends to get cases like this: X = cast ubyte to int Y = shr int X, ... Tested by: shift.ll:test24 llvm-svn: 21775	2005-05-08 17:34:56 +00:00
Chris Lattner	e1850b86b6	Refactor some code llvm-svn: 21772	2005-05-08 00:19:31 +00:00
Chris Lattner	6e2086d7e4	Handle some simple cases where we can see that values get annihilated. llvm-svn: 21771	2005-05-08 00:08:33 +00:00
Chris Lattner	4294cec0f1	Fix a miscompilation of crafty by clobbering the "A" variable. llvm-svn: 21770	2005-05-07 23:49:08 +00:00
Chris Lattner	1e5065052a	Rewrite the guts of the reassociate pass to be more efficient and logical. Instead of trying to do local reassociation tweaks at each level, only process an expression tree once (at its root). This does not improve the reassociation pass in any real way. llvm-svn: 21768	2005-05-07 21:59:39 +00:00
Reid Spencer	170ae7ff70	* Add two strlen optimizations: strlen(x) != 0 -> x != 0 strlen(x) == 0 -> x == 0 * Change nested statistics to use style of other LLVM statistics so that only the name of the optimization (simplify-libcalls) is used as the statistic name, and the description indicates which specific all is optimized. Cuts down on some redundancy and saves a few bytes of space. * Make note of stpcpy optimization that could be done. llvm-svn: 21766	2005-05-07 20:15:59 +00:00
Reid Spencer	4f01a822b4	Don't increment the counter unless the debug flag is set. llvm-svn: 21762	2005-05-07 04:59:45 +00:00
Chris Lattner	cea579932d	Convert shifts to muls to assist reassociation. This implements Reassociate/shifttest.ll llvm-svn: 21761	2005-05-07 04:24:13 +00:00
Chris Lattner	f43e974abd	Simplify the code and rearrange it. No major functionality changes here. llvm-svn: 21759	2005-05-07 04:08:02 +00:00
Chris Lattner	7effa0ed06	BAD typeo which caused many testsuite failures last night. Note to self, do not change code after testing it without retesting! llvm-svn: 21741	2005-05-06 17:13:16 +00:00
Chris Lattner	6aacb0f9da	Preserve tail marker llvm-svn: 21737	2005-05-06 06:48:21 +00:00
Chris Lattner	9f3dced2c7	Implement Transforms/Inline/inline-tail.ll llvm-svn: 21736	2005-05-06 06:47:52 +00:00
Chris Lattner	324d2eedb2	preserve the tail marker llvm-svn: 21734	2005-05-06 06:46:58 +00:00
Chris Lattner	53db546b97	Wrap long lines llvm-svn: 21720	2005-05-06 05:34:40 +00:00
Chris Lattner	a36d525741	DCE intrinsic instructions without side effects. llvm-svn: 21719	2005-05-06 05:27:34 +00:00
Chris Lattner	ef298a3b8a	Teach instcombine propagate zeroness through shl instructions, implementing and.ll:test31 llvm-svn: 21717	2005-05-06 04:53:20 +00:00
Chris Lattner	873804168e	Implement shift.ll:test23. If we are shifting right then immediately truncating the result, turn signed shift rights into unsigned shift rights if possible. This leads to later simplification and happens often in 176.gcc. For example, this testcase: struct xxx { unsigned int code : 8; }; enum codes { A, B, C, D, E, F }; int foo(struct xxx P) { if ((enum codes)P->code == A) bar(); } used to be compiled to: int %foo(%struct.xxx %P) { %tmp.1 = getelementptr %struct.xxx* %P, int 0, uint 0 ; <uint> [#uses=1] %tmp.2 = load uint %tmp.1 ; <uint> [#uses=1] %tmp.3 = cast uint %tmp.2 to int ; <int> [#uses=1] %tmp.4 = shl int %tmp.3, ubyte 24 ; <int> [#uses=1] %tmp.5 = shr int %tmp.4, ubyte 24 ; <int> [#uses=1] %tmp.6 = cast int %tmp.5 to sbyte ; <sbyte> [#uses=1] %tmp.8 = seteq sbyte %tmp.6, 0 ; <bool> [#uses=1] br bool %tmp.8, label %then, label %UnifiedReturnBlock Now it is compiled to: %tmp.1 = getelementptr %struct.xxx* %P, int 0, uint 0 ; <uint> [#uses=1] %tmp.2 = load uint %tmp.1 ; <uint> [#uses=1] %tmp.2 = cast uint %tmp.2 to sbyte ; <sbyte> [#uses=1] %tmp.8 = seteq sbyte %tmp.2, 0 ; <bool> [#uses=1] br bool %tmp.8, label %then, label %UnifiedReturnBlock which is the difference between this: foo: subl $4, %esp movl 8(%esp), %eax movl (%eax), %eax shll $24, %eax sarl $24, %eax testb %al, %al jne .LBBfoo_2 and this: foo: subl $4, %esp movl 8(%esp), %eax movl (%eax), %eax testb %al, %al jne .LBBfoo_2 This occurs 3243 times total in the External tests, 215x in povray, 6x in each f2c'd program, 1451x in 176.gcc, 7x in crafty, 20x in perl, 25x in gap, 3x in m88ksim, 25x in ijpeg. Maybe this will cause a little jump on gcc tommorow :) llvm-svn: 21715	2005-05-06 04:18:52 +00:00
Chris Lattner	7208616ec0	Implement xor.ll:test22 llvm-svn: 21713	2005-05-06 02:07:39 +00:00
Chris Lattner	4c2d3781aa	implement and.ll:test30 and set.ll:test21 llvm-svn: 21712	2005-05-06 01:53:19 +00:00
Chris Lattner	dd1e562ec3	implement or.ll:test20 llvm-svn: 21709	2005-05-06 00:58:50 +00:00
Chris Lattner	807aa20f67	Fix a bug compimling Ruby, fixing this testcase: LowerSetJmp/2005-05-05-OldUses.ll llvm-svn: 21696	2005-05-05 15:47:43 +00:00
Chris Lattner	809dfac421	Instcombine: cast (X != 0) to int, cast (X == 1) to int -> X iff X has only the low bit set. This implements set.ll:test20. This triggers 2x on povray, 9x on mesa, 11x on gcc, 2x on crafty, 1x on eon, 6x on perlbmk and 11x on m88ksim. It allows us to compile these two functions into the same code: struct s { unsigned int bit : 1; }; unsigned foo(struct s p) { if (p->bit) return 1; else return 0; } unsigned bar(struct s p) { return p->bit; } llvm-svn: 21690	2005-05-04 19:10:26 +00:00
Reid Spencer	282d057485	Implement the IsDigitOptimization for simplifying calls to the isdigit library function: isdigit(chr) -> 0 or 1 if chr is constant isdigit(chr) -> chr - '0' <= 9 otherwise Although there are many calls to isdigit in llvm-test, most of them are compiled away by macros leaving only this: 2 MultiSource/Applications/hexxagon llvm-svn: 21688	2005-05-04 18:58:28 +00:00
Reid Spencer	1e520fd661	* Correct the function prototypes for some of the functions to match the actual spec (int -> uint) * Add the ability to get/cache the strlen function prototype. * Make sure generated values are appropriately named for debugging purposes * Add the SPrintFOptimiation for 4 casts of sprintf optimization: sprintf(str,cstr) -> llvm.memcpy(str,cstr) (if cstr has no %) sprintf(str,"") -> store sbyte 0, str sprintf(str,"%s",src) -> llvm.memcpy(str,src) (if src is constant) sprintf(str,"%c",chr) -> store chr, str ; store sbyte 0, str+1 The sprintf optimization didn't fire as much as I had hoped: 2 MultiSource/Applications/SPASS 5 MultiSource/Benchmarks/McCat/18-imp 22 MultiSource/Benchmarks/Prolangs-C/TimberWolfMC 1 MultiSource/Benchmarks/Prolangs-C/assembler 6 MultiSource/Benchmarks/Prolangs-C/unix-smail 2 MultiSource/Benchmarks/mediabench/mpeg2/mpeg2dec llvm-svn: 21679	2005-05-04 03:20:21 +00:00
Reid Spencer	38cabd7265	Implement optimizations for the strchr and llvm.memset library calls. Neither of these activated as many times as was hoped: strchr: 9 MultiSource/Applications/siod 1 MultiSource/Applications/d 2 MultiSource/Prolangs-C/archie-client 1 External/SPEC/CINT2000/176.gcc/176.gcc llvm.memset: no hits llvm-svn: 21669	2005-05-03 07:23:44 +00:00
Reid Spencer	95d8efdfcf	Avoid garbage output in the statistics display by ensuring that the strings passed to Statistic's constructor are not destructable. The stats are printed during static destruction and the SimplifyLibCalls module was getting destructed before the statistics. llvm-svn: 21661	2005-05-03 02:54:54 +00:00
Reid Spencer	49fa070401	Add the StrNCmpOptimization which is similar to strcmp. Unfortunately, this optimization didn't trigger on any llvm-test tests. llvm-svn: 21660	2005-05-03 01:43:45 +00:00
Reid Spencer	2d5c7beebd	Implement the fprintf optimization which converts calls like this: fprintf(F,"hello") -> fwrite("hello",strlen("hello"),1,F) fprintf(F,"%s","hello") -> fwrite("hello",strlen("hello"),1,F) fprintf(F,"%c",'x') -> fputc('c',F) This optimization fires severals times in llvm-test: 313 MultiSource/Applications/Burg 302 MultiSource/Benchmarks/Prolangs-C/TimberWolfMC 189 MultiSource/Benchmarks/Prolangs-C/mybison 175 MultiSource/Benchmarks/Prolangs-C/football 130 MultiSource/Benchmarks/Prolangs-C/unix-tbl llvm-svn: 21657	2005-05-02 23:59:26 +00:00
John Criswell	f42ed7bdaf	Fixed a comment. llvm-svn: 21653	2005-05-02 14:47:42 +00:00
Chris Lattner	a816eee427	Implement getelementptr.ll:test11 llvm-svn: 21647	2005-05-01 04:42:15 +00:00
Chris Lattner	a9d84e3388	Check for volatile loads only once. Implement load.ll:test7 llvm-svn: 21645	2005-05-01 04:24:53 +00:00
Reid Spencer	16449a9eb0	Fix a comment that stated the wrong thing. llvm-svn: 21638	2005-04-30 06:45:47 +00:00
Reid Spencer	4c444fe007	* Don't depend on "guessing" what a FILE* is, just require that the actual type be obtained from a CallInst we're optimizing. * Make it possible for getConstantStringLength to return the ConstantArray that it extracts in case the content is needed by an Optimization. * Implement the strcmp optimization * Implement the toascii optimization This pass is now firing several to many times in the following MultiSource tests: Applications/Burg - 7 (strcat,strcpy) Applications/siod - 13 (strcat,strcpy,strlen) Applications/spiff - 120 (exit,fputs,strcat,strcpy,strlen) Applications/treecc - 66 (exit,fputs,strcat,strcpy) Applications/kimwitu++ - 34 (strcmp,strcpy,strlen) Applications/SPASS - 588 (exit,fputs,strcat,strcpy,strlen) llvm-svn: 21626	2005-04-30 03:17:54 +00:00
Reid Spencer	9361697f93	Implement the optimizations for "pow" and "fputs" library calls. llvm-svn: 21618	2005-04-29 09:39:47 +00:00
Reid Spencer	c968ea0495	Remove optimizations that don't require both operands to be constant. These are moved to simplify-libcalls pass. llvm-svn: 21614	2005-04-29 05:55:35 +00:00
Jeff Cohen	4bc952f703	Consistently use 'class' to silence VC++ llvm-svn: 21612	2005-04-29 03:05:44 +00:00
Reid Spencer	ed55a6b5e0	* Add constant folding for additional floating point library calls such as sinh, cosh, etc. * Make the name comparisons for the fp libcalls a little more efficient by switching on the first character of the name before doing comparisons. llvm-svn: 21611	2005-04-28 23:01:59 +00:00
Reid Spencer	16983ca865	Remove from the TODO list those optimizations that are already handled by constant folding implemented in lib/Transforms/Utils/Local.cpp. llvm-svn: 21604	2005-04-28 18:05:16 +00:00
Reid Spencer	649ac283e4	Document additional libcall transformations that need to be written. Help Wanted! There's a lot of them to write. llvm-svn: 21603	2005-04-28 04:40:06 +00:00
Reid Spencer	7ddcfb3375	Doxygenate. llvm-svn: 21602	2005-04-27 21:29:20 +00:00
Chris Lattner	36ffb1ff37	remove 'statement with no effect' warning llvm-svn: 21600	2005-04-27 20:12:17 +00:00
Reid Spencer	08b4940509	More Cleanup: * Name the instructions by appending to name of original * Factor common part out of a switch statement. llvm-svn: 21597	2005-04-27 17:46:54 +00:00
Reid Spencer	e249a82e73	This is a cleanup commit: * Correct stale documentation in a few places * Re-order the file to better associate things and reduce line count * Make the pass thread safe by caching the Function* objects needed by the optimizers in the pass object instead of globally. * Provide the SimplifyLibCalls pass object to the optimizer classes so they can access cached Function* objects and TargetData info * Make sure the pass resets its cache if the Module passed to runOnModule changes * Rename CallOptimizer LibCallOptimization. All the classes are named Optimization while the objects are Optimizer. * Don't cache Function* in the optimizer objects because they could be used by multiple PassManager's running in multiple threads * Add an optimization for strcpy which is similar to strcat * Add a "TODO" list at the end of the file for ideas on additional libcall optimizations that could be added (get ideas from other compilers). Sorry for the huge diff. Its mostly reorganization of code. That won't happen again as I believe the design and infrastructure for this pass is now done or close to it. llvm-svn: 21589	2005-04-27 07:54:40 +00:00
Chris Lattner	93f4e9dd26	detect functions that never return, and turn the instruction following a call to them into an 'unreachable' instruction. This triggers a bunch of times, particularly on gcc: gzip: 36 gcc: 601 eon: 12 bzip: 38 llvm-svn: 21587	2005-04-27 04:52:23 +00:00
Reid Spencer	dc11db68b6	Prefix the debug statistics so they group together. llvm-svn: 21583	2005-04-27 00:20:23 +00:00
Reid Spencer	e95a647b2a	In debug builds, make a statistic for each kind of call optimization. This helps track down what gets triggered in the pass so its easier to identify good test cases. llvm-svn: 21582	2005-04-27 00:05:45 +00:00
Chris Lattner	7f4f773e9f	This analysis doesn't take 'throwing' into consideration, it looks at 'unwinding' llvm-svn: 21581	2005-04-26 23:53:25 +00:00
Reid Spencer	f9d4be187f	Fix up the debug statement to actually use a newline .. radical concept. llvm-svn: 21580	2005-04-26 23:07:08 +00:00
Reid Spencer	18b998192f	Uh, this isn't argpromotion. llvm-svn: 21579	2005-04-26 23:05:17 +00:00
Reid Spencer	2bc7a4f82a	Add some debugging output so we can tell which calls are getting triggered llvm-svn: 21578	2005-04-26 23:02:16 +00:00
Reid Spencer	f8c03d9db6	No, seriously folks, memcpy really does return void. llvm-svn: 21575	2005-04-26 22:49:48 +00:00
Reid Spencer	aaca170867	memcpy returns void!!!!! llvm-svn: 21574	2005-04-26 22:46:23 +00:00
Reid Spencer	4855ebf622	Fix some bugs found by running on llvm-test: * MemCpyOptimization can only be optimized if the 3rd and 4th arguments are constants and we weren't checking for that. * The result of llvm.memcpy (and llvm.memmove) is void* not sbyte*, put in a cast. llvm-svn: 21570	2005-04-26 19:55:57 +00:00
Reid Spencer	bb92b4fdfb	Changes From Review Feedback: * Have the SimplifyLibCalls pass acquire the TargetData and pass it down to the optimization classes so they can use it to make better choices for the signatures of functions, etc. * Rearrange the code a little so the utility functions are closer to their usage and keep the core of the pass near the top of the files. * Adjust the StrLen pass to get/use the correct prototype depending on the TargetData::getIntPtrType() result. The result of strlen is size_t which could be either uint or ulong depending on the platform. * Clean up some coding nits (cast vs. dyn_cast, remove redundant items from a switch, etc.) * Implement the MemMoveOptimization as a twin of MemCpyOptimization (they only differ in name). llvm-svn: 21569	2005-04-26 19:13:17 +00:00
Chris Lattner	bd43b9db9d	Fix the compile failures from last night. llvm-svn: 21565	2005-04-26 14:40:41 +00:00
Reid Spencer	b4f7b83dce	* Merge get_GVInitializer and getCharArrayLength into a single function named getConstantStringLength. This is the common part of StrCpy and StrLen optimizations and probably several others, yet to be written. It performs all the validity checks for looking at constant arrays that are supposed to be null-terminated strings and then computes the actual length of the string. * Implement the MemCpyOptimization class. This just turns memcpy of 1, 2, 4 and 8 byte data blocks that are properly aligned on those boundaries into a load and a store. Much more could be done here but alignment restrictions and lack of knowledge of the target instruction set prevent use from doing significantly more. That will have to be delegated to the code generators as they lower llvm.memcpy calls. llvm-svn: 21562	2005-04-26 07:45:18 +00:00
Reid Spencer	76dab9a523	* Implement StrLenOptimization * Factor out commonalities between StrLenOptimization and StrCatOptimization * Make sure that signatures return sbyte* not void* llvm-svn: 21559	2005-04-26 05:24:00 +00:00
Reid Spencer	8ee5aacc38	Incorporate feedback from Chris: * Change signatures of OptimizeCall and ValidateCalledFunction so they are non-const, allowing the optimization object to be modified. This is in support of caching things used across multiple calls. * Provide two functions for constructing and caching function types * Modify the StrCatOptimization to cache Function objects for strlen and llvm.memcpy so it doesn't regenerate them on each call site. Make sure these are invalidated each time we start the pass. * Handle both a GEP Instruction and a GEP ConstantExpr * Add additional checks to make sure we really are dealing with an arary of sbyte and that all the element initializers are ConstantInt or ConstantExpr that reduce to ConstantInt. * Make sure the GlobalVariable is constant! * Don't use ConstantArray::getString as it can fail and it doesn't give us the right thing. We must check for null bytes in the middle of the array. * Use llvm.memcpy instead of memcpy so we can factor alignment into it. * Don't use void* types in signatures, replace with sbyte* instead. llvm-svn: 21555	2005-04-26 03:26:15 +00:00
Reid Spencer	fe91dfec91	Changes due to code review and new implementation: * Don't use std::string for the function names, const char* will suffice * Allow each CallOptimizer to validate the function signature before doing anything * Repeatedly loop over the functions until an iteration produces no more optimizations. This allows one optimization to insert a call that is optimized by another optimization. * Implement the ConstantArray portion of the StrCatOptimization * Provide a template for the MemCpyOptimization * Make ExitInMainOptimization split the block, not delete everything after the return instruction. (This covers revision 1.3 and 1.4, as the 1.3 comments were botched) llvm-svn: 21548	2005-04-25 21:20:38 +00:00
Reid Spencer	f2534c7291	Lots of changes based on review and new functionality: * Use a llvm-svn: 21546	2005-04-25 21:11:48 +00:00
Chris Lattner	a21bf8d1be	implement getelementptr.ll:test10 llvm-svn: 21541	2005-04-25 20:17:30 +00:00
Reid Spencer	9bbaa2ab7f	Post-Review Cleanup: * Fix comments at top of file * Change algorithm for running the call optimizations from nn to something closer to n. Use a hash_map to store and lookup the optimizations since there will eventually (or potentially) be a large number of them. This gets lookup based on the name of the function to O(1). Each CallOptimizer now has a std::string member named func_name that tracks the name of the function that it applies to. It is this string that is entered into the hash_map for fast comparison against the function names encountered in the module. * Cleanup some style issues pertaining to iterator invalidation * Don't pass the Function pointer to the OptimizeCall function because if the optimization needs it, it can get it from the CallInst passed in. * Add the skeleton for a new CallOptimizer, StrCatOptimizer which will eventually replace strcat's of constant strings with direct copies. llvm-svn: 21526	2005-04-25 03:59:26 +00:00
Reid Spencer	39a762d149	A new pass to provide specific optimizations for certain well-known library calls. The pass visits all external functions in the module and determines if such function calls can be optimized. The optimizations are specific to the library calls involved. This initial version only optimizes calls to exit(3) when they occur in main(): it changes them to ret instructions. llvm-svn: 21522	2005-04-25 02:53:12 +00:00
Chris Lattner	2f1457fd83	Eliminate cases where we could << by 64, which is undefined in C. llvm-svn: 21500	2005-04-24 17:46:05 +00:00
Chris Lattner	d6f636a340	Implement xor.ll:test21: select (not C), A, B -> select C, B, A llvm-svn: 21495	2005-04-24 07:30:14 +00:00
Chris Lattner	d1f46d3bf9	Use getPrimitiveSizeInBits() instead of getPrimitiveSize()*8 Completely rework the 'setcc (cast x to larger), y' code. This code has the advantage of implementing setcc.ll:test19 (being more general than the previous code) and being correct in all cases. This allows us to unxfail 2004-11-27-SetCCForCastLargerAndConstant.ll, and close PR454. llvm-svn: 21491	2005-04-24 06:59:08 +00:00
Jeff Cohen	82639853c0	Eliminate tabs and trailing spaces llvm-svn: 21480	2005-04-23 21:38:35 +00:00
Chris Lattner	77c32c34d7	Generalize the setcc -> PHI and Select folding optimizations to work with any constant RHS, not just a constant integer RHS. This implements select.ll:test17 llvm-svn: 21470	2005-04-23 15:31:55 +00:00
Misha Brukman	b1c9317bb4	Remove trailing whitespace llvm-svn: 21427	2005-04-21 23:48:37 +00:00
Chris Lattner	a3159af703	Fix a bug where we would not promote calls to invokes if they occured in the same block as the setjmp. Thanks to Greg Pettyjohn for noticing this! llvm-svn: 21403	2005-04-21 16:46:46 +00:00
Chris Lattner	7ceb081f3f	Improve doxygen documentation, patch contributed by Evan Jones! llvm-svn: 21393	2005-04-21 16:04:49 +00:00
Chris Lattner	374e659466	Instcombine this: %shortcirc_val = select bool %tmp.1, bool true, bool %tmp.4 ; <bool> [#uses=1] %tmp.6 = cast bool %shortcirc_val to int ; <int> [#uses=1] into this: %shortcirc_val = or bool %tmp.1, %tmp.4 ; <bool> [#uses=1] %tmp.6 = cast bool %shortcirc_val to int ; <int> [#uses=1] not this: %tmp.4.cast = cast bool %tmp.4 to int ; <int> [#uses=1] %tmp.6 = select bool %tmp.1, int 1, int %tmp.4.cast ; <int> [#uses=1] llvm-svn: 21389	2005-04-21 05:43:13 +00:00
Chris Lattner	b38b443b15	Teach simplifycfg that setcc is cheap and non-trapping, so that it can convert this: %tmp.1 = seteq int %i, 0 ; <bool> [#uses=1] br bool %tmp.1, label %shortcirc_done, label %shortcirc_next shortcirc_next: ; preds = %entry %tmp.4 = seteq int %j, 0 ; <bool> [#uses=1] br label %shortcirc_done shortcirc_done: ; preds = %shortcirc_next, %entry %shortcirc_val = phi bool [ %tmp.4, %shortcirc_next ], [ true, %entry ] ; <bool> [#uses=1] to this: %tmp.1 = seteq int %i, 0 ; <bool> [#uses=1] %tmp.4 = seteq int %j, 0 ; <bool> [#uses=1] %shortcirc_val = select bool %tmp.1, bool true, bool %tmp.4 ; <bool> [#uses=1] ... which is later simplified by instcombine into an or. llvm-svn: 21388	2005-04-21 05:31:13 +00:00
Chris Lattner	8cb10a1775	Wrap some long lines. Make IPSCCP strip off dead constant exprs that are using functions, making them appear as though their address is taken. This allows us to propagate some more pool descriptors, lowering the overhead of pool alloc. llvm-svn: 21363	2005-04-19 19:16:19 +00:00
Chris Lattner	5c219469a0	Eliminate a broken transformation, fixing PR548 llvm-svn: 21354	2005-04-19 06:04:18 +00:00
Chris Lattner	ee84413730	silence a bogus warning llvm-svn: 21320	2005-04-18 05:26:21 +00:00
Chris Lattner	16a50fd0a0	a new simple pass, which will be extended to be more useful in the future. This pass forward branches through conditions when it can show that the conditions is either always true or false for a predecessor. This currently only handles the most simple cases of this, but is successful at threading across 2489 branches and 65 switch instructions in 176.gcc, which isn't bad. llvm-svn: 21306	2005-04-15 19:28:32 +00:00
Chris Lattner	95f16a3ac4	Get rid of this for_each loop llvm-svn: 21253	2005-04-12 18:51:33 +00:00
Chris Lattner	4236261930	Fix bug: InstCombine/2005-05-07-UDivSelectCrash.ll llvm-svn: 21152	2005-04-08 04:03:26 +00:00
Chris Lattner	4706046e68	Implement the following xforms: (X-Y)-X --> -Y A + (B - A) --> B (B - A) + A --> B llvm-svn: 21138	2005-04-07 17:14:51 +00:00
Chris Lattner	c7f3c1a00e	Implement InstCombine/add.ll:test28, transforming C1-(X+C2) --> (C1-C2)-X. This occurs several dozen times in specint2k, particularly in crafty and gcc apparently. llvm-svn: 21136	2005-04-07 16:28:01 +00:00
Chris Lattner	a9be4490d8	Transform X-(X+Y) == -Y and X-(Y+X) == -Y llvm-svn: 21134	2005-04-07 16:15:25 +00:00
Chris Lattner	ecfa9b5810	disable this transformation in the one obscure case that really pessimizes pointer analysis. llvm-svn: 20916	2005-03-29 06:37:47 +00:00
Alkis Evlogimenos	9ead0d7b4c	Rename createPromoteMemoryToRegister() to createPromoteMemoryToRegisterPass() to be consistent with other pass creation functions. llvm-svn: 20885	2005-03-28 02:01:12 +00:00
Chris Lattner	514e843e89	Enhance loopsimplify to preserve alias analysis instead of clobbering it. This prevents crashes on some programs when using -ds-aa -licm. llvm-svn: 20831	2005-03-25 06:37:22 +00:00
Chris Lattner	faf7791fea	Fix a bug where LICM was not updating AA information properly when sinking a pointer value out of a loop causing it to be duplicated. llvm-svn: 20828	2005-03-25 00:22:36 +00:00
Chris Lattner	1c790bf656	enable -debug-only=licm llvm-svn: 20788	2005-03-23 21:00:12 +00:00
Chris Lattner	7b9020a059	Fix the missing symbols problem Bill was hitting. Patch contributed by Bill Wendling!! llvm-svn: 20649	2005-03-17 15:38:16 +00:00
Chris Lattner	6cb4559369	stop using method. llvm-svn: 20603	2005-03-15 05:19:49 +00:00
Chris Lattner	531f9e92d4	This mega patch converts us from using Function::a{iterator\|begin\|end} to using Function::arg_{iterator\|begin\|end}. Likewise Module::g* -> Module::global_*. This patch is contributed by Gabor Greif, thanks! llvm-svn: 20597	2005-03-15 04:54:21 +00:00
Chris Lattner	8c79559443	fix a bug where we thought arguments were constants :( llvm-svn: 20506	2005-03-06 22:52:29 +00:00
Chris Lattner	2ce303b406	Fix Regression/Transforms/LoopStrengthReduce/dont_insert_redundant_ops.ll, hopefully not breaking too many other things. llvm-svn: 20505	2005-03-06 22:36:12 +00:00
Chris Lattner	45403e5052	implement Transforms/LoopStrengthReduce/invariant_value_first_arg.ll llvm-svn: 20501	2005-03-06 22:06:22 +00:00
Chris Lattner	d3874fad44	minor simplifications of the code. llvm-svn: 20497	2005-03-06 21:58:22 +00:00
Chris Lattner	dd3ec92085	trivial simplification llvm-svn: 20494	2005-03-06 21:35:38 +00:00
Chris Lattner	238f6df546	Fix a bug where we could corrupt a parent loop's header info if we unrolled a nested loop. This fixes Transforms/LoopUnroll/2005-03-06-BadLoopInfoUpdate.ll and PR532 llvm-svn: 20493	2005-03-06 20:57:32 +00:00
Chris Lattner	1b032f59e7	Make this MUCH faster by avoiding a linear search in the symbol table code. llvm-svn: 20479	2005-03-06 05:42:36 +00:00
Jeff Cohen	4abcea3a69	Reformat comments to fix 80 columns. llvm-svn: 20467	2005-03-05 22:45:40 +00:00
Jeff Cohen	be37fa07fd	Reuse induction variables created for strength-reduced GEPs by other similar GEPs. llvm-svn: 20466	2005-03-05 22:40:34 +00:00
Chris Lattner	6d0a24c608	second argument to Value::setName is now gone. llvm-svn: 20463	2005-03-05 19:05:20 +00:00
Chris Lattner	cfe2822cdf	Do not compute 1ULL << 64, which is undefined. This fixes Ptrdist/ks on the sparc, and testcase Regression/Transforms/InstCombine/2005-03-04-ShiftOverflow.ll llvm-svn: 20445	2005-03-04 23:21:33 +00:00
Jeff Cohen	a2c59b7423	Add support for not strength reducing GEPs where the element size is a small power of two. This emphatically includes the zeroeth power of two. llvm-svn: 20429	2005-03-04 04:04:26 +00:00
Chris Lattner	ef1e989e4f	Add an optional argument to lower to a specific constant value instead of to a "sizeof" expression. llvm-svn: 20414	2005-03-03 01:03:43 +00:00
Jeff Cohen	8ea6f9e821	Fixed the following LSR bugs: * Loop invariant code does not dominate the loop header, but rather the end of the loop preheader. * The base for a reduced GEP isn't a constant unless all of its operands (preceding the induction variable) are constant. * Allow induction variable elimination for the simple case after all. Also made changes recommended by Chris for properly deleting instructions. llvm-svn: 20383	2005-03-01 03:46:11 +00:00
Jeff Cohen	dcaa48b5c4	Fix crash in LSR due to attempt to remove original induction variable. However, for reasons explained in the comments, I also deactivated this code as it needs more thought. llvm-svn: 20367	2005-02-28 00:08:56 +00:00
Jeff Cohen	fd63d3af0d	PHI nodes were incorrectly placed when more than one GEP is reduced in a loop. llvm-svn: 20360	2005-02-27 21:08:04 +00:00
Jeff Cohen	39751c3b7c	First pass at improved Loop Strength Reduction. Still not yet ready for prime time. llvm-svn: 20358	2005-02-27 19:37:07 +00:00
Chris Lattner	7561ca1d15	Teach globalopt how memset/cpy/move affect memory, to allow better optimization. llvm-svn: 20352	2005-02-27 18:58:52 +00:00
Chris Lattner	0ce80cd542	Fix spelling, patch contributed by Gabor Greif! llvm-svn: 20343	2005-02-27 06:18:25 +00:00
Chris Lattner	cc6d75fddf	remove extraneous cast llvm-svn: 20334	2005-02-26 18:33:28 +00:00
Chris Lattner	1cca959e5d	Implement Transforms/SimplifyCFG/switch_thread.ll This does a simple form of "jump threading", which eliminates CFG edges that are provably dead. This triggers 90 times in the external tests, and eliminating CFG edges is always always a good thing! :) llvm-svn: 20300	2005-02-24 06:17:52 +00:00
Chris Lattner	25169caa80	make this more efficient. Scan up to 16 nodes, not the whole list. llvm-svn: 20289	2005-02-23 16:53:04 +00:00
Chris Lattner	52e931b37d	Remove use of bind_obj llvm-svn: 20276	2005-02-22 23:22:58 +00:00
Chris Lattner	7b5d9e2217	Do not mark obviously unreachable blocks live when processing PHI nodes, and handle incomplete control dependences correctly. This fixes: Regression/Transforms/ADCE/dead-phi-edge.ll -> a missed optimization Regression/Transforms/ADCE/dead-phi-edge.ll -> a compiler crash distilled from QT4 llvm-svn: 20227	2005-02-17 19:28:49 +00:00
Chris Lattner	31f3382b3b	Fix the second bug attached to PR504. llvm-svn: 20181	2005-02-14 20:11:45 +00:00
Chris Lattner	e616fea3bc	Fix for testcase Transforms/IndVarsSimplify/2005-02-11-InvokeCrash.ll and PR504. llvm-svn: 20129	2005-02-12 03:26:49 +00:00
Alkis Evlogimenos	c4a44c6b3d	Localize globals if they are only used in main(). This replaces the global with an alloca, which eventually gets promoted into a register. This enables a lot of other optimizations later on. llvm-svn: 20109	2005-02-10 18:36:30 +00:00
Alkis Evlogimenos	346bb20409	Fix crash on MallocInsts of unsized types. llvm-svn: 19988	2005-02-02 04:43:37 +00:00
Chris Lattner	82b42c5d85	API change. llvm-svn: 19959	2005-02-01 01:23:49 +00:00
Chris Lattner	d6a4492f81	Adjust to changes in APIs llvm-svn: 19958	2005-02-01 01:23:31 +00:00
Chris Lattner	f98a7bffb3	Hacks to make this ugly ugly code work with the new use lists. llvm-svn: 19957	2005-02-01 01:22:56 +00:00
Chris Lattner	72684fecf8	Implement InstCombine/cast.ll:test25, a case that occurs many times in spec llvm-svn: 19953	2005-01-31 05:51:45 +00:00
Chris Lattner	31f486c775	Implement the trivial cases in InstCombine/store.ll llvm-svn: 19950	2005-01-31 05:36:43 +00:00
Chris Lattner	fe1b0b8b24	Implement Transforms/InstCombine/cast-load-gep.ll, which allows us to devirtualize 11 indirect calls in perlbmk. llvm-svn: 19947	2005-01-31 04:50:46 +00:00
Chris Lattner	d8e20188c6	Adjust to changes in instruction interfaces. llvm-svn: 19900	2005-01-29 00:39:08 +00:00
Chris Lattner	a3f06fa2dd	Switchinst takes a hint for the number of cases it will have. llvm-svn: 19899	2005-01-29 00:38:45 +00:00
Chris Lattner	a35dfcedd3	switchinst ctor now takes a hint for the number of cases that it will have. llvm-svn: 19898	2005-01-29 00:38:26 +00:00
Chris Lattner	84d3137da7	Adjust Valuehandle to hold its operand directly in it. llvm-svn: 19897	2005-01-29 00:37:36 +00:00
Chris Lattner	cd517ff0c7	* add some DEBUG statements * Properly compile this: struct a {}; int test() { struct a b[2]; if (&b[0] != &b[1]) abort (); return 0; } to 'return 0', not abort(). llvm-svn: 19875	2005-01-28 19:32:01 +00:00
Alkis Evlogimenos	fbd921987f	Add a dependency to the trace library so that it gets pulled in automatically. llvm-svn: 19828	2005-01-25 16:23:57 +00:00
Chris Lattner	9e2c7facb2	Get rid of a several dozen more and instructions in specint. llvm-svn: 19786	2005-01-23 20:26:55 +00:00
Chris Lattner	fc4429e7c1	Handle comparisons of gep instructions that have different typed indices as long as they are the same size. llvm-svn: 19734	2005-01-21 23:06:49 +00:00
Chris Lattner	411336fe04	Add two optimizations. The first folds (X+Y)-X -> Y The second folds operations into selects, e.g. (select C, (X+Y), (Y+Z)) -> (Y+(select C, X, Z) This occurs a few times across spec, e.g. select add/sub mesa: 83 0 povray: 5 2 gcc 4 2 parser 0 22 perlbmk 13 30 twolf 0 3 llvm-svn: 19706	2005-01-19 21:50:18 +00:00
Chris Lattner	a3cc1835ad	Fix 'raise' to work with packed types. Patch by Morten Ofstad. llvm-svn: 19693	2005-01-19 16:16:35 +00:00
Chris Lattner	715364364b	Delete PHI nodes that are not dead but are locked in a cycle of single useness. llvm-svn: 19629	2005-01-17 05:10:15 +00:00
Chris Lattner	03f06f11aa	Move code out of indentation one level to make it easier to read. Disable the xform for < > cases. It turns out that the following is being miscompiled: bool %test(sbyte %S) { %T = cast sbyte %S to uint %V = setgt uint %T, 255 ret bool %V } llvm-svn: 19628	2005-01-17 03:20:02 +00:00
Chris Lattner	51726c47fe	Fix some bugs in an xform added yesterday. This fixes Prolangs-C/allroots. llvm-svn: 19553	2005-01-14 17:35:12 +00:00
Chris Lattner	7aa41cfa88	Fix a compile crash on spiff llvm-svn: 19552	2005-01-14 17:17:59 +00:00
Chris Lattner	4fa89827e2	if two gep comparisons only differ by one index, compare that index directly. This allows us to better optimize begin() -> end() comparisons in common cases. llvm-svn: 19542	2005-01-14 00:20:05 +00:00
Chris Lattner	d35d210ea0	Do not overrun iterators. This fixes a 176.gcc crash llvm-svn: 19541	2005-01-13 23:26:48 +00:00
Chris Lattner	a04c904c4c	Turn select C, (X+Y), (X-Y) --> (X+(select C, Y, (-Y))). This occurs in the 'sim' program and probably elsewhere. In sim, it comes up for cases like this: #define round(x) ((x)>0.0 ? (x)+0.5 : (x)-0.5) double G; void T(double X) { G = round(X); } (it uses the round macro a lot). This changes the LLVM code from: %tmp.1 = setgt double %X, 0.000000e+00 ; <bool> [#uses=1] %tmp.4 = add double %X, 5.000000e-01 ; <double> [#uses=1] %tmp.6 = sub double %X, 5.000000e-01 ; <double> [#uses=1] %mem_tmp.0 = select bool %tmp.1, double %tmp.4, double %tmp.6 store double %mem_tmp.0, double* %G to: %tmp.1 = setgt double %X, 0.000000e+00 ; <bool> [#uses=1] %mem_tmp.0.p = select bool %tmp.1, double 5.000000e-01, double -5.000000e-01 %mem_tmp.0 = add double %mem_tmp.0.p, %X store double %mem_tmp.0, double* %G ret void llvm-svn: 19537	2005-01-13 22:52:24 +00:00
Chris Lattner	81e8417614	Implement an optimization for == and != comparisons like this: _Bool test2(int X, int Y) { return &arr[X][Y] == arr; } instead of generating this: bool %test2(int %X, int %Y) { %tmp.3.idx = mul int %X, 160 ; <int> [#uses=1] %tmp.3.idx1 = shl int %Y, ubyte 2 ; <int> [#uses=1] %tmp.3.offs2 = sub int 0, %tmp.3.idx ; <int> [#uses=1] %tmp.7 = seteq int %tmp.3.idx1, %tmp.3.offs2 ; <bool> [#uses=1] ret bool %tmp.7 } generate this: bool %test2(int %X, int %Y) { seteq int %X, 0 ; <bool>:0 [#uses=1] seteq int %Y, 0 ; <bool>:1 [#uses=1] %tmp.7 = and bool %0, %1 ; <bool> [#uses=1] ret bool %tmp.7 } This idiom occurs in C++ programs when iterating from begin() to end(), in a vector or array. For example, we now compile this: void test(int X, int Y) { for (int i = arr; i != arr+100; ++i) foo(i); } to this: no_exit: ; preds = %entry, %no_exit ... %exitcond = seteq uint %indvar.next, 100 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit instead of this: no_exit: ; preds = %entry, %no_exit ... %inc5 = getelementptr [100 x [40 x int]]* %arr, int 0, int 0, int %inc.rec ; <int> [#uses=1] %tmp.8 = seteq int %inc5, getelementptr ([100 x [40 x int]]* %arr, int 0, int 100, int 0) ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.8, label %return, label %no_exit llvm-svn: 19536	2005-01-13 22:25:21 +00:00
Chris Lattner	4cb9fa373b	Fix some bugs in code I didn't mean to check in. llvm-svn: 19534	2005-01-13 20:40:58 +00:00
Chris Lattner	0798af33a5	Fix a crash compiling 129.compress llvm-svn: 19533	2005-01-13 20:14:25 +00:00
Reid Spencer	134f02d0c7	Add the LOADABLE_MODULE=1 directive to indicate that this shared library is intended to be a dlopenable module and not a "plain" shared library. llvm-svn: 19456	2005-01-11 04:33:32 +00:00
Jeff Cohen	3e62e7c68b	Apply feedback from Chris. llvm-svn: 19432	2005-01-10 04:23:32 +00:00
Chris Lattner	798e84f59e	Fix VS warnings llvm-svn: 19383	2005-01-08 19:48:40 +00:00
Chris Lattner	46fa04b531	Fix VS warnings. llvm-svn: 19382	2005-01-08 19:45:31 +00:00
Chris Lattner	fdfe3e49fe	Fix uint64_t -> unsigned VS warnings. llvm-svn: 19381	2005-01-08 19:42:22 +00:00
Chris Lattner	47f395cd85	Silence VS warnings. llvm-svn: 19380	2005-01-08 19:37:20 +00:00
Chris Lattner	ce274ce93d	Silence warnings llvm-svn: 19379	2005-01-08 19:34:41 +00:00
Jeff Cohen	677babc4d4	Add more missing createXxxPass functions. llvm-svn: 19370	2005-01-08 17:21:40 +00:00
Misha Brukman	417ca179a9	Convert tabs to spaces llvm-svn: 19320	2005-01-07 07:05:34 +00:00
Jeff Cohen	9a7ac16214	Add missing createXxxPass functions llvm-svn: 19319	2005-01-07 06:57:28 +00:00
Jeff Cohen	844410b48e	Add missing include llvm-svn: 19315	2005-01-07 05:42:13 +00:00
Jeff Cohen	eca0d0f2da	Put createLoopUnswitchPass() into proper namespace llvm-svn: 19306	2005-01-06 05:47:18 +00:00
Jeff Cohen	27595a4aec	Add missing include llvm-svn: 19305	2005-01-06 05:46:44 +00:00
Chris Lattner	86102b8ad5	This is a bulk commit that implements the following primary improvements: * We can now fold cast instructions into select instructions that have at least one constant operand. * We now optimize expressions more aggressively based on bits that are known to be zero. These optimizations occur a lot in code that uses bitfields even in simple ways. * We now turn more cast-cast sequences into AND instructions. Before we would only do this if it if all types were unsigned. Now only the middle type needs to be unsigned (guaranteeing a zero extend). * We transform sign extensions into zero extensions in several cases. This corresponds to these test/Regression/Transforms/InstCombine testcases: 2004-11-22-Missed-and-fold.ll and.ll: test28-29 cast.ll: test21-24 and-or-and.ll cast-cast-to-and.ll zeroext-and-reduce.ll llvm-svn: 19220	2005-01-01 16:22:27 +00:00
Chris Lattner	3215bb6049	Implement SimplifyCFG/DeadSetCC.ll SimplifyCFG is one of those passes that we use for final cleanup: it should not rely on other passes to clean up its garbage. This fixes the "why are trivially dead setcc's in the output of gccas" problem. llvm-svn: 19212	2005-01-01 16:02:12 +00:00
Chris Lattner	13516fe2e7	Fix PR491 and testcase Transforms/DeadStoreElimination/2004-12-28-PartialStore.ll llvm-svn: 19180	2004-12-29 04:36:02 +00:00
Chris Lattner	b17f3e13ec	Adjust to new interfaces llvm-svn: 18958	2004-12-15 07:22:25 +00:00
Chris Lattner	9ad0d55025	Constant exprs are not efficiently negatable in practice. This disables turning X - (constantexpr) into X + (-constantexpr) among other things. llvm-svn: 18935	2004-12-14 20:08:06 +00:00
Brian Gaeke	f9639d2a74	Fix link error in PPC optimized build of 'opt'. llvm-svn: 18913	2004-12-13 21:28:39 +00:00
Chris Lattner	8f430a3b59	Get rid of getSizeOf, using ConstantExpr::getSizeOf instead. do not insert a prototype for malloc of: void* malloc(uint): on 64-bit u targets this is not correct. Instead of prototype it as void *malloc(...), and pass the correct intptr_t through the "...". Finally, fix Regression/CodeGen/SparcV9/2004-12-13-MallocCrash.ll, by not forming constantexpr casts from pointer to uint. llvm-svn: 18908	2004-12-13 20:00:02 +00:00
Chris Lattner	a199e3c1e2	Change indentation of a whole bunch of code, no real changes here. llvm-svn: 18843	2004-12-12 23:49:37 +00:00
Chris Lattner	14d07db44d	More substantial simplifications and speedups. This makes ADCE about 20% faster in some cases. llvm-svn: 18842	2004-12-12 23:40:17 +00:00
Chris Lattner	9115eb3024	More minor microoptimizations llvm-svn: 18841	2004-12-12 22:44:30 +00:00
Chris Lattner	d4298781c1	Remove some more set operations llvm-svn: 18840	2004-12-12 22:22:18 +00:00
Chris Lattner	a538439bf0	Reduce number of set operations. llvm-svn: 18839	2004-12-12 22:16:13 +00:00
Chris Lattner	bf5b7cf638	Optimize div/rem + select combinations more. In particular, implement div.ll:test10 and rem.ll:test4. llvm-svn: 18838	2004-12-12 21:48:58 +00:00
Chris Lattner	745196a5fc	Properly implement copying of a global, fixing the 255.vortex & povray failures from last night. llvm-svn: 18832	2004-12-12 19:34:41 +00:00
Chris Lattner	88deefa303	Simplify code and do not invalidate iterators. This fixes a crash compiling TimberWolfMC that was exposed due to recent optimizer changes. llvm-svn: 18831	2004-12-12 18:23:20 +00:00
Chris Lattner	1cbd5be7a1	Though the previous xform applies to literally dozens (hundreds?) of variables in SPEC, the subsequent optimziations that we are after don't play with with FP values, so disable this xform for them. Really we just don't want stuff like: double G; (always 0 or 412312.312) = G; turning into: bool G_b; = G_b ? 412312.312 : 0; We'd rather just do the load. -Chris llvm-svn: 18819	2004-12-12 06:03:06 +00:00
Chris Lattner	40e4cec9ee	If a variable can only hold two values, and is not already a bool, shrink it down to actually BE a bool. This allows simple value range propagation stuff work harder, deleting comparisons in bzip2 in some hot loops. This implements GlobalOpt/integer-bool.ll, which is the essence of the loop condition distilled into a testcase. llvm-svn: 18817	2004-12-12 05:53:50 +00:00
Chris Lattner	cbc0161d1f	If one side of and/or is known to be 0/-1, it doesn't matter if the other side is overdefined. This allows us to fold conditions like: if (X < Y \|\| Y > Z) in some cases. llvm-svn: 18807	2004-12-11 23:15:19 +00:00
Chris Lattner	263b0a1669	Only cound if we actually made a change. llvm-svn: 18800	2004-12-11 17:00:14 +00:00
Chris Lattner	ffefea0772	The split bb is really the exit of the old function llvm-svn: 18799	2004-12-11 16:59:54 +00:00
Chris Lattner	2f687fd9d6	Two bug fixes: 1. Actually increment the Statistic for the GV elim optzn 2. When resolving undef branches, only resolve branches in executable blocks, avoiding marking a bunch of completely dead blocks live. This has a big impact on the quality of the generated code. With this patch, we positively rip up vortex, compiling Ut_MoveBytes to a single memcpy call. In vortex we get this: 12 ipsccp - Number of globals found to be constant 986 ipsccp - Number of arguments constant propagated 1378 ipsccp - Number of basic blocks unreachable 8919 ipsccp - Number of instructions removed llvm-svn: 18796	2004-12-11 06:05:53 +00:00
Chris Lattner	8525ebe465	Do not delete the entry block to a function. llvm-svn: 18795	2004-12-11 05:32:19 +00:00
Chris Lattner	91dbae6fee	Implement Transforms/SCCP/ipsccp-gvar.ll, by tracking values stored to non-address-taken global variables. llvm-svn: 18790	2004-12-11 05:15:59 +00:00
Chris Lattner	99e1295645	Fix a bug where we could delete dead invoke instructions with uses. In functions where we fully constant prop the return value, replace all ret instructions with 'ret undef'. llvm-svn: 18786	2004-12-11 02:53:57 +00:00
Chris Lattner	bae4b64553	Implement SCCP/ipsccp-conditional.ll, by totally deleting dead blocks. llvm-svn: 18781	2004-12-10 22:29:08 +00:00
Chris Lattner	7285f43836	Fix SCCP/2004-12-10-UndefBranchBug.ll llvm-svn: 18776	2004-12-10 20:41:50 +00:00
Chris Lattner	4fc998da2e	Fix Regression/Transforms/SimplifyCFG/2004-12-10-SimplifyCFGCrash.ll, and the failure on make_dparser last night. llvm-svn: 18766	2004-12-10 17:42:31 +00:00
Chris Lattner	b439464c61	This is the initial implementation of IPSCCP, as requested by Brian. This implements SCCP/ipsccp-basic.ll, rips apart Olden/mst (as described in PR415), and does other nice things. There is still more to come with this, but it's a start. llvm-svn: 18752	2004-12-10 08:02:06 +00:00
Chris Lattner	36d39cecb4	note to self: Do not check in debugging code! llvm-svn: 18693	2004-12-09 07:15:52 +00:00
Chris Lattner	f17a2fb849	Implement trivial sinking for load instructions. This causes us to sink 567 loads in spec llvm-svn: 18692	2004-12-09 07:14:34 +00:00
Chris Lattner	39c98bb31c	Do extremely simple sinking of instructions when they are only used in a successor block. This turns cases like this: x = a op b if (c) { use x } into: if (c) { x = a op b use x } This triggers 3965 times in spec, and is tested by Regression/Transforms/InstCombine/sink_instruction.ll This appears to expose a bug in the X86 backend for 177.mesa, which I'm looking in to. llvm-svn: 18677	2004-12-08 23:43:58 +00:00
Alkis Evlogimenos	a1291a0679	Fix this regression and remove the XFAIL from this test. llvm-svn: 18674	2004-12-08 23:10:30 +00:00
Chris Lattner	8f30caf549	Fix Transforms/InstCombine/2004-12-08-RemInfiniteLoop.ll llvm-svn: 18670	2004-12-08 22:20:34 +00:00
Chris Lattner	674ce86cd0	Add support for compilers without argument dependent name lookup, contributed by Bjørn Wennberg llvm-svn: 18627	2004-12-08 16:12:20 +00:00
Chris Lattner	407000c497	Remove unneeded class qualifier, contributed by Bjørn Wennberg llvm-svn: 18625	2004-12-08 16:05:02 +00:00
Reid Spencer	9273d480ad	For PR387:\ Add doInitialization method to avoid overloaded virtuals llvm-svn: 18602	2004-12-07 08:11:36 +00:00
Chris Lattner	9019e5cfa0	Implement stripping of debug symbols, making the --strip-debug options in gccas/gccld more than just a noop. llvm-svn: 18456	2004-12-03 16:22:08 +00:00
Chris Lattner	e8ebcb3300	Initial reimplementation of the -strip pass, with a stub for implementing -S llvm-svn: 18440	2004-12-02 21:25:03 +00:00
Chris Lattner	a4c9808603	This pass is moving to lib IPO llvm-svn: 18439	2004-12-02 21:24:40 +00:00
Chris Lattner	c0677c081d	Implement a FIXME by checking to make sure that a malloc is not being used in scary and unknown ways before we promote it. This fixes the miscompilation of 188.ammp that has been plauging us since a globalopt patch went in. Thanks a ton to Tanya for helping me diagnose the problem! llvm-svn: 18418	2004-12-02 07:11:07 +00:00
Chris Lattner	3b18139b3c	Fix a minor bug where we set a var to initialized on malloc, not on store. This doesn't fix anything that I'm aware of, just noticed it by inspection llvm-svn: 18417	2004-12-02 06:25:58 +00:00
Chris Lattner	951673a94c	This pass is completely broken. llvm-svn: 18387	2004-11-30 17:09:06 +00:00
Chris Lattner	019445715e	Squelch warning llvm-svn: 18381	2004-11-30 07:47:34 +00:00
Chris Lattner	868ae13dc0	Fix test/Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx This only fails on darwin or on X86 under valgrind. llvm-svn: 18377	2004-11-30 07:01:15 +00:00
Chris Lattner	fd8cbc257e	Alkis noticed that this variable is dead. Thanks! llvm-svn: 18369	2004-11-30 04:01:44 +00:00
Chris Lattner	389cfac0d1	If we have something like this: if (x) { code ... } else { code ... } Turn it into: code if (x) { ... } else { ... } This reduces code size and in some common cases allows us to completely eliminate the conditional. This turns several if/then/else blocks in loops into straightline code in 179.art, turning the loops into single basic blocks (good for modsched even!). Maybe now brg will leave me alone ;-) llvm-svn: 18366	2004-11-30 00:29:14 +00:00
Chris Lattner	6e455608e2	Allow hoisting loads of globals and alloca's in conditionals. llvm-svn: 18363	2004-11-29 21:26:12 +00:00
Reid Spencer	279fa256a2	Fix for PR454: * Make sure we handle signed to unsigned conversion correctly * Move this visitSetCondInst case to its own method. llvm-svn: 18312	2004-11-28 21:31:15 +00:00
Chris Lattner	6ea2888832	Make DSE potentially more aggressive by being more specific about alloca sizes. llvm-svn: 18309	2004-11-28 20:44:37 +00:00
Chris Lattner	14f3cdc227	Implement Regression/Transforms/InstCombine/getelementptr_cast.ll, which occurs many times in crafty llvm-svn: 18273	2004-11-27 17:55:46 +00:00
Chris Lattner	b137409926	Provide size information when checking to see if we can LICM a load, this allows us to hoist more loads in some cases. llvm-svn: 18265	2004-11-26 21:20:09 +00:00
Chris Lattner	540e5f92b4	Do not count debugger intrinsics in size estimation. llvm-svn: 18110	2004-11-22 17:23:57 +00:00
Chris Lattner	79e87e39eb	Ignore debugger intrinsics when doing inlining size computations. llvm-svn: 18109	2004-11-22 17:21:44 +00:00
Chris Lattner	6d048a0d32	Do not consider debug intrinsics in the size computations for loop unrolling. Patch contributed by Michael McCracken! llvm-svn: 18108	2004-11-22 17:18:36 +00:00
Misha Brukman	72a57c3259	Allow constructor parameter to override aggregating args; fix spacing llvm-svn: 18028	2004-11-20 02:20:27 +00:00
Chris Lattner	446948e094	Fix the exposed prototype for the lower packed pass, thanks to Morten Ofstad. llvm-svn: 17996	2004-11-19 16:49:34 +00:00
Chris Lattner	d137be2d0d	CPR is dead. llvm-svn: 17992	2004-11-19 16:24:57 +00:00
Chris Lattner	953075442d	Delete stoppoints that occur for the same source line. llvm-svn: 17970	2004-11-18 21:41:39 +00:00
Chris Lattner	c08ac110df	Check in hook that I forgot llvm-svn: 17956	2004-11-18 17:24:20 +00:00
Chris Lattner	27af257ea0	Do not delete dead invoke instructions! llvm-svn: 17897	2004-11-16 16:32:28 +00:00
Reid Spencer	9339638e9c	Remove unused variable for compilation by VC++. Patch contributed by Morten Ofstad. llvm-svn: 17830	2004-11-15 17:29:41 +00:00
Chris Lattner	1890f94413	Minor cleanups. There is no reason for SCCP to derive from instvisitor anymore. llvm-svn: 17825	2004-11-15 07:15:04 +00:00
Chris Lattner	9a038a3a5e	Count more accurately llvm-svn: 17824	2004-11-15 07:02:42 +00:00
Chris Lattner	97013636cd	Quiet warnings on the persephone tester llvm-svn: 17821	2004-11-15 05:54:07 +00:00
Chris Lattner	d18c16b842	Two minor improvements: 1. Speedup getValueState by having it not consider Arguments. It's better to just add them before we start SCCP'ing. 2. SCCP can delete the contents of dead blocks. No really, it's ok! This reduces the size of the IR for subsequent passes, even though simplifycfg would do the same job. In practice, simplifycfg does not run until much later than sccp in gccas llvm-svn: 17820	2004-11-15 05:45:33 +00:00
Chris Lattner	4f0316229c	rename InstValue to LatticeValue, as it holds for more than instructions. llvm-svn: 17818	2004-11-15 05:03:30 +00:00
Chris Lattner	074be1f6e4	Substantially refactor the SCCP class into an SCCP pass and an SCCPSolver class. The only changes are minor: * Do not try to SCCP instructions that return void in the rewrite loop. This is silly and fool hardy, wasting a map lookup and adding an entry to the map which is never used. * If we decide something has an undefined value, rewrite it to undef, potentially leading to further simplications. llvm-svn: 17816	2004-11-15 04:44:20 +00:00
Chris Lattner	28eeb73f2f	If a global is just loaded and restored, realize that it is not changing value. This allows us to turn more globals into constants and eliminate them. This patch implements GlobalOpt/load-store-global.llx. Note that this patch speeds up 255.vortex from: Output/255.vortex.out-cbe.time:program 7.640000 Output/255.vortex.out-llc.time:program 9.810000 to: Output/255.vortex.out-cbe.time:program 7.250000 Output/255.vortex.out-llc.time:program 9.490000 Which isn't bad at all! llvm-svn: 17746	2004-11-14 20:50:30 +00:00
Chris Lattner	46dd5a6304	This optimization makes MANY phi nodes that all have the same incoming value. If this happens, detect it early instead of relying on instcombine to notice it later. This can be a big speedup, because PHI nodes can have many incoming values. llvm-svn: 17741	2004-11-14 19:29:34 +00:00
Chris Lattner	7515cabe2a	Implement instcombine/phi.ll:test6 - pulling operations through PHI nodes. This exposes subsequent optimization possiblities and reduces code size. This triggers 1423 times in spec. llvm-svn: 17740	2004-11-14 19:13:23 +00:00
Chris Lattner	15ff1e1885	Transform this: %X = alloca ... %Y = alloca ... X == Y into false. This allows us to simplify some stuff in eon (and probably many other C++ programs) where operator= was checking for self assignment. Folding this allows us to SROA several additional structs. llvm-svn: 17735	2004-11-14 07:33:16 +00:00
Chris Lattner	5a8b003a09	Remove note to self llvm-svn: 17734	2004-11-14 06:57:47 +00:00
Chris Lattner	af555adc15	If a function always returns a constant, replace all calls sites with that constant value. This makes the return value dead and allows for simplification in the caller. This implements IPConstantProp/return-constant.ll This triggers several dozen times throughout SPEC. llvm-svn: 17730	2004-11-14 06:10:11 +00:00
Chris Lattner	fe3f4e6ebd	Teach SROA how to promote an array index that is variable, if the dimension of the array is just two. This occurs 8 times in gcc, 6 times in crafty, and 12 times in 099.go. This implements ScalarRepl/sroa_two.ll llvm-svn: 17727	2004-11-14 05:00:19 +00:00
Chris Lattner	8881912d71	Rearrange some code, no functionality changes. llvm-svn: 17724	2004-11-14 04:24:28 +00:00
Chris Lattner	9fa7f0ae0a	Remove debugging code llvm-svn: 17719	2004-11-13 23:32:53 +00:00
Chris Lattner	244031d306	Argument promotion transforms functions to unconditionally load their argument pointers. This is only valid to do if the function already unconditionally loaded an argument or if the pointer passed in is known to be valid. Make sure to do the required checks. This fixed ArgumentPromotion/control-flow.ll and the Burg program. llvm-svn: 17718	2004-11-13 23:31:34 +00:00
Chris Lattner	8c3e7b92af	Simplify handling of shifts to be the same as we do for adds. Add support for (X * C1) + (X * C2) (where * can be mul or shl), allowing us to fold: Y+Y+Y+Y+Y+Y+Y+Y into %tmp.8 = shl long %Y, ubyte 3 ; <long> [#uses=1] instead of %tmp.4 = shl long %Y, ubyte 2 ; <long> [#uses=1] %tmp.12 = shl long %Y, ubyte 2 ; <long> [#uses=1] %tmp.8 = add long %tmp.4, %tmp.12 ; <long> [#uses=1] This implements add.ll:test25 Also add support for (XC1)-(XC2) -> X*(C1-C2), implementing sub.ll:test18 llvm-svn: 17704	2004-11-13 19:50:12 +00:00
Chris Lattner	4efe20a103	Fold: (X + (X << C2)) --> X * ((1 << C2) + 1) ((X << C2) + X) --> X * ((1 << C2) + 1) This means that we now canonicalize "Y+Y+Y" into: %tmp.2 = mul long %Y, 3 ; <long> [#uses=1] instead of: %tmp.10 = shl long %Y, ubyte 1 ; <long> [#uses=1] %tmp.6 = add long %Y, %tmp.10 ; <long> [#uses=1] llvm-svn: 17701	2004-11-13 19:31:40 +00:00
Chris Lattner	2858e17538	Lazily create the abort message, so only translation units that use unwind will actually get it. llvm-svn: 17700	2004-11-13 19:07:32 +00:00
Chris Lattner	9b0291b18d	Fix: CodeExtractor/2004-11-12-InvokeExtract.ll llvm-svn: 17699	2004-11-13 00:06:45 +00:00
Chris Lattner	5bcca6058a	Fix a bug where the code extractor would get a bit confused handling invoke instructions, setting DefBlock to a block it did not have dom info for. llvm-svn: 17697	2004-11-12 23:50:44 +00:00
Chris Lattner	5c1d84c769	Simplify handling of constant initializers llvm-svn: 17696	2004-11-12 22:42:57 +00:00
Chris Lattner	9621dfab3f	Actually, leave the check in. This prevents us from counting dead arguments as IPCP opportunities. llvm-svn: 17680	2004-11-11 07:47:54 +00:00
Chris Lattner	5fa696f8e4	Fix bug: IPConstantProp/deadarg.ll llvm-svn: 17679	2004-11-11 07:46:29 +00:00
Chris Lattner	c1d24cd859	Make IP Constant prop more aggressive about handling self recursive calls. This implements IPConstantProp/recursion.ll llvm-svn: 17666	2004-11-10 19:43:59 +00:00
Chris Lattner	0d3773d8b1	Do not let dead constant expressions hanging off of functions prevent IPCP. This allows to elimination of a bunch of global pool descriptor args from programs being pool allocated (and is also generally useful!) llvm-svn: 17657	2004-11-09 20:47:30 +00:00
Chris Lattner	436285e75d	Change this back so that I get stable numbers to reflect the change from the nightly testers llvm-svn: 17646	2004-11-09 08:05:23 +00:00
Chris Lattner	1f0a97c6cb	Fix bug: 2004-11-08-FreeUseCrash.ll llvm-svn: 17642	2004-11-09 05:10:56 +00:00
Chris Lattner	49fa1ecd04	VERY large functions that are only called from one place are not really exciting to inline. Only inline medium or small sized functions with a single call site. llvm-svn: 17588	2004-11-07 21:46:47 +00:00
Chris Lattner	595016d090	This is V9 specific, move it there. llvm-svn: 17545	2004-11-07 00:39:26 +00:00
Chris Lattner	3c670cb65a	Remove dead vars llvm-svn: 17482	2004-11-05 04:46:22 +00:00
Chris Lattner	33eb909939	Fix some warnings on VC++ llvm-svn: 17481	2004-11-05 04:45:43 +00:00
Chris Lattner	96f6616479	* Rearrange code slightly * Disable broken transforms for simplifying (setcc (cast X to larger), CI) where CC is not != or == llvm-svn: 17422	2004-11-02 03:50:32 +00:00
Chris Lattner	8af7424920	Speed up the tail duplication pass on the testcase below from 68.2s to 1.23s: #define CL0(a) case a: f(); goto c; #define CL1(a) CL0(a##0) CL0(a##1) CL0(a##2) CL0(a##3) CL0(a##4) CL0(a##5) \ CL0(a##6) CL0(a##7) CL0(a##8) CL0(a##9) #define CL2(a) CL1(a##0) CL1(a##1) CL1(a##2) CL1(a##3) CL1(a##4) CL1(a##5) \ CL1(a##6) CL1(a##7) CL1(a##8) CL1(a##9) #define CL3(a) CL2(a##0) CL2(a##1) CL2(a##2) CL2(a##3) CL2(a##4) CL2(a##5) \ CL2(a##6) CL2(a##7) CL2(a##8) CL2(a##9) #define CL4(a) CL3(a##0) CL3(a##1) CL3(a##2) CL3(a##3) CL3(a##4) CL3(a##5) \ CL3(a##6) CL3(a##7) CL3(a##8) CL3(a##9) void f(); void a() { int b; c: switch (b) { CL4(1) } } This comes from GCC PR 15524 llvm-svn: 17390	2004-11-01 07:05:07 +00:00
Chris Lattner	93d1e39f3e	Do not compute the predecessor list for a block unless we need it. This speeds up simplifycfg on this program, from 44.87s to 0.29s (with a profiled build): #define CL0(a) case a: goto c; #define CL1(a) CL0(a##0) CL0(a##1) CL0(a##2) CL0(a##3) CL0(a##4) CL0(a##5) \ CL0(a##6) CL0(a##7) CL0(a##8) CL0(a##9) #define CL2(a) CL1(a##0) CL1(a##1) CL1(a##2) CL1(a##3) CL1(a##4) CL1(a##5) \ CL1(a##6) CL1(a##7) CL1(a##8) CL1(a##9) #define CL3(a) CL2(a##0) CL2(a##1) CL2(a##2) CL2(a##3) CL2(a##4) CL2(a##5) \ CL2(a##6) CL2(a##7) CL2(a##8) CL2(a##9) #define CL4(a) CL3(a##0) CL3(a##1) CL3(a##2) CL3(a##3) CL3(a##4) CL3(a##5) \ CL3(a##6) CL3(a##7) CL3(a##8) CL3(a##9) void f(); void a() { int b; c: switch (b) { CL4(1) } } This testcase is contrived to expose N^2 behavior, but this patch should speedup simplifycfg on any programs that use large switch statements. This testcase comes from GCC PR17895. llvm-svn: 17389	2004-11-01 06:53:58 +00:00
Reid Spencer	57cbe39d1e	Change Library Names Not To Conflict With Others When Installed llvm-svn: 17286	2004-10-27 23:18:45 +00:00
Chris Lattner	7dfc2d29ac	Convert 'struct' to 'class' in various places to adhere to the coding standards and work better with VC++. Patch contributed by Morten Ofstad! llvm-svn: 17281	2004-10-27 16:14:51 +00:00
Chris Lattner	70c2039b39	Hrm, this code was severely botched. As it turns out, this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20041018/019708.html exposed ANOTHER latent bug in this xform, which caused Prolangs-C/bison to fill the zion nightly tester disk up and make the tester barf. This is obviously not a good thing, so lets fix this bug shall we? :) llvm-svn: 17276	2004-10-27 05:57:15 +00:00
Chris Lattner	845afe9b20	Initialize with the correct constant type llvm-svn: 17270	2004-10-27 03:55:24 +00:00
Chris Lattner	d57638c4a7	Fix compatibility with MSVC, patch by Morten Ofstad llvm-svn: 17218	2004-10-25 18:45:16 +00:00
Reid Spencer	fad217c847	Eliminate compilation warning on uninitialized variable. llvm-svn: 17163	2004-10-22 16:10:39 +00:00
Chris Lattner	fe9abf92de	* empty log message * llvm-svn: 17161	2004-10-22 06:43:28 +00:00
Chris Lattner	5c3c21e10a	Fix a bug Nate noticed, where we miscompiled a simple testcase llvm-svn: 17157	2004-10-22 04:53:16 +00:00
Reid Spencer	c1c320c335	We won't use automake llvm-svn: 17155	2004-10-22 03:35:04 +00:00
Brian Gaeke	c9d8b4d45c	Explain what this pass does. llvm-svn: 17146	2004-10-20 19:38:58 +00:00
Chris Lattner	257b284038	Hrm, some people complain when the compiler cheerfully tells them what it's doing... I guess they're right. llvm-svn: 17142	2004-10-19 06:33:16 +00:00
Reid Spencer	6a11a75f31	Initial automake generated Makefile template llvm-svn: 17136	2004-10-18 23:55:41 +00:00
Nate Begeman	b18121e6a9	Initial implementation of the strength reduction for GEP instructions in loops. This optimization is not turned on by default yet, but may be run with the opt tool's -loop-reduce flag. There are many FIXMEs listed in the code that will make it far more applicable to a wide range of code, but you have to start somewhere :) This limited version currently triggers on the following tests in the MultiSource directory: pcompress2: 7 times cfrac: 5 times anagram: 2 times ks: 6 times yacr2: 2 times llvm-svn: 17134	2004-10-18 21:08:22 +00:00
Chris Lattner	88a8a329c3	Get this file compiling with VC++, patch contributed by Morten Ofstad. Thanks Morten! llvm-svn: 17125	2004-10-18 15:43:46 +00:00
Reid Spencer	ce0783318b	Correction to allow compilation with Visual C++. Patch contributed by Morten Ofstad. Thanks Morten! llvm-svn: 17123	2004-10-18 14:38:48 +00:00
Chris Lattner	5edb2f32d0	Simplify code by deleting instructions that preceed unreachable instructions. Simplify code by simplifying terminators that branch to blocks that start with an unreachable instruction. llvm-svn: 17116	2004-10-18 04:07:22 +00:00
Chris Lattner	a67dd32004	Turn store -> null/undef into the LLVM unreachable instruction! This simple change hacks off 10K of bytecode from perlbmk (.5%) even though the front-end is not generating them yet and we are not optimizing the resultant code. This isn't too bad. llvm-svn: 17111	2004-10-18 03:00:50 +00:00
Chris Lattner	8ba9ec9bbb	Turn things with obviously undefined semantics into 'store -> null' llvm-svn: 17110	2004-10-18 02:59:09 +00:00
Chris Lattner	3b92f17165	My friend the invoke instruction does not dominate all basic blocks if it occurs in the entry node of a function llvm-svn: 17109	2004-10-18 01:48:31 +00:00
Chris Lattner	34ae670706	Fix a bug that occurs when the constant value is the result of an invoke. In particular, invoke ret values are only live in the normal dest of the invoke not in the unwind dest. llvm-svn: 17108	2004-10-18 01:21:17 +00:00
Chris Lattner	6a792feb02	Getting ADCE to interact well with unreachable instructions seems like a nontrivial exercise that I'm not interested in tackling right now. Just punt and treat them like unwind's. This 'fixes' test/Regression/Transforms/ADCE/unreachable-function.ll llvm-svn: 17106	2004-10-17 23:45:06 +00:00
Chris Lattner	6e79e55aea	Fix Regression/Transforms/Inline/2004-10-17-InlineFunctionWithoutReturn.ll If a function had no return instruction in it, and the result of the inlined call instruction was used, we would crash. llvm-svn: 17104	2004-10-17 23:21:07 +00:00
Chris Lattner	107c15c33d	Remove printout, realize that instructions in the entry block dominate all other blocks. llvm-svn: 17099	2004-10-17 21:31:34 +00:00
Chris Lattner	215c7ebaa6	When inserting PHI nodes, don't insert any phi nodes that are obviously unneccesary. This allows us to delete several hundred phi nodes of the form PHI(x,x,x,undef) from 253.perlbmk and probably other programs as well. This implements Mem2Reg/UndefValuesMerge.ll llvm-svn: 17098	2004-10-17 21:25:56 +00:00
Chris Lattner	96db59e48a	Enhance hasConstantValue to ignore undef values in phi nodes. This allows it to think that PHI[4, undef] == 4. llvm-svn: 17096	2004-10-17 21:23:26 +00:00
Chris Lattner	e29d634a94	hasConstantValue will soon return instructions that don't dominate the PHI node, so prepare for this. llvm-svn: 17095	2004-10-17 21:22:38 +00:00
Chris Lattner	67f0545daf	Fix a type violation llvm-svn: 17069	2004-10-16 23:28:04 +00:00
Chris Lattner	684c5c6587	Kill the bogon that slipped into my buffer before I committed. llvm-svn: 17067	2004-10-16 19:46:33 +00:00
Chris Lattner	6580e09fef	Implement InstCombine/getelementptr.ll:test9, which is the source of many ugly and giant constnat exprs in some programs. llvm-svn: 17066	2004-10-16 19:44:59 +00:00
Chris Lattner	98e541457b	Add support for unreachable llvm-svn: 17056	2004-10-16 18:21:33 +00:00
Chris Lattner	81a7a23494	Optimize instructions involving undef values. For example X+undef == undef. llvm-svn: 17047	2004-10-16 18:11:37 +00:00
Chris Lattner	7e6d4a12b5	Add support for UndefValue llvm-svn: 17046	2004-10-16 18:10:31 +00:00
Chris Lattner	c0e2e82477	When promoting mem2reg, make uninitialized values become undef isntead of 0. llvm-svn: 17045	2004-10-16 18:10:06 +00:00
Chris Lattner	646354bae1	Handle undef values as undefined on the constant lattice ignore unreachable instructions llvm-svn: 17044	2004-10-16 18:09:41 +00:00
Chris Lattner	6ac3ef950d	Add note llvm-svn: 17043	2004-10-16 18:09:25 +00:00
Chris Lattner	8e71c6a33d	Add support for the undef value. Implement a new optimization based on globals that are initialized with undef. When promoting malloc to a global, start out initialized to undef llvm-svn: 17042	2004-10-16 18:09:00 +00:00
Chris Lattner	5d33e8e73a	Fix a bug John tracked down in libstdc++ where we were incorrectly deleting weak functions. Thanks for finding this John! llvm-svn: 16997	2004-10-14 19:53:50 +00:00
Chris Lattner	45c35b1d1f	When converting phi nodes into select instructions, we shouldn't promote PHI nodes unless we KNOW that we are able to promote all of them. This fixes: test/Regression/Transforms/SimplifyCFG/PhiNoEliminate.ll llvm-svn: 16973	2004-10-14 05:13:36 +00:00
Reid Spencer	ace94df71f	Update to reflect changes in Makefile rules. llvm-svn: 16950	2004-10-13 11:46:52 +00:00
Chris Lattner	00648e1f86	Transform memmove -> memcpy when the source is obviously constant memory. llvm-svn: 16932	2004-10-12 04:52:52 +00:00
Chris Lattner	7cabf6f87a	Fix a REALLY obscure bug in my previous checkin, which was splicing the END marker from one ilist into the middle of another basic block! llvm-svn: 16925	2004-10-12 01:02:29 +00:00
Chris Lattner	9776f7259b	Handle a common case more carefully. In particular, instead of transforming pointer recurrences into expressions from this: %P_addr.0.i.0 = phi sbyte* [ getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), %entry ], [ %inc.0.i, %no_exit.i ] %inc.0.i = getelementptr sbyte* %P_addr.0.i.0, int 1 ; <sbyte> [#uses=2] into this: %inc.0.i = getelementptr sbyte getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), int %inc.0.i.rec Actually create something nice, like this: %inc.0.i = getelementptr [8 x sbyte]* %.str_1, int 0, int %inc.0.i.rec llvm-svn: 16924	2004-10-11 23:06:50 +00:00
Chris Lattner	a92af96c56	Reenable the transform, turning X/-10 < 1 into X > -10 llvm-svn: 16918	2004-10-11 19:40:04 +00:00
Chris Lattner	004e250cd2	This patch implements two things (sorry). First, it allows SRA of globals that have embedded arrays, implementing GlobalOpt/globalsra-partial.llx. This comes up infrequently, but does allow, for example, deleting several stores to dead parts of globals in dhrystone. Second, this implements GlobalOpt/malloc-promote-.llx, which is the following nifty transformation: Basically if a global pointer is initialized with malloc, and we can tell that the program won't notice, we transform this: struct foo FooPtr; ... FooPtr = malloc(sizeof(struct foo)); ... FooPtr->A FooPtr->B Into: struct foo FooPtrBody; ... FooPtrBody.A FooPtrBody.B This comes up occasionally, for example, the 'disp' global in 183.equake (where the xform speeds the CBE version of the program up from 56.16s to 52.40s (7%) on apoc), and the 'desired_accept', 'fixLRBT', 'macroArray', & 'key_queue' globals in 300.twolf (speeding it up from 22.29s to 21.55s (3.4%)). The nice thing about this xform is that it exposes the resulting global to global variable optimization and makes alias analysis easier in addition to eliminating a few loads. llvm-svn: 16916	2004-10-11 05:54:41 +00:00
Chris Lattner	e42eb31f7d	Just because we cannot completely eliminate all uses of a global, we can still optimize away all of the indirect calls and loads, etc from it. This turns code like this: if (G != 0) G(); into if (G != 0) ActualCallee(); This triggers a couple of times in gcc and libstdc++. llvm-svn: 16901	2004-10-10 23:14:11 +00:00
Reid Spencer	97327f05fc	Initial version of automake Makefile.am file. llvm-svn: 16893	2004-10-10 22:20:40 +00:00
Chris Lattner	604ed7aae8	Fix 2004-10-10-CastStoreOnce.llx, by adjusting types back if we strip off a cast llvm-svn: 16878	2004-10-10 17:07:12 +00:00
Chris Lattner	a0e769cc81	Implement GlobalOpt/deadglobal-2.llx, deletion of globals that are only stored to, but are stored at variable indexes. This occurs at least in 176.gcc, but probably others, and we should handle it for completeness. llvm-svn: 16876	2004-10-10 16:47:33 +00:00
Chris Lattner	cb9f152d8c	Avoid calling use_size() which could (in theory) be expensive if the global has a large number of users. Instead, just keep track of whether we're making changes as we do so. This patch has no functionlity changes. llvm-svn: 16874	2004-10-10 16:43:46 +00:00
Chris Lattner	09a527290d	Eliminate global pointers that are only stored a single value and null if we know that all uses of the global will trap if the pointer contained is null. In this case, we forward substitute the stored value to any uses. This has the effect of devirtualizing trivial globals in trivial cases. For example, 164.gzip contains this: gzip.h:extern int (read_buf) OF((char buf, unsigned size)); bits.c: read_buf = file_read; deflate.c: lookahead = read_buf((char)window, deflate.c: n = read_buf((char)window+strstart+lookahead, more); Since read_buf has to point to file_read at every use, we just replace the calls through read_buf with a direct call to file_read. This occurs in several benchmarks, including 176.gcc and 164.gzip. Direct calls are good and stuff. llvm-svn: 16871	2004-10-09 21:48:45 +00:00
Chris Lattner	5c91c8f18b	Use DEBUG instead of DebugFlag directly, as DebugFlag does not respect -debug-only! llvm-svn: 16868	2004-10-09 19:30:36 +00:00
Chris Lattner	f369b38d55	Fix infinite loop due to iteration llvm-svn: 16864	2004-10-09 03:32:52 +00:00
Chris Lattner	4ad08352b4	Implement sub.ll:test17, -X/C -> X/-C llvm-svn: 16863	2004-10-09 02:50:40 +00:00
Chris Lattner	1b8d2957d3	If we found a dead global, we should at least delete it... llvm-svn: 16858	2004-10-08 22:05:31 +00:00
Chris Lattner	1c4bddc50d	* Pull out the meat of runOnModule into another function for clarity. * Do not lead dangling dead constants prevent optimization * Iterate global optimization while we're making progress. These changes allow us to be more aggressive, handling cases like GlobalOpt/iterate.llx without a problem (turning it into 'ret int 0'). llvm-svn: 16857	2004-10-08 20:59:28 +00:00
Chris Lattner	73ad73e2d8	We might as well delete the known-dead global sooner rather than later since we know it is dead. llvm-svn: 16855	2004-10-08 20:25:55 +00:00
Chris Lattner	0b41e861b6	Temporarily disable a buggy transformation until it can be fixed. This fixes 254.gap. llvm-svn: 16853	2004-10-08 19:15:44 +00:00
Chris Lattner	abab0719af	Implement SRA for global variables. This allows the other global variable optimizations to trigger much more often. This allows the elimination of several dozen more global variables in Programs/External. Note that we only do this for non-constant globals: constant globals will already be optimized out if the accesses to them permit it. This implements Transforms/GlobalOpt/globalsra.llx llvm-svn: 16842	2004-10-08 17:32:09 +00:00
Chris Lattner	bff91d9a2e	Instcombine (X & FF00) + xx00 -> (X+xx00) & FF00, implementing and.ll:test27 This comes up when doing adds to bitfield elements. llvm-svn: 16836	2004-10-08 05:07:56 +00:00
Chris Lattner	44bd392cbf	Little patch to turn (shl (add X, 123), 4) -> (add (shl X, 4), 123 << 4) This triggers in cases of bitfield additions, opening opportunities for future improvements. llvm-svn: 16834	2004-10-08 03:46:20 +00:00
Chris Lattner	617f1a34f1	Improve comments, no functionality changes llvm-svn: 16814	2004-10-07 21:30:30 +00:00
Chris Lattner	02b6c918b7	Fix a bug in the safety analysis routine llvm-svn: 16804	2004-10-07 06:01:25 +00:00
Chris Lattner	f64799683e	Comment cleanups llvm-svn: 16803	2004-10-07 06:00:24 +00:00
Chris Lattner	25db58032d	* Rename pass to globalopt, since we do more than just constify * Instead of handling dead functions specially, just nuke them. * Be more aggressive about cleaning up after constification, in particular, handle getelementptr instructions and constantexprs. * Be a little bit more structured about how we process globals. *** Delete globals that are only stored to, and never read. These are clearly not useful, so they should go. This implements deadglobal.llx This last one triggers quite a few times. In particular, 2208 in the external tests, 1865 of which are in 252.eon. This shrinks eon from 1995094 to 1732341 bytes of bytecode. llvm-svn: 16802	2004-10-07 04:16:33 +00:00
Chris Lattner	1f849a08a3	Implement GlobalConstifier/trivialstore.llx, and also do some simplifications of the resultant program to avoid making later passes do it all. This allows us to constify globals that just have the same constant that they are initialized stored into them. Suprisingly this comes up ALL of the freaking time, dozens of times in SPEC, 30 times in vortex alone. For example, on 256.bzip2, it allows us to constify these two globals: %smallMode = internal global ubyte 0 ; <ubyte> [#uses=8] %verbosity = internal global int 0 ; <int> [#uses=49] Which (with later optimizations) results in the bytecode file shrinking from 82286 to 69686 bytes! Lets hear it for IPO :) For the record, it's nuking lots of "if (verbosity > 2) { do lots of stuff }" code. llvm-svn: 16793	2004-10-06 20:57:02 +00:00
Chris Lattner	0aee4b7947	Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 llvm-svn: 16769	2004-10-06 15:08:25 +00:00
Chris Lattner	2ce32df8b0	Reduce code growth implied by the tail duplication pass by not duplicating an instruction if it can be hoisted to a common dominator of the block. This implements: test/Regression/Transforms/TailDup/MergeTest.ll llvm-svn: 16758	2004-10-06 03:27:37 +00:00
Brian Gaeke	33e834ebb0	Add accessor function. llvm-svn: 16622	2004-09-30 20:14:29 +00:00
Brian Gaeke	5a89bde564	Correct type of accessor functions. llvm-svn: 16621	2004-09-30 20:14:18 +00:00
Brian Gaeke	e80d4cd66b	Namespacify. Add accessor function. llvm-svn: 16620	2004-09-30 20:14:07 +00:00
Chris Lattner	9af8efddd3	Disable the 'WARNING: Found global types that are not compatible' warning that always prints when linking programs to libstdc++ :( llvm-svn: 16603	2004-09-30 00:12:29 +00:00
Chris Lattner	abae776b18	Hrm, debugging printouts do not need to be in here llvm-svn: 16598	2004-09-29 21:21:14 +00:00
Chris Lattner	6862fbd2cf	* Pull range optimization code out into new InsertRangeTest function. * SubOne/AddOne functions always return ConstantInt, declare them as such * Pull code for handling setcc X, cst, where cst is at the end of the range, or cc is LE or GE up earlier in visitSetCondInst. This reduces #iterations in some cases. * Fold: (div X, C1) op C2 -> range check, implementing div.ll:test6 - test9. llvm-svn: 16588	2004-09-29 17:40:11 +00:00
Chris Lattner	879ce7894c	Do not insert trivially dead select instructions, which allows us to potentially fold more in one pass. llvm-svn: 16583	2004-09-29 05:43:32 +00:00
Chris Lattner	6a4adcda4c	Fold binary expressions and casts into PHI nodes that have all constant inputs. This takes something like this: %A = phi int [ 3, %cond_false.0 ], [ 2, %endif.0.i ], [ 2, %endif.1.i ] %B = div int %tmp.243, 4 and turns it into: %A = phi int [ 3/4, %cond_false.0 ], [ 2/4, %endif.0.i ], [ 2/4, %endif.1.i ] which is later simplified (in this case) into %A = 0. This triggers thousands of times in spec, for example, 269 times in 176.gcc. This is tested by InstCombine/add.ll:test23 and set.ll:test18. llvm-svn: 16582	2004-09-29 05:07:12 +00:00
Chris Lattner	c949128b2f	Hrm, really, all tests passed without this, but it is scary to think how... llvm-svn: 16568	2004-09-29 03:16:24 +00:00
Chris Lattner	be7a69ebd8	Remove debugging printout Instcombine (setcc (truncate X), C1). This occurs THOUSANDS of times in many benchmarks. Particularlly common seem to be things like (seteq (cast bool X to int), int 0) This turns it into (seteq bool %X, false), which then becomes (not %X). llvm-svn: 16567	2004-09-29 03:09:18 +00:00
Chris Lattner	dcf756ec22	Fold (X setcc C1) \| (X setcc C2) This implements or.ll:test1[89] llvm-svn: 16561	2004-09-28 22:33:08 +00:00
Chris Lattner	623826c888	Fold (and (setcc X, C1), (setcc X, C2)) This is important for several reasons: 1. Benchmarks have lots of code that looks like this (perlbmk in particular): %tmp.2.i = setne int %tmp.0.i, 128 ; <bool> [#uses=1] %tmp.6343 = seteq int %tmp.0.i, 1 ; <bool> [#uses=1] %tmp.63 = and bool %tmp.2.i, %tmp.6343 ; <bool> [#uses=1] we now fold away the setne, a clear improvement. 2. In the more important cases, such as (X >= 10) & (X < 20), we now produce smaller code: (X-10) < 10. 3. Perhaps the nicest effect of this patch is that it really helps out the code generators. In particular, for a 'range test' like the above, instead of generating this on X86 (the difference on PPC is even more pronounced): cmp %EAX, 50 setge %CL cmp %EAX, 100 setl %AL and %CL, %AL cmp %CL, 0 we now generate this: add %EAX, -50 cmp %EAX, 50 Furthermore, this causes setcc's to be folded into branches more often. These combinations trigger dozens of times in the spec benchmarks, particularly in 176.gcc, 186.crafty, 253.perlbmk, 254.gap, & 099.go. llvm-svn: 16559	2004-09-28 21:48:02 +00:00
Chris Lattner	272d5ca9e0	Implement X / C1 / C2 folding Implement (setcc (shl X, C1), C2) folding. The second one occurs several dozen times in spec. The first was added just in case. :) These are tested by shift.ll:test2[12], and div.ll:test5 llvm-svn: 16549	2004-09-28 18:22:15 +00:00
Chris Lattner	6afc02f816	shl is always zero extending, so always use a zero extending shift right. This latent bug was exposed by recent changes, and is tested as: llvm/test/Regression/Transforms/InstCombine/2004-09-28-BadShiftAndSetCC.llx llvm-svn: 16546	2004-09-28 17:54:07 +00:00
Alkis Evlogimenos	20f1b0bafb	Add includes and use std:: for standard library calls to make code compile on windows. This patch was contributed by Paolo Invernizzi. llvm-svn: 16539	2004-09-28 14:42:44 +00:00
Alkis Evlogimenos	3ce42ec7ee	Pull assignment out of for loop conditional in order for this to compile under windows. Patch contributed by Paolo Invernizzi! llvm-svn: 16534	2004-09-28 02:40:37 +00:00
Chris Lattner	bfff18a869	Fix two bugs: one where a condition was mistakenly swapped, and another where we folded (X & 254) -> X < 1 instead of X < 2. These problems were latent problems exposed by the latest patch. llvm-svn: 16528	2004-09-27 19:29:18 +00:00
Chris Lattner	1023b8726e	Fold: (setcc (shr X, ShAmt), CI), where 'cc' is eq or ne. This xform triggers often, for example: 6x in povray, 1x in gzip, 279x in gcc, 1x in crafty, 8x in eon, 11x in perlbmk, 362x in gap, 4x in vortex, 14 in m88ksim, 211x in 126.gcc, 1x in compress, 11x in ijpeg, and 4x in 147.vortex. llvm-svn: 16521	2004-09-27 16:18:50 +00:00
Chris Lattner	7e794273f5	Implement shift-and combinations, implementing InstCombine/and.ll:test19-21 These combinations trigger 4 times in povray, 7x in gcc, 4x in gap, and 2x in bzip2. llvm-svn: 16508	2004-09-24 15:21:34 +00:00
Chris Lattner	e1b4d2a470	Move LHSI->hasOneUse() into the arms of the conditional, reindenting code. No functionality changes here. llvm-svn: 16505	2004-09-23 21:52:49 +00:00
Chris Lattner	8fc5af4da9	Implement Transforms/InstCombine/and.ll:test18, a case that occurs 20 times in perlbmk llvm-svn: 16504	2004-09-23 21:46:38 +00:00
Chris Lattner	bdcf41a8a2	Implement select.ll:test16: fold load (select C, X, null) -> load X llvm-svn: 16499	2004-09-23 15:46:00 +00:00
Chris Lattner	b121ae1cec	Do not fold (X + C1 != C2) if there are other users of the add. Doing this transformation used to take a loop like this: int Array[1000]; void test(int X) { int i; for (i = 0; i < 1000; ++i) Array[i] += X; } Compiled to LLVM is: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=2] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=2] * %exitcond = seteq uint %indvar.next, 1000 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit and turn it into a loop like this: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=3] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] * %exitcond = seteq uint %indvar, 999 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit Note that indvar.next and indvar can no longer be coallesced. In machine code terms, this patch changes this code: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI mov %EDX, %ECX inc %EDX cmp %ECX, 999 mov %ECX, %EDX jne .LBBtest_1 # no_exit into this: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit We need better instruction selection to get this: .LBBtest_1: # no_exit add DWORD PTR [Array + 4*%ECX], EAX inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit ... but at least there is less register juggling llvm-svn: 16473	2004-09-21 21:35:23 +00:00
Chris Lattner	42618551d5	Fix potential miscompilations: InstCombine/2004-09-20-BadLoadCombine*.llx llvm-svn: 16447	2004-09-20 10:15:10 +00:00
Alkis Evlogimenos	d59cebf87a	Fix loop condition so that we don't decrement off the beginning of the list. llvm-svn: 16440	2004-09-20 06:42:58 +00:00
Chris Lattner	4f2cf030e8	'Pass' should now not be derived from by clients. Instead, they should derive from ModulePass. Instead of implementing Pass::run, then should implement ModulePass::runOnModule. llvm-svn: 16436	2004-09-20 04:48:05 +00:00
Chris Lattner	cd671065be	Prototype more accurately llvm-svn: 16433	2004-09-20 04:43:57 +00:00
Chris Lattner	3e86084641	Prototype these functions more accurately llvm-svn: 16432	2004-09-20 04:43:15 +00:00
Chris Lattner	e6f13093e6	Make isSafeToLoadUnconditionally a bit smarter, implementing PR362 and Regression/Transforms/InstCombine/CPP_min_max.llx llvm-svn: 16409	2004-09-19 19:18:10 +00:00
Chris Lattner	855a4ff4dd	Remove a whole bunch of horrible hacky code that was used to promote allocas whose addresses where used by trivial phi nodes and select instructions. This is now performed by the instcombine pass, which is more powerful, is much simpler, and is faster. This allows the deletion of a bunch of code, two FIXME's and two gotos. llvm-svn: 16406	2004-09-19 18:51:51 +00:00
Chris Lattner	f62ea8ef4b	Make instruction combining a bit more aggressive in the face of volatile loads, and implement two new transforms: InstCombine/load.ll:test[56]. llvm-svn: 16404	2004-09-19 18:43:46 +00:00
Chris Lattner	9864df96ba	Add comment llvm-svn: 16400	2004-09-19 01:05:16 +00:00
Chris Lattner	6455c51ab6	Fix the inliner to always delete any edges from the external call node to a function being deleted. Due to optimizations done while inlining, there can be edges from the external call node to a function node that were not apparent any longer. This fixes the compiler crash while compiling 175.vpr llvm-svn: 16399	2004-09-18 21:37:03 +00:00
Chris Lattner	37b6c4f2d2	Convert this pass to be a CallGraphSCCPass instead of a Pass, which eliminates the worklist and makes it more efficient. This does not change functionality at all. llvm-svn: 16390	2004-09-18 00:34:13 +00:00
Chris Lattner	475dc2c93d	Make sure to remove the Select instruction as well llvm-svn: 16389	2004-09-18 00:32:40 +00:00
Chris Lattner	5065b240c8	Fix typo in comment llvm-svn: 16384	2004-09-17 03:58:39 +00:00
Chris Lattner	9face5eb1f	Add a newline llvm-svn: 16369	2004-09-15 17:53:52 +00:00
Reid Spencer	6614946443	Convert code to compile with vc7.1. Patch contributed by Paolo Invernizzi. Thanks Paolo! llvm-svn: 16368	2004-09-15 17:06:42 +00:00
Chris Lattner	f11216d24f	Fix a bug in the previous checkin that broke 255.vortex llvm-svn: 16355	2004-09-15 02:34:40 +00:00
Chris Lattner	a346578d92	Make sure to update alias analysis information as we transform the function. This fixes PR420 and Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx llvm-svn: 16348	2004-09-15 01:04:07 +00:00
Chris Lattner	9b9932bd94	If given an AliasSetTracker object to update, update it. llvm-svn: 16347	2004-09-15 01:02:54 +00:00
Chris Lattner	f41b80a05f	Remove a long-dead pass. Actually, this pass was never used at all. llvm-svn: 16337	2004-09-14 16:33:01 +00:00
Alkis Evlogimenos	a5c04ee50f	Fixes to make LLVM compile with vc7.1. Patch contributed by Paolo Invernizzi! llvm-svn: 16152	2004-09-03 18:19:51 +00:00
Reid Spencer	7c16caa336	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Reid Spencer	f39f66e3ef	Initial checkin of a pass to lower packed operations to scalars operations. This also registers the pass with opt with a -lower-packed command line option. Patch contributed by Brad Jones. llvm-svn: 15987	2004-08-21 21:39:24 +00:00
Chris Lattner	14c198d09a	If we are linking two global variables and they have the same size, do not spew warnings, even if the types don't match. llvm-svn: 15933	2004-08-20 00:30:39 +00:00
Chris Lattner	6139134715	Implement test/Regression/Transforms/GlobalConstifier/phi-select.llx This allows more globals to be marked constant, particularly global arrays. llvm-svn: 15735	2004-08-14 20:57:17 +00:00
Chris Lattner	56273827b1	If we are extracting a block that has multiple successors that are the same block (common in a switch), make sure to remove extra edges in successor blocks. This fixes CodeExtractor/2004-08-12-BlockExtractPHI.ll and should be pulled into LLVM 1.3 (though the regression test need not be, as that would require pulling in the LoopExtract.cpp changes). llvm-svn: 15717	2004-08-13 03:27:07 +00:00
Chris Lattner	f06b043204	When we code extract some stuff, leave the codeRepl block in the place where the extracted code was, instead of putting it at the end of the function llvm-svn: 15716	2004-08-13 03:17:39 +00:00
Chris Lattner	7386e6333d	"extract" the block extractor pass from bugpoint (haha) llvm-svn: 15714	2004-08-13 03:05:17 +00:00
Chris Lattner	889d346e6e	Add value mapper support for select constant exprs. This should fix a bug Nate ran into when bugpointing siod. This fix should go into LLVM 1.3 llvm-svn: 15712	2004-08-13 02:43:19 +00:00
Chris Lattner	cde351ee30	This patch makes the inliner refuse to inline functions that have alloca instructions in the body of the function (not the entry block). This fixes test/Programs/SingleSource/Regression/C/2004-08-12-InlinerAndAllocas.c and test/Programs/External/SPEC/CINT2000/176.gcc on zion. This should obviously be pulled into 1.3. llvm-svn: 15684	2004-08-12 05:45:09 +00:00
Chris Lattner	7f1c7ede5b	Fix code extraction of unwind blocks. This fixed bugs that bugpoint can run into. This should go into 1.3 llvm-svn: 15679	2004-08-12 03:17:02 +00:00
Chris Lattner	a7ba90e672	Hrm, this pass didn't compile. This bugfix should go into 1.3! llvm-svn: 15676	2004-08-12 02:44:23 +00:00
Chris Lattner	4456da6a4c	Fix InstCombine/2004-08-10-BoolSetCC.ll, a bug that is miscompiling 176.gcc. Note that this is apparently not the only bug miscompiling gcc though. :( llvm-svn: 15639	2004-08-11 00:50:51 +00:00
Chris Lattner	8e7260652b	Fix InstCombine/2004-08-09-RemInfLoop.llx This should go into the 1.3 branch llvm-svn: 15593	2004-08-09 21:05:48 +00:00
Chris Lattner	4956a32c9e	Fix another really nasty regression that Anshu pointed out. In cases where dangling constant users were removed from a function, causing it to be dead, we never removed the call graph edge from the external node to the function. In most cases, this didn't cause a problem (by luck). This should definitely go into 1.3 llvm-svn: 15570	2004-08-08 03:29:50 +00:00
Chris Lattner	92b9906199	Two fixes: 1. Fix a REALLY nasty cyclic replacement issue that Anshu discovered, causing nondeterminstic crashes and memory corruption. 2. For performance, don't go inserting constantexpr casts of GV pointers. This should definitely go into 1.3 llvm-svn: 15568	2004-08-08 01:30:07 +00:00
Chris Lattner	6a93462144	This DEBUG is buggy. comment it out because it's not worth fixing. This should go into 1.3 llvm-svn: 15567	2004-08-08 01:27:56 +00:00
Alkis Evlogimenos	832437255d	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	7aa2d4747a	Fix a regression in InstCombine/xor.ll llvm-svn: 15410	2004-08-01 19:42:59 +00:00
Chris Lattner	7471b96a05	Expose this as a functionpass llvm-svn: 15369	2004-07-31 10:01:58 +00:00
Misha Brukman	9c003d8f65	Fix De Morgan's name. llvm-svn: 15343	2004-07-30 12:50:08 +00:00
Chris Lattner	d4252a7c64	Start using the PatternMatcher a bit. llvm-svn: 15342	2004-07-30 07:50:03 +00:00
Misha Brukman	f4a410f907	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15337	2004-07-29 17:30:57 +00:00
Misha Brukman	63b38bd2ed	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15334	2004-07-29 17:30:56 +00:00
Misha Brukman	2b3387a6d9	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15328	2004-07-29 17:05:13 +00:00
Alkis Evlogimenos	fd7a2d4477	Merge i*.h headers into Instructions.h as part of bug403. llvm-svn: 15325	2004-07-29 12:17:34 +00:00
Robert Bocchino	7b5b86cd0f	This change fixed a bug in the function visitMul. The prior version assumed that a constant on the RHS of a multiplication was either an IntConstant or an FPConstant. It checked for an IntConstant and then, if it did not find one, did a hard cast to an FPConstant. That code would crash if the RHS were a ConstantExpr that was neither an IntConstant nor an FPConstant. This version replaces the hard cast with a dyn_cast. It performs the same way for IntConstants and FPConstants but does nothing, instead of crashing, for constant expressions. The regression test for this change is 2004-07-27-ConstantExprMul.ll. llvm-svn: 15291	2004-07-27 21:02:21 +00:00
Brian Gaeke	38b79e8fbc	Make the create...() functions for some of these passes return a FunctionPass *. llvm-svn: 15276	2004-07-27 17:43:21 +00:00
Chris Lattner	50eb771d37	Fix hoisting of void typed values, e.g. calls llvm-svn: 15263	2004-07-27 07:38:32 +00:00
Chris Lattner	f29807169a	Implement DeadStoreElim/alloca.llx by observing that allocas are dead at the end of the function (either return or unwind) llvm-svn: 15232	2004-07-26 06:14:11 +00:00
Chris Lattner	e5ad26dbb3	Throttle back indvar substitution from creating multiplies in loops. This is bad bad bad. llvm-svn: 15227	2004-07-26 02:47:12 +00:00
Chris Lattner	7b25bcdf52	* Substantially simplify how free instructions are handled (potentially fixing a bug in DSE). * Delete dead operand uses iteratively instead of recursively, using a SetVector. * Defer deletion of dead operand uses until the end of processing, which means we don't have to bother with updating the AliasSetTracker. This speeds up DSE substantially. llvm-svn: 15204	2004-07-25 11:09:56 +00:00
Chris Lattner	4c1c1ac7e4	Free instructions kill values too. This implements DeadStoreElim/free.llx llvm-svn: 15199	2004-07-25 07:58:38 +00:00
Chris Lattner	bad6478b00	obvious fix llvm-svn: 15162	2004-07-24 07:51:27 +00:00
Chris Lattner	3844c300de	This is a trivial dead store elimination pass. It very very simple and can be improved in many ways. But: stop laughing, even with -basicaa it deletes 15% of the stores in 252.eon :) llvm-svn: 15101	2004-07-22 08:00:28 +00:00
Chris Lattner	51f7c9e56d	Update GC intrinsics to take a pointer to the object as well as a pointer to the field being updated. Patch contributed by Tobias Nurmiranta llvm-svn: 15097	2004-07-22 05:51:13 +00:00
Brian Gaeke	902dcf0729	These files don't need to include <iostream> since they include "Support/Debug.h". llvm-svn: 15089	2004-07-21 20:50:33 +00:00
Chris Lattner	d8f5e2ccac	* Further cleanup. * Test for whether bits are shifted out during the optzn. If so, the fold is illegal, though it can be handled explicitly for setne/seteq This fixes the miscompilation of 254.gap last night, which was a latent bug exposed by other optimizer improvements. llvm-svn: 15085	2004-07-21 20:14:10 +00:00
Chris Lattner	1638de4499	Make cast-cast code a bit more defensive "simplify" a bit of code for comparison/and folding llvm-svn: 15082	2004-07-21 19:50:44 +00:00
Chris Lattner	4fbad968f8	Remove special casing of pointers and treat them generically as integers of the appopriate size. This gives us the ability to eliminate int -> ptr -> int llvm-svn: 15063	2004-07-21 04:27:24 +00:00
Chris Lattner	45b50d14c9	Fix a serious code pessimization problem. If an inlined function has a single return, clone the 'ret' BB code into the block AFTER the inlined call, not the other way around. llvm-svn: 15030	2004-07-20 05:45:24 +00:00
Chris Lattner	11ffd59e37	Implement Transforms/InstCombine/IntPtrCast.ll llvm-svn: 15029	2004-07-20 05:21:00 +00:00
Chris Lattner	ec67df0ed1	Ignore instructions that are in trivially dead functions. This allows us to constify 14 globals instead of 4 in a trivial C++ testcase. llvm-svn: 15027	2004-07-20 03:58:07 +00:00
Chris Lattner	44d0b9502a	Implement InstCombine/GEPIdxCanon.ll llvm-svn: 15024	2004-07-20 01:48:15 +00:00
Chris Lattner	5823ac1c21	Implement SimplifyCFG/BrUnwind.ll llvm-svn: 15022	2004-07-20 01:17:38 +00:00
Chris Lattner	4e2dbc6b4a	Rewrite cast->cast elimination code completely based on the information we actually care about. Someday when the cast instruction is gone, we can do better here, but this will do for now. This implements instcombine/cast.ll:test17/18 as well. llvm-svn: 15018	2004-07-20 00:59:32 +00:00
Chris Lattner	e2774757fe	Fix a performance regression from the CPR patch, simplify code llvm-svn: 14974	2004-07-18 21:34:16 +00:00
Chris Lattner	d47504d9db	Strip out and simplify some code. This also fixes the regression last night compiling cfrac. It did not realize that code like this: int G; int *H = &G; takes the address of G. llvm-svn: 14973	2004-07-18 19:56:20 +00:00
Chris Lattner	f3edc49ae2	Minor cleanup, no functionality change llvm-svn: 14972	2004-07-18 18:59:44 +00:00
Reid Spencer	3b4e83ec83	Remove an if statement that would never be reached. llvm-svn: 14968	2004-07-18 08:41:47 +00:00
Reid Spencer	f0a5bcaae4	Delete a redundant if branch. llvm-svn: 14967	2004-07-18 08:34:52 +00:00
Reid Spencer	c44cb6bd9f	Expand the coercion of constants to include the newly constant Globals. llvm-svn: 14966	2004-07-18 08:34:19 +00:00
Reid Spencer	539429d9b5	Delete a no-op loop. llvm-svn: 14965	2004-07-18 08:32:43 +00:00
Reid Spencer	6c2b627e23	Expand the scope to include global values because they are now constants too. llvm-svn: 14964	2004-07-18 08:32:10 +00:00
Reid Spencer	199aeb7f59	Avoid an unnecessary isa<Constant>. llvm-svn: 14963	2004-07-18 08:31:18 +00:00
Chris Lattner	9238d78dc3	Remove useless statistic, fix some slightly broken logic llvm-svn: 14958	2004-07-18 07:22:58 +00:00
Chris Lattner	2da5eee33c	Fix a rather serious bug in previous checkin llvm-svn: 14957	2004-07-18 06:56:58 +00:00
Reid Spencer	cb3fb5d4f5	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage llvm-svn: 14953	2004-07-18 00:44:37 +00:00
Reid Spencer	874368790f	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14950	2004-07-18 00:38:32 +00:00
Reid Spencer	ef784f01dd	bug 122: - Minimize redundant isa<GlobalValue> usage llvm-svn: 14948	2004-07-18 00:32:14 +00:00
Reid Spencer	c5afc9512b	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14947	2004-07-18 00:31:05 +00:00
Reid Spencer	9e855c6832	bug 122: - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14946	2004-07-18 00:29:57 +00:00
Reid Spencer	5f6815980b	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Rename methods to get ride of ConstantPointerRef usage llvm-svn: 14945	2004-07-18 00:25:04 +00:00
Reid Spencer	83cae64faf	bug 122: - Excise dead CPR procesing. llvm-svn: 14944	2004-07-18 00:23:51 +00:00
Reid Spencer	e4de22874e	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Correct test ordering for GlobalValue subclass llvm-svn: 14943	2004-07-18 00:19:45 +00:00
Chris Lattner	d79334df33	This patch was contributed by Daniel Berlin! Speed up SCCP substantially by processing overdefined values quickly. This patch speeds up SCCP by about 30-40% on large testcases. llvm-svn: 14861	2004-07-15 23:36:43 +00:00
Chris Lattner	f2c018c0c1	Fix PR404 try #2 This version takes about 1s longer than the previous one (down to 2.35s), but on the positive side, it actually works :) llvm-svn: 14856	2004-07-15 08:20:22 +00:00
Chris Lattner	daa12135da	Revert previous patch until I get a bug fixed llvm-svn: 14853	2004-07-15 05:36:31 +00:00
Chris Lattner	70177e402d	Fix PR404: Loop simplify is really slow on 252.eon This eliminates an NNlogN algorithm from the loop simplify pass, replacing it with a much simpler and faster alternative. In a debug build, this reduces gccas time on eon from 85s to 42s. llvm-svn: 14851	2004-07-15 04:27:04 +00:00
Chris Lattner	32c518e526	Progress on PR341 llvm-svn: 14840	2004-07-15 02:06:12 +00:00
Chris Lattner	9a63520b1a	Fixes working towards PR341 llvm-svn: 14839	2004-07-15 01:50:47 +00:00
Chris Lattner	ba7aef39fd	Now that we codegen the portable "sizeof" efficiently, we can use it for malloc lowering. This means that lowerallocations doesn't need targetdata anymore. yaay. llvm-svn: 14835	2004-07-15 01:08:08 +00:00
Chris Lattner	35e24774eb	Factor some code to handle "load (constantexpr cast foo)" just like "load (cast foo)". This allows us to compile C++ code like this: class Bclass { public: virtual int operator()() { return 666; } }; class Dclass: public Bclass { public: virtual int operator()() { return 667; } } ; int main(int argc, char argv) { Dclass x; return x(); } Into this: int %main(int %argc, sbyte %argv) { entry: call void %__main( ) ret int 667 } Instead of this: int %main(int %argc, sbyte** %argv) { entry: %x = alloca "struct.std::bad_typeid" ; <"struct.std::bad_typeid"> [#uses=3] call void %__main( ) %tmp.1.i.i = getelementptr "struct.std::bad_typeid" %x, uint 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Bclass, int 0, long 2), int (...)*** %tmp.1.i.i %tmp.3.i = getelementptr "struct.std::bad_typeid"* %x, int 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2), int (...)*** %tmp.3.i %tmp.5 = load int ("struct.std::bad_typeid")* cast (int (...)** getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2) to int ("struct.std::bad_typeid")) ; <int ("struct.std::bad_typeid")> [#uses=1] %tmp.6 = call int %tmp.5( "struct.std::bad_typeid" %x ) ; <int> [#uses=1] ret int %tmp.6 ret int 0 } In order words, we now resolve the virtual function call. llvm-svn: 14783	2004-07-13 01:49:43 +00:00
Chris Lattner	9eb9ccd9f6	Check to make sure types are sized before calling getTypeSize on them. llvm-svn: 14649	2004-07-06 19:28:42 +00:00
Brian Gaeke	a501be556f	It doesn't matter what the 2nd operand is; if the GEP has 2 operands and the first is a zero, we should leave it alone. llvm-svn: 14648	2004-07-06 19:24:47 +00:00
Brian Gaeke	0e0fe8a2e9	Add helper function. Don't touch GEPs for which DecomposeArrayRef is not going to do anything special (e.g., < 2 indices, or 2 indices and the last one is a constant.) llvm-svn: 14647	2004-07-06 18:15:39 +00:00
Chris Lattner	23b47b6af9	Implement rem.ll:test3 llvm-svn: 14640	2004-07-06 07:38:18 +00:00
Chris Lattner	98c6bdf251	Fix a minor bug where we would go into infinite loops on some constants llvm-svn: 14638	2004-07-06 07:11:42 +00:00
Chris Lattner	7fd5f0745a	Implement InstCombine/sub.ll:test15: X % -Y === X % Y Also, remove X % -1 = 0, because it's not true for unsigneds, and the signed case is superceeded by this new handling. llvm-svn: 14637	2004-07-06 07:01:22 +00:00
Reid Spencer	eb04d9bcb4	Add #include <iostream> since Value.h does not #include it any more. llvm-svn: 14622	2004-07-04 12:19:56 +00:00
Chris Lattner	4c9c20af28	Implement add.ll:test22, a common case in MSIL files llvm-svn: 14587	2004-07-03 00:26:11 +00:00
Chris Lattner	49df6cefa5	Do not call getTypeSize on a type that has no size llvm-svn: 14584	2004-07-02 22:55:47 +00:00
Brian Gaeke	e1a136fb4b	Get rid of a dead variable, and fix a typo in a comment. llvm-svn: 14560	2004-07-02 05:30:01 +00:00
Brian Gaeke	163c87fc32	Make this pass use a more specific debug message than "Processing:". llvm-svn: 14541	2004-07-01 19:27:10 +00:00
Vikram S. Adve	1097ed8467	Restoring this file. llvm-svn: 14478	2004-06-29 14:20:27 +00:00
Chris Lattner	3b11d3b294	Remove unused file llvm-svn: 14460	2004-06-28 00:46:58 +00:00
Chris Lattner	924882f775	These passes are long dead/obsolete. They never worked in the first place and are a maintenence burden. Nuke nuke nuke llvm-svn: 14457	2004-06-28 00:44:18 +00:00
Chris Lattner	6e07936ed2	Implement InstCombine/add.ll:test21 llvm-svn: 14443	2004-06-27 22:51:36 +00:00
Chris Lattner	7f4222237d	New constant expression lowering pass to simplify your instruction selection needs. Contributed by Vladimir Prus! llvm-svn: 14399	2004-06-25 07:48:09 +00:00
Vikram S. Adve	463556f889	This file is unused, and duplicates functionality in TraceValues.cpp. llvm-svn: 14369	2004-06-24 20:16:22 +00:00
Chris Lattner	7a002d6010	Two fixes. First, stop using the ugly shouldSubstituteIndVar method. Second, disable substitution of quadratic addrec expressions to avoid putting multiplies in loops! llvm-svn: 14358	2004-06-24 06:49:18 +00:00
Misha Brukman	49bb82a4b8	Moved to lib/VMCore llvm-svn: 14348	2004-06-23 17:21:17 +00:00
Brian Gaeke	1ea8447089	Use new IsNAN() wrapper. llvm-svn: 14340	2004-06-23 00:25:35 +00:00
Misha Brukman	ddc90adca3	File depends on DSA, moved to lib/Analysis/DataStructure llvm-svn: 14325	2004-06-22 18:11:38 +00:00
Chris Lattner	f12c4a3d37	FINALLY Fix a really nasty nondeterministic bug that has been haunting us since May 1st. In this code, the pred iterator was being invalidated sometimes causing the wrong entries to be added to PHI nodes. The fix for this is to defererence and safe the *PI value before we hack on branch instructions, which changes use/def chains, which SOMETIMES invalidates the iterator. llvm-svn: 14278	2004-06-21 07:19:01 +00:00
Chris Lattner	46f60890a3	Comment out the isnan stuff until we get a proper autoconf test for it breaking the build on sparc is not acceptable. llvm-svn: 14277	2004-06-21 06:17:21 +00:00
Chris Lattner	1c676f76b6	Make order of argument addition deterministic. In particular, the layout of ConstantInt objects in memory used to determine which order arguments were added in in some cases. llvm-svn: 14276	2004-06-21 00:07:58 +00:00
Chris Lattner	c9e06336ab	Make use of BinaryOperator::create* methods to shrinkify code. llvm-svn: 14262	2004-06-20 05:04:01 +00:00
Chris Lattner	7d30a6c145	Fix the inliner to be deterministic, not letting its output depend on the relative location of Function objects in memory. llvm-svn: 14260	2004-06-20 04:11:48 +00:00
Chris Lattner	9734fd0980	Add some DEBUG output to the simplifycfg routines Fix another non-deterministic behavior, this one should actually speed up the code though as it was doing silly things. llvm-svn: 14258	2004-06-20 01:13:18 +00:00
Chris Lattner	42ad646104	Now that dominator tree children are built in determinstic order, this horrible code can go away llvm-svn: 14254	2004-06-19 20:23:35 +00:00
Chris Lattner	940b7ba5ad	This will hopefully fix a heisenbug that Vladimir Merzliakov is running into valiantly trying to compile stuff on freebsd. llvm-svn: 14251	2004-06-19 19:01:26 +00:00
Chris Lattner	4027500e1c	Fix a nasty bug, noticed by Reid llvm-svn: 14249	2004-06-19 18:15:50 +00:00
Chris Lattner	ec2d34cc19	Fix one source of nondeterminism in the -licm pass: the hoist pass was processing blocks in whatever order they happened to end up in the dominator tree data structure. Force an ordering. llvm-svn: 14248	2004-06-19 08:56:43 +00:00
Chris Lattner	4db0f8260a	Change to use the StableBasicBlockNumbering class llvm-svn: 14247	2004-06-19 08:42:40 +00:00
Chris Lattner	a52ab6f57f	Do not let the numbering of PHI nodes placed in the function depend on non-deterministic things like the ordering of blocks in the dominance frontier of a BB. Unfortunately, I don't know of a better way to solve this problem than to explicitly sort the BB's in function-order before processing them. This is guaranteed to slow the pass down a bit, but is absolutely necessary to get usable diffs between two different tools executing the mem2reg or scalarrepl pass. Before this, bazillions of spurious diff failures occurred all over the place due to the different order of processing PHIs: - %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.0, uint 0, uint 0 + %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.1, uint 0, uint 0 Now, the diffs match. llvm-svn: 14244	2004-06-19 07:40:14 +00:00
Chris Lattner	b2b151d297	Do not sort by the address of LLVM ConstantInt* objects. This produces nondeterministic results that depend on where these objects land in memory. Instead, sort by the value of the constant, which is stable. Before this patch, the -simplifycfg pass run from two different compilers could cause different code to be generated, though it was semantically the same: @@ -12258,8 +12258,8 @@ %s_addr.1 = phi sbyte* [ %s, %entry ], [ %inc.0, %no_exit ] ; <sbyte> [#uses=5] %tmp.1 = load sbyte %s_addr.1 ; <sbyte> [#uses=1] switch sbyte %tmp.1, label %no_exit [ - sbyte 0, label %loopexit sbyte 46, label %loopexit + sbyte 0, label %loopexit ] We need to stomp all of this stuff out. llvm-svn: 14243	2004-06-19 07:02:14 +00:00
Chris Lattner	b5f8eb8315	Do not loop over uses as we delete them. This causes iterators to be invalidated out from under us. This bug goes back to revision 1.1: scary. llvm-svn: 14242	2004-06-19 02:02:22 +00:00
Chris Lattner	023a483c76	Implement Transforms/InstCombine/and.ll:test17, a common case that occurs due to unordered comparison macros in math.h llvm-svn: 14221	2004-06-18 06:07:51 +00:00
Chris Lattner	1e1abdd6ed	Do not function resolve intrinsics. This prevents warnings and possible bad things from happening due to declare bool %llvm.isunordered(double, double) declare bool %llvm.isunordered(float, float) llvm-svn: 14219	2004-06-18 05:50:48 +00:00
Brian Gaeke	27b13253d9	I love the smell of a freshly broken PowerPC build in the morning. llvm-svn: 14206	2004-06-17 22:27:04 +00:00
Chris Lattner	f03f320b79	Fix compilation problem on freebsd. Problem noted by Vladimir Merzliakov in PR371 llvm-svn: 14203	2004-06-17 21:20:52 +00:00
Chris Lattner	6b7275996c	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() llvm-svn: 14201	2004-06-17 18:19:28 +00:00
Chris Lattner	97bfcea262	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() Delete two functions that are now methods on the Type class llvm-svn: 14200	2004-06-17 18:16:02 +00:00
Brian Gaeke	661963c63f	Fix typo in DEBUG printout. llvm-svn: 14196	2004-06-17 07:26:52 +00:00
Brian Gaeke	20e09e5c7b	Um, did someone make a typo or something? llvm-svn: 14192	2004-06-15 23:09:50 +00:00
Chris Lattner	5a542aadc8	Remove support for the isnan intrinsic llvm-svn: 14186	2004-06-15 21:37:54 +00:00
Brian Gaeke	21370771ba	Quick hack to get this file compiling again on Mac OS X. The right thing to do is write an autoconf macro that checks whether __isnan or isnan actually works using the C++ compiler after #include <cmath>, instead of doing it the easy way with AC_CHECK_FUNCS(). llvm-svn: 14171	2004-06-14 06:33:19 +00:00
Alkis Evlogimenos	e395468ae5	Add constant folding capabilities to the isunordered intrinsic. llvm-svn: 14168	2004-06-13 01:23:56 +00:00
Chris Lattner	ec941f7abb	Constant fold the isnan intrinsic llvm-svn: 14150	2004-06-11 06:16:23 +00:00
Chris Lattner	ee59d4bf04	Fix a bug in my checkin from last night that caused miscompilations of 186.crafty, fhourstones and 132.ijpeg. Bugpoint makes really nasty miscompilations embarassingly easy to find. It narrowed it down to the instcombiner and this testcase (from fhourstones): bool %l7153_l4706_htstat_loopentry_2E_4_no_exit_2E_4(int* %i, [32 x int]* %works, int* %tmp.98.out) { newFuncRoot: %tmp.96 = load int* %i ; <int> [#uses=1] %tmp.97 = getelementptr [32 x int]* %works, long 0, int %tmp.96 ; <int> [#uses=1] %tmp.98 = load int %tmp.97 ; <int> [#uses=2] %tmp.99 = load int* %i ; <int> [#uses=1] %tmp.100 = and int %tmp.99, 7 ; <int> [#uses=1] %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=2] %tmp.102 = cast bool %tmp.101 to int ; <int> [#uses=0] br bool %tmp.101, label %codeRepl4.exitStub, label %codeRepl3.exitStub codeRepl4.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool true codeRepl3.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool false } ... which only has one combination performed on it: $ llvm-as < t.ll \| opt -instcombine -debug \| llvm-dis IC: Old = %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=1] New = setne int %tmp.100, 0 ; <bool>:<badref> [#uses=0] IC: MOD = br bool %tmp.101, label %codeRepl3.exitStub, label %codeRepl4.exitStub IC: MOD = %tmp.97 = getelementptr [32 x int]* %works, uint 0, int %tmp.96 ; <int*> [#uses=1] It doesn't get much better than this. :) llvm-svn: 14109	2004-06-10 02:33:20 +00:00
Chris Lattner	c8e7e298c1	More minor cleanups llvm-svn: 14108	2004-06-10 02:12:35 +00:00
Chris Lattner	df20a4d589	Eliminate many occurrances of Instruction:: llvm-svn: 14107	2004-06-10 02:07:29 +00:00
Chris Lattner	35167c3087	Implement InstCombine/select.ll:test15* llvm-svn: 14095	2004-06-09 07:59:58 +00:00
Chris Lattner	396dbfe327	Be more careful about the order we put stuff onto the worklist. This allow us to collapse this: bool %le(int %A, int %B) { %c1 = setgt int %A, %B %tmp = select bool %c1, int 1, int 0 %c2 = setlt int %A, %B %result = select bool %c2, int -1, int %tmp %c3 = setle int %result, 0 ret bool %c3 } into: bool %le(int %A, int %B) { %c3 = setle int %A, %B ; <bool> [#uses=1] ret bool %c3 } which is handy, because the Java FE makes these sequences all over the place. This is tested as: test/Regression/Transforms/InstCombine/JavaCompare.ll llvm-svn: 14086	2004-06-09 05:08:07 +00:00
Chris Lattner	2dd017402b	Implement select.ll:test14* llvm-svn: 14083	2004-06-09 04:24:29 +00:00
Brian Gaeke	a9c5779a86	Expand head-of-file comment. llvm-svn: 13982	2004-06-03 05:03:02 +00:00
Brian Gaeke	c0b9b83450	Use new form of unconditional branch constructor. llvm-svn: 13930	2004-06-01 20:06:10 +00:00
Chris Lattner	523d3e6674	Fix one of the major things that is causing the C Backend to infinite loop llvm-svn: 13872	2004-05-28 05:02:13 +00:00
John Criswell	37d2ae92a7	Fix a bug in the -deadtypeelim pass. The SymbolTable re-write changed it to eliminate the wrong type. llvm-svn: 13855	2004-05-27 21:16:46 +00:00
Chris Lattner	ed79d8af53	Fix InstCombine/load.ll & PR347. This code hadn't been updated after the "structs with more than 256 elements" related changes to the GEP instruction. Also it was not handling the ConstantAggregateZero class. Now it does! llvm-svn: 13834	2004-05-27 17:30:27 +00:00
Chris Lattner	c6e21fbd5c	Implement constant folding of fmod, which is used a lot in povray llvm-svn: 13823	2004-05-27 07:25:00 +00:00
Chris Lattner	06158d140c	Restructure call constant folding code a bit to make it simpler Add support for acos/asin/atan. 188.ammp contains three calls to acos with constant arguments. Constant folding it allows elimination of those 3 calls and three FP divisions of the results. llvm-svn: 13821	2004-05-27 06:26:28 +00:00
Alkis Evlogimenos	0eefdcd73f	Do not pass a null pointer if this instruction is not prepended or appended anywhere. llvm-svn: 13798	2004-05-26 22:50:28 +00:00
Alkis Evlogimenos	9e84b503f0	Use one destination constructor for the unconditional branch. llvm-svn: 13792	2004-05-26 21:38:14 +00:00
Reid Spencer	e7e9671cad	Convert to SymbolTable's new iteration interface. llvm-svn: 13754	2004-05-25 08:53:40 +00:00
Reid Spencer	abb6f008ca	Convert to SymbolTable's new lookup and iteration interfaces. llvm-svn: 13751	2004-05-25 08:52:20 +00:00
Reid Spencer	297d7fe7e6	Remove unused header file. llvm-svn: 13750	2004-05-25 08:51:36 +00:00
Reid Spencer	1cc31f264f	Make this pass simply invoke SymbolTable::strip(). llvm-svn: 13749	2004-05-25 08:51:25 +00:00
Chris Lattner	e1e10e1883	Implement InstCombine:shift.ll:test16, which turns (X >> C1) & C2 != C3 into (X & (C2 << C1)) != (C3 << C1), where the shift may be either left or right and the compare may be any one. This triggers 1546 times in 176.gcc alone, as it is a common pattern that occurs for bitfield accesses. llvm-svn: 13740	2004-05-25 06:32:08 +00:00
Chris Lattner	03841659a4	Implement instcombine/cast.ll:test16: Canonicalize cast X to bool into a setne instruction llvm-svn: 13736	2004-05-25 04:29:21 +00:00
Chris Lattner	6f02714a10	Fix a bug in my previous checkin llvm-svn: 13717	2004-05-24 06:24:46 +00:00
Chris Lattner	99173879ad	Spelling people's names right is kinda important llvm-svn: 13702	2004-05-23 21:27:29 +00:00
Chris Lattner	6754b827c6	Fix cases where we missed inlining some more obvious candidates because the caller was in an SCC. llvm-svn: 13693	2004-05-23 21:22:17 +00:00
Chris Lattner	8d7ff5e3dd	Simplify the interface and remove an unneeded #include llvm-svn: 13692	2004-05-23 21:21:35 +00:00
Chris Lattner	254f8f8ad5	Fairly substantial changes to update the alias analysis we are querying as we make the transformation. This allows us to use interprocedural alias analyses successfully. llvm-svn: 13691	2004-05-23 21:21:17 +00:00
Chris Lattner	289ba2ac4d	Adjust to the changes in the AliasSetTracker interface llvm-svn: 13690	2004-05-23 21:20:19 +00:00
Chris Lattner	e67dbc2ae2	Add support for replacement of formal arguments with simpler expressions. llvm-svn: 13689	2004-05-23 21:19:55 +00:00
Chris Lattner	099c8cfe90	Implement the -lowergc pass which is used by code generators (like the CBE) that do not have builtin support for garbage collection. llvm-svn: 13688	2004-05-23 21:19:22 +00:00
Brian Gaeke	72185765bc	Add CloneTraceInto(), which is based on (and has mostly the same effects as) CloneFunctionInto(). llvm-svn: 13601	2004-05-19 09:08:14 +00:00
Brian Gaeke	6182acf92a	Move RemapInstruction() to ValueMapper, so that it can be shared with CloneTrace, and because it is primarily an operation on ValueMaps. It is now a global (non-static) function which can be pulled in using ValueMapper.h. llvm-svn: 13600	2004-05-19 09:08:12 +00:00
Brian Gaeke	27e4943516	Clean up this pass somewhat: Add better comments, including a better head-of-file comment. Prune #includes. Fix a FIXME that Chris put here by using doInitialization(). Use DEBUG() to print out debug msgs. Give names to basic blocks inserted by this pass. Expand tabs. Use InsertProfilingInitCall() from ProfilingUtils to insert the initialize call. llvm-svn: 13581	2004-05-14 21:21:52 +00:00
Chris Lattner	0026512bac	This was not meant to be committed llvm-svn: 13565	2004-05-13 20:56:34 +00:00
Chris Lattner	c12c945cc4	Fix a nasty bug that caused us to unroll EXTREMELY large loops due to overflow in the size calculation. This is not something you want to see: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - UNROLLING! The problem was that 2*2147483648 == 0. Now we get: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - TOO LARGE: 4294967296>100 Thanks to some anonymous person playing with the demo page that repeatedly caused zion to go into swapping land. That's one way to ensure you'll get a quick bugfix. :) Testcase here: Transforms/LoopUnroll/2004-05-13-DontUnrollTooMuch.ll llvm-svn: 13564	2004-05-13 20:43:31 +00:00
Chris Lattner	66219abac7	Do not pass in the same argument to the extracted function more than once, and give the extracted function a more useful name than just foo_code. llvm-svn: 13493	2004-05-12 16:26:18 +00:00
Chris Lattner	13d2ddfe9c	Implement support for code extracting basic blocks that have a return instruction in them. llvm-svn: 13490	2004-05-12 16:07:41 +00:00
Chris Lattner	795c9933e2	Implement splitting of PHI nodes, allowing block extraction of BB's that have PHI node entries from multiple outside-the-region blocks. This also fixes extraction of the entry block in a function. Yaay. This has successfully block extracted all (but one) block from the score_move function in obsequi (out of 33). Hrm, I wonder which block the bug is in. :) llvm-svn: 13489	2004-05-12 15:29:13 +00:00
Chris Lattner	3b2917bfcf	* Pull some code out into the definedInRegion/definedInCaller methods * Add a stub for the severSplitPHINodes which will allow us to bbextract bb's with PHI nodes in them soon. * Remove unused arguments from findInputsOutputs * Dramatically simplify the code in findInputsOutputs. In particular, nothing really cares whether or not a PHI node is using something. * Move moveCodeToFunction to after emitCallAndSwitchStatement as that's the order they get called. * Fix a bug where we would code extract a region that included a call to vastart. Like 'alloca', calls to vastart must stay in the function that they are defined in. * Add some comments. llvm-svn: 13482	2004-05-12 06:01:40 +00:00
Chris Lattner	ffc4926263	Generate substantially better code when there are a limited number of exits from the extracted region. If the return has 0 or 1 exit blocks, the new function returns void. If it has 2 exits, it returns bool, otherwise it returns a ushort as before. This allows us to use a conditional branch instruction when there are two exit blocks, as often happens during block extraction. llvm-svn: 13481	2004-05-12 04:14:24 +00:00
Chris Lattner	3d1ca67fdd	Two minor improvements: 1. Get rid of the silly abort block. When doing bb extraction, we get one abort block for every block extracted, which is kinda annoying. 2. If the switch ends up having a single destination, turn it into an unconditional branch. I would like to add support for conditional branches, but to do this we will want to have the function return a bool instead of a ushort. llvm-svn: 13478	2004-05-12 03:22:33 +00:00
Chris Lattner	8ec5f88c79	Fix stupid bug in my checkin yesterday llvm-svn: 13429	2004-05-08 22:41:42 +00:00
Chris Lattner	5f667a6f58	Implement folding of GEP's like: %tmp.0 = getelementptr [50 x sbyte]* %ar, uint 0, int 5 ; <sbyte> [#uses=2] %tmp.7 = getelementptr sbyte %tmp.0, int 8 ; <sbyte*> [#uses=1] together. This patch actually allows us to simplify and generalize the code. llvm-svn: 13415	2004-05-07 22:09:22 +00:00
Chris Lattner	d9e5813821	Fix PR336: The instcombine pass asserts when visiting load instruction llvm-svn: 13400	2004-05-07 15:35:56 +00:00
Chris Lattner	9490849028	Do not mark instructions in unreachable sections of the function as live. This fixes PR332 and ADCE/2004-05-04-UnreachableBlock.llx llvm-svn: 13349	2004-05-04 17:00:46 +00:00
Chris Lattner	dd1a86d858	Minor efficiency tweak, suggested by Patrick Meredith llvm-svn: 13341	2004-05-04 15:19:33 +00:00
Brian Gaeke	5237476f75	Fix typo llvm-svn: 13340	2004-05-03 23:52:07 +00:00
Brian Gaeke	e96196081e	In InsertProfilingInitCall(), make it legal to pass in a null array, in which case you'll get a null array and zero passed to the profiling function. llvm-svn: 13336	2004-05-03 22:06:33 +00:00
Brian Gaeke	088dd3e121	Add initial implementation of basic-block tracing instrumentation pass. llvm-svn: 13335	2004-05-03 22:06:32 +00:00
Chris Lattner	be6f06818c	Do not clone arbitrary condition instructions. llvm-svn: 13316	2004-05-02 05:19:36 +00:00
Chris Lattner	51a6dbcb65	Do not infinitely "unroll" single BB loops. llvm-svn: 13315	2004-05-02 05:02:03 +00:00
Chris Lattner	1e94ed606e	Dont' merge terminators that are needed to select PHI node values. llvm-svn: 13312	2004-05-02 01:00:44 +00:00
Chris Lattner	2e93c4275e	Implement SimplifyCFG/branch-cond-merge.ll Turning "if (A < B && B < C)" into "if (A < B & B < C)" llvm-svn: 13311	2004-05-01 23:35:43 +00:00
Chris Lattner	63d75af920	Make sure to reprocess instructions used by deleted instructions to avoid missing opportunities for combination. llvm-svn: 13309	2004-05-01 23:27:23 +00:00
Chris Lattner	b643a9e675	Make sure the instruction combiner doesn't lose track of instructions when replacing them, missing the opportunity to do simplifications llvm-svn: 13308	2004-05-01 23:19:52 +00:00
Chris Lattner	4cbd160b45	Fix my missing parens llvm-svn: 13307	2004-05-01 22:41:51 +00:00
Chris Lattner	88da6f7b52	Implement SimplifyCFG/branch-cond-prop.ll llvm-svn: 13306	2004-05-01 22:36:37 +00:00
Chris Lattner	652064e3b8	Fix a major pessimization in the instcombiner. If an allocation instruction is only used by a cast, and the casted type is the same size as the original allocation, it would eliminate the cast by folding it into the allocation. Unfortunately, it was placing the new allocation instruction right before the cast, which could pull (for example) alloca instructions into the body of a function. This turns statically allocatable allocas into expensive dynamically allocated allocas, which is bad bad bad. This fixes the problem by placing the new allocation instruction at the same place the old one was, duh. :) llvm-svn: 13289	2004-04-30 04:37:52 +00:00
Chris Lattner	2d3a7a6ff0	Changes to fix up the inst_iterator to pass to boost iterator checks. This patch was graciously contributed by Vladimir Prus. llvm-svn: 13185	2004-04-27 15:13:33 +00:00
Chris Lattner	e20c334e65	Instcombine X/-1 --> 0-X llvm-svn: 13172	2004-04-26 14:01:59 +00:00
Misha Brukman	3596f0a180	* Allow aggregating extracted function arguments (controlled by flag) * Commandline option (for now) controls that flag that is passed in llvm-svn: 13141	2004-04-23 23:54:17 +00:00
Chris Lattner	83cd87efcd	Move the scev expansion code into this pass, where it belongs. There is still room for cleanup, but at least the code modification is out of the analysis now. llvm-svn: 13135	2004-04-23 21:29:48 +00:00
Misha Brukman	98aa516a9c	Clarify the logic: the flag is renamed to `deleteFn' to signify it will delete the function instead of isolating it. This also means the condition is reversed. llvm-svn: 13112	2004-04-22 23:00:51 +00:00
Misha Brukman	e0682426f0	Add a flag to choose between isolating a function or deleting the function from the Module. The default behavior keeps functionality as before: the chosen function is the one that remains. llvm-svn: 13111	2004-04-22 22:52:22 +00:00
Chris Lattner	c27302c79f	Disable a previous patch that was causing indvars to loop infinitely :( llvm-svn: 13108	2004-04-22 15:12:36 +00:00
Chris Lattner	c1a682dda0	Fix an extremely serious thinko I made in revision 1.60 of this file. llvm-svn: 13106	2004-04-22 14:59:40 +00:00
Chris Lattner	af532f27e7	Implement a todo, rewriting all possible scev expressions inside of the loop. This eliminates the extra add from the previous case, but it's not clear that this will be a performance win overall. Tommorows test results will tell. :) llvm-svn: 13103	2004-04-21 23:36:08 +00:00
Chris Lattner	fb9a299f68	This code really wants to iterate over the OPERANDS of an instruction, not over its USES. If it's dead it doesn't have any uses! :) Thanks to the fabulous and mysterious Bill Wendling for pointing this out. :) llvm-svn: 13102	2004-04-21 22:29:37 +00:00
Chris Lattner	dc7cc35088	Implement a fixme. The helps loops that have induction variables of different types in them. Instead of creating an induction variable for all types, it creates a single induction variable and casts to the other sizes. This generates this code: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=4] *** %j.0.0 = cast uint %indvar to short ; <short> [#uses=1] %indvar = cast uint %indvar to int ; <int> [#uses=1] %tmp.7 = getelementptr short* %P, uint %indvar ; <short> [#uses=1] store short %j.0.0, short %tmp.7 %inc.0 = add int %indvar, 1 ; <int> [#uses=2] %tmp.2 = setlt int %inc.0, %N ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %no_exit, label %loopexit instead of: no_exit: ; preds = %entry, %no_exit %indvar = phi ushort [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ushort> [#uses=2] *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %indvar = cast ushort %indvar to short ; <short> [#uses=1] %tmp.7 = getelementptr short* %P, uint %indvar ; <short> [#uses=1] store short %indvar, short %tmp.7 %inc.0 = add int %indvar, 1 ; <int> [#uses=2] %tmp.2 = setlt int %inc.0, %N ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 *** %indvar.next = add ushort %indvar, 1 br bool %tmp.2, label %no_exit, label %loopexit This is an improvement in register pressure, but probably doesn't happen that often. The more important fix will be to get rid of the redundant add. llvm-svn: 13101	2004-04-21 22:22:01 +00:00
Chris Lattner	be8bb804c5	Fix an incredibly nasty iterator invalidation problem. I am too spoiled by ilists :) Eventually it would be nice if CallGraph maintained an ilist of CallGraphNode's instead of a vector of pointers to them, but today is not that day. llvm-svn: 13100	2004-04-21 20:44:33 +00:00
Alkis Evlogimenos	f68f40ea42	Include cerrno (gcc-3.4 fix) llvm-svn: 13091	2004-04-21 16:11:40 +00:00
Chris Lattner	a9691fe70d	Fix typeo llvm-svn: 13089	2004-04-21 14:23:18 +00:00
Chris Lattner	c87784f1fc	REALLY fix PR324: don't delete linkonce functions until after the SCC traversal is done, which avoids invalidating iterators in the SCC traversal routines llvm-svn: 13088	2004-04-20 22:06:53 +00:00
Chris Lattner	c1aa21f5a7	Fix PR325 llvm-svn: 13081	2004-04-20 20:26:03 +00:00
Chris Lattner	514934051a	Fix PR324 and testcase: Inline/2004-04-20-InlineLinkOnce.llx llvm-svn: 13080	2004-04-20 20:20:59 +00:00
Chris Lattner	f48f777d4c	Initial checkin of a simple loop unswitching pass. It still needs work, but it's a start, and seems to do it's basic job. llvm-svn: 13068	2004-04-19 18:07:02 +00:00
Chris Lattner	bc02177fdc	Add #include llvm-svn: 13057	2004-04-19 03:01:23 +00:00
Chris Lattner	fc44a25bcb	Move isLoopInvariant to the Loop class llvm-svn: 13051	2004-04-18 22:46:08 +00:00
Chris Lattner	827826320d	Correct rewriting of exit blocks after my last patch llvm-svn: 13048	2004-04-18 22:27:10 +00:00
Chris Lattner	35eaa55cfc	Loop exit sets are no longer explicitly held, they are dynamically computed on demand. llvm-svn: 13046	2004-04-18 22:15:13 +00:00
Chris Lattner	d72c3eb54e	Change the ExitBlocks list from being explicitly contained in the Loop structure to being dynamically computed on demand. This makes updating loop information MUCH easier. llvm-svn: 13045	2004-04-18 22:14:10 +00:00
Chris Lattner	d15250240c	Reduce the unrolling limit llvm-svn: 13040	2004-04-18 18:06:14 +00:00
Chris Lattner	30ae18155d	If the preheader of the loop was the entry block of the function, make sure that the exit block of the loop becomes the new entry block of the function. This was causing a verifier assertion on 252.eon. llvm-svn: 13039	2004-04-18 17:38:42 +00:00
Chris Lattner	230bcb6b35	Be much more careful about how we update instructions outside of the loop using instructions inside of the loop. This should fix the MishaTest failure from last night. llvm-svn: 13038	2004-04-18 17:32:39 +00:00
Chris Lattner	4d52e1e401	After unrolling our single basic block loop, fold it into the preheader and exit block. The primary motivation for doing this is that we can now unroll nested loops. This makes a pretty big difference in some cases. For example, in 183.equake, we are now beating the native compiler with the CBE, and we are a lot closer with LLC. I'm now going to play around a bit with the unroll factor and see what effect it really has. llvm-svn: 13034	2004-04-18 06:27:43 +00:00
Chris Lattner	f2cc841619	Fix a bug: this does not preserve the CFG! While we're at it, add support for updating loop information correctly. llvm-svn: 13033	2004-04-18 05:38:37 +00:00
Chris Lattner	946b255977	Initial checkin of a simple loop unroller. This pass is extremely basic and limited. Even in it's extremely simple state (it can only fully unroll single basic block loops that execute a constant number of times), it already helps improve performance a LOT on some benchmarks, particularly with the native code generators. llvm-svn: 13028	2004-04-18 05:20:17 +00:00
Chris Lattner	c14da9600b	Make the tail duplication threshold accessible from the command line instead of hardcoded llvm-svn: 13025	2004-04-18 00:52:43 +00:00
Chris Lattner	a814080025	If the loop executes a constant number of times, try a bit harder to replace exit values. llvm-svn: 13018	2004-04-17 18:44:09 +00:00
Chris Lattner	1e9ac1a45e	Fix a HUGE pessimization on X86. The indvars pass was taking this (familiar) function: int _strlen(const char str) { int len = 0; while (str++) len++; return len; } And transforming it to use a ulong induction variable, because the type of the pointer index was left as a constant long. This is obviously very bad. The fix is to shrink long constants in getelementptr instructions to intptr_t, making the indvars pass insert a uint induction variable, which is much more efficient. Here's the before code for this function: int %_strlen(sbyte* %str) { entry: %tmp.13 = load sbyte* %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit * %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=2] * %indvar = phi ulong [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ulong> [#uses=2] %indvar1 = cast ulong %indvar to uint ; <uint> [#uses=1] %inc.02.sum = add uint %indvar1, 1 ; <uint> [#uses=1] %inc.0.0 = getelementptr sbyte* %str, uint %inc.02.sum ; <sbyte> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add ulong %indvar, 1 ; <ulong> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit.loopexit, label %no_exit loopexit.loopexit: ; preds = %no_exit %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] ret int %inc.1 loopexit: ; preds = %entry ret int 0 } Here's the after code: int %_strlen(sbyte* %str) { entry: %inc.02 = getelementptr sbyte* %str, uint 1 ; <sbyte> [#uses=1] %tmp.13 = load sbyte %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.0.0 = getelementptr sbyte* %inc.02, uint %indvar ; <sbyte> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit, label %no_exit loopexit: ; preds = %entry, %no_exit %len.0.1 = phi int [ 0, %entry ], [ %inc.1, %no_exit ] ; <int> [#uses=1] ret int %len.0.1 } llvm-svn: 13016	2004-04-17 18:16:10 +00:00
Chris Lattner	885a6eb74d	Even if there are not any induction variables in the loop, if we can compute the trip count for the loop, insert one so that we can canonicalize the exit condition. llvm-svn: 13015	2004-04-17 18:08:33 +00:00
Chris Lattner	a43312d30b	Add support for evaluation of exp/log/log10/pow llvm-svn: 13011	2004-04-16 22:35:33 +00:00
Chris Lattner	284d3b0311	Fix some really nasty dominance bugs that were exposed by my patch to make the verifier more strict. This fixes building zlib llvm-svn: 13002	2004-04-16 18:08:07 +00:00
Brian Gaeke	174633b078	Include <cmath> for compatibility with gcc 3.0.x (the system compiler on Debian.) llvm-svn: 12986	2004-04-16 15:57:32 +00:00
Chris Lattner	9e9b2b7474	Fix some of the strange CBE-only failures that happened last night. llvm-svn: 12980	2004-04-16 06:03:17 +00:00
Chris Lattner	0328d75c83	Fix Inline/2004-04-15-InlineDeletesCall.ll Basically we were using SimplifyCFG as a huge sledgehammer for a simple optimization. Because simplifycfg does so many things, we can't use it for this purpose. llvm-svn: 12977	2004-04-16 05:17:59 +00:00
Chris Lattner	d7a559e353	Fix a bug in the previous checkin: if the exit block is not the same as the back-edge block, we must check the preincremented value. llvm-svn: 12968	2004-04-15 20:26:22 +00:00
Chris Lattner	0cec5cb92c	Change the canonical induction variable that we insert. Instead of producing code like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X != N-1) goto Loop We now generate code that looks like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X2 != N) goto Loop This has two big advantages: 1. The trip count of the loop is now explicit in the code, allowing the direct implementation of Loop::getTripCount() 2. This reduces register pressure in the loop, and allows X and X2 to be put into the same register. As a consequence of the second point, the code we generate for loops went from: .LBB2: # no_exit.1 ... mov %EDI, %ESI inc %EDI cmp %ESI, 2 mov %ESI, %EDI jne .LBB2 # PC rel: no_exit.1 To: .LBB2: # no_exit.1 ... inc %ESI cmp %ESI, 3 jne .LBB2 # PC rel: no_exit.1 ... which has two fewer moves, and uses one less register. llvm-svn: 12961	2004-04-15 15:21:43 +00:00
Chris Lattner	6679e46b59	ADd a trivial instcombine: load null -> null llvm-svn: 12940	2004-04-14 03:28:36 +00:00
Chris Lattner	ff9362a8da	Add SCCP support for constant folding calls, implementing: test/Regression/Transforms/SCCP/calltest.ll llvm-svn: 12921	2004-04-13 19:43:54 +00:00
Chris Lattner	ca52d0468e	Add a simple call constant propagation interface. llvm-svn: 12919	2004-04-13 19:28:52 +00:00
Chris Lattner	d0dc6d5295	Constant propagation should remove the dead instructions llvm-svn: 12917	2004-04-13 19:28:20 +00:00
Chris Lattner	89e959bb1f	Fix LoopSimplify/2004-04-13-LoopSimplifyUpdateDomFrontier.ll LoopSimplify was not updating dominator frontiers correctly in some cases. llvm-svn: 12890	2004-04-13 16:23:25 +00:00
Chris Lattner	a6e22814ab	Refactor code a bit to make it simpler and eliminate the goto llvm-svn: 12888	2004-04-13 15:21:18 +00:00
Chris Lattner	8417052938	This patch addresses PR35: Loop simplify should reconstruct nested loops. This is fairly straight-forward, but was a real nightmare to get just perfect. aarg. :) llvm-svn: 12884	2004-04-13 05:05:33 +00:00
Chris Lattner	be43544429	Actually update the call graph as the inliner changes it. This allows us to execute other CallGraphSCCPasses after the inliner without crashing. llvm-svn: 12861	2004-04-12 05:37:29 +00:00
Chris Lattner	494a685449	Add support for removing invoke instructions llvm-svn: 12858	2004-04-12 05:15:13 +00:00

... 10 11 12 13 14 ...

2481 Commits