llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	c1cb75eb72	add a statistic for # times fastisel fails. llvm-svn: 97738	2010-03-04 19:46:56 +00:00
Dan Gohman	9cc886b9f1	Fix a typo Duncan noticed. llvm-svn: 97735	2010-03-04 19:11:28 +00:00
Chris Lattner	0acbb71bad	change the new isel matcher to emit ComplexPattern matches as the very last thing before node emission. This should dramatically reduce the number of times we do 'MatchAddress' on X86, speeding up compile time. This also improves comments in the tables and shrinks the table a bit, now down to 80506 bytes for x86. llvm-svn: 97703	2010-03-04 01:23:08 +00:00
Dan Gohman	e14c4087a3	Fix more code to work properly with vector operands. Based on a patch my Micah Villmow for PR6465. llvm-svn: 97692	2010-03-04 00:23:16 +00:00
Chris Lattner	878b3e46fb	inline CannotYetSelectIntrinsic into CannotYetSelect and simplify. llvm-svn: 97690	2010-03-04 00:21:16 +00:00
Dan Gohman	7d099f7e89	Fix a bug in SelectionDAG's ReplaceAllUsesWith in the case where CSE and recursive RAUW calls delete a node from the use list, invalidating the use list iterator. There's currently no known way to reproduce this in an unmodified LLVM, however there's no fundamental reason why a SelectionDAG couldn't be formed which would trigger this case. llvm-svn: 97665	2010-03-03 21:33:37 +00:00
Chris Lattner	dc1b6f79da	add some of the more obscure predicate types to the Scope accelerator. llvm-svn: 97652	2010-03-03 07:46:25 +00:00
Chris Lattner	796f1da479	speed up scope node processing: if the first element of a scope entry we're about to process is obviously going to fail, don't bother pushing a scope only to have it immediately be popped. This avoids a lot of scope stack traffic in common cases. Unfortunately, this requires duplicating some of the predicate dispatch. To avoid duplicating the actual logic I pulled each predicate out to its own static function which gets used in both places. llvm-svn: 97651	2010-03-03 07:31:15 +00:00
Chris Lattner	3e1ffd06fc	introduce a new SwitchTypeMatcher node (which is analogous to SwitchOpcodeMatcher) and have DAGISelMatcherOpt form it. This speeds up selection, particularly for X86 which has lots of variants of instructions with only type differences. llvm-svn: 97645	2010-03-03 06:28:15 +00:00
Bill Wendling	c8d3add052	Use APInt instead of zext value. llvm-svn: 97631	2010-03-03 01:58:01 +00:00
Bill Wendling	af13d82945	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Chris Lattner	dd030701bd	Fix some issues in WalkChainUsers dealing with CopyToReg/CopyFromReg/INLINEASM. These are annoying because they have the same opcode before an after isel. Fix this by setting their NodeID to -1 to indicate that they are selected, just like what automatically happens when selecting things that end up being machine nodes. With that done, give IsLegalToFold a new flag that causes it to ignore chains. This lets the HandleMergeInputChains routine be the one place that validates chains after a match is successful, enabling the new hotness in chain processing. This smarter chain processing eliminates the need for "PreprocessRMW" in the X86 and MSP430 backends and enables MSP to start matching it's multiple mem operand instructions more aggressively. I currently #if out the dead code in the X86 backend and MSP backend, I'll remove it for real in a follow-on patch. The testcase changes are: test/CodeGen/X86/sse3.ll: we generate better code test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was miscompiling this before, we now generate correct code Convert it to filecheck while I'm at it. test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem folding to make anton happy. :) llvm-svn: 97596	2010-03-02 22:20:06 +00:00
Chris Lattner	27a184b851	run HandleMergeInputChains even if we only have one input chain. llvm-svn: 97581	2010-03-02 19:34:59 +00:00
Chris Lattner	925ac71f26	Fix the xfail I added a couple of patches back. The issue was that we weren't properly handling the case when interior nodes of a matched pattern become dead after updating chain and flag uses. Now we handle this explicitly in UpdateChainsAndFlags. llvm-svn: 97561	2010-03-02 07:50:03 +00:00
Chris Lattner	350bb062b2	I was confused about this, it turns out that MorphNodeTo does delete ex-operands that become dead. llvm-svn: 97559	2010-03-02 07:14:49 +00:00
Chris Lattner	9732ab6d86	factor node morphing out to its own helper method. llvm-svn: 97558	2010-03-02 06:55:04 +00:00
Chris Lattner	f98f124a73	Sink InstructionSelect() out of each target into SDISel, and rename it DoInstructionSelection. Inline "SelectRoot" into it from DAGISelHeader. Sink some other stuff out of DAGISelHeader into SDISel. Eliminate the various 'Indent' stuff from various targets, which dates to when isel was recursive. 17 files changed, 114 insertions(+), 430 deletions(-) llvm-svn: 97555	2010-03-02 06:34:30 +00:00
Chris Lattner	2f846eeaca	Use the right induction variable. llvm-svn: 97541	2010-03-02 02:37:23 +00:00
Chris Lattner	b884fe867e	Rewrite chain handling validation and input TokenFactor handling stuff now that we don't care about emulating the old broken behavior of the old isel. This eliminates the 'CheckChainCompatible' check (along with IsChainCompatible) which did an incorrect and inefficient scan up the chain nodes which happened as the pattern was being formed and does the validation at the end in HandleMergeInputChains when it forms a structural pattern. This scans "down" the graph, which means that it is quickly bounded by nodes already selected. This also handles token factors that get "trapped" in the dag. Removing the CheckChainCompatible nodes also shrinks the generated tables by about 6K for X86 (down to 83K). There are two pieces remaining before I can nuke PreprocessRMW: 1. I xfailed a test because we're now producing worse code in a case that has nothing to do with the change: it turns out that our use of MorphNodeTo will leave dead nodes in the graph which (depending on how the graph is walked) end up causing bogus uses of chains and blocking matches. This is really bad for other reasons, so I'll fix this in a follow-up patch. 2. CheckFoldableChainNode needs to be improved to handle the TF. llvm-svn: 97539	2010-03-02 02:22:10 +00:00
Dan Gohman	4cec543952	Fix several places to handle vector operands properly. Based on a patch by Micah Villmow for PR6438. llvm-svn: 97538	2010-03-02 02:14:38 +00:00
Bill Wendling	78c5b7a76d	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Chris Lattner	7894ab3a99	remove dead code. llvm-svn: 97529	2010-03-02 00:40:26 +00:00
Chris Lattner	c1f2e15332	refactor some code out of OPC_EmitMergeInputChains into a new helper function. llvm-svn: 97525	2010-03-02 00:00:03 +00:00
Chris Lattner	19c92aea01	remove all but one version of SelectionDAG::MorphNodeTo (the most general) the others are dead. llvm-svn: 97511	2010-03-01 22:20:05 +00:00
Chris Lattner	c1a3190870	Accelerate isel dispatch for tables that start with a top-level OPC_SwitchOpcode to use a table lookup instead of having to go through the interpreter for this. llvm-svn: 97469	2010-03-01 18:47:11 +00:00
Dan Gohman	c3c3c6829f	Fix optimization of ISD::TRUNCATE on vector operands. Based on a patch by Micah Villmow for PR6335. llvm-svn: 97461	2010-03-01 17:59:21 +00:00
Chris Lattner	e89ca7c146	some trivial microoptimizations. llvm-svn: 97441	2010-03-01 07:43:08 +00:00
Chris Lattner	053a28a397	eliminate the CheckMultiOpcodeMatcher code and have each ComplexPattern at the root be generated multiple times, once for each opcode they are part of. This encourages factoring because the opcode checks get treated just like everything else in the matcher. llvm-svn: 97439	2010-03-01 07:17:40 +00:00
Chris Lattner	f4d1775263	add a new OPC_SwitchOpcode which is semantically equivalent to a scope where every child starts with a CheckOpcode, but executes more efficiently. Enhance DAGISelMatcherOpt to form it. This also fixes a bug in CheckOpcode: apparently the SDNodeInfo objects are not pointer comparable, we have to compare the enum name. llvm-svn: 97438	2010-03-01 06:59:22 +00:00
Chris Lattner	53cf6b8444	eliminate GetInt1/2 llvm-svn: 97426	2010-02-28 22:38:43 +00:00
Chris Lattner	5ef43cec36	hoist the new isel interpreter out of DAGISelHeader.h (which gets #included into the middle of each target's DAGISel class) into a .cpp file where it is only compiled once. llvm-svn: 97425	2010-02-28 22:37:22 +00:00
Chris Lattner	af197502d6	enhance the new isel to handle the 'node already exists' case of MorphNodeTo directly. llvm-svn: 97417	2010-02-28 21:36:14 +00:00
Chris Lattner	b1af865aa6	simplify this code, return only ever has zero or one operands. llvm-svn: 97408	2010-02-28 18:53:13 +00:00
Evan Cheng	228c31f045	Re-apply 97040 with fix. This survives a ppc self-host llvm-gcc bootstrap. llvm-svn: 97310	2010-02-27 07:36:59 +00:00
Dale Johannesen	dd33104203	Move dbg_value generation to target-independent FastISel, as X86 is currently the only FastISel target. Per review. llvm-svn: 97255	2010-02-26 20:01:55 +00:00
Dan Gohman	2a8e3777b4	Fix ExpandVectorBuildThroughStack for the case where the operands are themselves vectors. Based on a patch by Micah Villmow for PR6338. llvm-svn: 97165	2010-02-25 20:30:49 +00:00
Dan Gohman	9b80f86e5b	Revert r97064. Duncan pointed out that bitcasts are defined in terms of store and load, which means bitcasting between scalar integer and vector has endian-specific results, which undermines this whole approach. llvm-svn: 97137	2010-02-25 15:20:39 +00:00
Chris Lattner	d3aa3aa0ec	clean up various VT manipulations, patch by Micah Villmow! PR6337 llvm-svn: 97072	2010-02-24 22:44:06 +00:00
Dan Gohman	4b2b48daba	Make getTypeSizeInBits work correctly for array types; it should return the number of value bits, not the number of bits of allocation for in-memory storage. Make getTypeStoreSize and getTypeAllocSize work consistently for arrays and vectors. Fix several places in CodeGen which compute offsets into in-memory vectors to use TargetData information. This fixes PR1784. llvm-svn: 97064	2010-02-24 22:05:23 +00:00
Chris Lattner	9b7cfd39b2	convert cycle checker to smallptrset, add comments and make it more elegant. llvm-svn: 97059	2010-02-24 21:34:04 +00:00
Chris Lattner	02ec121de8	revert david's patch which does not even build. llvm-svn: 97057	2010-02-24 21:25:08 +00:00
David Greene	8328341d9c	Use a SmallPtrSet as suggested by Chris. llvm-svn: 97056	2010-02-24 20:59:49 +00:00
Daniel Dunbar	4811d004be	Speculatively revert r97011, "Re-apply 96540 and 96556 with fixes.", again in the hopes of fixing PPC bootstrap. llvm-svn: 97040	2010-02-24 17:05:47 +00:00
Dan Gohman	3860521406	When forming SSE min and max nodes for UGE and ULE comparisons, it's necessary to swap the operands to handle NaN and negative zero properly. Also, reintroduce logic for checking for NaN conditions when forming SSE min and max instructions, fixed to take into consideration NaNs and negative zeros. This allows forming min and max instructions in more cases. llvm-svn: 97025	2010-02-24 06:52:40 +00:00
Chris Lattner	df8a8a8c6f	Change the scheduler from adding nodes in allnodes order to adding them in a determinstic order (bottom up from the root) based on the structure of the graph itself. This updates tests for some random changes, interesting bits: CodeGen/Blackfin/promote-logic.ll no longer crashes. I have no idea why, but that's good right? CodeGen/X86/2009-07-16-LoadFoldingBug.ll also fails, but now compiles to have one fewer constant pool entry, making the expected load that was being folded disappear. Since it is an unreduced mass of gnast, I just removed it. This fixes PR6370 llvm-svn: 97023	2010-02-24 06:11:37 +00:00
Chris Lattner	3ea9066bb4	add node #'s to debug dumps. llvm-svn: 97019	2010-02-24 04:24:44 +00:00
Evan Cheng	328a607490	Re-apply 96540 and 96556 with fixes. llvm-svn: 97011	2010-02-24 01:42:31 +00:00
Chris Lattner	625916df32	make selectnodeto set the nodeid to -1. This makes it more akin to creating a new node then replacing uses. llvm-svn: 97000	2010-02-23 23:01:35 +00:00
Chris Lattner	8585850e94	fix a bug in findNonImmUse (used by IsLegalToFold) where nodes with no id's would cause early exit allowing IsLegalToFold to return true instead of false, producing a cyclic dag. This was striking the new isel because it isn't using SelectNodeTo yet, which theoretically is just an optimization. llvm-svn: 96972	2010-02-23 19:32:27 +00:00
Chris Lattner	1738d49b74	Print node ID's in dumps and views if set. llvm-svn: 96971	2010-02-23 19:31:18 +00:00

1 2 3 4 5 ...

4056 Commits