llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	cfd2c5ce58	Untangle the mess which is MachineBasicBlock::hasAddressTaken(). There are two different senses in which a block can be "address-taken". There can be a BlockAddress involved, which means we need to map the IR-level value to some specific block of machine code. Or there can be constructs inside a function which involve using the address of a basic block to implement certain kinds of control flow. Mixing these together causes a problem: if target-specific passes are marking random blocks "address-taken", if we have a BlockAddress, we can't actually tell which MachineBasicBlock corresponds to the BlockAddress. So split this into two separate bits: one for BlockAddress, and one for the machine-specific bits. Discovered while trying to sort out related stuff on D102817. Differential Revision: https://reviews.llvm.org/D124697	2022-08-16 16:15:44 -07:00
Arthur Eubanks	5a1f864e89	[test][llvm-reduce] Fix simplify-cfg.ll D131920 broke some Windows bots with "x6" in the buildbot paths. https://lab.llvm.org/buildbot#builders/123/builds/12276	2022-08-15 16:21:39 -07:00
John Regehr	2f1fa6242a	this pass calls simplifyCFG on individual basic blocks; we want this so that we can reduce away incidental parts of the CFG in cases where the full simplifyCFG pass makes the test case uninteresting Differential Revision: https://reviews.llvm.org/D131920	2022-08-15 15:45:20 -06:00
John Regehr	df308cab28	fix some bad logic that was removing all successor phi nodes, not just out of chunk ones. the non-default second argument to removePredecessor() is necessary to avoid creating invalid IR on examples like the one in the provided test case Differential Revision: https://reviews.llvm.org/D131843	2022-08-13 19:15:26 -06:00
Arthur Eubanks	195087d815	[llvm-reduce] Try harder to not create invalid aliases This was done by adding --abort-on-invalid-reduction to remove-function-bodies-used-in-globals.ll and fixing the fallout. Aliases must have a GlobalValue or ConstantExpr aliasee and the aliasee must be a definition if it's a GlobalValue. Don't RAUW functions with null if there's an alias pointing to it, and similarly don't delete the body of a function. Don't delete the entire body of a function when reducing blocks, preserve at least one block. Also make debugging these sorts of things easier by dumping the module when --abort-on-invalid-reduction triggers. Reviewed By: regehr Differential Revision: https://reviews.llvm.org/D131505	2022-08-12 10:39:05 -07:00
Arthur Eubanks	bd1f80f54e	[llvm-reduce] Add delta pass to run IR passes The exact IR passes run is customizable via `-ir-passes`. Reviewed By: regehr Differential Revision: https://reviews.llvm.org/D123749	2022-08-12 10:38:19 -07:00
Arthur Eubanks	4982d8ac76	[test][llvm-reduce] Use opaque pointers in tests	2022-08-04 16:47:50 -07:00
John Regehr	213c21fe10	earlier I fixed a bug where the BB removal pass sometimes created invalid IR. the fix was incomplete, this one is better and is believed to be complete Differential Revision: https://reviews.llvm.org/D131132	2022-08-04 10:21:20 -06:00
David Spickett	7ce321e5b0	[llvm-reduce] Split operands-skip.ll into serial and parallel parts This fixes a test failure when building with LLVM_ENABLE_THREADS=OFF. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D130707	2022-08-04 08:51:47 +00:00
John Regehr	5b4f6d8b4b	prevent llvm-reduce from duplicating values in switch cases when turning operands into zero or one	2022-08-03 10:06:45 -06:00
John Regehr	d469f136be	oops-- I pushed previous commit from a fresh checkout and forgot to git add the new test case, here it is Differential Revision: https://reviews.llvm.org/D131026	2022-08-02 22:27:28 -06:00
Matt Arsenault	fe1678d1b2	llvm-reduce: Fix register mask test This was sometimes failing with "input module no longer interesting after counting chunks" assert.	2022-07-20 18:19:14 -04:00
Jon Chesterfield	3a20597776	[amdgpu] Implement lds kernel id intrinsic Implement an intrinsic for use lowering LDS variables to different addresses from different kernels. This will allow kernels that cannot reach an LDS variable to avoid wasting space for it. There are a number of implicit arguments accessed by intrinsic already so this implementation closely follows the existing handling. It is slightly novel in that this SGPR is written by the kernel prologue. It is necessary in the general case to put variables at different addresses such that they can be compactly allocated and thus necessary for an indirect function call to have some means of determining where a given variable was allocated. Claiming an arbitrary SGPR into which an integer can be written by the kernel, in this implementation based on metadata associated with that kernel, which is then passed on to indirect call sites is sufficient to determine the variable address. The intent is to emit a __const array of LDS addresses and index into it. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D125060	2022-07-19 17:46:19 +01:00
Matt Arsenault	e24b390dbc	llvm-reduce: Add reduction for instruction defs Try to insert an implicit_def to replace the instruction's value, replacing the original instruction's def with a dead register. If all defs are delete the instruction entirely. This is pretty similar to the instruction reduction, but leaves the new defs in the same place as the original instruction. This could possibly replace it. I'm not sure if we should directly delete the instructions here, or leave dead ones behind. This could also further work to replace physical register defs.	2022-07-18 13:41:08 -04:00
Matt Arsenault	0f9d9edd24	llvm-reduce: Add reduction for custom register masks I have a register allocator failure that only reproduces with IPRA enabled, and requires the specific regmask if I want to only run the one relevant pass. The printed custom regmask is enormous and I would like to reduce it. This reduces each individual bit in the mask, but it would probably be better to start at register units and clear all aliasing fields at a time. This would require stricter verification that all aliasing bits are set in regmasks (although I would prefer to switch regmasks to use register units in the first place).	2022-07-18 13:41:08 -04:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Fraser Cormack	bb3f99cd85	[llvm-reduce] Fix crash when reducing integer vectors to 1 Integer vectors were previously ignored when reducing operands. When `6b8bd0f72` introduced support for reducing floating-point scalars/vectors, the vector case was written to only handle floating-point values. It would crash when creating an invalid ConstantFP from the integer element type. Instead of reinstating the old integer vector behaviour, we might as well reduce integer vectors to all-one splats. A couple of existing tests has also been renamed from "remove" to "reduce" to better reflect the deltas they test. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D129629	2022-07-13 16:56:55 +01:00
Matthew Voss	6b3956e123	[llvm-reduce] Add support for LTO bitcode files Adds support for reading and writing LTO bitcode files. - Emit a summary if the original bitcode file had a summary - Use split LTO units if the original bitcode file used them. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D127168	2022-06-30 08:58:24 -07:00
Matt Arsenault	261075590b	llvm-reduce: Handle reducing FP values to nan Prefer 0/1 over NaN, but it may make more sense to invert this as FP operations with nan inputs can universally be folded into something else.	2022-06-27 19:55:38 -04:00
John Regehr	2962f9df7c	stop llvm-reduce from introducing undefs Differential Revision: https://reviews.llvm.org/D128317	2022-06-22 20:41:23 -06:00
John Regehr	8771023543	in the absense of the -max-pass-iterations command line options, make llvm-reduce run its full pass sequence up to 5 times, instead of just once Differential Revision: https://reviews.llvm.org/D128284	2022-06-21 10:47:42 -06:00
Matt Arsenault	eea11e7369	llvm-reduce: Add reduction pass to simplify instructions	2022-06-16 20:39:27 -04:00
Matt Arsenault	6b8bd0f72d	llvm-reduce: Support replacing FP values with 1.0	2022-06-16 20:13:17 -04:00
Matt Arsenault	59328ab0ce	llvm-reduce: Add -abort-on-invalid-reduction to MIR tests Ideally reductions would never produce invalid IR, and we shouldn't regress on cases that already avoid doing so.	2022-06-07 10:28:23 -04:00
Matt Arsenault	cbbc7e4a75	llvm-reduce: Don't set generic instruction operands to undef The intention is that these should never have undef operands. It turns out the restriction the verifier enforces is too lax. The verifier enforces that registers without a register class cannot be undef, but it's valid to use a register with a register class and type. The verifier needs to change to be based on the opcode.	2022-06-07 10:28:23 -04:00
Matt Arsenault	47c8ec811f	llvm-reduce: Add pass to remove register uses Try to delete implicit uses, and add undef flags to explicit ones.	2022-06-07 10:28:23 -04:00
Matt Arsenault	cc5a1b3dd9	llvm-reduce: Add cloning of target MachineFunctionInfo MIR support is totally unusable for AMDGPU without this, since the set of reserved registers is set from fields here. Add a clone method to MachineFunctionInfo. This is a subtle variant of the copy constructor that is required if there are any MIR constructs that use pointers. Specifically, at minimum fields that reference MachineBasicBlocks or the MachineFunction need to be adjusted to the values in the new function.	2022-06-07 10:14:48 -04:00
Matt Arsenault	e6723d80c7	llvm-reduce: Fix crashes on unreachable blocks for MIR instructions	2022-06-07 10:00:26 -04:00
Matt Arsenault	56303223ac	llvm-reduce: Don't assert on functions which don't track liveness Use the query that doesn't assert if TracksLiveness isn't set, which needs to always be available. We also need to start printing liveins regardless of TracksLiveness.	2022-06-07 10:00:25 -04:00
Matt Arsenault	a0dcbe45bd	llvm-reduce: Add reduction pass to remove regalloc hints I'm a bit confused by what's actually stored for the allocation hints. The MIR parser only handles the "simple" case where there's a single hint. I don't really understand the assertion in clearSimpleHint, or under what circumstances there are multiple hint registers.	2022-06-01 09:15:41 -04:00
Matt Arsenault	2011052150	llvm-reduce: Add pass to reduce MIR instruction flags	2022-06-01 08:58:34 -04:00
Markus Lavin	bb8e02325f	llvm-reduce: improve basic-blocks removal pass When the single branch target of a block has been removed try updating it to target a block that is kept (by scanning forward in the sequence) instead of replacing the branch with a return instruction. Doing so reduces the risk of breaking loop structures meaning that when the loop is 'interesting' these reductions should have more blocks eliminated. Differential Revision: https://reviews.llvm.org/D125766	2022-05-24 09:51:25 +02:00
Matt Arsenault	aabea3b2ea	llvm-reduce: Fix not removing first instruction in MachineBasicBlock This had the surprising behavior of using whatever instruction happened to be first in the block as an anchor point to stick random implicit defs on. Use a real implicit_def instead.	2022-05-01 18:26:45 -04:00
Matt Arsenault	0b896b754e	llvm-reduce: Do not try to delete frame instructions The verifier enforces these appearing as balanced pairs, so just deleting one has no real chance of producing something valid.	2022-05-01 18:21:52 -04:00
Matt Arsenault	3939e99aae	llvm-reduce: Add pass to reduce IR references from MIR This is typically the first thing I do when reducing a new testcase until the IR section can be deleted.	2022-05-01 17:40:53 -04:00
Nico Weber	ddfffbeb31	try to fix check-llvm on windows after `e39e9d33`	2022-04-28 09:16:15 -04:00
Matt Arsenault	cf68b31f14	llvm-reduce: Don't check tool name in error message check Windows is being difficult and I don't know how to check the program name here	2022-04-28 09:10:19 -04:00
Douglas Yung	6adb8c2208	Fix test fail-file-open.test on Windows to hopefully fix the Windows buildbots.	2022-04-27 17:28:57 -07:00
Matt Arsenault	717209763e	llvm-reduce: Fix incorrect cloning of MachineMemOperands There were two problems with directly copying the MMOs from the old function. The MMOs are owned by the function's Allocator, so need to be reallocated anyways (surprisingly I didn't notice breakage on this). Second, the PseudoSourceValues are also allocated per function and need to be reallocated.	2022-04-27 18:51:38 -04:00
Matt Arsenault	e39e9d339c	llvm-reduce: Fix crashing on file opening error for mir path	2022-04-27 18:15:12 -04:00
Matt Arsenault	7c2db66632	llvm-reduce: Support multiple MachineFunctions The current testcase I'm trying to reduce only reproduces with IPRA enabled and requires handling multiple functions. The only real difference vs. the IR is the extra indirect to look for the underlying MachineFunction, so treat the ReduceWorkItem as the module instead of the function. The ugliest piece of this is really the ugliness of MachineModuleInfo. It not only tracks actual module state, but has a number of transient fields used for isel and/or the asm printer. These shouldn't do any harm for the use here, though they should be separated out.	2022-04-27 18:11:59 -04:00
Matt Arsenault	49c7534587	llvm-reduce: Try to fix test on windows buildbots	2022-04-27 18:00:18 -04:00
Matt Arsenault	1747a93b28	llvm-reduce: Try to parse triple/datalayout from module This saves needing to specify -mtriple on nearly every use for MIR reduction.	2022-04-27 17:47:46 -04:00
Matt Arsenault	e617d1a1d7	llvm-reduce: Fix mangling types of generic registers	2022-04-27 14:27:36 -04:00
Matt Arsenault	6d6288f2be	llvm-reduce: Preserve subregisters and other fields for top block def	2022-04-27 14:21:43 -04:00
Matt Arsenault	49aeeafda3	llvm-reduce: Don't delete triple/datalayout Removing these is extremely unhelpful and just adds extra hassle. This is really finding out whether your test script uses -mtriple or not. You can't meaningfully delete these fields, and the resulting module defaults to the host.	2022-04-24 11:01:31 -04:00
Matt Arsenault	debfb96be6	llvm-reduce: Fix cloning unset maxCallFrameSize This was promoting an unset max call frame size to a max call frame size of 0.	2022-04-22 18:28:45 -04:00
Matt Arsenault	53d88581f1	llvm-reduce: Clone properties of blocks getSuccProbability was private for some reason, saying to go through MachineBranchProbabilityInfo. There doesn't seem to be much point to that, as that wrapper directly calls this. Like other areas, some of these fields aren't handled by the MIR printer/parser so aren't tested.	2022-04-20 09:47:45 -04:00
Matt Arsenault	193fde7509	llvm-reduce: Clone some of the easy function properties Error on some of these other fields, since tracking down test cases for all of these at once is exhausting.	2022-04-15 20:31:07 -04:00
Matt Arsenault	f163106f39	llvm-reduce: Handle cloning MachineFrameInfo and stack objects This didn't work at all before, and would assert on any frame index. Also copy the other fields, which I believe should cover everything. There are a few that are untested since MIR serialization is apparently still missing them (isStatepointSpillSlot, ObjectSSPLayout, and ObjectSExt/ObjectZExt).	2022-04-14 21:25:06 -04:00

1 2

83 Commits