llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	b7668d5164	ADT: Guarantee transferNodesFromList is only called on transfers Guarantee that ilist_traits<T>::transferNodesFromList is only called when nodes are actually changing lists. I also moved all the callbacks to occur first, before the operation. This is the only choice for iplist<T>::merge, so we might as well be consistent. I expect this to have no effect in practice, although it simplifies the logic in both iplist<T>::transfer and iplist<T>::insert. llvm-svn: 280122	2016-08-30 18:00:45 +00:00
Sanjoy Das	6d3c9132e3	Fix coding style; NFC Avoid variables starting with lowercase. llvm-svn: 280048	2016-08-30 01:38:59 +00:00
Duncan P. N. Exon Smith	5c001c367f	ADT: Give ilist<T>::reverse_iterator a handle to the current node Reverse iterators to doubly-linked lists can be simpler (and cheaper) than std::reverse_iterator. Make it so. In particular, change ilist<T>::reverse_iterator so that it is never invalidated unless the node it references is deleted. This matches the guarantees of ilist<T>::iterator. (Note: MachineBasicBlock::iterator is not an ilist iterator, but a MachineInstrBundleIterator<MachineInstr>. This commit does not change MachineBasicBlock::reverse_iterator, but it does update MachineBasicBlock::reverse_instr_iterator. See note at end of commit message for details on bundle iterators.) Given the list (with the Sentinel showing twice for simplicity): [Sentinel] <-> A <-> B <-> [Sentinel] the following is now true: 1. begin() represents A. 2. begin() holds the pointer for A. 3. end() represents [Sentinel]. 4. end() holds the poitner for [Sentinel]. 5. rbegin() represents B. 6. rbegin() holds the pointer for B. 7. rend() represents [Sentinel]. 8. rend() holds the pointer for [Sentinel]. The changes are #6 and #8. Here are some properties from the old scheme (which used std::reverse_iterator): - rbegin() held the pointer for [Sentinel] and rend() held the pointer for A; - operator() cost two dereferences instead of one; - converting from a valid iterator to its valid reverse_iterator involved a confusing increment; and - "RI++->erase()" left RI invalid. The unintuitive replacement was "RI->erase(), RE = end()". With vector-like data structures these properties are hard to avoid (since past-the-beginning is not a valid pointer), and don't impose a real cost (since there's still only one dereference, and all iterators are invalidated on erase). But with lists, this was a poor design. Specifically, the following code (which obviously works with normal iterators) now works with ilist::reverse_iterator as well: for (auto RI = L.rbegin(), RE = L.rend(); RI != RE;) fooThatMightRemoveArgFromList(RI++); Converting between iterator and reverse_iterator for the same node uses the getReverse() function. reverse_iterator iterator::getReverse(); iterator reverse_iterator::getReverse(); Why doesn't iterator <=> reverse_iterator conversion use constructors? In order to catch and update old code, reverse_iterator does not even have an explicit conversion from iterator. It wouldn't be safe because there would be no reasonable way to catch all the bugs from the changed semantic (see the changes at call sites that are part of this patch). Old code used this API: std::reverse_iterator::reverse_iterator(iterator); iterator std::reverse_iterator::base(); Here's how to update from old code to new (that incorporates the semantic change), assuming I is an ilist<>::iterator and RI is an ilist<>::reverse_iterator: [Old] ==> [New] reverse_iterator(I) (--I).getReverse() reverse_iterator(I) ++I.getReverse() --reverse_iterator(I) I.getReverse() reverse_iterator(++I) I.getReverse() RI.base() (--RI).getReverse() RI.base() ++RI.getReverse() --RI.base() RI.getReverse() (++RI).base() RI.getReverse() delete &RI, RE = end() delete &RI++ RI->erase(), RE = end() RI++->erase() ======================================= Note: bundle iterators are out of scope ======================================= MachineBasicBlock::iterator, also known as MachineInstrBundleIterator<MachineInstr>, is a wrapper to represent MachineInstr bundles. The idea is that each operator++ takes you to the beginning of the next bundle. Implementing a sane reverse iterator for this is harder than ilist. Here are the options: - Use std::reverse_iterator<MBB::i>. Store a handle to the beginning of the next bundle. A call to operator() runs a loop (usually operator--() will be called 1 time, for unbundled instructions). Increment/decrement just works. This is the status quo. - Store a handle to the final node in the bundle. A call to operator() still runs a loop, but it iterates one time fewer (usually operator--() will be called 0 times, for unbundled instructions). Increment/decrement just works. - Make the ilist_sentinel<MachineInstr> always store that it's the sentinel (instead of just in asserts mode). Then the bundle iterator can sniff the sentinel bit in operator++(). I initially tried implementing the end() option as part of this commit, but updating iterator/reverse_iterator conversion call sites was error-prone. I have a WIP series of patches that implements the final option. llvm-svn: 280032	2016-08-30 00:13:12 +00:00
Tim Northover	f8bab1ce0c	GlobalISel: use multi-dimensional arrays for legalize actions. Instead of putting all possible requests into a single table, we can perform the extremely dense lookup based on opcode and type-index in constant time using multi-dimensional array-like things. This roughly halves the time spent doing legalization, which was dominated by queries against the Actions table. llvm-svn: 280011	2016-08-29 21:00:00 +00:00
Krzysztof Parzyszek	354832e585	Propagate TBAA info in SelectionDAG::getIndexedLoad Patch by Pranav Bhandarkar. llvm-svn: 279998	2016-08-29 19:50:15 +00:00
Tim Northover	ac5148ef41	GlobalISel: switch to SmallVector for pending legalizations. std::queue was doing far to many heap allocations to be healthy. llvm-svn: 279992	2016-08-29 19:27:20 +00:00
Tim Northover	edb3c8ccb8	GlobalISel: legalize frem to a libcall on AArch64. llvm-svn: 279988	2016-08-29 19:07:16 +00:00
Tim Northover	fe5f89ba14	GlobalISel: rework CallLowering so that it can be used for libcalls too. There should be no functional change here, I'm just making the implementation of "frem" (to libcall) legalization easier for a followup. llvm-svn: 279987	2016-08-29 19:07:08 +00:00
Kyle Butt	092c4dd5b6	IfConversion: Fix branch predication bug. This bug shows up with diamonds that share unpredicable, unanalyzable branches. There's an included test case from Hexagon. What was happening was that we were attempting to predicate the branch instruction despite the fact that it was checked to be the same. Now for unanalyzable branches we skip over the branch instructions when predicating the block. Differential Revision: https://reviews.llvm.org/D23939 llvm-svn: 279985	2016-08-29 18:27:12 +00:00
Sanjay Patel	b57d0a2fda	[TargetLowering] remove fdiv and frem from canOpTrap() (PR29114) Assuming the default FP env, we should not treat fdiv and frem any differently in terms of trapping behavior than any other FP op. Ie, FP ops do not trap with the default FP env. This matches how we treat these ops in IR with isSafeToSpeculativelyExecute(). There's a similar bug in Constant::canTrap(). This bug manifests in PR29114: https://llvm.org/bugs/show_bug.cgi?id=29114 ...as a sequence of scalar divisions instead of a vector division on x86 for a <3 x float> type. Differential Revision: https://reviews.llvm.org/D23974 llvm-svn: 279970	2016-08-29 13:32:41 +00:00
Krzysztof Parzyszek	0a955d6dcb	Do not use MRI::getMaxLaneMaskForVReg as a mask covering whole register MRI::getMaxLaneMaskForVReg does not always cover the whole register. For example, on X86 the upper 16 bits of EAX cannot be accessed via any subregister. Consequently, there is no lane mask that only covers that part of EAX. The getMaxLaneMaskForVReg will return the union of the lane masks for all subregisters, and in case of EAX, that union will not cover the upper 16 bits. This fixes https://llvm.org/bugs/show_bug.cgi?id=29132 llvm-svn: 279969	2016-08-29 13:15:35 +00:00
Rafael Espindola	412a529551	Use the correct ctor/dtor section for dynamic-no-pic. llvm-svn: 279967	2016-08-29 12:47:22 +00:00
Rafael Espindola	46fa231c52	Move code only used by codegen out of MC. NFC. MC itself never needs to know about these sections. llvm-svn: 279965	2016-08-29 12:33:42 +00:00
Igor Breger	24281b4740	Fixed a bug in type legalizer for masked gather. The problem occurs when the Node doesn't updated in place , UpdateNodeOperation() return the node that already exist. In this case assert fail in PromoteIntegerOperand() , N have 2 results ( val + chain). Differential Revision: http://reviews.llvm.org/D23756 llvm-svn: 279961	2016-08-29 09:12:31 +00:00
Haojian Wu	407f275894	[InstructionSelect] NumBlocks isn't defined in DEBUG build. Summary: A follow-up fixing on http://llvm.org/viewvc/llvm-project?view=revision&revision=279905. Reviewers: bkramer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23985 llvm-svn: 279959	2016-08-29 08:48:15 +00:00
Quentin Colombet	acb857b831	[RegBankSelect] Do not abort when the target wants to fall back. llvm-svn: 279906	2016-08-27 02:38:27 +00:00
Quentin Colombet	948abf0a0f	[InstructionSelect] Do not abort when the target wants to fall back. llvm-svn: 279905	2016-08-27 02:38:24 +00:00
Quentin Colombet	5e60bcdeaf	[MachineLegalize] Do not abort when the target wants to fall back. llvm-svn: 279904	2016-08-27 02:38:21 +00:00
Quentin Colombet	374796d678	[GlobalISel] Add a fallback path to SDISel. When global-isel fails on a MachineFunction MF, MF will be cleaned up and given to SDISel. Thanks to this fallback, we can already perform correctness test even if we support only a small portion of the functions in a test. llvm-svn: 279891	2016-08-27 00:18:31 +00:00
Quentin Colombet	6049524d37	[GlobalISel] Teach the core pipeline not to run if ISel failed. llvm-svn: 279889	2016-08-27 00:18:24 +00:00
Quentin Colombet	3bb32cc79c	[IRTranslator] Do not abort when the target wants to fall back. Every pass in the GlobalISel pipeline will need to do something similar. llvm-svn: 279886	2016-08-26 23:49:05 +00:00
Quentin Colombet	e076d3094c	[MFProperties] Introduce a FailedISel property. This is used to communicate that the instruction selection pipeline failed at some point. Another way to achieve that would be to have some kind of conditional scheduling in the PassManager, such that we only schedule a pass based on the success/failure of another one. The property approach has the advantage of being lightweight and solve the problem at stake. llvm-svn: 279885	2016-08-26 23:49:01 +00:00
Quentin Colombet	0de43b225f	[TargetPassConfig] Add a target hook to know what GlobalISel should do on error. By default, this hook tells GlobalISel to abort (report a fatal error) when it encounters an error. The alternative will be to fall back on SDISel. This fall back will be removed when the bring-up of GlobalISel is over. llvm-svn: 279879	2016-08-26 22:32:59 +00:00
Quentin Colombet	1d0cb6f107	[IRTranslator][NFC] Use DEBUG_TYPE instead of repeating the name. llvm-svn: 279878	2016-08-26 22:32:57 +00:00
Quentin Colombet	e063e1f68a	[SelectionDAG] Do not run the ISel process on already selected code. Right now, this cannot happen, but with the fall back path of GlobalISel it will show up eventually. llvm-svn: 279877	2016-08-26 22:32:55 +00:00
Quentin Colombet	380cd3eb23	[MachineFunction] Introduce a reset method. This method allows to reset the state of a MachineFunction as if it was just created. This will be used during the bring-up of GlobalISel to provide a way to fallback on SelectionDAG. That way, we can start doing correctness testing even if we are not able to select all functions via the global instruction selector. llvm-svn: 279876	2016-08-26 22:32:53 +00:00
Quentin Colombet	e609a9a80a	[MFProperties] Introduce a reset method with no argument. This method allows to reset all the properties in one go. llvm-svn: 279874	2016-08-26 22:09:11 +00:00
Quentin Colombet	c437aa9c26	[MFProperties][NFC] Rename clear into reset to match BitVector naming. The name clear is used to reset all the bit in bitvectors and using it to reset just properties was confusing. llvm-svn: 279873	2016-08-26 22:09:08 +00:00
Kyle Butt	723aa1327c	TailDuplication: Record blocks that received the duplicated block. NFC. This will allow tail duplication during layout to handle the cfg changes more cleanly. llvm-svn: 279858	2016-08-26 20:12:40 +00:00
Reid Kleckner	a5b1eef846	[MC] Move .cv_loc management logic out of MCContext MCContext already has many tasks, and separating CodeView out from it is probably a good idea. The .cv_loc tracking was modelled on the DWARF tracking which lived directly in MCContext. Removes the inclusion of MCCodeView.h from MCContext.h, so now there are only 10 build actions while I hack on CodeView support instead of 265. llvm-svn: 279847	2016-08-26 17:58:37 +00:00
Tim Northover	051b8ad3d9	GlobalISel: simplify G_ICMP legalization regime. It's unclear how the old %res(32) = G_ICMP { s32, s32 } intpred(eq), %0, %1 is actually different from an s1 verison %res(1) = G_ICMP { s1, s32 } intpred(eq), %0, %1 so we'll remove it for now. llvm-svn: 279843	2016-08-26 17:46:17 +00:00
Tim Northover	cecee56abb	GlobalISel: legalize sdiv and srem operations. llvm-svn: 279842	2016-08-26 17:46:13 +00:00
Tim Northover	7a753d9bec	GlobalISel: legalize under-width divisions. llvm-svn: 279841	2016-08-26 17:46:06 +00:00
Krzysztof Parzyszek	fb18d1e381	Missed a semicolon in r279835 llvm-svn: 279836	2016-08-26 16:50:57 +00:00
Krzysztof Parzyszek	eb34b71f0a	Add some more detailed debugging information in RegisterCoalescer llvm-svn: 279835	2016-08-26 16:46:14 +00:00
Matt Arsenault	f403df38eb	Replace subregister uses when processing tied operands This was for some reason skipping operands that are subregisters instead of keeping the same subregister index. v_movreld_b32 expects src0 to be the subregister of the tied super register use/def. e.g. v_movreld_b32 v0, v9, <imp-def, tied3> v[0:3], <imp-use, tied2> v[0:3] was being replaced with v[4:7] = copy v[0:3] v_movreld_b32 v0, v9, <imp-def, tied3> v[4:7], <imp-use, tied2> v[4:7], which really writes to v[0:3] llvm-svn: 279804	2016-08-26 06:31:32 +00:00
Michael Kuperstein	260daed147	Reuse an SDLoc throughout a function. NFC. llvm-svn: 279767	2016-08-25 18:50:56 +00:00
Tim Northover	6c43b850b7	GlobalISel: add missing type to G_UADDE instructions llvm-svn: 279762	2016-08-25 17:37:44 +00:00
Tim Northover	438c77ca1a	GlobalISel: perform multi-step legalization llvm-svn: 279758	2016-08-25 17:37:32 +00:00
George Burgess IV	b42e0e7fa3	Make buildbots happy. "warning: extra ‘;’ [-Wpedantic]" llvm-svn: 279703	2016-08-25 02:15:54 +00:00
Kyle Butt	c7f1eac514	TailDuplication: Don't pass MMI separately from MF. NFC MMI must match the function passed, and MF has a handle on MMI. Use that instead of accepting it as separate argument. No Functional Change. llvm-svn: 279701	2016-08-25 01:37:07 +00:00
Kyle Butt	3ed4273d33	TailDuplication: Save MF and reduce number of parameters. NFC Save the function in the class, and then don't pass it around. This reduces the number of parameters and makes calls to member functions simpler. No Functional Change. llvm-svn: 279700	2016-08-25 01:37:03 +00:00
Matthias Braun	1eb473680a	MachineFunctionProperties/MIRParser: Rename AllVRegsAllocated->NoVRegs, compute it Rename AllVRegsAllocated to NoVRegs. This avoids the connotation of running after register and simply describes that no vregs are used in a machine function. With that we can simply compute the property and do not need to dump/parse it in .mir files. Differential Revision: http://reviews.llvm.org/D23850 llvm-svn: 279698	2016-08-25 01:27:13 +00:00
George Burgess IV	381fc0ee3c	Make some LLVM_CONSTEXPR variables const. NFC. This patch changes LLVM_CONSTEXPR variable declarations to const variable declarations, since LLVM_CONSTEXPR expands to nothing if the current compiler doesn't support constexpr. In all of the changed cases, it looks like the code intended the variable to be const instead of sometimes-constexpr sometimes-not. llvm-svn: 279696	2016-08-25 01:05:08 +00:00
Eugene Zelenko	1804a77b2a	Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes. Differential revision: https://reviews.llvm.org/D23861 llvm-svn: 279695	2016-08-25 00:45:04 +00:00
Matthias Braun	a319e2cae0	MIRParser/MIRPrinter: Compute HasInlineAsm instead of printing/parsing it llvm-svn: 279680	2016-08-24 22:34:06 +00:00
Matthias Braun	f1b20c5225	MachineRegisterInfo/MIR: Initialize tracksSubRegLiveness early, do not print/parser it tracksSubRegLiveness only depends on the Subtarget and a cl::opt, there is not need to change it or save/parse it in a .mir file. Make the field const and move the initialization LiveIntervalAnalysis to the MachineRegisterInfo constructor. Also cleanup some code and fix some instances which better use MachineRegisterInfo::subRegLivenessEnabled() instead of TargetSubtargetInfo::enableSubRegLiveness(). llvm-svn: 279676	2016-08-24 22:17:45 +00:00
Kyle Butt	a8c7371d16	CodeGen: If Convert blocks that would form a diamond when tail-merged. The following function currently relies on tail-merging for if conversion to succeed. The common tail of cond_true and cond_false is extracted, and this then forms a diamond pattern that can be successfully if converted. If this block does not get extracted, either because tail-merging is disabled or the threshold is higher, we should still recognize this pattern and if-convert it. Fixed a regression in the original commit. Need to un-reverse branches after reversing them, or other conversions go awry. define i32 @t2(i32 %a, i32 %b) nounwind { entry: %tmp1434 = icmp eq i32 %a, %b ; <i1> [#uses=1] br i1 %tmp1434, label %bb17, label %bb.outer bb.outer: ; preds = %cond_false, %entry %b_addr.021.0.ph = phi i32 [ %b, %entry ], [ %tmp10, %cond_false ] %a_addr.026.0.ph = phi i32 [ %a, %entry ], [ %a_addr.026.0, %cond_false ] br label %bb bb: ; preds = %cond_true, %bb.outer %indvar = phi i32 [ 0, %bb.outer ], [ %indvar.next, %cond_true ] %tmp. = sub i32 0, %b_addr.021.0.ph %tmp.40 = mul i32 %indvar, %tmp. %a_addr.026.0 = add i32 %tmp.40, %a_addr.026.0.ph %tmp3 = icmp sgt i32 %a_addr.026.0, %b_addr.021.0.ph br i1 %tmp3, label %cond_true, label %cond_false cond_true: ; preds = %bb %tmp7 = sub i32 %a_addr.026.0, %b_addr.021.0.ph %tmp1437 = icmp eq i32 %tmp7, %b_addr.021.0.ph %indvar.next = add i32 %indvar, 1 br i1 %tmp1437, label %bb17, label %bb cond_false: ; preds = %bb %tmp10 = sub i32 %b_addr.021.0.ph, %a_addr.026.0 %tmp14 = icmp eq i32 %a_addr.026.0, %tmp10 br i1 %tmp14, label %bb17, label %bb.outer bb17: ; preds = %cond_false, %cond_true, %entry %a_addr.026.1 = phi i32 [ %a, %entry ], [ %tmp7, %cond_true ], [ %a_addr.026.0, %cond_false ] ret i32 %a_addr.026.1 } Without tail-merging or diamond-tail if conversion: LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ble LBB1_3 @ BB#2: @ %cond_true @ in Loop: Header=BB1_1 Depth=1 subs r0, r0, r1 cmp r1, r0 it ne cmpne r0, r1 bgt LBB1_4 LBB1_3: @ %cond_false @ in Loop: Header=BB1_1 Depth=1 subs r1, r1, r0 cmp r1, r0 bne LBB1_1 LBB1_4: @ %bb17 bx lr With diamond-tail if conversion, but without tail-merging: @ BB#0: @ %entry cmp r0, r1 it eq bxeq lr LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ite le suble r1, r1, r0 subgt r0, r0, r1 cmp r1, r0 bne LBB1_1 @ BB#2: @ %bb17 bx lr llvm-svn: 279671	2016-08-24 21:34:27 +00:00
Kyle Butt	6262ca3448	IfConversion: Rescan diamonds. The cost of predicating a diamond is only the instructions that are not shared between the two branches. Additionally If a predicate clobbering instruction occurs in the shared portion of the branches (e.g. a cond move), it may still be possible to if convert the sub-cfg. This change handles these two facts by rescanning the non-shared portion of a diamond sub-cfg to recalculate both the predication cost and whether both blocks are pred-clobbering. Fixed 2 bugs before recommitting. Branch instructions must be compared and found identical before diamond conversion. Also, predicate-clobbering instructions in the shared prefix disqualifies a potential diamond conversion. Includes tests for both. llvm-svn: 279670	2016-08-24 21:34:24 +00:00
David Blaikie	a01f295322	DebugInfo: Add flag to CU to disable emission of inline debug info into the skeleton CU In cases where .dwo/.dwp files are guaranteed to be available, skipping the extra online (in the .o file) inline info can save a substantial amount of space - see the original r221306 for more details there. llvm-svn: 279650	2016-08-24 18:29:49 +00:00
Krzysztof Parzyszek	a7ed090bba	Create subranges for new intervals resulting from live interval splitting The register allocator can split a live interval of a register into a set of smaller intervals. After the allocation of registers is complete, the rewriter will modify the IR to replace virtual registers with the corres- ponding physical registers. At this stage, if a register corresponding to a subregister of a virtual register is used, the rewriter will check if that subregister is undefined, and if so, it will add the <undef> flag to the machine operand. The function verifying liveness of the subregis- ter would assume that it is undefined, unless any of the subranges of the live interval proves otherwise. The problem is that the live intervals created during splitting do not have any subranges, even if the original parent interval did. This could result in the <undef> flag placed on a register that is actually defined. Differential Revision: http://reviews.llvm.org/D21189 llvm-svn: 279625	2016-08-24 13:37:55 +00:00
Matthias Braun	3a133159cc	TargetSchedule: Do not consider subregister definitions as reads. We should not consider subregister definitions as reads for schedule model purposes (they are just modeled as reads of the overal vreg for liveness calculation purposes, the CPU instructions are not actually reading). Unfortunately I cannot submit a test for this as it requires a target which uses ReadAdvance annotation in the scheduling model and has subregister liveness enabled at the same time, which is only the case on an out of tree target. llvm-svn: 279604	2016-08-24 02:32:29 +00:00
Matthias Braun	733fe3676c	CodeGen: Remove MachineFunctionAnalysis => Enable (Machine)ModulePasses Re-apply this patch, hopefully I will get away without any warnings in the constructor now. This patch removes the MachineFunctionAnalysis. Instead we keep a map from IR Function to MachineFunction in the MachineModuleInfo. This allows the insertion of ModulePasses into the codegen pipeline without breaking it because the MachineFunctionAnalysis gets dropped before a module pass. Peak memory should stay unchanged without a ModulePass in the codegen pipeline: Previously the MachineFunction was freed at the end of a codegen function pipeline because the MachineFunctionAnalysis was dropped; With this patch the MachineFunction is freed after the AsmPrinter has finished. Differential Revision: http://reviews.llvm.org/D23736 llvm-svn: 279602	2016-08-24 01:52:46 +00:00
Matthias Braun	79f85b3b8f	MIRParser/MIRPrinter: Compute isSSA instead of printing/parsing it. Specifying isSSA is an extra line at best and results in invalid MI at worst. Compute the value instead. Differential Revision: http://reviews.llvm.org/D22722 llvm-svn: 279600	2016-08-24 01:32:41 +00:00
Matthias Braun	c3b2e80b9d	MachineModuleInfo: Avoid dummy constructor, use INITIALIZE_TM_PASS Change this pass constructor to just accept a const TargetMachine * and use INITIALIZE_TM_PASS, that way we can get rid of the dummy constructor. The pass will still fail when calling the default constructor leading to TM == nullptr, this is no different than before but is more in line what other codegen passes are doing and avoids the dummy constructor. llvm-svn: 279598	2016-08-24 00:42:05 +00:00
Philip Reames	d06a1b4cdc	[stackmaps] Remove an unneeded member variable [NFC] llvm-svn: 279590	2016-08-23 23:58:08 +00:00
Philip Reames	e83c4b30ca	[stackmaps] More extraction of common code [NFCI] General cleanup before starting to work on the part I want to actually change. llvm-svn: 279586	2016-08-23 23:33:29 +00:00
Richard Smith	84c4cc47f5	Don't use "return {...}" to initialize a std::tuple. This has only been valid since 2015 (n4387), though it's allowed by a library DR so new implementations accept it in their C++11 modes... This should unbreak the build with libstdc++ 4.9. llvm-svn: 279583	2016-08-23 22:21:58 +00:00
Richard Smith	418237bed8	#ifdef out validation code when asserts are disabled to remove unused variable warnings. llvm-svn: 279582	2016-08-23 22:14:15 +00:00
Richard Smith	eae6138936	Remove unused data member to unbreak -Werror builds. llvm-svn: 279581	2016-08-23 22:10:46 +00:00
Richard Smith	8c3fbdc6c4	Revert r279564. It introduces undefined behavior (binding a reference to a dereferenced null pointer) in MachineModuleInfo::MachineModuleInfo that causes -Werror builds (including several buildbots) to fail. llvm-svn: 279580	2016-08-23 22:08:27 +00:00
Philip Reames	570dd009c3	[stackmaps] Extract out magic constants [NFCI] This is a first step towards clarifying the exact MI semantics of stackmap's "live values". llvm-svn: 279574	2016-08-23 21:21:43 +00:00
Matthias Braun	90799ce8b2	MachineFunction: Introduce NoPHIs property I want to compute the SSA property of .mir files automatically in upcoming patches. The problem with this is that some inputs will be reported as static single assignment with some passes claiming not to support SSA form. In reality though those passes do not support PHI instructions => Track the presence of PHI instructions separate from the SSA property. Differential Revision: https://reviews.llvm.org/D22719 llvm-svn: 279573	2016-08-23 21:19:49 +00:00
Tim Northover	bdf67c9a00	GlobalISel: make truncate/extend casts uniform They really should have both types represented, but early variants were created before MachineInstrs could have multiple types so they're rather ambiguous. llvm-svn: 279567	2016-08-23 21:01:33 +00:00
Tim Northover	6cd4b23a0f	GlobalISel: legalize integer comparisons on AArch64. Next step is doing both legalizations at the same time! Marvel at GlobalISel's cunning. llvm-svn: 279566	2016-08-23 21:01:26 +00:00
Tim Northover	b3a0be4d38	GlobalISel: legalize conditional branches on AArch64. llvm-svn: 279565	2016-08-23 21:01:20 +00:00
Matthias Braun	4c1f1f120c	CodeGen: Remove MachineFunctionAnalysis => Enable (Machine)ModulePasses Re-apply this commit with the deletion of a MachineFunction delegated to a separate pass to avoid use after free when doing this directly in AsmPrinter. This patch removes the MachineFunctionAnalysis. Instead we keep a map from IR Function to MachineFunction in the MachineModuleInfo. This allows the insertion of ModulePasses into the codegen pipeline without breaking it because the MachineFunctionAnalysis gets dropped before a module pass. Peak memory should stay unchanged without a ModulePass in the codegen pipeline: Previously the MachineFunction was freed at the end of a codegen function pipeline because the MachineFunctionAnalysis was dropped; With this patch the MachineFunction is freed after the AsmPrinter has finished. Differential Revision: http://reviews.llvm.org/D23736 llvm-svn: 279564	2016-08-23 20:58:29 +00:00
Tim Northover	a01bece1dc	GlobalISel: extend legalizer interface to handle multiple types. Instructions like G_ICMP have multiple types that may need to be legalized (the boolean output and nearly arbitrary inputs in this case). So the legalizer must be capable of deciding what to do for each of them separately. llvm-svn: 279554	2016-08-23 19:30:42 +00:00
Tim Northover	3c73e367c0	GlobalISel: legalize 1-bit load/store and mark 8/16 bit variants legal on AArch64. llvm-svn: 279548	2016-08-23 18:20:09 +00:00
Justin Lebar	1972e222ea	[SelectionDAG] Use a union of bitfield structs for SDNode::SubclassData. Summary: This greatly simplifies our handling of SDNode::SubclassData. NFC, hopefully. :) See discussion in D23035 for discussion about the design API of these bitfields. Reviewers: chandlerc Subscribers: llvm-commits, rnk Differential Revision: https://reviews.llvm.org/D23036 llvm-svn: 279537	2016-08-23 17:18:11 +00:00
Justin Lebar	0a33a7aefa	[CodeGen] Convert a loop to a for-each loop. NFC llvm-svn: 279536	2016-08-23 17:18:07 +00:00
Pete Cooper	036b94dad3	Fix some more asserts after r279466. That commit added a new version of Intrinsic::getName which should only be called when the intrinsic has no overloaded types. There are several debugging paths, such as SDNode::dump which are printing the name of the intrinsic but don't have the overloaded types. These paths should be ok to just print the name instead of crashing. The fix here is ultimately to just add a 'None' second argument as that calls the overload capable getName, which is less efficient, but this is a debugging path anyway, and not perf critical. Thanks to Björn Pettersson for pointing out that there were more crashes. llvm-svn: 279528	2016-08-23 16:23:45 +00:00
Matthias Braun	7f66202d38	Revert "(HEAD -> master, origin/master, origin/HEAD) CodeGen: Remove MachineFunctionAnalysis => Enable (Machine)ModulePasses" Reverting while tracking down a use after free. This reverts commit r279502. llvm-svn: 279503	2016-08-23 05:17:11 +00:00
Matthias Braun	fd936841eb	CodeGen: Remove MachineFunctionAnalysis => Enable (Machine)ModulePasses This patch removes the MachineFunctionAnalysis. Instead we keep a map from IR Function to MachineFunction in the MachineModuleInfo. This allows the insertion of ModulePasses into the codegen pipeline without breaking it because the MachineFunctionAnalysis gets dropped before a module pass. Peak memory should stay unchanged without a ModulePass in the codegen pipeline: Previously the MachineFunction was freed at the end of a codegen function pipeline because the MachineFunctionAnalysis was dropped; With this patch the MachineFunction is freed after the AsmPrinter has finished. Differential Revision: http://reviews.llvm.org/D23736 llvm-svn: 279502	2016-08-23 03:20:09 +00:00
Pete Cooper	1523925daa	Fix crash from assert in r279466. The assert in r279466 checks that we call the correct version of Intrinsic::getName. The version which accepts only an ID should not be used for intrinsics with overloaded types. The global-isel code was calling the wrong version. The test CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll will ensure that we call the correct version from now on. llvm-svn: 279487	2016-08-22 22:27:05 +00:00
Tim Shen	f2187ed321	[GraphTraits] Replace all NodeType usage with NodeRef This should finish the GraphTraits migration. Differential Revision: http://reviews.llvm.org/D23730 llvm-svn: 279475	2016-08-22 21:09:30 +00:00
Tim Shen	a5cc25e50f	[SSP] Do not set __guard_local to hidden for OpenBSD SSP __guard_local is defined as long on OpenBSD. If the source file contains a definition of __guard_local, it mismatches with the int8 pointer type used in LLVM. In that case, Module::getOrInsertGlobal() returns a cast operation instead of a GlobalVariable. Trying to set the visibility on the cast operation leads to random segfaults (seen when compiling the OpenBSD kernel, which also runs with stack protection). In the kernel, the hidden attribute does not matter. For userspace code, __guard_local is defined as hidden in the startup code. If a program re-defines __guard_local, the definition from the startup code will either win or the linker complains about multiple definitions (depending on whether the re-defined __guard_local is placed in the common segment or not). It also matches what gcc on OpenBSD does. Thanks Stefan Kempf <sisnkemp@gmail.com> for the patch! Differential Revision: http://reviews.llvm.org/D23674 llvm-svn: 279449	2016-08-22 18:26:27 +00:00
Krzysztof Parzyszek	673b347e5a	Reset isUndef when removing subreg from a def operand llvm-svn: 279437	2016-08-22 14:50:12 +00:00
Simon Pilgrim	02b13d4d3c	Use SDValue::getOpcode() helper instead of via SDValue::getNode() llvm-svn: 279381	2016-08-20 20:04:18 +00:00
Matthias Braun	367d853042	MachineFunction: Add llvm_unreachable for missing properties Most compilers should give you a warning anyway though. llvm-svn: 279346	2016-08-19 23:03:28 +00:00
Krzysztof Parzyszek	d95d100c28	Reset "undef" flag when coalescing subregister into whole register llvm-svn: 279344	2016-08-19 22:57:23 +00:00
Tim Northover	a11be04769	GlobalISel: support legalization of G_FCONSTANTs llvm-svn: 279341	2016-08-19 22:40:08 +00:00
Tim Northover	ea904f9424	GlobalISel: teach legalizer how to handle integer constants. llvm-svn: 279340	2016-08-19 22:40:00 +00:00
Matthias Braun	a7d6fc9618	MachineFunction: Cleanup/simplify MachineFunctionProperties::print() - Always compile print() regardless of LLVM_ENABLE_DUMP. (We usually only gard dump() functions with that). - Only show the set properties to reduce output clutter. - Remove the unused variant that even shows the unset properties. - Fix comments llvm-svn: 279338	2016-08-19 22:31:45 +00:00
Matthias Braun	a3b983aa5e	MachineFunction: Make LastProperty an alias of the last property This avoids unnecessary cases in switch statements covering all properties. llvm-svn: 279337	2016-08-19 22:31:42 +00:00
Tim Shen	b5e0f5ac95	[GraphTraits] Make nodes_iterator dereference to NodeType/NodeRef Currently nodes_iterator may dereference to a NodeType or a NodeType&. Make them all dereference to NodeType*, which is NodeRef later. Differential Revision: https://reviews.llvm.org/D23704 Differential Revision: https://reviews.llvm.org/D23705 llvm-svn: 279326	2016-08-19 21:20:13 +00:00
Krzysztof Parzyszek	e4582d4a2e	[Packetizer] Add debugging code to stop packetization after N instructions llvm-svn: 279325	2016-08-19 21:12:52 +00:00
Tim Northover	d5c23bcfc9	GlobalISel: translate floating-point comparisons llvm-svn: 279319	2016-08-19 20:48:16 +00:00
Tim Northover	b16734fbaa	GlobalISel: translate floating-point constants llvm-svn: 279311	2016-08-19 20:09:15 +00:00
Tim Northover	5a28c3642f	GlobalISel: support translating select instructions. llvm-svn: 279309	2016-08-19 20:09:07 +00:00
Tim Northover	b604622bba	GlobalISel: fix insert/extract to work on ConstantExprs too. No tests yet unfortunately (ConstantFolding reduces all supported constants to ConstantInts before we get to translation). Soon. llvm-svn: 279308	2016-08-19 20:09:03 +00:00
Tim Northover	bbbfb1cfb8	GlobalISel: translate insertvalue instructions. This adds a G_INSERT instruction, which technically makes G_SEQUENCE redundant (it's equivalent to a G_INSERT into an IMPLICIT_DEF). We'll leave G_SEQUENCE for now though: it's likely to be far more common as it's a fundamental part of legalization, so avoiding the mess and bloat of the extra IMPLICIT_DEFs is probably worthwhile. llvm-svn: 279306	2016-08-19 20:08:55 +00:00
Tom Stellard	68726a5359	MachineScheduler: Add constructor functions for the DAGMutations Summary: This way they can be re-used by target-specific schedulers. Reviewers: atrick, MatzeB, kparzysz Subscribers: kparzysz, llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D23678 llvm-svn: 279305	2016-08-19 19:59:18 +00:00
Tim Northover	26b76f2c59	GlobalISel: improve representation of G_SEQUENCE and G_EXTRACT First, make sure all types involved are represented, rather than being implicit from the register width. Second, canonicalize all types to scalar. These operations just act in bits and don't care about vectors. Also standardize spelling of Indices in the MachineIRBuilder (NFC here). llvm-svn: 279294	2016-08-19 18:32:14 +00:00
Kyle Butt	5b10483618	Revert "IfConversion: Rescan diamonds." This reverts commit bfd62a4b4465dd21811bf615c3b04c30ddb09f7b. llvm-svn: 279289	2016-08-19 18:17:06 +00:00
Kyle Butt	ce0196de3f	Revert "CodeGen: If Convert blocks that would form a diamond when tail-merged." This reverts commit 0fda93481c4231c06b838ef476c0c404c51ff875. llvm-svn: 279288	2016-08-19 18:17:04 +00:00
Tim Northover	2fa5fa391f	GlobalISel: allow extractvalue to extract an aggregate. llvm-svn: 279287	2016-08-19 18:09:41 +00:00
Tim Northover	6f80b08c64	GlobalISel: support translation of extractvalue instructions. llvm-svn: 279285	2016-08-19 17:47:05 +00:00
Tim Northover	91c8173093	GlobalISel: support overflow arithmetic intrinsics. Unsigned addition and subtraction can reuse the instructions created to legalize large width operations (i.e. both produce and consume a carry flag). Signed operations and multiplies get a dedicated op-with-overflow instruction. Once this is produced the two values are combined into a struct register (which will almost always be merged with a corresponding G_EXTRACT as part of legalization). llvm-svn: 279278	2016-08-19 17:17:06 +00:00
James Molloy	7ee640f9b6	[CodeGen] Fix a trivial type conversion bug dating back to pre-2008 The heuristic above this code is incredibly suspect, but disregarding that it mutates the cast opcode so we need to check the mutated opcode later to see if we need to emit an AssertSext or AssertZext node. Fixes PR29041. llvm-svn: 279223	2016-08-19 08:38:50 +00:00

1 2 3 4 5 ...

21190 Commits