llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	7a1e5bbe4e	Make WholeProgramDevirt understand ConstStruct vtables. Based on a patch by LemonBoy! Differential Revision: https://reviews.llvm.org/D26581 llvm-svn: 289162	2016-12-09 00:33:27 +00:00
Chris Bieneman	313b326bb6	[ObjectYAML] Support for DWARF debug_aranges This patch adds support for round tripping DWARF debug_aranges in and out of YAML. llvm-svn: 289161	2016-12-09 00:26:44 +00:00
Zia Ansari	394cef803a	[InstSimplify] Add "X / 1.0" to SimplifyFDivInst. Differential Revision: https://reviews.llvm.org/D27587 llvm-svn: 289153	2016-12-08 23:27:40 +00:00
Tim Northover	b58346f2f2	GlobalISel: fall back gracefully for debug intrinsics. Supporting them properly is a reasonably complex chunk of work, so to allow bot testing before then we should at least be able to fall back to DAG ISel. llvm-svn: 289150	2016-12-08 22:44:13 +00:00
Tim Northover	1e656ec137	GlobalISel: factor overflow handling into separate function. NFC. llvm-svn: 289149	2016-12-08 22:44:00 +00:00
Davide Italiano	54c683f9e7	[SCCP] Make sure SCCP and ConstantFolding agree on undef >> a. Currently SCCP folds the value to -1, while ConstantProp folds to 0. This changes SCCP to do what ConstantFolding does. llvm-svn: 289147	2016-12-08 22:28:53 +00:00
Reid Kleckner	785e7d282c	Don't emit .seh_handler directives for any cleanup funclets We were falsely claiming that we had an LSDA for the relevant EH personality before this change, which could lead to the EH machinery interpreting random adjacent data as an LSDA. Fixes PR31317 This change is safe because cleanups can't contain exception handlers today. We do these things to maintain that invariant: - C++ destructors are naturally out-of-line - __finally blocks are outlined in clang - LLVM's inliner will not inline EH constructs into cleanups llvm-svn: 289101	2016-12-08 20:38:46 +00:00
Krzysztof Parzyszek	77a45576ef	[RDF] Fix incorrect lane mask calculation This was exposed by some code that used more than one level of sub- registers. There is no testcase, because there is no such code in the Hexagon backend. llvm-svn: 289099	2016-12-08 20:33:45 +00:00
Matt Arsenault	e96d03745d	AMDGPU: Make f16 ConstantFP legal Not having this legal led to combine failures, resulting in dumb things like bitcasts of constants not being folded away. The only reason I'm leaving the v_mov_b32 hack that f32 already uses is to avoid madak formation test regressions. PeepholeOptimizer has an ordering issue where the immediate fold attempt is into the sgpr->vgpr copy instead of the actual use. Running it twice avoids that problem. llvm-svn: 289096	2016-12-08 20:14:46 +00:00
Stanislav Mekhanoshin	73b54f4134	[AMDGPU] Fix number of reserved SGPRs on CI to reflect flat scratch use Differential Revision: https://reviews.llvm.org/D27225 llvm-svn: 289095	2016-12-08 20:07:23 +00:00
Matt Arsenault	6c06a6f48a	AMDGPU: Fix commuting v_sub_u16 The correct commutable opcode was set to itself, so this was simply swapping the operands to commute instead of also changing the opcode to v_subrev_u16. llvm-svn: 289093	2016-12-08 19:52:38 +00:00
Stanislav Mekhanoshin	50ea93a2bd	[AMDGPU] Add amdgpu-unify-metadata pass Multiple metadata values for records such as opencl.ocl.version, llvm.ident and similar are created after linking several modules. For some of them, notably opencl.ocl.version, this creates semantic problem because we cannot tell which version of OpenCL the composite module conforms. Moreover, such repetitions of identical values often create a huge list of unneeded metadata, which grows bitcode size both in memory and stored on disk. It can go up to several Mb when linked against our OpenCL library. Lastly, such long lists obscure reading of dumped IR. The pass unifies metadata after linking. Differential Revision: https://reviews.llvm.org/D25381 llvm-svn: 289092	2016-12-08 19:46:04 +00:00
Peter Collingbourne	235c275b20	IR, X86: Understand !absolute_symbol metadata on global variables. Summary: Attaching !absolute_symbol to a global variable does two things: 1) Marks it as an absolute symbol reference. 2) Specifies the value range of that symbol's address. Teach the X86 backend to allow absolute symbols to appear in place of immediates by extending the relocImm and mov64imm32 matchers. Start using relocImm in more places where it is legal. As previously proposed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2016-October/105800.html Differential Revision: https://reviews.llvm.org/D25878 llvm-svn: 289087	2016-12-08 19:01:00 +00:00
Chris Bieneman	fbf7dfe1ba	[ObjectYAML] Remove DWARF from class names Since all the DWARF classes are in a DWARFYAML namespace having every class start with DWARF seems like a bit of overkill. llvm-svn: 289080	2016-12-08 17:46:57 +00:00
Alexander Timofeev	18009560c5	[AMDGPU] Scalarization of global uniform loads. Summary: LC can currently select scalar load for uniform memory access basing on readonly memory address space only. This restriction originated from the fact that in HW prior to VI vector and scalar caches are not coherent. With MemoryDependenceAnalysis we can check that the memory location corresponding to the memory operand of the LOAD is not clobbered along the all paths from the function entry. Reviewers: rampitec, tstellarAMD, arsenm Subscribers: wdng, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D26917 llvm-svn: 289076	2016-12-08 17:28:47 +00:00
Keno Fischer	dc09119776	ConstantFolding: Don't crash when encountering vector GEP ConstantFolding tried to cast one of the scalar indices to a vector type. Instead, use the vector type only for the first index (which is the only one allowed to be a vector) and use its scalar type otherwise. Fixes PR31250. Reviewers: majnemer Differential Revision: https://reviews.llvm.org/D27389 llvm-svn: 289073	2016-12-08 17:22:35 +00:00
NAKAMURA Takumi	689493bb12	Prune unused libdeps. llvm-svn: 289060	2016-12-08 15:28:02 +00:00
NAKAMURA Takumi	9ccd966612	LanaiInstPrinter: Prune unused libdeps. llvm-svn: 289054	2016-12-08 14:26:30 +00:00
Nicolai Haehnle	f08dc90253	[SelectionDAG] Add expansion and promotion of [US]MUL_LOHI Summary: Most targets set the action for these nodes to Expand even though there isn't actually any code for them in ExpandNode. Instead, targets simply relied on the fact that no code generates these nodes as long as the nodes aren't legal or custom. However, generating these nodes can be useful e.g. for divide-by-constant in wider integer types. Expand of [US]MUL_LOHI will use MULH[US] when legal or custom, and a sequence of half-width multiplications otherwise. Promote uses a wider multiply. This patch intends to not change the generated code, but indirect effects are possible since expansions/promotions that were previously done in DAGCombine may now be done in LegalizeDAG. See D24822 for a change that actually uses the new expansion. Reviewers: spatel, bkramer, venkatra, efriedma, hfinkel, ast, nadav, tstellarAMD Subscribers: arsenm, jyknight, nemanjai, wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D24956 llvm-svn: 289050	2016-12-08 14:08:14 +00:00
Nicolai Haehnle	2857dc3893	AMDGPU: Properly implement SIRegisterInfo::isFrameOffsetLegal and needsFrameBaseReg Summary: Without the fix to isFrameOffsetLegal to consider the instruction's immediate offset, the new test case hits the corresponding assertion in resolveFrameIndex, because the LocalStackSlotAllocation pass re-uses a different base register. With only the fix to isFrameOffsetLegal, code quality reduces in a bunch of places because frame base registers are added where they're not needed. This is addressed by properly implementing needsFrameBaseReg, which also helps to avoid unnecessary zero frame indices in a bunch of other places. Fixes piglit glsl-1.50/execution/variable-indexing/gs-output-array-vec4-index-wr.shader_test Reviewers: arsenm, tstellarAMD Subscribers: qcolombet, kzhuravl, wdng, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D27344 llvm-svn: 289048	2016-12-08 14:08:02 +00:00
Daniel Jasper	0f77869d58	Move DwarfGenerator.cpp to unittests So far it creates a test helper and so it should be moved there. It also create a layering cycle between CodeGen and CodeGen/AsmPrinter, which should be avoided. Review: https://reviews.llvm.org/D27570 llvm-svn: 289044	2016-12-08 12:45:29 +00:00
Alexey Bataev	4f0d469d45	[SLP] Fix for PR6246: vectorization for scalar ops on vector elements. When trying to vectorize trees that start at insertelement instructions function tryToVectorizeList() uses vectorization factor calculated as MinVecRegSize/ScalarTypeSize. But sometimes it does not work as tree cost for this fixed vectorization factor is too high. Patch tries to improve the situation. It tries different vectorization factors from max(PowerOf2Floor(NumberOfVectorizedValues), MinVecRegSize/ScalarTypeSize) to MinVecRegSize/ScalarTypeSize and tries to choose the best one. Differential Revision: https://reviews.llvm.org/D27215 llvm-svn: 289043	2016-12-08 11:57:51 +00:00
Simon Pilgrim	413c8e217f	Wdocumentation fix llvm-svn: 289038	2016-12-08 10:41:41 +00:00
Oliver Stannard	68e7c21ca0	Add a comment consumer mechanism to MCAsmLexer This allows clients to register an AsmCommentConsumer with the MCAsmLexer, which receives a callback each time a comment is parsed. Differential Revision: https://reviews.llvm.org/D27511 llvm-svn: 289036	2016-12-08 10:31:21 +00:00
Dylan McKay	fac9ce5413	[AVR] Add an assertion to ensure we don't emit LPM when it's unsupported llvm-svn: 289030	2016-12-08 08:34:13 +00:00
Peter Collingbourne	f4257528e9	LTO: Hash the parts of the LTO configuration that affect code generation. Most importantly, we need to hash the relocation model, otherwise we can end up trying to link non-PIC object files into PIEs or DSOs. Differential Revision: https://reviews.llvm.org/D27556 llvm-svn: 289024	2016-12-08 05:28:30 +00:00
Keno Fischer	d4ea4c18f1	Revert "[CodeGen] Fix invalid DWARF info on Win64" Appears to break on build bots. Reverting pending investigation. llvm-svn: 289014	2016-12-08 01:56:23 +00:00
Keno Fischer	460218fb7d	[CodeGen] Fix invalid DWARF info on Win64 The relocations for `DIEEntry::EmitValue` were wrong for Win64 (emitting FK_Data_4 instead of FK_SecRel_4). This corrects that oversight so that the DWARF data is correct in Win64 COFF files. Fixes PR15393. Patch by Jameson Nash <jameson@juliacomputing.com> based on a patch by David Majnemer. Differential Revision: https://reviews.llvm.org/D21731 llvm-svn: 289013	2016-12-08 01:40:21 +00:00
Greg Clayton	3462a420d1	Make a DWARF generator so we can unit test DWARF APIs with gtest. The only tests we have for the DWARF parser are the tests that use llvm-dwarfdump and expect output from textual dumps. More DWARF parser modification are coming in the next few weeks and I wanted to add tests that can verify that we can encode and decode all form types, as well as test some other basic DWARF APIs where we ask DIE objects for their children and siblings. DwarfGenerator.cpp was added in the lib/CodeGen directory. This file contains the code necessary to easily create DWARF for tests: dwarfgen::Generator DG; Triple Triple("x86_64--"); bool success = DG.init(Triple, Version); if (!success) return; dwarfgen::CompileUnit &CU = DG.addCompileUnit(); dwarfgen::DIE CUDie = CU.getUnitDIE(); CUDie.addAttribute(DW_AT_name, DW_FORM_strp, "/tmp/main.c"); CUDie.addAttribute(DW_AT_language, DW_FORM_data2, DW_LANG_C); dwarfgen::DIE SubprogramDie = CUDie.addChild(DW_TAG_subprogram); SubprogramDie.addAttribute(DW_AT_name, DW_FORM_strp, "main"); SubprogramDie.addAttribute(DW_AT_low_pc, DW_FORM_addr, 0x1000U); SubprogramDie.addAttribute(DW_AT_high_pc, DW_FORM_addr, 0x2000U); dwarfgen::DIE IntDie = CUDie.addChild(DW_TAG_base_type); IntDie.addAttribute(DW_AT_name, DW_FORM_strp, "int"); IntDie.addAttribute(DW_AT_encoding, DW_FORM_data1, DW_ATE_signed); IntDie.addAttribute(DW_AT_byte_size, DW_FORM_data1, 4); dwarfgen::DIE ArgcDie = SubprogramDie.addChild(DW_TAG_formal_parameter); ArgcDie.addAttribute(DW_AT_name, DW_FORM_strp, "argc"); // ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref4, IntDie); ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref_addr, IntDie); StringRef FileBytes = DG.generate(); MemoryBufferRef FileBuffer(FileBytes, "dwarf"); auto Obj = object::ObjectFile::createObjectFile(FileBuffer); EXPECT_TRUE((bool)Obj); DWARFContextInMemory DwarfContext(*Obj.get()); This code is backed by the AsmPrinter code that emits DWARF for the actual compiler. While adding unit tests it was discovered that DIEValue that used DIEEntry as their values had bugs where DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref8, and DW_FORM_ref_udata forms were not supported. These are all now supported. Added support for DW_FORM_string so we can emit inlined C strings. Centralized the code to unique abbreviations into a new DIEAbbrevSet class and made both the dwarfgen::Generator and the llvm::DwarfFile classes use the new class. Fixed comments in the llvm::DIE class so that the Offset is known to be the compile/type unit offset. DIEInteger now supports more DW_FORM values. There are also unit tests that cover: Encoding and decoding all form types and values Encoding and decoding all reference types (DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref4, DW_FORM_ref8, DW_FORM_ref_udata, DW_FORM_ref_addr) including cross compile unit references with that go forward one compile unit and backward on compile unit. Differential Revision: https://reviews.llvm.org/D27326 llvm-svn: 289010	2016-12-08 01:03:48 +00:00
Evgeniy Stepanov	0c8957c198	CFI-icall on Thumb Replace @progbits in the section directive with %progbits, because "@" starts a comment on arm/thumb. Use b.w branch instruction. Use .thumb_function and .thumb_set for proper arm/thumb interwork. This way jumptable entry addresses on thumb have bit 0 set (correctly). This does not affect CFI check math, because the address of the jumptable start also has that bit set. This does not work on thumbv5, because it does not support b.w, and the linker would not insert a veneer (trampoline?) to extend the range of b.n. We may need to do full-range plt-style jumptables on thumbv54, which are 12 bytes per entry. Another option is "push lr; bl; pop pc" (4 bytes) but that needs unwinding instructions, etc. Differential Revision: https://reviews.llvm.org/D27499 llvm-svn: 289008	2016-12-08 00:32:26 +00:00
Matthias Braun	e2d2ead661	TargetPassConfig: Rename DisablePostRA -> DisablePostRASched; NFC llvm-svn: 289003	2016-12-08 00:16:08 +00:00
Matthias Braun	0c989a893b	LivePhysReg: Use reference instead of pointer in init(); NFC llvm-svn: 289002	2016-12-08 00:15:51 +00:00
Quentin Colombet	ae3168da3f	[InlineSpiller] Don't call TargetInstrInfo::foldMemoryOperand with an empty list. Since r287792 if we try to do that we will hit an assert. llvm-svn: 289001	2016-12-08 00:06:51 +00:00
Eugene Zelenko	9408c61830	[ADT, IR] Fix some Clang-tidy modernize-use-equals-delete and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 288989	2016-12-07 22:06:02 +00:00
Davide Italiano	1ed5396304	[BDCE] Skip metadata while replacing uses. The fix committed in r288851 doesn't cover all the cases. In particular, if we have an instruction with side effects which has a no non-dbg use not depending on the bits, we still perform RAUW destroying the dbg.value's first argument. Prevent metadata from being replaced here to avoid the issue. Differential Revision: https://reviews.llvm.org/D27534 llvm-svn: 288987	2016-12-07 21:47:32 +00:00
Tim Northover	c53606ef02	GlobalISel: use correct builder for ConstantExprs. ConstantExpr instances were emitting code into the current block rather than the entry block. This meant they didn't necessarily dominate all uses, which is clearly wrong. llvm-svn: 288985	2016-12-07 21:29:15 +00:00
Chris Bieneman	79e60eb948	[ObjectYAML] Pull DWARF support into DWARFYAML namespace Since DWARF formatting is agnostic to the object file it is stored in, it doesn't make sense for this to be in the MachOYAML implementation. Pulling it into its own namespace means we could modify the ELF and COFF YAML tools to emit DWARF as well. In a follow-up patch I will better abstract this in obj2yaml and yaml2obj so that the DWARF bits in the tools can be re-used too. llvm-svn: 288984	2016-12-07 21:26:32 +00:00
Tim Northover	50db7f416c	GlobalISel: store the current MachineFunction as direct state. NFC. Having to ask the MIRBuilder for the current function is a little awkward, and I'm intending to improve how that's threaded through anyway. llvm-svn: 288983	2016-12-07 21:17:47 +00:00
Chris Bieneman	25ec226dfc	[ObjectYAML] Rename DWARF entries to match section names This change makes the yaml tags for the members of the DWARF data match the names of the DWARF sections. llvm-svn: 288981	2016-12-07 21:09:37 +00:00
Tim Northover	05cc4859ad	GlobalISel: simplify MachineIRBuilder interface. MachineIRBuilder had weird before/after and beginning/end flags for the insert point. Unfortunately the non-default means that instructions will be inserted in reverse order which is almost never what anyone wants. Really, I think we just want (like IRBuilder has) the ability to insert at any C++ iterator-style point (i.e. before any instruction or before MBB.end()). So this fixes MIRBuilders to behave like IRBuilders in this respect. llvm-svn: 288980	2016-12-07 21:05:38 +00:00
Kostya Serebryany	64a055549a	[libFuzzer] include FuzzerIO.h and hopefully fix the Mac build. reported by Dejan Mircevski llvm-svn: 288979	2016-12-07 21:02:48 +00:00
Matt Arsenault	624e1b348c	InstCombine: Fold bitcast of vector to FP scalar llvm-svn: 288978	2016-12-07 20:56:11 +00:00
Eli Friedman	c6885fc369	[GVNHoist] Invalidate MemDep when an instruction is moved. See also r279907. Fixes https://llvm.org/bugs/show_bug.cgi?id=30991 . Differential Revision: https://reviews.llvm.org/D27493 llvm-svn: 288968	2016-12-07 19:55:59 +00:00
Michael Kuperstein	5842b20633	[X86] Skip over DEBUG_VALUE while looking for start of call sequence If we don't skip over DEBUG_VALUEs, we get differences between -g and non-g code. This fixes PR31242. Differential Revision: https://reviews.llvm.org/D27485 llvm-svn: 288965	2016-12-07 19:31:08 +00:00
Michael Kuperstein	18092cf2c3	[X86] Do not assume "ri" instructions always have an immediate operand The second operand of an "ri" instruction may be an immediate, but it may also be a globalvariable, so we should make any assumptions. This fixes PR31271. Differential Revision: https://reviews.llvm.org/D27481 llvm-svn: 288964	2016-12-07 19:29:18 +00:00
Chris Bieneman	bfff254a10	Fix the apple build issue caused by r288956 Should be checking if HAVE_CRASHREPORTERCLIENT_H is defined not relying on it having a value. llvm-svn: 288963	2016-12-07 19:28:22 +00:00
Chris Bieneman	c6c0e54d3d	[ObjectYAML] Support for DWARF __debug_abbrev section This patch adds support for round-tripping DWARF debug abbreviations through the obj<->yaml tools. llvm-svn: 288955	2016-12-07 18:52:59 +00:00
Simon Pilgrim	ba05d41095	[SelectionDAG] Add knownbits support for vector demandedelts in SMAX/SMIN/UMAX/UMIN opcodes llvm-svn: 288926	2016-12-07 17:54:00 +00:00
Simon Pilgrim	c3c6463ce0	[X86][SSE] Remove AND -> VZEXT combine This is now performed more generally by the target shuffle combine code. Already covered by tests that were originally added in D7666/rL229480 to support combineVectorZext (or VectorZextCombine as it was known then....). Differential Revision: https://reviews.llvm.org/D27510 llvm-svn: 288918	2016-12-07 17:02:41 +00:00
Simon Pilgrim	967325b373	[SelectionDAG] Add knownbits support for EXTRACT_VECTOR_ELT opcodes llvm-svn: 288916	2016-12-07 16:28:21 +00:00
Simon Pilgrim	ff79f31328	[SelectionDAG] Removed old knownbits TODO comment. NFCI. EXTRACT_VECTOR_ELT does support demanded elts if the element index is known and in range. llvm-svn: 288913	2016-12-07 15:31:12 +00:00
Matthew Simpson	364da7e527	[LV] Scalarize operands of predicated instructions This patch attempts to scalarize the operand expressions of predicated instructions if they were conditionally executed in the original loop. After scalarization, the expressions will be sunk inside the blocks created for the predicated instructions. The transformation essentially performs un-if-conversion on the operands. The cost model has been updated to determine if scalarization is profitable. It compares the cost of a vectorized instruction, assuming it will be if-converted, to the cost of the scalarized instruction, assuming that the instructions corresponding to each vector lane will be sunk inside a predicated block, possibly avoiding execution. If it's more profitable to scalarize the entire expression tree feeding the predicated instruction, the expression will be scalarized; otherwise, it will be vectorized. We only consider the cost of the entire expression to accurately estimate the cost of the required insertelement and extractelement instructions. Differential Revision: https://reviews.llvm.org/D26083 llvm-svn: 288909	2016-12-07 15:03:32 +00:00
Benjamin Kramer	b1332d8bf6	Try unbreaking the MSVC build. llvm-svn: 288907	2016-12-07 13:35:11 +00:00
Dylan McKay	99b756eb40	[AVR] Expand 'SELECT_CC' nodes whereever possible llvm-svn: 288905	2016-12-07 12:34:47 +00:00
Benjamin Kramer	926ab5b00b	[LowerTypeTests] Use the TrailingObjects infrastructure for trailing objects. Also avoid allocating ~3x as much memory as needed. llvm-svn: 288904	2016-12-07 12:31:45 +00:00
Andrea Di Biagio	ae5780104f	When GVN removes a redundant load, it should not modify the debug location of the dominating load. In the case of a fully redundant load LI dominated by an equivalent load V, GVN should always preserve the original debug location of V. Otherwise, we risk to introduce an incorrect stepping. If V has debug info, then clearly it should not be modified. If V has a null debugloc, then it is still potentially incorrect to propagate LI's debugloc because LI may not post-dominate V. Differential Revision: https://reviews.llvm.org/D27468 llvm-svn: 288903	2016-12-07 12:31:36 +00:00
Simon Pilgrim	8893bd95f0	[X86][SSE] Consistently set MOVD/MOVQ load/store/move instructions to integer domain We are being inconsistent with these instructions (and all their variants.....) with a random mix of them using the default float domain. Differential Revision: https://reviews.llvm.org/D27419 llvm-svn: 288902	2016-12-07 12:10:49 +00:00
Andrea Di Biagio	eff22832c0	[InlineFunction] Refactor code in function `fixupLineNumbers' as suggested by David in D27462. NFC llvm-svn: 288901	2016-12-07 12:01:45 +00:00
Simon Dardis	615bac37cd	[mips][rtdyld] Merge code to write relocated values to the section. NFC Preparation work for implementing N32 support. Patch By: Daniel Sanders Reviewers: vkalintiris, atanasyan Differential Revision: https://reviews.llvm.org/D27460 llvm-svn: 288900	2016-12-07 11:41:23 +00:00
Simon Pilgrim	d5bc5c16b2	[X86][XOP] Fix VPERMIL2 non-constant pool shuffle decoding (PR31296) The non-constant pool version of DecodeVPERMIL2PMask was not offsetting correctly for the second input. I've updated the code to match the implementation in the constant-pool version. Annoyingly this bug was hidden for so long as it's tricky to combine to useful variable shuffle masks that don't become constant-pool entries. llvm-svn: 288898	2016-12-07 11:19:00 +00:00
Dylan McKay	8cec7eb6dd	[AVR] Allow loading from stack slots where src and dest registers are identical Fixes PR 31256 llvm-svn: 288897	2016-12-07 11:08:56 +00:00
Andrea Di Biagio	32d5aedd5b	[InlineFunction] Do not propagate the callsite debug location to instructions inlined from functions with debug info. When a function F is inlined, InlineFunction extends the debug location of every instruction inlined from F by adding an InlinedAt. However, if an instruction has a 'null' debug location, InlineFunction would propagate the callsite debug location to it. This behavior existed since revision 210459. Revision 210459 was originally committed specifically to workaround the lack of debug information for instructions inlined from intrinsic functions (which are usually declared with attributes `__always_inline__, __nodebug__`). The problem with revision 210459 is that it doesn't make any sort of distinction between instructions inlined from a 'nodebug' function and instructions which are inlined from a function built with debug info. This issue may lead to incorrect stepping in the debugger. This patch works under the assumption that a nodebug function does not have a DISubprogram. When a function F is inlined into another function G, InlineFunction checks if F has debug info associated with it. For nodebug functions, the InlineFunction logic is unchanged (i.e. it would still propagate the callsite debugloc to the inlined instructions). Otherwise, InlineFunction no longer propagates the callsite debug location. Differential Revision: https://reviews.llvm.org/D27462 llvm-svn: 288895	2016-12-07 10:37:26 +00:00
Philip Reames	02bb6a6b0b	Reintroduce a check accidentally removed in 288873 to fix clang bots I believe this is the cause of the failure, but have not been able to confirm. Note that this is a speculative fix; I'm still waiting for a full build to finish as I synced and ended up doing a clean build which takes 20+ minutes on my machine. llvm-svn: 288886	2016-12-07 04:48:50 +00:00
Philip Reames	29b19f0e9e	Fix a warning introduced in r288874 llvm-svn: 288884	2016-12-07 04:11:22 +00:00
Tom Stellard	8485fa096e	AMDGPU : Add S_SETREG instructions to fix fdiv precision issues. Patch By: Wei Ding Summary: This patch fixes the fdiv precision issues. Reviewers: b-sumner, cfang, wdng, arsenm Subscribers: kzhuravl, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D26424 llvm-svn: 288879	2016-12-07 02:42:15 +00:00
Haicheng Wu	f8b834049a	[AArch64] Correct the check of signed 9-bit imm in isLegalAddressingMode() In the addressing mode, signed 9-bit imm is [-256, 255], not [-512, 511]. Differential Revision: https://reviews.llvm.org/D27480 llvm-svn: 288876	2016-12-07 01:45:04 +00:00
Chandler Carruth	5205c35075	[LCG] Add basic verification of the parent set and fix bugs it uncovers. The existing unittests actually cover this now that we verify things. llvm-svn: 288875	2016-12-07 01:42:40 +00:00
Philip Reames	71a496777c	[LVI] Remove used return value from markX functions llvm-svn: 288874	2016-12-07 01:03:56 +00:00
Philip Reames	b47a719ac0	[LVI] Simplify mergeIn code Remove the unused return type, use early return, use assignment operator. llvm-svn: 288873	2016-12-07 00:54:21 +00:00
Philip Reames	864ab5c516	[LVI] Simplify obfuscated code It doesn't matter why something is overdefined if it is... llvm-svn: 288871	2016-12-07 00:28:28 +00:00
Peter Collingbourne	6f0b4f2e89	IR: Reduce the amount of boilerplate required for a metadata kind. NFCI. llvm-svn: 288867	2016-12-06 23:53:01 +00:00
Tom Stellard	2187bb8a89	AMDGPU: Add llvm.amdgcn.interp.mov intrinsic Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D26725 llvm-svn: 288865	2016-12-06 23:52:13 +00:00
Matt Arsenault	269ffdac4e	AMDGPU: Fix crash on i16 constant expression llvm-svn: 288861	2016-12-06 23:18:06 +00:00
Peter Collingbourne	7357b2ad62	LowerTypeTests: Improve performance by optimising type metadata queries. Requesting metadata for a global is a relatively expensive operation as it involves a map lookup, but it's one that we need to do relatively frequently in this pass to collect the list of type metadata nodes associated with a global. This change improves the performance of type metadata queries by prebuilding data structures that keep the global together with its list of type metadata, and changing the pass to use that data structure wherever we were previously passing global references around. This change also eliminates some O(N^2) behavior by collecting the list of globals associated with each type identifier during the first pass over the list of globals rather than visiting each global to compute that list every time we add a new type identifier. Reduces pass runtime on a module containing Chrome's vtables from over 60s to 0.9s. Differential Revision: https://reviews.llvm.org/D27484 llvm-svn: 288859	2016-12-06 23:02:13 +00:00
Eli Friedman	0a76e3241f	[CodeGen] Fix result type for SMULO/UMULO legalization On some platforms (like MSP430) the second element of the result structure for SMULO/UMULO may have a shorter type than the one returned by SetCC. We need to truncate it to the right type, or else some incorrect code may be generated later on. This fixes issue https://github.com/rust-lang/rust/issues/37829 Patch by Vadzim Dambrouski! Differential Revision: https://reviews.llvm.org/D27154 llvm-svn: 288857	2016-12-06 22:49:36 +00:00
Matt Arsenault	ac066f354a	AMDGPU: Fix operand name for v_interp_* Other VOP instructions call the output vdst llvm-svn: 288856	2016-12-06 22:29:43 +00:00
Sanjay Patel	5369775a84	[InstSimplify] fixed (?) to not mutate icmps As Eli noted in the post-commit thread for r288833, the use of swapOperands() may not be allowed in InstSimplify, so I'm removing those calls here pending further review. The swap mutates the icmp, and there doesn't appear to be precedent for instruction mutation in InstSimplify. I didn't actually have any tests for those cases, so I'm adding a few here. llvm-svn: 288855	2016-12-06 22:09:52 +00:00
Tom Stellard	175959e350	AMDGPU/SI: Set correct value for amd_kernel_code_t::kernarg_segment_alignment Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D27416 llvm-svn: 288852	2016-12-06 21:53:10 +00:00
Davide Italiano	043e66137c	[BDCE/DebugInfo] Preserve llvm.dbg.value's argument. BDCE has two phases: 1. It asks SimplifyDemandedBits if all the bits of an instruction are dead, and if so, replaces all its uses with the constant zero. 2. Then, it asks SimplifyDemandedBits again if the instruction is really dead (no side effects etc..) and if so, eliminates it. Now, in 1) if all the bits of an instruction are dead, we may end up replacing a dbg use: %call = tail call i32 (...) @g() #4, !dbg !15 tail call void @llvm.dbg.value(metadata i32 %call, i64 0, metadata !8, metadata !16), !dbg !17 -> %call = tail call i32 (...) @g() #4, !dbg !15 tail call void @llvm.dbg.value(metadata i32 0, i64 0, metadata !8, metadata !16), !dbg !17 but not eliminating the call because it may have arbitrary side effects. In other words, we lose some debug informations. This patch fixes the problem making sure that BDCE does nothing with the instruction if it has side effects and no non-dbg uses. Differential Revision: https://reviews.llvm.org/D27471 llvm-svn: 288851	2016-12-06 21:52:47 +00:00
Tom Stellard	00cfa74715	AMDGPU/SI: Don't move copies of immediates to the VALU Summary: If we write an immediate to a VGPR and then copy the VGPR to an SGPR, we can replace the copy with a S_MOV_B32 sgpr, imm, rather than moving the copy to the SALU. Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D27272 llvm-svn: 288849	2016-12-06 21:13:30 +00:00
Tim Northover	14ceb45fb4	GlobalISel: correctly handle small args via memory. We were rounding size in bits down rather than up, leading to 0-sized slots for i1 (assert!) and bugs for other types not byte-aligned. llvm-svn: 288848	2016-12-06 21:02:19 +00:00
Zvi Rackover	8bc7e4da51	[X86] Prefer reduced width multiplication over pmulld on Silvermont Summary: Prefer expansions such as: pmullw,pmulhw,unpacklwd,unpackhwd over pmulld. On Silvermont [source: Optimization Reference Manual]: PMULLD has a throughput of 1/11 [instruction/cycles]. PMULHUW/PMULHW/PMULLW have a throughput of 1/2 [instruction/cycles]. Fixes pr31202. Analysis of this issue was done by Fahana Aleen. Reviewers: wmi, delena, mkuper Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D27203 llvm-svn: 288844	2016-12-06 19:35:20 +00:00
Simon Pilgrim	dd6ca639d5	[DAGCombine] Add (sext_in_reg (zext x)) -> (sext x) combine Handle the case where a sign extension has ended up being split into separate stages (typically to get around vector legal ops) and a zext + sext_in_reg gets inserted. Differential Revision: https://reviews.llvm.org/D27461 llvm-svn: 288842	2016-12-06 19:09:37 +00:00
Sanjay Patel	9b1b2de348	[InstSimplify] add folds for and-of-icmps with same operands All of these (and a few more) are already handled by InstCombine, but we shouldn't have to wait until then to simplify these because they're cheap to deal with here in InstSimplify. This is the 'and' sibling of the earlier 'or' patch: https://reviews.llvm.org/rL288833 llvm-svn: 288841	2016-12-06 19:05:46 +00:00
Tim Northover	0a683e7bfd	GlobalISel: fall back gracefully when we hit unhandled legalizer default. llvm-svn: 288840	2016-12-06 19:02:15 +00:00
Simon Pilgrim	1577b39f51	[SelectionDAG] We can ignore knownbits from an undef shuffle vector index if we don't actually demand that element llvm-svn: 288839	2016-12-06 18:58:25 +00:00
Tim Northover	c1a23854f3	GlobalISel: handle G_SEQUENCE fallbacks gracefully. There were two problems: + AArch64 was reusing random data from its binary op tables, which is complete nonsense for G_SEQUENCE. + Even when AArch64 gave up and said it couldn't handle G_SEQUENCE, the generic code asserted. llvm-svn: 288836	2016-12-06 18:38:38 +00:00
Tim Northover	f50f2f3d32	GlobalISel: allow G_SELECT instructions for pointers. llvm-svn: 288835	2016-12-06 18:38:34 +00:00
Tim Northover	405e25cd6a	GlobalISel: stop the legalizer from trying to handle oddly-sized types. It'll almost immediately fail because it always tries to half/double the size until it finds a legal one. Unfortunately, this triggers an assertion preventing the DAG fallback from being possible. llvm-svn: 288834	2016-12-06 18:38:29 +00:00
Sanjay Patel	d0ccdb46b9	[InstSimplify] add folds for or-of-icmps with same operands All of these (and a few more) are already handled by InstCombine, but we shouldn't have to wait until then to simplify these because they're cheap to deal with here in InstSimplify. llvm-svn: 288833	2016-12-06 18:09:37 +00:00
Simon Pilgrim	29c17f3f58	Avoid repeated calls to Op.getOpcode(). NFCI. llvm-svn: 288814	2016-12-06 14:50:09 +00:00
Daniel Sanders	4fd1e7c628	[globalisel][aarch64] Fix unintended assumptions about PartialMappingIdx. NFC. Summary: This is NFC but prevents assertions when PartialMappingIdx is tablegen-erated. The assumptions were: 1) FirstGPR is 0 2) FirstGPR is the first of the First* enumerators. GPR32 is changed to 1 to demonstrate that assumption #1 is fixed. #2 will be covered by a subsequent patch that tablegen-erates information and swaps the order of GPR and FPR as a side effect. Depends on D27336 Reviewers: ab, t.p.northover, qcolombet Subscribers: aemerson, rengolin, vkalintiris, dberris, rovka, llvm-commits Differential Revision: https://reviews.llvm.org/D27337 llvm-svn: 288812	2016-12-06 14:39:57 +00:00
Daniel Sanders	21765cb15e	[globalisel][aarch64] Replace magic numbers with corresponding enumerators in ValMappings. NFC Reviewers: ab, t.p.northover, qcolombet Subscribers: aemerson, rengolin, vkalintiris, dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D27336 llvm-svn: 288810	2016-12-06 13:55:01 +00:00
Daniel Sanders	605f8cd30d	[globalisel][aarch64] Correct argument names in comments. llvm-svn: 288809	2016-12-06 13:48:58 +00:00
Oliver Stannard	870b5cad45	[ARM] Better error message for invalid flag-preserving Thumb1 insts When we see a non flag-setting instruction for which only the flag-setting version is available in Thumb1, we should give a better error message than "invalid instruction". Differential Revision: https://reviews.llvm.org/D27414 llvm-svn: 288805	2016-12-06 12:59:08 +00:00
Ayman Musa	86c00b799f	[X86][AVX512] Detect repeated constant patterns in BUILD_VECTOR suitable for broadcasting. Check if a build_vector node includes a repeated constant pattern and replace it with a broadcast of that pattern. For example: "build_vector <0, 1, 2, 3, 0, 1, 2, 3>" would be replaced by "broadcast <0, 1, 2, 3>" Differential Revision: https://reviews.llvm.org/D26802 llvm-svn: 288804	2016-12-06 12:24:14 +00:00
Nemanja Ivanovic	15748f4921	[PowerPC] Improvements for BUILD_VECTOR Vol. 4 This is the final patch in the series of patches that improves BUILD_VECTOR handling on PowerPC. This adds a few peephole optimizations to remove redundant instructions. It also adds a large test case which encompasses a large set of code patterns that build vectors - this test case was the motivator for this series of patches. Differential Revision: https://reviews.llvm.org/D26066 llvm-svn: 288800	2016-12-06 11:47:14 +00:00
Daniel Sanders	bfd5ff155a	[globalisel][aarch64] Prefix PartialMappingIdx enumerators with 'PMI_' to fit coding standards. This also stops things like 'None' polluting the llvm::AArch64 namespace. llvm-svn: 288799	2016-12-06 11:33:04 +00:00
Simon Pilgrim	9335c020c6	Fix MSVC bool to uint64_t promotion warning llvm-svn: 288796	2016-12-06 11:12:53 +00:00
Chandler Carruth	23a6c3f746	[LCG] Add some much needed asserts and verify runs to uncover a hilarious bug and fix it. We somehow were never verifying the RefSCCs newly formed when splitting an existing one apart, and when verifying them we weren't really checking the SCC indices mapping effectively. If we had been, it would have been blindingly obvious that right after putting something int `RC.SCCs` we should update `RC.SCCIndices` instead of `SCCIndices` which we were about to clear and rebuild anyways. =[ Anyways, this is thoroughly covered by existing tests now that we actually verify things properly. llvm-svn: 288795	2016-12-06 10:29:23 +00:00
Florian Hahn	7582c669bd	[framelowering] Improve tracking of first CS pop instruction. Summary: This patch makes sure FirstCSPop and MBBI never point to DBG_VALUE instructions, which affected the code generated. Reviewers: mkuper, aprantl, MatzeB Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27343 llvm-svn: 288794	2016-12-06 10:24:55 +00:00
Sam McCall	03435f57aa	Add missing parens in assert. Summary: Add missing parens in assert, which warn in GCC. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27448 llvm-svn: 288792	2016-12-06 10:14:36 +00:00
Chandler Carruth	8977223e55	[PM] Basic cleanups to CGSCC update code, NFC. Just using InstIterator, simpler loop structures, and making better use of the visit callback infrastructure. llvm-svn: 288790	2016-12-06 10:06:06 +00:00
Craig Topper	b34eef7b41	[X86] Remove another weird scalar sqrt/rcp/rsqrt pattern. This pattern turned a vector sqrt/rcp/rsqrt operation of sse_load_f32/f64 into the the scalar instruction for the operation and put undef into the upper bits. For correctness, the resulting code should still perform the sqrt/rcp/rsqrt on the upper bits after the load is extended since that's what the operation asked for. Particularly in the case where the upper bits are 0, in that case we need calculate the sqrt/rcp/rsqrt of the zeroes and keep the result in the upper-bits. This implies we should be using the packed instruction still. The only test case for this pattern is one I just added so there was no coverage of this. llvm-svn: 288784	2016-12-06 08:08:12 +00:00
Craig Topper	683470bf1b	[X86] Remove bad pattern that caused 128-bit loads being used by scalar sqrt/rcp/rsqrt intrinsics to select the memory form of the corresponding instruction and violate the semantics of the intrinsic. The intrinsics are supposed to pass the upper bits straight through to their output register. This means we need to make sure we still perform the 128-bit load to get those upper bits to pass to give to the instruction since the memory form of the instruction only reads 32 or 64 bits. llvm-svn: 288781	2016-12-06 08:08:04 +00:00
Craig Topper	5fc7bc91f9	[X86] Correct pattern for VSQRTSSr_Int, VSQRTSDr_Int, VRCPSSr_Int, and VRSQRTSSr_Int to not have an IMPLICIT_DEF on the first input. The semantics of the intrinsic are clear and not undefined. The intrinsic takes one argument, the lower bits are affected by the operation and the upper bits should be passed through. The instruction itself takes two operands, the high bits of the first operand are passed through and the low bits of the second operand are modified by the operation. To match this to the intrinsic we should pass the single intrinsic input to both operands. I had to remove the stack folding test for these instructions since they depended on the incorrect behavior. The same register is now used for both inputs so the load can't be folded. llvm-svn: 288779	2016-12-06 08:07:58 +00:00
Chris Bieneman	8b058aec1d	[ObjectYAML] First bit of support for encoding DWARF in MachO This patch adds the starting support for encoding data from the MachO __DWARF segment. The first section supported is the __debug_str section because it is the simplest. llvm-svn: 288774	2016-12-06 06:00:49 +00:00
Craig Topper	6413f8a8f2	[X86] Remove scalar logical op alias instructions. Just use COPY_FROM/TO_REGCLASS and the normal packed instructions instead Summary: This patch removes the scalar logical operation alias instructions. We can just use reg class copies and use the normal packed instructions instead. This removes the need for putting these instructions in the execution domain fixing tables as was done recently. I removed the loadf64_128 and loadf32_128 patterns as DAG combine creates a narrower load for (extractelt (loadv4f32)) before we ever get to isel. I plan to add similar patterns for AVX512DQ in a future commit to allow use of the larger register class when available. Reviewers: spatel, delena, zvi, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27401 llvm-svn: 288771	2016-12-06 04:58:39 +00:00
Chris Bieneman	1b5f563a61	[CMake] Cleanup TableGen include flags It is kinda crazy to have llvm/include and llvm/lib/Target in the include path for every tablegen invocation for every tablegen-like tool. This patch removes those flags from the tablgen function that is called everywhere by instead creating a variable LLVM_TABLEGEN_FLAGS which is setup in the LLVM source directories. This removes TableGen.cmake's dependency on LLVM_MAIN_SRC_DIR, and LLVM_MAIN_INCLUDE_DIR. llvm-svn: 288770	2016-12-06 04:45:11 +00:00
Philip Reames	b486c49ff5	[LVI] Remove dead code in mergeIn Integers are expressed in the lattice via constant ranges. They can never be represented by constants or not-constants; those are reserved for non-integer types. This code has been dead for literaly years. llvm-svn: 288767	2016-12-06 03:34:33 +00:00
Philip Reames	05c435e3a4	[LVI] Extract a helper function Extracting a helper function out of solveBlockValue makes the contract around the cache much easier to understand. llvm-svn: 288766	2016-12-06 03:22:03 +00:00
Philip Reames	1baaef138d	[LVI] Hide the last markX function on LVILatticeVal This completes a small series of patches to hide the stateful updates of LVILatticeVal from the consuming code. The only remaining stateful API is mergeIn. llvm-svn: 288765	2016-12-06 03:01:08 +00:00
Philip Reames	b29496246f	[LVI] Hide a confusing internal interface llvm-svn: 288764	2016-12-06 02:54:16 +00:00
Mandeep Singh Grang	9a561aa34c	[llvm] Fix D26214: Move error handling out of MC and to the callers. Summary: Related clang patch; https://reviews.llvm.org/D27360 Reviewers: t.p.northover, grosbach, compnerd, echristo Subscribers: compnerd, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D27359 llvm-svn: 288763	2016-12-06 02:49:17 +00:00
Philip Reames	0e613f7c65	[LVI] Remove duplicate code using existing helper function llvm-svn: 288761	2016-12-06 02:36:58 +00:00
Davide Italiano	df670a1984	Revert "[SCCP] Remove manual folding of terminator instructions." This reverts commit r288725 as it broke a bot. llvm-svn: 288759	2016-12-06 02:26:50 +00:00
Matt Arsenault	ad55ee5869	AMDGPU: Don't required structured CFG The structured CFG is just an aid to inserting exec mask modification instructions, once that is done we don't really need it anymore. We also do not analyze blocks with terminators that modify exec, so this should only be impacting true branches. llvm-svn: 288744	2016-12-06 01:02:51 +00:00
Weiming Zhao	b38cfced8d	Summary: Currently there is no way to disable deprecated warning from asm like this clang -target arm deprecated-asm.s -c deprecated-asm.s:30:9: warning: use of SP or PC in the list is deprecated stmia r4!, {r12-r14} We have to have an option what can disable it. Patched by Yin Ma! Reviewers: joey, echristo, weimingz Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D27219 llvm-svn: 288734	2016-12-05 23:55:13 +00:00
Kostya Serebryany	fe1094b811	[libFuzzer] refactor the code to allow collecting features in different ways. Also initialize a couple of Fuzzer:: members that might have been used uninitialized :( llvm-svn: 288731	2016-12-05 23:35:22 +00:00
Tim Northover	800638fd67	GlobalISel: avoid looking too closely at PHIs when we bail. The function used to finish off PHIs by adding the relevant basic blocks can fail if we're aborting and still don't actually have the needed MachineBasicBlocks. So avoid trying in that case. llvm-svn: 288727	2016-12-05 23:10:19 +00:00
Davide Italiano	3dad93d9ef	[SCCP] Remove manual folding of terminator instructions. There are two cases handled here: 1) a branch on undef 2) a switch with an undef condition. Both cases are currently handled by ResolvedUndefsIn. If we have a branch on undef, we force its value to false (which is trivially foldable). If we have a switch on undef, we force to the first constant (which is also foldable). llvm-svn: 288725	2016-12-05 23:04:21 +00:00
Davide Italiano	ebd5350d85	[TableGen] Centralize/Unify error handling. llvm-svn: 288724	2016-12-05 22:58:01 +00:00
Bob Haarman	a5b4358956	[pdb] handle missing pdb streams more gracefully Summary: The code we use to read PDBs assumed that streams we ask it to read exist, and would read memory outside a vector and crash if this wasn't the case. This would, for example, cause llvm-pdbdump to crash on PDBs generated by lld. This patch handles such cases more gracefully: the PDB reading code in LLVM now reports errors when asked to get a stream that is not present, and llvm-pdbdump will report missing streams and continue processing streams that are present. Reviewers: ruiu, zturner Subscribers: thakis, amccarth Differential Revision: https://reviews.llvm.org/D27325 llvm-svn: 288722	2016-12-05 22:44:00 +00:00
Tim Northover	b566848d68	GlobalISel: place constants correctly in the entry block. When the entry block was empty after arg lowering, we were always placing constants at the end. This is probably hamrless while translating the same block, but horribly wrong once its terminator has been translated. So switch to inserting at the beginning. llvm-svn: 288720	2016-12-05 22:40:13 +00:00
Matt Arsenault	26faed3960	AMDGPU: Consolidate inline immediate predicate functions llvm-svn: 288718	2016-12-05 22:26:17 +00:00
Tim Northover	c0bd197c6b	GlobalISel: handle pointer arguments that get assigned to the stack. llvm-svn: 288717	2016-12-05 22:20:32 +00:00
Matt Arsenault	c7f28a5d95	AMDGPU: Minor assembler refactoring Fix return before else, check types for selecting fltSemantics, refactor immediate checks. llvm-svn: 288715	2016-12-05 22:07:21 +00:00
Tim Northover	cc35f90492	GlobalISel: translate constants larger than 64 bits. llvm-svn: 288713	2016-12-05 21:54:17 +00:00
Tim Northover	9267ac5d47	GlobalISel: make G_CONSTANT take a ConstantInt rather than int64_t. This makes it more similar to the floating-point constant, and also allows for larger constants to be translated later. There's no real functional change in this patch though, just syntax updates. llvm-svn: 288712	2016-12-05 21:47:07 +00:00
Tim Northover	6ad7b9f837	GlobalISel: improve translation fallback for constants. Returning 0 (NoReg) from getOrCreateVReg leads to unexpected situations later in the translation. It's better to return a valid (if undefined) register and let the rest of the instruction carry on as planned. llvm-svn: 288709	2016-12-05 21:40:33 +00:00
Tim Northover	d1fd383b28	GlobalISel: handle 1-element aggregates during ABI lowering. llvm-svn: 288706	2016-12-05 21:25:33 +00:00
Keno Fischer	92f377bd74	[LAA] Prevent invalid IR for loop-invariant bound in loop body Summary: If LAA expands a bound that is loop invariant, but not hoisted out of the loop body, it used to use that value anyway, causing a non-domination error, because the memcheck block is of course not dominated by the scalar loop body. Detect this situation and expand the SCEV expression instead. Fixes PR31251 Reviewers: anemet Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D27397 llvm-svn: 288705	2016-12-05 21:25:03 +00:00
Michael Kuperstein	e3036abcf9	[X86] Fix non-intrinsic roundss/roundsd to not read the destination register This changes the scalar non-intrinsic non-avx roundss/sd instruction definitions not to read their destination register - allowing partial dependency breaking. This fixes PR31143. Differential Revision: https://reviews.llvm.org/D27323 llvm-svn: 288703	2016-12-05 20:57:37 +00:00
Matt Arsenault	bf6bdac1ad	AMDGPU: Assembler support for exp compr is not currently parsed (or printed) correctly, but that should probably be fixed along with intrinsic changes. llvm-svn: 288698	2016-12-05 20:42:41 +00:00
Matt Arsenault	8a63cb9044	AMDGPU: Change how exp is printed This is an improvement over a long list of unreadable numbers. A follow up patch will try to match how sc formats these. llvm-svn: 288697	2016-12-05 20:31:49 +00:00
Matt Arsenault	7bee6ac798	AMDGPU: Refactor exp instructions Structure the definitions a bit more like the other classes. The main change here is to split EXP with the done bit set to a separate opcode, so we can set mayLoad = 1 so that it won't be reordered before the other exp stores, since this has the special constraint that if the done bit is set then this should be the last exp in she shader. Previously all exp instructions were inferred to have unmodeled side effects. llvm-svn: 288695	2016-12-05 20:23:10 +00:00
Quentin Colombet	0e6cccfb53	[AArch64][RegisterBankInfo] Fix typo in the logic used in assert. Thanks to David Binderman <dcb314@hotmail.com> for bringing it to my attention. llvm-svn: 288688	2016-12-05 19:02:37 +00:00
Adrian Prantl	941fa7588b	[DIExpression] Introduce a dedicated DW_OP_LLVM_fragment operation so we can stop using DW_OP_bit_piece with the wrong semantics. The entire back story can be found here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20161114/405934.html The gist is that in LLVM we've been misinterpreting DW_OP_bit_piece's offset field to mean the offset into the source variable rather than the offset into the location at the top the DWARF expression stack. In order to be able to fix this in a subsequent patch, this patch introduces a dedicated DW_OP_LLVM_fragment operation with the semantics that we used to apply to DW_OP_bit_piece, which is what we actually need while inside of LLVM. This patch is complete with a bitcode upgrade for expressions using the old format. It does not yet fix the DWARF backend to use DW_OP_bit_piece correctly. Implementation note: We discussed several options for implementing this, including reserving a dedicated field in DIExpression for the fragment size and offset, but using an custom operator at the end of the expression works just fine and is more efficient because we then only pay for it when we need it. Differential Revision: https://reviews.llvm.org/D27361 rdar://problem/29335809 llvm-svn: 288683	2016-12-05 18:04:47 +00:00
Sanjay Patel	1f158d6955	[TargetLowering] add special-case for demanded bits analysis of 'not' We treat bitwise 'not' as a special operation and try not to reduce its all-ones mask. Presumably, this is because a 'not' may be cheaper than a generic 'xor' or it may get folded into another logic op if the target has those. However, if we can remove a logic instruction by changing the xor's constant mask value, that should always be a win. Note that the IR version of SimplifyDemandedBits() does not treat 'not' as a special-case currently (although that's marked with a FIXME). So if you run this IR through -instcombine, you should get the same end result. I'm hoping to add a different backend transform that will expose this problem though, so I need to solve this first. Differential Revision: https://reviews.llvm.org/D27356 llvm-svn: 288676	2016-12-05 15:58:21 +00:00
Sanjay Patel	f807f6a05f	[x86] fold fand (fxor X, -1) Y --> fandn X, Y I noticed this gap in the scalar FP-logic matching with: D26712 and: rL287171 Differential Revision: https://reviews.llvm.org/D27385 llvm-svn: 288675	2016-12-05 15:45:27 +00:00
Simon Pilgrim	5e922eb0a3	Use range based for loop. NFCI. llvm-svn: 288671	2016-12-05 14:25:04 +00:00
Nirav Dave	d6642c1163	[PPC] Slightly Improve Assembly Parsing errors and add EOL comment parsing tests. NFC intended. llvm-svn: 288667	2016-12-05 14:11:03 +00:00
Simon Dardis	8fe36cd77c	[mips][ias] N32/N64 must not sort the relocation table. Doing so changes the evaluation order for relocation composition. Patch By: Daniel Sanders Reviewers: vkalintiris, atanasyan Differential Revision: https://reviews.llvm.org/D26401 llvm-svn: 288666	2016-12-05 12:55:19 +00:00
Simon Pilgrim	b08c98f125	[X86][SSE] Add support for combining target shuffles to UNPCKL/UNPCKH. llvm-svn: 288663	2016-12-05 11:25:13 +00:00
Simon Pilgrim	20b1409f35	[X86][SSE] Add helper function to create UNPCKL/UNPCKH shuffle masks. NFCI. llvm-svn: 288659	2016-12-05 11:00:25 +00:00
Diana Picus	f11f042ecb	[GlobalISel] Extract handleAssignments out of AArch64CallLowering This function seems target-independent so far: all the target-specific behaviour is isolated in the CCAssignFn and the ValueHandler (which we're also extracting into the generic CallLowering). The intention is to use this in the ARM backend. Differential Revision: https://reviews.llvm.org/D27045 llvm-svn: 288658	2016-12-05 10:40:33 +00:00
Sam Kolton	83102d99ce	[AMDGPU] Disassembler: fix s_buffer_store_dword instructions Summary: s_buffer_store_dword instructions sdata operand was called sdst in encoding. This caused disassembler to fail. Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: arsenm, nhaehnle, rampitec Differential Revision: https://reviews.llvm.org/D27100 llvm-svn: 288657	2016-12-05 09:58:51 +00:00
Matthias Braun	215ff84b40	TableGen: Some more std::string->StringInit* replacements llvm-svn: 288653	2016-12-05 07:35:13 +00:00
Matthias Braun	99f8937029	TableGen: TableGenStringKey is no longer necessary as of r288642 llvm-svn: 288651	2016-12-05 07:04:19 +00:00
Matthias Braun	ca151317e8	TableGen: Use range based for; reserve vectors where possible llvm-svn: 288650	2016-12-05 07:00:44 +00:00

1 2 3 4 5 ...

97500 Commits