llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjoy Das	8082592ac9	[RuntimeDyld] Add bounds checking to SectionEntry::advanceStubOffset Summary: Change SectionEntry to keep track of the size of its underlying allocation, and use that to bounds check advanceStubOffset. Reviewers: lhames, andrew.w.kaylor, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14675 llvm-svn: 253919	2015-11-23 21:47:46 +00:00
Sanjoy Das	277776a520	[RuntimeDyld] Add accessors to `SectionEntry`; NFC Summary: Remove naked access to the data members in `SectionEntry` and route accesses through accessor functions. This makes it obvious how the instances of the class are used, and will also facilitate adding bounds checking to `advanceStubOffset` in a later change. Reviewers: lhames, loladiro, andrew.w.kaylor Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14674 llvm-svn: 253918	2015-11-23 21:47:41 +00:00
Dan Gohman	7054ac1b8b	[WebAssembly] Model the return value of store instructions in wasm. llvm-svn: 253916	2015-11-23 21:16:35 +00:00
Chad Rosier	a15b4b6af2	[LIR] Put includes in correct order. NFC. llvm-svn: 253915	2015-11-23 21:09:13 +00:00
Xinliang David Li	6f7c19a494	[PGO] Add --text option for llvm-profdata show\|merge commands The new option is similar to the SampleProfile dump option. - dump raw/indexed format into text profile format - merge the profile and output into text profile format. Note that Value Profiling data text format is not yet designed. That functionality will be added later. Differential Revision: http://reviews.llvm.org/D14894 llvm-svn: 253913	2015-11-23 20:47:38 +00:00
Diego Novillo	243ea6a7d6	SamplePGO - Add coverage tracking for samples. The existing coverage tracker counts the number of records that were used from the input profile. An alternative view of coverage is to check how many available samples were applied. This way, if the profile contains several records with few samples, it doesn't really matter much that they were not applied. The more interesting records to apply are the ones that contribute many samples. llvm-svn: 253912	2015-11-23 20:12:21 +00:00
Andrew Kaylor	0615a0e65d	[WinEH] Fix a case where GVN could incorrectly PRE a load into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253908	2015-11-23 19:51:41 +00:00
Dan Gohman	aa0a4bd05b	[WebAssembly] Don't use set_local instructions explicitly. The current approach to using get_local and set_local is to use them implicitly, as register uses and defs. Introduce new copy instructions which are themselves no-ops except for the get_local and set_local that they imply, so that we use get_local and set_local consistently. llvm-svn: 253905	2015-11-23 19:30:43 +00:00
Teresa Johnson	6b92316811	[ThinLTO] Deduplicate function index loading into shared helper (NFC) Add a shared helper routine to read the function index from a file and create/return the function index object. Use it in llvm-link and llvm-lto. llvm-svn: 253903	2015-11-23 19:19:11 +00:00
Andrew Kaylor	d0430e8580	[WinEH] Fix problem where CodeGenPrepare incorrectly sinks a bitcast into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253902	2015-11-23 19:16:15 +00:00
Dan Gohman	f6857223c9	[WebAssembly] Always print loop end labels WebAssembly is currently using labels to end scopes, so for example a loop scope looks like this: BB0_0: loop BB0_1 ... BB0_1: with BB0_0 being the label of the first block not in the loop. This requires that the label be printed even when it's only reachable via fallthrough. To arrange this, insert a no-op LOOP_END instruction in such cases at the end of the loop. llvm-svn: 253901	2015-11-23 19:12:37 +00:00
Xinliang David Li	c7c1f8581a	[PGO] Introduce alignment macro for instr-prof control data(NFC) llvm-svn: 253893	2015-11-23 18:02:59 +00:00
Dan Gohman	e425c32224	[WebAssembly] Remove incomplete MCCodeEmitter bits. These are parts of a separate patch that I accidentally included in r253878. llvm-svn: 253892	2015-11-23 18:00:04 +00:00
Paul Robinson	af19bc3a9c	Add Windows error code and tidy formatting for system errors. Differential Revision: http://reviews.llvm.org/D14892 llvm-svn: 253888	2015-11-23 17:34:20 +00:00
Dan Gohman	53828fd777	[WebAssembly] Emit .param, .result, and .local through MC. This eliminates one of the main remaining uses of EmitRawText. llvm-svn: 253878	2015-11-23 16:50:18 +00:00
Diego Novillo	1ca881c4bb	SamplePGO - Clear coverage tracking when clearing per-function data. llvm-svn: 253877	2015-11-23 16:30:17 +00:00
Dan Gohman	3280793234	[WebAssembly] Use dominator information to improve BLOCK placement Always starting blocks at the top of their containing loops works, but creates unnecessarily deep nesting because it makes all blocks in a loop overlap. Refine the BLOCK placement algorithm to start blocks at nearest common dominating points instead, which significantly shrinks them and reduces overlapping. llvm-svn: 253876	2015-11-23 16:19:56 +00:00
Daniel Sanders	2b561336d9	[mips] .ent and .end should also set the type and size of the symbol respectively. Reviewers: vkalintiris Subscribers: llvm-commits, seanbruno, emaste, vkalintiris, dsanders Differential Revision: http://reviews.llvm.org/D14221 llvm-svn: 253875	2015-11-23 16:08:03 +00:00
Diego Novillo	39ab68f39b	SamplePGO - Use newly introduced local variable. NFC. llvm-svn: 253868	2015-11-23 15:24:13 +00:00
Krzysztof Parzyszek	29d23f9f4c	[Hexagon] Update instruction formats llvm-svn: 253867	2015-11-23 14:09:26 +00:00
Martell Malone	a6b867eb0d	ARM: address WoA division overflow crash Disable custom handling of signed 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit integer overflow crashes. llvm-svn: 253865	2015-11-23 13:11:39 +00:00
Craig Topper	2241dfd2dc	[Mips] Remove an unnecessary wrapping of a predicate with std::ptr_fun. NFC llvm-svn: 253855	2015-11-23 07:19:06 +00:00
Davide Italiano	6f93df8105	[Analysis/CallGraph] Switch dump() definitions over to LLVM_DUMP_METHOD. llvm-svn: 253842	2015-11-23 02:58:42 +00:00
Davide Italiano	945d05f6a0	[LoopStrengthReduce] Mark dump() definitions as LLVM_DUMP_METHOD. llvm-svn: 253841	2015-11-23 02:47:30 +00:00
Mehdi Amini	8220e8a830	Add const qualifier for FunctionInfoIndex in ModuleLinker and linkInModule() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253840	2015-11-23 01:59:16 +00:00
Sanjoy Das	0194743fad	[SCEV] Use C++11'isms llvm-svn: 253837	2015-11-22 21:20:13 +00:00
Benjamin Kramer	0969a2a74c	[MDBuilder] Simplify code using initializer lists. NFC. llvm-svn: 253826	2015-11-22 18:03:17 +00:00
Simon Pilgrim	1dfe53e180	Remove duplicate getValueType() calls. NFCI. llvm-svn: 253823	2015-11-22 16:49:38 +00:00
Krzysztof Parzyszek	6753f33388	Avoid dependency between TableGen and CodeGen Duplicate a few common definitions between DFAPacketizer.cpp and DFAPacketizerEmitter.cpp to avoid including files from CodeGen in TableGen. llvm-svn: 253820	2015-11-22 15:20:19 +00:00
Elena Demikhovsky	0fd11526e2	AVX-512: Optimized INSERT_SUBVECTOR for i1 vector types ISERT_SUBVECTOR for i1 vectors may be done with shifts, when we insert into the lower part, or into the upper part, on into all-zero vector. CONCAT_VECTORS uses ISERT_SUBVECTOR. Differential Revision: http://reviews.llvm.org/D14815 llvm-svn: 253819	2015-11-22 13:57:38 +00:00
Xinliang David Li	924e05843d	[PGO] move names of runtime sections definitions to InstrProfData.inc In profile runtime implementation for Darwin, Linux and FreeBSD, the names of sections holding profile control/counter/naming data need to be known by the runtime in order to locate the start/end of the data. Moving the name definitions to the common file to specify the connection. llvm-svn: 253814	2015-11-22 05:42:31 +00:00
Xinliang David Li	c76732396b	[PGO] Define value profiling updater API signature in InstrProfData.inc (NFC) llvm-svn: 253805	2015-11-22 00:22:07 +00:00
Rafael Espindola	d1beb07d39	Have a single way for creating unique value names. We had two code paths. One would create names like "foo.1" and the other names like "foo1". For globals it is important to use "foo.1" to help C++ name demangling. For locals there is no strong reason to go one way or the other so I kept the most common mangling (foo1). llvm-svn: 253804	2015-11-22 00:16:24 +00:00
Sanjay Patel	8066d906f1	fix formatting; NFC llvm-svn: 253802	2015-11-22 00:03:16 +00:00
Sanjoy Das	b37c4c414b	[SCEVExpander] Use C++isms; NFC llvm-svn: 253801	2015-11-21 23:20:10 +00:00
Teresa Johnson	6290dbc0f7	[ThinLTO] Handle bitcode without function summary sections gracefully Summary: Several fixes to the handling of bitcode files without function summary sections so that they are skipped during ThinLTO processing in llvm-lto and the gold plugin when appropriate instead of aborting. 1 Don't assert when trying to add a FunctionInfo that doesn't have a summary attached. 2 Skip FunctionInfo structures that don't have attached function summary sections when trying to create the combined function summary. 3 In both llvm-lto and gold-plugin, check whether a bitcode file has a function summary section before trying to parse the index, and skip the bitcode file if it does not. 4 Fix hasFunctionSummaryInMemBuffer in BitcodeReader, which had a bug where we returned to early while looking for the summary section. Also added llvm-lto and gold-plugin based tests for cases where we don't have function summaries in the bitcode file. I verified that either the first couple fixes described above are enough to avoid the crashes, or fixes 1,3,4. But have combined them all here for added robustness. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14903 llvm-svn: 253796	2015-11-21 21:55:48 +00:00
Krzysztof Parzyszek	b46557292c	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. Reapply the previous patch, this time without circular dependencies. llvm-svn: 253793	2015-11-21 20:00:45 +00:00
Craig Topper	a5ea5289ff	Use modulo operator instead of multiplying result of a divide and subtracting from the original dividend. NFC. llvm-svn: 253792	2015-11-21 17:44:42 +00:00
Krzysztof Parzyszek	4ca21fc1aa	Revert r253790: it breaks all builds for some reason. llvm-svn: 253791	2015-11-21 17:38:33 +00:00
Krzysztof Parzyszek	220a9bc018	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. llvm-svn: 253790	2015-11-21 17:23:52 +00:00
Sanjay Patel	04df583a42	use ternary ops; NFC llvm-svn: 253787	2015-11-21 16:51:19 +00:00
Sanjay Patel	1f3fa2133a	remove unnecessary temp variables; NFC llvm-svn: 253786	2015-11-21 16:37:09 +00:00
Sanjay Patel	5a7bdc9632	fix typo; NFC llvm-svn: 253785	2015-11-21 16:16:29 +00:00
Jonas Paulsson	8f0d2b7f1f	[DAGCombiner] Bugfix for lost chain depenedency. When MergeConsecutiveStores() combines two loads and two stores into wider loads and stores, the chain users of both of the original loads must be transfered to the new load, because it may be that a chain user only depends on one of the loads. New test case: test/CodeGen/SystemZ/dag-combine-01.ll Reviewed by James Y Knight. Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=25310#c6 llvm-svn: 253779	2015-11-21 13:25:07 +00:00
Simon Pilgrim	d5a154424b	[X86][AVX512] Added AVX512 VMOVLHPS/VMOVHLPS shuffle decode comments. llvm-svn: 253777	2015-11-21 13:04:42 +00:00
Simon Pilgrim	96cbce61b2	[X86][SSE] Legal XMM Register Class ordering for SSE1 It turns out we have a number of places that just grab the first type attached to a register class for various reasons. This is fine unless for some reason that type isn't legal on the current target, such as for SSE1 which doesn't support v16i8/v8i16/v4i32/v2i64 - all of which were included before 4f32 in the class. Given that this is such a rare situation I've just re-ordered the types and placed the float types first. Fix for PR16133 Differential Revision: http://reviews.llvm.org/D14787 llvm-svn: 253773	2015-11-21 12:38:34 +00:00
Weiming Zhao	8d5c08f591	[SimplifyLibCalls] Removed some TODOs which are already implemented. NFC. Summary: D14302 implements tan(atan(x)) -> x D14045 implements pow(exp(x), y) -> exp(x*y) Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide Differential Revision: http://reviews.llvm.org/D14882 llvm-svn: 253768	2015-11-21 06:10:20 +00:00
Teresa Johnson	16e2a9eeb6	Move new assert to correct location This assert was meant to execute at the end of parseMetadata, but we return early and never reach the end of the function. Caught by a compile-time warning since the function doesn't return a value from that location. llvm-svn: 253762	2015-11-21 03:51:23 +00:00
Kostya Serebryany	b569368a5a	[libFuzzer] don't crash when reporting a leak in test_single_input mode llvm-svn: 253761	2015-11-21 03:46:43 +00:00
Matthias Braun	5a1857b6eb	ARMLoadStoreOptimizer: Cleanup isMemoryOp(); NFC llvm-svn: 253757	2015-11-21 02:09:49 +00:00
Vinicius Tinti	67cf33d9ab	Test commit llvm-svn: 253737	2015-11-20 23:20:12 +00:00
Rong Xu	a1f61fe841	Add some constantness to GetSuccessorNumber(). llvm-svn: 253733	2015-11-20 23:02:06 +00:00
Eric Christopher	25bf4a8617	Power8 and later support fusing addis/addi and addis/ld instruction pairs that use the same register to execute as a single instruction. No Functional Change Patch by Kyle Butt! llvm-svn: 253724	2015-11-20 22:38:20 +00:00
Owen Anderson	8e85130bb9	Fix another infinite loop in Reassociate caused by Constant::isZero(). Not all zero vectors are ConstantDataVector's. llvm-svn: 253723	2015-11-20 22:34:48 +00:00
Geoff Berry	5256fcada0	[CodeGenPrepare] Create more extloads and fewer ands Summary: Add and instructions immediately after loads that only have their low bits used, assuming that the (and (load x) c) will be matched as a extload and the ands/truncs fed by the extload will be removed by isel. Reviewers: mcrosier, qcolombet, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14584 llvm-svn: 253722	2015-11-20 22:34:39 +00:00
Arnaud A. de Grandmaison	4e89e9f846	[ShrinkWrap] Teach ShrinkWrap to handle targets requiring a register scavenger. The included test only checks for a compiler crash for now. Several people are facing this issue, so we first resolve the crash, and will increase shrinkwrap's coverage later in a follow-up patch. llvm-svn: 253718	2015-11-20 21:54:27 +00:00
Diego Novillo	5fb49e5c5f	SamplePGO - Do not count never-executed inlined functions when computing coverage. If a function was originally inlined but not actually hot at runtime, its samples will not be counted inside the parent function. This throws off the coverage calculation because it expects to find more used records than it should. Fixed by ignoring functions that will not be inlined into the parent. Currently, this is inlined functions with 0 samples. In subsequent patches, I'll change this to mean "cold" functions. llvm-svn: 253716	2015-11-20 21:46:38 +00:00
Jun Bum Lim	80ec0d3f5a	[AArch64]Merge narrow zero stores to a wider store This change merges adjacent zero stores into a wider single store. For example : strh wzr, [x0] strh wzr, [x0, #2] becomes str wzr, [x0] This will fix PR25410. llvm-svn: 253711	2015-11-20 21:14:07 +00:00
Eric Christopher	c180836722	Weak non-function symbols were being accessed directly, which is incorrect, as the chosen representative of the weak symbol may not live with the code in question. Always indirect the access through the TOC instead. Patch by Kyle Butt! llvm-svn: 253708	2015-11-20 20:51:31 +00:00
Krzysztof Parzyszek	6c5ca95814	[Hexagon] Fix the return value from HexagonGenInsert::runOnMachineFunction llvm-svn: 253705	2015-11-20 20:46:23 +00:00
Reid Kleckner	437b1b3ea5	Fix the Windows build, include <tuple> for std::tie llvm-svn: 253698	2015-11-20 19:29:40 +00:00
Tilmann Scheller	925b193eed	Revert "[FunctionAttrs] Remove redundant assignment." This reverts r253661. Turns out that the assignment is not redundant (despite the Clang static analyzer claiming the opposite). The variable is being used by the lambda function AddUsersToWorklistIfCapturing(). llvm-svn: 253696	2015-11-20 19:17:10 +00:00
Nathan Slingerland	a731829788	[llvm-profdata] Add merge() to InstrProfRecord Summary: This change refactors two aspects of InstrProfRecord: 1) Add a merge() method to InstrProfRecord (previously InstrProfWriter combineInstrProfRecords()) in order to better encapsulate this functionality and to make the InstrProfRecord and SampleRecord APIs more consistent. 2) Make InstrProfRecord mergeValueProfData() a private method since it is only ever called internally by merge(). Reviewers: dnovillo, bogner, davidxl Subscribers: silvas, vsk, llvm-commits Differential Revision: http://reviews.llvm.org/D14786 llvm-svn: 253695	2015-11-20 19:12:43 +00:00
Artyom Skrobov	7f0fc9ccb7	Avoid duplicate entry for cortex-a7 in the TargetParser (NFC) Reviewers: t.p.northover, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14757 llvm-svn: 253676	2015-11-20 16:46:14 +00:00
Artyom Skrobov	91f339ab3f	Handle ARMv6-J as an alias, instead of fake architecture Summary: This follows D14577 to treat ARMv6-J as an alias for ARMv6, instead of an architecture in its own right. The functional change is that the default CPU when targeting ARMv6-J changes from arm1136j-s to arm1136jf-s, which is currently used as the default CPU for ARMv6; both are, in fact, ARMv6-J CPUs. The J-bit (Jazelle support) is irrelevant to LLVM, and it doesn't affect code generation, attributes, optimizations, or anything else, apart from selecting the default CPU. Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14755 llvm-svn: 253675	2015-11-20 16:46:09 +00:00
Diego Novillo	df544a098a	SamplePGO - Add line offset and discriminator information to sample reports. While debugging some sampling coverage problems, I found this useful: When applying samples from a profile, it helps to also know what line offset and discriminator the sample belongs to. This makes it easy to correlate against the input profile. llvm-svn: 253670	2015-11-20 15:39:42 +00:00
Teresa Johnson	d4d3dfd8ef	[ThinLTO] Add MODULE_CODE_METADATA_VALUES record Summary: This is split out from the ThinLTO metadata mapping patch http://reviews.llvm.org/D14752. To avoid needing to parse the module level metadata during function importing, a new module-level record is added which holds the number of module-level metadata values. This is required because metadata value ids are assigned implicitly during parsing, and the function-level metadata ids start after the module-level metadata ids. I made a change to this version of the code compared to D14752 in order to add more consistent and thorough assertion checking of the new record value. We now unconditionally use the record value to initialize the MDValueList size, and handle it the same in parseMetadata for all module level metadata cases (lazy loading or not). Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14825 llvm-svn: 253668	2015-11-20 14:51:27 +00:00
Tilmann Scheller	4cd1d51a4d	[Hexagon] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253664	2015-11-20 13:27:30 +00:00
Daniel Sanders	b700203c8b	Partially revert r253662: some unrelated work was accidentally committed with it. Sorry. llvm-svn: 253663	2015-11-20 13:16:35 +00:00
Daniel Sanders	be9db3c00a	Revert the revert 253497 and 253539 - These commits aren't the cause of the clang-cmake-mips failures. Sorry for the noise. llvm-svn: 253662	2015-11-20 13:13:53 +00:00
Tilmann Scheller	1e929f97f6	[FunctionAttrs] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253661	2015-11-20 12:51:58 +00:00
Tilmann Scheller	bfd7ce01ea	[Hexagon] Remove redundant local variable. Identified by the Clang static analyzer. llvm-svn: 253660	2015-11-20 12:10:17 +00:00
Owen Anderson	630077ef55	Fix a pair of issues that caused an infinite loop in reassociate. Terrifyingly, one of them is a mishandling of floating point vectors in Constant::isZero(). How exactly this issue survived this long is beyond me. llvm-svn: 253655	2015-11-20 08:16:13 +00:00
Craig Topper	e325e3806f	Use range-based for loops. NFC llvm-svn: 253652	2015-11-20 07:18:48 +00:00
Hrvoje Varga	b65518c15c	[mips][microMIPS] Implement MUL[_S].PH, MULEQ_S.W.PHL, MULEQ_S.W.PHR, MULEU_S.PH.QBL, MULEU_S.PH.QBR, MULQ_RS.PH, MULQ_RS.W, MULQ_S.PH and MULQ_S.W instructions Differential Revision: http://reviews.llvm.org/D14280 llvm-svn: 253651	2015-11-20 07:14:52 +00:00
Dan Gohman	d9625276a7	[WebAssembly] Remove the AsmPrinter code for printing physical registers. WebAssembly does not have physical registers, so even if LLVM uses physical registers like SP, they'll need to be lowered to virtual registers before AsmPrinter time. llvm-svn: 253644	2015-11-20 03:13:31 +00:00
Dan Gohman	dfa81d8e22	[WebAssembly] Add a few open tasks to the target README.txt. llvm-svn: 253643	2015-11-20 03:08:27 +00:00
Dan Gohman	bb7ce8e408	[WebAssembly] Rename SWITCH to TABLESWITCH to match the current wording in the spec. llvm-svn: 253642	2015-11-20 03:02:49 +00:00
Dan Gohman	2dfc3b8be5	[WebAssembly] Remove done items from the README.txt. llvm-svn: 253640	2015-11-20 02:51:38 +00:00
Dan Gohman	7bafa0eaef	[WebAssembly] Add asserts that the expression stack is used in stack order. llvm-svn: 253638	2015-11-20 02:33:24 +00:00
Dan Gohman	b0992dafb3	[WebAssemby] Enforce FIFO ordering for instructions using stackified registers. llvm-svn: 253634	2015-11-20 02:19:12 +00:00
Peter Collingbourne	c85f4ced4d	ScalarEvolution: do not set nuw when creating exprs of form <expr> + <all-ones>. The nuw constraint will not be satisfied unless <expr> == 0. This bug has been around since r102234 (in 2010!), but was uncovered by r251052, which introduced more aggressive optimization of nuw scev expressions. Differential Revision: http://reviews.llvm.org/D14850 llvm-svn: 253627	2015-11-20 01:26:13 +00:00
Eric Christopher	eb027124af	Split the argument unscheduling loop in the WebAssembly register coloring pass. Turn the logic into "look for an insert point and then move things past the insert point". No functional change intended. llvm-svn: 253626	2015-11-20 00:34:54 +00:00
Tobias Edler von Koch	4d45090659	[LTO] Add option to emit assembly from LTOCodeGenerator This adds a new API, LTOCodeGenerator::setFileType, to choose the output file format for LTO CodeGen. A corresponding change to use this new API from llvm-lto and a test case is coming in a separate commit. Differential Revision: http://reviews.llvm.org/D14554 llvm-svn: 253622	2015-11-19 23:59:24 +00:00
Eric Christopher	8c3dbcab1d	Fix a [-Werror,-Wcovered-switch-default] warning by removing the unnecessary default case. llvm-svn: 253621	2015-11-19 23:45:42 +00:00
Reid Kleckner	cc2f6c35a3	[WinEH] Disable most forms of demotion Now that the register allocator knows about the barriers on funclet entry and exit, testing has shown that this is unnecessary. We still demote PHIs on unsplittable blocks due to the differences between the IR CFG and the Machine CFG. llvm-svn: 253619	2015-11-19 23:23:33 +00:00
Dan Gohman	3192ddfeba	[WebAssembly] Implement isCheapToSpeculateCtlz and isCheapToSpeculateCttz. This unbreaks test/CodeGen/WebAssembly/i32.ll and test/CodeGen/WebAssembly/i64.ll after r224899. llvm-svn: 253617	2015-11-19 23:04:59 +00:00
Diego Novillo	379cc5e71b	SamplePGO - Tweak debugging output for function samples. NFC. llvm-svn: 253612	2015-11-19 22:18:30 +00:00
Simon Pilgrim	a9912617c8	[X86][SSE4A] Fix issue with EXTRQI shuffles not starting at the correct start index. Found during stress testing. llvm-svn: 253611	2015-11-19 22:13:56 +00:00
Reid Kleckner	ebee6129cd	Fix UMRs in Mips disassembler on invalid instruction streams The Insn and Size local variables were used without initialization. llvm-svn: 253607	2015-11-19 21:51:55 +00:00
Simon Pilgrim	ae0140d6ec	[X86] Use existing MachineInstrBuilder::addDisp to create offseted pointer. NFC. Minor code duplication tidyup to D13988 llvm-svn: 253606	2015-11-19 21:50:57 +00:00
Davide Italiano	c807f487f7	Follow up to r253591. Turn into an assertion. Reported by: David Blaikie. llvm-svn: 253605	2015-11-19 21:50:08 +00:00
Chad Rosier	1cd3da15e8	[LIR] Update some comments. NFC. llvm-svn: 253603	2015-11-19 21:33:07 +00:00
Krzysztof Parzyszek	df537b97b1	Expand subregisters in MachineFrameInfo::getPristineRegs http://reviews.llvm.org/D14719 llvm-svn: 253600	2015-11-19 21:18:52 +00:00
Dehao Chen	014fb55711	Fix the debug build breakage that getDiscriminator is called by mistake. llvm-svn: 253597	2015-11-19 20:29:27 +00:00
Michael Zolotukhin	6c11c04db3	Revert r253253 and r253126: "Don't recompute LCSSA after loop-unrolling when possible." The change exposed a bug in IndVarSimplify (PR25578), which led to a failure (PR25538). When the bug is fixed, this patch can be reapplied. The tests are kept in tree, as they're useful anyway, and will not break with this revert. llvm-svn: 253596	2015-11-19 20:28:32 +00:00
Dehao Chen	23e2278e27	Reimplement discriminator assignment algorithm. Summary: The new algorithm is more efficient (O(n), n is number of basic blocks). And it is guaranteed to cover all cases of multiple BB mapped to same line. Reviewers: dblaikie, davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14738 llvm-svn: 253594	2015-11-19 19:53:05 +00:00
Davide Italiano	193c4edffb	[AddressSanitizer] assert(false) -> llvm_unreachable and remove return. llvm-svn: 253591	2015-11-19 19:28:23 +00:00
Jun Bum Lim	c12c2790e1	[AArch64] Refactoring aarch64-ldst-opt. NCF. Summary : * Rename isSmallTypeLdMerge() to isNarrowLoad(). * Rename NumSmallTypeMerged to NumNarrowTypePromoted. * Use Subtarget defined as a member variable. llvm-svn: 253587	2015-11-19 18:41:27 +00:00
Chad Rosier	3ecc8d8d83	[LIR] Fix 80-column from previous commit. llvm-svn: 253586	2015-11-19 18:25:11 +00:00
Chad Rosier	fddc01f393	[LIR] Sink checks into function to enable future refactoring. NFC. The purpose of this change is help delineate the memset and memcpy optimizations with the overall goal of resolving PR25520. llvm-svn: 253585	2015-11-19 18:22:21 +00:00
James Molloy	1d695a09dd	[GlobalOpt] Localize some globals that have non-instruction users We currently bail out of global localization if the global has non-instruction users. However, often these can be simple bitcasts or constant-GEPs, which we can easily turn into instructions before localizing. Be a bit more aggressive. llvm-svn: 253584	2015-11-19 18:04:33 +00:00
Sanjay Patel	2fe7728233	update comment and error message; NFC 'notail' was added in: http://reviews.llvm.org/rL252368 llvm-svn: 253580	2015-11-19 17:35:55 +00:00
Chad Rosier	85c21f0a6e	[LIR] Use the more appropriate method. NFC. llvm-svn: 253578	2015-11-19 17:27:28 +00:00
Jun Bum Lim	4c35ccac91	[AArch64]Extend merging narrow loads into a wider load This change extends r251438 to handle more narrow load promotions including byte type, unscaled, and signed. For example, this change will convert : ldursh w1, [x0, #-2] ldurh w2, [x0, #-4] into ldur w2, [x0, #-4] asr w1, w2, #16 and w2, w2, #0xffff llvm-svn: 253577	2015-11-19 17:21:41 +00:00
Sanjay Patel	4699b8ab6a	[CGP] despeculate expensive cttz/ctlz intrinsics This is another step towards allowing SimplifyCFG to speculate harder, but then have CGP clean things up if the target doesn't like it. Previous patches in this series: http://reviews.llvm.org/D12882 http://reviews.llvm.org/D13297 D13297 should catch most expensive ops, but speculation of cttz/ctlz requires special handling because of weirdness in the intrinsic definition for handling a zero input (that definition can probably be blamed on x86). For example, if we have the usual speculated-by-select expensive op pattern like this: %tobool = icmp eq i64 %A, 0 %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true %cond = select i1 %tobool, i64 64, i64 %0 ret i64 %cond There's an instcombine that will turn it into: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 false) ; is_zero_undef == false This CGP patch is looking for that case and despeculating it back into: entry: %tobool = icmp eq i64 %A, 0 br i1 %tobool, label %cond.end, label %cond.true cond.true: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true br label %cond.end cond.end: %cond = phi i64 [ %0, %cond.true ], [ 64, %entry ] ret i64 %cond This unfortunately may lead to poorer codegen (see the changes in the existing x86 test), but if we increase speculation in SimplifyCFG (the next step in this patch series), then we should avoid those kinds of cases in the first place. The need for this patch was originally mentioned here: http://reviews.llvm.org/D7506 with follow-up here: http://reviews.llvm.org/D7554 Differential Revision: http://reviews.llvm.org/D14630 llvm-svn: 253573	2015-11-19 16:37:10 +00:00
Hans Wennborg	dcc2500452	X86: More efficient legalization of wide integer compares In particular, this makes the code for 64-bit compares on 32-bit targets much more efficient. Example: define i32 @test_slt(i64 %a, i64 %b) { entry: %cmp = icmp slt i64 %a, %b br i1 %cmp, label %bb1, label %bb2 bb1: ret i32 1 bb2: ret i32 2 } Before this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax setae %al cmpl 16(%esp), %ecx setge %cl je .LBB2_2 movb %cl, %al .LBB2_2: testb %al, %al jne .LBB2_4 movl $1, %eax retl .LBB2_4: movl $2, %eax retl After this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax sbbl 16(%esp), %ecx jge .LBB1_2 movl $1, %eax retl .LBB1_2: movl $2, %eax retl Differential Revision: http://reviews.llvm.org/D14496 llvm-svn: 253572	2015-11-19 16:35:08 +00:00
NAKAMURA Takumi	768579c409	TargetParser.cpp: Fixup -- StringRef::startswith() is better here. NFC. llvm-svn: 253570	2015-11-19 15:42:52 +00:00
Diego Novillo	ef548d2918	SamplePGO - Sort samples by source location when emitting as text. When dumping function samples or writing them out as text format, it helps if the samples are emitted sorted by source location. The sorting of the maps is a bit slow, so we only do it on demand. llvm-svn: 253568	2015-11-19 15:33:08 +00:00
NAKAMURA Takumi	b6b254582f	llvm/lib/Support/TargetParser.cpp: Rework llvm::ARM::getArchExtFeature() to avoid abuse of Twine in r253470. llvm-svn: 253566	2015-11-19 15:03:11 +00:00
Chad Rosier	33efdf810f	[LV] Add a helper function, isReductionVariable. NFC. llvm-svn: 253565	2015-11-19 14:19:06 +00:00
Zoran Jovanovic	00f998b440	[mips] Expansion of ROL and ROR macros Author: obucina Reviewers: dsanders Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D10611 llvm-svn: 253564	2015-11-19 14:15:03 +00:00
Elena Demikhovsky	7c2c9fd243	AVX-512: Fixed COPY_TO_REGCLASS for mask registers Copying one mask register to another under BW should be done with kmovq instruction, otherwise we can loose some bits. Copying 8 bits under DQ may be done with kmovb. Differential Revision: http://reviews.llvm.org/D14812 llvm-svn: 253563	2015-11-19 13:13:00 +00:00
Simon Pilgrim	846b64e17a	[X86][AVX] Fix lowering of X86ISD::VZEXT_MOVL for 128-bit -> 256-bit extension The lowering patterns for X86ISD::VZEXT_MOVL for 128-bit to 256-bit vectors were just copying the lower xmm instead of actually masking off the first scalar using a blend. Fix for PR25320. Differential Revision: http://reviews.llvm.org/D14151 llvm-svn: 253561	2015-11-19 12:18:37 +00:00
Alexey Bataev	b7b82bf33e	Alternative to long nops for X86 CPUs, by Andrey Turetsky Make X86AsmBackend generate smarter nops instead of a bunch of 0x90 for code alignment for CPUs which don't support long nop instructions. Differential Revision: http://reviews.llvm.org/D14178 llvm-svn: 253557	2015-11-19 11:44:35 +00:00
James Molloy	0ecdbe7d6b	[FunctionAttrs] Provide a mechanism for adding function attributes from the command line This provides a way to force a function to have certain attributes from the command line. This can be useful when debugging or doing workload exploration, where manually editing IR is tedious or not possible (due to build systems etc). The syntax is -force-attribute=function_name:attribute_name All function attributes are parsed except alignstack as it requires an argument. llvm-svn: 253550	2015-11-19 08:49:57 +00:00
Igor Breger	1f78296869	AVX512: Implemented encoding, intrinsics and DAG lowering for VMOVDDUP instructions. Differential Revision: http://reviews.llvm.org/D14702 llvm-svn: 253548	2015-11-19 08:26:56 +00:00
Igor Breger	4424aaa28e	AVX512: Implemented encoding for the vmovss.s and vmovsd.s instructions. Differential Revision: http://reviews.llvm.org/D14771 llvm-svn: 253547	2015-11-19 07:58:33 +00:00
Igor Breger	81b79de54c	AVX512: Implemented encoding for the follow instructions. vmovapd.s, vmovaps.s, vmovdqa32.s, vmovdqa64.s, vmovdqu16.s, vmovdqu32.s, vmovdqu64.s, vmovdqu8.s, vmovupd.s, vmovups.s Differential Revision: http://reviews.llvm.org/D14768 llvm-svn: 253546	2015-11-19 07:43:43 +00:00
Elena Demikhovsky	1ca72e1846	Pointers in Masked Load, Store, Gather, Scatter intrinsics The masked intrinsics support all integer and floating point data types. I added the pointer type to this list. Added tests for CodeGen and for Loop Vectorizer. Updated the Language Reference. Differential Revision: http://reviews.llvm.org/D14150 llvm-svn: 253544	2015-11-19 07:17:16 +00:00
Pete Cooper	67cf9a723b	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253511. This likely broke the bots in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253543	2015-11-19 05:56:52 +00:00
Mehdi Amini	354f520fbc	Do not require a Context to extract the FunctionIndex from Bitcode (NFC) The LLVMContext was only used for Diagnostic. Pass a DiagnosticHandler instead. Differential Revision: http://reviews.llvm.org/D14794 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253540	2015-11-19 05:52:29 +00:00
Weiming Zhao	b69babd01e	Fix bug 25440: GVN assertion after coercing loads Optimizations like LoadPRE in GVN will insert new instructions. If the insertion point is in a already processed BB, they should get a value number explicitly. If the insertion point is after current instruction, then just leave it. However, current GVN framework has no support for it. In this patch, we just bail out if a VN can't be found. Dfferential Revision: http://reviews.llvm.org/D14670 A test/Transforms/GVN/pr25440.ll M lib/Transforms/Scalar/GVN.cpp llvm-svn: 253536	2015-11-19 02:45:18 +00:00
Quentin Colombet	46d5c71135	[X86] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14156 rdar://problem/21118279 llvm-svn: 253528	2015-11-19 00:38:00 +00:00
Cong Hou	7b2ae9abba	Fix several long lines (>80) in LoopVectorize.cpp. NFC. llvm-svn: 253527	2015-11-19 00:32:30 +00:00
Davide Italiano	c5cedd195a	[SimplifyLibCalls] New trick: pow(x, 0.5) -> sqrt(x) under -ffast-math. Differential Revision: http://reviews.llvm.org/D14466 llvm-svn: 253521	2015-11-18 23:21:32 +00:00
Quentin Colombet	f6645cce91	[AArch64] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14360 rdar://problem/20820748 llvm-svn: 253520	2015-11-18 23:12:20 +00:00
Mehdi Amini	adb4057a15	Fix returned value for GVN: could return "false" even after modifying the IR This bug would manifest in some very specific cases where all the following conditions are fullfilled: - GVN didn't remove block - The regular GVN iteration didn't change the IR - PRE is enabled - PRE will not split critical edge - The last instruction processed by PRE didn't change the IR Because the CallGraph PassManager relies on this returned value to decide if it needs to recompute a node after the execution of Function passes, not returning the right value can lead to unexpected results. Fix for: https://llvm.org/bugs/show_bug.cgi?id=24715 Patch by Wenxiang Qiu <vincentqiuuu@gmail.com> From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253518	2015-11-18 22:49:49 +00:00
Xinliang David Li	cfb1456572	Minor cleanups (from review feedback) 1. remove uneeded header inclusion 2. use reinterpret_cast instead of c ctyle 3. other format change llvm-svn: 253515	2015-11-18 22:42:27 +00:00
Davide Italiano	455ea11d13	[BuildLibCalls] EmitStrNLen() is dead code. Garbage collect. llvm-svn: 253514	2015-11-18 22:29:38 +00:00
Pete Cooper	72bc23ef02	Change memcpy/memset/memmove to have dest and source alignments. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. There are a few places in the code for which the code needs to be checked by an expert as to whether using only src/dest alignment is safe. For those places, they currently take the minimum of src/dest alignments which matches the current behaviour. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 500, i32 8, i1 false) will now read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 8 %dest, i8* align 8 %src, i32 500, i1 false) For out of tree owners, I was able to strip alignment from calls using sed by replacing: (call.llvm\.memset.)i32\ [0-9]\,\ i1 false\) with: $1i1 false) and similarly for memmove and memcpy. I then added back in alignment to test cases which needed it. A similar commit will be made to clang which actually has many differences in alignment as now IRBuilder can generate different source/dest alignments on calls. In IRBuilder itself, a new argument was added. Instead of calling: CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, / isVolatile / false) you now call CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, SrcAlign, / isVolatile */ false) There is a temporary class (IntegerAlignment) which takes the source alignment and rejects implicit conversion from bool. This is to prevent isVolatile here from passing its default parameter to the source alignment. Note, changes in future can now be made to codegen. I didn't change anything here, but this change should enable better memcpy code sequences. Reviewed by Hal Finkel. llvm-svn: 253511	2015-11-18 22:17:24 +00:00
Simon Pilgrim	c1a46b729b	[DAGCombiner] Vector constant folding for comparisons This patch adds support for vector constant folding of integer/float comparisons. This requires FoldConstantVectorArithmetic to support scalar constant operands (in this case ISD::CONDCASE). In future we should be able to support other scalar constant types as necessary (and possibly start calling FoldConstantVectorArithmetic for all node creations) Differential Revision: http://reviews.llvm.org/D14683 llvm-svn: 253504	2015-11-18 21:17:19 +00:00
Tim Northover	747ae9a7de	ARM: make sure backend is consistent about exception handling method. It turns out we decide whether to use SjLj exceptions or some alternative in two separate places in the backend, and they disagreed with each other. This led to inconsistent code and is generally a terrible idea. So make them consistent and add an assert that they do match (unfortunately MCAsmInfo isn't available in opt, so it can't be used to initialise the CodeGen version directly). llvm-svn: 253502	2015-11-18 21:10:39 +00:00
Mike Aizatsky	c7810baaa6	Disable gvn non-local speculative loads under asan. Summary: Fix for https://llvm.org/bugs/show_bug.cgi?id=25550 Differential Revision: http://reviews.llvm.org/D14763 llvm-svn: 253498	2015-11-18 20:43:00 +00:00
Betul Buyukkurt	6fac1741c9	[PGO] Value profiling support This change introduces an instrumentation intrinsic instruction for value profiling purposes, the lowering of the instrumentation intrinsic and raw reader updates. The raw profile data files for llvm-profdata testing are updated. llvm-svn: 253484	2015-11-18 18:14:55 +00:00
Matthew Simpson	343af07aa9	[Aarch64] Add cost for missing extensions. This patch adds a cost estimate for some missing sign and zero extensions. The costs were determined by counting the number of shift instructions generated without context for each new extension. Differential Revision: http://reviews.llvm.org/D14730 llvm-svn: 253482	2015-11-18 18:03:06 +00:00
Dan Gohman	94ef41ff1d	[WebAssembly] Add more whitespace characters to prettify the assembly output. llvm-svn: 253472	2015-11-18 17:05:35 +00:00
Bradley Smith	7b0a7d8d1e	[ARM] Add +feature names to TargetParser extensions table llvm-svn: 253470	2015-11-18 16:32:12 +00:00
Dan Gohman	1f29c68042	[WebAssembly] Add some spaces to the assembly output to vertically align operands. llvm-svn: 253468	2015-11-18 16:25:38 +00:00
Dan Gohman	4ba4816b97	[WebAssembly] Enable register coloring and register stackifying. This also takes the push/pop syntax another step forward, introducing stack slot numbers to make it easier to see how expressions are connected. For example, the value pushed in $push7 is popped in $pop7. And, this begins an experiment with making get_local and set_local implicit when an operation directly uses or defines a register. This greatly reduces clutter. If this experiment succeeds, it may make sense to do this for const instructions as well. And, this introduces more special code for ARGUMENTS; hopefully this code will soon be obviated by proper support for live-in virtual registers. llvm-svn: 253465	2015-11-18 16:12:01 +00:00
Manuel Klimek	272d3f17fc	Fix bug where WinCOFFObjectWriter would assume starting from an empty output. Starting on an input stream that is not at offset 0 would trigger the assert in WinCOFFObjectWriter.cpp:1065: assert(getStream().tell() <= (*i)->Header.PointerToRawData && "Section::PointerToRawData is insane!"); llvm-svn: 253464	2015-11-18 15:24:17 +00:00
Jonas Paulsson	af722f8287	[SelectionDAGBuilder] Make sure DemoteReg ends up in right reg-class. The virtual register containing the address for returned value on stack should in the DAG be represented with a CopyFromReg node and not a Register node. Otherwise, InstrEmitter will not make sure that it ends up in the right register class for the target instruction. SystemZ needs this, becuause the reg class for address registers is a subset of the general 64 bit register class. test/SystemZ/CodeGen/args-07.ll and args-04.ll updated to run with -verify-machineinstrs. Reviewed by Hal Finkel. llvm-svn: 253461	2015-11-18 14:59:00 +00:00
Igor Laevsky	7310c68e85	Revert "Revert "Strip metadata when speculatively hoisting instructions (r252604)" Failing clang test is now fixed by the r253458. llvm-svn: 253459	2015-11-18 14:50:18 +00:00
James Molloy	9ad4f22538	[LTO] Add an early run of functionattrs Because we internalize early, we can potentially mark a bunch of functions as norecurse. Do this before globalopt. llvm-svn: 253451	2015-11-18 11:24:42 +00:00
Asaf Badouh	0d957b8b09	[X86][AVX512CD] add mask broadcast intrinsics Differential Revision: http://reviews.llvm.org/D14573 llvm-svn: 253450	2015-11-18 09:42:45 +00:00
Igor Breger	5574730454	AVX512: Implemented encoding for vpextrw.s instruction. Differential Revision: http://reviews.llvm.org/D14766 llvm-svn: 253447	2015-11-18 08:46:16 +00:00
Sanjoy Das	f79d3449c5	[OperandBundles] Tighten OperandBundleDef's interface; NFC llvm-svn: 253446	2015-11-18 08:30:07 +00:00
Hrvoje Varga	78409019d9	[mips][microMIPS] Implement DPS.W.PH, DPSQ_S.W.PH, DPSQ_SA.L.W, DPSQX_S.W.PH, DPSQX_SA.W.PH, DPSU.H.QBL, DPSU.H.QBR and DPSX.W.PH instructions Differential Revision: http://reviews.llvm.org/D14058 llvm-svn: 253443	2015-11-18 07:41:35 +00:00
Craig Topper	66059c9f4d	Replace dyn_cast with isa in places that weren't using the returned value for more than a boolean check. NFC. llvm-svn: 253441	2015-11-18 07:07:59 +00:00
Rafael Espindola	55512f9b25	Default SetVector to use a DenseSet. We use to have an odd difference among MapVector and SetVector. The map used a DenseMop, but the set used a SmallSet, which in turn uses a std::set. I have changed SetVector to use a DenseSet. If you were depending on the old behaviour you can pass an explicit set type or use SmallSetVector. The common cases for needing to do it are: * Optimizing for small sets. * Sets for types not supported by DenseSet. llvm-svn: 253439	2015-11-18 06:52:18 +00:00
Sanjoy Das	2d16145acf	Teach the inliner to track deoptimization state Summary: This change teaches LLVM's inliner to track and suitably adjust deoptimization state (tracked via deoptimization operand bundles) as it inlines through call sites. The operation is described in more detail in the LangRef changes. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14552 llvm-svn: 253438	2015-11-18 06:23:38 +00:00
Rafael Espindola	449711cb36	Stop producing .data.rel sections. If a section is rw, it is irrelevant if the dynamic linker will write to it or not. It looks like llvm implemented this because gcc was doing it. It looks like gcc implemented this in the hope that it would put all the relocated items close together and speed up the dynamic linker. There are two problem with this: * It doesn't work. Both bfd and gold will map .data.rel to .data and concatenate the input sections in the order they are seen. * If we want a feature like that, it can be implemented directly in the linker since it knowns where the dynamic relocations are. llvm-svn: 253436	2015-11-18 06:02:15 +00:00
Cong Hou	136bc65ec8	Remove a redundant assertion in MachineBasicBlock.cpp. NFC. llvm-svn: 253426	2015-11-18 01:55:56 +00:00
Cong Hou	11c1420173	Remove redundant code in MachineBasicBlock.cpp. NFC. llvm-svn: 253425	2015-11-18 01:45:10 +00:00
Kostya Serebryany	4d62322213	[libFuzzer] remove default initializer as a workaround for https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68399 . Don't need it anyway. llvm-svn: 253419	2015-11-18 01:08:30 +00:00
Cong Hou	41cf1a5dfb	Improving edge probabilities computation when choosing the best successor in machine block placement. When looking for the best successor from the outer loop for a block belonging to an inner loop, the edge probability computation can be improved so that edges in the inner loop are ignored. For example, suppose we are building chains for the non-loop part of the following code, and looking for B1's best successor. Assume the true body is very hot, then B3 should be the best candidate. However, because of the existence of the back edge from B1 to B0, the probability from B1 to B3 can be very small, preventing B3 to be its successor. In this patch, when computing the probability of the edge from B1 to B3, the weight on the back edge B1->B0 is ignored, so that B1->B3 will have 100% probability. if (...) do { B0; ... // some branches B1; } while(...); else B2; B3; Differential revision: http://reviews.llvm.org/D10825 llvm-svn: 253414	2015-11-18 00:52:52 +00:00
Quentin Colombet	8cb95b8e51	[ARM] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 253411	2015-11-18 00:40:54 +00:00
David Blaikie	6196aa06c9	Generalize ownership/passing semantics to allow dsymutil to own abbreviations via unique_ptr While still allowing CodeGen/AsmPrinter in llvm to own them using a bump ptr allocator. (might be nice to replace the pointers there with something that at least automatically calls their dtors, if that's necessary/useful, rather than having it done explicitly (I think a typed BumpPtrAllocator already does this, or maybe a unique_ptr with a custom deleter, etc)) llvm-svn: 253409	2015-11-18 00:34:10 +00:00
Sanjay Patel	77f4486950	[InstCombine] refactor optimizeIntToFloatBitCast() ; NFCI The logic for handling the pattern without a shift is identical to the logic for handling the pattern with a shift if you set the shift amount to zero for the former. This should make it easier to see that we probably don't even need optimizeIntToFloatBitCast(). If we call something like foldVecTruncToExtElt() from visitTrunc(), we'll solve PR25543: https://llvm.org/bugs/show_bug.cgi?id=25543 llvm-svn: 253403	2015-11-18 00:00:04 +00:00
Simon Pilgrim	2da4178737	[X86][AVX512] Added AVX512 SHUFP/VPERMILP shuffle decode comments. llvm-svn: 253396	2015-11-17 23:29:49 +00:00
Xinliang David Li	99556877ae	[PGO] Move value profile data definitions out of IndexedInstrProf Move the data structure defintions out of the namespace. The defs will be shared by raw format. [NFC] llvm-svn: 253394	2015-11-17 23:00:40 +00:00
David Blaikie	4689ef5943	Fix null dereference committed in r253277 llvm-svn: 253393	2015-11-17 22:39:26 +00:00
David Blaikie	35c2eebfe4	dwarfdump: support indexed string dumping in dwp based on the STR_OFFSETS component of the index llvm-svn: 253392	2015-11-17 22:39:23 +00:00
Simon Pilgrim	8483df6e24	[X86][AVX512] Added support for AVX512 UNPCK shuffle decode comments. llvm-svn: 253391	2015-11-17 22:35:45 +00:00
Nathan Slingerland	e6e30d5e88	[llvm-profdata] Improve error messaging when merging mismatched profile data Summary: This change tries to make the root cause of instrumented profile data merge failures clearer. Previous: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function count mismatch test_1.profraw: bar: Function count mismatch test_1.profraw: baz: Function count mismatch ... Changed: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function basic block count change detected (counter mismatch) Make sure that all profile data to be merged is generated from the same binary. test_1.profraw: bar: Function basic block count change detected (counter mismatch) test_1.profraw: baz: Function basic block count change detected (counter mismatch) ... Reviewers: dnovillo, davidxl, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14739 llvm-svn: 253384	2015-11-17 22:08:53 +00:00
Reid Kleckner	c20276d0b2	[WinEH] Move WinEHFuncInfo from MachineModuleInfo to MachineFunction Summary: Now that there is a one-to-one mapping from MachineFunction to WinEHFuncInfo, we don't need to use a DenseMap to select the right WinEHFuncInfo for the current funclet. The main challenge here is that X86WinEHStatePass is an IR pass that doesn't have access to the MachineFunction. I gave it its own WinEHFuncInfo object that it uses to calculate state numbers, which it then throws away. As long as nobody creates or removes EH pads between this pass and SDAG construction, we will get the same state numbers. The other thing X86WinEHStatePass does is to mark the EH registration node. Instead of communicating which alloca was the registration through WinEHFuncInfo, I added the llvm.x86.seh.ehregnode intrinsic. This intrinsic generates no code and simply marks the alloca in use. Reviewers: JCTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14668 llvm-svn: 253378	2015-11-17 21:10:25 +00:00
David Blaikie	c4e2bed738	dwarfdump: Reference the appropriate line table segment when dumping dwp files Also improves .dwo type unit dumping which didn't handle this either. llvm-svn: 253377	2015-11-17 21:08:05 +00:00
Andrew Kaylor	de642cef2c	[EH] Keep filter clauses for types that have been caught. The instruction combiner previously removed types from filter clauses in Landing Pad instructions if the type had previously been seen in a catch clause. This is incorrect and prevents unexpected exception handlers from rethrowing the caught type. Differential Revision: http://reviews.llvm.org/D14669 llvm-svn: 253370	2015-11-17 20:13:04 +00:00
Ulrich Weigand	36b8626b00	[RuntimeDyld] Fix resolving R_PPC64_REL24 relocations When resolving R_PPC64_REL24, code used to check for an address delta that fits in 24 bits, while the instructions that take this relocation actually can process address deltas that fit into 26 bits (as those instructions have a 24 bit field, but implicitly append two zero bits at the end since all instruction addresses are a multiple of 4). This means that code would signal overflow once a single object's text section exceeds 8 MB, while we can actually support up to 32 MB. Partially fixes PR25540. llvm-svn: 253369	2015-11-17 20:08:31 +00:00
Yunzhong Gao	8e348cc732	Switch lto codegen to using diagnostic handlers. This patch removes the std::string& argument from a number of C++ LTO API calls and instead makes them use the installed diagnostic handler. This would also improve consistency of diagnostic handling infrastructure: if an LTO client used lto_codegen_set_diagnostic_handler() to install a custom error handler, we do not want some error messages to go through the custom error handler, and some other error messages to go into sLastErrorString. llvm-svn: 253367	2015-11-17 19:48:12 +00:00
George Burgess IV	2ae15e0609	Specify explicit storage type for AllocType. NFC. llvm-svn: 253366	2015-11-17 19:48:06 +00:00
Elena Demikhovsky	3ec9e15ad4	Vector of pointers in function attributes calculation While setting function attributes we check all instructions that may access memory. For a call instruction we check all arguments. The special check is required for pointers. I added vector-of-pointers to the call arguments types that should be checked. Differential Revision: http://reviews.llvm.org/D14693 llvm-svn: 253363	2015-11-17 19:30:51 +00:00
Diego Novillo	ba920be4a2	SamplePGO - Move debug/dump function bodies out of header files. NFC. No point polluting the header declarations with debugging code. llvm-svn: 253361	2015-11-17 19:04:46 +00:00
David Blaikie	ff43d69ddf	StringRef-ify some Option APIs Patch by Eugene Kosov! Differential Revision: http://reviews.llvm.org/D14711 llvm-svn: 253360	2015-11-17 19:00:52 +00:00
Sanjay Patel	1de794aa3a	fix typos; NFC llvm-svn: 253359	2015-11-17 18:46:56 +00:00
Sanjay Patel	f09d1bfced	use local variables; NFCI llvm-svn: 253356	2015-11-17 18:37:23 +00:00
Charlie Turner	7968b981bf	[ARM] Don't pessimize i32 vselect. The underlying issues surrounding codegen for 32-bit vselects have been resolved. The pessimistic costs for 64-bit vselects remain due to the bad scalarization that is still happening there. I tested this on A57 in T32, A32 and A64 modes. I saw no regressions, and some improvements. From my benchmarks, I saw these improvements in A57 (T32) spec.cpu2000.ref.177_mesa 5.95% lnt.SingleSource/Benchmarks/Shootout/strcat 12.93% lnt.MultiSource/Benchmarks/MiBench/telecomm-CRC32/telecomm-CRC32 11.89% I also measured A57 A32, A53 T32 and A9 T32 and found no performance regressions. I see much bigger wins in third-party benchmarks with this change Differential Revision: http://reviews.llvm.org/D14743 llvm-svn: 253349	2015-11-17 17:25:15 +00:00
Sanjay Patel	431e1143ec	function names start with a lower case letter; NFC llvm-svn: 253348	2015-11-17 17:24:08 +00:00
Pawel Bylica	a90e745109	[Support] Tweak path::system_temp_directory() on Windows. Summary: This patch changes the behavior of path::system_temp_directory() on Windows to be closer to GetTempPath Windows API call. Enforces path separator to be the native one, makes path absolute, etc. GetTempPath is not used directly because of limitations/implementation bugs on Windows 7. Windows specific unit tests are added. Most of them runs in separated process with modified environment variables. This change fixes FileSystemTest.CreateDir unittest that had been failing when run from Unix-like shell on Windows (Unix-like path separator (/) used in env variables). Reviewers: chapuni, rafael, aaron.ballman Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D14231 llvm-svn: 253345	2015-11-17 16:54:32 +00:00
Ahmed Bougacha	88ddeae8bd	[AArch64] Promote f16 SELECT_CC CC operands when op is legal. SELECT_CC has the nasty property of having operands with unrelated types. So if you do something like: f32 = select_cc f16, f16, f32, f32, cc You'd only look for the action for <select_cc, f32>, but never f16. If the types are all legal, but the op isn't (as for f16 on AArch64, or for f128 on x86_64/AArch64?), then you get into trouble. For f128, we have softenSetCCOperands to handle this case. Similarly, for f16, we can directly promote the CC operands. llvm-svn: 253344	2015-11-17 16:45:40 +00:00
Davide Italiano	7f9f835cfb	[JIT/Memory] Fix up semantic of setExecutable(). setExecutable() should do everything that's needed to make the memory executable on host, i.e. unconditionally set permissions + invalidate instruction cache. llvm-rtdyld will be updated in my next commit. Discusseed with: Lang Hames (as part of D13631). llvm-svn: 253341	2015-11-17 16:34:28 +00:00
Pat Gavlin	c8ea157811	Lower statepoints with multi-def targets. Statepoint lowering currently expects that the target method of a statepoint only defines a single value. This precludes using statepoints with ABIs that return values in multiple registers (e.g. the SysV AMD64 ABI). This change adds support for lowering statepoints with mutli-def targets. llvm-svn: 253339	2015-11-17 16:04:21 +00:00
Dan Gohman	7aa4abac24	Use TargetRegisterInfo for printing MachineOperand register comments Several places in AsmPrinter.cpp print comments describing MachineOperand registers using MCRegisterInfo, which uses MCOperand-oriented names. This doesn't work for targets that use virtual registers exclusively, as WebAssembly does, since virtual registers are represented and printed differently. This patch preserves what seems to be the spirit of r229978, avoiding the use of TM.getSubtargetImpl(), while still using MachineOperand-oriented printing for MachineOperands. Differential Revision: http://reviews.llvm.org/D14709 llvm-svn: 253338	2015-11-17 16:01:28 +00:00
Chad Rosier	6066dc69f1	Typo. llvm-svn: 253336	2015-11-17 13:58:10 +00:00
Bradley Smith	982a8888b8	[ARM] Default to ARMv4t in favour of adding Other to ARMArch llvm-svn: 253335	2015-11-17 13:38:29 +00:00
Charlie Turner	b4613c6973	[ARM] Match VABDL from log2 shuffles. Differential Revision: http://reviews.llvm.org/D14664 llvm-svn: 253334	2015-11-17 13:21:35 +00:00
Zlatko Buljan	72a7f9c1f5	[mips][microMIPS] Implement EXTP, EXTPDP, EXTPDPV, EXTPV, EXTR[_RS].W, EXTR_S.H, EXTRV[_RS].W and EXTRV_S.H instructions Differential Revision: http://reviews.llvm.org/D14174 llvm-svn: 253332	2015-11-17 12:54:15 +00:00
Bradley Smith	4320205484	[ARM] Properly initialize ARMArch in the ARM subtarget llvm-svn: 253331	2015-11-17 11:57:33 +00:00
Zlatko Buljan	246b21f66a	[mips][microMIPS] Implement SUBQ[_S].PH, SUBQ_S.W, SUBQH[_R].PH, SUBQH[_R].W, SUBU[_S].PH, SUBU[_S].QB and SUBUH[_R].QB instructions Differential Revision: http://reviews.llvm.org/D14114 llvm-svn: 253329	2015-11-17 10:11:22 +00:00
Oliver Stannard	9be59af3ab	[Assembler] Make fatal assembler errors non-fatal Currently, if the assembler encounters an error after parsing (such as an out-of-range fixup), it reports this as a fatal error, and so stops after the first error. However, for most of these there is an obvious way to recover after emitting the error, such as emitting the fixup with a value of zero. This means that we can report on all of the errors in a file, not just the first one. MCContext::reportError records the fact that an error was encountered, so we won't actually emit an object file with the incorrect contents. Differential Revision: http://reviews.llvm.org/D14717 llvm-svn: 253328	2015-11-17 10:00:43 +00:00
Oliver Stannard	07b43d39a8	[Assembler] Allow non-fatal errors after parsing This adds reportError to MCContext, which can be used as an alternative to reportFatalError when the assembler wants to try to continue processing the rest of the file after the error is reported, so that all of the errors ina file can be reported. It records the fact that an error was encountered, so we can avoid emitting an object file if any errors occurred. This patch doesn't add any uses of this function (a later patch will convert most uses of reportFatalError to use it), but there is a small functional change: we use the SourceManager to print the error message, even if we have a null SMLoc. This means that we get a SourceManager-style message, with the file and line information shown as <unknown>, rather than the "LLVM ERROR" style used by report_fatal_error. llvm-svn: 253327	2015-11-17 09:58:07 +00:00
Zlatko Buljan	3e0588d033	[mips][microMIPS] Implement PRECEQ.W.PHL, PRECEQ.W.PHR, PRECEQU.PH.QBL, PRECEQU.PH.QBLA, PRECEQU.PH.QBR, PRECEQU.PH.QBRA, PRECEU.PH.QBL, PRECEU.PH.QBLA, PRECEU.PH.QBR and PRECEU.PH.QBRA instructions Differential Revision: http://reviews.llvm.org/D14279 llvm-svn: 253326	2015-11-17 09:43:29 +00:00
Jay Foad	b64f0a5a1a	Fix typos in comments. llvm-svn: 253324	2015-11-17 08:54:53 +00:00
David Majnemer	6727c015dc	[AliasAnalysis] CatchPad and CatchRet can modify escaped memory CatchPad and CatchRet behave a lot like function calls: they can potentially modify any memory which has been escaped. llvm-svn: 253323	2015-11-17 08:15:14 +00:00
David Majnemer	0345b0fa9e	Fix a typo in BasicAliasAnalysis llvm-svn: 253322	2015-11-17 08:15:08 +00:00
Xinliang David Li	b8c3ad1d05	Fix unaligned memory read issue exposed by ubsan Indexed profile data as designed today does not guarantee counter data to be well aligned, so reading needs to use the slower form (with memcpy). This is less than ideal and should be improved in the future (i.e., with fixed length function key instead of variable length name key). llvm-svn: 253309	2015-11-17 03:47:21 +00:00
Rafael Espindola	65e4902156	Drop prelink support. The way prelink used to work was * The compiler decides if a given section only has relocations that are know to point to the same DSO. If so, it names it .data.rel.ro.local<something>. * The static linker puts all of these together. * The prelinker program assigns addresses to each library and resolves the local relocations. There are many problems with this: * It is incompatible with address space randomization. * The information passed by the compiler is redundant. The linker knows if a given relocation is in the same DSO or not. If could sort by that if so desired. * There are newer ways of speeding up DSO (gnu hash for example). * Even if we want to implement this again in the compiler, the previous implementation is pretty broken. It talks about relocations that are "resolved by the static linker". If they are resolved, there are none left for the prelinker. What one needs to track is if an expression will require only dynamic relocations that point to the same DSO. At this point it looks like the prelinker is an historical curiosity. For example, fedora has retired it because it failed to build for two releases (http://pkgs.fedoraproject.org/cgit/prelink.git/commit/?id=eb43100a8331d91c801ee3dcdb0a0bb9babfdc1f) This patch removes support for it. That is, it stops printing the ".local" sections. llvm-svn: 253280	2015-11-17 00:51:23 +00:00
Matthias Braun	fe9d6f211f	Assume lane masks are always precise Allowing imprecise lane masks in case of more than 32 sub register lanes lead to some tricky corner cases, and I need another bugfix for another one. Instead I rather declare lane masks as precise and let tablegen abort if we do not have enough bits. This does not affect any in-tree target, even AMDGPU only needs 16 lanes at the moment. If the 32 lanes turn out to be a problem in the future, then we can easily change the LaneBitmask typedef to uint64_t. Differential Revision: http://reviews.llvm.org/D14557 llvm-svn: 253279	2015-11-17 00:50:55 +00:00
David Blaikie	cdec7ee565	Fix indentation llvm-svn: 253278	2015-11-17 00:41:02 +00:00
David Blaikie	82641be467	dwarfdump: Use the index to find the right abbrev offset in DWP files llvm-svn: 253277	2015-11-17 00:39:55 +00:00
Derek Schuff	71e8169ea8	[WebAssembly] Fix printing of global operands This was regressed in r252656 which wasn't quite NFC. Instead of using a custom instruction as before, use a pattern to select CONST_I32 for the global addrs. Differential Revision: http://reviews.llvm.org/D14587 llvm-svn: 253276	2015-11-17 00:20:44 +00:00
Philip Reames	b6e8fe3dac	[PRE] Preserve !invariant.load metadata Spoted via inspection. Test case included. llvm-svn: 253275	2015-11-17 00:15:09 +00:00
Simon Pilgrim	13d3a20ad7	[X86][SSE] Merged BLEND shuffle decode comments. NFC. Now that we can recognise different vector sizes. llvm-svn: 253268	2015-11-16 23:03:18 +00:00
Simon Pilgrim	b9ada27052	[X86][SSE] Merged ALIGNR/SLLDQ/SRLDQ shuffle decode comments. NFC. Now that we can recognise different vector sizes - will make future AVX512 additions easier. llvm-svn: 253266	2015-11-16 22:54:41 +00:00
Simon Pilgrim	5883a73f18	[X86][SSE] Merged SHUF/PERM shuffle decode comments. NFC. Now that we can recognise different vector sizes - will make future AVX512 additions easier. llvm-svn: 253260	2015-11-16 22:39:27 +00:00
Simon Pilgrim	66e43ee289	[X86][SSE] Merged UNPCK shuffle decode comments. NFC. Now that we can recognise different vector sizes - will make future AVX512 additions easier. llvm-svn: 253258	2015-11-16 22:21:10 +00:00
Sanjay Patel	4e28753140	use range-based for loop; NFCI llvm-svn: 253256	2015-11-16 22:16:52 +00:00
Stephen Canon	1bfc89baac	Add isInteger() to APFloat. Useful utility function; this wasn't too hard to do before, but also wasn't obviously discoverable. Make it explicit. Reviewed offline by Michael Gottesman. llvm-svn: 253254	2015-11-16 21:52:48 +00:00
Michael Zolotukhin	927bdba29d	[PR25538]: Fix a failure caused by r253126. In r253126 we stopped to recompute LCSSA after loop unrolling in all cases, except the unrolling is full and at least one of the loop exits is outside the parent loop. In other cases the transformation should not break LCSSA, but it turned out, that we also call SimplifyLoop on the parent loop, which might break LCSSA by itself. This fix just triggers LCSSA recomputation in this case as well. I'm committing it without a test case for now, but I'll try to invent one. It's a bit tricky because in an isolated test LoopSimplify would be scheduled before LoopUnroll, and thus will change the test and hide the problem. llvm-svn: 253253	2015-11-16 21:17:26 +00:00
Derek Schuff	46e3316888	[WebAssembly] Fix function return type printing Summary: Previously return type information for a function was derived from return dag nodes. But this didn't work for dags with != return node. So instead compute it directly from the LLVM function as is done for imports. Differential Revision: http://reviews.llvm.org/D14593 llvm-svn: 253251	2015-11-16 21:12:41 +00:00
Derek Schuff	4ed4778419	[WebAssembly] Reverse the order of operands for br_if Summary: This is to match the new version in the spec Reviewers: sunfish Subscribers: jfb, llvm-commits, dschuff Differential Revision: http://reviews.llvm.org/D14519 llvm-svn: 253249	2015-11-16 21:04:51 +00:00
David Majnemer	2dd41c5d42	[IR] Manage TheNoneToken with a std::unique_ptr Hopefully, this will make the sanitizer build bots happy. llvm-svn: 253248	2015-11-16 20:55:57 +00:00
Kit Barton	9c432ae111	Find available scratch register to use in function prologue and epilogue as part of shrink wrapping. Phabricator: http://reviews.llvm.org/D13955 llvm-svn: 253247	2015-11-16 20:22:15 +00:00
Reid Kleckner	c397b26790	[WinEH] Don't let UnwindHelp alias the return address On top of that, don't bother allocating and initializing UnwindHelp if we don't have any funclets. Currently we always use RBP as our frame pointer when funclets are present, so this change makes it impossible to come here without any fixed stack objects. Fixes PR25533. llvm-svn: 253245	2015-11-16 18:47:25 +00:00
Reid Kleckner	4255b04e7b	Use the subtarget reference that we already have llvm-svn: 253244	2015-11-16 18:47:12 +00:00
Owen Anderson	2de9f545aa	Add intermediate subtract instructions to reassociation worklist. We sometimes create intermediate subtract instructions during reassociation. Adding these to the worklist to revisit exposes many additional reassociation opportunities. Patch by Aditya Nandakumar. llvm-svn: 253240	2015-11-16 18:07:30 +00:00
David Majnemer	7378e7a333	[LoopStrengthReduce] Don't increment iterator past the end of the BB We tried to move the insertion point beyond instructions like landingpad and cleanuppad. However, we also tried to move past catchpad. This is problematic because catchpad is also a terminator. This fixes PR25541. llvm-svn: 253238	2015-11-16 17:37:58 +00:00
Vasileios Kalintiris	88faf6d697	[mips] Disable code generation through FastISel for MIPS32R6. Reviewers: dsanders Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14708 llvm-svn: 253225	2015-11-16 17:05:01 +00:00
Davide Italiano	ed5cc95d22	[SimplifyLibCalls] Generalize a comment. This doesn't apply only to sqrt. llvm-svn: 253224	2015-11-16 16:54:28 +00:00
Petr Pavlu	a770379524	[ARM] Prevent use of a value pointed by end() iterator when placing a jump table Function ARMConstantIslands::doInitialJumpTablePlacement() iterates over all basic blocks in a machine function. It calls `MI = MBB.getLastNonDebugInstr()` to get the last instruction in each block and then uses MI->getOpcode() to decide what to do. If getLastNonDebugInstr() returns MBB.end() (for example, when the block does not contain any instructions) then calling getOpcode() on this value is incorrect. Avoid this problem by checking the result of getLastNonDebugInstr(). Differential Revision: http://reviews.llvm.org/D14694 llvm-svn: 253222	2015-11-16 16:41:13 +00:00
Oliver Stannard	9327a7575b	[ARM,AArch64] Store source location of asm constant pool entries Storing the source location of the expression that created a constant pool entry allows us to emit better error messages if we later discover that the expression cannot be represented by a relocation. Differential Revision: http://reviews.llvm.org/D14646 llvm-svn: 253220	2015-11-16 16:25:47 +00:00
Oliver Stannard	09be060606	[ARM,AArch64] Store source location for values in assembly files The MCValue class can store a SMLoc to allow better error messages to be emitted if an error is detected after parsing. The ARM and AArch64 assembly parsers were not setting this, so error messages did not have source information. Differential Revision: http://reviews.llvm.org/D14645 llvm-svn: 253219	2015-11-16 16:22:47 +00:00
Dan Gohman	1462faad35	[WebAssembly] Prototype passes for register coloring and register stackifying. These passes are not yet enabled by default. llvm-svn: 253217	2015-11-16 16:18:28 +00:00
Artyom Skrobov	f187a65f99	Handle ARMv6KZ naming Summary: * ARMv6KZ is the "canonical" name, given in the ARMARM * ARMv6Z is an "official abbreviation" for it, mentioned in the ARMARM * ARMv6ZK is a popular misspelling, which we should support as an alias. The patch corrects the handling of the names. Functional changes: * ARMv6Z no longer treated as an architecture in its own right * ARMv6ZK renamed to ARMv6KZ, accepting ARMv6ZK as an alias * arm1176jz-s and arm1176jzf-s recognized as ARMv6ZK, instead of ARMv6K * default ARMv6K CPU changed to arm1176j-s Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14568 llvm-svn: 253206	2015-11-16 14:05:32 +00:00
Artyom Skrobov	bc09f39476	NFC refactorings in lib/Support/TargetParser.cpp Summary: * declare FPUNames, ARCHNames, ARCHExtNames, HWDivNames, CPUNames as static const * implement getDefaultExtensions with a StringSwitch, in the same way getDefaultFPU is implemented Reviewers: rengolin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14648 llvm-svn: 253201	2015-11-16 12:08:05 +00:00
Bradley Smith	4adcb73933	[ARM] Allow TargetParser to accurately target architectures Instead of defaulting to an empty string, we want to default to the CPU 'generic' in the case of no valid default CPU being found, (as long as the architecture is actually valid). In order to do this we add a default FPU for each architecture, as well as falling back to architecture defaults for extensions and FPU in the case of a generic CPU is specified. llvm-svn: 253198	2015-11-16 11:15:22 +00:00
Bradley Smith	323fee105d	[ARM] Introduce subtarget features per ARM architecture. This allows for accurate architecture targeting as well as removing duplicate information (hardcoded feature strings) from MCTargetDesc. llvm-svn: 253196	2015-11-16 11:10:19 +00:00
James Molloy	2018091e87	Properly check if a CMPZ node is in fact comparing against zero This was left implicit and never ever checked, which means we could have a CMPZ against some non-zero value and we were carrying on with BFI conversion regardless. Caught by Oliver Stannard using csmith; regression test added. llvm-svn: 253195	2015-11-16 10:49:25 +00:00
Pavel Labath	978060ce2f	Don't generate discriminators for calls to debug intrinsics Summary: This fails a check in Verifier.cpp, which checks for location matches between the declared variable and the !dbg attachments. Reviewers: dnovillo, dblaikie, danielcdh Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14657 llvm-svn: 253194	2015-11-16 10:40:38 +00:00
Oliver Stannard	db9081bf89	[AArch64] ldr= pseudo-instruction silently ignored if register invalid The AArch64 assembler was silently ignoring instructions like this: ldr foo, =bar AArch64AsmParser::parseOperand was returning true as the parse failed, but was not calling AArch64AsmParser::Error to report this to the user, so the instruction was ignored without printing an error message. Differential Revision: http://reviews.llvm.org/D14651 llvm-svn: 253193	2015-11-16 10:25:19 +00:00
James Molloy	d4d2357f26	[GlobalOpt] Address post-commit review comments on r253168 Address Duncan Exon Smith's comments on D14148, which was added after the patch had been LGTM'd and committed: * clang-format one area where whitespace diffs occurred. * Add a threshold to limit the store/load dominance checks as they are quadratic. llvm-svn: 253192	2015-11-16 10:16:22 +00:00
Benjamin Kramer	83709b1c1e	Move helper classes into anonymous namespaces. NFC. llvm-svn: 253189	2015-11-16 09:01:28 +00:00
Keno Fischer	b011c63d19	[DIBuilder] Make createReferenceType take size and align Summary: Since we're passing references to dbg.value as pointers, we need to have the frontend properly declare their sizes and alignments (as it already does for regular pointers) in preparation for my upcoming patch to have the verifer check that the sizes agree. Also augment the backend logic that skips actually emitting this information into DWARF such that it also handles reference types. Reviewers: aprantl, dexonsmith, dblaikie Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D14275 llvm-svn: 253186	2015-11-16 07:57:32 +00:00
Igor Breger	24cab0fa06	AVX512: Implemented encoding and intrinsics for VMOVSHDUP/VMOVSLDUP instructions. Differential Revision: http://reviews.llvm.org/D14322 llvm-svn: 253185	2015-11-16 07:22:00 +00:00
Keno Fischer	2ac0c27001	Also map the personality function in CloneFunctionInto Summary: The Old personality function gets copied over, but the Materializer didn't have a chance to inspect it (e.g. to fix up references to the correct module for the target function). Also add a verifier check that makes sure the personality routine is in the same module as the function whose personality it is. Reviewers: majnemer Subscribers: jevinskie, llvm-commits Differential Revision: http://reviews.llvm.org/D14474 llvm-svn: 253183	2015-11-16 05:13:30 +00:00
Keno Fischer	86c95b5642	[Sink] Don't move landingpads Summary: Moving landingpads into successor basic blocks makes the verifier sad. Teach Sink that much like PHI nodes and terminator instructions, landingpads (and cleanuppads, etc.) may not be moved between basic blocks. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14475 llvm-svn: 253182	2015-11-16 04:47:58 +00:00
Dan Gohman	1031d4a8c3	[WebAssembly] Use tabs instead of spaces in assembly output. This seems to be the most popular convention among the other backends. llvm-svn: 253172	2015-11-15 15:34:19 +00:00
Simon Pilgrim	cbba348ae7	[X86][SSE] Tidyup with implicit SDValue bool check. NFC. llvm-svn: 253171	2015-11-15 14:57:07 +00:00
Teresa Johnson	83d03ddbf6	Fix mapping of unmaterialized global values during metadata linking Summary: The patch to move metadata linking after global value linking didn't correctly map unmaterialized global values to null as desired. They were in fact mapped to the source copy. It largely worked by accident since most module linker clients destroyed the source module which caused the source GVs to be replaced by null, but caused a failure with LTO linking on Windows: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312869.html The problem is that a null return value from materializeValueFor is handled by mapping the value to self. This is the desired behavior when materializeValueFor is passed a non-GlobalValue. The problem is how to distinguish that case from the case where we really do want to map to null. This patch addresses this by passing in a new flag to the value mapper indicating that unmapped global values should be mapped to null. Other Value types are handled as before. Note that the documented behavior of asserting on unmapped values when the flag RF_IgnoreMissingValues isn't set is currently disabled with FIXME notes due to bootstrap failures. I modified these disabled asserts so when they are eventually enabled again it won't assert for the unmapped values when the new RF_NullMapMissingGlobalValues flag is set. I also considered using a callback into the value materializer, but a flag seemed cleaner given that there are already existing flags. I also considered modifying materializeValueFor to return the input value when we want to map to source and then treat a null return to mean map to null. However, there are other value materializer subclasses that implement materializeValueFor, and they would all need to be audited and the return values possibly changed, which seemed error-prone. Reviewers: dexonsmith, joker.eph Subscribers: pcc, llvm-commits Differential Revision: http://reviews.llvm.org/D14682 llvm-svn: 253170	2015-11-15 14:50:14 +00:00
James Molloy	9c7d4d8855	[GlobalOpt] Demote globals to locals more aggressively Global to local demotion can speed up programs that use globals a lot. It is particularly useful with LTO, when the entire call graph is known and most functions have been internalized. For a global to be demoted, it must only be accessed by one function and that function: 1. Must never recurse directly or indirectly, else the GV would be clobbered. 2. Must never rely on the value in GV at the start of the function (apart from the initializer). GlobalOpt can already do this, but it is hamstrung and only ever tries to demote globals inside "main", because C++ gives extra guarantees about how main is called - once and only once. In LTO mode, we can often prove the first property (if the function is internal by this point, we know enough about the callgraph to determine if it could possibly recurse). FunctionAttrs now infers the "norecurse" attribute for this reason. The second property can be proven for a subset of functions by proving that all loads from GV are dominated by a store to GV. This is conservative in the name of compile time - this only requires a DominatorTree which is fairly cheap in the grand scheme of things. We could do more fancy stuff with MemoryDependenceAnalysis too to catch more cases but this appears to catch most of the useful ones in my testing. llvm-svn: 253168	2015-11-15 14:21:37 +00:00
Igor Breger	3ff8ef9eb7	Revert r253160. It broke layering violation. Reproducible with BUILD_SHARED_LIBS=ON. llvm-svn: 253163	2015-11-15 12:19:11 +00:00
Elena Demikhovsky	121d49b640	Fixed GEP visitor in the InstCombine pass. The current implementation of GEP visitor in InstCombine fails with assertion on Vector GEP with mix of scalar and vector types, like this: getelementptr double, double* %a, <8 x i32> %i (It fails to create a "sext" from <8 x i32> to <8 x i64>) I fixed it and added some tests. Differential Revision: http://reviews.llvm.org/D14485 llvm-svn: 253162	2015-11-15 08:19:35 +00:00
Igor Breger	aa40ddd3ba	AVX512: Implemented encoding and intrinsics for VMOVSHDUP/VMOVSLDUP instructions. Differential Revision: http://reviews.llvm.org/D14322 llvm-svn: 253160	2015-11-15 07:23:13 +00:00
Teresa Johnson	12545075f0	Use a different block id for block of metadata kind records Summary: There are currently two blocks with the METADATA_BLOCK id at module scope. The first has the module-level metadata values (consisting of some combination of METADATA_* record codes except for METADATA_KIND). The second consists only of METADATA_KIND records. The latter is used only in the METADATA_ATTACHMENT block within function blocks (for metadata attached to instructions). For ThinLTO we want to delay the parsing of module level metadata until all functions have been imported from that module (there is some bookkeeping used to suture it up when we read it during a post-pass). However, we do need the METADATA_KIND records when parsing the function body during importing, since those kinds are used as described above. To simplify identification and parsing of just the block containing the metadata kinds, use a different block id (METADATA_KIND_BLOCK_ID). Support older bitcode without the new block id as well. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14654 llvm-svn: 253154	2015-11-15 02:00:09 +00:00
Dan Gohman	5219ecf068	[WebAssembly] Minor code simplification. NFC. llvm-svn: 253150	2015-11-14 23:28:15 +00:00
Dan Gohman	8ad045c1d1	[WebAssembly] Support signext, zeroext, and several other function attributes. llvm-svn: 253148	2015-11-14 23:15:41 +00:00
Akira Hatanaka	b11ef0897c	Reduce the size of MCRelaxableFragment. MCRelaxableFragment previously kept a copy of MCSubtargetInfo and MCInst to enable re-encoding the MCInst later during relaxation. A copy of MCSubtargetInfo (instead of a reference or pointer) was needed because the feature bits could be modified by the parser. This commit replaces the MCSubtargetInfo copy in MCRelaxableFragment with a constant reference to MCSubtargetInfo. The copies of MCSubtargetInfo are kept in MCContext, and the target parsers are now responsible for asking MCContext to provide a copy whenever the feature bits of MCSubtargetInfo have to be toggled. With this patch, I saw a 4% reduction in peak memory usage when I compiled verify-uselistorder.lto.bc using llc. rdar://problem/21736951 Differential Revision: http://reviews.llvm.org/D14346 llvm-svn: 253127	2015-11-14 06:35:56 +00:00
Michael Zolotukhin	8ef44f93ca	Don't recompute LCSSA after loop-unrolling when possible. Summary: Currently we always recompute LCSSA for outer loops after unrolling an inner loop. That leads to compile time problem when we have big loop nests, and we can solve it by avoiding unnecessary work. For instance, if w eonly do partial unrolling, we don't break LCSSA, so we don't need to rebuild it. Also, if all exits from the inner loop are inside the enclosing loop, then complete unrolling won't break LCSSA either. I replaced unconditional LCSSA recomputation with conditional recomputation + unconditional assert and added several tests, which were failing when I experimented with it. Soon I plan to follow up with a similar patch for recalculation of dominators tree. Reviewers: hfinkel, dexonsmith, bogner, joker.eph, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14526 llvm-svn: 253126	2015-11-14 05:51:41 +00:00
Akira Hatanaka	bd9fc28444	[MCTargetAsmParser] Move the member varialbes that reference MCSubtargetInfo in the subclasses into MCTargetAsmParser and define a member function getSTI. This is done in preparation for making changes to shrink the size of MCRelaxableFragment. (see http://reviews.llvm.org/D14346). llvm-svn: 253124	2015-11-14 05:20:05 +00:00
Eric Christopher	57a6e1321f	Add MMX to the 3dnow enum and propagate changes around. This makes it somewhat more consistent with how the feature is used. llvm-svn: 253122	2015-11-14 03:04:00 +00:00
Quentin Colombet	2cdcfd23cd	[ShrinkWrapping] Disable the optimization for functions with sanitize like attribute. Even if the target supports shrink-wrapping, the prologue and epilogue must not move because a crash can happen anywhere and sanitizers need to be able to unwind from the PC of the crash. llvm-svn: 253116	2015-11-14 01:55:17 +00:00
Sanjoy Das	06f9a27a51	[RuntimeDyld] Fix indentation and whitespace; NFC Whitespace-only change. llvm-svn: 253105	2015-11-14 00:16:15 +00:00
Justin Bogner	fff708db92	AArch64: Default AArch64Subtarget::ReserveX18 to true on darwin Darwin reserves x18, so it's never ABI compliant to generate code that uses it. Set the default value based on the OS part of the triple rather than forcing front-ends to set the +reserve-x18 target feature in order to build correct code for Darwin. This will make r243310 redundant, so I'll revert that shortly. llvm-svn: 253102	2015-11-13 23:05:46 +00:00
Matthias Braun	e6edd48d69	MachineScheduler: Print initial pressure in debug dump llvm-svn: 253097	2015-11-13 22:30:31 +00:00
Matthias Braun	3b099db61d	MachineScheduler: Improve debug output for "only one node in readyset" When there is only 1 node left in the ready queue and it is picked call the reason "ONLY1" instead of "NOCAND". llvm-svn: 253096	2015-11-13 22:30:29 +00:00
Chad Rosier	cc299b627d	[LIR] Add support for creating memcpys from loops with a negative stride. This allows us to transform the below loop into a memcpy. void test(unsigned __restrict__ a, unsigned __restrict__ b) { for (int i = 2047; i >= 0; --i) { a[i] = b[i]; } } This is the memcpy version of r251518, which added support for memset with negative strided loops. llvm-svn: 253091	2015-11-13 21:51:02 +00:00
Colin LeMahieu	655489433c	[Hexagon] Fixing memory leak during relaxation by allocating MCInst in MCContext. llvm-svn: 253090	2015-11-13 21:45:50 +00:00
Reid Kleckner	75b4be9a11	[WinEH] Fix ESP management with 32-bit __CxxFrameHandler3 The C++ EH personality automatically restores ESP from the C++ EH registration node after a catchret. I mistakenly thought it was like SEH, which does not restore ESP. It makes sense for C++ EH to differ from SEH here because SEH does not use funclets for catches, and does not allow catching inside of finally. C++ EH may need to unwind through multiple catch funclets and eventually catchret to some outer funclet. Therefore, the runtime has to keep track of which ESP to use with catchret, rather than having the compiler reload it manually. llvm-svn: 253084	2015-11-13 21:27:00 +00:00
Evgeniy Stepanov	447bbdb171	[safestack] Rewrite isAllocaSafe using SCEV. Use ScalarEvolution to calculate memory access bounds. Handle function calls based on readnone/nocapture attributes. Handle memory intrinsics with constant size. This change improves both recall and precision of IsAllocaSafe. See the new tests (ex. BitCastWide) for the kind of code that was wrongly classified as safe. SCEV efficiency seems to be limited by the fact the SafeStack runs late (in CodeGenPrepare), and many loops are unrolled or otherwise not in LCSSA. llvm-svn: 253083	2015-11-13 21:21:42 +00:00
Dan Gohman	dd0071f440	[WebAssembly] Rename the Const instructions to be upper-case too. llvm-svn: 253072	2015-11-13 20:27:45 +00:00
Diego Novillo	8e415a821f	SamplePGO - Add dump routines for LineLocation, SampleRecord and FunctionSamples llvm-svn: 253071	2015-11-13 20:24:28 +00:00
Dan Gohman	f433324290	[WebAssembly] Rename memory intrinsics to be upper-case, following convention. NFC. llvm-svn: 253070	2015-11-13 20:19:11 +00:00
Cong Hou	ef4074bac2	[X86][SSE] Combine UNPCKL with vector_shuffle into UNPCKH to save one instruction for sext from v16i8 to v16i16 and v8i16 to v8i32. This patch is enabling combining UNPCKL with vector_shuffle that moves the upper half of a vector into the lower half, into a UNPCKH instruction. For example: t2: v16i8 = vector_shuffle<8,9,10,11,12,13,14,15,u,u,u,u,u,u,u,u> t1, undef:v16i8 t3: v16i8 = X86ISD::UNPCKL undef:v16i8, t2 will be combined to: t3: v16i8 = X86ISD::UNPCKH undef:v16i8, t1 Differential revision: http://reviews.llvm.org/D14399 llvm-svn: 253067	2015-11-13 19:47:43 +00:00
David Blaikie	8e8dd57e0b	dwarfdump: Add support for dumping the table contents of DWP indexes This is a recommit of 252842 which was reverted in 252859. The issue was using %s format specifier for a StringRef - used Format's left_justify(StringRef, int) instead. It'd be nice to have __attribute__((format(..))) on llvm::format, but apparently it's only implemented for c-style variadics, not C++ variadic templates. Perhaps we could fix that & conditionalize the attribute on such... llvm-svn: 253065	2015-11-13 19:18:49 +00:00
Chad Rosier	2fa50a7a05	Add a comment that should have made my last commit. llvm-svn: 253063	2015-11-13 19:13:40 +00:00
Chad Rosier	ed0c7d1316	[LIR] Factor out the code to compute base ptr for negative strided loops. This will allow for the code to be reused in the memcpy optimization. llvm-svn: 253061	2015-11-13 19:11:07 +00:00
Reid Kleckner	94b57065c6	[WinEH] Make UnwindHelp a fixed stack object allocated after XMM CSRs Now the offset of UnwindHelp in our EH tables and the offset that we store to in the prologue agree. llvm-svn: 253059	2015-11-13 19:06:01 +00:00
Colin LeMahieu	f0af6e5243	[Hexagon] Factoring bundle creation in to a utility function. llvm-svn: 253056	2015-11-13 17:42:46 +00:00
Tom Stellard	afd6e2f3c3	AMDGPU: Add stony support Patch by: Alex Deucher llvm-svn: 253053	2015-11-13 17:06:32 +00:00
Tom Stellard	f9f5f12ce7	ELFYAML: Add support for parsing AMDGPU section attribute flags Reviewers: silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14444 llvm-svn: 253052	2015-11-13 17:06:29 +00:00
Reid Kleckner	c038e2db4d	[Symbolizer] Don't use PE symbol tables to override PDB symbols Summary: PE files are stripped by default, and only contain the names of exported symbols. The actual reason that we bother to do this override by default is actually due to a quirk of the way -gline-tables-only is implemented, so I phrased the check as "if we are symbolizing from dwarf, do the symtab override". This fixes lots of Windows ASan tests that I broke in r250582. Reviewers: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14594 llvm-svn: 253051	2015-11-13 17:00:36 +00:00
Sanjay Patel	225d65f1e6	use range-based for loop; NFCI llvm-svn: 253048	2015-11-13 16:21:23 +00:00
James Molloy	b564098c62	[ARM] Replace ARMISD::RBIT with ISD::BITREVERSE ISD::BITREVERSE matches "rbit" completely, so remove ARMISD::RBIT and mark ISD::BITREVERSE as legal, adding a test for lowering. llvm-svn: 253047	2015-11-13 16:05:22 +00:00
Zlatko Buljan	32fb5c40d2	[mips][microMIPS] Implement SHRA[_R].PH, SHRAV[_R].PH, SHRAV[_R].QB, SHRAV_R.W, SHRA_R.W, SHRL.PH, SHRL.QB, SHRLV.PH and SHRLV.QB instructions Differential Revision: http://reviews.llvm.org/D14010 llvm-svn: 253041	2015-11-13 13:14:25 +00:00
Ulrich Weigand	19d24d2699	[SystemZ] Simplify boolean conditional return statements Use clang-tidy to simplify conditonal return statements. Author: LegalizeAdulthood Differential Revision: http://reviews.llvm.org/D9986 llvm-svn: 253038	2015-11-13 13:00:27 +00:00
James Molloy	33e7345886	[GlobalOpt] Make sure all debug lines end with '\n' GlobalVariable::print() used to emit a newline. It hasn't for a while now, but these debug lines weren't updated. llvm-svn: 253030	2015-11-13 11:05:13 +00:00
James Molloy	ea31ad3b27	[GlobalOpt] Coding style - remove function names from doxygen comments Suggested by Mehdi in the review of D14148. llvm-svn: 253029	2015-11-13 11:05:07 +00:00
James Molloy	bb1dbf530a	[SDAG] Fix expansion of BITREVERSE Richard Trieu noted that UBSan detected an overflowing shift, and the obvious fix caused a crash. What was happening was that the shiftee (1U) was indeed too small for the possible range of shifts it had to handle, but also we were using "VT.getSizeInBits()" to get the maximum type bitwidth, but we wanted "VT.getScalarSizeInBits()" to get the vector lane size instead of the entire vector size. Use an APInt for the shift and VT.getScalarSizeInBits(). llvm-svn: 253023	2015-11-13 10:02:36 +00:00
Sanjoy Das	ac9c5b1901	[ImplicitNulls] Add some clarifying comments; NFC llvm-svn: 253020	2015-11-13 08:14:00 +00:00
Colin LeMahieu	b3c97271e3	[Hexagon] Fixing leak in padEndloop by allocating in MCContext. llvm-svn: 253019	2015-11-13 07:58:06 +00:00
Nathan Slingerland	4f82366759	[llvm-profdata] Add check for text profile formats and improve error reporting (2nd try) Summary: This change addresses two possible instances of user error / confusion when merging sampled profile data. Previously any input that didn't match the raw or processed instrumented format would automatically be interpreted as instrumented profile text format data. No error would be reported during the merge. Example: If foo-sampled.profdata and bar-sampled.profdata are binary sampled profiles: Old behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -output foobar-sampled.profdata $ llvm-profdata show -sample foobar-sampled.profdata error: foobar-sampled.profdata:1: Expected 'mangled_name:NUM:NUM', found lprofi This change adds basic checks for valid input data when assuming text input. It also makes error messages related to file format validity more specific about the assumbed profile data type. New behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -o foobar-sampled.profdata error: foo.profdata: Unrecognized instrumentation profile encoding format Perhaps you forgot to use the -sample option? Reviewers: bogner, davidxl, dnovillo Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14558 llvm-svn: 253009	2015-11-13 03:47:58 +00:00
Davide Italiano	f3d2329da6	[lib/Linker] Convert assert(false) to llvm_unreachable(). llvm-svn: 253005	2015-11-13 02:16:51 +00:00
Kostya Serebryany	2a48c24d77	[libFuzzer] make libFuzzer build even with a compiler that does not have sanitizer headers llvm-svn: 253003	2015-11-13 01:54:40 +00:00
Akira Hatanaka	5af7ace4ee	Revert r252990. Some of the buildbots are still failing. llvm-svn: 252999	2015-11-13 01:44:32 +00:00
Dan Gohman	f19ed56288	[WebAssembly] Inline asm support. llvm-svn: 252997	2015-11-13 01:42:29 +00:00
Akira Hatanaka	c7dfb76fe7	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r252949. I've changed the type of FuncName to be std::string instead of StringRef in emitFnAttrCompatCheck. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 252990	2015-11-13 01:23:11 +00:00
Colin LeMahieu	8bb168b160	[Hexagon] Adding relaxation functionality to backend and test. llvm-svn: 252989	2015-11-13 01:12:25 +00:00
Dan Gohman	bc58a7bad0	[WebAssembly] Un-mangle the conversion instruction names. This arranges the types in the LLVM instruction names in the same order that they appear in the WebAssembly opcode names, and eliminates double-underscores. llvm-svn: 252988	2015-11-13 00:50:04 +00:00
Dan Gohman	231244c304	[WebAssembly] Rename BR_IF_ to BR_IF With MC-based instruction printing, we no longer need instruction names to mangle in hints about how they should be printed. llvm-svn: 252987	2015-11-13 00:46:31 +00:00
Dan Gohman	c9dd057e3c	[WebAssembly] Remove unneeded TODO items. NFC. llvm-svn: 252985	2015-11-13 00:41:25 +00:00
Dan Gohman	b1daa3aec7	[WebAssembly] Tidy up and update a TODO item. NFC. llvm-svn: 252984	2015-11-13 00:40:37 +00:00
Joseph Tremoulet	149c433bcc	[WinEH] Find root frame correctly in CLR funclets Summary: The value that the CoreCLR personality passes to a funclet for the establisher frame may be the root function's frame or may be the parent funclet's (mostly empty) frame in the case of nested funclets. Each funclet stores a pointer to the root frame in its own (mostly empty) frame, as does the root function itself. All frames allocate this slot at the same offset, measured from the post-prolog stack pointer, so that the same sequence can accept any ancestor as an establisher frame parameter value, and so that a single offset can be reported to the GC, which also looks at this slot. This change allocate the slot when processing function entry, and records its frame index on the WinEHFuncInfo object, then inserts the code to set/copy it during prolog emission. Reviewers: majnemer, AndyAyers, pgavlin, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14614 llvm-svn: 252983	2015-11-13 00:39:23 +00:00
Dan Gohman	058fce5435	[WebAssembly] Introduce a new pseudo-operand for unused expression results. llvm-svn: 252975	2015-11-13 00:21:05 +00:00
Vyacheslav Klochkov	cbc56baae6	X86-FMA3: Implemented commute transformations FMA_Int instructions. It made it possible to apply the memory folding optimization for the 2nd operand of FMA_Int instructions. Reviewer: Quentin Colombet Differential Revision: http://reviews.llvm.org/D14550 llvm-svn: 252973	2015-11-13 00:07:35 +00:00
Davide Italiano	b883b01a8e	[SimplifyLibCalls] Make a function shorter. NFC. llvm-svn: 252970	2015-11-12 23:39:00 +00:00
Tom Stellard	0967c91e0c	Revert "Remove unnecessary call to getAllocatableRegClass" This reverts commit r252565. This also includes the revert of the commit mentioned below in order to avoid breaking tests in AMDGPU: Revert "AMDGPU: Set isAllocatable = 0 on VS_32/VS_64" This reverts commit r252674. llvm-svn: 252956	2015-11-12 21:43:25 +00:00
Akira Hatanaka	f3aa82f666	Revert r252949. It broke some of the bots including clang-x64-ninja-win7. llvm-svn: 252951	2015-11-12 21:19:18 +00:00
Akira Hatanaka	61b81a563a	Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 252949	2015-11-12 20:59:43 +00:00
Sanjoy Das	53da2fe729	Revert r243347 "Add TargetTransformInfo::isZExtFree." r243347 was intended to support a change to LSR (r243348). That change to LSR has since had to be reverted (r243939) because it was buggy, and now the code added in r243347 is untested and unexercised. Given that, I think it is appropriate to revert r243347 for now, with the intent of adding it back in later if I get around to checking in a fixed version of r243348. llvm-svn: 252948	2015-11-12 20:51:52 +00:00
Sanjoy Das	e8b81649cf	[ImplicitNulls] Fix wrapping by breaking up a condition, NFC llvm-svn: 252947	2015-11-12 20:51:49 +00:00
Sanjoy Das	edc394f1ed	[ImplicitNull] Extract out a HazardDetector class, NFC This will make later functional changes easier to follow. llvm-svn: 252946	2015-11-12 20:51:44 +00:00
Vyacheslav Klochkov	1ff9cbdfc0	My first/test commit. Removed a trailing whitespace. llvm-svn: 252940	2015-11-12 20:11:57 +00:00
Tobias Grosser	8241795d20	Revert "Fix bug 25440: GVN assertion after coercing loads" This reverts 252919 which broke LNT: MultiSource/Applications/SPASS llvm-svn: 252936	2015-11-12 20:04:21 +00:00
Benjamin Kramer	7c576d8bcf	[Hexagon] Allocate MCInst in the MCContext to avoid leaking it. Found by leaksanitizer. llvm-svn: 252931	2015-11-12 19:30:40 +00:00
Chad Rosier	a548fe569b	[LIR] Minor refactoring. NFCI. This change prevents uninteresting stores from being inserted into the list of candidate stores for memset/memcpy conversion. llvm-svn: 252926	2015-11-12 19:09:16 +00:00
David Blaikie	b0311c590d	Roll an expression into an assert to fix -Wunused-variable in a -Asserts build llvm-svn: 252925	2015-11-12 19:07:43 +00:00
Nathan Slingerland	911ced6bf3	reverting r252916 to investigate test failure llvm-svn: 252921	2015-11-12 18:39:26 +00:00
Weiming Zhao	eed0145dd2	Fix bug 25440: GVN assertion after coercing loads Summary: when coercing loads, it inserts some instructions, which have no GV assigned. https://llvm.org/bugs/show_bug.cgi?id=25440 Reviewers: hfinkel, dberlin Subscribers: dberlin, llvm-commits Differential Revision: http://reviews.llvm.org/D14479 llvm-svn: 252919	2015-11-12 18:19:59 +00:00
Quentin Colombet	aeb85934b6	[ShrinkWrap] Fix a typo in a comment. llvm-svn: 252918	2015-11-12 18:16:27 +00:00
Quentin Colombet	94dc1e0d34	[ShrinkWrap] Make sure we do not mess up with EH funclet lowering. ShrinkWrapping does not understand exception handling constraints for now, so make sure we do not mess with them by aborting on functions that use EH funclets. llvm-svn: 252917	2015-11-12 18:13:42 +00:00
Nathan Slingerland	f0e107e38a	[llvm-profdata] Add check for text profile formats and improve error reporting Summary: This change addresses two possible instances of user error / confusion when merging sampled profile data. Previously any input that didn't match the raw or processed instrumented format would automatically be interpreted as instrumented profile text format data. No error would be reported during the merge. Example: If foo-sampled.profdata and bar-sampled.profdata are binary sampled profiles: Old behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -output foobar-sampled.profdata $ llvm-profdata show -sample foobar-sampled.profdata error: foobar-sampled.profdata:1: Expected 'mangled_name:NUM:NUM', found lprofi This change adds basic checks for valid input data when assuming text input. It also makes error messages related to file format validity more specific about the assumbed profile data type. New behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -o foobar-sampled.profdata error: foo.profdata: Unrecognized instrumentation profile encoding format Perhaps you forgot to use the -sample option? Reviewers: bogner, davidxl, dnovillo Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14558 llvm-svn: 252916	2015-11-12 18:06:18 +00:00
Diego Novillo	4b6bdb538e	SamplePGO - Move FunctionSamples::print() to a better location. NFC. The class is declared in SampleProf.h, so a better home for this is SampleProf.cpp. llvm-svn: 252915	2015-11-12 17:58:14 +00:00
Andrew Kaylor	fb16a3ac9a	[WinEH] Fix problem with removing an element from a SetVector while iterating. Patch provided by Yaron Keren. (Thanks!) llvm-svn: 252913	2015-11-12 17:36:03 +00:00
Rafael Espindola	2aebdda81e	Comment update. NFC. Fix the library name. Don't duplicate the comment in the .cpp file. Don't repeat the name in the comment. llvm-svn: 252911	2015-11-12 17:13:45 +00:00
Dan Gohman	cf4748f180	[WebAssembly] Reapply r252858, with svn add for the new file. Switch to MC for instruction printing. This encompasses several changes which are all interconnected: - Use the MC framework for printing almost all instructions. - AsmStrings are now live. - This introduces an indirection between LLVM vregs and WebAssembly registers, and a new pass, WebAssemblyRegNumbering, for computing a basic the mapping. This addresses some basic issues with argument registers and unused registers. - The way ARGUMENT instructions are handled no longer generates redundant get_local+set_local for every argument. This also changes the assembly syntax somewhat; most notably, MC's printing does not use sigils on label names, so those are no longer present, and push/pop now have a sigil to keep them unambiguous. The usage of set_local/get_local/$push/$pop will continue to evolve significantly. This patch is just one step of a larger change. llvm-svn: 252910	2015-11-12 17:04:33 +00:00
Michael Zuckerman	fd3fe9e45a	[x86] translating "fp" (floating point) instructions from {fadd,fdiv,fmul,fsub,fsubr,fdivr} to {faddp,fdivp,fmulp,fsubp,fsubrp,fdivrp} LLVM Missing the following instructions: fadd\fdiv\fmul\fsub\fsubr\fdivr. GAS and MS supporting this instruction and lowering them in to a faddp\fdivp\fmulp\fsubp\fsubrp\fdivrp instructions. Differential Revision: http://reviews.llvm.org/D14217 llvm-svn: 252908	2015-11-12 16:58:51 +00:00
Artyom Skrobov	2c2f378f8a	Cull non-standard variants of ARM architectures (NFC) Summary: This patch changes ARMV5, ARMV5E, ARMV6SM, ARMV6HL, ARMV7, ARMV7L, ARMV7HL, ARMV7EM to be treated as aliases for the corresponding standard architectures, instead of as actual architectures. Reviewers: rengolin Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14577 llvm-svn: 252903	2015-11-12 15:51:41 +00:00
Hans Wennborg	7384a2de02	Revert r252858: "[WebAssembly] Switch to MC for instruction printing." It broke the CMake build: "Cannot find source file: WebAssemblyRegNumbering.cpp" llvm-svn: 252897	2015-11-12 14:37:56 +00:00
Vasileios Kalintiris	48e0256ed6	Re-apply "[mips] Use correct frame register for DWARF info when dynamically realigning the stack."" r252219 reversed the direction of subprogram -> function edge. Fixed the IR to account for this. llvm-svn: 252895	2015-11-12 14:11:43 +00:00
James Molloy	8e99e97f2a	[ARM] CMOV->BFI combining: handle both senses of CMPZ I completely misunderstood what ARMISD::CMPZ means. It's not "compare equal to zero", it's "compare, only setting the zero/Z flag". It can either be equal-to-zero or not-equal-to-zero, and we weren't checking what sense it was. If it's equal-to-zero, we can swap the operands around and pretend like it is not-equal-to-zero, which is both a bug fix and lets us handle more cases. llvm-svn: 252891	2015-11-12 13:49:17 +00:00
Renato Golin	93064025bd	Revert "[ARM] Enable shrink-wrapping by default." This reverts commit r252825, as it broke ASAN on ARM. Investigating... llvm-svn: 252889	2015-11-12 13:34:50 +00:00
Daniel Sanders	9f6ad49740	Implement .reloc (constant offset only) with support for R_MIPS_NONE and R_MIPS_32. Summary: Support for R_MIPS_NONE allows us to parse MIPS16's usage of .reloc. R_MIPS_32 was included to be able to better test the directive. Targets can add their relocations by overriding MCAsmBackend::getFixupKind(). Subscribers: grosbach, rafael, majnemer, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D13659 llvm-svn: 252888	2015-11-12 13:33:00 +00:00
Zlatko Buljan	797c2aec6b	[mips][microMIPS] Implement LWM16, SB16, SH16, SW16, SWSP and SWM16 instructions Differential Revision: http://reviews.llvm.org/D11406 llvm-svn: 252885	2015-11-12 13:21:33 +00:00
Vasileios Kalintiris	d38860610d	Revert "[mips] Use correct frame register for DWARF info when dynamically realigning the stack." This reverts commit r252882. LLParser complains for invalid field 'function' in DISubprogram. llvm-svn: 252884	2015-11-12 13:19:11 +00:00
Vasileios Kalintiris	352eb55baf	[mips] Use correct frame register for DWARF info when dynamically realigning the stack. Summary: This patch overrides TargetFrameLowering::getFrameIndexReference() in order to specify the correct register when the function needs dynamic stack realignment. The values returned from this function are used in order to create DW_AT_locations for DWARF info. These locations would use the wrong registers as it's been reported in PR25028. Reviewers: dsanders Subscribers: dean, llvm-commits Differential Revision: http://reviews.llvm.org/D13511 llvm-svn: 252882	2015-11-12 13:04:16 +00:00
James Molloy	2d09c00b91	[InstCombine] Add trivial folding (bitreverse (bitreverse x)) -> x There are plenty more instcombines we could probably do with bitreverse, but this seems like a very obvious and trivial starting point and was brought up by Hal in his review. llvm-svn: 252879	2015-11-12 12:39:41 +00:00
James Molloy	90111f79f9	[SDAG] Introduce a new BITREVERSE node along with a corresponding LLVM intrinsic Several backends have instructions to reverse the order of bits in an integer. Conceptually matching such patterns is similar to @llvm.bswap, and it was mentioned in http://reviews.llvm.org/D14234 that it would be best if these patterns were matched in InstCombine instead of reimplemented in every different target. This patch introduces an intrinsic @llvm.bitreverse.i* that operates similarly to @llvm.bswap. For plumbing purposes there is also a new ISD node ISD::BITREVERSE, with simple expansion and promotion support. The intention is that InstCombine's BSWAP detection logic will be extended to support BITREVERSE too, and @llvm.bitreverse intrinsics emitted (if the backend supports lowering it efficiently). llvm-svn: 252878	2015-11-12 12:29:09 +00:00
James Molloy	7e9bdd5d01	Revert "Revert "[FunctionAttrs] Identify norecurse functions"" This reapplies this patch, with test fixes. llvm-svn: 252871	2015-11-12 10:55:20 +00:00
Kuba Brecka	de8332257b	[Object, MachO] Mark symbols from DATA and BSS sections as ST_Data In `MachOObjectFile::getSymbolType` we currently always return `SymbolRef::ST_Function` for symbols from any section. In order for llvm-symbolizer to correctly symbolize Mach-O globals, symbols from data and BSS sections should return `SymbolRef::ST_Data`. Differential Revision: http://reviews.llvm.org/D14576 llvm-svn: 252867	2015-11-12 09:40:29 +00:00
Amjad Aboud	e59cc3e540	dwarfdump: Added macro support to llvm-dwarfdump tool. Added "macro" option to "-debug-dump" flag, which trigger parsing and dumping of the ".debug_macinfo" section. Differential Revision: http://reviews.llvm.org/D14294 llvm-svn: 252866	2015-11-12 09:38:54 +00:00
Dylan McKay	c498ba3a3e	Add AVR backend skeleton This adds part of the target info code, and adds modifications to the build scripts so that AVR is recognized a supported, experimental backend. It does not include any AVR-specific code, just the bare sources required for a backend to exist. From D14039. llvm-svn: 252865	2015-11-12 09:26:44 +00:00
James Molloy	9a32da74f7	Revert "[FunctionAttrs] Identify norecurse functions" This reverts commit r252862. This introduced test failures and I'm reverting while I investigate how this happened. llvm-svn: 252863	2015-11-12 09:05:43 +00:00
James Molloy	b14994e752	[FunctionAttrs] Identify norecurse functions A function can be marked as norecurse if: * The SCC to which it belongs has cardinality 1; and either a) It does not call any non-norecurse function. This includes self-recursion; or b) It only has one callsite and the function that callsite is within is marked norecurse. a) is best propagated bottom-up and b) is best propagated top-down. We build up the norecurse attributes bottom-up using the existing SCC pass, and mark functions with no obvious recursion (but not provably norecurse) to sweep later, top-down. llvm-svn: 252862	2015-11-12 08:53:04 +00:00
David Blaikie	6400fc146e	Mostly revert 252842 due to failures on some buildbots. I imagine there's some UB in here somewhere, though Valgrind doesn't seem to have picked it up (not sure if I have a working asan build right now to test there). GDB bot seems to be crashing: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/26267/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test Hexagon ELF bot is, presumably, just getting different output: http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/32927/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test llvm-svn: 252859	2015-11-12 06:33:14 +00:00
Dan Gohman	9dd55a8065	[WebAssembly] Switch to MC for instruction printing. This encompasses several changes which are all interconnected: - Use the MC framework for printing almost all instructions. - AsmStrings are now live. - This introduces an indirection between LLVM vregs and WebAssembly registers, and a new pass, WebAssemblyRegNumbering, for computing a basic the mapping. This addresses some basic issues with argument registers and unused registers. - The way ARGUMENT instructions are handled no longer generates redundant get_local+set_local for every argument. This also changes the assembly syntax somewhat; most notably, MC's printing use sigils on label names, so those are no longer present, and push/pop now have a sigil to keep them unambiguous. The usage of set_local/get_local/$push/$pop will continue to evolve significantly. This patch is just one step of a larger change. llvm-svn: 252858	2015-11-12 06:10:03 +00:00
Mike Aizatsky	a9c2387192	output_csv libfuzzer option Summary: The option outputs statistics in CSV format preceded by 1 header line. This is intended for machine processing of the output. -verbosity=0 should likely be set. Differential Revision: http://reviews.llvm.org/D14600 llvm-svn: 252856	2015-11-12 04:38:40 +00:00
David Blaikie	ee293c0aac	dwarfdump: Add error checking to fix the buildbots/correctness llvm-svn: 252845	2015-11-12 01:57:33 +00:00
David Blaikie	6e9c4f7f0d	dwarfdump: Add some error handling for DWP index sections of the wrong size llvm-svn: 252843	2015-11-12 01:41:59 +00:00
David Blaikie	5b9bf49c6f	dwarfdump: Dump the contents of DWP indexes llvm-svn: 252842	2015-11-12 01:41:52 +00:00
Matthias Braun	b9610a6bc2	LegalizeDAG: Fix and improve FCOPYSIGN/FABS legalization - Factor out code to query and modify the sign bit of a floatingpoint value as an integer. This also works if none of the targets integer types is big enough to hold all bits of the floatingpoint value. - Legalize FABS(x) as FCOPYSIGN(x, 0.0) if FCOPYSIGN is available, otherwise perform bit manipulation on the sign bit. The previous code used "x >u 0 ? x : -x" which is incorrect for x being -0.0! It also takes 34 instructions on ARM Cortex-M4. With this patch we only require 5: vldr d0, LCPI0_0 vmov r2, r3, d0 lsrs r2, r3, #31 bfi r1, r2, #31, #1 bx lr (This could be further improved if the compiler would recognize that r2, r3 is zero). - Only lower FCOPYSIGN(x, y) = sign(x) ? -FABS(x) : FABS(x) if FABS is available otherwise perform bit manipulation on the sign bit. - Perform the sign(x) test by masking out the sign bit and comparing with 0 rather than shifting the sign bit to the highest position and testing for "<s 0". For x86 copysignl (on 80bit values) this gets us: testl $32768, %eax rather than: shlq $48, %rax sets %al testb %al, %al Differential Revision: http://reviews.llvm.org/D11172 llvm-svn: 252839	2015-11-12 01:02:47 +00:00
Kostya Serebryany	dc3135db05	[libFuzzer] experimental flag -drill (another search heuristic; Mike Aizatsky's idea) llvm-svn: 252838	2015-11-12 01:02:01 +00:00
Manman Ren	3f2b9c18e2	[TLS on Darwin] use a different mask for tls calls on x86-64. Calls involved in thread-local variable lookup save more registers than normal calls. rdar://problem/23073171 llvm-svn: 252837	2015-11-12 00:54:04 +00:00
Xinliang David Li	4c3ab815ea	Fix problems in coding style llvm-svn: 252829	2015-11-12 00:32:17 +00:00
Quentin Colombet	10f9813528	[ARM] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 252825	2015-11-11 23:31:46 +00:00
Reid Kleckner	b9204a584c	[WinEH] Don't forward branches across empty EH pad BBs For really simple SEH catchpads, we tried to forward the invoke unwind edge across the empty block. llvm-svn: 252822	2015-11-11 23:09:31 +00:00
Chad Rosier	cc9030b60a	[LIR] General refactor to improve compile-time and simplify code. First create a list of candidates, then transform. This simplifies the code in that you have don't have to worry that you may be using an invalidated iterator. Previously, each time we created a memset/memcpy we would reevaluate the entire loop potentially resulting in lots of redundant work for large basic blocks. llvm-svn: 252817	2015-11-11 23:00:59 +00:00
David Majnemer	f0f224d12d	[IR] Add support for empty tokens When working with tokens, it is often the case that one has instructions which consume a token and produce a new token. Currently, we have no mechanism to represent an initial token state. Instead, we can create a notional "empty token" by inventing a new constant which captures the semantics we would like. This new constant is called ConstantTokenNone and is written textually as "token none". Differential Revision: http://reviews.llvm.org/D14581 llvm-svn: 252811	2015-11-11 21:57:16 +00:00
Sanjoy Das	cdafd8490a	Introduce deoptimization operand bundles Summary: This change introduces the notion of "deoptimization" operand bundles. LLVM can recognize and optimize these in more precise ways than it can a generic "unknown" operand bundles. The current form of this special recognition / optimization is an enum entry in LLVMContext, a LangRef blurb and a verifier rule. Over time we will teach LLVM to do more aggressive optimization around deoptimization operand bundles, exploiting known facts about kinds of state deoptimization operand bundles are allowed to track. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14551 llvm-svn: 252806	2015-11-11 21:38:02 +00:00
Paul Robinson	8ab79a1e8a	Report Windows error code in a fatal error after a system call. llvm-svn: 252800	2015-11-11 20:49:32 +00:00
David Blaikie	1070a09f17	unique_ptrify the AllocValueProfData helper function introduced in r252783 llvm-svn: 252799	2015-11-11 20:44:52 +00:00
Hemant Kulkarni	bdce12a01b	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252798	2015-11-11 20:41:43 +00:00
Akira Hatanaka	d932679c71	Move the enum attributes defined in Attributes.h to a table-gen file. This is a step towards consolidating some of the information regarding attributes in a single place. This patch moves the enum attributes in Attributes.h to the table-gen file. Additionally, it adds definitions of target independent string attributes that will be used in follow-up commits by the inliner to check attribute compatibility. rdar://problem/19836465 llvm-svn: 252796	2015-11-11 20:35:42 +00:00
Yunzhong Gao	ea7b3a2320	Add a libLTO diagnostic handler that supports lto_get_error_message API This is a follow-up from the previous discussion on the thread: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151019/307763.html The LibLTO lto_get_error_message() API reads error messages from a std::string sLastErrorString. Instead of passing this string around as an argument, this patch creates a diagnostic handler and then sends this handler to the constructor of LTOCodeGenerator. Differential Revision: http://reviews.llvm.org/D14313 llvm-svn: 252791	2015-11-11 19:59:08 +00:00
Geoff Berry	2ddfc5e60f	[DAGCombiner] Improve zextload optimization. Summary: Don't fold (zext (and (load x), cst)) -> (and (zextload x), (zext cst)) if (and (load x) cst) will match as a zextload already and has additional users. For example, the following IR: %load = load i32, i32* %ptr, align 8 %load16 = and i32 %load, 65535 %load64 = zext i32 %load16 to i64 store i32 %load16, i32* %dst1, align 4 store i64 %load64, i64* %dst2, align 8 used to produce the following aarch64 code: ldr w8, [x0] and w9, w8, #0xffff and x8, x8, #0xffff str w9, [x1] str x8, [x2] but with this change produces the following aarch64 code: ldrh w8, [x0] str w8, [x1] str x8, [x2] Reviewers: resistor, mcrosier Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14340 llvm-svn: 252789	2015-11-11 19:42:52 +00:00
David Blaikie	51c402838c	dwarfdump: DWP type unit index dumping skeleton llvm-svn: 252786	2015-11-11 19:40:49 +00:00
Xinliang David Li	4d1bef3f76	Refactoring and fix another instance of asan error llvm-svn: 252783	2015-11-11 19:31:53 +00:00
David Blaikie	0b44dcc44a	Format my previous commit llvm-svn: 252782	2015-11-11 19:30:47 +00:00
David Blaikie	65a8efe441	dwarfdump: First piece of support for DWP dumping Just a tiny piece of index dumping - the header in this instance. llvm-svn: 252781	2015-11-11 19:28:21 +00:00
Joseph Tremoulet	9f467353a5	[WinEH] Only generate UnwindHelp slot for MSVCXX Summary: Other personalities don't use this special frame slot. Reviewers: majnemer, andrew.w.kaylor, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14580 llvm-svn: 252778	2015-11-11 19:21:09 +00:00
Dawn Perchik	fc4e1c74ab	Support: Recognize Borland DWARF extensions. This patch adds DWARF values for the Delphi language and Borland C++ language extensions. Reviewed by: dblaikie Subscribers: llvm-commits, majnemer Differential Revision: http://reviews.llvm.org/D14522 llvm-svn: 252776	2015-11-11 18:47:36 +00:00
Matt Arsenault	d8fed1b793	Add target preference for GatherAllAliases max depth llvm-svn: 252775	2015-11-11 18:44:33 +00:00
Colin LeMahieu	da6cafffc0	Reverting r252760 llvm-svn: 252770	2015-11-11 18:11:06 +00:00
Dehao Chen	54511353e3	clang-format lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp llvm-svn: 252769	2015-11-11 18:09:47 +00:00
Dehao Chen	72fdf444b7	Emit discriminator for inlined callsites. Summary: Inlined callsites need to be emitted in debug info so that sample profile can be annotated to the correct inlined instance. Reviewers: dnovillo, dblaikie Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D14511 llvm-svn: 252768	2015-11-11 18:08:18 +00:00
Diego Novillo	0354a9f67b	SamplePGO - Fix PR 25482 - Do not rely on llvm.dbg.cu for discriminators The discriminators pass relied on the presence of llvm.dbg.cu to decide whether to add discriminators, but this fails in the case where debug info is only enabled partially when -fprofile-sample-use is active. The reason llvm.dbg.cu is not present in these cases is to prevent codegen from emitting debug info (as it is only used for the sample profile pass). This changes the discriminators pass to also emit discriminators even when debug info is not being emitted. llvm-svn: 252763	2015-11-11 17:54:37 +00:00
Hemant Kulkarni	c6638c7561	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252760	2015-11-11 17:47:54 +00:00
Sanjay Patel	f740129198	[MIPS] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz() MIPS32 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any MIPS32 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: jr $ra clz $2, $4 cttz: addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 jr $ra subu $2, $2, $1 Instead of: ctlz: beqz $4, $BB0_2 addiu $2, $zero, 32 clz $2, $4 $BB0_2: jr $ra nop cttz: beqz $4, $BB1_2 addiu $2, $zero, 32 addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 subu $2, $2, $1 $BB1_2: jr $ra nop See D14469 for the larger motivation. Differential Revision: http://reviews.llvm.org/D14500 llvm-svn: 252755	2015-11-11 17:24:56 +00:00
Diego Novillo	0767ae5896	Properly fix unused variable in disable-assert builds. I missed the side-effects of ParseBFI in my previous attempt (r252748). Thanks dblaikie for the suggestion of adding a void use of the unused variable instead. llvm-svn: 252751	2015-11-11 16:39:22 +00:00
Diego Novillo	29f88a2460	Remove unused variable in disable-assert builds. NFC. llvm-svn: 252748	2015-11-11 16:14:52 +00:00
Douglas Katzman	a14039764b	Visibly fail if attempting to encode register AH,BH,CH,DH in a REX-prefixed instruction. Differential Revision: http://reviews.llvm.org/D13316 Fixes PR25003 llvm-svn: 252743	2015-11-11 15:51:16 +00:00
James Molloy	ce12c92f66	[ARM] Combine BFIs together If we have a chain of BFIs, we may be able to combine several together into one merged BFI. We can do this if the "from" bits from one BFI OR'd with the "from" bits from the other BFI form a contiguous range, and the same with the "to" bits. llvm-svn: 252740	2015-11-11 15:40:40 +00:00
Charlie Turner	d82c9389e7	[SLP] Enable -slp-vectorize-hor by default. Measurements primarily on AArch64 have shown this feature does not significantly effect compile-time. The are no significant perf changes in LNT, but for AArch64 at least, there are wins in third party benchmarks. As discussed on llvm-dev, we're going to try turning this on by default and see how other targets react to the change. llvm-svn: 252733	2015-11-11 15:03:46 +00:00
Aaron Ballman	470b5f1a79	Silencing a signed vs unsigned type mismatch warning. llvm-svn: 252732	2015-11-11 14:57:28 +00:00
Aaron Ballman	107bb0d193	Silencing nine warnings for "enumeral and non-enumeral type in conditional expression"; NFC. llvm-svn: 252728	2015-11-11 13:44:06 +00:00
Michael Kuperstein	12982a816c	[X86] Replace LEAs with INC/DEC when profitable If possible and profitable, replace lea %reg, 1(%reg) and lea %reg, -1(%reg) with inc %reg and dec %reg respectively. Patch by: anton.nadolsky@intel.com Differential Revision: http://reviews.llvm.org/D14059 llvm-svn: 252722	2015-11-11 11:44:31 +00:00
Yury Gribov	d7731988ef	[ASan] Enable optional ASan recovery. Differential Revision: http://reviews.llvm.org/D14242 llvm-svn: 252719	2015-11-11 10:36:49 +00:00
Craig Topper	b24a58e28f	[X86] Fix feature flags on some MMX register instructions that really were introduced with SSE or SSE2. llvm-svn: 252709	2015-11-11 07:29:25 +00:00
Craig Topper	700a1a23d7	[X86] Remove redundant MMX isel patterns. llvm-svn: 252708	2015-11-11 07:29:22 +00:00
Dan Gohman	754cd11d90	[WebAssembly] Support non-legal argument and return types. llvm-svn: 252687	2015-11-11 01:33:02 +00:00
Ahmed Bougacha	4a85643907	[MC] Use LShr for constant evaluation of ">>" on non-arm64 darwin. Follow-up to r235963: this matches other assemblers and is less unexpected (e.g. PR23227). llvm-svn: 252681	2015-11-11 00:51:36 +00:00
Matthias Braun	2c98d0f477	MachineInstr: addRegisterDefReadUndef() => setRegisterDefReadUndef() This way we can not only add but also remove read undef flags. llvm-svn: 252678	2015-11-11 00:41:58 +00:00
Matt Arsenault	8246d4aead	AMDGPU: Print more fields in comments llvm-svn: 252677	2015-11-11 00:27:46 +00:00
Sanjoy Das	dc26df4abe	[ValueTracking] Remove untested / unreachable code, NFC Right now isTruePredicate is only ever called with Pred == ICMP_SLE or ICMP_ULE, and the ICMP_SLT and ICMP_ULT cases are dead. This change removes the untested dead code so that the function is not misleading. llvm-svn: 252676	2015-11-11 00:16:41 +00:00
Matt Arsenault	61cb6fa848	AMDGPU: Remove dead code llvm-svn: 252675	2015-11-11 00:01:36 +00:00
Matt Arsenault	6690d7de39	AMDGPU: Set isAllocatable = 0 on VS_32/VS_64 llvm-svn: 252674	2015-11-11 00:01:32 +00:00
Sanjoy Das	925681053d	[ValueTracking] Teach isImpliedCondition a new bitwise trick Summary: This change teaches isImpliedCondition to prove things like (A \| 15) < L ==> (A \| 14) < L if the low 4 bits of A are known to be zero. Depends on D14391 Reviewers: majnemer, reames, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14392 llvm-svn: 252673	2015-11-10 23:56:20 +00:00
Sanjoy Das	af1400f84b	[ValueTracking] Use m_APInt instead of m_ConstantInt, NFC This change would add functionality if isImpliedCondition worked on vector types; but since it bail out on vector predicates this change is an NFC. llvm-svn: 252672	2015-11-10 23:56:15 +00:00
Matthias Braun	4353b30542	TableGen: Emit LaneMask for register classes without subregisters as ~0u This makes it slightly easier to handle classes with and without subregister uniformly. llvm-svn: 252671	2015-11-10 23:23:05 +00:00
Reid Kleckner	7f84a939ed	[WinEH] Insert the MBB for EH_RESTORE after the catchret Inserting it before the target block could be bad, we might already have a fallthrough edge to it. llvm-svn: 252670	2015-11-10 23:22:20 +00:00
Kostya Serebryany	b7e286bed7	[libFuzzer] add UninstrumentedTest.cpp (missing from a previous commit) llvm-svn: 252658	2015-11-10 22:02:56 +00:00
Dan Gohman	16d314d300	[WebAssembly] Remove special cases for things that are no longer special. NFC. llvm-svn: 252656	2015-11-10 21:48:21 +00:00
Bill Schmidt	3c44c6f189	Add PPCMIPeephole.cpp to CMakeLists.txt llvm-svn: 252654	2015-11-10 21:43:45 +00:00
Dan Gohman	b84ae9bb38	[WebAssembly] Support for floating point min and max. llvm-svn: 252653	2015-11-10 21:40:21 +00:00
Bill Schmidt	34af5e1c76	[PowerPC] Add an MI SSA peephole pass. This patch adds a pass for doing PowerPC peephole optimizations at the MI level while the code is still in SSA form. This allows for easy modifications to the instructions while depending on a subsequent pass of DCE. Both passes are very fast due to the characteristics of SSA. At this time, the only peepholes added are for cleaning up various redundancies involving the XXPERMDI instruction. However, I would expect this will be a useful place to add more peepholes for inefficiencies generated during instruction selection. The pass is placed after VSX swap optimization, as it is best to let that pass remove unnecessary swaps before performing any remaining clean-ups. The utility of these clean-ups are demonstrated by changes to four existing test cases, all of which now have tighter expected code generation. I've also added Eric Schweiz's bugpoint-reduced test from PR25157, for which we now generate tight code. One other test started failing for me, and I've fixed it (test/Transforms/PlaceSafepoints/finite-loops.ll) as well; this is not related to my changes, and I'm not sure why it works before and not after. The problem is that the CHECK-NOT: of "statepoint" from test1 fails because of the "statepoint" in test2, and so forth. Adding a CHECK-LABEL in between keeps the different occurrences of that string properly scoped. llvm-svn: 252651	2015-11-10 21:38:26 +00:00
Teresa Johnson	2d5fb8cac4	Ensure ModuleLinker materializes complete comdat groups Summary: The module linker lazy links some "discardable if unused" global values (e.g. linkonce), materializing and linking them only if they are referenced in the module. If a comdat group contains a linkonce member that is not referenced, however, it would not be materialized and linked, leading to an incomplete comdat group. If there are other object files not part of the same LTO link that also define and use that comdat group, the linker may select the incomplete group leading to link time unsats. To solve this, whenever a global value body is linked, make sure we materialize any other members of the same comdat group that are not yet materialized. This ensures they are in the lazy link list and get linked as well. Added new test and adjusted old test to remove parts that didn't make sense with fix. Reviewers: rafael Subscribers: dexonsmith, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14516 llvm-svn: 252647	2015-11-10 21:09:06 +00:00
Sanjoy Das	bd1c1bfbd2	[IR] Make {Call,Invoke}::cloneImpl aware of operand bundles This was an omission in the patch that landed initial support for operand bundles. So far we haven't hit this, but we will once the inliner is able to inline calls to functions that contain calls with operand bundles. llvm-svn: 252645	2015-11-10 20:13:21 +00:00
Sanjoy Das	b9ca6dcc6b	[OperandBundles] Identify operand bundles with both their names and IDs No code uses this functionality yet. This change just exposes information / structure that was already present. llvm-svn: 252644	2015-11-10 20:13:15 +00:00
Sanjay Patel	33ec5dbe35	less indent; NFCI llvm-svn: 252643	2015-11-10 20:09:02 +00:00
Sanjay Patel	af1b48bfdc	[ARM] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz() ARM V6T2 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any ARM V6T2 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: clz r0, r0 bx lr cttz: rbit r0, r0 clz r0, r0 bx lr Instead of: ctlz: cmp r0, #0 moveq r0, #32 clzne r0, r0 bx lr cttz: cmp r0, #0 moveq r0, #32 rbitne r0, r0 clzne r0, r0 bx lr This will help solve a general speculation/despeculation problem noted in PR24818: https://llvm.org/bugs/show_bug.cgi?id=24818 Differential Revision: http://reviews.llvm.org/D14469 llvm-svn: 252639	2015-11-10 19:24:31 +00:00
Matt Arsenault	aa118e299c	LegalizeDAG: Implement promote for scalar_to_vector This allows avoiding the default Expand behavior which introduces stack usage. Bitcast the scalar and replace the missing elements with undef. This is covered by existing tests and used by a future commit which makes 64-bit vectors legal types on AMDGPU. llvm-svn: 252632	2015-11-10 18:48:11 +00:00

... 6 7 8 9 10 ...

85123 Commits