llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	b58f32f7a8	[LoopVectorize] Don't crash on zero-sized types in isInductionPHI isInductionPHI wants to calculate the stride based on the pointee size. However, this is not possible when the pointee is zero sized. This fixes PR23763. llvm-svn: 239143	2015-06-05 10:52:40 +00:00
Andrea Di Biagio	eb33134ce7	Simplify code; NFC. Also, moved test cases from CodeGen/X86/fold-buildvector-bug.ll into CodeGen/X86/buildvec-insertvec.ll and regenerated CHECK lines using update_llc_test_checks.py. llvm-svn: 239142	2015-06-05 10:29:55 +00:00
David Majnemer	6d8081835d	[InstCombine] Rephrase fix to SimplifyWithOpReplaced I don't have the IR which is causing the build bot breakage but I can postulate as to why they are timing out: 1. SimplifyWithOpReplaced was stripping flags from the simplified value. 2. visitSelectInstWithICmp was overriding SimplifyWithOpReplaced because it's simplification wasn't correct. 3. InstCombine would revisit the add instruction and note that it can rederive the flags. 4. By modifying the value, we chose to revisit instructions which reuse the value. One of the instructions is the original select, causing LLVM to never reach fixpoint. Instead, strip the flags only when we are sure we are going to perform the simplification. llvm-svn: 239141	2015-06-05 09:57:57 +00:00
Daniel Jasper	917fa5ee66	Revert "[InstCombine] Don't miscompile safe increment idiom" This is breaking a lot of build bots and is causing very long-running compiles (infinite loops)? Likely, we shouldn't return nullptr? llvm-svn: 239139	2015-06-05 09:31:20 +00:00
Simon Pilgrim	b4c562de87	[X86][SSE] Added tests for i8/i16 vector shifts Currently still scalarized, but D9474 should remedy that. llvm-svn: 239136	2015-06-05 08:24:23 +00:00
Alexey Samsonov	5c3ed00eb2	Revert "[Object, ELF] Fix segmentation fault in ELFFile::getSectionName()." This reverts commit r239124. llvm-svn: 239125	2015-06-04 23:58:31 +00:00
Alexey Samsonov	24558d5520	[Object, ELF] Fix segmentation fault in ELFFile::getSectionName(). Don't do a null dereference if .shstrtab section is missing. llvm-svn: 239124	2015-06-04 23:40:23 +00:00
Alexey Samsonov	49179ddba4	[Object, ELF] Don't assert on invalid magic in createELFObjectFile. Instead, return a proper error code from factory. llvm-svn: 239116	2015-06-04 23:14:43 +00:00
David Majnemer	00f7d9ecc8	[InstCombine] Don't miscompile safe increment idiom We cleverly handle cases where computation done in one argument of a select instruction is suitable for the other operand, thus obviating the need of the select and the comparison. However, the other operand cannot have flags. This fixes PR23757. llvm-svn: 239115	2015-06-04 23:11:30 +00:00
Swaroop Sridhar	70d18df18f	Statepoint: Fix handling of Far Immediate calls gc.statepoint intrinsics with a far immediate call target were lowered incorrectly as pc-rel32 calls. This change fixes the problem, and generates an indirect call via a scratch register. For example: Intrinsic: %safepoint_token = call i32 (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () inttoptr (i64 140727162896504 to void ()), i32 0, i32 0, i32 0, i32 0) Old Incorrect Lowering: callq 140727162896504 New Correct Lowering: movabsq $140727162896504, %rax callq %rax In lowerCallFromStatepoint(), the callee-target was modified and represented as a "TargetConstant" node, rather than a "Constant" node. Undoing this modification enabled LowerCall() to generate the correct CALL instruction. llvm-svn: 239114	2015-06-04 23:03:21 +00:00
Alexey Samsonov	18ad2e54ab	[Object, ELF] Don't call llvm_unreachable() from createELFObjectFile. Instead, return a proper error code from factory. llvm-svn: 239113	2015-06-04 22:58:25 +00:00
Charles Davis	da280728b6	[Target/X86] Don't use callee-saved registers in a Win64 tail call on non-Windows. Summary: A small bit that I missed when I updated the X86 backend to account for the Win64 calling convention on non-Windows. Now we don't use dead non-volatile registers when emitting a Win64 indirect tail call on non-Windows. Should fix PR23710. Test Plan: Added test for the correct behavior based on the case I posted to PR23710. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10258 llvm-svn: 239111	2015-06-04 22:50:05 +00:00
Alexey Samsonov	f8a7bf8c6e	[Object, MachO] Don't crash on incomplete MachO segment load commands. Report proper error code from MachOObjectFile constructor if we can't parse another segment load command (we already return a proper error if segment load command contents is suspicious). llvm-svn: 239109	2015-06-04 22:26:44 +00:00
Benjamin Kramer	ff0fb6936b	[SDAG switch lowering] Fix switch case -> or merging for 0 and INT_MIN The big/small ordering here is based on signed values so SmallValue will be INT_MIN and BigValue 0. This shouldn't be a problem but the code assumed that BigValue always had more bits set than SmallValue. We used to just miss the transformation, but a recent refactoring of mine turned this into an assertion failure. llvm-svn: 239105	2015-06-04 22:05:51 +00:00
Colin LeMahieu	348efdbd36	Shouldn't be XFAIL'ed. llvm-svn: 239103	2015-06-04 21:49:43 +00:00
Colin LeMahieu	c40be85adc	Revert r239095 incorrect test tree. llvm-svn: 239102	2015-06-04 21:32:42 +00:00
Jingyue Wu	a2f6027a31	[NVPTX] roll forward r239082 NVPTXISelDAGToDAG translates "addrspacecast to param" to NVPTX::nvvm_ptr_gen_to_param Added an llc test in bug21465. llvm-svn: 239100	2015-06-04 21:28:26 +00:00
Colin LeMahieu	fc52c11d80	[Hexagon] Adding functionality for duplexing. Duplexing is a way to compress commonly used pairs of instructions in order to reduce code size. The test case duplex.ll normally would be 8 bytes, assign register to 0 and jump to link register. After duplexing this is only 4 bytes. This also tests the HexagonMCShuffler code path which is used to make sure duplexed instructions still follow slot requirements. llvm-svn: 239095	2015-06-04 21:16:16 +00:00
Jingyue Wu	b8f38668d5	Revert r239082 llc crashed for NVPTX backend llvm-svn: 239094	2015-06-04 21:07:08 +00:00
Sergey Dmitrouk	3160d02b5b	Erase constant dbgloc on reuse in PHI node Basic block selection involves checking successor BBs for PHI nodes that depend on the current BB. In case such BBs are found, the value being selected is a constant and such constant already exists in current BB, it's value is reused. This might lead to wrong locations in some situations, especially if same constant value ends up being materialized twice in two different ways, which discards that sharing and leaves us with wrong debug location in the successor BB. In code this involves the following sequence of calls: SelectionDAGBuilder::HandlePHINodesInSuccessorBlocks -> SelectionDAGBuilder::CopyValueToVirtualRegister -> SelectionDAGBuilder::getNonRegisterValue llvm-svn: 239089	2015-06-04 20:48:40 +00:00
Ahmed Bougacha	8207641251	[GlobalMerge] Take into account minsize on Global users' parents. Now that we can look at users, we can trivially do this: when we would have otherwise disabled GlobalMerge (currently -O<3), we can just run it for minsize functions, as it's usually a codesize win. Differential Revision: http://reviews.llvm.org/D10054 llvm-svn: 239087	2015-06-04 20:39:23 +00:00
Jim Grosbach	7c76b4cc6e	MC: Remove obsolete MachO UseAggressiveSymbolFolding. Fix the FIXME and remove this old as(1) compat option. It was useful for bringup of the integrated assembler to diff object files, but now it's just causing more relocations than strictly necessary to be generated. rdar://21201804 llvm-svn: 239084	2015-06-04 20:27:42 +00:00
Jingyue Wu	f3a8079b75	[NVPTX] kernel pointer arguments point to the global address space Summary: With this patch, NVPTXLowerKernelArgs converts a kernel pointer argument to a pointer in the global address space. This change, along with NVPTXFavorNonGenericAddrSpaces, allows the NVPTX backend to emit ld.global.* and st.global.* for accessing kernel pointer arguments. Minor changes: 1. refactor: extract function convertToPointerInAddrSpace 2. fix a bug in the test case in bug21465.ll Test Plan: lower-kernel-ptr-arg.ll Reviewers: eliben, meheff, jholewinski Reviewed By: jholewinski Subscribers: wengxt, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10154 llvm-svn: 239082	2015-06-04 20:19:38 +00:00
Alexey Samsonov	074da9b5e7	[Object, MachO] Don't crash on invalid MachO segment load commands. Summary: Properly report the error in segment load commands from MachOObjectFile constructor instead of crashing the program. Adjust the test case accordingly. Test Plan: regression test suite Reviewers: rafael, filcab Subscribers: llvm-commits llvm-svn: 239081	2015-06-04 20:08:52 +00:00
Alexey Samsonov	de5a94a6b4	[Object, MachO] Don't crash on invalid MachO load commands. Summary: Currently all load commands are parsed in MachOObjectFile constructor. If the next load command cannot be parsed, or if command size is too small, properly report it through the error code and fail to construct the object, instead of crashing the program. Test Plan: regression test suite Reviewers: rafael, filcab Subscribers: llvm-commits llvm-svn: 239080	2015-06-04 19:57:46 +00:00
Alexey Samsonov	9f336636fe	[Object, MachO] Don't crash on parsing invalid MachO header. Summary: Instead, properly report this error from MachOObjectFile constructor. Test Plan: regression test suite Reviewers: rafael Subscribers: llvm-commits llvm-svn: 239078	2015-06-04 19:45:22 +00:00
Alexey Samsonov	3642508921	Fix buildbot failure on Windows by relaxing test expectations. llvm-svn: 239074	2015-06-04 19:22:00 +00:00
Alexei Starovoitov	310deada10	[bpf] add big- and host- endian support Summary: -march=bpf -> host endian -march=bpf_le -> little endian -match=bpf_be -> big endian Test Plan: v1 was tested by IBM s390 guys and appears to be working there. It bit rots too fast here. Reviewers: chandlerc, tstellarAMD Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10177 llvm-svn: 239071	2015-06-04 19:15:05 +00:00
Andrea Di Biagio	9ac8a6b13d	[DAGCombiner] Fix wrong folding of a build_vector into a blend with zero. Method 'visitBUILD_VECTOR' in the DAGCombiner knows how to combine a build_vector of a bunch of extract_vector_elt nodes and constant zero nodes into a shuffle blend with a zero vector. However, method 'visitBUILD_VECTOR' forgot that a floating point build_vector may contain negative zero as well as positive zero. Example: define <2 x double> @example(<2 x double> %A) { entry: %0 = extractelement <2 x double> %A, i32 0 %1 = insertelement <2 x double> undef, double %0, i32 0 %2 = insertelement <2 x double> %1, double -0.0, i32 1 ret <2 x double> %2 } Before this patch, llc (with -mattr=+sse4.1) wrongly generated movq %xmm0, %xmm0 # xmm0 = xmm0[0],zero So, the sign bit of the negative zero was effectively lost. This patch fixes the problem by adding explicit checks for positive zero. With this patch, llc produces the following code for the example above: movhpd .LCPI0_0(%rip), %xmm0 where .LCPI0_0 referes to a 'double -0'. llvm-svn: 239070	2015-06-04 19:15:01 +00:00
Alexey Samsonov	2b5fe3f5b2	Make test case more readable: move CHECK-lines next to corresponding RUN-lines. llvm-svn: 239068	2015-06-04 18:50:04 +00:00
Alexey Samsonov	50d0fbd2b9	llvm-objdump: return non-zero exit code for certain cases of invalid input * If the input file is missing; * If the type of input object file can't be recognized; * If the object file can't be parsed correctly. llvm-svn: 239065	2015-06-04 18:34:11 +00:00
Matt Arsenault	73e06fa262	R600/SI: Reimplement isLegalAddressingMode Now that we sometimes know the address space, this can theoretically do a better job. This needs better test coverage, but this mostly depends on first updating the loop optimizatiosn to provide the address space. llvm-svn: 239053	2015-06-04 16:17:42 +00:00
Matt Arsenault	81c7ae2bf5	R600/SI: Fix some cases for load / store of half Mostly argument loads were producing broken zextloads from an FP type. llvm-svn: 239049	2015-06-04 16:00:27 +00:00
Hans Wennborg	d922915685	Switch lowering: fix assert in buildBitTests (PR23738) When checking (High - Low + 1).sle(BitWidth), BitWidth would be truncated to the size of the left-hand side. In the case of this PR, the left-hand side was i4, so BitWidth=64 got truncated to 0 and the assert failed. llvm-svn: 239048	2015-06-04 15:55:00 +00:00
Rafael Espindola	a401eee22f	Omit unused section symbols from the symbol table. Section symbols exist as an optimization: instead of having multiple relocations point to different symbols, many of them can point to a single section symbol. When that optimization is unused, a section symbol is also unused and adds no extra information to the object file. This saves a bit of space on the object files and makes the output of llvm-objdump -t easier to read and consequently some tests get quite a bit simpler. llvm-svn: 239045	2015-06-04 15:33:30 +00:00
Rafael Espindola	09e5b1ca76	Move test that depends on x86 to the x86 directory. llvm-svn: 239043	2015-06-04 15:25:47 +00:00
Rafael Espindola	54a381463e	No need to check the raw relocation bytes if checking the parsed dump. llvm-svn: 239042	2015-06-04 15:21:17 +00:00
Rafael Espindola	20733034a7	llvm-readobj can parse relocations, no need to check the raw bytes.x llvm-svn: 239041	2015-06-04 15:15:12 +00:00
Rafael Espindola	7884c95c7e	Disassemble the start of sections even if there is no symbol there. We already handled a section with no symbols, extend that to also handle a section with symbols that don't include the section start. llvm-svn: 239039	2015-06-04 15:01:05 +00:00
James Molloy	37593732a4	Don't create a MIN/MAX node if the underlying compare has more than one use. If the compare in a select pattern has another use then it can't be removed, so we'd just be creating repeated code if we created a min/max node. Spotted by Matt Arsenault! llvm-svn: 239037	2015-06-04 13:48:23 +00:00
Elena Demikhovsky	2f1a0dabd0	AVX-512: I brought back vector-shuffle-512-v8.ll test. I re-generated it after all AVX-512 shuffle optimizations. llvm-svn: 239026	2015-06-04 07:49:56 +00:00
Igor Breger	8bdcb69413	Test commit llvm-svn: 239019	2015-06-04 07:23:38 +00:00
David Majnemer	0a99278f7f	Make the test introduced in r239015 more targeted. We don't need to go through LSR to trigger this bug. Instead, hand-craft a tricky GEP and get the constant folder to hack on it when parsing the IR. llvm-svn: 239017	2015-06-04 07:21:42 +00:00
Elena Demikhovsky	4078c75bd4	AVX-512: added all SKX forms of VPERMW/D/Q instructions. Added all forms of VPERMPS/PD instrcuctions. Added encoding tests. llvm-svn: 239016	2015-06-04 07:07:13 +00:00
David Majnemer	38eb9f46db	[ConstantFold] Don't skip the first gep index when folding geps We neglected to check if the first index made the GEP ineligible for 'inbounds'. This fixes PR23753. llvm-svn: 239015	2015-06-04 07:01:56 +00:00
Rafael Espindola	af5f51f6a8	Add testcase that would crash before the previous revert. llvm-svn: 239011	2015-06-04 05:51:13 +00:00
Sanjay Patel	667a7e2a0f	make reciprocal estimate code generation more flexible by adding command-line options (3rd try) The first try (r238051) to land this was reverted due to ExecutionEngine build failure; that was hopefully addressed by r238788. The second try (r238842) to land this was reverted due to BUILD_SHARED_LIBS failure; that was hopefully addressed by r238953. This patch adds a TargetRecip class for processing many recip codegen possibilities. The class is intended to handle both command-line options to llc as well as options passed in from a front-end such as clang with the -mrecip option. The x86 backend is updated to use the new functionality. Only -mcpu=btver2 with -ffast-math should see a functional change from this patch. All other x86 CPUs continue to not use reciprocal estimates by default with -ffast-math. Differential Revision: http://reviews.llvm.org/D8982 llvm-svn: 239001	2015-06-04 01:32:35 +00:00
Tom Stellard	1ba52feb96	R600: Re-enable sub-reg liveness The bug in the R600 backend that this uncovered has been fixed. llvm-svn: 238999	2015-06-04 01:20:04 +00:00
Alexey Samsonov	599dd89b33	Improve test added in r238481. llvm-svn: 238985	2015-06-03 22:36:17 +00:00
Frederic Riss	90e0bd96ff	Reapply r238941 - [dsymutil] Accept a YAML debug map as input instead of a binary. With a couple more constructors that GCC thinks are necessary. Original commit message: [dsymutil] Accept a YAML debug map as input instead of a binary. To do this, the user needs to pass the new -y flag. As it wasn't tested before, the debug map YAML deserialization was completely buggy (mainly because the DebugMapObject has a dual mapping that allows to search by name and by address, but only the StringMap got populated). It's fixed and tested in this commit by augmenting some test with a 2 stage dwarf link: a frist llvm-dsymutil reads the debug map and pipes it in a second instance that does the actual link without touching the initial binary. llvm-svn: 238959	2015-06-03 20:29:24 +00:00

1 2 3 4 5 ...

30331 Commits