llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	3ff9cc01f2	[X86] Use CMOVNS for abs instead of CMOVGE. CMOVGE reads SF and OF. CMOVNS only reads SF. This matches with other recent changes to use a single flag where possible. It also matches gcc codegen. I believe this technically changes whether the conditioanl move happens on INT_MIN, but for INT_MIN both registers are the same so it doesn't matter. Differential Revision: https://reviews.llvm.org/D111826	2021-10-14 12:28:28 -07:00
David Green	adec922361	[AArch64] Make -mcpu=generic schedule for an in-order core We would like to start pushing -mcpu=generic towards enabling the set of features that improves performance for some CPUs, without hurting any others. A blend of the performance options hopefully beneficial to all CPUs. The largest part of that is enabling in-order scheduling using the Cortex-A55 schedule model. This is similar to the Arm backend change from `eecb353d0e` which made -mcpu=generic perform in-order scheduling using the cortex-a8 schedule model. The idea is that in-order cpu's require the most help in instruction scheduling, whereas out-of-order cpus can for the most part out-of-order schedule around different codegen. Our benchmarking suggests that hypothesis holds. When running on an in-order core this improved performance by 3.8% geomean on a set of DSP workloads, 2% geomean on some other embedded benchmark and between 1% and 1.8% on a set of singlecore and multicore workloads, all running on a Cortex-A55 cluster. On an out-of-order cpu the results are a lot more noisy but show flat performance or an improvement. On the set of DSP and embedded benchmarks, run on a Cortex-A78 there was a very noisy 1% speed improvement. Using the most detailed results I could find, SPEC2006 runs on a Neoverse N1 show a small increase in instruction count (+0.127%), but a decrease in cycle counts (-0.155%, on average). The instruction count is very low noise, the cycle count is more noisy with a 0.15% decrease not being significant. SPEC2k17 shows a small decrease (-0.2%) in instruction count leading to a -0.296% decrease in cycle count. These results are within noise margins but tend to show a small improvement in general. When specifying an Apple target, clang will set "-target-cpu apple-a7" on the command line, so should not be affected by this change when running from clang. This also doesn't enable more runtime unrolling like -mcpu=cortex-a55 does, only changing the schedule used. A lot of existing tests have updated. This is a summary of the important differences: - Most changes are the same instructions in a different order. - Sometimes this leads to very minor inefficiencies, such as requiring an extra mov to move variables into r0/v0 for the return value of a test function. - misched-fusion.ll was no longer fusing the pairs of instructions it should, as per D110561. I've changed the schedule used in the test for now. - neon-mla-mls.ll now uses "mul; sub" as opposed to "neg; mla" due to the different latencies. This seems fine to me. - Some SVE tests do not always remove movprfx where they did before due to different register allocation giving different destructive forms. - The tests argument-blocks-array-of-struct.ll and arm64-windows-calls.ll produce two LDR where they previously produced an LDP due to store-pair-suppress kicking in. - arm64-ldp.ll and arm64-neon-copy.ll are missing pre/postinc on LPD. - Some tests such as arm64-neon-mul-div.ll and ragreedy-local-interval-cost.ll have more, less or just different spilling. - In aarch64_generated_funcs.ll.generated.expected one part of the function is no longer outlined. Interestingly if I switch this to use any other scheduled even less is outlined. Some of these are expected to happen, such as differences in outlining or register spilling. There will be places where these result in worse codegen, places where they are better, with the SPEC instruction counts suggesting it is not a decrease overall, on average. Differential Revision: https://reviews.llvm.org/D110830	2021-10-09 15:58:31 +01:00
Alex Richardson	547e5e4ae6	[update_llc_test_checks.py] Fix MIPS ASM regex for functions with EH On MIPS, functions with exception handling code emits an additional temporary label at the start of the function (due to UseAssignmentForEHBegin): _Z8do_catchv: # @_Z8do_catchv .Ltmp3: .set .Lfunc_begin0, .Ltmp3 .cfi_startproc .cfi_personality 128, DW.ref.__gxx_personality_v0 .cfi_lsda 0, .Lexception0 .frame $c11,48,$c17 .mask 0x00000000,0 .fmask 0x00000000,0 .set noreorder .set nomacro .set noat # %bb.0: # %entry The `[^:]*` regex was terminating the search after .Ltmp<N>: and therefore not detecting functions with exception handling. Reviewed By: atanasyan, MaskRay Differential Revision: https://reviews.llvm.org/D100027	2021-09-28 17:57:36 +01:00
Alex Richardson	ee3109b044	[update_llc_test_checks] Baseline test for D100027 Show that we fail to generate CHECK lines for MIPS64 functions with EH. Differential Revision: https://reviews.llvm.org/D110408	2021-09-28 17:57:36 +01:00
Sebastian Neubauer	ecd5145c27	[Utils] Replace llc with cat for tests Make the update_llc_test_checks script test independant of llc behavior by using cat with static files to simulate llc output. This allows changing llc without breaking the script test case. The update script is executed in a temporary directory, so the llc-generated assembly files are copied there. %T is deprecated, but it allows copying a file with a predictable filename. Differential Revision: https://reviews.llvm.org/D110143	2021-09-22 10:10:35 +02:00
Alex Richardson	817e23d481	[update_mir_test_checks.py] Use -NEXT FileCheck directories Previously the script emitted output using plain CHECK directives. This can result in a test passing even if there are some instructions between CHECK directives that should have been removed. It also makes debugging tests that have the output in a different order more difficult since FileCheck can match with a later line and then complain about the "wrong" directive not being found. This will cause quite large diffs when updating existing tests, but I'm not sure we need an opt-in flag here. Depends on D109765 (pre-commit tests) Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D109767	2021-09-20 12:55:56 +01:00
Alex Richardson	7b68c0725d	pre-commit test for D109767 Differential Revision: https://reviews.llvm.org/D109765	2021-09-20 12:55:56 +01:00
Jason Molenda	0d8cd4e2d5	[AArch64InstPrinter] Change printAddSubImm to comment imm value when shifted Add a comment when there is a shifted value, add x9, x0, #291, lsl #12 ; =1191936 but not when the immediate value is unshifted, subs x9, x0, #256 ; =256 when the comment adds nothing additional to the reader. Differential Revision: https://reviews.llvm.org/D107196	2021-08-03 02:28:46 -07:00
Sebastian Neubauer	4864893127	[Utils] Do not remove comments in llc test script When checking if two prefixes can be merged for a function, update_llc_test_checks.py removed IR comments before comparing llc outputs of different RUN lines. This means, if one RUN line emited lines starting with ';' and another RUN line emited the same lines except the ones starting with ';', both RUNs would be merged (if they share a prefix). However, CHECK-NEXT lines check the comments, otherwise they fail, so the script should not merge RUNs if they contain different comments. Differential Revision: https://reviews.llvm.org/D101312	2021-07-29 13:03:05 +02:00
Roman Lebedev	707224ea16	[NFC] Update arm_function_name.ll after `4de0c40031`	2021-06-23 16:41:43 +03:00
Jinsong Ji	882e4cbd74	[AIX][AsmPrinter] Print Symbol in comments for TOC load We are using TOCEntry symbols like `LC..0` in TOC loads, this is hard to read , at least requiring an additional step to figure out the loaded symbols. We should print out the name in comments. Reviewed By: #powerpc, shchenz Differential Revision: https://reviews.llvm.org/D102949	2021-05-25 16:37:40 +00:00
serge-sans-paille	4ab3041acb	Revert "[NFC] remove explicit default value for strboolattr attribute in tests" This reverts commit `bda6e5bee0`. See https://lab.llvm.org/buildbot/#/builders/109/builds/15424 for instance	2021-05-24 19:43:40 +02:00
serge-sans-paille	bda6e5bee0	[NFC] remove explicit default value for strboolattr attribute in tests Since `d6de1e1a71`, no attributes is quivalent to setting attribute to false. This is a preliminary commit for https://reviews.llvm.org/D99080	2021-05-24 19:31:04 +02:00
Fangrui Song	5d9ea36baf	[UpdateTestChecks] Default --x86_scrub_rip to False True is a bad default: the useful symbol names and `@GOTPCREL` are scrubbed. Change the default and add global variable tests to x86-basic.ll (renamed from x86_function_name.ll since we now also test variables). I updated some tests to show the differences. Updated LCPI regex to include Darwin style `LCPI_[0-9]+_[0-9]+` (no leading dot). Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D102588	2021-05-21 19:26:15 -07:00
Giorgis Georgakoudis	404fa9a6cf	[Utils] Add prof metadata to matched unnamed values Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101742	2021-05-03 15:15:34 -07:00
Alex Richardson	9692811b26	[update_(llc_)test_checks.py] Support pre-processing commands This has been rather useful in our downstream CHERI target where we want to run tests both with addrspace(0) and addrspace(200) pointers. With this patch we can prefix the opt command with `sed -e 's/addrspace(200)/addrspace(0)/g' -e 's/-A200-P200-G200//g'` to test both cases using the same IR input. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95137	2021-04-28 12:19:19 +01:00
Johannes Doerfert	cd1bd6e587	[Utils] Check for more global information in update_test_checks This allows to check for various globals (metadata/attributes/...) and also resolves problems with globals (metadata/attributes/...) being reused across different prefixes. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D94741	2021-03-11 23:31:16 -06:00
Mircea Trofin	f31ea86c80	Revert "[Utils] Add a switch controlling prefix warnings in UpdateTestChecks" This reverts commit `87f8a08ce3`.	2021-02-08 11:21:56 -08:00
Mircea Trofin	87f8a08ce3	[Utils] Add a switch controlling prefix warnings in UpdateTestChecks The switch controls both unused prefix warnings, and warnings about functions which differ under different runs for a prefix, and, thus, end up not having asserts for that prefix. (If the latter case spans to all functions, then the former case kicks in) The switch is on by default, and can be disabled. Differential Revision: https://reviews.llvm.org/D95829	2021-02-01 18:04:18 -08:00
Ben Shi	2d7aa149a4	[update_llc_test_checks] Support AVR Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D95240	2021-01-26 17:50:56 +08:00
Yvan Roux	244ad228f3	[ARM][MachineOutliner] Add stack fixup feature This patch handles cases where we have to save/restore the link register into the stack and and load/store instruction which use the stack are part of the outlined region. It checks that there will be no overflow introduced by the new offset and fixup these instructions accordingly. Differential Revision: https://reviews.llvm.org/D92934	2021-01-19 10:59:09 +01:00
Qiu Chaofan	ae61485163	[UpdateTestChecks] Fix PowerPC RE to support AIX assembly Current update_llc_test_checks.py cannot generate checks for AIX (powerpc64-ibm-aix-xcoff) properly. Assembly generated is little bit different from Linux. So I use begin function comment here to capture function name. Reviewed By: MaskRay, steven.zhang Differential Revision: https://reviews.llvm.org/D93676	2021-01-05 10:28:00 +08:00
Fangrui Song	bf1160c1d6	[test] Add explicit dso_local to definitions in ELF static relocation model tests	2020-12-30 16:52:23 -08:00
Fangrui Song	a964e0f085	[test] Add explicit dso_local to definitions in ELF static relocation model tests	2020-12-30 15:47:16 -08:00
Fangrui Song	7181df1e49	[update_llc_test_checks] Support Windows .seh_proc for x86	2020-12-30 12:32:47 -08:00
Fangrui Song	294a196b04	[update_llc_test_checks] Support .Lfunc$local for x86 -relocation-model=pic dsolocal tests	2020-12-30 11:59:36 -08:00
Mircea Trofin	e2dc306b1a	[utils] Fix UpdateTestChecks case where 2 runs differ for last label Two RUN lines produce outputs that, each, have some common parts and some different parts. The common parts are checked under label A. The differing parts are associated to a function and checked under labels B and C, respectivelly. When build_function_body_dictionary is called for the first RUN line, it will attribute the function body to labels A and C. When the second RUN is passed to build_function_body_dictionary, it sees that the function body under A is different from what it has. If in this second RUN line, A were at the end of the prefixes list, A's body is still kept associated with the first run's function. When we output the function body (i.e. add_checks), we stop after emitting for the first prefix matching that function. So we end up with the wrong function body (first RUN's A-association). There is no reason to special-case the last label in the prefixes list, and the fix is to always clear a label association if we find a RUN line where the body is different. Differential Revision: https://reviews.llvm.org/D93078	2020-12-15 07:16:54 -08:00
Yvan Roux	dee452da7a	[UpdateTestChecks] Fix $ in function test for ARM. Removes AArch64 target checking inside 32bit ARM test to bring back buildbots to a green state. But $ are not well handled for ARM and it still need to be fixed.	2020-11-13 12:26:56 +01:00
Paul Robinson	def26af4ea	Revert "The arm64 triple requires AArch64 not ARM target" This reverts commit `e7256825d5`. apparently it's not that simple. https://lab.llvm.org:8011/#/builders/109/2412	2020-11-10 08:27:05 -08:00
Paul Robinson	e7256825d5	The arm64 triple requires AArch64 not ARM target Failure seen if you configure ARM target but not AArch64, as here: http://lab.llvm.org:8011/#/builders/59/builds/271	2020-11-10 08:19:02 -08:00
Hans Wennborg	0628bea513	Revert "[PM/CC1] Add -f[no-]split-cold-code CC1 option to toggle splitting" This broke Chromium's PGO build, it seems because hot-cold-splitting got turned on unintentionally. See comment on the code review for repro etc. > This patch adds -f[no-]split-cold-code CC1 options to clang. This allows > the splitting pass to be toggled on/off. The current method of passing > `-mllvm -hot-cold-split=true` to clang isn't ideal as it may not compose > correctly (say, with `-O0` or `-Oz`). > > To implement the -fsplit-cold-code option, an attribute is applied to > functions to indicate that they may be considered for splitting. This > removes some complexity from the old/new PM pipeline builders, and > behaves as expected when LTO is enabled. > > Co-authored by: Saleem Abdulrasool <compnerd@compnerd.org> > Differential Revision: https://reviews.llvm.org/D57265 > Reviewed By: Aditya Kumar, Vedant Kumar > Reviewers: Teresa Johnson, Aditya Kumar, Fedor Sergeev, Philip Pfaffe, Vedant Kumar This reverts commit `273c299d5d`.	2020-10-19 12:31:14 +02:00
Vedant Kumar	273c299d5d	[PM/CC1] Add -f[no-]split-cold-code CC1 option to toggle splitting This patch adds -f[no-]split-cold-code CC1 options to clang. This allows the splitting pass to be toggled on/off. The current method of passing `-mllvm -hot-cold-split=true` to clang isn't ideal as it may not compose correctly (say, with `-O0` or `-Oz`). To implement the -fsplit-cold-code option, an attribute is applied to functions to indicate that they may be considered for splitting. This removes some complexity from the old/new PM pipeline builders, and behaves as expected when LTO is enabled. Co-authored by: Saleem Abdulrasool <compnerd@compnerd.org> Differential Revision: https://reviews.llvm.org/D57265 Reviewed By: Aditya Kumar, Vedant Kumar Reviewers: Teresa Johnson, Aditya Kumar, Fedor Sergeev, Philip Pfaffe, Vedant Kumar	2020-10-15 23:13:33 +00:00
David Greene	b877933784	[UpdateTestChecks] Remove bug-exposing test Remove RISCV codegen tests for --include-generated-funcs because apparently MachineOutliner has a bug on that target that is exposed by expensive-checks.	2020-09-23 11:28:28 -05:00
Sebastian Neubauer	a343b9b032	Revert "[AMDGPU] Insert waitcnt after returning from call" This reverts commit `ca907bfb57`. According to michel.daenzer, > This completely broke the Mesa radeonsi driver on Navi 14. Xorg + > xterm come up with major corruption & psychedelic colours.	2020-09-23 17:16:39 +02:00
Sebastian Neubauer	ca907bfb57	[AMDGPU] Insert waitcnt after returning from call When memory operations are outstanding on function calls, either the caller or the callee can insert a waitcnt to ensure that all reads are finished. Calls need some time to be executed, so if the callee inserts the waitcnt, filling the instruction buffer and waiting for memory will be interleaved, hiding some latency. This comes at the cost of having a waitcnt inside functions that may not be needed as no memory operations are outstanding. For function calls, this is already implemented. The same principal applies to returns: If the caller inserts a waitcnt after the call, the callee does not have to wait and the return and memory operation can be run in parallel. This commit implements waiting in the caller after returning from a function call. Differential Revision: https://reviews.llvm.org/D87674	2020-09-23 12:17:59 +02:00
David Greene	7c8bb409f3	[UpdateCCTestChecks] Include generated functions if asked Add the --include-generated-funcs option to update_cc_test_checks.py so that any functions created by the compiler that don't exist in the source will also be checked. We need to maintain the output order of generated function checks so that CHECK-LABEL works properly. To do so, maintain a list of functions output for each prefix in the order they are output. Use this list to output checks for generated functions in the proper order. Differential Revision: https://reviews.llvm.org/D83004	2020-09-18 06:34:59 -05:00
David Greene	ce0eb81c72	[UpdateTestChecks] Allow $ in function names Some compilers generation functions with '$' in their names, so recognize those functions. This also requires recognizing function names inside quotes in some contexts in order to escape certain characters. Differential Revision: https://reviews.llvm.org/D82995	2020-09-16 14:34:18 -05:00
sstefan1	7128e64706	[UpdateTestChecks] include { in function signature check line After D85099, if we have attribute group in the function signature that hasn't been seen before, and later a callsite with the same attribute group, filecheck will evaluate the first attribute group to for example '#0 {'. We now include { in the args_and_sig group to avoid this. Differential Revision: https://reviews.llvm.org/D86769	2020-08-28 23:44:03 +02:00
Johannes Doerfert	97ce7fd89f	[UpdateTestChecks] Match unnamed values like "@[0-9]+" and "![0-9]+" With this patch we will match most uses of "temporary" named things in the IR via regular expressions, not their name at creation time. The new "values" we match are: - "unnamed" globals: `@[0-9]+` - debug metadata: `!dbg ![0-9]+` - loop metadata: `!loop ![0-9]+` - tbaa metadata: `!tbaa ![0-9]+` - range metadata: `!range ![0-9]+` - generic metadata: `metadata ![0-9]+` - attributes groups: `#[0-9]` We still don't match the declarations but that can be done later. This patch can introduce churn when existing check lines contain the old hardcoded versions of the above "values". We can add a flag to opt-out, or opt-in, if necessary. Reviewed By: arichardson, MaskRay Differential Revision: https://reviews.llvm.org/D85099	2020-08-12 01:04:16 -05:00
sstefan1	cf11050696	[Utils][Fix] remove unnecessary ; at the end	2020-07-19 20:48:28 +02:00
sstefan1	937bad3594	[Utils] Check function attributes in update_test_checks Summary: This introduces new flag to the update_test_checks and update_cc_test_checks that allows for function attributes to be checked in a check-line. If the flag is not set, the behavior should remain the same. Reviewers: jdoerfert Subscribers: arichardson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83629	2020-07-19 20:07:24 +02:00
Alex Richardson	1be92dd207	Add missing REQUIRES: x86-registered-target This should fix build bot failures after `a80afc0328`	2020-07-08 11:39:29 +01:00
Alex Richardson	a80afc0328	[UpdateTestChecks] Add UTC_ARGS support for update_{llc,cc}_test_checks.py https://reviews.llvm.org/D69701 added support for on-the-fly argument changes for update scripts. I recently wanted to keep some manual check lines in a test generated by update_cc_test_checks.py in our CHERI fork, so this commit adds support for UTC_ARGS in update_cc_test_checks.py. And since I was refactoring the code to be in common.py, I also added it for update_llc_test_checks.py. Reviewed By: jdoerfert, MaskRay Differential Revision: https://reviews.llvm.org/D78478	2020-07-08 11:00:10 +01:00
Sanjay Patel	e5b8772756	[utils] change default nameless value to "TMP" This is effectively reverting rGbfdc2552664d to avoid test churn while we figure out a better way forward. We at least salvage the warning on name conflict from that patch though. If we change the default string again, we may want to mass update tests at the same time. Alternatively, we could live with the poor naming if we change -instnamer. This also adds a test to LLVM as suggested in the post-commit review. There's a clang test that is also affected. That seems like a layering violation, but I have not looked at fixing that yet. Differential Revision: https://reviews.llvm.org/D80584	2020-06-01 06:54:45 -04:00
Alex Richardson	f50bc823fe	[UpdateTestChecks] Make generation of UTC_ARGS: comment more robust We now use the argparse Action objects to determine the name of the flags. This fixes cases where the key for the stored result ('dest') is not the same as the command line flag (e.g. --enable/--disable). Also add a test that --disabled can be part of the initial UTC_ARGS. This is split out from D78478 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78617	2020-04-23 13:12:27 +01:00
Johannes Doerfert	a8b2fed0ae	[Utils][FIX] Properly deal with occasionally deleted functions While D68850 allowed functions to be deleted I accidentally saved some version of the function to be used once a suitable prefix was found. This turned out to be problematic when the occasionally deleted function is also occasionally modified. The test case is adjusted to resemble the case in which the problem was found. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D76586	2020-04-01 21:56:18 -05:00
Alex Richardson	61dd0603bd	Move update_cc_test_checks.py tests to clang Having tests that depend on clang inside llvm/ are not a good idea since it can break incremental `ninja check-llvm`. Fixes https://llvm.org/PR44798 Reviewed By: lebedev.ri, MaskRay, rsmith Differential Revision: https://reviews.llvm.org/D74051	2020-02-14 14:39:55 +00:00
Johannes Doerfert	1698cc7eba	[Utils] Allow "on-the-fly" argument changes for update_test_check scripts Update test scripts were limited because they performed a single action on the entire file and if that action was controlled by arguments, like the one introduced in D68819, there was no record of it. This patch introduces the capability of changing the arguments passed to the script "on-the-fly" while processing a test file. In addition, an "on/off" switch was added so that processing can be disabled for parts of the file where the content is simply copied. The last extension is a record of the invocation arguments in the auto generated NOTE. These arguments are also picked up in a subsequent invocation, allowing updates with special options enabled without user interaction. To change the arguments the string `UTC_ARGS:` has to be present in a line, followed by "additional command line arguments". That is everything that follows `UTC_ARGS:` will be added to a growing list of "command line arguments" which is reparsed after every update. Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D69701	2020-02-11 16:29:46 -06:00
Alex Richardson	3114435168	Re-enable a update_cc_test_checks.py tests This test was not running because it still had a REQUIRES: python3 line. As this is no longer necessary, remove the REQUIRES to run the test again.	2020-02-05 15:37:30 +00:00
Alex Richardson	1132f87fbf	[update_cc_test_checks] Don't attach CHECK lines to function declarations Previously we were adding the CHECK lines to both definitions and declarations. Update the JSON AST dump parsing code to skip all FunctionDecls without an "inner" node (i.e. no body). Reviewed By: MaskRay, greened Differential Revision: https://reviews.llvm.org/D73708	2020-02-04 08:41:26 +00:00

1 2

59 Commits