llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	90fa50d928	[llvm-mca][X86] Add missing CLWB/CLZERO/FSGSBASE/LWP/MWAITX/RDPID/SHA tests We're getting pretty close to matching/exceeding test coverage of the test\CodeGen\X86\*-schedule.ll files, which should allow us to get rid of -print-schedule and fix PR37160 llvm-svn: 351836	2019-01-22 16:39:28 +00:00
Simon Pilgrim	fc4b1e841e	[llvm-mca][X86] Add missing enter/leave, invlpg/invlpga, rdmsr/wrmsr, rdpmc and rdtsc/rdtscp tests llvm-svn: 351835	2019-01-22 16:29:26 +00:00
Simon Pilgrim	4e03b2496d	[llvm-mca][X86] Add missing mfence/pinsrw tests llvm-svn: 351831	2019-01-22 16:01:08 +00:00
Simon Pilgrim	05198a9b8a	[llvm-mca][X86] Add missing monitor/mwait tests These technically should be under a MONITOR cpuid bit, but we tag them as SSE3 so I've done that here as well. llvm-svn: 351829	2019-01-22 15:48:16 +00:00
Simon Pilgrim	9b3a2f96a1	[llvm-mca][X86] Add missing vperm2i128 tests llvm-svn: 351828	2019-01-22 14:54:24 +00:00
Simon Pilgrim	1d8d6c3bfb	[llvm-mca][X86] Add missing tzcntw tests llvm-svn: 351827	2019-01-22 14:53:52 +00:00
Simon Pilgrim	c4e2776f3b	[llvm-mca][x86] Add RDRAND/RDSEED instruction resource tests llvm-svn: 348622	2018-12-07 18:29:47 +00:00
Clement Courbet	e6b727e552	[X86] Fix VZEROUPPER scheduling info on SNB,HSW,BDW,SXL,SKX. Summary: Starting from SNB, VZEROUPPER is handled by the renamer and uses no proc resources. After HSW, it also has zero latency. This fixes PR35606. To reproduce: Uops: llvm-exegesis -mode=uops -opcode-name=VZEROUPPER Latency: echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper\naddps %xmm0, %xmm1' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D54107 llvm-svn: 346482	2018-11-09 09:49:06 +00:00
Clement Courbet	a933fb237e	[X86][Sched] Update scheduling information for VZEROALL on HWS, BDW, SKX, SNB. Summary: While looking at PR35606, I found out that the scheduling info is incorrect. One can check that it's really a P5+P6 and not a 2*P56 with: echo -e 'vzeroall\nvandps %xmm1, %xmm2, %xmm3' \| ./bin/llvm-exegesis -mode=uops -snippets-file=- (vandps executes on P5 only) Reviewers: craig.topper, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52541 llvm-svn: 343447	2018-10-01 08:37:48 +00:00
Simon Pilgrim	b1108399bd	[LLVM-MCA][X86] Add missing VCMPESTR/VCMPESTR tests llvm-svn: 343421	2018-09-30 18:19:00 +00:00
Simon Pilgrim	00865a48d1	[X86] Split WriteIMul into 8/16/32/64 implementations (PR36931) Split WriteIMul by size and also by IMUL multiply-by-imm and multiply-by-reg cases. This removes all the scheduler overrides for gpr multiplies and stops WriteMULH being ignored for BMI2 MULX instructions. llvm-svn: 342892	2018-09-24 15:21:57 +00:00
Simon Pilgrim	9202c9fb47	[X86] RORmCL instruction models should match ROLmCL etc. Confirmed with Craig Topper - fix a typo that was missing a Port4 uop for ROR*mCL instructions on some Intel models. Yet another step on the scheduler model cleanup marathon...... llvm-svn: 342846	2018-09-23 19:16:01 +00:00
Andrea Di Biagio	8b647dcf4b	[llvm-mca] Report the number of dispatched micro opcodes in the DispatchStatistics view. This patch introduces the following changes to the DispatchStatistics view: * DispatchStatistics now reports the number of dispatched opcodes instead of the number of dispatched instructions. * The "Dynamic Dispatch Stall Cycles" table now also reports the percentage of stall cycles against the total simulated cycles. This change allows users to easily compare dispatch group sizes with the processor DispatchWidth. Before this change, it was difficult to correlate the two numbers, since DispatchStatistics view reported numbers of instructions (instead of opcodes). DispatchWidth defines the maximum size of a dispatch group in terms of number of micro opcodes. The other change introduced by this patch is related to how DispatchStage generates "instruction dispatch" events. In particular: * There can be multiple dispatch events associated with a same instruction * Each dispatch event now encapsulates the number of dispatched micro opcodes. The number of micro opcodes declared by an instruction may exceed the processor DispatchWidth. Therefore, we cannot assume that instructions are always fully dispatched in a single cycle. DispatchStage knows already how to handle instructions declaring a number of opcodes bigger that DispatchWidth. However, DispatchStage always emitted a single instruction dispatch event (during the first simulated dispatch cycle) for instructions dispatched. With this patch, DispatchStage now correctly notifies multiple dispatch events for instructions that cannot be dispatched in a single cycle. A few views had to be modified. Views can no longer assume that there can only be one dispatch event per instruction. Tests (and docs) have been updated. Differential Revision: https://reviews.llvm.org/D51430 llvm-svn: 341055	2018-08-30 10:50:20 +00:00
Andrew V. Tischenko	1fe3375620	[X86] MCA tests for XCHG, XADD and CMPXCHG* instructions Differential Revision: https://reviews.llvm.org/D49912 llvm-svn: 339145	2018-08-07 14:36:43 +00:00
Simon Pilgrim	b911d6721d	[llvm-mca][x86] Add CMPXCHG instruction resource tests I've put CMPXCHG8B/CMPXCHG16B in the same file, even though technically they are under separate CPUID bits all targets seem to support both (or neither). llvm-svn: 338595	2018-08-01 17:25:11 +00:00
Simon Pilgrim	dcfa732b2f	[llvm-mca][x86] Add PCLMUL instruction resource tests Renamed the btver2 file that already contained them - the other targets were only testing the AVX versions llvm-svn: 338583	2018-08-01 16:25:50 +00:00
Simon Pilgrim	34ac6533f4	[llvm-mca][x86] Add SET/TEST instruction resource tests llvm-svn: 338576	2018-08-01 15:29:47 +00:00
Simon Pilgrim	e364e57ac9	[llvm-mca][x86] Add LEA instruction resource tests We already added these to btver2, now add them to other targets, even though none of their models treat them specially (yet). llvm-svn: 338565	2018-08-01 14:25:33 +00:00
Simon Pilgrim	6754913e95	[llvm-mca][x86] Add more x86-64 system instruction resource tests CPUID, IN/OUT, INS/OUTS, INT, PAUSE, SCAS, UD2, XLAT llvm-svn: 338563	2018-08-01 14:18:09 +00:00
Simon Pilgrim	bd014f4d91	[llvm-mca][x86] Add CMPS/LODS/MOVS/STOS string instruction resource tests llvm-svn: 338532	2018-08-01 13:14:45 +00:00
Simon Pilgrim	18d025a732	[llvm-mca][x86] Add STC + STD instruction resource tests llvm-svn: 338514	2018-08-01 11:00:11 +00:00
Simon Pilgrim	1f4b9cb6fe	[llvm-mca][x86] Add 32-bit instruction resource tests These aren't exhaustive, but cover some instructions that are only available in 32-bit mode (where would we be without good BCD math performance?). llvm-svn: 338404	2018-07-31 17:33:08 +00:00
Simon Pilgrim	5e729dcc03	[llvm-mca][x86] Add movsx/movzx instructions to general x86_64 resource tests llvm-svn: 337586	2018-07-20 17:43:42 +00:00
Simon Pilgrim	03164dfa5e	[llvm-mca][x86] Add extend, carry-flag and CMP instructions to general x86_64 resource tests llvm-svn: 337306	2018-07-17 17:47:35 +00:00
Simon Pilgrim	92da01fed9	[llvm-mca][x86] Add MOVBE resource tests to all supporting targets SNB doesn't support MOVBE but the numbers in Generic (which use the SNB model) look sane. llvm-svn: 337305	2018-07-17 17:41:45 +00:00
Simon Pilgrim	94049e8b15	[llvm-mca][x86] Add BSWAP resource tests llvm-svn: 337302	2018-07-17 17:10:47 +00:00
Andrea Di Biagio	483db141e3	[X86] Fix MayLoad/HasSideEffect flag for (V)MOVLPSrm instructions. Before revision 336728, the "mayLoad" flag for instruction (V)MOVLPSrm was inferred directly from the "default" pattern associated with the instruction definition. r336728 removed special node X86Movlps, and all the patterns associated to it. Now instruction (V)MOVLPSrm doesn't have a pattern associated to it, and the 'mayLoad/hasSideEffects' flags are left unset. When the instruction info is emitted by tablegen, method CodeGenDAGPatterns::InferInstructionFlags() sees that (V)MOVLPSrm doesn't have a pattern, and flags are undefined. So, it conservatively sets the "hasSideEffects" flag for it. As a consequence, we were losing the 'mayLoad' flag, and we were gaining a 'hasSideEffect' flag in its place. This patch fixes the issue (originally reported by Michael Holmen). The mca tests show the differences in the instruction info flags. Instructions that were affected by this problem were: MOVLPSrm/VMOVLPSrm/VMOVLPSZ128rm. Differential Revision: https://reviews.llvm.org/D49182 llvm-svn: 336818	2018-07-11 15:27:50 +00:00
Andrea Di Biagio	d2e2c053cf	[llvm-mca] Use a different character to flag instructions with side-effects in the Instruction Info View. NFC This makes easier to identify changes in the instruction info flags. It also helps spotting potential regressions similar to the one recently introduced at r336728. Using the same character to mark MayLoad/MayStore/HasSideEffects is problematic for llvm-lit. When pattern matching substrings, llvm-lit consumes tabs and spaces. A change in position of the flag marker may not trigger a test failure. This patch only changes the character used for flag `hasSideEffects`. The reason why I didn't touch other flags is because I want to avoid spamming the mailing because of the massive diff due to the numerous tests affected by this change. In future, each instruction flag should be associated with a different character in the Instruction Info View. llvm-svn: 336797	2018-07-11 12:44:44 +00:00
Roman Lebedev	0e58dee284	[MCA][X86][NFC] Add BSF/BSR resource tests Reviewers: RKSimon, andreadb, courbet Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48997 llvm-svn: 336510	2018-07-08 09:50:14 +00:00
Clement Courbet	7b9913fb9f	[X86] Add sched class WriteLAHFSAHF and fix values. Summary: I ran llvm-exegesis on SKX, SKL, BDW, HSW, SNB. Atom is from Agner and SLM is a guess. I've left AMD processors alone. Reviewers: RKSimon, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48079 llvm-svn: 335097	2018-06-20 06:13:39 +00:00
Clement Courbet	0d9da88d18	[X86] Fix NOOP sched overrides on BDW/HSW/SKL. Summary: Noop certainly does not use resources. Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits, gchatelet Differential Revision: https://reviews.llvm.org/D48028 llvm-svn: 334927	2018-06-18 06:48:22 +00:00
Simon Pilgrim	7a48bb6e44	[llvm-mca][x86] Fix all resources-x86_64.s tests to use different registers in reg-reg cases I noticed while working on zero-idiom + dependency-breaking support (PR36671) that most of our binary instruction tests were reusing the same src registers, which would cause the tests to fail once we enable scalar zero-idiom support on btver2. Fixed in all targets to keep them in sync. llvm-svn: 334110	2018-06-06 18:20:25 +00:00
Sanjay Patel	59313be8d3	[CodeGen] assume max/default throughput for unspecified instructions This is a fix for the problem arising in D47374 (PR37678): https://bugs.llvm.org/show_bug.cgi?id=37678 We may not have throughput info because it's not specified in the model or it's not available with variant scheduling, so assume that those instructions can execute/complete at max-issue-width. Differential Revision: https://reviews.llvm.org/D47723 llvm-svn: 334055	2018-06-05 23:34:45 +00:00
Roman Lebedev	7b53d1454f	[llvm-mca] Make sure not to end the test files with an empty line. Summary: It's super irritating. [properly configured] git client then complains about that double-newline, and you have to use `--force` to ignore the warning, since even if you fix it manually, it will be reintroduced the very next runtime :/ Reviewers: RKSimon, andreadb, courbet, craig.topper, javed.absar, gbedwell Reviewed By: gbedwell Subscribers: javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47697 llvm-svn: 333887	2018-06-04 11:48:46 +00:00
Clement Courbet	2e41c5a79c	[X86] Introduce WriteFLDC for x87 constant loads. Summary: {FLDL2E, FLDL2T, FLDLG2, FLDLN2, FLDPI} were using WriteMicrocoded. - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form InstRWs. - For SLM and BtVer2, I've guessed some values :( Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47585 llvm-svn: 333656	2018-05-31 14:22:01 +00:00
Clement Courbet	b78ab5097d	[X86] Extract latency of fldz/fld1 in separate classes. Summary: - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form `InstRW`s. - For SLM and BtVer2, values are from Agner. This is split off from https://reviews.llvm.org/D47377 Reviewers: RKSimon, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47523 llvm-svn: 333642	2018-05-31 11:41:27 +00:00
Clement Courbet	07c9ec6f2e	[X86][Sched] Add InstRW for CLC on Intel after SNB. Summary: After SNB, Intel CPUs can rename CF independently of other EFLAGS, so the renamer can zero it for free. Note that STC still consumes resources. To reproduce: `$ llvm-exegesis -mode=uops -opcode-name=CLC` On SNB: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: sandybridge llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.0014, debug_string: SBPort0 } - { key: '4', value: 0.0013, debug_string: SBPort1 } - { key: '5', value: 0.0003, debug_string: SBPort4 } - { key: '6', value: 0.0029, debug_string: SBPort5 } - { key: '10', value: 0.0003, debug_string: SBPort23 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` On HSW: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: haswell llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.001, debug_string: HWPort0 } - { key: '4', value: 0.0009, debug_string: HWPort1 } - { key: '5', value: 0.0004, debug_string: HWPort2 } - { key: '6', value: 0.0006, debug_string: HWPort3 } - { key: '7', value: 0.0002, debug_string: HWPort4 } - { key: '8', value: 0.0012, debug_string: HWPort5 } - { key: '9', value: 0.0022, debug_string: HWPort6 } - { key: '10', value: 0.0001, debug_string: HWPort7 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` Reviewers: craig.topper, RKSimon Subscribers: gchatelet, llvm-commits Differential Revision: https://reviews.llvm.org/D47362 llvm-svn: 333392	2018-05-29 06:19:39 +00:00
Simon Pilgrim	e389ea0e3e	[llvm-mca][X86] Add CMOV test files llvm-svn: 332622	2018-05-17 16:29:12 +00:00
Andrea Di Biagio	45ccdd1785	[llvm-mca] Regenerate tests after r332381 and r332361. NFC llvm-svn: 332447	2018-05-16 10:12:06 +00:00
Simon Pilgrim	2aa395abcf	[llvm-mca][x86] Add F16C instruction tests llvm-svn: 332347	2018-05-15 12:50:06 +00:00
Simon Pilgrim	4135de2e93	[llvm-mca][x86] Add scalar nt-store instruction tests llvm-svn: 332262	2018-05-14 17:10:33 +00:00
Simon Pilgrim	7340d88740	[llvm-mca][x86] Add and/not/or/xor instruction tests llvm-svn: 332257	2018-05-14 16:26:24 +00:00
Simon Pilgrim	ab34aa8294	[X86] Cleanup WriteFStore/WriteVecStore schedules MOVNTPD/MOVNTPS should be WriteFStore Standardized BDW/HSW/SKL/SKX WriteFStore/WriteVecStore - fixes some missed instregex patterns. (V)MASKMOVDQU was already using the default, its costs gets increased but is still nowhere near the real cost of that nasty instruction.... llvm-svn: 331864	2018-05-09 11:01:16 +00:00
Simon Pilgrim	4283924e08	[llvm-mca][x86] Add div/idiv, mul/imul and inc/dec/neg/nop instruction tests llvm-svn: 331765	2018-05-08 13:30:58 +00:00
Simon Pilgrim	061096d2c2	[llvm-mca][x86] Remove addsubpd from SSE2 tests llvm-svn: 331678	2018-05-07 21:10:48 +00:00
Simon Pilgrim	f3ae50fca2	[X86] Split WriteFRcp/WriteFRsqrt/WriteFSqrt schedule classes WriteFRcp/WriteFRsqrt are split to support scalar, XMM and YMM/ZMM instructions. WriteFSqrt is split into single/double/long-double sizes and scalar, XMM, YMM and ZMM instructions. This removes all InstrRW overrides for these instructions. NOTE: There were a couple of typos in the Znver1 model - notably a 1cy throughput for SQRT that is highly unlikely and doesn't tally with Agner. NOTE: I had to add Agner's numbers for several targets for WriteFSqrt80. llvm-svn: 331629	2018-05-07 11:50:44 +00:00
Simon Pilgrim	8962c344f9	[llvm-mca][X86] Add BT resource tests to all models llvm-svn: 331144	2018-04-29 15:45:31 +00:00
Simon Pilgrim	2d569361fc	[llvm-mca][X86] Add add/adc + sub/sbb resource tests to all models llvm-svn: 331140	2018-04-29 11:03:25 +00:00
Simon Pilgrim	318e9d39ab	[llvm-mca][X86] Add double shift resource tests to all relevant models llvm-svn: 331109	2018-04-28 15:18:49 +00:00
Simon Pilgrim	4d0187c893	[llvm-mca][X86] Add shift/rotate resource tests to all relevant models I intend to add further instruction tests to the resources-x86_64.s test file as required, but this initial commit is to help remove a load of unnecessary InstRW overrides in a future patch llvm-svn: 331108	2018-04-28 14:56:18 +00:00

1 2

57 Commits