llvm-project

Commit Graph

Author	SHA1	Message	Date
AndreyChurbanov	5dd4d0d46f	[OpenMP] libomp: fix dynamic loop dispatcher Restructured dynamic loop dispatcher code. Fixed use of dispatch buffers for nonmonotonic dynamic (static_steal) schedule: - eliminated possibility of stealing iterations of the wrong loop when victim thread changed its buffer to work on another loop; - fixed race when victim thread changed its buffer to work in nested parallel; - eliminated "static" property of the schedule, that is now a single thread can execute whole loop. Differential Revision: https://reviews.llvm.org/D103648	2021-06-22 16:29:01 +03:00
Terry Wilmarth	25073a4ecf	[OpenMP] Add Two-level Distributed Barrier Two-level distributed barrier is a new experimental barrier designed for Intel hardware that has better performance in some cases than the default hyper barrier. This barrier is designed to handle fine granularity parallelism where barriers are used frequently with little compute and memory access between barriers. There is no need to use it for codes with few barriers and large granularity compute, or memory intensive applications, as little difference will be seen between this barrier and the default hyper barrier. This barrier is designed to work optimally with a fixed number of threads, and has a significant setup time, so should NOT be used in situations where the number of threads in a team is varied frequently. The two-level distributed barrier is off by default -- hyper barrier is used by default. To use this barrier, you must set all barrier patterns to use this type, because it will not work with other barrier patterns. Thus, to turn it on, the following settings are required: KMP_FORKJOIN_BARRIER_PATTERN=dist,dist KMP_PLAIN_BARRIER_PATTERN=dist,dist KMP_REDUCTION_BARRIER_PATTERN=dist,dist Branching factors (set with KMP_FORKJOIN_BARRIER, KMP_PLAIN_BARRIER, and KMP_REDUCTION_BARRIER) are ignored by the two-level distributed barrier. Differential Revision: https://reviews.llvm.org/D103121	2021-06-16 15:34:55 -05:00
AndreyChurbanov	610fea65e2	[OpenMP] libomp: fixed implementation of OMP 5.1 inoutset task dependence type Refactored code of dependence processing and added new inoutset dependence type. Compiler can set dependence flag to 0x8 when call __kmpc_omp_task_with_deps. All dependence flags library gets so far and corresponding dependence types: 1 - IN, 2 - OUT, 3 - INOUT, 4 - MUTEXINOUTSET, 8 - INOUTSET. Differential Revision: https://reviews.llvm.org/D97085	2021-06-16 14:47:29 +03:00
Peyton, Jonathan L	0ddde4d865	[OpenMP] Lazily assign root affinity Lazily set affinity for root threads. Previously, the root thread executing middle initialization would attempt to assign affinity to other existing root threads. This was not working properly as the set_system_affinity() function wasn't setting the affinity for the target thread. Instead, the middle init thread was resetting the its own affinity using the target thread's affinity mask. Differential Revision: https://reviews.llvm.org/D103625	2021-06-15 16:21:06 -05:00
AndreyChurbanov	9ce2e5e700	Revert "[OpenMP] libomp: implement OpenMP 5.1 inoutset task dependence type" This reverts commit `a1f550e052`. Revert in order to fix backwards compatibility breakage caused by type size change for task dependence flag.	2021-06-09 17:38:38 +03:00
AndreyChurbanov	a1f550e052	[OpenMP] libomp: implement OpenMP 5.1 inoutset task dependence type Refactored code of dependence processing and added new inoutset dependence type. Compiler can set dependence flag to 0x8 when call __kmpc_omp_task_with_deps. Size of type of the dependence flag changed from 1 to 4 bytes in clang. All dependence flags library gets so far and corresponding dependence types: 1 - IN, 2 - OUT, 3 - INOUT, 4 - MUTEXINOUTSET, 8 - INOUTSET. Differential Revision: https://reviews.llvm.org/D97085	2021-06-07 21:42:51 +03:00
Hansang Bae	7ba4e96ede	[OpenMP] Use new task type/flag for taskwait depend events. Differential Revision: https://reviews.llvm.org/D103464	2021-06-02 10:16:38 -05:00
Hansang Bae	95cefacfe1	[OpenMP] Fix crashing critical section with hint clause Runtime was using the default lock type without using the hint. Differential Revision: https://reviews.llvm.org/D102955	2021-05-24 17:25:01 -05:00
AndreyChurbanov	aa6e7e8da8	[OpenMP] libomp: move warnings to after library initialization Warnings on deprecated api cannot be suppressed if the library is not initialized. With this change it is possible to set KMP_WARNINGS=false to suppress the warnings. Differential Revision: https://reviews.llvm.org/D102676	2021-05-21 23:47:23 +03:00
Shilei Tian	af6511d730	[OpenMP] Fixed Bug 49356 Bug 49356 (https://bugs.llvm.org/show_bug.cgi?id=49356) reports crash in the test case `tasking/bug_taskwait_detach.cpp`, which is caused by the wrong function declaration. `gtid` in `__kmpc_omp_task` should be `kmp_int32`. Reviewed By: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D102584	2021-05-17 12:14:54 -04:00
Peyton, Jonathan L	9982f33e2c	[OpenMP] Refactor/Rework topology discovery code This patch does the following: 1) Introduce kmp_topology_t as the runtime-friendly structure (the corresponding global variable is __kmp_topology) to determine the exact machine topology which can vary widely among current and future architectures. The current design is not easy to expand beyond the assumed three layer topology: sockets, cores, and threads so a rework capable of using the existing KMP_AFFINITY mechanisms is required. This new topology structure has: * The depth and types of the topology * Ratio count for each consecutive level (e.g., number of cores per socket, number of threads per core) * Absolute count for each level (e.g., 2 sockets, 16 cores, 32 threads) * Equivalent topology layer map (e.g., Numa domain is equivalent to socket, L1/L2 cache equivalent to core) * Whether it is uniform or not The hardware threads are represented with the kmp_hw_thread_t structure. This structure contains the ids (e.g., socket 0, core 1, thread 0) and other information grabbed from the previous Address structure. The kmp_topology_t structure contains an array of these. 2) Generalize the KMP_HW_SUBSET envirable for the new kmp_topology_t structure. The algorithm doesn't assume any order with tiles,numa domains,sockets,cores,threads. Instead it just parses the envirable, makes sure it is consistent with the detected topology (including taking into account equivalent layers) and then trims away the unneeded subset of hardware threads. To enable this, a new kmp_hw_subset_t structure is introduced which contains a vector of items (hardware type, number user wants, offset). Any keyword within __kmp_hw_get_keyword() can be used as a name and can be shortened as well. e.g., KMP_HW_SUBSET=1s,2numa,4tile,2c,3t can be used on the KNL SNC-4 machine. 3) Simplify topology detection functions so they only do the singular task of detecting the machine's topology. Printing, and all canonicalizing functionality is now done afterwards. So many lines of duplicated code are eliminated. 4) Add new ll_caches and numa_domains to OMP_PLACES, and consequently, KMP_AFFINITY's granularity setting. All the names within __kmp_hw_get_keyword() are available for use in OMP_PLACES or KMP_AFFINITY's granularity setting. 5) Simplify and future-proof code where explicit lists of allowed affinity settings keywords inside if() conditions. 6) Add x86 CPUID leaf 4 cache detection to existing x2apic id method so equivalent caches could be detected (in particular for the ll_caches place). Differential Revision: https://reviews.llvm.org/D100997	2021-05-03 18:00:24 -05:00
Peyton, Jonathan L	4457565757	[OpenMP] Implement GOMP task reductions Implement the remaining GOMP_* functions to support task reductions in taskgroup, parallel, loop, and taskloop constructs. The unused mem argument to many of the work-sharing constructs has to do with the scan() directive/ inscan() modifier. If mem is set, each function will call KMP_FATAL() and tell the user scan/inscan is unsupported. The GOMP reduction implementation is kept separate from our implementation because of how GOMP presents reduction data and computes the reductions. GOMP expects the privatized copies to be present even after a #pragma omp parallel reduction(task:...) region has ended so the data is stored inside GOMP's uintptr_t* data pseudo-structure. This style is tightly coupled with GCC compiler codegen. There also isn't any init(), combiner(), fini() functions in GOMP's codegen so the two implementations were to disparate to try to wrap GOMP's around our own. Differential Revision: https://reviews.llvm.org/D98806	2021-04-16 16:36:31 -05:00
Peyton, Jonathan L	5ebbb366c4	[OpenMP] Allow affinity to re-detect for child processes Current atfork() handler for child processes does not reset the affinity masks array which prevents users from setting their own affinity in child processes. Differential Revision: https://reviews.llvm.org/D99218	2021-04-16 16:34:02 -05:00
Hansang Bae	77dc7b4653	[OpenMP] Fix printing routine for OMP_TOOL_VERBOSE_INIT Also fixed typo in the verbose message. Differential Revision: https://reviews.llvm.org/D100414	2021-04-14 07:55:26 -05:00
Shilei Tian	2df65f87c1	[OpenMP] Fixed a crash in hidden helper thread It is reported that after enabling hidden helper thread, the program can hit the assertion `new_gtid < __kmp_threads_capacity` sometimes. The root cause is explained as follows. Let's say the default `__kmp_threads_capacity` is `N`. If hidden helper thread is enabled, `__kmp_threads_capacity` will be offset to `N+8` by default. If the number of threads we need exceeds `N+8`, e.g. via `num_threads` clause, we need to expand `__kmp_threads`. In `__kmp_expand_threads`, the expansion starts from `__kmp_threads_capacity`, and repeatedly doubling it until the new capacity meets the requirement. Let's assume the new requirement is `Y`. If `Y` happens to meet the constraint `(N+8)2^X=Y` where `X` is the number of iterations, the new capacity is not enough because we have 8 slots for hidden helper threads. Here is an example. ``` #include <vector> int main(int argc, char argv[]) { constexpr const size_t N = 1344; std::vector<int> data(N); #pragma omp parallel for for (unsigned i = 0; i < N; ++i) { data[i] = i; } #pragma omp parallel for num_threads(N) for (unsigned i = 0; i < N; ++i) { data[i] += i; } return 0; } ``` My CPU is 20C40T, then `__kmp_threads_capacity` is 160. After offset, `__kmp_threads_capacity` becomes 168. `1344 = (160+8)*2^3`, then the assertions hit. Reviewed By: protze.joachim Differential Revision: https://reviews.llvm.org/D98838	2021-03-18 18:25:36 -04:00
Hansang Bae	a6f9cb6adc	[OpenMP] Add runtime interface for OpenMP 5.1 error directive The proposed new interface is for supporting `at(execution)` clause in the error directive. Differential Revision: https://reviews.llvm.org/D98448	2021-03-16 08:55:25 -05:00
Peyton, Jonathan L	e2738b3758	[OpenMP] Fix potential integer overflow in dynamic schedule code Restrict the chunk_size * chunk_num to only occur for valid chunk_nums and reimplement calculating the limit to avoid overflow. Differential Revision: https://reviews.llvm.org/D96747	2021-03-08 09:43:05 -06:00
Joachim Protze	35ab6d6390	[OpenMP][Tests][NFC] rename macro to avoid naming clash When including <ostream>, the register_callback macro of the OMPT callback.h clashes with a function defined in ostream. This patch renames the macro and includes ompt into the macro name.	2021-02-24 18:03:54 +01:00
Peyton, Jonathan L	56223b1e91	[OpenMP] Help static loop code avoid over/underflow This code alleviates some pathological loop parameters (lower, upper, stride) within calculations involved in the static loop code. It bounds the chunk size to the trip count if it is greater than the trip count and also minimizes problematic code for when trip count < nth. Differential Revision: https://reviews.llvm.org/D96426	2021-02-22 13:22:01 -06:00
Peyton, Jonathan L	8c73be9d86	[OpenMP] Limit number of dispatch buffers This patch limits the number of dispatch buffers (used for loop worksharing construct) to between 1 and 4096. Differential Revision: https://reviews.llvm.org/D96749	2021-02-22 13:14:28 -06:00
AndreyChurbanov	dab5d6c2eb	[OpenMP] fix race condition in test	2021-02-18 02:27:49 +03:00
AndreyChurbanov	5631842d18	[OpenMP] NFC: fix test removing the target construct	2021-02-13 04:49:52 +03:00
AndreyChurbanov	091e8daa24	[OpenMP] fix test adding mapping of shared variables	2021-02-13 04:13:54 +03:00
Nawrin Sultana	4692bb4a8a	[OpenMP] Add lower and upper bound in num_teams clause This patch adds lower-bound and upper-bound to num_teams clause according to OpenMP 5.1 specification. The initial number of teams created is implementation defined, but it will be greater than or equal to lower-bound and less than or equal to upper-bound. If num_teams clause is not specified, the number of teams created is implementation defined, but it will be greater or equal to 1. Differential Revision: https://reviews.llvm.org/D95820	2021-02-10 13:58:50 -06:00
Shilei Tian	3c31b78455	[OpenMP] Fixed an issue that taskwait doesn't work on detachable task D77609 mistakenly changed the bebavior of task waiting on detachable task that a detachable task is not waited, based on https://lists.llvm.org/pipermail/openmp-dev/2021-February/003836.html. This patch fixed it. Thank Raúl for the report. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95798	2021-02-03 13:12:43 -05:00
AndreyChurbanov	d7b12004bd	[OpenMP] libomp: implement nteams-var and teams-thread-limit-var ICVs The change includes OMP_NUM_TEAMS, OMP_TEAMS_THREAD_LIMIT env variables, omp_set_num_teams, omp_get_max_teams, omp_set_teams_thread_limit, omp_get_teams_thread_limit routines. Differential Revision: https://reviews.llvm.org/D95003	2021-02-01 22:54:11 +03:00
Tobias Hieta	c3c02d0d5a	[OpenMP] Fix python3 compatibility in openmp's lit.cfg Differential Revision: https://reviews.llvm.org/D95669	2021-02-01 08:20:26 +01:00
AndreyChurbanov	ac70a53653	[OpenMP] NFC: disabled two flakey tests as the bug in libomp not fixed yet	2021-01-29 00:54:13 +03:00
Peyton, Jonathan L	8e67134364	[OpenMP] Fix misleading warning for OMP_PLACES When OMP_PLACES contains an invalid value, the warning informs the user that the fallback is OMP_PLACES=threads, but the actual internal setting is OMP_PLACES=cores and is detected as such with KMP_SETTINGS=1. This patch informs the user that OMP_PLACES=cores is being used instead of OMP_PLACES=threads. Differential Revision: https://reviews.llvm.org/D95170	2021-01-27 14:27:24 -06:00
Nawrin Sultana	927af4b3c5	[OpenMP] Modify OMP_ALLOCATOR environment variable This patch sets the def-allocator-var ICV based on the environment variables provided in OMP_ALLOCATOR. Previously, only allowed value for OMP_ALLOCATOR was a predefined memory allocator. OpenMP 5.1 specification allows predefined memory allocator, predefined mem space, or predefined mem space with traits in OMP_ALLOCATOR. If an allocator can not be created using the provided environment variables, the def-allocator-var is set to omp_default_mem_alloc. Differential Revision: https://reviews.llvm.org/D94985	2021-01-26 18:27:39 -06:00
Shilei Tian	9d64275ae0	[OpenMP] Added the support for hidden helper task in RTL The basic design is to create an outer-most parallel team. It is not a regular team because it is only created when the first hidden helper task is encountered, and is only responsible for the execution of hidden helper tasks. We first use `pthread_create` to create a new thread, let's call it the initial and also the main thread of the hidden helper team. This initial thread then initializes a new root, just like what RTL does in initialization. After that, it directly calls `__kmpc_fork_call`. It is like the initial thread encounters a parallel region. The wrapped function for this team is, for main thread, which is the initial thread that we create via `pthread_create` on Linux, waits on a condition variable. The condition variable can only be signaled when RTL is being destroyed. For other work threads, they just do nothing. The reason that main thread needs to wait there is, in current implementation, once the main thread finishes the wrapped function of this team, it starts to free the team which is not what we want. Two environment variables, `LIBOMP_NUM_HIDDEN_HELPER_THREADS` and `LIBOMP_USE_HIDDEN_HELPER_TASK`, are also set to configure the number of threads and enable/disable this feature. By default, the number of hidden helper threads is 8. Here are some open issues to be discussed: 1. The main thread goes to sleeping when the initialization is finished. As Andrey mentioned, we might need it to be awaken from time to time to do some stuffs. What kind of update/check should be put here? Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D77609	2021-01-25 22:16:17 -05:00
AndreyChurbanov	a60bc55c69	[OpenMP] libomp: cleanup parsing of OMP_ALLOCATOR env variable. Differential Revision: https://reviews.llvm.org/D94932	2021-01-19 16:21:22 +03:00
AndreyChurbanov	aa3a59e0c6	[OpenMP][NFC] Fix test The test fails if memkind library is accessible.	2021-01-19 00:05:34 +03:00
Shilei Tian	9bf843bdc8	Revert "[OpenMP] Added the support for hidden helper task in RTL" This reverts commit `ed939f853d`.	2021-01-18 06:57:52 -05:00
Shilei Tian	ed939f853d	[OpenMP] Added the support for hidden helper task in RTL The basic design is to create an outer-most parallel team. It is not a regular team because it is only created when the first hidden helper task is encountered, and is only responsible for the execution of hidden helper tasks. We first use `pthread_create` to create a new thread, let's call it the initial and also the main thread of the hidden helper team. This initial thread then initializes a new root, just like what RTL does in initialization. After that, it directly calls `__kmpc_fork_call`. It is like the initial thread encounters a parallel region. The wrapped function for this team is, for main thread, which is the initial thread that we create via `pthread_create` on Linux, waits on a condition variable. The condition variable can only be signaled when RTL is being destroyed. For other work threads, they just do nothing. The reason that main thread needs to wait there is, in current implementation, once the main thread finishes the wrapped function of this team, it starts to free the team which is not what we want. Two environment variables, `LIBOMP_NUM_HIDDEN_HELPER_THREADS` and `LIBOMP_USE_HIDDEN_HELPER_TASK`, are also set to configure the number of threads and enable/disable this feature. By default, the number of hidden helper threads is 8. Here are some open issues to be discussed: 1. The main thread goes to sleeping when the initialization is finished. As Andrey mentioned, we might need it to be awaken from time to time to do some stuffs. What kind of update/check should be put here? Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D77609	2021-01-16 14:13:35 -05:00
Terry Wilmarth	4fe17ada55	[OpenMP] Fix hierarchical barrier Hierarchical barrier is an experimental barrier algorithm that uses aspects of machine hierarchy to define the barrier tree structure. This patch fixes offset calculation in hierarchical barrier. The offset is used to store info on a flag about sleeping threads waiting on a location stored in the flag. This commit also fixes a potential deadlock in hierarchical barrier when using infinite blocktime by adjusting the offset value of leaf kids so that it matches the value of leaf state. It also adds testing of default barriers with infinite blocktime, and also tests hierarchical barrier algorithm with both default and infinite blocktime. Patch by Terry Wilmarth and Nawrin Sultana. Differential Revision: https://reviews.llvm.org/D94241	2021-01-13 10:22:57 -06:00
Nawrin Sultana	540007b427	[OpenMP] Add strict mode in num_tasks and grainsize This patch adds new API __kmpc_taskloop_5 to accomadate strict modifier (introduced in OpenMP 5.1) in num_tasks and grainsize clause. Differential Revision: https://reviews.llvm.org/D92352	2020-12-09 16:46:30 -06:00
Joachim Protze	d3ec512b1d	[OpenMP][OMPT] Make sure that 0 is never used as ID in tests (NFC)	2020-12-04 18:41:56 +01:00
Joachim Protze	fd3d1b09c1	[OpenMP][Tests][NFC] Use FileCheck from cmake config	2020-11-30 23:16:56 +01:00
Joachim Protze	723be4042a	[OpenMP][OMPT][NFC] Fix failing test The test would fail for gcc, when built with debug flag.	2020-11-29 19:07:42 +01:00
Joachim Protze	cdf9401df8	[OpenMP][OMPT][NFC] Fix flaky test The test had a chance to finish the first task before the second task is created. In this case, the dependences-pair event would not trigger.	2020-11-29 19:07:41 +01:00
Joachim Protze	6d3b81664a	[OpenMP][OMPT] Introduce a guard to handle OMPT return address This is an alternative approach to address inconsistencies pointed out in: D90078 This patch makes sure that the return address is reset, when leaving the scope. In some cases, I had to move the macro out of an if-statement to have it in the right scope, in some cases I added an additional block to restrict the scope. This patch does not handle inconsistencies, which might occur if the return address is still set when we call into the application. Test case (repeated_calls.c) provided by @hbae Differential Revision: https://reviews.llvm.org/D91692	2020-11-25 18:17:44 +01:00
Isabel Thärigen	b281a05dac	[OpenMP][OMPT] Implement verbose tool loading OpenMP 5.1 introduces the new env variable OMP_TOOL_VERBOSE_INIT=(disabled\|stdout\|stderr\|<filename>) to enable verbose loading and initialization of OMPT tools. This env variable helps to understand the cause when loading of a tool fails (e.g., undefined symbols or dependency not in LD_LIBRARY_PATH) Output of OMP_TOOL_VERBOSE_INIT is added for OMP_DISPLAY_ENV Tests for this patch are integrated into the different existing tool loading tests, making these tests more verbose. An Archer specific verbose test is integrated into an existing Archer test. Patch prepared by: Isabel Thärigen Differential Revision: https://reviews.llvm.org/D91464	2020-11-25 18:17:44 +01:00
Nawrin Sultana	5439db05e7	[OpenMP] Add omp_realloc implementation This patch adds omp_realloc function implementation according to OpenMP 5.1 specification. Differential Revision: https://reviews.llvm.org/D90971	2020-11-17 13:43:00 -06:00
Nawrin Sultana	938f1b8581	[OpenMP] Add omp_calloc implementation This patch adds omp_calloc implementation according to OpenMP 5.1 specification. Differential Revision: https://reviews.llvm.org/D90967	2020-11-13 14:35:46 -06:00
Shilei Tian	24d0ef0f50	[OpenMP] Fixed a bug when displaying affinity Currently the affinity format string has initial value. When users set the format via OMP_AFFINITY_FORMAT, it will overwrite the format string. However, when copying the format, the tailing null is missing. As a result, if the user format string is shorter than default value, the remaining part in the default value still makes effort. This bug is not exposed because the test case doesn't check the end of a string. It only checks whether given output "contains" the check string. Reviewed By: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D91309	2020-11-12 22:27:32 -05:00
Joachim Protze	ce0911b3e9	[OpenMP][Tests] Fix compiler warnings in OpenMP runtime tests This patch allows to pass the OpenMP runtime tests after configuring with `cmake . -DOPENMP_TEST_FLAGS:STRING="-Werror"`. The warnings for OMPT tests are addressed in D90752. Differential Revision: https://reviews.llvm.org/D91280	2020-11-11 20:13:21 +01:00
Joachim Protze	6213ed062b	[OpenMP][OMPT] Update the omp-tools header file to reflect 5.1 changes This doesn't add functionality, but just adds the new types and renames the master callback to masked callback. Differential Revision: https://reviews.llvm.org/D90752	2020-11-11 20:13:21 +01:00
Joachim Protze	b0eb19bf8a	[OpenMP][OMPT][NFC] Fix flaky test As reported by @ronlieb, the test shows intermittent fails. The test failed, if the dependent task was already finished, when the depending task was to be created. We have other tests to check for the dependences pair.	2020-11-03 13:15:32 +01:00
Joachim Protze	34b34e90fc	[OpenMP][Tests] NFC: fix flaky test failure caused by rare scheduling The worker thread can start execution of the task before creation of the second task Fixes the spurious failure reported in https://reviews.llvm.org/D61657	2020-10-05 16:55:32 +02:00
Joachim Protze	6104b30446	[OpenMP][OMPT] Update OMPT tests for newly added GOMP interface patches This patch updates the expected results for the GOMP interface patches: D87267, D87269, and D87271. The taskwait-depend test is changed to really use taskwait-depend and copied to an task_if0-depend test. To pass the tests, the handling of the return address was fixed. Differential Revision: https://reviews.llvm.org/D87680	2020-10-01 00:53:41 +02:00
Peyton, Jonathan L	ee1c04a926	[OpenMP] Fix if0 task with dependencies in the runtime The current GOMP interface for serialized tasks does not take into account task dependencies. Add the check and wait for dependencies. Fixes: https://bugs.llvm.org/show_bug.cgi?id=46573 Differential Revision: https://reviews.llvm.org/D87271	2020-09-24 09:47:53 -05:00
Peyton, Jonathan L	9089b4a5c5	[OpenMP] Introduce GOMP taskwait depend in the runtime This change introduces the GOMP_taskwait_depend() function. It implements the OpenMP 5.0 feature of #pragma omp taskwait with depend() clause by wrapping around __kmpc_omp_wait_deps(). Differential Revision: https://reviews.llvm.org/D87269	2020-09-24 09:45:14 -05:00
Peyton, Jonathan L	72ada5ae6c	[OpenMP] Introduce GOMP mutexinoutset in the runtime Encapsulate GOMP task dependencies in separate class and introduce the new mutexinoutset dependency type. This separate class allows future GOMP task APIs easier access to the task dependency functionality and better ability to propagate new dependency types to all existing GOMP task APIs which use task dependencies. Differential Revision: https://reviews.llvm.org/D87267	2020-09-24 09:45:13 -05:00
Peyton, Jonathan L	ea34d95e0a	[OpenMP] Introduce GOMP teams support in runtime Implement GOMP_teams_reg() function which enables GOMP support of the standalone teams construct. The GOMP_parallel* functions were modified to call __kmp_fork_call() unconditionally so that the teams-specific code could be reused within __kmp_fork_call() instead of reproduced inside the GOMP_* functions. Differential Revision: https://reviews.llvm.org/D87167	2020-09-24 09:45:13 -05:00
Saiyedul Islam	741e55aeed	[OpenMP] Temporarily disable failing runtime tests for clang-12 Following tests were disabled for clang-11 after upgrading to version 5.0 in D82963: 1. openmp/runtime/test/env/kmp_set_dispatch_buf.c 2. openmp/runtime/test/worksharing/for/kmp_set_dispatch_buf.c They are also failing for clang-12. Thus this temporary disabling until they are fixed. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84241	2020-07-21 15:32:46 +00:00
AndreyChurbanov	617787ea77	[OpenMP] add missed REQUIRES:ompt for 2 OMPT tests	2020-07-21 16:31:17 +03:00
Joachim Protze	f226171429	[OpenMP][Tests][NFC] Mark compatibility with older versions of clang	2020-07-20 13:53:29 +02:00
Joachim Protze	0fa0cf8638	[OpenMP][Tests] Update compatibility with GCC (NFC) Commit `95a28df5c` provided implementation for GOMP_nonmonotonicruntime* functions. Now the tests succeed with gcc 9 and 10	2020-07-08 00:27:19 +02:00
Joachim Protze	6d9626d2da	[OpenMP][Tests] Fix/Mark compatibilty for GCC Reviewed by: Hahnfeld, saiislam Differential Revision: https://reviews.llvm.org/D82267	2020-07-06 23:56:09 +02:00
Saiyedul Islam	4c4bda1630	[OpenMP] Temporarily disable failing runtime tests for OpenMP 5.0 Following tests are failing after upgrading to version 5.0 but are passing for version 4.5: 1. openmp/runtime/test/env/kmp_set_dispatch_buf.c 2. openmp/runtime/test/worksharing/for/kmp_set_dispatch_buf.c To be enabled as soon as these tests are fixed. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D82963	2020-07-06 14:04:43 +00:00
Joachim Protze	8289f2891e	[OpenMP][Tests] Flag compatibility of OpenMP runtime tests with GCC versions If the compilation fails, the test is marked as unsupported. -> This will never change for a specific version of gcc If the linking fails, the test is marked as expected to fail. -> This might change as LLVM/OpenMP implements the missing GOMP interface function Reviewed by: Hahnfeld Differential Revision: https://reviews.llvm.org/D83077	2020-07-05 22:49:54 +02:00
Joachim Protze	3fc97f9636	[OpenMP][Tests] NFC use type macro in printf	2020-07-05 09:17:18 +02:00
Joachim Protze	47cb8a0f0b	[OpenMP][OMPT]Add event callbacks for taskwait with depend This adds the missing event callbacks to express dependencies on included tasks and taskwait with depend clause. The test fails for GCC, see bug report: https://bugs.llvm.org/show_bug.cgi?id=46573 Reviewed by: hbae Differential Revision: https://reviews.llvm.org/D81891	2020-07-03 09:58:31 +02:00
Joachim Protze	63a3c5925d	[OpenMP][OMPT] Pass mutexinoutset to the tool Adds OMPT support for the mutexinoutset dependency Reviewed by: hbae Differential Revision: https://reviews.llvm.org/D81890	2020-06-19 12:51:18 +02:00
Alexey Bataev	08029595ca	[OPENMP]Fix overflow during counting the number of iterations. Summary: The OpenMP loops are normalized and transformed into the loops from 0 to max number of iterations. In some cases, original scheme may lead to overflow during calculation of number of iterations. If it is unknown, if we can end up with overflow or not (the bounds are not constant and we cannot define if there is an overflow), cast original type to the unsigned. Reviewers: jdoerfert Subscribers: yaxunl, guansong, sstefan1, openmp-commits, cfe-commits, caomhin Tags: #clang, #openmp Differential Revision: https://reviews.llvm.org/D81881	2020-06-17 08:47:01 -04:00
Joachim Protze	e9b8ed1fd7	[OpenMP][Tool] Header-only multiplexing of OMPT tools Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D76012	2020-06-17 09:16:46 +02:00
Joachim Protze	cbea36903e	[OpenMP][OMPT] Add callbacks for doacross loops Adds the callbacks for ordered with source/sink dependencies. The test for task dependencies changed, because callbach.h now actually prints the passed dependencies and the test also checks for the address. Reviewed by: hbae Differential Revision: https://reviews.llvm.org/D81807	2020-06-16 16:53:40 +02:00
Joachim Protze	9e5aefc5f9	[OpenMP][Tests] fix data race in an OpenMP runtime test Reviewed by: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D81804	2020-06-15 18:48:35 +02:00
Joachim Protze	d056d7592a	[OpenMP][Tool] Extend reuse of OMPT testing This patch allows to specify a prefix (default:empty) to be included into print-out written by callback.h. Also adding a cmake target to find the header file from other tests. Reviewed by: jdoerfert Differential Revision: https://reviews.llvm.org/D76008	2020-06-14 15:55:32 +02:00
AndreyChurbanov	abe64360ae	[openmp] Fixed nonmonotonic schedule implementation. Differential Revision: https://reviews.llvm.org/D80942	2020-06-04 15:39:45 +03:00
Joachim Protze	10995c77b4	[OpenMP][OMPT] Fix and add event callbacks for detached tasks The OpenMP spec has the task-fulfill event for a call to omp_fulfill_event. If the task did not yet finish execution, ompt_task_early_fulfill is used, otherwise ompt_task_late_fulfill. If a task does not complete, when the execution finishes (i.e., the task goes in detached mode), ompt_task_detach instead of ompt_task_complete must be used, when the next task is scheduled. A test for both cases is included, which only work with clang-11+ Reviewed By: hbae Differential revision: https://reviews.llvm.org/D80843	2020-06-02 09:52:40 +02:00
AndreyChurbanov	5e111c5df8	[openmp] Fixed taskloop recursive splitting so that taskloop tasks have same parent tasks. Differential Revision: https://reviews.llvm.org/D80577	2020-06-01 17:51:02 +03:00
AndreyChurbanov	57d8b8d6f0	[openmp] Fixed hang if detached task was serialized. The patch fixes https://bugs.llvm.org/show_bug.cgi?id=45904. Differential Revision: https://reviews.llvm.org/D79944	2020-05-18 15:32:13 +03:00
Joel E. Denny	dd5ba4b585	[OpenMP][NFC] Fix `not` sustitution in tests D78566 introduced a `\bnot\b` lit substitution in OpenMP test suites. However, that would corrupt a command like `FileCheck -implicit-check-not` or any file name like `%t.not`. We could use lookbehind/lookahead assertions to avoid such cases, but this patch switches to `%not` (suggested during the D78566 review) as a safer option. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D79529	2020-05-11 14:53:48 -04:00
Joel E. Denny	ba942610f6	[OpenMP] Add scaffolding for negative runtime tests Without this patch, the openmp project's test suites do not appear to have support for negative tests. However, D78170 needs to add a test that an expected runtime failure occurs. This patch makes `not` visible in all of the openmp project's test suites. In all but `libomptarget/test`, it should be possible for a test author to insert `not` before a use of the lit substitution for running a test program. In `libomptarget/test`, that substitution is target-specific, and its value is `echo` when the target is not available. In that case, inserting `not` before a lit substitution would expect an `echo` fail, so this patch instead defines a separate lit substitution for expected runtime fails. Reviewed By: jdoerfert, Hahnfeld Differential Revision: https://reviews.llvm.org/D78566	2020-04-21 17:10:50 -04:00
Kazuaki Ishizaki	4201679110	[OpenMP] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77430	2020-04-04 12:06:54 +09:00
Vitaly Buka	c9ae3c5e10	[openmp] Disable tests flaky on Debian https://bugs.llvm.org/show_bug.cgi?id=45397	2020-04-01 21:58:05 -07:00
Alexey Bataev	0fca766458	[OPENMP50]Fix PR45117: Orphaned task reduction should be allowed. Add support for orpahned task reductions.	2020-03-27 17:47:30 -04:00
AndreyChurbanov	ae044467ed	[openmp][runtime] Fixed hang for explicit task inside a taskloop. Added missed initialization of td_last_tied field for taskloop tasks. Differential Revision: https://reviews.llvm.org/D75673	2020-03-23 20:07:30 +03:00
AndreyChurbanov	95df6747cf	[openmp] OpenMP 5.1 omp_display_env function implementation. Patch by Michael Klemm. Differential Revision: https://reviews.llvm.org/D74956	2020-03-04 18:15:05 +03:00
serge-sans-paille	99b03c1c18	Detect and disable openmp tests that require multiple hardware processor to run Team tests seem to require at least two physical cores, and using the same trick as in https://reviews.llvm.org/D55598 doesn't work (why?) . Using lit configuration instead. Differential Revision: https://reviews.llvm.org/D74921	2020-02-21 14:02:12 +01:00
Johannes Doerfert	3ff4e2eee8	[OpenMP] Switch default C++ standard to C++ 14 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D74258	2020-02-11 17:11:54 -06:00
Jonas Devlieghere	4fe839ef3a	[CMake] Rename EXCLUDE_FROM_ALL and make it an argument to add_lit_testsuite EXCLUDE_FROM_ALL means something else for add_lit_testsuite as it does for something like add_executable. Distinguish between the two by renaming the variable and making it an argument to add_lit_testsuite. Differential revision: https://reviews.llvm.org/D74168	2020-02-06 15:33:18 -08:00
protze@itc.rwth-aachen.de	90e4ebdce5	[OpenMP][OMPT] fix reduction test for 32-bit x86 Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=44733 \| TEST 'libomp :: ompt/synchronization/reduction/tree_reduce.c' FAILED on 32-bit x86 ]] For 32-bit we need at least 3 variables to avoid atomic reduction to be choosen by runtime function `__kmp_determine_reduction_method`. This patch adds reduction variables to the testcase. Reviewers: mgorny, Hahnfeld Differential Revision: https://reviews.llvm.org/D73850	2020-02-04 12:19:10 +01:00
Kelvin Li	ad24cf2a94	[OpenMP] change omp_atk_* and omp_atv_* enumerators to lowercase [NFC] The OpenMP spec defines the OMP_ATK_* and OMP_ATV_* to be lowercase. Differential Revision: https://reviews.llvm.org/D73248	2020-01-23 11:15:44 -05:00
Kazuaki Ishizaki	4c6a098ad5	[OpenMP] NFC: Fix trivial typos in comments Reviewers: jdoerfert, Jim Reviewed By: Jim Subscribers: Jim, mgorny, guansong, jfb, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D72285	2020-01-07 14:05:03 +08:00
Kelvin Li	ed5fe64581	[OpenMP] NFC: Fix trivial typos in comments Submitted by: kiszk Differential Revision: https://reviews.llvm.org/D72171	2020-01-03 22:03:42 -05:00
protze@itc.rwth-aachen.de	3356e268f6	[OpenMP] Implementation of OMPT reduction callbacks Including two tests These callbacks were added late to the 5.0 specification, an implementation is missing. Reviewed By: jdoerfert Differential Review: https://reviews.llvm.org/D70395	2019-12-27 15:30:51 +01:00
AndreyChurbanov	bd2fb41c2d	[openmp] Fixed nonmonotonic schedule when #threads > #chunks in a loop. Differential Revision: https://reviews.llvm.org/D70713	2019-11-27 15:26:51 +03:00
Sylvestre Ledru	9b40a7f3bf	Remove +x permission on some files	2019-11-16 14:47:20 +01:00
Michał Górny	6f8ee2c575	[openmp] [test] Skip one more test that kills NetBSD buildbot	2019-11-07 17:29:57 +01:00
Andrey Churbanov	de44f434e8	fixed test: eliminated race condition which might cause deadlock llvm-svn: 372887	2019-09-25 15:25:52 +00:00
Andrey Churbanov	a1639b9bba	Enable tasks dependencies hashmaps resizing. Patch by viroulep (Philippe Virouleau) Differential Revision: https://reviews.llvm.org/D67447 llvm-svn: 372879	2019-09-25 14:40:19 +00:00
Hansang Bae	dcdbe6515b	[OpenMP] Fix broken build due to new OMPT tests New OMPT tests with teams construct should be disabled for GCC as it emits code with a GOMP entry not supported in the LLVM runtime. Differential Revision: https://reviews.llvm.org/D65757 llvm-svn: 367939	2019-08-05 21:46:13 +00:00
Hansang Bae	67e93a1ae0	Add OMPT support for teams construct This change adds OMPT support for events from teams construct. Differential Revision: https://reviews.llvm.org/D64025 llvm-svn: 367746	2019-08-03 02:38:53 +00:00
Jonas Hahnfeld	2488ae9df1	[OpenMP] RISCV64 port This is a port of libomp for the RISC-V 64-bit Linux target. We have tested this port on a HiFive Unleashed development board using a downstream LLVM that has support for the missing bits in upstream. As of now, all tests are passing, including OMPT. Patch by Ferran Pallarès! Differential Revision: https://reviews.llvm.org/D59880 llvm-svn: 367021	2019-07-25 14:36:20 +00:00
Jonas Hahnfeld	a2748c74d6	[OMPT] Cleanup reset of exit_frame pointer This is done at call-site and does not need to be handled in __kmp_invoke_microtask. It was already absent from the x86 and x86_64 assembly, this patch removes it from the generic implementation in z_Linux_util.cpp and adds documentation for AArch64 and PPC64 that it's actually not needed. I can't test on these architectures, so I don't want to change the code just because it looks right :) While at it, rename some variables for consistency and add a check in test/ompt/parallel/normal.c that the pointer was reset before entering the barrier. Differential Revision: https://reviews.llvm.org/D64442 llvm-svn: 366721	2019-07-22 18:46:02 +00:00
Jonathan Peyton	aa5cdafa40	Remove REQUIRES OMP spec version within lit tests This is a follow up patch to D64534 (r365963) which removed all OMP spec versioning within the OpenMP runtime codebase. This patch removes REQUIRES: openmp-x.y lines from lit tests. llvm-svn: 366341	2019-07-17 15:41:00 +00:00
Jonathan Peyton	e4b4f994d2	[OpenMP] Remove OMP spec versioning Remove all older OMP spec versioning from the runtime and build system. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D64534 llvm-svn: 365963	2019-07-12 21:45:36 +00:00

1 2 3 4 5 ...

339 Commits