llvm-project

Commit Graph

Author	SHA1	Message	Date
Martin Storsjö	97958c9bb8	[openmp] Support building for armv7 Windows with mingw tools This does things in the same way as D137168 / `a356782426` and D101173 / `4fb0aaf033` did for aarch64. This adds a C implementation of __kmp_invoke_microtask in the same way as the fallback C implementation in z_Linux_util.cpp. Both the existing C fallback used on arm linux, and this one added here, fail test/misc_bugs/many-microtask-args.c similarly (which could be considered as an XFAIL). Differential Revision: https://reviews.llvm.org/D138689	2022-11-25 21:46:31 +02:00
Jonathan Peyton	174502fc14	[OpenMP][libomp] Parameterize affinity functions This patch parameterizes the affinity initialization code to allow multiple affinity settings. Almost all global affinity settings are consolidated and put into a structure kmp_affinity_t. This is in anticipation of the addition of hidden helper affinity which will have the same syntax and semantics as KMP_AFFINITY only for the hidden helper team. Differential Revision: https://reviews.llvm.org/D135109	2022-10-28 15:21:06 -05:00
Jonathan Peyton	1234011b80	[OpenMP][libomp] Introduce oneAPI compiler support Introduce KMP_COMPILER_ICX macro to represent compilation with oneAPI compiler. Fixup flag detection and compiler ID detection in CMake. Older CMake's detect IntelLLVM as Clang. Fix compiler warnings. Fixup many of the tests to have non-empty parallel regions as they are elided by oneAPI compiler.	2022-02-14 14:10:33 -06:00
Terry Wilmarth	2e02579a76	[OpenMP] Add use of TPAUSE Add use of TPAUSE (from WAITPKG) to the runtime for Intel hardware, with an envirable to turn it on in a particular C-state. Always uses TPAUSE if it is selected and enabled by Intel hardware and presence of WAITPKG, and if not, falls back to old way of checking __kmp_use_yield, etc. Differential Revision: https://reviews.llvm.org/D115758	2022-01-18 10:14:32 -06:00
Terry Wilmarth	d8e4cb9121	[OpenMP] libomp: Add new experimental barrier: two-level distributed barrier Two-level distributed barrier is a new experimental barrier designed for Intel hardware that has better performance in some cases than the default hyper barrier. This barrier is designed to handle fine granularity parallelism where barriers are used frequently with little compute and memory access between barriers. There is no need to use it for codes with few barriers and large granularity compute, or memory intensive applications, as little difference will be seen between this barrier and the default hyper barrier. This barrier is designed to work optimally with a fixed number of threads, and has a significant setup time, so should NOT be used in situations where the number of threads in a team is varied frequently. The two-level distributed barrier is off by default -- hyper barrier is used by default. To use this barrier, you must set all barrier patterns to use this type, because it will not work with other barrier patterns. Thus, to turn it on, the following settings are required: KMP_FORKJOIN_BARRIER_PATTERN=dist,dist KMP_PLAIN_BARRIER_PATTERN=dist,dist KMP_REDUCTION_BARRIER_PATTERN=dist,dist Branching factors (set with KMP_FORKJOIN_BARRIER, KMP_PLAIN_BARRIER, and KMP_REDUCTION_BARRIER) are ignored by the two-level distributed barrier. Patch fixed for ITTNotify disabled builds and non-x86 builds Co-authored-by: Jonathan Peyton <jonathan.l.peyton@intel.com> Co-authored-by: Vladislav Vinogradov <vlad.vinogradov@intel.com> Differential Revision: https://reviews.llvm.org/D103121	2021-07-29 14:09:26 -05:00
Johannes Doerfert	4eb90e893f	Revert "[OpenMP] Add Two-level Distributed Barrier" This reverts commit `25073a4ecf`. This breaks non-x86 OpenMP builds for a while now. Until a solution is ready to be upstreamed we revert the feature and unblock those builds. See: https://reviews.llvm.org/rG25073a4ecfc9b2e3cb76776185e63bfdb094cd98#1005821 and https://reviews.llvm.org/rG25073a4ecfc9b2e3cb76776185e63bfdb094cd98#1005821 The currently proposed fix (D104788) seems not to be ready yet: https://reviews.llvm.org/D104788#2841928	2021-06-29 09:38:27 -05:00
Terry Wilmarth	25073a4ecf	[OpenMP] Add Two-level Distributed Barrier Two-level distributed barrier is a new experimental barrier designed for Intel hardware that has better performance in some cases than the default hyper barrier. This barrier is designed to handle fine granularity parallelism where barriers are used frequently with little compute and memory access between barriers. There is no need to use it for codes with few barriers and large granularity compute, or memory intensive applications, as little difference will be seen between this barrier and the default hyper barrier. This barrier is designed to work optimally with a fixed number of threads, and has a significant setup time, so should NOT be used in situations where the number of threads in a team is varied frequently. The two-level distributed barrier is off by default -- hyper barrier is used by default. To use this barrier, you must set all barrier patterns to use this type, because it will not work with other barrier patterns. Thus, to turn it on, the following settings are required: KMP_FORKJOIN_BARRIER_PATTERN=dist,dist KMP_PLAIN_BARRIER_PATTERN=dist,dist KMP_REDUCTION_BARRIER_PATTERN=dist,dist Branching factors (set with KMP_FORKJOIN_BARRIER, KMP_PLAIN_BARRIER, and KMP_REDUCTION_BARRIER) are ignored by the two-level distributed barrier. Differential Revision: https://reviews.llvm.org/D103121	2021-06-16 15:34:55 -05:00
Martin Storsjö	b388c84c09	[OpenMP] Remove two entirely unused variables Differential Revision: https://reviews.llvm.org/D96583	2021-02-12 21:55:31 +02:00
Martin Storsjö	b3d84790fa	[OpenMP] Add void casts to silence unused variable warnings These variables are used only in certain build configurations, or marked with a todo comment indicating that they should be used/checked/reported. Differential Revision: https://reviews.llvm.org/D96582	2021-02-12 21:55:31 +02:00
Martin Storsjö	3f9519b768	[OpenMP] Only use #pragma comment(lib, ...) in MSVC build configurations MinGW build configurations don't support this pragma (unless compiling with clang, with -fms-extensions, and linking with lld), and at least clang warns about it. This library does end up linked by the cmake files anyway (as long as the check works properly). Differential Revision: https://reviews.llvm.org/D96581	2021-02-12 21:55:31 +02:00
Hansang Bae	ffb21e7f05	[OpenMP] Enable omp_get_num_devices() on Windows This patch enables omp_get_num_devices() and omp_get_initial_device() on Windows by providing an alternative to dlsym on Windows, and proposes to add a new libomptarget entry, __tgt_get_num_devices(). Differential Revision: https://reviews.llvm.org/D96182	2021-02-11 14:53:48 -06:00
AndreyChurbanov	498c4b6fc4	[OpenMP] libomp: fix build by clang-cl with vs2019 Problem reported by Joseph Shen <joseph.smeng@gmail.com>. The patch changes *(&<atomic-var>) to (&<atomic-var>)->load(). Differential Revision: https://reviews.llvm.org/D95485	2021-01-27 12:18:15 +03:00
Shilei Tian	9d64275ae0	[OpenMP] Added the support for hidden helper task in RTL The basic design is to create an outer-most parallel team. It is not a regular team because it is only created when the first hidden helper task is encountered, and is only responsible for the execution of hidden helper tasks. We first use `pthread_create` to create a new thread, let's call it the initial and also the main thread of the hidden helper team. This initial thread then initializes a new root, just like what RTL does in initialization. After that, it directly calls `__kmpc_fork_call`. It is like the initial thread encounters a parallel region. The wrapped function for this team is, for main thread, which is the initial thread that we create via `pthread_create` on Linux, waits on a condition variable. The condition variable can only be signaled when RTL is being destroyed. For other work threads, they just do nothing. The reason that main thread needs to wait there is, in current implementation, once the main thread finishes the wrapped function of this team, it starts to free the team which is not what we want. Two environment variables, `LIBOMP_NUM_HIDDEN_HELPER_THREADS` and `LIBOMP_USE_HIDDEN_HELPER_TASK`, are also set to configure the number of threads and enable/disable this feature. By default, the number of hidden helper threads is 8. Here are some open issues to be discussed: 1. The main thread goes to sleeping when the initialization is finished. As Andrey mentioned, we might need it to be awaken from time to time to do some stuffs. What kind of update/check should be put here? Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D77609	2021-01-25 22:16:17 -05:00
Terry Wilmarth	e0665a9050	[OpenMP] Add support for Intel's umonitor/umwait These changes add support for Intel's umonitor/umwait usage in wait code, for architectures that support those intrinsic functions. Usage of umonitor/umwait is off by default, but can be turned on by setting the KMP_USER_LEVEL_MWAIT environment variable. Differential Revision: https://reviews.llvm.org/D91189	2020-12-01 14:07:46 -06:00
AndreyChurbanov	5644f734d6	Revert "[OpenMP] Add support for Intel's umonitor/umwait" This reverts commit `9cfad5f9c5`.	2020-11-20 12:16:34 +03:00
AndreyChurbanov	9cfad5f9c5	[OpenMP] Add support for Intel's umonitor/umwait Patch by tlwilmar (Terry Wilmarth) Differential Revision: https://reviews.llvm.org/D91189	2020-11-19 22:04:21 +03:00
Kelvin Li	ed5fe64581	[OpenMP] NFC: Fix trivial typos in comments Submitted by: kiszk Differential Revision: https://reviews.llvm.org/D72171	2020-01-03 22:03:42 -05:00
Jonathan Peyton	e4b4f994d2	[OpenMP] Remove OMP spec versioning Remove all older OMP spec versioning from the runtime and build system. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D64534 llvm-svn: 365963	2019-07-12 21:45:36 +00:00
Andrey Churbanov	f8f788b205	Fixed second issue reported in https://bugs.llvm.org/show_bug.cgi?id=41584 . Added synchronization for possible concurrent initialization of mutexes by multiple threads. The need of synchronization caused by commit r357927 which added the use of mutexes at threads movement to/from common pool (earlier the mutexes were used only at suspend/resume). Patch by Johnny Peyton. Differential Revision: https://reviews.llvm.org/D61995 llvm-svn: 360919	2019-05-16 17:52:53 +00:00
Jonathan Peyton	feac33ebb0	[OpenMP] Clean up load balancing dynamic mode This patch cleans up the bookkeeping code for the load balancing dynamic mode. When a thread is moved to or from the thread pool, the th_active_in_pool flag and the __kmp_thread_pool_active_nth global counter are both updated. This removes the need for the corrective code in the main wait loop. Another global counter, __kmp_thread_pool_nth, was removed completely, as it was only used for debugging, but was not under KMP_DEBUG. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D59508 llvm-svn: 357927	2019-04-08 17:50:02 +00:00
Jonathan Peyton	b727d384a3	[OpenMP] Fix hang on Windows Debug dump on large machine shows when many OpenMP threads (401 in total) sleep on a barrier, one of the innermost nesting levels sleeps on a child's b_arrived flag whose value is equal to 4 and is equal to checker value. i.e., (1) sleep bit is 0, and (2) done_check() would return true if called. It is unclear how this might happen. It could be Windows Server 2016's error of EnterCriticalSection / LeaveCriticalSection, or error of WaitForSingleObject / SetEvent / ResetEvent, or error in the library which is very difficult to find. As a workaround, change INFINITE wait to timed wait, so that each thread awakens each 5 seconds (the timeout was chosen arbitrary to not disturb other threads much), check flag condition under the lock, and either go to sleep again or stop sleeping as a result of the check. Patch by Andrey Churbanov Differential Revision: https://reviews.llvm.org/D59793 llvm-svn: 357722	2019-04-04 20:35:29 +00:00
Jonathan Peyton	e47d32f165	[OpenMP] Make use of sched_yield optional in runtime This patch cleans up the yielding code and makes it optional. An environment variable, KMP_USE_YIELD, was added. Yielding is still on by default (KMP_USE_YIELD=1), but can be turned off completely (KMP_USE_YIELD=0), or turned on only when oversubscription is detected (KMP_USE_YIELD=2). Note that oversubscription cannot always be detected by the runtime (for example, when the runtime is initialized and the process forks, oversubscription cannot be detected currently over multiple instances of the runtime). Because yielding can be controlled by user now, the library mode settings (from KMP_LIBRARY) for throughput and turnaround have been adjusted by altering blocktime, unless that was also explicitly set. In the original code, there were a number of places where a double yield might have been done under oversubscription. This version checks oversubscription and if that's not going to yield, then it does the spin check. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D58148 llvm-svn: 355120	2019-02-28 19:11:29 +00:00
Chandler Carruth	57b08b0944	Update more file headers across all of the LLVM projects in the monorepo to reflect the new license. These used slightly different spellings that defeated my regular expressions. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351648	2019-01-19 10:56:40 +00:00
Jonathan Peyton	9b8bb323c9	[OpenMP] Add omp_pause_resource* API Add omp_pause_resource and omp_pause_resource_all API and enum, plus stub for internal implementation. Implemented callable helper function to do local pause, and added basic functionality for hard and soft pause. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D55078 llvm-svn: 351372	2019-01-16 20:07:39 +00:00
Andrey Churbanov	f700e9ed8c	Support clang compiling under windows-gnu and windows-msvc Patch by Peiyuan Song <squallatf@gmail.com> Differential Revision: https://reviews.llvm.org/D53422 llvm-svn: 348756	2018-12-10 13:45:00 +00:00
Jonathan Peyton	8b3842fc99	[OpenMP] Convert KMP_DYNAMIC_LIB to a 0 or 1 guard everywhere llvm-svn: 343869	2018-10-05 17:59:39 +00:00
Jonathan Peyton	37e2ef5434	[OpenMP] Use C++11 Atomics - barrier, tasking, and lock code These are preliminary changes that attempt to use C++11 Atomics in the runtime. We are expecting better portability with this change across architectures/OSes. Here is the summary of the changes. Most variables that need synchronization operation were converted to generic atomic variables (std::atomic<T>). Variables that are updated with combined CAS are packed into a single atomic variable, and partial read/write is done through unpacking/packing Patch by Hansang Bae Differential Revision: https://reviews.llvm.org/D47903 llvm-svn: 336563	2018-07-09 17:36:22 +00:00
Jonathan Peyton	94a114fc39	Apply formatting changes .clang-format's comments are removed and a (hopefully) final set of formatting changes are applied. Differential Revision: https://reviews.llvm.org/D38837 Differential Revision: https://reviews.llvm.org/D38920 llvm-svn: 316227	2017-10-20 19:30:57 +00:00
Jonathan Peyton	bd3a7633f1	Remove unnecessary semicolons Removes semicolons after if {} blocks, function definitions, etc. I was able to apply the large OMPT patch cleanly on top of this one with no conflicts. llvm-svn: 314340	2017-09-27 20:36:27 +00:00
Jonathan Peyton	6a393f75f4	Minor code cleanup of Klocwork issues Minor code cleanup of Klocwork issues. Fatal messages are given no return attribute. Define and use KMP_NORETURN to work for multiple C++ versions. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D37275 llvm-svn: 312538	2017-09-05 15:43:58 +00:00
Ed Maste	414544c9aa	remove deprecated register storage class specifier While importing libomp into the FreeBSD base system we encountered Clang warnings that "'register' storage class specifier is deprecated and incompatible with C++1z [-Wdeprecated-register]". Differential Revision: https://reviews.llvm.org/D35124 llvm-svn: 307441	2017-07-07 21:06:05 +00:00
Jonathan Peyton	3041982dd1	Clang-format and whitespace cleanup of source code This patch contains the clang-format and cleanup of the entire code base. Some of clang-formats changes made the code look worse in places. A best effort was made to resolve the bulk of these problems, but many remain. Most of the problems were mangling line-breaks and tabbing of comments. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D32659 llvm-svn: 302929	2017-05-12 18:01:32 +00:00
Jonathan Peyton	de4749b748	Follow up to r289732: Update comments in source files to reference .cpp files Patch by Hansang Bae llvm-svn: 289739	2016-12-14 23:01:24 +00:00
Jonathan Peyton	7cc577a4ef	Change source files from .c to .cpp Patch by Hansang Bae Differential Revision: https://reviews.llvm.org/D26688 llvm-svn: 289732	2016-12-14 22:39:11 +00:00

34 Commits