192 lines
		
	
	
		
			7.7 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
			
		
		
	
	
			192 lines
		
	
	
		
			7.7 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
| =============================
 | |
| Advanced Build Configurations
 | |
| =============================
 | |
| 
 | |
| .. contents::
 | |
|    :local:
 | |
| 
 | |
| Introduction
 | |
| ============
 | |
| 
 | |
| `CMake <http://www.cmake.org/>`_ is a cross-platform build-generator tool. CMake
 | |
| does not build the project, it generates the files needed by your build tool
 | |
| (GNU make, Visual Studio, etc.) for building LLVM.
 | |
| 
 | |
| If **you are a new contributor**, please start with the :doc:`GettingStarted` or
 | |
| :doc:`CMake` pages. This page is intended for users doing more complex builds.
 | |
| 
 | |
| Many of the examples below are written assuming specific CMake Generators.
 | |
| Unless otherwise explicitly called out these commands should work with any CMake
 | |
| generator.
 | |
| 
 | |
| Bootstrap Builds
 | |
| ================
 | |
| 
 | |
| The Clang CMake build system supports bootstrap (aka multi-stage) builds. At a
 | |
| high level a multi-stage build is a chain of builds that pass data from one
 | |
| stage into the next. The most common and simple version of this is a traditional
 | |
| bootstrap build.
 | |
| 
 | |
| In a simple two-stage bootstrap build, we build clang using the system compiler,
 | |
| then use that just-built clang to build clang again. In CMake this simplest form
 | |
| of a bootstrap build can be configured with a single option,
 | |
| CLANG_ENABLE_BOOTSTRAP.
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake -G Ninja -DCLANG_ENABLE_BOOTSTRAP=On <path to source>
 | |
|   $ ninja stage2
 | |
| 
 | |
| This command itself isn't terribly useful because it assumes default
 | |
| configurations for each stage. The next series of examples utilize CMake cache
 | |
| scripts to provide more complex options.
 | |
| 
 | |
| By default, only a few CMake options will be passed between stages.
 | |
| The list, called _BOOTSTRAP_DEFAULT_PASSTHROUGH, is defined in clang/CMakeLists.txt.
 | |
| To force the passing of the variables between stages, use the -DCLANG_BOOTSTRAP_PASSTHROUGH
 | |
| CMake option, each variable separated by a ";". As example:
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake -G Ninja -DCLANG_ENABLE_BOOTSTRAP=On -DCLANG_BOOTSTRAP_PASSTHROUGH="CMAKE_INSTALL_PREFIX;CMAKE_VERBOSE_MAKEFILE" <path to source>
 | |
|   $ ninja stage2
 | |
| 
 | |
| CMake options starting by ``BOOTSTRAP_`` will be passed only to the stage2 build.
 | |
| This gives the opportunity to use Clang specific build flags.
 | |
| For example, the following CMake call will enabled '-fno-addrsig' only during
 | |
| the stage2 build for C and C++.
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake [..]  -DBOOTSTRAP_CMAKE_CXX_FLAGS='-fno-addrsig' -DBOOTSTRAP_CMAKE_C_FLAGS='-fno-addrsig' [..]
 | |
| 
 | |
| The clang build system refers to builds as stages. A stage1 build is a standard
 | |
| build using the compiler installed on the host, and a stage2 build is built
 | |
| using the stage1 compiler. This nomenclature holds up to more stages too. In
 | |
| general a stage*n* build is built using the output from stage*n-1*.
 | |
| 
 | |
| Apple Clang Builds (A More Complex Bootstrap)
 | |
| =============================================
 | |
| 
 | |
| Apple's Clang builds are a slightly more complicated example of the simple
 | |
| bootstrapping scenario. Apple Clang is built using a 2-stage build.
 | |
| 
 | |
| The stage1 compiler is a host-only compiler with some options set. The stage1
 | |
| compiler is a balance of optimization vs build time because it is a throwaway.
 | |
| The stage2 compiler is the fully optimized compiler intended to ship to users.
 | |
| 
 | |
| Setting up these compilers requires a lot of options. To simplify the
 | |
| configuration the Apple Clang build settings are contained in CMake Cache files.
 | |
| You can build an Apple Clang compiler using the following commands:
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake -G Ninja -C <path to clang>/cmake/caches/Apple-stage1.cmake <path to source>
 | |
|   $ ninja stage2-distribution
 | |
| 
 | |
| This CMake invocation configures the stage1 host compiler, and sets
 | |
| CLANG_BOOTSTRAP_CMAKE_ARGS to pass the Apple-stage2.cmake cache script to the
 | |
| stage2 configuration step.
 | |
| 
 | |
| When you build the stage2-distribution target it builds the minimal stage1
 | |
| compiler and required tools, then configures and builds the stage2 compiler
 | |
| based on the settings in Apple-stage2.cmake.
 | |
| 
 | |
| This pattern of using cache scripts to set complex settings, and specifically to
 | |
| make later stage builds include cache scripts is common in our more advanced
 | |
| build configurations.
 | |
| 
 | |
| Multi-stage PGO
 | |
| ===============
 | |
| 
 | |
| Profile-Guided Optimizations (PGO) is a really great way to optimize the code
 | |
| clang generates. Our multi-stage PGO builds are a workflow for generating PGO
 | |
| profiles that can be used to optimize clang.
 | |
| 
 | |
| At a high level, the way PGO works is that you build an instrumented compiler,
 | |
| then you run the instrumented compiler against sample source files. While the
 | |
| instrumented compiler runs it will output a bunch of files containing
 | |
| performance counters (.profraw files). After generating all the profraw files
 | |
| you use llvm-profdata to merge the files into a single profdata file that you
 | |
| can feed into the LLVM_PROFDATA_FILE option.
 | |
| 
 | |
| Our PGO.cmake cache script automates that whole process. You can use it by
 | |
| running:
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake -G Ninja -C <path_to_clang>/cmake/caches/PGO.cmake <source dir>
 | |
|   $ ninja stage2-instrumented-generate-profdata
 | |
| 
 | |
| If you let that run for a few hours or so, it will place a profdata file in your
 | |
| build directory. This takes a really long time because it builds clang twice,
 | |
| and you *must* have compiler-rt in your build tree.
 | |
| 
 | |
| This process uses any source files under the perf-training directory as training
 | |
| data as long as the source files are marked up with LIT-style RUN lines.
 | |
| 
 | |
| After it finishes you can use “find . -name clang.profdata” to find it, but it
 | |
| should be at a path something like:
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   <build dir>/tools/clang/stage2-instrumented-bins/utils/perf-training/clang.profdata
 | |
| 
 | |
| You can feed that file into the LLVM_PROFDATA_FILE option when you build your
 | |
| optimized compiler.
 | |
| 
 | |
| The PGO came cache has a slightly different stage naming scheme than other
 | |
| multi-stage builds. It generates three stages; stage1, stage2-instrumented, and
 | |
| stage2. Both of the stage2 builds are built using the stage1 compiler.
 | |
| 
 | |
| The PGO came cache generates the following additional targets:
 | |
| 
 | |
| **stage2-instrumented**
 | |
|   Builds a stage1 x86 compiler, runtime, and required tools (llvm-config,
 | |
|   llvm-profdata) then uses that compiler to build an instrumented stage2 compiler.
 | |
| 
 | |
| **stage2-instrumented-generate-profdata**
 | |
|   Depends on "stage2-instrumented" and will use the instrumented compiler to
 | |
|   generate profdata based on the training files in <clang>/utils/perf-training
 | |
| 
 | |
| **stage2**
 | |
|   Depends of "stage2-instrumented-generate-profdata" and will use the stage1
 | |
|   compiler with the stage2 profdata to build a PGO-optimized compiler.
 | |
| 
 | |
| **stage2-check-llvm**
 | |
|   Depends on stage2 and runs check-llvm using the stage2 compiler.
 | |
| 
 | |
| **stage2-check-clang**
 | |
|   Depends on stage2 and runs check-clang using the stage2 compiler.
 | |
| 
 | |
| **stage2-check-all**
 | |
|   Depends on stage2 and runs check-all using the stage2 compiler.
 | |
| 
 | |
| **stage2-test-suite**
 | |
|   Depends on stage2 and runs the test-suite using the stage3 compiler (requires
 | |
|   in-tree test-suite).
 | |
| 
 | |
| 3-Stage Non-Determinism
 | |
| =======================
 | |
| 
 | |
| In the ancient lore of compilers non-determinism is like the multi-headed hydra.
 | |
| Whenever its head pops up, terror and chaos ensue.
 | |
| 
 | |
| Historically one of the tests to verify that a compiler was deterministic would
 | |
| be a three stage build. The idea of a three stage build is you take your sources
 | |
| and build a compiler (stage1), then use that compiler to rebuild the sources
 | |
| (stage2), then you use that compiler to rebuild the sources a third time
 | |
| (stage3) with an identical configuration to the stage2 build. At the end of
 | |
| this, you have a stage2 and stage3 compiler that should be bit-for-bit
 | |
| identical.
 | |
| 
 | |
| You can perform one of these 3-stage builds with LLVM & clang using the
 | |
| following commands:
 | |
| 
 | |
| .. code-block:: console
 | |
| 
 | |
|   $ cmake -G Ninja -C <path_to_clang>/cmake/caches/3-stage.cmake <source dir>
 | |
|   $ cmake --build . --target stage3 --parallel
 | |
| 
 | |
| After the build you can compare the stage2 & stage3 compilers.
 |