1==================================== 2Getting Started with the LLVM System 3==================================== 4 5.. contents:: 6 :local: 7 8Overview 9======== 10 11Welcome to the LLVM project! 12 13The LLVM project has multiple components. The core of the project is 14itself called "LLVM". This contains all of the tools, libraries, and header 15files needed to process intermediate representations and converts it into 16object files. Tools include an assembler, disassembler, bitcode analyzer, and 17bitcode optimizer. It also contains basic regression tests. 18 19C-like languages use the `Clang <https://clang.llvm.org/>`_ front end. This 20component compiles C, C++, Objective C, and Objective C++ code into LLVM bitcode 21-- and from there into object files, using LLVM. 22 23Other components include: 24the `libc++ C++ standard library <https://libcxx.llvm.org>`_, 25the `LLD linker <https://lld.llvm.org>`_, and more. 26 27Getting the Source Code and Building LLVM 28========================================= 29 30The LLVM Getting Started documentation may be out of date. The `Clang 31Getting Started <https://clang.llvm.org/get_started.html>`_ page might have more 32accurate information. 33 34This is an example workflow and configuration to get and build the LLVM source: 35 36#. Checkout LLVM (including related subprojects like Clang): 37 38 * ``git clone https://github.com/llvm/llvm-project.git`` 39 * Or, on windows, ``git clone --config core.autocrlf=false 40 https://github.com/llvm/llvm-project.git`` 41 * To save storage and speed-up the checkout time, you may want to do a 42 `shallow clone <https://git-scm.com/docs/git-clone#Documentation/git-clone.txt---depthltdepthgt>`_. 43 For example, to get the latest revision of the LLVM project, use 44 ``git clone --depth 1 https://github.com/llvm/llvm-project.git`` 45 46#. Configure and build LLVM and Clang: 47 48 * ``cd llvm-project`` 49 * ``mkdir build`` 50 * ``cd build`` 51 * ``cmake -G <generator> [options] ../llvm`` 52 53 Some common build system generators are: 54 55 * ``Ninja`` --- for generating `Ninja <https://ninja-build.org>`_ 56 build files. Most llvm developers use Ninja. 57 * ``Unix Makefiles`` --- for generating make-compatible parallel makefiles. 58 * ``Visual Studio`` --- for generating Visual Studio projects and 59 solutions. 60 * ``Xcode`` --- for generating Xcode projects. 61 62 Some Common options: 63 64 * ``-DLLVM_ENABLE_PROJECTS='...'`` --- semicolon-separated list of the LLVM 65 subprojects you'd like to additionally build. Can include any of: clang, 66 clang-tools-extra, libcxx, libcxxabi, libunwind, lldb, compiler-rt, lld, 67 polly, or cross-project-tests. 68 69 For example, to build LLVM, Clang, libcxx, and libcxxabi, use 70 ``-DLLVM_ENABLE_PROJECTS="clang;libcxx;libcxxabi"``. 71 72 * ``-DCMAKE_INSTALL_PREFIX=directory`` --- Specify for *directory* the full 73 pathname of where you want the LLVM tools and libraries to be installed 74 (default ``/usr/local``). 75 76 * ``-DCMAKE_BUILD_TYPE=type`` --- Valid options for *type* are Debug, 77 Release, RelWithDebInfo, and MinSizeRel. Default is Debug. 78 79 * ``-DLLVM_ENABLE_ASSERTIONS=On`` --- Compile with assertion checks enabled 80 (default is Yes for Debug builds, No for all other build types). 81 82 * ``cmake --build . [--target <target>]`` or the build system specified 83 above directly. 84 85 * The default target (i.e. ``cmake --build .`` or ``make``) will build all of 86 LLVM. 87 88 * The ``check-all`` target (i.e. ``ninja check-all``) will run the 89 regression tests to ensure everything is in working order. 90 91 * CMake will generate build targets for each tool and library, and most 92 LLVM sub-projects generate their own ``check-<project>`` target. 93 94 * Running a serial build will be **slow**. To improve speed, try running a 95 parallel build. That's done by default in Ninja; for ``make``, use the 96 option ``-j NN``, where ``NN`` is the number of parallel jobs, e.g. the 97 number of available CPUs. 98 99 * For more information see `CMake <CMake.html>`__ 100 101 * If you get an "internal compiler error (ICE)" or test failures, see 102 `below`_. 103 104Consult the `Getting Started with LLVM`_ section for detailed information on 105configuring and compiling LLVM. Go to `Directory Layout`_ to learn about the 106layout of the source code tree. 107 108Requirements 109============ 110 111Before you begin to use the LLVM system, review the requirements given below. 112This may save you some trouble by knowing ahead of time what hardware and 113software you will need. 114 115Hardware 116-------- 117 118LLVM is known to work on the following host platforms: 119 120================== ===================== ============= 121OS Arch Compilers 122================== ===================== ============= 123Linux x86\ :sup:`1` GCC, Clang 124Linux amd64 GCC, Clang 125Linux ARM GCC, Clang 126Linux Mips GCC, Clang 127Linux PowerPC GCC, Clang 128Linux SystemZ GCC, Clang 129Solaris V9 (Ultrasparc) GCC 130FreeBSD x86\ :sup:`1` GCC, Clang 131FreeBSD amd64 GCC, Clang 132NetBSD x86\ :sup:`1` GCC, Clang 133NetBSD amd64 GCC, Clang 134macOS\ :sup:`2` PowerPC GCC 135macOS x86 GCC, Clang 136Cygwin/Win32 x86\ :sup:`1, 3` GCC 137Windows x86\ :sup:`1` Visual Studio 138Windows x64 x86-64 Visual Studio 139================== ===================== ============= 140 141.. note:: 142 143 #. Code generation supported for Pentium processors and up 144 #. Code generation supported for 32-bit ABI only 145 #. To use LLVM modules on Win32-based system, you may configure LLVM 146 with ``-DBUILD_SHARED_LIBS=On``. 147 148Note that Debug builds require a lot of time and disk space. An LLVM-only build 149will need about 1-3 GB of space. A full build of LLVM and Clang will need around 15015-20 GB of disk space. The exact space requirements will vary by system. (It 151is so large because of all the debugging information and the fact that the 152libraries are statically linked into multiple tools). 153 154If you are space-constrained, you can build only selected tools or only 155selected targets. The Release build requires considerably less space. 156 157The LLVM suite *may* compile on other platforms, but it is not guaranteed to do 158so. If compilation is successful, the LLVM utilities should be able to 159assemble, disassemble, analyze, and optimize LLVM bitcode. Code generation 160should work as well, although the generated native code may not work on your 161platform. 162 163Software 164-------- 165 166Compiling LLVM requires that you have several software packages installed. The 167table below lists those required packages. The Package column is the usual name 168for the software package that LLVM depends on. The Version column provides 169"known to work" versions of the package. The Notes column describes how LLVM 170uses the package and provides other details. 171 172=========================================================== ============ ========================================== 173Package Version Notes 174=========================================================== ============ ========================================== 175`CMake <http://cmake.org/>`__ >=3.13.4 Makefile/workspace generator 176`GCC <http://gcc.gnu.org/>`_ >=5.1.0 C/C++ compiler\ :sup:`1` 177`python <http://www.python.org/>`_ >=3.6 Automated test suite\ :sup:`2` 178`zlib <http://zlib.net>`_ >=1.2.3.4 Compression library\ :sup:`3` 179`GNU Make <http://savannah.gnu.org/projects/make>`_ 3.79, 3.79.1 Makefile/build processor\ :sup:`4` 180=========================================================== ============ ========================================== 181 182.. note:: 183 184 #. Only the C and C++ languages are needed so there's no need to build the 185 other languages for LLVM's purposes. See `below` for specific version 186 info. 187 #. Only needed if you want to run the automated test suite in the 188 ``llvm/test`` directory. 189 #. Optional, adds compression / uncompression capabilities to selected LLVM 190 tools. 191 #. Optional, you can use any other build tool supported by CMake. 192 193Additionally, your compilation host is expected to have the usual plethora of 194Unix utilities. Specifically: 195 196* **ar** --- archive library builder 197* **bzip2** --- bzip2 command for distribution generation 198* **bunzip2** --- bunzip2 command for distribution checking 199* **chmod** --- change permissions on a file 200* **cat** --- output concatenation utility 201* **cp** --- copy files 202* **date** --- print the current date/time 203* **echo** --- print to standard output 204* **egrep** --- extended regular expression search utility 205* **find** --- find files/dirs in a file system 206* **grep** --- regular expression search utility 207* **gzip** --- gzip command for distribution generation 208* **gunzip** --- gunzip command for distribution checking 209* **install** --- install directories/files 210* **mkdir** --- create a directory 211* **mv** --- move (rename) files 212* **ranlib** --- symbol table builder for archive libraries 213* **rm** --- remove (delete) files and directories 214* **sed** --- stream editor for transforming output 215* **sh** --- Bourne shell for make build scripts 216* **tar** --- tape archive for distribution generation 217* **test** --- test things in file system 218* **unzip** --- unzip command for distribution checking 219* **zip** --- zip command for distribution generation 220 221.. _below: 222.. _check here: 223 224Host C++ Toolchain, both Compiler and Standard Library 225------------------------------------------------------ 226 227LLVM is very demanding of the host C++ compiler, and as such tends to expose 228bugs in the compiler. We also attempt to follow improvements and developments in 229the C++ language and library reasonably closely. As such, we require a modern 230host C++ toolchain, both compiler and standard library, in order to build LLVM. 231 232LLVM is written using the subset of C++ documented in :doc:`coding 233standards<CodingStandards>`. To enforce this language version, we check the most 234popular host toolchains for specific minimum versions in our build systems: 235 236* Clang 3.5 237* Apple Clang 6.0 238* GCC 5.1 239* Visual Studio 2017 240 241Anything older than these toolchains *may* work, but will require forcing the 242build system with a special option and is not really a supported host platform. 243Also note that older versions of these compilers have often crashed or 244miscompiled LLVM. 245 246For less widely used host toolchains such as ICC or xlC, be aware that a very 247recent version may be required to support all of the C++ features used in LLVM. 248 249We track certain versions of software that are *known* to fail when used as 250part of the host toolchain. These even include linkers at times. 251 252**GNU ld 2.16.X**. Some 2.16.X versions of the ld linker will produce very long 253warning messages complaining that some "``.gnu.linkonce.t.*``" symbol was 254defined in a discarded section. You can safely ignore these messages as they are 255erroneous and the linkage is correct. These messages disappear using ld 2.17. 256 257**GNU binutils 2.17**: Binutils 2.17 contains `a bug 258<http://sourceware.org/bugzilla/show_bug.cgi?id=3111>`__ which causes huge link 259times (minutes instead of seconds) when building LLVM. We recommend upgrading 260to a newer version (2.17.50.0.4 or later). 261 262**GNU Binutils 2.19.1 Gold**: This version of Gold contained `a bug 263<http://sourceware.org/bugzilla/show_bug.cgi?id=9836>`__ which causes 264intermittent failures when building LLVM with position independent code. The 265symptom is an error about cyclic dependencies. We recommend upgrading to a 266newer version of Gold. 267 268Getting a Modern Host C++ Toolchain 269^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 270 271This section mostly applies to Linux and older BSDs. On macOS, you should 272have a sufficiently modern Xcode, or you will likely need to upgrade until you 273do. Windows does not have a "system compiler", so you must install either Visual 274Studio 2017 or a recent version of mingw64. FreeBSD 10.0 and newer have a modern 275Clang as the system compiler. 276 277However, some Linux distributions and some other or older BSDs sometimes have 278extremely old versions of GCC. These steps attempt to help you upgrade you 279compiler even on such a system. However, if at all possible, we encourage you 280to use a recent version of a distribution with a modern system compiler that 281meets these requirements. Note that it is tempting to install a prior 282version of Clang and libc++ to be the host compiler, however libc++ was not 283well tested or set up to build on Linux until relatively recently. As 284a consequence, this guide suggests just using libstdc++ and a modern GCC as the 285initial host in a bootstrap, and then using Clang (and potentially libc++). 286 287The first step is to get a recent GCC toolchain installed. The most common 288distribution on which users have struggled with the version requirements is 289Ubuntu Precise, 12.04 LTS. For this distribution, one easy option is to install 290the `toolchain testing PPA`_ and use it to install a modern GCC. There is 291a really nice discussions of this on the `ask ubuntu stack exchange`_ and a 292`github gist`_ with updated commands. However, not all users can use PPAs and 293there are many other distributions, so it may be necessary (or just useful, if 294you're here you *are* doing compiler development after all) to build and install 295GCC from source. It is also quite easy to do these days. 296 297.. _toolchain testing PPA: 298 https://launchpad.net/~ubuntu-toolchain-r/+archive/test 299.. _ask ubuntu stack exchange: 300 https://askubuntu.com/questions/466651/how-do-i-use-the-latest-gcc-on-ubuntu/581497#58149 301.. _github gist: 302 https://gist.github.com/application2000/73fd6f4bf1be6600a2cf9f56315a2d91 303 304Easy steps for installing GCC 5.1.0: 305 306.. code-block:: console 307 308 % gcc_version=5.1.0 309 % wget https://ftp.gnu.org/gnu/gcc/gcc-${gcc_version}/gcc-${gcc_version}.tar.bz2 310 % wget https://ftp.gnu.org/gnu/gcc/gcc-${gcc_version}/gcc-${gcc_version}.tar.bz2.sig 311 % wget https://ftp.gnu.org/gnu/gnu-keyring.gpg 312 % signature_invalid=`gpg --verify --no-default-keyring --keyring ./gnu-keyring.gpg gcc-${gcc_version}.tar.bz2.sig` 313 % if [ $signature_invalid ]; then echo "Invalid signature" ; exit 1 ; fi 314 % tar -xvjf gcc-${gcc_version}.tar.bz2 315 % cd gcc-${gcc_version} 316 % ./contrib/download_prerequisites 317 % cd .. 318 % mkdir gcc-${gcc_version}-build 319 % cd gcc-${gcc_version}-build 320 % $PWD/../gcc-${gcc_version}/configure --prefix=$HOME/toolchains --enable-languages=c,c++ 321 % make -j$(nproc) 322 % make install 323 324For more details, check out the excellent `GCC wiki entry`_, where I got most 325of this information from. 326 327.. _GCC wiki entry: 328 https://gcc.gnu.org/wiki/InstallingGCC 329 330Once you have a GCC toolchain, configure your build of LLVM to use the new 331toolchain for your host compiler and C++ standard library. Because the new 332version of libstdc++ is not on the system library search path, you need to pass 333extra linker flags so that it can be found at link time (``-L``) and at runtime 334(``-rpath``). If you are using CMake, this invocation should produce working 335binaries: 336 337.. code-block:: console 338 339 % mkdir build 340 % cd build 341 % CC=$HOME/toolchains/bin/gcc CXX=$HOME/toolchains/bin/g++ \ 342 cmake .. -DCMAKE_CXX_LINK_FLAGS="-Wl,-rpath,$HOME/toolchains/lib64 -L$HOME/toolchains/lib64" 343 344If you fail to set rpath, most LLVM binaries will fail on startup with a message 345from the loader similar to ``libstdc++.so.6: version `GLIBCXX_3.4.20' not 346found``. This means you need to tweak the -rpath linker flag. 347 348This method will add an absolute path to the rpath of all executables. That's 349fine for local development. If you want to distribute the binaries you build 350so that they can run on older systems, copy ``libstdc++.so.6`` into the 351``lib/`` directory. All of LLVM's shipping binaries have an rpath pointing at 352``$ORIGIN/../lib``, so they will find ``libstdc++.so.6`` there. Non-distributed 353binaries don't have an rpath set and won't find ``libstdc++.so.6``. Pass 354``-DLLVM_LOCAL_RPATH="$HOME/toolchains/lib64"`` to cmake to add an absolute 355path to ``libstdc++.so.6`` as above. Since these binaries are not distributed, 356having an absolute local path is fine for them. 357 358When you build Clang, you will need to give *it* access to modern C++ 359standard library in order to use it as your new host in part of a bootstrap. 360There are two easy ways to do this, either build (and install) libc++ along 361with Clang and then use it with the ``-stdlib=libc++`` compile and link flag, 362or install Clang into the same prefix (``$HOME/toolchains`` above) as GCC. 363Clang will look within its own prefix for libstdc++ and use it if found. You 364can also add an explicit prefix for Clang to look in for a GCC toolchain with 365the ``--gcc-toolchain=/opt/my/gcc/prefix`` flag, passing it to both compile and 366link commands when using your just-built-Clang to bootstrap. 367 368.. _Getting Started with LLVM: 369 370Getting Started with LLVM 371========================= 372 373The remainder of this guide is meant to get you up and running with LLVM and to 374give you some basic information about the LLVM environment. 375 376The later sections of this guide describe the `general layout`_ of the LLVM 377source tree, a `simple example`_ using the LLVM tool chain, and `links`_ to find 378more information about LLVM or to get help via e-mail. 379 380Terminology and Notation 381------------------------ 382 383Throughout this manual, the following names are used to denote paths specific to 384the local system and working environment. *These are not environment variables 385you need to set but just strings used in the rest of this document below*. In 386any of the examples below, simply replace each of these names with the 387appropriate pathname on your local system. All these paths are absolute: 388 389``SRC_ROOT`` 390 391 This is the top level directory of the LLVM source tree. 392 393``OBJ_ROOT`` 394 395 This is the top level directory of the LLVM object tree (i.e. the tree where 396 object files and compiled programs will be placed. It can be the same as 397 SRC_ROOT). 398 399Unpacking the LLVM Archives 400--------------------------- 401 402If you have the LLVM distribution, you will need to unpack it before you can 403begin to compile it. LLVM is distributed as a number of different 404subprojects. Each one has its own download which is a TAR archive that is 405compressed with the gzip program. 406 407The files are as follows, with *x.y* marking the version number: 408 409``llvm-x.y.tar.gz`` 410 411 Source release for the LLVM libraries and tools. 412 413``cfe-x.y.tar.gz`` 414 415 Source release for the Clang frontend. 416 417.. _checkout: 418 419Checkout LLVM from Git 420---------------------- 421 422You can also checkout the source code for LLVM from Git. 423 424.. note:: 425 426 Passing ``--config core.autocrlf=false`` should not be required in 427 the future after we adjust the .gitattribute settings correctly, but 428 is required for Windows users at the time of this writing. 429 430Simply run: 431 432.. code-block:: console 433 434 % git clone https://github.com/llvm/llvm-project.git 435 436or on Windows, 437 438.. code-block:: console 439 440 % git clone --config core.autocrlf=false https://github.com/llvm/llvm-project.git 441 442This will create an '``llvm-project``' directory in the current directory and 443fully populate it with all of the source code, test directories, and local 444copies of documentation files for LLVM and all the related subprojects. Note 445that unlike the tarballs, which contain each subproject in a separate file, the 446git repository contains all of the projects together. 447 448If you want to get a specific release (as opposed to the most recent revision), 449you can check out a tag after cloning the repository. E.g., `git checkout 450llvmorg-6.0.1` inside the ``llvm-project`` directory created by the above 451command. Use `git tag -l` to list all of them. 452 453Sending patches 454^^^^^^^^^^^^^^^ 455 456Please read `Developer Policy <DeveloperPolicy.html#one-off-patches>`_, too. 457 458We don't currently accept github pull requests, so you'll need to send patches 459either via emailing to llvm-commits, or, preferably, via :ref:`Phabricator 460<phabricator-reviews>`. 461 462You'll generally want to make sure your branch has a single commit, 463corresponding to the review you wish to send, up-to-date with the upstream 464``origin/main`` branch, and doesn't contain merges. Once you have that, you 465can start `a Phabricator review <Phabricator.html>`_ (or use ``git show`` or 466``git format-patch`` to output the diff, and attach it to an email message). 467 468However, using the "Arcanist" tool is often easier. After `installing 469arcanist`_, you can upload the latest commit using: 470 471.. code-block:: console 472 473 % arc diff HEAD~1 474 475Additionally, before sending a patch for review, please also try to ensure it's 476formatted properly. We use ``clang-format`` for this, which has git integration 477through the ``git-clang-format`` script. On some systems, it may already be 478installed (or be installable via your package manager). If so, you can simply 479run it -- the following command will format only the code changed in the most 480recent commit: 481 482.. code-block:: console 483 484 % git clang-format HEAD~1 485 486Note that this modifies the files, but doesn't commit them -- you'll likely want 487to run 488 489.. code-block:: console 490 491 % git commit --amend -a 492 493in order to update the last commit with all pending changes. 494 495.. note:: 496 If you don't already have ``clang-format`` or ``git clang-format`` installed 497 on your system, the ``clang-format`` binary will be built alongside clang, and 498 the git integration can be run from 499 ``clang/tools/clang-format/git-clang-format``. 500 501 502.. _commit_from_git: 503 504For developers to commit changes from Git 505^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 506 507Once a patch is reviewed, you should rebase it, re-test locally, and commit the 508changes to LLVM's main branch. This is done using `git push` if you have the 509required access rights. See `committing a change 510<Phabricator.html#committing-a-change>`_ for Phabricator based commits or 511`obtaining commit access <DeveloperPolicy.html#obtaining-commit-access>`_ 512for commit access. 513 514Here is an example workflow using git. This workflow assumes you have an 515accepted commit on the branch named `branch-with-change`. 516 517.. code-block:: console 518 519 # Go to the branch with your accepted commit. 520 % git checkout branch-with-change 521 # Rebase your change onto the latest commits on Github. 522 % git pull --rebase origin main 523 # Rerun the appropriate tests if needed. 524 % ninja check-$whatever 525 # Check that the list of commits about to be pushed is correct. 526 % git log origin/main...HEAD --oneline 527 # Push to Github. 528 % git push origin HEAD:main 529 530LLVM currently has a linear-history policy, which means that merge commits are 531not allowed. The `llvm-project` repo on github is configured to reject pushes 532that include merges, so the `git rebase` step above is required. 533 534Please ask for help if you're having trouble with your particular git workflow. 535 536 537.. _git_pre_push_hook: 538 539Git pre-push hook 540^^^^^^^^^^^^^^^^^ 541 542We include an optional pre-push hook that run some sanity checks on the revisions 543you are about to push and ask confirmation if you push multiple commits at once. 544You can set it up (on Unix systems) by running from the repository root: 545 546.. code-block:: console 547 548 % ln -sf ../../llvm/utils/git/pre-push.py .git/hooks/pre-push 549 550Bisecting commits 551^^^^^^^^^^^^^^^^^ 552 553See `Bisecting LLVM code <GitBisecting.html>`_ for how to use ``git bisect`` 554on LLVM. 555 556Reverting a change 557^^^^^^^^^^^^^^^^^^ 558 559When reverting changes using git, the default message will say "This reverts 560commit XYZ". Leave this at the end of the commit message, but add some details 561before it as to why the commit is being reverted. A brief explanation and/or 562links to bots that demonstrate the problem are sufficient. 563 564Local LLVM Configuration 565------------------------ 566 567Once checked out repository, the LLVM suite source code must be configured 568before being built. This process uses CMake. Unlinke the normal ``configure`` 569script, CMake generates the build files in whatever format you request as well 570as various ``*.inc`` files, and ``llvm/include/Config/config.h``. 571 572Variables are passed to ``cmake`` on the command line using the format 573``-D<variable name>=<value>``. The following variables are some common options 574used by people developing LLVM. 575 576+-------------------------+----------------------------------------------------+ 577| Variable | Purpose | 578+=========================+====================================================+ 579| CMAKE_C_COMPILER | Tells ``cmake`` which C compiler to use. By | 580| | default, this will be /usr/bin/cc. | 581+-------------------------+----------------------------------------------------+ 582| CMAKE_CXX_COMPILER | Tells ``cmake`` which C++ compiler to use. By | 583| | default, this will be /usr/bin/c++. | 584+-------------------------+----------------------------------------------------+ 585| CMAKE_BUILD_TYPE | Tells ``cmake`` what type of build you are trying | 586| | to generate files for. Valid options are Debug, | 587| | Release, RelWithDebInfo, and MinSizeRel. Default | 588| | is Debug. | 589+-------------------------+----------------------------------------------------+ 590| CMAKE_INSTALL_PREFIX | Specifies the install directory to target when | 591| | running the install action of the build files. | 592+-------------------------+----------------------------------------------------+ 593| PYTHON_EXECUTABLE | Forces CMake to use a specific Python version by | 594| | passing a path to a Python interpreter. By default | 595| | the Python version of the interpreter in your PATH | 596| | is used. | 597+-------------------------+----------------------------------------------------+ 598| LLVM_TARGETS_TO_BUILD | A semicolon delimited list controlling which | 599| | targets will be built and linked into llvm. | 600| | The default list is defined as | 601| | ``LLVM_ALL_TARGETS``, and can be set to include | 602| | out-of-tree targets. The default value includes: | 603| | ``AArch64, AMDGPU, ARM, AVR, BPF, Hexagon, Lanai, | 604| | Mips, MSP430, NVPTX, PowerPC, RISCV, Sparc, | 605| | SystemZ, WebAssembly, X86, XCore``. | 606| | | 607+-------------------------+----------------------------------------------------+ 608| LLVM_ENABLE_DOXYGEN | Build doxygen-based documentation from the source | 609| | code This is disabled by default because it is | 610| | slow and generates a lot of output. | 611+-------------------------+----------------------------------------------------+ 612| LLVM_ENABLE_PROJECTS | A semicolon-delimited list selecting which of the | 613| | other LLVM subprojects to additionally build. (Only| 614| | effective when using a side-by-side project layout | 615| | e.g. via git). The default list is empty. Can | 616| | include: clang, libcxx, libcxxabi, libunwind, lldb,| 617| | compiler-rt, lld, polly, or debuginfo-tests. | 618+-------------------------+----------------------------------------------------+ 619| LLVM_ENABLE_SPHINX | Build sphinx-based documentation from the source | 620| | code. This is disabled by default because it is | 621| | slow and generates a lot of output. Sphinx version | 622| | 1.5 or later recommended. | 623+-------------------------+----------------------------------------------------+ 624| LLVM_BUILD_LLVM_DYLIB | Generate libLLVM.so. This library contains a | 625| | default set of LLVM components that can be | 626| | overridden with ``LLVM_DYLIB_COMPONENTS``. The | 627| | default contains most of LLVM and is defined in | 628| | ``tools/llvm-shlib/CMakelists.txt``. This option is| 629| | not available on Windows. | 630+-------------------------+----------------------------------------------------+ 631| LLVM_OPTIMIZED_TABLEGEN | Builds a release tablegen that gets used during | 632| | the LLVM build. This can dramatically speed up | 633| | debug builds. | 634+-------------------------+----------------------------------------------------+ 635 636To configure LLVM, follow these steps: 637 638#. Change directory into the object root directory: 639 640 .. code-block:: console 641 642 % cd OBJ_ROOT 643 644#. Run the ``cmake``: 645 646 .. code-block:: console 647 648 % cmake -G "Unix Makefiles" -DCMAKE_INSTALL_PREFIX=/install/path 649 [other options] SRC_ROOT 650 651Compiling the LLVM Suite Source Code 652------------------------------------ 653 654Unlike with autotools, with CMake your build type is defined at configuration. 655If you want to change your build type, you can re-run cmake with the following 656invocation: 657 658 .. code-block:: console 659 660 % cmake -G "Unix Makefiles" -DCMAKE_BUILD_TYPE=type SRC_ROOT 661 662Between runs, CMake preserves the values set for all options. CMake has the 663following build types defined: 664 665Debug 666 667 These builds are the default. The build system will compile the tools and 668 libraries unoptimized, with debugging information, and asserts enabled. 669 670Release 671 672 For these builds, the build system will compile the tools and libraries 673 with optimizations enabled and not generate debug info. CMakes default 674 optimization level is -O3. This can be configured by setting the 675 ``CMAKE_CXX_FLAGS_RELEASE`` variable on the CMake command line. 676 677RelWithDebInfo 678 679 These builds are useful when debugging. They generate optimized binaries with 680 debug information. CMakes default optimization level is -O2. This can be 681 configured by setting the ``CMAKE_CXX_FLAGS_RELWITHDEBINFO`` variable on the 682 CMake command line. 683 684Once you have LLVM configured, you can build it by entering the *OBJ_ROOT* 685directory and issuing the following command: 686 687.. code-block:: console 688 689 % make 690 691If the build fails, please `check here`_ to see if you are using a version of 692GCC that is known not to compile LLVM. 693 694If you have multiple processors in your machine, you may wish to use some of the 695parallel build options provided by GNU Make. For example, you could use the 696command: 697 698.. code-block:: console 699 700 % make -j2 701 702There are several special targets which are useful when working with the LLVM 703source code: 704 705``make clean`` 706 707 Removes all files generated by the build. This includes object files, 708 generated C/C++ files, libraries, and executables. 709 710``make install`` 711 712 Installs LLVM header files, libraries, tools, and documentation in a hierarchy 713 under ``$PREFIX``, specified with ``CMAKE_INSTALL_PREFIX``, which 714 defaults to ``/usr/local``. 715 716``make docs-llvm-html`` 717 718 If configured with ``-DLLVM_ENABLE_SPHINX=On``, this will generate a directory 719 at ``OBJ_ROOT/docs/html`` which contains the HTML formatted documentation. 720 721Cross-Compiling LLVM 722-------------------- 723 724It is possible to cross-compile LLVM itself. That is, you can create LLVM 725executables and libraries to be hosted on a platform different from the platform 726where they are built (a Canadian Cross build). To generate build files for 727cross-compiling CMake provides a variable ``CMAKE_TOOLCHAIN_FILE`` which can 728define compiler flags and variables used during the CMake test operations. 729 730The result of such a build is executables that are not runnable on the build 731host but can be executed on the target. As an example the following CMake 732invocation can generate build files targeting iOS. This will work on macOS 733with the latest Xcode: 734 735.. code-block:: console 736 737 % cmake -G "Ninja" -DCMAKE_OSX_ARCHITECTURES="armv7;armv7s;arm64" 738 -DCMAKE_TOOLCHAIN_FILE=<PATH_TO_LLVM>/cmake/platforms/iOS.cmake 739 -DCMAKE_BUILD_TYPE=Release -DLLVM_BUILD_RUNTIME=Off -DLLVM_INCLUDE_TESTS=Off 740 -DLLVM_INCLUDE_EXAMPLES=Off -DLLVM_ENABLE_BACKTRACES=Off [options] 741 <PATH_TO_LLVM> 742 743Note: There are some additional flags that need to be passed when building for 744iOS due to limitations in the iOS SDK. 745 746Check :doc:`HowToCrossCompileLLVM` and `Clang docs on how to cross-compile in general 747<https://clang.llvm.org/docs/CrossCompilation.html>`_ for more information 748about cross-compiling. 749 750The Location of LLVM Object Files 751--------------------------------- 752 753The LLVM build system is capable of sharing a single LLVM source tree among 754several LLVM builds. Hence, it is possible to build LLVM for several different 755platforms or configurations using the same source tree. 756 757* Change directory to where the LLVM object files should live: 758 759 .. code-block:: console 760 761 % cd OBJ_ROOT 762 763* Run ``cmake``: 764 765 .. code-block:: console 766 767 % cmake -G "Unix Makefiles" SRC_ROOT 768 769The LLVM build will create a structure underneath *OBJ_ROOT* that matches the 770LLVM source tree. At each level where source files are present in the source 771tree there will be a corresponding ``CMakeFiles`` directory in the *OBJ_ROOT*. 772Underneath that directory there is another directory with a name ending in 773``.dir`` under which you'll find object files for each source. 774 775For example: 776 777 .. code-block:: console 778 779 % cd llvm_build_dir 780 % find lib/Support/ -name APFloat* 781 lib/Support/CMakeFiles/LLVMSupport.dir/APFloat.cpp.o 782 783Optional Configuration Items 784---------------------------- 785 786If you're running on a Linux system that supports the `binfmt_misc 787<http://en.wikipedia.org/wiki/binfmt_misc>`_ 788module, and you have root access on the system, you can set your system up to 789execute LLVM bitcode files directly. To do this, use commands like this (the 790first command may not be required if you are already using the module): 791 792.. code-block:: console 793 794 % mount -t binfmt_misc none /proc/sys/fs/binfmt_misc 795 % echo ':llvm:M::BC::/path/to/lli:' > /proc/sys/fs/binfmt_misc/register 796 % chmod u+x hello.bc (if needed) 797 % ./hello.bc 798 799This allows you to execute LLVM bitcode files directly. On Debian, you can also 800use this command instead of the 'echo' command above: 801 802.. code-block:: console 803 804 % sudo update-binfmts --install llvm /path/to/lli --magic 'BC' 805 806.. _Program Layout: 807.. _general layout: 808 809Directory Layout 810================ 811 812One useful source of information about the LLVM source base is the LLVM `doxygen 813<http://www.doxygen.org/>`_ documentation available at 814`<https://llvm.org/doxygen/>`_. The following is a brief introduction to code 815layout: 816 817``llvm/cmake`` 818-------------- 819Genereates system build files. 820 821``llvm/cmake/modules`` 822 Build configuration for llvm user defined options. Checks compiler version and 823 linker flags. 824 825``llvm/cmake/platforms`` 826 Toolchain configuration for Android NDK, iOS systems and non-Windows hosts to 827 target MSVC. 828 829``llvm/examples`` 830----------------- 831 832- Some simple examples showing how to use LLVM as a compiler for a custom 833 language - including lowering, optimization, and code generation. 834 835- Kaleidoscope Tutorial: Kaleidoscope language tutorial run through the 836 implementation of a nice little compiler for a non-trivial language 837 including a hand-written lexer, parser, AST, as well as code generation 838 support using LLVM- both static (ahead of time) and various approaches to 839 Just In Time (JIT) compilation. 840 `Kaleidoscope Tutorial for complete beginner 841 <https://llvm.org/docs/tutorial/MyFirstLanguageFrontend/index.html>`_. 842 843- BuildingAJIT: Examples of the `BuildingAJIT tutorial 844 <https://llvm.org/docs/tutorial/BuildingAJIT1.html>`_ that shows how LLVM’s 845 ORC JIT APIs interact with other parts of LLVM. It also, teaches how to 846 recombine them to build a custom JIT that is suited to your use-case. 847 848``llvm/include`` 849---------------- 850 851Public header files exported from the LLVM library. The three main subdirectories: 852 853``llvm/include/llvm`` 854 855 All LLVM-specific header files, and subdirectories for different portions of 856 LLVM: ``Analysis``, ``CodeGen``, ``Target``, ``Transforms``, etc... 857 858``llvm/include/llvm/Support`` 859 860 Generic support libraries provided with LLVM but not necessarily specific to 861 LLVM. For example, some C++ STL utilities and a Command Line option processing 862 library store header files here. 863 864``llvm/include/llvm/Config`` 865 866 Header files configured by ``cmake``. They wrap "standard" UNIX and 867 C header files. Source code can include these header files which 868 automatically take care of the conditional #includes that ``cmake`` 869 generates. 870 871``llvm/lib`` 872------------ 873 874Most source files are here. By putting code in libraries, LLVM makes it easy to 875share code among the `tools`_. 876 877``llvm/lib/IR/`` 878 879 Core LLVM source files that implement core classes like Instruction and 880 BasicBlock. 881 882``llvm/lib/AsmParser/`` 883 884 Source code for the LLVM assembly language parser library. 885 886``llvm/lib/Bitcode/`` 887 888 Code for reading and writing bitcode. 889 890``llvm/lib/Analysis/`` 891 892 A variety of program analyses, such as Call Graphs, Induction Variables, 893 Natural Loop Identification, etc. 894 895``llvm/lib/Transforms/`` 896 897 IR-to-IR program transformations, such as Aggressive Dead Code Elimination, 898 Sparse Conditional Constant Propagation, Inlining, Loop Invariant Code Motion, 899 Dead Global Elimination, and many others. 900 901``llvm/lib/Target/`` 902 903 Files describing target architectures for code generation. For example, 904 ``llvm/lib/Target/X86`` holds the X86 machine description. 905 906``llvm/lib/CodeGen/`` 907 908 The major parts of the code generator: Instruction Selector, Instruction 909 Scheduling, and Register Allocation. 910 911``llvm/lib/MC/`` 912 913 The libraries represent and process code at machine code level. Handles 914 assembly and object-file emission. 915 916``llvm/lib/ExecutionEngine/`` 917 918 Libraries for directly executing bitcode at runtime in interpreted and 919 JIT-compiled scenarios. 920 921``llvm/lib/Support/`` 922 923 Source code that corresponding to the header files in ``llvm/include/ADT/`` 924 and ``llvm/include/Support/``. 925 926``llvm/bindings`` 927---------------------- 928 929Contains bindings for the LLVM compiler infrastructure to allow 930programs written in languages other than C or C++ to take advantage of the LLVM 931infrastructure. 932LLVM project provides language bindings for Go, OCaml and Python. 933 934``llvm/projects`` 935----------------- 936 937Projects not strictly part of LLVM but shipped with LLVM. This is also the 938directory for creating your own LLVM-based projects which leverage the LLVM 939build system. 940 941``llvm/test`` 942------------- 943 944Feature and regression tests and other sanity checks on LLVM infrastructure. These 945are intended to run quickly and cover a lot of territory without being exhaustive. 946 947``test-suite`` 948-------------- 949 950A comprehensive correctness, performance, and benchmarking test suite 951for LLVM. This comes in a ``separate git repository 952<https://github.com/llvm/llvm-test-suite>``, because it contains a 953large amount of third-party code under a variety of licenses. For 954details see the :doc:`Testing Guide <TestingGuide>` document. 955 956.. _tools: 957 958``llvm/tools`` 959-------------- 960 961Executables built out of the libraries 962above, which form the main part of the user interface. You can always get help 963for a tool by typing ``tool_name -help``. The following is a brief introduction 964to the most important tools. More detailed information is in 965the `Command Guide <CommandGuide/index.html>`_. 966 967``bugpoint`` 968 969 ``bugpoint`` is used to debug optimization passes or code generation backends 970 by narrowing down the given test case to the minimum number of passes and/or 971 instructions that still cause a problem, whether it is a crash or 972 miscompilation. See `<HowToSubmitABug.html>`_ for more information on using 973 ``bugpoint``. 974 975``llvm-ar`` 976 977 The archiver produces an archive containing the given LLVM bitcode files, 978 optionally with an index for faster lookup. 979 980``llvm-as`` 981 982 The assembler transforms the human readable LLVM assembly to LLVM bitcode. 983 984``llvm-dis`` 985 986 The disassembler transforms the LLVM bitcode to human readable LLVM assembly. 987 988``llvm-link`` 989 990 ``llvm-link``, not surprisingly, links multiple LLVM modules into a single 991 program. 992 993``lli`` 994 995 ``lli`` is the LLVM interpreter, which can directly execute LLVM bitcode 996 (although very slowly...). For architectures that support it (currently x86, 997 Sparc, and PowerPC), by default, ``lli`` will function as a Just-In-Time 998 compiler (if the functionality was compiled in), and will execute the code 999 *much* faster than the interpreter. 1000 1001``llc`` 1002 1003 ``llc`` is the LLVM backend compiler, which translates LLVM bitcode to a 1004 native code assembly file. 1005 1006``opt`` 1007 1008 ``opt`` reads LLVM bitcode, applies a series of LLVM to LLVM transformations 1009 (which are specified on the command line), and outputs the resultant 1010 bitcode. '``opt -help``' is a good way to get a list of the 1011 program transformations available in LLVM. 1012 1013 ``opt`` can also run a specific analysis on an input LLVM bitcode 1014 file and print the results. Primarily useful for debugging 1015 analyses, or familiarizing yourself with what an analysis does. 1016 1017``llvm/utils`` 1018-------------- 1019 1020Utilities for working with LLVM source code; some are part of the build process 1021because they are code generators for parts of the infrastructure. 1022 1023 1024``codegen-diff`` 1025 1026 ``codegen-diff`` finds differences between code that LLC 1027 generates and code that LLI generates. This is useful if you are 1028 debugging one of them, assuming that the other generates correct output. For 1029 the full user manual, run ```perldoc codegen-diff'``. 1030 1031``emacs/`` 1032 1033 Emacs and XEmacs syntax highlighting for LLVM assembly files and TableGen 1034 description files. See the ``README`` for information on using them. 1035 1036``getsrcs.sh`` 1037 1038 Finds and outputs all non-generated source files, 1039 useful if one wishes to do a lot of development across directories 1040 and does not want to find each file. One way to use it is to run, 1041 for example: ``xemacs `utils/getsources.sh``` from the top of the LLVM source 1042 tree. 1043 1044``llvmgrep`` 1045 1046 Performs an ``egrep -H -n`` on each source file in LLVM and 1047 passes to it a regular expression provided on ``llvmgrep``'s command 1048 line. This is an efficient way of searching the source base for a 1049 particular regular expression. 1050 1051``TableGen/`` 1052 1053 Contains the tool used to generate register 1054 descriptions, instruction set descriptions, and even assemblers from common 1055 TableGen description files. 1056 1057``vim/`` 1058 1059 vim syntax-highlighting for LLVM assembly files 1060 and TableGen description files. See the ``README`` for how to use them. 1061 1062.. _simple example: 1063 1064An Example Using the LLVM Tool Chain 1065==================================== 1066 1067This section gives an example of using LLVM with the Clang front end. 1068 1069Example with clang 1070------------------ 1071 1072#. First, create a simple C file, name it 'hello.c': 1073 1074 .. code-block:: c 1075 1076 #include <stdio.h> 1077 1078 int main() { 1079 printf("hello world\n"); 1080 return 0; 1081 } 1082 1083#. Next, compile the C file into a native executable: 1084 1085 .. code-block:: console 1086 1087 % clang hello.c -o hello 1088 1089 .. note:: 1090 1091 Clang works just like GCC by default. The standard -S and -c arguments 1092 work as usual (producing a native .s or .o file, respectively). 1093 1094#. Next, compile the C file into an LLVM bitcode file: 1095 1096 .. code-block:: console 1097 1098 % clang -O3 -emit-llvm hello.c -c -o hello.bc 1099 1100 The -emit-llvm option can be used with the -S or -c options to emit an LLVM 1101 ``.ll`` or ``.bc`` file (respectively) for the code. This allows you to use 1102 the `standard LLVM tools <CommandGuide/index.html>`_ on the bitcode file. 1103 1104#. Run the program in both forms. To run the program, use: 1105 1106 .. code-block:: console 1107 1108 % ./hello 1109 1110 and 1111 1112 .. code-block:: console 1113 1114 % lli hello.bc 1115 1116 The second examples shows how to invoke the LLVM JIT, :doc:`lli 1117 <CommandGuide/lli>`. 1118 1119#. Use the ``llvm-dis`` utility to take a look at the LLVM assembly code: 1120 1121 .. code-block:: console 1122 1123 % llvm-dis < hello.bc | less 1124 1125#. Compile the program to native assembly using the LLC code generator: 1126 1127 .. code-block:: console 1128 1129 % llc hello.bc -o hello.s 1130 1131#. Assemble the native assembly language file into a program: 1132 1133 .. code-block:: console 1134 1135 % /opt/SUNWspro/bin/cc -xarch=v9 hello.s -o hello.native # On Solaris 1136 1137 % gcc hello.s -o hello.native # On others 1138 1139#. Execute the native code program: 1140 1141 .. code-block:: console 1142 1143 % ./hello.native 1144 1145 Note that using clang to compile directly to native code (i.e. when the 1146 ``-emit-llvm`` option is not present) does steps 6/7/8 for you. 1147 1148Common Problems 1149=============== 1150 1151If you are having problems building or using LLVM, or if you have any other 1152general questions about LLVM, please consult the `Frequently Asked 1153Questions <FAQ.html>`_ page. 1154 1155If you are having problems with limited memory and build time, please try 1156building with ninja instead of make. Please consider configuring the 1157following options with cmake: 1158 1159 * -G Ninja 1160 Setting this option will allow you to build with ninja instead of make. 1161 Building with ninja significantly improves your build time, especially with 1162 incremental builds, and improves your memory usage. 1163 1164 * -DLLVM_USE_LINKER 1165 Setting this option to lld will significantly reduce linking time for LLVM 1166 executables on ELF-based platforms, such as Linux. If you are building LLVM 1167 for the first time and lld is not available to you as a binary package, then 1168 you may want to use the gold linker as a faster alternative to GNU ld. 1169 1170 * -DCMAKE_BUILD_TYPE 1171 1172 - Debug --- This is the default build type. This disables optimizations while 1173 compiling LLVM and enables debug info. On ELF-based platforms (e.g. Linux) 1174 linking with debug info may consume a large amount of memory. 1175 1176 - Release --- Turns on optimizations and disables debug info. Combining the 1177 Release build type with -DLLVM_ENABLE_ASSERTIONS=ON may be a good trade-off 1178 between speed and debugability during development, particularly for running 1179 the test suite. 1180 1181 * -DLLVM_ENABLE_ASSERTIONS 1182 This option defaults to ON for Debug builds and defaults to OFF for Release 1183 builds. As mentioned in the previous option, using the Release build type and 1184 enabling assertions may be a good alternative to using the Debug build type. 1185 1186 * -DLLVM_PARALLEL_LINK_JOBS 1187 Set this equal to number of jobs you wish to run simultaneously. This is 1188 similar to the -j option used with make, but only for link jobs. This option 1189 can only be used with ninja. You may wish to use a very low number of jobs, 1190 as this will greatly reduce the amount of memory used during the build 1191 process. If you have limited memory, you may wish to set this to 1. 1192 1193 * -DLLVM_TARGETS_TO_BUILD 1194 Set this equal to the target you wish to build. You may wish to set this to 1195 X86; however, you will find a full list of targets within the 1196 llvm-project/llvm/lib/Target directory. 1197 1198 * -DLLVM_OPTIMIZED_TABLEGEN 1199 Set this to ON to generate a fully optimized tablegen during your build. This 1200 will significantly improve your build time. This is only useful if you are 1201 using the Debug build type. 1202 1203 * -DLLVM_ENABLE_PROJECTS 1204 Set this equal to the projects you wish to compile (e.g. clang, lld, etc.) If 1205 compiling more than one project, separate the items with a semicolon. Should 1206 you run into issues with the semicolon, try surrounding it with single quotes. 1207 1208 * -DCLANG_ENABLE_STATIC_ANALYZER 1209 Set this option to OFF if you do not require the clang static analyzer. This 1210 should improve your build time slightly. 1211 1212 * -DLLVM_USE_SPLIT_DWARF 1213 Consider setting this to ON if you require a debug build, as this will ease 1214 memory pressure on the linker. This will make linking much faster, as the 1215 binaries will not contain any of the debug information; however, this will 1216 generate the debug information in the form of a DWARF object file (with the 1217 extension .dwo). This only applies to host platforms using ELF, such as Linux. 1218 1219.. _links: 1220 1221Links 1222===== 1223 1224This document is just an **introduction** on how to use LLVM to do some simple 1225things... there are many more interesting and complicated things that you can do 1226that aren't documented here (but we'll gladly accept a patch if you want to 1227write something up!). For more information about LLVM, check out: 1228 1229* `LLVM Homepage <https://llvm.org/>`_ 1230* `LLVM Doxygen Tree <https://llvm.org/doxygen/>`_ 1231* `Starting a Project that Uses LLVM <https://llvm.org/docs/Projects.html>`_ 1232 1233.. _installing arcanist: https://secure.phabricator.com/book/phabricator/article/arcanist_quick_start/ 1234