11 This tutorial will guide you through the process of making a change to
12 LLVM, and contributing it back to the LLVM project. We'll be making a
13 change to Clang, but the steps for other parts of LLVM are the same.
14 Even though the change we'll be making is simple, we're going to cover
15 steps like building LLVM, running the tests, and code review. This is
16 good practice, and you'll be prepared for making larger changes.
20 - know how to use an editor,
22 - have basic C++ knowledge,
24 - know how to install software on your system,
26 - are comfortable with the command line,
28 - have basic knowledge of git.
31 The change we're making
32 -----------------------
34 Clang has a warning for infinite recursion:
38 $ echo "void foo() { foo(); }" > ~/test.cc
39 $ clang -c -Wall ~/test.cc
40 input.cc:1:14: warning: all paths through this function will call
41 itself [-Winfinite-recursion]
43 This is clear enough, but not exactly catchy. Let's improve the wording
48 input.cc:1:14: warning: to understand recursion, you must first
49 understand recursion [-Winfinite-recursion]
55 We're going to need some tools:
57 - git: to check out the LLVM source code,
59 - a C++ compiler: to compile LLVM source code. You'll want `a recent
60 version <https://llvm.org/docs/GettingStarted.html#host-c-toolchain-both-compiler-and-standard-library>`__
61 of Clang, GCC, or Visual Studio.
63 - CMake: used to configure how LLVM should be built on your system,
65 - ninja: runs the C++ compiler to (re)build specific parts of LLVM,
67 - python: to run the LLVM tests,
69 - arcanist: for uploading changes for review,
71 As an example, on Ubuntu:
75 $ sudo apt-get install git clang cmake ninja-build python arcanist
85 The source code is stored `on
86 Github <https://github.com/llvm/llvm-project>`__ in one large repository
89 It may take a while to download!
93 $ git clone https://github.com/llvm/llvm-project.git
95 This will create a directory "llvm-project" with all of the source
96 code.(Checking out anonymously is OK - pushing commits uses a different
97 mechanism, as we'll see later)
99 Configure your workspace
100 ------------------------
102 Before we can build the code, we must configure exactly how to build it
103 by running CMake. CMake combines information from three sources:
105 - explicit choices you make (is this a debug build?)
107 - settings detected from your system (where are libraries installed?)
109 - project structure (which files are part of 'clang'?)
111 First, create a directory to build in. Usually, this is
116 $ mkdir llvm-project/build
117 $ cd llvm-project/build
123 $ cmake -G Ninja ../llvm -DCMAKE_BUILD_TYPE=Release -DLLVM_ENABLE_PROJECTS=clang
125 If all goes well, you'll see a lot of "performing test" lines, and
132 Build files have been written to: /path/llvm-project/build
134 And you should see a build.ninja file.
136 Let's break down that last command a little:
138 - **-G Ninja**: we're going to use ninja to build; please create
141 - **../llvm**: this is the path to the source of the "main" LLVM
144 - The two **-D** flags set CMake variables, which override
145 CMake/project defaults:
147 - **CMAKE\ BUILD\ TYPE=Release**: build in optimized mode, which is
148 (surprisingly) the fastest option.
150 If you want to run under a debugger, you should use the default Debug
151 (which is totally unoptimized, and will lead to >10x slower test
152 runs) or RelWithDebInfo which is a halfway point.
153 **CMAKE\ BUILD\ TYPE** affects code generation only, assertions are
154 on by default regardless! **LLVM\ ENABLE\ ASSERTIONS=Off** disables
157 - **LLVM\ ENABLE\ PROJECTS=clang** : this lists the LLVM subprojects
158 you are interested in building, in addition to LLVM itself. Multiple
159 projects can be listed, separated by semicolons, such as "clang;
160 lldb".In this example, we'll be making a change to Clang, so we
163 Finally, create a symlink (or a copy) of
164 llvm-project/build/compile-commands.json into llvm-project/:
168 $ ln -s build/compile_commands.json ../
170 (This isn't strictly necessary for building and testing, but allows
171 tools like clang-tidy, clang-query, and clangd to work in your source
178 Finally, we can build the code! It's important to do this first, to
179 ensure we're in a good state before making changes. But what to build?
180 In ninja, you specify a **target**. If we just want to build the clang
181 binary, our target name is "clang" and we run:
187 The first time we build will be very slow - Clang + LLVM is a lot of
188 code. But incremental builds are fast: ninja will only rebuild the parts
189 that have changed. When it finally finishes you should have a working
190 clang binary. Try running:
194 $ bin/clang --version
196 There's also a target for building and running all the clang tests:
202 This is a common pattern in LLVM: check-llvm is all the checks for core,
203 other projects have targets like check-lldb.
213 We need to find the file containing the error message.
217 $ git grep "all paths through this function" ..
218 ../clang/include/clang/Basic/DiagnosticSemaKinds.td: "all paths through this function will call itself">,
220 The string that appears in DiagnosticSemaKinds.td is the one that is
221 printed by Clang. \*.td files define tables - in this case it's a list
222 of warnings and errors clang can emit and their messages. Let's update
223 the message in your favorite editor:
227 $ vi ../clang/include/clang/Basic/DiagnosticSemaKinds.td
229 Find the message (it should be under
230 warn\ *infinite*\ recursive_function)Change the message to "in order to
231 understand recursion, you must first understand recursion".
237 To verify our change, we can build clang and manually check that it
243 $ bin/clang -Wall ~/test.cc
245 **/path/test.cc:1:124:** **warning****: in order to understand recursion, you must
246 first understand recursion [-Winfinite-recursion]**
248 We should also run the tests to make sure we didn't break something.
254 Notice that it is much faster to build this time, but the tests take
255 just as long to run. Ninja doesn't know which tests might be affected,
261 Testing Time: 408.84s
264 Clang :: SemaCXX/warn-infinite-recursion.cpp
266 Well, that makes senseā¦ and the test output suggests it's looking for
267 the old string "call itself" and finding our new message instead.
268 Note that more tests may fail in a similar way as new tests are
271 Let's fix it by updating the expectation in the test.
275 $ vi ../clang/test/SemaCXX/warn-infinite-recursion.cpp
277 Everywhere we see `// expected-warning{{call itself}}` (or something similar
278 from the original warning text), let's replace it with
279 `// expected-warning{{to understand recursion}}`.
281 Now we could run **all** the tests again, but this is a slow way to
282 iterate on a change! Instead, let's find a way to re-run just the
283 specific test. There are two main types of tests in LLVM:
285 - **lit tests** (e.g. SemaCXX/warn-infinite-recursion.cpp).
287 These are fancy shell scripts that run command-line tools and verify the
288 output. They live in files like
289 clang/**test**/FixIt/dereference-addressof.c. Re-run like this:
293 $ bin/llvm-lit -v ../clang/test/SemaCXX/warn-infinite-recursion.cpp
295 - **unit tests** (e.g. ToolingTests/ReplacementTest.CanDeleteAllText)
297 These are C++ programs that call LLVM functions and verify the results.
298 They live in suites like ToolingTests. Re-run like this:
302 $ ninja ToolingTests && tools/clang/unittests/Tooling/ToolingTests
303 --gtest_filter=ReplacementTest.CanDeleteAllText
309 We'll save the change to a local git branch. This lets us work on other
310 things while the change is being reviewed. Changes should have a
311 description, to explain to reviewers and future readers of the code why
316 $ git checkout -b myfirstpatch
317 $ git commit -am "[Diagnostic] Clarify -Winfinite-recursion message"
319 Now we're ready to send this change out into the world! By the way,
320 There is a unwritten convention of using tag for your commit. Tags
321 usually represent modules that you intend to modify. If you don't know
322 the tags for your modules, you can look at the commit history :
323 https://github.com/llvm/llvm-project/commits/main.
333 Changes can be reviewed by anyone in the LLVM community who has commit
334 access.For larger and more complicated changes, it's important that the
335 reviewer has experience with the area of LLVM and knows the design goals
336 well. The author of a change will often assign a specific reviewer (git
337 blame and git log can be useful to find one).
339 As our change is fairly simple, we'll add the cfe-commits mailing list
340 as a subscriber; anyone who works on clang can likely pick up the
341 review. (For changes outside clang, llvm-commits is the usual list. See
342 `http://lists.llvm.org/ <http://lists.llvm.org/mailman/listinfo>`__ for
343 all the \*-commits mailing lists).
346 Uploading a change for review
347 -----------------------------
349 LLVM code reviews happen at https://reviews.llvm.org. The web interface
350 is called Phabricator, and the code review part is Differential. You
351 should create a user account there for reviews (click "Log In" and then
352 "Register new account").
354 Now you can upload your change for review:
360 This creates a review for your change, comparing your current commit
361 with the previous commit. You will be prompted to fill in the review
362 details. Your commit message is already there, so just add cfe-commits
363 under the "subscribers" section. It should print a code review URL:
364 https://reviews.llvm.org/D58291 You can always find your active reviews
365 on Phabricator under "My activity".
371 When you upload a change for review, an email is sent to you, the
372 cfe-commits list, and anyone else subscribed to these kinds of changes.
373 Within a few days, someone should start the review. They may add
374 themselves as a reviewer, or simply start leaving comments. You'll get
375 another email any time the review is updated. The details are in the
376 `https://llvm.org/docs/CodeReview/ <https://llvm.org/docs/CodeReview.html>`__.
382 The reviewer can leave comments on the change, and you can reply. Some
383 comments are attached to specific lines, and appear interleaved with the
384 code. You can either reply to these, or address them and mark them as
385 "done". Note that in-line replies are **not** sent straight away! They
386 become "draft" comments and you must click "Submit" at the bottom of the
393 If you make changes in response to a reviewer's comments, simply run
399 again to update the change and notify the reviewer. Typically this is a
400 good time to send any draft comments as well.
406 When the reviewer is happy with the change, they will **Accept** the
407 revision. They may leave some more minor comments that you should
408 address, but at this point the review is complete. It's time to get it
415 As this is your first change, you won't have access to commit it
416 yourself yet. The reviewer **doesn't know this**, so you need to tell
417 them! Leave a message on the review like:
419 Thanks @somellvmdev. I don't have commit access, can you land this
420 patch for me? Please use "My Name my@email" to commit the change.
422 The review will be updated when the change is committed.
428 In order to make LLVM a long-term sustainable effort, code needs to be
429 maintainable and well tested. Code reviews help to achieve that goal.
430 Especially for new contributors, that often means many rounds of reviews
431 and push-back on design decisions that do not fit well within the
432 overall architecture of the project.
434 For your first patches, this means:
436 - be kind, and expect reviewers to be kind in return - LLVM has a `Code
437 of Conduct <https://llvm.org/docs/CodeOfConduct.html>`__;
439 - be patient - understanding how a new feature fits into the
440 architecture of the project is often a time consuming effort, and
441 people have to juggle this with other responsibilities in their
442 lives; **ping the review once a week** when there is no response;
444 - if you can't agree, generally the best way is to do what the reviewer
445 asks; we optimize for readability of the code, which the reviewer is
446 in a better position to judge; if this feels like it's not the right
447 option, you can contact the cfe-dev mailing list to get more feedback
454 Once you've contributed a handful of patches to LLVM, start to think
455 about getting commit access yourself. It's probably a good idea if:
457 - you've landed 3-5 patches of larger scope than "fix a typo"
459 - you'd be willing to review changes that are closely related to yours
461 - you'd like to keep contributing to LLVM.
464 Getting commit access
465 ---------------------
467 LLVM uses Git for committing changes. The details are in the `developer
469 document <https://llvm.org/docs/DeveloperPolicy.html#obtaining-commit-access>`__.
475 Actually, this would be a great time to read the rest of the `developer
476 policy <https://llvm.org/docs/DeveloperPolicy.html>`__, too. At minimum,
477 you need to be subscribed to the relevant commits list before landing
478 changes (e.g. llvm-commits@lists.llvm.org), as discussion often happens
479 there if a new patch causes problems.
485 Let's say you have a change on a local git branch, reviewed and ready to
486 commit. Things to do first:
488 - if you used multiple fine-grained commits locally, squash them into a
489 single commit. LLVM prefers commits to match the code that was
490 reviewed. (If you created one commit and then used "arc diff", you're
493 - rebase your patch against the latest LLVM code. LLVM uses a linear
494 history, so everything should be based on an up-to-date origin/main.
498 $ git pull --rebase https://github.com/llvm/llvm-project.git main
500 - ensure the patch looks correct.
506 - run the tests one last time, for good luck
508 At this point git show should show a single commit on top of
511 Now you can push your commit with
515 $ git push https://github.com/llvm/llvm-project.git HEAD:main
517 You should see your change `on
518 GitHub <https://github.com/llvm/llvm-project/commits/main>`__ within
525 Once your change is submitted it will be picked up by automated build
526 bots that will build and test your patch in a variety of configurations.
528 You can see all configurations and their current state in a waterfall
529 view at http://lab.llvm.org/buildbot/#/waterfall. The waterfall view is good
530 to get a general overview over the tested configurations and to see
531 which configuration have been broken for a while.
533 The console view at http://lab.llvm.org/buildbot/#/console helps to get a
534 better understanding of the build results of a specific patch. If you
535 want to follow along how your change is affecting the build bots, **this
536 should be the first place to look at** - the colored bubbles correspond
537 to projects in the waterfall.
539 If you see a broken build, do not despair - some build bots are
540 continuously broken; if your change broke the build, you will see a red
541 bubble in the console view, while an already broken build will show an
542 orange bubble. Of course, even when the build was already broken, a new
543 change might introduce a hidden new failure.
545 | When you want to see more details how a specific build is broken,
546 click the red bubble.
547 | If post-commit error logs confuse you, do not worry too much -
548 everybody on the project is aware that this is a bit unwieldy, so
549 expect people to jump in and help you understand what's going on!
551 buildbots, overview of bots, getting error logs.
557 if in doubt, revert and re-land.
563 llvm is a land of contrasts.