1 Compile-time stack metadata validation
2 ======================================
8 The kernel CONFIG_STACK_VALIDATION option enables a host tool named
9 objtool which runs at compile time. It has a "check" subcommand which
10 analyzes every .o file and ensures the validity of its stack metadata.
11 It enforces a set of rules on asm code and C inline assembly code so
12 that stack traces can be reliable.
14 For each function, it recursively follows all possible code paths and
15 validates the correct frame pointer state at each instruction.
17 It also follows code paths involving special sections, like
18 .altinstructions, __jump_table, and __ex_table, which can add
19 alternative execution paths to a given instruction (or set of
20 instructions). Similarly, it knows how to follow switch statements, for
21 which gcc sometimes uses jump tables.
23 (Objtool also has an 'orc generate' subcommand which generates debuginfo
24 for the ORC unwinder. See Documentation/x86/orc-unwinder.rst in the
25 kernel tree for more details.)
28 Why do we need stack metadata validation?
29 -----------------------------------------
31 Here are some of the benefits of validating stack metadata:
33 a) More reliable stack traces for frame pointer enabled kernels
35 Frame pointers are used for debugging purposes. They allow runtime
36 code and debug tools to be able to walk the stack to determine the
37 chain of function call sites that led to the currently executing
40 For some architectures, frame pointers are enabled by
41 CONFIG_FRAME_POINTER. For some other architectures they may be
42 required by the ABI (sometimes referred to as "backchain pointers").
44 For C code, gcc automatically generates instructions for setting up
45 frame pointers when the -fno-omit-frame-pointer option is used.
47 But for asm code, the frame setup instructions have to be written by
48 hand, which most people don't do. So the end result is that
49 CONFIG_FRAME_POINTER is honored for C code but not for most asm code.
51 For stack traces based on frame pointers to be reliable, all
52 functions which call other functions must first create a stack frame
53 and update the frame pointer. If a first function doesn't properly
54 create a stack frame before calling a second function, the *caller*
55 of the first function will be skipped on the stack trace.
57 For example, consider the following example backtrace with frame
60 [<ffffffff81812584>] dump_stack+0x4b/0x63
61 [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
62 [<ffffffff8127f568>] seq_read+0x108/0x3e0
63 [<ffffffff812cce62>] proc_reg_read+0x42/0x70
64 [<ffffffff81256197>] __vfs_read+0x37/0x100
65 [<ffffffff81256b16>] vfs_read+0x86/0x130
66 [<ffffffff81257898>] SyS_read+0x58/0xd0
67 [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
69 It correctly shows that the caller of cmdline_proc_show() is
72 If we remove the frame pointer logic from cmdline_proc_show() by
73 replacing the frame pointer related instructions with nops, here's
74 what it looks like instead:
76 [<ffffffff81812584>] dump_stack+0x4b/0x63
77 [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
78 [<ffffffff812cce62>] proc_reg_read+0x42/0x70
79 [<ffffffff81256197>] __vfs_read+0x37/0x100
80 [<ffffffff81256b16>] vfs_read+0x86/0x130
81 [<ffffffff81257898>] SyS_read+0x58/0xd0
82 [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
84 Notice that cmdline_proc_show()'s caller, seq_read(), has been
85 skipped. Instead the stack trace seems to show that
86 cmdline_proc_show() was called by proc_reg_read().
88 The benefit of objtool here is that because it ensures that *all*
89 functions honor CONFIG_FRAME_POINTER, no functions will ever[*] be
90 skipped on a stack trace.
92 [*] unless an interrupt or exception has occurred at the very
93 beginning of a function before the stack frame has been created,
94 or at the very end of the function after the stack frame has been
95 destroyed. This is an inherent limitation of frame pointers.
97 b) ORC (Oops Rewind Capability) unwind table generation
99 An alternative to frame pointers and DWARF, ORC unwind data can be
100 used to walk the stack. Unlike frame pointers, ORC data is out of
101 band. So it doesn't affect runtime performance and it can be
102 reliable even when interrupts or exceptions are involved.
104 For more details, see Documentation/x86/orc-unwinder.rst.
106 c) Higher live patching compatibility rate
108 Livepatch has an optional "consistency model", which is needed for
109 more complex patches. In order for the consistency model to work,
110 stack traces need to be reliable (or an unreliable condition needs to
111 be detectable). Objtool makes that possible.
113 For more details, see the livepatch documentation in the Linux kernel
114 source tree at Documentation/livepatch/livepatch.rst.
119 To achieve the validation, objtool enforces the following rules:
121 1. Each callable function must be annotated as such with the ELF
122 function type. In asm code, this is typically done using the
123 ENTRY/ENDPROC macros. If objtool finds a return instruction
124 outside of a function, it flags an error since that usually indicates
125 callable code which should be annotated accordingly.
127 This rule is needed so that objtool can properly identify each
128 callable function in order to analyze its stack metadata.
130 2. Conversely, each section of code which is *not* callable should *not*
131 be annotated as an ELF function. The ENDPROC macro shouldn't be used
134 This rule is needed so that objtool can ignore non-callable code.
135 Such code doesn't have to follow any of the other rules.
137 3. Each callable function which calls another function must have the
138 correct frame pointer logic, if required by CONFIG_FRAME_POINTER or
139 the architecture's back chain rules. This can by done in asm code
140 with the FRAME_BEGIN/FRAME_END macros.
142 This rule ensures that frame pointer based stack traces will work as
143 designed. If function A doesn't create a stack frame before calling
144 function B, the _caller_ of function A will be skipped on the stack
147 4. Dynamic jumps and jumps to undefined symbols are only allowed if:
149 a) the jump is part of a switch statement; or
151 b) the jump matches sibling call semantics and the frame pointer has
152 the same value it had on function entry.
154 This rule is needed so that objtool can reliably analyze all of a
155 function's code paths. If a function jumps to code in another file,
156 and it's not a sibling call, objtool has no way to follow the jump
157 because it only analyzes a single file at a time.
159 5. A callable function may not execute kernel entry/exit instructions.
160 The only code which needs such instructions is kernel entry code,
161 which shouldn't be be in callable functions anyway.
163 This rule is just a sanity check to ensure that callable functions
170 For asm files, if you're getting an error which doesn't make sense,
171 first make sure that the affected code follows the above rules.
173 For C files, the common culprits are inline asm statements and calls to
174 "noreturn" functions. See below for more details.
176 Another possible cause for errors in C code is if the Makefile removes
177 -fno-omit-frame-pointer or adds -fomit-frame-pointer to the gcc options.
179 Here are some examples of common warnings reported by objtool, what
180 they mean, and suggestions for how to fix them.
183 1. file.o: warning: objtool: func()+0x128: call without frame pointer save/setup
185 The func() function made a function call without first saving and/or
186 updating the frame pointer, and CONFIG_FRAME_POINTER is enabled.
188 If the error is for an asm file, and func() is indeed a callable
189 function, add proper frame pointer logic using the FRAME_BEGIN and
190 FRAME_END macros. Otherwise, if it's not a callable function, remove
191 its ELF function annotation by changing ENDPROC to END, and instead
192 use the manual unwind hint macros in asm/unwind_hints.h.
194 If it's a GCC-compiled .c file, the error may be because the function
195 uses an inline asm() statement which has a "call" instruction. An
196 asm() statement with a call instruction must declare the use of the
197 stack pointer in its output operand. On x86_64, this means adding
198 the ASM_CALL_CONSTRAINT as an output constraint:
200 asm volatile("call func" : ASM_CALL_CONSTRAINT);
202 Otherwise the stack frame may not get created before the call.
205 2. file.o: warning: objtool: .text+0x53: unreachable instruction
207 Objtool couldn't find a code path to reach the instruction.
209 If the error is for an asm file, and the instruction is inside (or
210 reachable from) a callable function, the function should be annotated
211 with the ENTRY/ENDPROC macros (ENDPROC is the important one).
212 Otherwise, the code should probably be annotated with the unwind hint
213 macros in asm/unwind_hints.h so objtool and the unwinder can know the
214 stack state associated with the code.
216 If you're 100% sure the code won't affect stack traces, or if you're
217 a just a bad person, you can tell objtool to ignore it. See the
218 "Adding exceptions" section below.
220 If it's not actually in a callable function (e.g. kernel entry code),
221 change ENDPROC to END.
224 4. file.o: warning: objtool: func(): can't find starting instruction
226 file.o: warning: objtool: func()+0x11dd: can't decode instruction
228 Does the file have data in a text section? If so, that can confuse
229 objtool's instruction decoder. Move the data to a more appropriate
230 section like .data or .rodata.
233 5. file.o: warning: objtool: func()+0x6: unsupported instruction in callable function
235 This is a kernel entry/exit instruction like sysenter or iret. Such
236 instructions aren't allowed in a callable function, and are most
237 likely part of the kernel entry code. They should usually not have
238 the callable function annotation (ENDPROC) and should always be
239 annotated with the unwind hint macros in asm/unwind_hints.h.
242 6. file.o: warning: objtool: func()+0x26: sibling call from callable instruction with modified stack frame
244 This is a dynamic jump or a jump to an undefined symbol. Objtool
245 assumed it's a sibling call and detected that the frame pointer
246 wasn't first restored to its original state.
248 If it's not really a sibling call, you may need to move the
249 destination code to the local file.
251 If the instruction is not actually in a callable function (e.g.
252 kernel entry code), change ENDPROC to END and annotate manually with
253 the unwind hint macros in asm/unwind_hints.h.
256 7. file: warning: objtool: func()+0x5c: stack state mismatch
258 The instruction's frame pointer state is inconsistent, depending on
259 which execution path was taken to reach the instruction.
261 Make sure that, when CONFIG_FRAME_POINTER is enabled, the function
262 pushes and sets up the frame pointer (for x86_64, this means rbp) at
263 the beginning of the function and pops it at the end of the function.
264 Also make sure that no other code in the function touches the frame
267 Another possibility is that the code has some asm or inline asm which
268 does some unusual things to the stack or the frame pointer. In such
269 cases it's probably appropriate to use the unwind hint macros in
273 8. file.o: warning: objtool: funcA() falls through to next function funcB()
275 This means that funcA() doesn't end with a return instruction or an
276 unconditional jump, and that objtool has determined that the function
277 can fall through into the next function. There could be different
280 1) funcA()'s last instruction is a call to a "noreturn" function like
281 panic(). In this case the noreturn function needs to be added to
282 objtool's hard-coded global_noreturns array. Feel free to bug the
283 objtool maintainer, or you can submit a patch.
285 2) funcA() uses the unreachable() annotation in a section of code
286 that is actually reachable.
288 3) If funcA() calls an inline function, the object code for funcA()
289 might be corrupt due to a gcc bug. For more details, see:
290 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70646
292 9. file.o: warning: objtool: funcA() call to funcB() with UACCESS enabled
294 This means that an unexpected call to a non-whitelisted function exists
295 outside of arch-specific guards.
296 X86: SMAP (stac/clac): __uaccess_begin()/__uaccess_end()
297 ARM: PAN: uaccess_enable()/uaccess_disable()
299 These functions should be called to denote a minimal critical section around
300 access to __user variables. See also: https://lwn.net/Articles/517475/
302 The intention of the warning is to prevent calls to funcB() from eventually
303 calling schedule(), potentially leaking the AC flags state, and not
304 restoring them correctly.
306 It also helps verify that there are no unexpected calls to funcB() which may
307 access user space pages with protections against doing so disabled.
310 1) remove explicit calls to funcB() from funcA().
311 2) add the correct guards before and after calls to low level functions like
312 __get_user_size()/__put_user_size().
313 3) add funcB to uaccess_safe_builtin whitelist in tools/objtool/check.c, if
314 funcB obviously does not call schedule(), and is marked notrace (since
315 function tracing inserts additional calls, which is not obvious from the
318 10. file.o: warning: func()+0x5c: alternative modifies stack
320 This means that an alternative includes instructions that modify the
321 stack. The problem is that there is only one ORC unwind table, this means
322 that the ORC unwind entries must be valid for each of the alternatives.
323 The easiest way to enforce this is to ensure alternatives do not contain
324 any ORC entries, which in turn implies the above constraint.
326 11. file.o: warning: unannotated intra-function call
328 This warning means that a direct call is done to a destination which
329 is not at the beginning of a function. If this is a legit call, you
330 can remove this warning by putting the ANNOTATE_INTRA_FUNCTION_CALL
331 directive right before the call.
334 If the error doesn't seem to make sense, it could be a bug in objtool.
335 Feel free to ask the objtool maintainer for help.
341 If you _really_ need objtool to ignore something, and are 100% sure
342 that it won't affect kernel stack traces, you can tell objtool to
345 - To skip validation of a function, use the STACK_FRAME_NON_STANDARD
348 - To skip validation of a file, add
350 OBJECT_FILES_NON_STANDARD_filename.o := y
354 - To skip validation of a directory, add
356 OBJECT_FILES_NON_STANDARD := y