Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset

[PATCH v4 0/4] per-function storage support · Menglong Dong <hidden> · 2025-03-03
[PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-03
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Peter Zijlstra <peterz@infradead.org> · 2025-03-03
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Peter Zijlstra <peterz@infradead.org> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Peter Zijlstra <peterz@infradead.org> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Peter Zijlstra <peterz@infradead.org> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · "H. Peter Anvin" <hpa@zytor.com> · 2025-03-04
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-05
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · "H. Peter Anvin" <hpa@zytor.com> · 2025-03-05
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-05
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Steven Rostedt <rostedt@goodmis.org> · 2025-03-05
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-06
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Alexei Starovoitov <hidden> · 2025-03-06
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-06
Re: [PATCH v4 1/4] x86/ibt: factor out cfi and fineibt offset · Menglong Dong <hidden> · 2025-03-23
[PATCH v4 2/4] add per-function metadata storage support · Menglong Dong <hidden> · 2025-03-03
[PATCH v4 3/4] x86: implement per-function metadata storage for x86 · Menglong Dong <hidden> · 2025-03-03
[PATCH v4 4/4] arm64: implement per-function metadata storage for arm64 · Menglong Dong <hidden> · 2025-03-03
Re: [PATCH v4 4/4] arm64: implement per-function metadata storage for arm64 · Sami Tolvanen <samitolvanen@google.com> · 2025-03-03
Re: [PATCH v4 4/4] arm64: implement per-function metadata storage for arm64 · Menglong Dong <hidden> · 2025-03-04

From: Alexei Starovoitov <hidden>
Date: 2025-03-06 03:39:33
Also in: bpf, linux-arm-kernel, lkml, llvm, netdev

On Wed, Mar 5, 2025 at 6:59 PM Menglong Dong [off-list ref] wrote:

I'm not sure if it works. However, indirect call is also used
in function graph, so we still have better performance. Isn't it?

Let me have a look at the code of the function graph first :/

Menglong,

Function graph infra isn't going to help.
"call foo" isn't a problem either.

But we have to step back.
per-function metadata is an optimization and feels like
we're doing a premature optimization here without collecting
performance numbers first.

Let's implement multi-fentry with generic get_metadata_by_ip() first.
get_metadata_by_ip() will be a hashtable in such a case and
then we can compare its performance when it's implemented as
a direct lookup from ip-4 (this patch) vs hash table
(that does 'ip' to 'metadata' lookup).

If/when we decide to do this per-function metadata we can also
punt to generic hashtable for cfi, IBT, FineIBT, etc configs.
When mitigations are enabled the performance suffers anyway,
so hashtable lookup vs direct ip-4 lookup won't make much difference.
So we can enable per-function metadata only on non-mitigation configs
when FUNCTION_ALIGNMENT=16.
There will be some number of bytes available before every function
and if we can tell gcc/llvm to leave at least 5 bytes there
the growth of vmlinux .text will be within a noise.

So let's figure out the design of multi-fenty first with a hashtable
for metadata and decide next steps afterwards.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help