Re: [PATCH] compiler_types: Introduce inline_for_performance
From: David Laight <hidden>
Date: 2026-01-19 09:33:43
Also in:
lkml
On Sun, 18 Jan 2026 16:01:25 -0800 Andrew Morton [off-list ref] wrote:
On Sun, 18 Jan 2026 22:58:02 +0000 David Laight [off-list ref] wrote:quoted
quoted
mm/ alone has 74 __always_inlines, none are documented, I don't know why they're present, many are probably wrong. Shit, uninlining only __get_user_pages_locked does this: text data bss dec hex filename 115703 14018 64 129785 1faf9 mm/gup.o 103866 13058 64 116988 1c8fc mm/gup.o-afterThe next questions are does anything actually run faster (either way), and should anything at all be marked 'inline' rather than 'always_inline'. After all, if you call a function twice (not in a loop) you may want a real function in order to avoid I-cache misses.yup
I had two adjacent strlen() calls in a bit of code, the first was an array (in a structure) and gcc inlined the 'word at a time' code, the second was a pointer and it called the library function. That had to be sub-optimal...
quoted
But I'm sure there is a lot of code that is 'inline_for_bloat' :-)ooh, can we please have that?
Or 'inline_to_speed_up_benchmark' and the associated 'unroll this loop because that must make it faster'.
I do think that every always_inline should be justified and commented, but I haven't been energetic about asking for that.
Apart from the 4-line functions where it is clearly obvious. Especially since the compiler can still decide to not-inline them if they are only 'inline'.
A fun little project would be go through each one, figure out whether were good reasons and if not, just remove them and see if anyone explains why that was incorrect.
It's not just always_inline, a lot of the inline are dubious. Probably why the networking code doesn't like it. Maybe persuade Linus to do some of that. He can use his 'god' bit to just change them. David