Re: [PATCH 1/7] General notification queue with user mmap()'able ring buffer
From: David Howells <dhowells@redhat.com>
Date: 2019-05-29 16:06:54
Also in:
keyrings, linux-block, linux-fsdevel, linux-security-module, lkml
Greg KH [off-list ref] wrote:
quoted
kref_put() could potentially add an unnecessary extra stack frame and would seem to be best avoided, though an optimising compiler ought to be able to inline if it can.If kref_put() is on your fast path, you have worse problems (kfree isn't fast, right?) Anyway, it's an inline function, how can it add an extra stack frame?
The call to the function pointer. Hopefully the compiler will optimise that away for an inlineable function.
quoted
Are you now on the convert all refcounts to krefs path?"now"? Remember, I wrote kref all those years ago,
Yes - and I thought it wasn't a good idea at the time. But this is the first time you've mentioned it to me, let alone pushed to change to it, that I recall.
everyone should use it. It saves us having to audit the same pattern over and over again. And, even nicer, it uses a refcount now, and as you are trying to reference count an object, it is exactly what this was written for. So yes, I do think it should be used here, unless it is deemed to not fit the pattern/usage model.
kref_put() enforces a very specific destructor signature. I know of places where that doesn't work because the destructor takes more than one argument (granted that this is not the case here). So why does kref_put() exist at all? Why not kref_dec_and_test()? Why doesn't refcount_t get merged into kref, or vice versa? Having both would seem redundant. Mind you, I've been gradually reverting atomic_t-to-refcount_t conversions because it seems I'm not allowed refcount_inc/dec_return() and I want to get at the point refcount for tracing purposes.
quoted
quoted
quoted
+module_exit(watch_queue_exit);module_misc_device()?warthog>git grep module_misc_device -- Documentation/ warthog1>Do I have to document all helper macros?
If you add an API, documenting it is your privilege ;-) It's an important test of the API - if you can't describe it, it's probably wrong. Now I will grant that you didn't add that function...
Anyway, it saves you boilerplate code, but if built in, it's at the module init level, not the fs init level, like you are asking for here. So that might not work, it's your call.
Actually, I probably shouldn't have a module exit function. It can't be a module as it's called by core code. I'll switch to builtin_misc_device().
And how does the tracing and perf ring buffers do this without needing volatile? Why not use the same type of interface they provide, as it's always good to share code that has already had all of the nasty corner cases worked out.
I've no idea how trace does it - or even where - or even if. As far as I can see, grepping for mmap in kernel/trace/*, there's no mmap support. Reading Documentation/trace/ring-buffer-design.txt the trace subsystem has some sort of transient page fifo which is a lot more complicated than what I want and doesn't look like it'll be mmap'able. Looking at the perf ring buffer, there appears to be a missing barrier in perf_aux_output_end(): rb->user_page->aux_head = rb->aux_head; should be: smp_store_release(&rb->user_page->aux_head, rb->aux_head); It should also be using smp_load_acquire(). See Documentation/core-api/circular-buffers.rst And a (partial) patch has been proposed: https://lkml.org/lkml/2018/5/10/249 David