Thread (33 messages) 33 messages, 6 authors, 2016-12-04

RE: Aw: Re: [PATCH] mlx4: give precise rx/tx bytes/packets counters

From: David Laight <hidden>
Date: 2016-11-30 15:28:10

From: Eric Dumazet
Sent: 29 November 2016 00:19
On Mon, 2016-11-28 at 23:02 +0100, Lino Sanfilippo wrote:
quoted
Hi Eric,

On 25.11.2016 20:19, Eric Dumazet wrote:
quoted
On Fri, 2016-11-25 at 17:30 +0100, Lino Sanfilippo wrote:
quoted
Hi,

quoted
The READ_ONCE() are documenting the fact that no lock is taken to fetch
the stats, while another cpus might being changing them.

I had no answer yet from https://patchwork.ozlabs.org/patch/698449/

So I thought it was not needed to explain this in the changelog, given
that it apparently is one of the few things that can block someone to
understand one of my changes :/

Apparently nobody really understands READ_ONCE() purpose, it is really a
pity we have to explain this over and over.
Even at the risk of showing once more a lack of understanding for
READ_ONCE():
Does not a READ_ONCE() have to e paired with some kind of
WRITE_ONCE()?
You are right.

Although in this case, the producers are using a lock, and do

ring->packets++;

We hopefully have compilers/cpus that do not put intermediate garbage in
ring->packets while doing the increment.

One problem with :

WRITE_ONCE(ring->packets, ring->packets + 1);

is that gcc no longer uses an INC instruction.
I see. So we would have to do something like

tmp = ring->packets;
tmp++;
WRITE_ONCE(ring->packets, tmp);

Well, gcc will generate a code with more instructions than a mere

"inc  offset(%register)"
Are you sure??
Last I looked gcc seemed to convert 'foo++' to 'foo = foo + 1' before
generating any code.
It might then optimise that back to a memory increment, but that would
also happen if you'd coded the latter form.
Which is kind of unfortunate, given it is the fast path.

Better add a comment, like :

/* We should use WRITE_ONCE() to pair with the READ_ONCE() found in xxxx()
 * But gcc would generate non optimal code.
 */
Actually while READ_ONCE() is generally useful - to get a snapshot of a changing value.

WRITE_ONCE() isn't a pairing - the compiler is highly unlikely to write a
location twice.
You might want an annotation to ensure is doesn't assume it can read the value
back (write through volatile pointer). But that has nothing to do with how readers behave.

	David
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help