Re: [PATCH] udp: Introduce special NULL pointers for hlist termination

(off-list ancestor, not in this archive)
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Benny Amorsen <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · David Miller <davem@davemloft.net> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-08
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · David Miller <davem@davemloft.net> · 2008-10-08
[PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-11-02
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-30
[PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Stephen Hemminger <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Corey Minyard <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Pavel Emelyanov <hidden> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · David Miller <davem@davemloft.net> · 2008-11-02
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · David Miller <davem@davemloft.net> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Peter Zijlstra <hidden> · 2008-10-30
[PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Andi Kleen <hidden> · 2008-11-13
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · David Miller <davem@davemloft.net> · 2008-11-17
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Christoph Lameter <hidden> · 2008-11-19
[PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-13
[PATCH 4/3] rcu: documents rculist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 4/3] rcu: documents rculist_nulls · Peter Zijlstra <hidden> · 2008-11-14
Re: [PATCH 4/3] rcu: documents rculist_nulls · David Miller <davem@davemloft.net> · 2008-11-17
Re: [PATCH 4/3] rcu: documents rculist_nulls · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Peter Zijlstra <hidden> · 2008-11-14
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Arnaldo Carvalho de Melo <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
[PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Eric Dumazet <hidden> · 2008-11-19
[PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Christoph Lameter <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Christoph Lameter <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Paul E. McKenney <hidden> · 2008-11-19
[PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · David Miller <davem@davemloft.net> · 2008-11-24
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Peter Zijlstra <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-31
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-31
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-11-01
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Peter Zijlstra <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
[PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Christian Bell <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Evgeniy Polyakov <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Evgeniy Polyakov <hidden> · 2008-10-28
[PATCH 0/2] udp: Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 0/2] udp: Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-28
Re: [PATCH 0/2] udp: Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Corey Minyard <hidden> · 2008-10-07

From: Corey Minyard <hidden>
Date: 2008-10-30 16:31:56

Stephen Hemminger wrote:

On Thu, 30 Oct 2008 16:40:01 +0100
Eric Dumazet [off-list ref] wrote:

quoted

Eric Dumazet a écrit :

quoted

Paul E. McKenney a écrit :

quoted

On Wed, Oct 29, 2008 at 09:00:13PM +0100, Eric Dumazet wrote:

quoted

Hum... Another way of handling all those cases and avoid memory barriers
would be to have different "NULL" pointers.

Each hash chain should have a unique "NULL" pointer (in the case of 
UDP, it
can be the 128 values : [ (void*)0 .. (void *)127 ]

Then, when performing a lookup, a reader should check the "NULL" pointer
it get at the end of its lookup has is the "hash" value of its chain.

If not -> restart the loop, aka "goto begin;" :)

We could avoid memory barriers then.

In the two cases Corey mentioned, this trick could let us avoid 
memory barriers.
(existing one in sk_add_node_rcu(sk, &hslot->head); should be enough)

What do you think ?

Kinky!!!  ;-)

Then the rcu_dereference() would be supplying the needed memory barriers.

Hmmm...  I guess that the only confusion would be if the element got
removed and then added to the same list.  But then if its pointer was
pseudo-NULL, then that would mean that all subsequent elements had been
removed, and all preceding ones added after the scan started.

Which might well be harmless, but I must defer to you on this one at
the moment.

If you need a larger hash table, another approach would be to set the
pointer's low-order bit, allowing the upper bits to be a full-sized
index -- or even a pointer to the list header.  Just make very sure
to clear the pointer when freeing, or an element on the freelist
could end up looking like a legitimate end of list...  Which again
might well be safe, but why inflict this on oneself?

Ok, here is an updated and tested patch.

Thanks everybody

[PATCH] udp: Introduce special NULL pointers for hlist termination

In order to safely detect changes in chains, we would like to have different
'NULL' pointers. Each chain in hash table is terminated by an unique 'NULL'
value, so that the lockless readers can detect their lookups evaded from
their starting chain.

We introduce a new type of hlist implementation, named hlist_nulls, were
we use the least significant bit of the 'ptr' to tell if its a "NULL" value
or a pointer to an object. We expect to use this new hlist variant for TCP
as well.

For UDP/UDP-Lite hash table, we use 128 different "NULL" values,
(UDP_HTABLE_SIZE=128)

Using hlist_nulls saves memory barriers (a read barrier to fetch 'next'
pointers *before* checking key values) we added in commit 
96631ed16c514cf8b28fab991a076985ce378c26
(udp: introduce sk_for_each_rcu_safenext())

This also saves a write memory barrier in udp_lib_get_port(), between
sk->sk_hash update and sk->next update)

Signed-off-by: Eric Dumazet <redacted>
---

IMHO this goes over the edge into tricky hack. Is it really worth it?
Is there a better simpler way?

The only think I've thought of is to do a single smp_rmb() after the 
loop scanning the list and check the sk_hash value again.  That's better 
than the read barrier for every list element, but still not as good as 
this list from a performance point of view.

IMHO, this is a tricky hack, but it if is well abstracted and documented 
I think it's ok.  I'd guess something like this will become more often 
used as we get larger numbers of processors on systems.

It is annoying that it doesn't help the performance for multicast.  
However, I think the current patch will solve the DOS issue for 
multicast, since it switches to a normal spinlock and has a per-list lock.

-corey

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help