Re: [PATCH 2/2] udp: RCU handling for Unicast packets.

(off-list ancestor, not in this archive)
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Benny Amorsen <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · David Miller <davem@davemloft.net> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-07
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-08
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · David Miller <davem@davemloft.net> · 2008-10-08
[PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Corey Minyard <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-11-02
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-30
[PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Stephen Hemminger <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Corey Minyard <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Pavel Emelyanov <hidden> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · David Miller <davem@davemloft.net> · 2008-11-02
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · David Miller <davem@davemloft.net> · 2008-10-31
Re: [PATCH] udp: Introduce special NULL pointers for hlist termination · Peter Zijlstra <hidden> · 2008-10-30
[PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Andi Kleen <hidden> · 2008-11-13
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · David Miller <davem@davemloft.net> · 2008-11-17
Re: [PATCH 0/3] net: RCU lookups for UDP, DCCP and TCP protocol · Christoph Lameter <hidden> · 2008-11-19
[PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-13
[PATCH 4/3] rcu: documents rculist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 4/3] rcu: documents rculist_nulls · Peter Zijlstra <hidden> · 2008-11-14
Re: [PATCH 4/3] rcu: documents rculist_nulls · David Miller <davem@davemloft.net> · 2008-11-17
Re: [PATCH 4/3] rcu: documents rculist_nulls · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Peter Zijlstra <hidden> · 2008-11-14
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Arnaldo Carvalho de Melo <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Eric Dumazet <hidden> · 2008-11-19
Re: [PATCH 1/3] rcu: Introduce hlist_nulls variant of hlist · Paul E. McKenney <hidden> · 2008-11-19
[PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Paul E. McKenney <hidden> · 2008-11-19
Re: [PATCH 2/3] udp: Use hlist_nulls in UDP RCU code · Eric Dumazet <hidden> · 2008-11-19
[PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Eric Dumazet <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Christoph Lameter <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Peter Zijlstra <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Christoph Lameter <hidden> · 2008-11-13
Re: [PATCH 3/3] net: Convert TCP & DCCP hash tables to use RCU / hlist_nulls · Paul E. McKenney <hidden> · 2008-11-19
[PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Eric Dumazet <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · Paul E. McKenney <hidden> · 2008-11-23
Re: [PATCH] net: Convert TCP/DCCP listening hash tables to use RCU · David Miller <davem@davemloft.net> · 2008-11-24
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Peter Zijlstra <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-31
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-31
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-11-01
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Paul E. McKenney <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · David Miller <davem@davemloft.net> · 2008-10-29
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Peter Zijlstra <hidden> · 2008-10-30
Re: [PATCH 2/2] udp: RCU handling for Unicast packets. · Eric Dumazet <hidden> · 2008-10-30
[PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Christian Bell <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Evgeniy Polyakov <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 1/2] udp: introduce struct udp_table and multiple rwlocks · Evgeniy Polyakov <hidden> · 2008-10-28
[PATCH 0/2] udp: Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 0/2] udp: Convert the UDP hash lock to RCU · Stephen Hemminger <hidden> · 2008-10-28
Re: [PATCH 0/2] udp: Convert the UDP hash lock to RCU · Eric Dumazet <hidden> · 2008-10-28
Re: [PATCH 3/3] Convert the UDP hash lock to RCU · Corey Minyard <hidden> · 2008-10-07

From: Eric Dumazet <hidden>
Date: 2008-10-29 17:47:03

Corey Minyard a écrit :

Paul E. McKenney wrote:

quoted

On Wed, Oct 29, 2008 at 05:09:53PM +0100, Eric Dumazet wrote:

quoted

Corey Minyard a écrit :

quoted

Eric Dumazet wrote:

quoted

Corey Minyard found a race added in commit 
271b72c7fa82c2c7a795bc16896149933110672d
(udp: RCU handling for Unicast packets.)

"If the socket is moved from one list to another list in-between 
the time  the hash is calculated and the next field is accessed, 
and the socket  has moved to the end of the new list, the traversal 
will not complete  properly on the list it should have, since the 
socket will be on the end  of the new list and there's not a way to 
tell it's on a new list and  restart the list traversal.  I think 
that this can be solved by  pre-fetching the "next" field (with 
proper barriers) before checking the  hash."

This patch corrects this problem, introducing a new 
sk_for_each_rcu_safenext()
macro.

You also need the appropriate smp_wmb() in udp_lib_get_port() after 
sk_hash is set, I think, so the next field is guaranteed to be 
changed after the hash value is changed.

Not sure about this one Corey.

If a reader catches previous value of item->sk_hash, two cases are to 
be taken into :

1) its udp_hashfn(net, sk->sk_hash) is != hash   -> goto begin : 
Reader will redo its scan

2) its udp_hashfn(net, sk->sk_hash) is == hash
 -> next pointer is good enough : it points to next item in same hash 
chain.
    No need to rescan the chain at this point.
    Yes we could miss the fact that a new port was bound and this UDP 
message could be lost.

3) its udp_hashfn(net, sk-sk_hash) is == hash, but only because it was
removed, freed, reallocated, and then readded with the same hash value,
possibly carrying the reader to a new position in the same list.

If I understand this, without the smp_wmb(), it is possible that the 
next field can be written to main memory before the hash value is 
written.  If that happens, the following can occur:

 CPU1                    CPU2
 next is set to NULL (end of new list)

Well, if this item is injected to the same chain, next wont be set to NULL.

That would mean previous writers deleted all items from the chain.

In this case, readers can see NULL, it is not a problem at all.
List is/was empty.
An application cannot complain a packet is not
handled if its bind() syscall is not yet completed :)

If item is injected on another chain, we will detect hash mismatch and redo full scan.

                         fetch next
                         calculate hash and compare to sk_hash
 sk_hash is set to new value

So I think in the above cases, your case #2 is not necessarily valid 
without the barrier.

And another possible issue.  If sk_hash is written before next, and CPU1 
is interrupted before CPU2, CPU2 will continually spin on the list until 
CPU1 comes back and moves it to the new list.  Note sure if that is an 
issue.

Probably not. Previously, readers were spining on read_lock(), when 
a writer was inside its critical section (write_lock()/write_unlock()).
So instead of spining inside read_unlock(), issuing stupid memory 
transactions, the readers can now spin reading hash chain and populate
cpu cache :)

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help