Thread (26 messages) 26 messages, 5 authors, 2025-01-17

Re: [PATCH net v2 1/5] vsock/virtio: discard packets if the transport changes

From: Stefano Garzarella <sgarzare@redhat.com>
Date: 2025-01-13 11:05:18
Also in: bpf, kvm, lkml, stable, virtualization

On Mon, Jan 13, 2025 at 11:12:52AM +0100, Michal Luczaj wrote:
On 1/13/25 10:07, Stefano Garzarella wrote:
quoted
On Mon, 13 Jan 2025 at 09:57, Stefano Garzarella [off-list ref] wrote:
quoted
On Sun, Jan 12, 2025 at 11:42:30PM +0100, Michal Luczaj wrote:
[...]
quoted
quoted
So, if I get this right:
1. vsock_create() (refcnt=1) calls vsock_insert_unbound() (refcnt=2)
2. transport->release() calls vsock_remove_bound() without checking if sk
  was bound and moved to bound list (refcnt=1)
3. vsock_bind() assumes sk is in unbound list and before
  __vsock_insert_bound(vsock_bound_sockets()) calls
  __vsock_remove_bound() which does:
     list_del_init(&vsk->bound_table); // nop
     sock_put(&vsk->sk);               // refcnt=0

The following fixes things for me. I'm just not certain that's the only
place where transport destruction may lead to an unbound socket being
removed from the unbound list.
diff --git a/net/vmw_vsock/virtio_transport_common.c b/net/vmw_vsock/virtio_transport_common.c
index 7f7de6d88096..0fe807c8c052 100644
--- a/net/vmw_vsock/virtio_transport_common.c
+++ b/net/vmw_vsock/virtio_transport_common.c
@@ -1303,7 +1303,8 @@ void virtio_transport_release(struct vsock_sock *vsk)

      if (remove_sock) {
              sock_set_flag(sk, SOCK_DONE);
-              virtio_transport_remove_sock(vsk);
+              if (vsock_addr_bound(&vsk->local_addr))
+                      virtio_transport_remove_sock(vsk);
I don't get this fix, virtio_transport_remove_sock() calls
   vsock_remove_sock()
     vsock_remove_bound()
       if (__vsock_in_bound_table(vsk))
           __vsock_remove_bound(vsk);


So, should already avoid this issue, no?
I got it wrong, I see now what are you trying to do, but I don't think
we should skip virtio_transport_remove_sock() entirely, it also purge
the rx_queue.
Isn't rx_queue empty-by-definition in case of !__vsock_in_bound_table(vsk)?
It could be.

But I see some other issues:
- we need to fix also in the other transports, since they do the same
- we need to check delayed cancel work too that call 
   virtio_transport_remove_sock()

An alternative approach, which would perhaps allow us to avoid all this, 
is to re-insert the socket in the unbound list after calling release() 
when we deassign the transport.

WDYT?

Stefano
quoted
quoted
Can the problem be in vsock_bind() ?
Well, I wouldn't say so.
quoted
quoted
Is this issue pre-existing or introduced by this series?
I think this is pre-existing, can you confirm?
Yup, I agree, pre-existing.
quoted
In that case, I'd not stop this series, and fix it in another patch/series.
Yeah, sure thing.

Thanks,
Michal
  
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help