Re: [PATCH v4 1/2] eventdev: add device stop flush callback

From: Eads, Gage <hidden>
Date: 2018-03-26 21:59:30

-----Original Message-----
From: Van Haaren, Harry
Sent: Friday, March 23, 2018 11:57 AM
To: Eads, Gage <redacted>; dev@dpdk.org
Cc: jerin.jacob@caviumnetworks.com; hemant.agrawal@nxp.com; Richardson,
Bruce [off-list ref]; santosh.shukla@caviumnetworks.com;
nipun.gupta@nxp.com
Subject: RE: [PATCH v4 1/2] eventdev: add device stop flush callback

quoted

From: Eads, Gage
Sent: Tuesday, March 20, 2018 2:13 PM
To: dev@dpdk.org
Cc: jerin.jacob@caviumnetworks.com; Van Haaren, Harry
[off-list ref]; hemant.agrawal@nxp.com; Richardson,
Bruce [off-list ref]; santosh.shukla@caviumnetworks.com;
nipun.gupta@nxp.com
Subject: [PATCH v4 1/2] eventdev: add device stop flush callback

When an event device is stopped, it drains all event queues. These
events may contain pointers, so to prevent memory leaks eventdev now
supports a user-provided flush callback that is called during the queue drain

process.

quoted

This callback is stored in process memory, so the callback must be
registered by any process that may call rte_event_dev_stop().

This commit also clarifies the behavior of rte_event_dev_stop().

This follows this mailing list discussion:
http://dpdk.org/ml/archives/dev/2018-January/087484.html

Signed-off-by: Gage Eads <redacted>

<snip most of the code - looks good!>

quoted

 /**
- * Stop an event device. The device can be restarted with a call to
- * rte_event_dev_start()
+ * Stop an event device.
+ *
+ * This function causes all queued events to be drained. While
+ draining
events
+ * out of the device, this function calls the user-provided flush
+ callback
+ * (if one was registered) once per event.
+ *
+ * This function does not drain events from event ports; the
+ application is
+ * responsible for flushing events from all ports before stopping the
device.


Question about how an application is expected to correctly cleanup all the
events here. Note in particular the last part: "application is responsible for
flushing events from all ports **BEFORE** stopping the device".

Given the event device is still running, how can the application be sure it has
flushed all the events (from the dequeue side in particular)?

Appreciate the feedback -- good points all around.

I was expecting that the application would unlink queues from the ports, and then dequeue until each port has no events. However, there are PMDs for which runtime port link/unlink is not supported, so I see that this is not a viable approach. Plus, this adds the application burden that you describe below.

In order to drain all events from the ports, I was expecting the following:

// stop scheduling new events to worker cores
rte_event_dev_stop()
---> callback gets called for each event

// to dequeue events from each port, and app cleans them up?
FOR_EACH_PORT( rte_event_dev_dequeue(..., port_id, ...) )


I'd like to avoid the dequeue-each-port() approach in application, as it adds extra
burden to clean up correctly...

Agreed, but for a different reason: that approach means we'd have to change the documented eventdev behavior. rte_eventdev.h states that the "schedule, enqueue and dequeue functions should not be invoked when the device is stopped," and this patch reiterates that in the rte_event_dev_stop() documentation ("Threads that continue to enqueue/dequeue while the device is stopped, or being stopped, will result in undefined behavior"). Since a PMD's stop cleanup code could just be repeated calls to a PMD's dequeue code, allowing applications to dequeue simultaneously could be troublesome.

What if we say that dequeue() returns zero after stop() (leaving events possibly
in the port-dequeue side SW buffers), and these events which were about to be
dequeued by the worker core are also passed to the dev_stop_flush callback?

I'd prefer to have dequeue-while-stopped be unsupported, so we don't need an additional check or synchronization in the datapath, but passing the events in a port to the callback should work (for the sw PMD, at least). How does that sound?

Thanks,
Gage

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help