Thread (13 messages) 13 messages, 5 authors, 2025-06-09

Re: [PATCH net-next] net: xilinx: axienet: Configure and report coalesce parameters in DMAengine flow

From: Sean Anderson <sean.anderson@linux.dev>
Date: 2025-05-30 20:45:02
Also in: linux-arm-kernel, lkml

On 5/30/25 06:18, Gupta, Suraj wrote:
[AMD Official Use Only - AMD Internal Distribution Only]
quoted
-----Original Message-----
From: Sean Anderson <sean.anderson@linux.dev>
Sent: Thursday, May 29, 2025 9:48 PM
To: Gupta, Suraj <redacted>; andrew+netdev@lunn.ch;
davem@davemloft.net; edumazet@google.com; kuba@kernel.org;
pabeni@redhat.com; vkoul@kernel.org; Simek, Michal [off-list ref];
Pandey, Radhey Shyam [off-list ref]; horms@kernel.org
Cc: netdev@vger.kernel.org; linux-arm-kernel@lists.infradead.org; linux-
kernel@vger.kernel.org; git (AMD-Xilinx) [off-list ref]; Katakam, Harini
[off-list ref]
Subject: Re: [PATCH net-next] net: xilinx: axienet: Configure and report coalesce
parameters in DMAengine flow

Caution: This message originated from an External Source. Use proper caution
when opening attachments, clicking links, or responding.


On 5/28/25 08:00, Gupta, Suraj wrote:
quoted
[AMD Official Use Only - AMD Internal Distribution Only]
quoted
-----Original Message-----
From: Sean Anderson <sean.anderson@linux.dev>
Sent: Tuesday, May 27, 2025 9:47 PM
To: Gupta, Suraj <redacted>; andrew+netdev@lunn.ch;
davem@davemloft.net; edumazet@google.com; kuba@kernel.org;
pabeni@redhat.com; vkoul@kernel.org; Simek, Michal
[off-list ref]; Pandey, Radhey Shyam
[off-list ref]; horms@kernel.org
Cc: netdev@vger.kernel.org; linux-arm-kernel@lists.infradead.org;
linux- kernel@vger.kernel.org; git (AMD-Xilinx) [off-list ref];
Katakam, Harini [off-list ref]
Subject: Re: [PATCH net-next] net: xilinx: axienet: Configure and
report coalesce parameters in DMAengine flow

Caution: This message originated from an External Source. Use proper
caution when opening attachments, clicking links, or responding.


On 5/25/25 06:22, Suraj Gupta wrote:
quoted
Add support to configure / report interrupt coalesce count and
delay via ethtool in DMAEngine flow.
Netperf numbers are not good when using non-dmaengine default
values, so tuned coalesce count and delay and defined separate
default values in dmaengine flow.

Netperf numbers and CPU utilisation change in DMAengine flow after
introducing coalescing with default parameters:
coalesce parameters:
   Transfer type        Before(w/o coalescing)  After(with coalescing)
TCP Tx, CPU utilisation%      925, 27                 941, 22
TCP Rx, CPU utilisation%      607, 32                 741, 36
UDP Tx, CPU utilisation%      857, 31                 960, 28
UDP Rx, CPU utilisation%      762, 26                 783, 18

Above numbers are observed with 4x Cortex-a53.
How does this affect latency? I would expect these RX settings to
increase latency around 5-10x. I only use these settings with DIM
since it will disable coalescing during periods of light load for better latency.

(of course the way to fix this in general is RSS or some other method
involving multiple queues).
I took values before NAPI addition in legacy flow (rx_threshold: 24, rx_usec: 50) as
reference. But netperf numbers were low with them, so tried tuning both and
selected the pair which gives good numbers.

Yeah, but the reason is that you are trading latency for throughput.
There is only one queue, so when the interface is saturated you will not get good
latency anyway (since latency-sensitive packets will get head-of-line blocked). But
when activity is sparse you can good latency if there is no coalescing. So I think
coalescing should only be used when there is a lot of traffic. Hence why I only
adjusted the settings once I implemented DIM. I think you should be able to
implement it by calling net_dim from axienet_dma_rx_cb, but it will not be as efficient
without NAPI.
Ok, got it. I'll keep default values used before NAPI in legacy flow (coalesce count: 24, delay: 50) for both Tx and Rx and remove perf comparisons.
Those settings are actually probably even worse for latency. I'd leave
the settings at 0/0 (coalescing disabled) to match the existing
behavior. I think the perf comparisons are helpful, especially for
people who know they are going to be throughput-limited.

My main point is that I think extending the dmaengine API to allow for
DIM will have practical benefits in reduced latency.
quoted
Actually, if you are looking into improving performance, I think lack of NAPI is
probably the biggest limitation with the dmaengine backend.
Yes, I agree. NAPI for DMAEngine implementation is underway and will be sent to mainline soon.
Looking forward to it.
quoted
quoted
quoted
quoted
Signed-off-by: Suraj Gupta <redacted>
---
This patch depend on following AXI DMA dmengine driver changes sent
to dmaengine mailing list as pre-requisit series:
https://lore.kernel.org/all/20250525101617.1168991-1-suraj.gupta2@amd.
com/
---
 drivers/net/ethernet/xilinx/xilinx_axienet.h  |  6 +++
.../net/ethernet/xilinx/xilinx_axienet_main.c | 53
+++++++++++++++++++
 2 files changed, 59 insertions(+)
diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet.h
b/drivers/net/ethernet/xilinx/xilinx_axienet.h
index 5ff742103beb..cdf6cbb6f2fd 100644
--- a/drivers/net/ethernet/xilinx/xilinx_axienet.h
+++ b/drivers/net/ethernet/xilinx/xilinx_axienet.h
@@ -126,6 +126,12 @@
 #define XAXIDMA_DFT_TX_USEC          50
 #define XAXIDMA_DFT_RX_USEC          16

+/* Default TX/RX Threshold and delay timer values for SGDMA mode
+with
DMAEngine */
quoted
+#define XAXIDMAENGINE_DFT_TX_THRESHOLD       16
+#define XAXIDMAENGINE_DFT_TX_USEC    5
+#define XAXIDMAENGINE_DFT_RX_THRESHOLD       24
+#define XAXIDMAENGINE_DFT_RX_USEC    16
+
 #define XAXIDMA_BD_CTRL_TXSOF_MASK   0x08000000 /* First tx packet
*/
quoted
quoted
quoted
 #define XAXIDMA_BD_CTRL_TXEOF_MASK   0x04000000 /* Last tx packet
*/
quoted
quoted
quoted
 #define XAXIDMA_BD_CTRL_ALL_MASK     0x0C000000 /* All control bits */
diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c
b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c
index 1b7a653c1f4e..f9c7d90d4ecb 100644
--- a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c
+++ b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c
@@ -1505,6 +1505,7 @@ static int axienet_init_dmaengine(struct
net_device *ndev)  {
      struct axienet_local *lp = netdev_priv(ndev);
      struct skbuf_dma_descriptor *skbuf_dma;
+     struct dma_slave_config tx_config, rx_config;
      int i, ret;

      lp->tx_chan = dma_request_chan(lp->dev, "tx_chan0"); @@
-1520,6
+1521,22 @@ static int axienet_init_dmaengine(struct net_device
+*ndev)
              goto err_dma_release_tx;
      }

+     tx_config.coalesce_cnt = XAXIDMAENGINE_DFT_TX_THRESHOLD;
+     tx_config.coalesce_usecs = XAXIDMAENGINE_DFT_TX_USEC;
+     rx_config.coalesce_cnt = XAXIDMAENGINE_DFT_RX_THRESHOLD;
+     rx_config.coalesce_usecs =  XAXIDMAENGINE_DFT_RX_USEC;
I think it would be clearer to just do something like

        struct dma_slave_config tx_config = {
                .coalesce_cnt = 16,
                .coalesce_usecs = 5,
        };

since these are only used once. And this ensures that you initialize the whole
struct.
quoted
quoted
But what tree are you using? I don't see these members on net-next or
dmaengine.
quoted
These changes are proposed in separate series in dmaengine
https://lore.kernel.org/all/20250525101617.1168991-2-suraj.gupta2@amd.com/ (local) and I
described it here below my SOB.

I think you should post those patches with this series to allow them to be reviewed
appropriately.

--Sean
DMAengine series functionality depends on commit
(https://git.kernel.org/pub/scm/linux/kernel/git/vkoul/dmaengine.git/commit/drivers/dma/xilinx?h=next&id=7e01511443c30a55a5ae78d3debd46d4d872517e)
in dmaengine which is currently not there in net-next. So I sent that
to dmaengine only. Please let me know if any way to send as single
series.
It looks like this won't cause any conflicts, so I think you can just
send the whole series with a note in the cover letter like

| This series depends on commit 7e01511443c3 ("dmaengine: xilinx_dma:
| Set dma_device directions") currently in dmaengine/next.

--Sean
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help