Re: [4.4-RT PATCH RFC/RFT] drivers: net: cpsw: mark rx/tx irq as IRQF_NO_THREAD
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
Date: 2016-08-12 08:32:06
Also in:
linux-omap, netdev
On 2016-08-11 19:15:40 [+0300], Grygorii Strashko wrote:
Mark CPSW Rx/Tx IRQs as IRQF_NO_THREAD and avoid double scheduling on -RT
where this IRQs are forced threaded:
rx-irq
|- schedule threaded rx-irq handler
...
|- threaded rx-irq handler -> cpsw_rx_interrupt()
|- napi_schedule()
|- __raise_softirq_irqoff()
|- wakeup_proper_softirq()
...
napiThis should not be the default path. The default should be napi running in the context of the threaded rx-irq handler once the handler is done. The wakeup_proper_softirq() part is only done if napi thinks that the callback functions runs for too long. So in *that* case you continue NAPI in the softirq-thread which runs at SCHED_OTHER.
after:
rx-irq
|- cpsw_rx_interrupt()
|- napi_schedule()
|- irq_exit()
|- invoke_softirq()
|- wakeup_softirqd()
...
napiSince you schedule the softirq from an IRQ-off region / without a process context you force the softirq to run in the thread at SCHED_OTHER priority.
And, as result, get benefits from the following improvements (tested on am57xx-evm): 1) "[ 78.348599] NOHZ: local_softirq_pending 80" message will not be seen any more. Now these warnings can be seen once iperf is started. # iperf -c $IPERFHOST -w 128K -d -t 60
Do you also see "sched: RT throttling activated"? Because I don't see otherwise why this should pop up.
2) latency reduction when cyclictest is run in parallel with network load where net_perf.sh is: iperf -c $IPERFHOST -w 8K -d -t 60 iperf -c $IPERFHOST -w 16K -d -t 60 iperf -c $IPERFHOST -w 32K -d -t 60 iperf -c $IPERFHOST -w 64K -d -t 60 iperf -c $IPERFHOST -w 128K -d -t 60 before: T: 0 ( 1326) P:98 I:1000 C: 240000 Min: 8 Act: 13 Avg: 18 Max: 70 T: 1 ( 1327) P:98 I:1500 C: 159981 Min: 9 Act: 15 Avg: 16 Max: 43 after: T: 0 ( 1331) P:98 I:1000 C: 240000 Min: 8 Act: 15 Avg: 14 Max: 51 T: 1 ( 1332) P:98 I:1500 C: 159953 Min: 8 Act: 16 Avg: 15 Max: 33
-d 0 to have I: set to the same value. What does -i 250 say? And without network load we are where we were at "after" values? What happens if s/__raise_softirq_irqoff_ksoft/__raise_softirq_irqoff/ in net/core/dev.c and chrt the priority of you network interrupt handlers to SCHED_OTHER priority?
3) network performance increase win, K Mbits/s before after % 8K 354 350.3 0.0 16K 412 551 33.7 32K 423 659.5 55.9 64K 436 728.3 67.0 128K 537 845 57.4
How close are the after numbers to !RT? Sebastian