Thread (16 messages) 16 messages, 6 authors, 2020-08-28

RE: [PATCH net-next] Add Mellanox BlueField Gigabit Ethernet driver

From: Asmaa Mnebhi <asmaa@nvidia.com>
Date: 2020-08-12 20:38:01

-----Original Message-----
From: Andrew Lunn <andrew@lunn.ch>
Sent: Tuesday, August 11, 2020 4:07 PM
To: Asmaa Mnebhi <asmaa@nvidia.com>
Cc: David Thompson <redacted>;
netdev@vger.kernel.org; davem@davemloft.net; kuba@kernel.org; Jiri
Pirko [off-list ref]; Asmaa Mnebhi [off-list ref]
Subject: Re: [PATCH net-next] Add Mellanox BlueField Gigabit Ethernet
driver

On Tue, Aug 11, 2020 at 07:53:35PM +0000, Asmaa Mnebhi wrote:
quoted
Hi Andrew,

Thanks again for your feedback.
quoted
quoted
+	/* Finally check if this interrupt is from PHY device.
+	 * Return if it is not.
+	 */
+	val = readl(priv->gpio_io +
+			MLXBF_GIGE_GPIO_CAUSE_OR_CAUSE_EVTEN0);
+	if (!(val & priv->phy_int_gpio_mask))
+		return IRQ_NONE;
+
+	/* Clear interrupt when done, otherwise, no further interrupt
+	 * will be triggered.
+	 * Writing 0x1 to the clear cause register also clears the
+	 * following registers:
+	 * cause_gpio_arm_coalesce0
+	 * cause_rsh_coalesce0
+	 */
+	val = readl(priv->gpio_io +
+			MLXBF_GIGE_GPIO_CAUSE_OR_CLRCAUSE);
+	val |= priv->phy_int_gpio_mask;
+	writel(val, priv->gpio_io +
+			MLXBF_GIGE_GPIO_CAUSE_OR_CLRCAUSE);
Shoudn't there be a call into the PHY driver at this point?
quoted
+
+	return IRQ_HANDLED;
+}
So these last three functions seem to be an interrupt controller?
So why not model it as a Linux interrupt controller?
Apologies for the confusion. The plan is to remove support to the polling
and instead support the HW interrupt as follows (from the probe):
quoted
irq = platform_get_irq(pdev, MLXBF_GIGE_PHY_INT_N);
         if (irq < 0) {
                 dev_err(dev, "Failed to retrieve irq 0x%x\n", irq);
                 return -ENODEV;
         }
         priv->mdiobus->irq[phy_addr] = irq;
O.K, that is one way to do it. The other is via the MAC driver calling
phy_mac_interrupt().
quoted
I guess my question is should we model it as a linux interrupt
controller rather than use phy_connect_direct ?
It seems like there are other interrupt sources, not just the PHY. Do you plan
to use any of them? It can be easier to debug issues if you have an interrupt
controller, can see counters in /proc/interrupts, etc. Also, if you need to
export the lines to some other driver, e.g. SFP, it is easier to do when there is
an interrupt controller.
quoted
Using phy_connect_direct to register my interrupt handler, I have
encountered a particular issue where the PHY interrupt is triggered
before the phy link status bit (reg 0x1 of the PHY device) is set to
1 (indicating link is up).
So the hardware is broken :-(

What about the other way, link down? Same problem?

Polling is probably your best bet, since it is robust against broken interrupts.
If i remember correctly, this is an off the shelf 1G PHY?
Microchip? Is there an errata for this? Maybe the errata suggests a work
around?
So let me explain further and would greatly appreciate your input.
Technically, when this driver gets loaded, we shouldn't need the interrupt when bringing up the link for the first time, do we?
Correct me if I am wrong, "phy_start" should bring up the link. Phy_start calls phy_start_aneg, which eventually calls phy_check_link_status.
phy_check_link_status , reads the link state bit of the BMSR register (only twice),  and based on that determines whether to bring up/down the link. In our case, that bit is still 0 when the read is donw. A little bit later, it gets set to 1.

This is why polling works in this case. Phy_start fails to bring up the link but the polling eventually bring it up. If we choose to use the interrupt, we should make sure that the 
Interrupt is enabled a little bit after phy_start, otherwise, it would just be wasted.

Best,
Asmaa
     Andrew
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help