Re: [PATCH linux-2.6.13-rc3] SATA: rewritten sil24 driver

From: Tejun Heo <hidden>
Date: 2005-08-03 14:30:41
Also in: lkml

 Greetings, Edward and Jeff.

 Jeff, as my previous patchset against sil24 driver hasn't been
accepted yet, and it seems that most of them can be done better with
info from Edward, please ignore the patchset.  I'll post a new
patchset tomorrow (hopefully).  Sorry about the hassle.

On Wed, Aug 03, 2005 at 12:09:05AM -0400, Jeff Garzik wrote:

Edward Falk wrote:

quoted

Hi Tejun; I'm the guy at Google working on SATA drivers (port 
multipliers right now).

 Great.

quoted

As soon as I can (next week perhaps, I'll start 
looking at the driver you wrote.  From what I can see, it looks quite good.

 Thanks a lot.  I'll try to make my driver saner by then.

quoted

+
+static u8 sil24_check_status(struct ata_port *ap)
+{
+    return ATA_DRDY;
+}
+
+static u8 sil24_check_err(struct ata_port *ap)
+{
+    return 0;
+}


For these two functions, we need to grab the values for these 
registers from the D2H Register FIS.  These should not be constant 
values, they are used in probing.

Sadly I don't know where the values are.  The original driver seems
to assume that taskfile registers are overlayed with PORT_PRB, but
they are not.  Values are bogus there.  Again, in need of hardware
docs here.


The original driver is broken.  Taskfile registers have to be read back 
from the returned FIS block after a command completion.

Correct.

 Ahh.. Thanks a lot.  I thought PRB was used just for command issues.
I'll fix the driver ASAP.

quoted

Hmmmm, perhaps libata should provide an ata_fis_to_tf() function that 
examines a FIS block, confirms that it's a device-to-host type FIS, and 
read the taskfile registers back out.

Such as ata_tf_from_fis() in libata-core.c? :)

quoted

The original rewritten sil24 driver against NCQ helpers had simple
status register emulation using normal/error completion interrupt.  I
don't know why I stripped that while converting the driver over
vanilla libata (sorry).  I'll restore it.  It's not good, but it
correctly indicates error on error.


It's still better than what we have.

 Hmmm.. Now that I know how to access the D2H FIS, I can properly
emulate status and error registers from interrupt handler.  I'll
submit a patch soon.

quoted

+static void sil24_phy_reset(struct ata_port *ap)
+{
+    __sata_phy_reset(ap);
+    /*
+     * No ATAPI yet.  Just unconditionally indicate ATA device.
+     * If ATAPI device is attached, it will fail ATA_CMD_ID_ATA
+     * and libata core will ignore the device.
+     */
+    if (!(ap->flags & ATA_FLAG_PORT_DISABLED))
+        ap->device[0].class = ATA_DEV_ATA;
+}


We need to use the standard probing code to figure this out. 
ata_dev_classify(), etc.


Again, the problem here is that I don't know how to access the
signature values after reset.


Again, you would need to fetch them from the returned FIS structure.

Correct.

 Again, thanks.  ;-)

quoted

+static void sil24_irq_clear(struct ata_port *ap)
+{
+    /* unused */
+}


we need to fill this in.


I think this will work (adapted from sil_interrupt():

static void sil_irq_clear(struct ata_port *ap)
{
       struct sil_port_priv *pp = ap->private_data;
       struct Port_Registers *port_base = pp->pregs;
   unsigned long port_int;

   port_int  = readl((void *)&port_base->IntStatus);
   writel(port_int, &port_base->IntStatus);
}

I'm assuming that this entry point is expected to clear all interrupts, no?

Correct.

 I'll verify with the error register clearing part of the original
driver and submit a patch.

quoted

+    /* Max ~100ms */
+    for (cnt = 0; cnt < 1000; cnt++) {
+        udelay(100);
+        tmp = readl(port + PORT_CTRL_STAT);
+        if (!(tmp & PORT_CS_DEV_RST))
+            break;
+    }


Use mdelay.  We should be able to sleep here?

Either way, we want to avoid long delays like this.


Does mdelay() sleep?  I thought it just called udelay().

mdelay() does not sleep.

quoted

At any rate, I think 100ms is only the worst-case delay.

Is it safe to call msleep() at boot time?

Yes.

quoted

+    /* GPIO off */
+    writel(0, host_base + HOST_FLASH_CMD);
+
+    /* Mask interrupts during initialization */
+    writel(0, host_base + HOST_CTRL);


add a readl() to flush this write out to the PCI bus ("PCI posting")


Sure.  And, out of curiosity, isn't sync unnecessary unless we're
gonna perform some kind of timed waiting following it?  We don't have
any timing requirement after above interrupt masking, do we?



I think we're ok here; the code reads PORT_CTRL_STAT a few lines down; 
that will flush the write.  I don't think there's a race condition 
involved.

No race condition.  Typically there is often a mistaken assumption that

	writel(...)
	udelay(...)

will accomplish the desired effect.  Due to posted writes, the write may 
be posted to the PCI device after some delay, such that, the udelay() 
and the posted write execute concurrently, skewing the desired timing 
effect.

 So... we don't really need flushing here, right?  No delay
requirement here.

quoted

OK, a couple of questions of my own:

Any documentation on NCQ helpers or other related kernel code?

 Not yet.  I'm waiting for Jeff's comment before converting all
drivers and possibly writing up a document.  If Jeff doesn't like it,
I'll have to tear it down and do it some other way, so you better not
invest too much into the helpers yet.  However, the usage is really
simple, just take a look at interrupt and error hanndlers of ahci.c
driver.  Other NCQ enabled drivers shouldn't be very different.

quoted

What's the proper way to implement very long delays.  I want to 
implement staggered disk spinup in my port multiplier code.  Would 
mdelay(5000) be terribly anti-social?  I'm guessing yes, but then, how

Yes :)  msleep(5000) is far less anti-social, but still yucky...

 Heh... but w/ staggered spin-up, I don't think we have many other
choices.  Maybe it would be nice if we can do multi-threaded device
driver initialization with checkpoints to order necessary stuff...
BTW, doesn't the BIOS spin up drives as in SCSI HBAs?

quoted

do I ensure that no process tries to access the disk until I've spun it up?

The probe code is typically serialized, to prevent problems like this. 
Specifically, you'll have to be aware of when libata adds devices to the 
SCSI layer.

quoted

Port multiplier spec says that the GSCR[2] register indicates how many 
ports the port multiplier supports.  On my 5-port device, this register 
reads back as six.  Are they counting the control port, or is this 
device defective?

Good question!  ;-)

 Hmmm... that field is 4-bit wide, so, if the count includes the
control port, the actual maximum number of supported ports would be 14
instead of 15 as specified in the spec.  I guess the device is faulty.
And my spec (rev 1.1) says GSCR[2] 3-0: Number of exposed device
fan-out ports.  That's pretty clear, I think.

quoted

What causes a disk to spin up?  Is it the COMRESET?  Soft reset?  In 
either case, it sounds like I need to spin up a drive just to detect it. 
Not optimal.

It depends.  COMRESET or power management can do it.  This stuff is 
documented in the SATA docs.  This is more on the device end than the 
host end, but in general you must be aware of
* host controller power state(s)
* SATA bus power state(s)
* SATA device power state(s)

	Jeff

 Thanks, Edward & Jeff.

-- 
tejun

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help