[PATCH V2 1/5] arm: mvebu: Added support for coherency fabric in mach-mvebu
From: Gregory CLEMENT <hidden>
Date: 2012-11-15 15:54:39
Also in:
linux-devicetree
On 11/15/2012 11:17 AM, Will Deacon wrote:
On Wed, Nov 14, 2012 at 08:00:32PM +0000, Gregory CLEMENT wrote:quoted
On 11/13/2012 11:43 AM, Will Deacon wrote:quoted
On Mon, Nov 12, 2012 at 08:21:07PM +0000, Gregory CLEMENT wrote:quoted
On 11/05/2012 03:02 PM, Will Deacon wrote:quoted
These writels may expand to code containing calls to outer_sync(), which will attempt to take a spinlock for the aurora l2. Given that the CPU isn't coherent, how does this play out with the exclusive store instruction in the lock?I dug a little this subject: and I am not sure there is problem. In SMP mode, only the system cache mode of Aurora is used. In this mode, outer_cache.sync is void then outer_sync() won't call any function, so there will be no access to any spinlock.Hmm, that is pretty subtle and it doesn't really solve the bigger picture. printk takes logbuf_lock, for example, and I'm sure that by the time you get to this code you will have relied on exclusives behaving correctly.Hi Will, I get an answer from Marvell engineers: "STREX on non-shareable and/or non-cacheable memory regions is supported."Interesting, thanks for asking them about this. Does this mean that:
Here come the answers to your new questions
1. When not running coherently (i.e. before initialising the coherency fabric), memory is treated as non-shareable, non-cacheable?
It can be cacheable. The shared memory (as defined on the page table) will NOT be coherent by HW.
2. If (1), then are exclusive accesses the only way to achieve coherent memory accesses in this scenario?
I quote: "I suspect there is terminology miss-use: exclusive accesses are NOT used to achieve memory coherency - they are used to achieve atomicity. To achieve memory coherency while fabric is configured to be non-coherent, SW should use maintenance operations over the L1 caches.suspect there is terminology miss-use: exclusive accesses are NOT used to achieve memory coherency - "they are used to achieve atomicity. To achieve memory coherency while fabric is configured to be non-coherent, SW should use maintenance operations over the L1 caches.
If so, you still have a problem with write locks, where the unlock code does
a regular str to clear the status. atomic_{read,set} also uses regular
memory accesses, so I think you'll get some surprises there when you add
explicit memory barriers and expect things to be visible between threads.
Do memory barriers have different semantics depending on the state of your
coherency fabric?No -- Gregory Clement, Free Electrons Kernel, drivers, real-time and embedded Linux development, consulting, training and support. http://free-electrons.com