Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask()
From: Valentin Schneider <vschneid@redhat.com>
Date: 2022-08-15 14:21:02
Also in:
lkml
On 14/08/22 11:19, Tariq Toukan wrote:
The API is indeed easy to use, the driver part looks straight forward. I appreciate the tricks you used to make it work! However, the implementation is relatively complicated, not easy to read or understand, and touches several files. I do understand what you did here, but I guess not all respective maintainers will like it. Let's see.
Dumping it all into a single diff also doesn't help :-) I think the changes to get a for_each_cpu_andnot() are straightforward enough, the one eyesore is the macro but I consider it a necessary evil to get an allocation-free interface.
One alternative to consider, that will simplify things up, is switching back to returning an array of cpus, ordered by their distance, up to a provided argument 'npus'. This way, you will iterate over sched_numa_hop_mask() internally, easily maintaining the cpumask diffs between two hops, without the need of making it on-the-fly as part an an exposed for-loop macro.
That requires extra storage however: at the very least the array, and a temp cpumask to remember already-visited CPUs (the alternative being scanning the array every CPU iteration to figure out if it's been added already). I'm going to submit the cpumask / sched changes, hopefully I get to something by the time you're back from PTO.