Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API

[PATCH net-next V4 0/3] Introduce and use NUMA distance metrics · Tariq Toukan <tariqt@nvidia.com> · 2022-07-28
[PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <tariqt@nvidia.com> · 2022-07-28
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <hidden> · 2022-07-30
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <hidden> · 2022-08-02
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-02
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Jakub Kicinski <kuba@kernel.org> · 2022-08-02
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-04
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <hidden> · 2022-08-08
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-09
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <hidden> · 2022-08-09
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-09
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Tariq Toukan <hidden> · 2022-08-09
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-09
Re: [PATCH net-next V4 1/3] sched/topology: Add NUMA-based CPUs spread API · Valentin Schneider <vschneid@redhat.com> · 2022-08-10
[PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Valentin Schneider <vschneid@redhat.com> · 2022-08-10
[PATCH 2/2] net/mlx5e: Leverage sched_numa_hop_mask() · Valentin Schneider <vschneid@redhat.com> · 2022-08-10
Re: [PATCH 2/2] net/mlx5e: Leverage sched_numa_hop_mask() · Tariq Toukan <hidden> · 2022-08-10
Re: [PATCH 2/2] net/mlx5e: Leverage sched_numa_hop_mask() · Jakub Kicinski <kuba@kernel.org> · 2022-08-10
Re: [PATCH 2/2] net/mlx5e: Leverage sched_numa_hop_mask() · Valentin Schneider <vschneid@redhat.com> · 2022-08-11
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Tariq Toukan <hidden> · 2022-08-10
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Tariq Toukan <hidden> · 2022-08-10
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Valentin Schneider <vschneid@redhat.com> · 2022-08-11
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Tariq Toukan <hidden> · 2022-08-14
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Tariq Toukan <hidden> · 2022-08-14
Re: [PATCH 1/2] sched/topology: Introduce sched_numa_hop_mask() · Valentin Schneider <vschneid@redhat.com> · 2022-08-15
[PATCH net-next V4 2/3] net/mlx5e: Improve remote NUMA preferences used for the IRQ affinity hints · Tariq Toukan <tariqt@nvidia.com> · 2022-07-28
[PATCH net-next V4 3/3] enic: Use NUMA distances logic when setting affinity hints · Tariq Toukan <tariqt@nvidia.com> · 2022-07-28

From: Tariq Toukan <hidden>
Date: 2022-08-09 14:04:18
Also in: lkml


On 8/9/2022 3:52 PM, Valentin Schneider wrote:

On 09/08/22 13:18, Tariq Toukan wrote:

quoted

On 8/9/2022 1:02 PM, Valentin Schneider wrote:

quoted

Are there cases where we can't figure this out in advance? From what I grok
out of the two callsites you patched, all vectors will be used unless some
error happens, so compressing the CPUs in a single cpumask seemed
sufficient.

All vectors will be initialized to support the maximum number of traffic
rings. However, the actual number of traffic rings can be controlled and
set to a lower number N_actual < N. In this case, we'll be using only
N_actual instances and we want them to be the first/closest.

Ok, that makes sense, thank you.

In that case I wonder if we'd want a public-facing iterator for
sched_domains_numa_masks[%i][node], rather than copy a portion of
it. Something like the below (naming and implementation haven't been
thought about too much).

   const struct cpumask *sched_numa_level_mask(int node, int level)
   {
           struct cpumask ***masks = rcu_dereference(sched_domains_numa_masks);

           if (node >= nr_node_ids || level >= sched_domains_numa_levels)
                   return NULL;

           if (!masks)
                   return NULL;

           return masks[level][node];
   }
   EXPORT_SYMBOL_GPL(sched_numa_level_mask);

The above can be kept static, and expose only the foo() function below, 
similar to my sched_cpus_set_spread().

LGTM.
How do you suggest to proceed?
You want to formalize it? Or should I take it from here?

   #define for_each_numa_level_mask(node, lvl, mask)	    \
           for (mask = sched_numa_level_mask(node, lvl); mask;	\
                mask = sched_numa_level_mask(node, ++lvl))

   void foo(int node, int cpus[], int ncpus)
   {
           const struct cpumask *mask;
           int lvl = 0;
           int i = 0;
           int cpu;

           rcu_read_lock();
           for_each_numa_level_mask(node, lvl, mask) {
                   for_each_cpu(cpu, mask) {
                           cpus[i] = cpu;
                           if (++i == ncpus)
                                   goto done;
                   }
           }
   done:
           rcu_read_unlock();
   }

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help