Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel

[PATCH 00/18] make test "linting" more comprehensive · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 01/18] t: add skeleton chainlint.pl · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: [PATCH 01/18] t: add skeleton chainlint.pl · Ævar Arnfjörð Bjarmason <hidden> · 2022-09-01
Re: [PATCH 01/18] t: add skeleton chainlint.pl · Eric Sunshine <hidden> · 2022-09-02
[PATCH 02/18] chainlint.pl: add POSIX shell lexical analyzer · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: [PATCH 02/18] chainlint.pl: add POSIX shell lexical analyzer · Ævar Arnfjörð Bjarmason <hidden> · 2022-09-01
Re: [PATCH 02/18] chainlint.pl: add POSIX shell lexical analyzer · Eric Sunshine <hidden> · 2022-09-03
[PATCH 04/18] chainlint.pl: add parser to validate tests · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 03/18] chainlint.pl: add POSIX shell parser · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Ævar Arnfjörð Bjarmason <hidden> · 2022-09-01
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-09-03
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Wong <hidden> · 2022-09-06
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-09-06
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Jeff King <hidden> · 2022-09-06
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Ævar Arnfjörð Bjarmason <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Ævar Arnfjörð Bjarmason <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Jeff King <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Jeff King <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Eric Sunshine <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Jeff King <hidden> · 2022-11-21
Re: [PATCH 06/18] chainlint.pl: validate test scripts in parallel · Ævar Arnfjörð Bjarmason <hidden> · 2022-11-22
[PATCH 05/18] chainlint.pl: add parser to identify test definitions · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 07/18] chainlint.pl: don't require `return|exit|continue` to end with `&&` · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 10/18] chainlint.pl: don't flag broken &&-chain if `$?` handled explicitly · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 12/18] chainlint.pl: complain about loops lacking explicit failure handling · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 09/18] chainlint.pl: don't require `&` background command to end with `&&` · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 08/18] t/Makefile: apply chainlint.pl to existing self-tests · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 11/18] chainlint.pl: don't flag broken &&-chain if failure indicated explicitly · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 13/18] chainlint.pl: allow `|| echo` to signal failure upstream of a pipe · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 14/18] t/chainlint: add more chainlint.pl self-tests · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 15/18] test-lib: retire "lint harder" optimization hack · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
[PATCH 16/18] test-lib: replace chainlint.sed with chainlint.pl · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: [PATCH 16/18] test-lib: replace chainlint.sed with chainlint.pl · Elijah Newren <hidden> · 2022-09-03
Re: [PATCH 16/18] test-lib: replace chainlint.sed with chainlint.pl · Eric Sunshine <hidden> · 2022-09-03
[PATCH 18/18] t: retire unused chainlint.sed · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: several messages · Johannes Schindelin <hidden> · 2022-09-02
Re: several messages · Eric Sunshine <hidden> · 2022-09-02
Re: several messages · Jeff King <hidden> · 2022-09-02
[PATCH 17/18] t/Makefile: teach `make test` and `make prove` to run chainlint.pl · Eric Sunshine via GitGitGadget <hidden> · 2022-09-01
Re: [PATCH 00/18] make test "linting" more comprehensive · Jeff King <hidden> · 2022-09-11
Re: [PATCH 00/18] make test "linting" more comprehensive · Eric Sunshine <hidden> · 2022-09-11
Re: [PATCH 00/18] make test "linting" more comprehensive · Jeff King <hidden> · 2022-09-11
Re: [PATCH 00/18] make test "linting" more comprehensive · Eric Sunshine <hidden> · 2022-09-12
Re: [PATCH 00/18] make test "linting" more comprehensive · Jeff King <hidden> · 2022-09-13

From: Ævar Arnfjörð Bjarmason <hidden>
Date: 2022-11-21 13:32:29

On Sun, Nov 20 2022, Eric Sunshine wrote:

On Tue, Sep 6, 2022 at 7:27 PM Jeff King [off-list ref] wrote:

quoted

I did some timings the other night, and I found something quite curious
with the thread stuff.

I was quite surprised that it made things slower! It's nice that we're
only calling it once per script instead of once per test, but it seems
the startup overhead of the script is really high.

And since in this mode we're only feeding it one script at a time, I
tried reverting the "chainlint.pl: validate test scripts in parallel"
commit. And indeed, now things are much faster:

  Benchmark 1: make
    Time (mean ± σ):     61.544 s ±  3.364 s    [User: 556.486 s, System: 384.001 s]
    Range (min … max):   57.660 s … 63.490 s    3 runs

And you can see the same thing just running chainlint by itself:

  $ time perl chainlint.pl /dev/null
  real  0m0.069s
  user  0m0.042s
  sys   0m0.020s

  $ git revert HEAD^{/validate.test.scripts.in.parallel}
  $ time perl chainlint.pl /dev/null
  real  0m0.014s
  user  0m0.010s
  sys   0m0.004s

I didn't track down the source of the slowness. Maybe it's loading extra
modules, or maybe it's opening /proc/cpuinfo, or maybe it's the thread
setup. But it's a surprising slowdown.

It is surprising, and unfortunate. Ditching "ithreads" would probably
be a good idea. (more on that below)

quoted

Now of course your intent is to do a single repo-wide invocation. And
that is indeed a bit faster. Here it is without the parallel code:

  Benchmark 1: make
    Time (mean ± σ):     61.727 s ±  2.140 s    [User: 507.712 s, System: 377.753 s]
    Range (min … max):   59.259 s … 63.074 s    3 runs

The wall-clock time didn't improve much, but the CPU time did. Restoring
the parallel code does improve the wall-clock time a bit, but at the
cost of some extra CPU:

  Benchmark 1: make
    Time (mean ± σ):     59.029 s ±  2.851 s    [User: 515.690 s, System: 380.369 s]
    Range (min … max):   55.736 s … 60.693 s    3 runs

which makes sense. If I do a with/without of just "make test-chainlint",
the parallelism is buying a few seconds of wall-clock:

  Benchmark 1: make test-chainlint
    Time (mean ± σ):     900.1 ms ± 102.9 ms    [User: 12049.8 ms, System: 79.7 ms]
    Range (min … max):   704.2 ms … 994.4 ms    10 runs

  Benchmark 1: make test-chainlint
    Time (mean ± σ):      3.778 s ±  0.042 s    [User: 3.756 s, System: 0.023 s]
    Range (min … max):    3.706 s …  3.833 s    10 runs

I'm not sure what it all means. For Linux, I think I'd be just as happy
with a single non-parallelized test-chainlint run for each file. But
maybe on Windows the startup overhead is worse? OTOH, the whole test run
is so much worse there. One process per script is not going to be that
much in relative terms either way.

Somehow Windows manages to be unbelievably slow no matter what. I
mentioned elsewhere (after you sent this) that I tested on a five or
six year old 8-core dual-boot machine. Booted to Linux, running a
single chainlint.pl invocation using all 8 cores to check all scripts
in the project took under 1 second walltime. The same machine booted
to Windows using all 8 cores took just under two minutes(!) walltime
for the single Perl invocation to check all scripts in the project.

So, at this point, I have no hope for making linting fast on Windows;
it seems to be a lost cause.

I'd be really interested in seeing e.g. the NYTProf output for that run,
compared with that on *nix (if you could upload the HTML versions of
both somewhere, even better).

Maybe "chainlint.pl" is doing something odd, but this goes against the
usual wisdom about what is and isn't slow in Perl on windows, as I
understand it.

I.e. process star-up etc. is slow there, and I/O's a bit slower, but
once you're started up and e.g. slurping up all of those files & parsing
them you're just running "perl-native" code.

Which shouldn't be much slower at all. A perl compiled with ithreads is
(last I checked) around 10-20% slower, and the Windows version is always
compiled with that (it's needed for "fork" emulation).

But most *nix versions are compiled with that too, and certainly the one
you're using with "threads", so that's not the difference.

So I suspect something odd's going on...

quoted

And if we did cache the results and avoid extra invocations via "make",
then we'd want all the parallelism to move to there anyway.

Maybe that gives you more food for thought about whether perl's "use
threads" is worth having.

I'm not especially happy about the significant overhead of "ithreads";
on my (old) machine, although it does improve perceived time
significantly, it eats up quite a bit of additional user-time. As
such, I would not be unhappy to see "ithreads" go away, especially
since fast linting on Windows seems unattainable (at least with Perl).

Overall, I think Ævar's plan to parallelize linting via "make" is
probably the way to go.

Yeah, but that seems to me to be orthagonal to why it's this slow on
Windows, and if it is that wouldn't help much, except for incremental
re-runs.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help