Re: [PATCH v3 5/8] reftable/record: store "val1" hashes as static arrays

[PATCH 0/7] reftable: fixes and optimizations (pt.2) · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 1/7] reftable/stack: do not overwrite errors when compacting · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 2/7] reftable/writer: fix index corruption when writing multiple indices · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 3/7] reftable/record: constify some parts of the interface · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 4/7] reftable/record: store "val1" hashes as static arrays · Patrick Steinhardt <hidden> · 2023-12-20
Re: [PATCH 4/7] reftable/record: store "val1" hashes as static arrays · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 5/7] reftable/record: store "val2" hashes as static arrays · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 6/7] reftable/merged: really reuse buffers to compute record keys · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH 7/7] reftable/merged: transfer ownership of records when iterating · Patrick Steinhardt <hidden> · 2023-12-20
[PATCH v2 0/8] reftable: fixes and optimizations (pt.2) · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 1/8] reftable/stack: do not overwrite errors when compacting · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 2/8] reftable/stack: do not auto-compact twice in `reftable_stack_add()` · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 3/8] reftable/writer: fix index corruption when writing multiple indices · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 4/8] reftable/record: constify some parts of the interface · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 5/8] reftable/record: store "val1" hashes as static arrays · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 6/8] reftable/record: store "val2" hashes as static arrays · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 7/8] reftable/merged: really reuse buffers to compute record keys · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v2 8/8] reftable/merged: transfer ownership of records when iterating · Patrick Steinhardt <hidden> · 2023-12-28
[PATCH v3 0/8] reftable: fixes and optimizations (pt.2) · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 1/8] reftable/stack: do not overwrite errors when compacting · Patrick Steinhardt <hidden> · 2024-01-03
Re: [PATCH v3 1/8] reftable/stack: do not overwrite errors when compacting · Han-Wen Nienhuys <hidden> · 2024-02-14
Re: [PATCH v3 1/8] reftable/stack: do not overwrite errors when compacting · Patrick Steinhardt <hidden> · 2024-02-15
[PATCH v3 2/8] reftable/stack: do not auto-compact twice in `reftable_stack_add()` · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 3/8] reftable/writer: fix index corruption when writing multiple indices · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 4/8] reftable/record: constify some parts of the interface · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 5/8] reftable/record: store "val1" hashes as static arrays · Patrick Steinhardt <hidden> · 2024-01-03
Re: [PATCH v3 5/8] reftable/record: store "val1" hashes as static arrays · Karthik Nayak <hidden> · 2024-02-05
Re: [PATCH v3 5/8] reftable/record: store "val1" hashes as static arrays · Patrick Steinhardt <hidden> · 2024-02-06
[PATCH v3 6/8] reftable/record: store "val2" hashes as static arrays · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 7/8] reftable/merged: really reuse buffers to compute record keys · Patrick Steinhardt <hidden> · 2024-01-03
[PATCH v3 8/8] reftable/merged: transfer ownership of records when iterating · Patrick Steinhardt <hidden> · 2024-01-03
Re: [PATCH v3 0/8] reftable: fixes and optimizations (pt.2) · Karthik Nayak <hidden> · 2024-02-05

From: Patrick Steinhardt <hidden>
Date: 2024-02-06 06:03:24

On Mon, Feb 05, 2024 at 03:39:31AM -0800, Karthik Nayak wrote:

Patrick Steinhardt [off-list ref] writes:

quoted

When reading ref records of type "val1", we store its object ID in an
allocated array. This results in an additional allocation for every
single ref record we read, which is rather inefficient especially when
iterating over refs.

Refactor the code to instead use an embedded array of `GIT_MAX_RAWSZ`
bytes. While this means that `struct ref_record` is bigger now, we
typically do not store all refs in an array anyway and instead only
handle a limited number of records at the same point in time.

Using `git show-ref --quiet` in a repository with ~350k refs this leads
to a significant drop in allocations. Before:

    HEAP SUMMARY:
        in use at exit: 21,098 bytes in 192 blocks
      total heap usage: 2,116,683 allocs, 2,116,491 frees, 76,098,060 bytes allocated

After:

    HEAP SUMMARY:
        in use at exit: 21,098 bytes in 192 blocks
      total heap usage: 1,419,031 allocs, 1,418,839 frees, 62,145,036 bytes allocated

Curious, did you also do perf benchmarking on this?

I didn't back then, but here you go. The following test shows a single
ref matching a specific pattern out of 1 million refs:

    Benchmark 1: show-ref: single matching ref (revision = HEAD~)
      Time (mean ± σ):     191.1 ms ±   5.2 ms    [User: 188.1 ms, System: 2.8 ms]
      Range (min … max):   186.2 ms … 214.5 ms    100 runs

    Benchmark 2: show-ref: single matching ref (revision = HEAD)
      Time (mean ± σ):     189.7 ms ±   5.3 ms    [User: 186.7 ms, System: 2.8 ms]
      Range (min … max):   184.1 ms … 213.4 ms    100 runs

    Summary
      show-ref: single matching ref (revision = HEAD) ran
        1.01 ± 0.04 times faster than show-ref: single matching ref (revision = HEAD~)

Not much of a win here, which is probably expected. On glibc the
allocator seems to be really efficient churning out many small blocks of
memory, which is also something I have noticed in other contexts. I do
expect that other platorms might see more significant results.

Patrick

Attachments

signature.asc [application/pgp-signature] 833 bytes

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help