Re: [PATCH 08/15] cache: compare the entire buffer for struct object_id

[PATCH 00/15] SHA-256 / SHA-1 interop, part 1 · brian m. carlson <hidden> · 2021-04-10
[PATCH 01/15] sha1-file: allow hashing objects literally with any algorithm · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 01/15] sha1-file: allow hashing objects literally with any algorithm · Denton Liu <hidden> · 2021-04-15
Re: [PATCH 01/15] sha1-file: allow hashing objects literally with any algorithm · brian m. carlson <hidden> · 2021-04-15
Re: [PATCH 01/15] sha1-file: allow hashing objects literally with any algorithm · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-16
[PATCH 04/15] Always use oidread to read into struct object_id · brian m. carlson <hidden> · 2021-04-10
[PATCH 05/15] hash: add a function to finalize object IDs · brian m. carlson <hidden> · 2021-04-10
[PATCH 07/15] builtin/pack-redundant: avoid casting buffers to struct object_id · brian m. carlson <hidden> · 2021-04-10
[PATCH 08/15] cache: compare the entire buffer for struct object_id · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 08/15] cache: compare the entire buffer for struct object_id · Chris Torek <hidden> · 2021-04-11
Re: [PATCH 08/15] cache: compare the entire buffer for struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-11
Re: [PATCH 08/15] cache: compare the entire buffer for struct object_id · brian m. carlson <hidden> · 2021-04-11
[PATCH 10/15] hash: provide per-algorithm null OIDs · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 10/15] hash: provide per-algorithm null OIDs · Junio C Hamano <hidden> · 2021-04-11
Re: [PATCH 10/15] hash: provide per-algorithm null OIDs · brian m. carlson <hidden> · 2021-04-11
[PATCH 09/15] hash: set and copy algo field in struct object_id · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-11
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · brian m. carlson <hidden> · 2021-04-11
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-11
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · brian m. carlson <hidden> · 2021-04-11
[PATCH 0/2] C99: harder dependency on variadic macros · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-12
[PATCH 2/2] C99 support: remove non-HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-12
Re: [PATCH 2/2] C99 support: remove non-HAVE_VARIADIC_MACROS code · Jonathan Nieder <hidden> · 2021-05-21
[PATCH 1/2] git-compat-util.h: clarify comment on GCC-specific code · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-12
Re: [PATCH 1/2] git-compat-util.h: clarify comment on GCC-specific code · Jeff King <hidden> · 2021-04-13
Re: [PATCH 1/2] git-compat-util.h: clarify comment on GCC-specific code · Jonathan Nieder <hidden> · 2021-05-21
Re: [PATCH 0/2] C99: harder dependency on variadic macros · Bagas Sanjaya <hidden> · 2021-04-12
Re: [PATCH 0/2] C99: harder dependency on variadic macros · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-12
Re: [PATCH 0/2] C99: harder dependency on variadic macros · brian m. carlson <hidden> · 2021-04-12
[PATCH v2 0/2] C99: remove hardcoded-out !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-01-28
[PATCH v2 1/2] git-compat-util.h: clarify GCC v.s. C99-specific in comment · Ævar Arnfjörð Bjarmason <hidden> · 2022-01-28
[PATCH v2 2/2] C99: remove hardcoded-out !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-01-28
[PATCH v3 0/3] C99: remove dead !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-19
[PATCH v3 1/3] git-compat-util.h: clarify GCC v.s. C99-specific in comment · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-19
[PATCH v3 3/3] trace.h: remove never-used TRACE_CONTEXT · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-19
Re: [PATCH v3 3/3] trace.h: remove never-used TRACE_CONTEXT · Junio C Hamano <hidden> · 2022-02-20
Re: [PATCH v3 3/3] trace.h: remove never-used TRACE_CONTEXT · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-20
[PATCH v3 2/3] C99: remove hardcoded-out !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-19
[PATCH v4 0/2] C99: remove dead !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-21
[PATCH v4 1/2] git-compat-util.h: clarify GCC v.s. C99-specific in comment · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-21
[PATCH v4 2/2] C99: remove hardcoded-out !HAVE_VARIADIC_MACROS code · Ævar Arnfjörð Bjarmason <hidden> · 2022-02-21
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · Junio C Hamano <hidden> · 2021-04-12
Re: [PATCH 09/15] hash: set and copy algo field in struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-12
[PATCH 11/15] builtin/show-index: set the algorithm for object IDs · brian m. carlson <hidden> · 2021-04-10
[PATCH 12/15] commit-graph: don't store file hashes as struct object_id · brian m. carlson <hidden> · 2021-04-10
[PATCH 15/15] hex: print objects using the hash algorithm member · brian m. carlson <hidden> · 2021-04-10
[PATCH 02/15] builtin/hash-object: allow literally hashing with a given algorithm · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 02/15] builtin/hash-object: allow literally hashing with a given algorithm · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-11
Re: [PATCH 02/15] builtin/hash-object: allow literally hashing with a given algorithm · brian m. carlson <hidden> · 2021-04-11
Re: [PATCH 02/15] builtin/hash-object: allow literally hashing with a given algorithm · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-16
Re: [PATCH 02/15] builtin/hash-object: allow literally hashing with a given algorithm · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-16
[PATCH 03/15] cache: add an algo member to struct object_id · brian m. carlson <hidden> · 2021-04-10
Re: [PATCH 03/15] cache: add an algo member to struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-11
Re: [PATCH 03/15] cache: add an algo member to struct object_id · brian m. carlson <hidden> · 2021-04-11
Re: [PATCH 03/15] cache: add an algo member to struct object_id · Derrick Stolee <hidden> · 2021-04-13
Re: [PATCH 03/15] cache: add an algo member to struct object_id · brian m. carlson <hidden> · 2021-04-14
Re: [PATCH 03/15] cache: add an algo member to struct object_id · Ævar Arnfjörð Bjarmason <hidden> · 2021-04-15
Re: [PATCH 03/15] cache: add an algo member to struct object_id · brian m. carlson <hidden> · 2021-04-15
[PATCH 06/15] Use the final_oid_fn to finalize hashing of object IDs · brian m. carlson <hidden> · 2021-04-10
[PATCH 13/15] builtin/pack-objects: avoid using struct object_id for pack hash · brian m. carlson <hidden> · 2021-04-10
[PATCH 14/15] hex: default to the_hash_algo on zero algorithm value · brian m. carlson <hidden> · 2021-04-10

From: brian m. carlson <hidden>
Date: 2021-04-11 21:05:57

On 2021-04-11 at 11:36:33, Ævar Arnfjörð Bjarmason wrote:

On Sat, Apr 10 2021, brian m. carlson wrote:

quoted

Currently, when we compare two object IDs, we have to take a branch to
determine what the hash size is supposed to be.  The compiler can
optimize well for a single length, but has trouble when there are two
possible lengths.

This would benefit from some performance/perf numbers. When this code
was first changed like this in 183a638b7da (hashcmp: assert constant
hash size, 2018-08-23) we had:

      Test     v2.18.0             v2.19.0-rc0               HEAD
      ------------------------------------------------------------------------------
      0001.2:  34.24(33.81+0.43)   34.83(34.42+0.40) +1.7%   33.90(33.47+0.42) -1.0%

Then it was later modified in 0dab7129ab1 (cache: make hashcmp and
hasheq work with larger hashes, 2018-11-14).

I can do some perf numbers.

quoted

@@ -205,7 +205,7 @@ static inline int hashcmp(const unsigned char *sha1, const unsigned char *sha2)
 
 static inline int oidcmp(const struct object_id *oid1, const struct object_id *oid2)
 {
-	return hashcmp(oid1->hash, oid2->hash);
+	return memcmp(oid1->hash, oid2->hash, GIT_MAX_RAWSZ);
 }

hashcmp is now:

        if (the_hash_algo->rawsz == GIT_MAX_RAWSZ)
                return memcmp(sha1, sha2, GIT_MAX_RAWSZ);
        return memcmp(sha1, sha2, GIT_SHA1_RAWSZ);

Wouldn't it make more sense to amend it to just be a memcmp
wrapper/macro if we're going to not make this conditional on the hash
algorithm, or are there other callsites where we still want the old way
of doing it?

No, we can't do that.  With oidcmp, we know the buffer is large enough.
However, in some cases, the buffer in hashcmp is not large enough.  For
example, we may be at the end of a SHA-1 tree object and we'd segfault.
I did try that and I quickly found that it was totally broken.
-- 
brian m. carlson (he/him or they/them)
Houston, Texas, US

Attachments

signature.asc [application/pgp-signature] 263 bytes

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help