Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1

[PATCH 0/9] get_short_oid UI improvements · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 1/9] sha1-name.c: remove stray newline · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 2/9] sha1-array.h: align function arguments · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 3/9] sha1-name.c: move around the collect_ambiguous() function · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH 4/9] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
[PATCH 6/9] get_short_oid: learn to disambiguate by ^{blob} · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 7/9] get_short_oid / peel_onion: ^{tree} should mean tree, not treeish · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
Re: [PATCH 7/9] get_short_oid / peel_onion: ^{tree} should mean tree, not treeish · brian m. carlson <hidden> · 2018-05-01
[PATCH 5/9] get_short_oid: learn to disambiguate by ^{tag} · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 9/9] config doc: document core.disambiguate · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
[PATCH 8/9] get_short_oid / peel_onion: ^{tree} should mean commit, not commitish · Ævar Arnfjörð Bjarmason <hidden> · 2018-04-30
Re: [PATCH 8/9] get_short_oid / peel_onion: ^{tree} should mean commit, not commitish · Eric Sunshine <hidden> · 2018-04-30
Re: [PATCH 0/9] get_short_oid UI improvements · Stefan Beller <hidden> · 2018-04-30
Re: [PATCH 0/9] get_short_oid UI improvements · brian m. carlson <hidden> · 2018-05-01
Re: [PATCH 0/9] get_short_oid UI improvements · Derrick Stolee <hidden> · 2018-05-01
[PATCH v2 00/12] get_short_oid UI improvements · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 01/12] sha1-name.c: remove stray newline · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 02/12] sha1-array.h: align function arguments · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 03/12] git-p4: change "commitish" typo to "committish" · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 04/12] cache.h: add comment explaining the order in object_type · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 05/12] sha1-name.c: move around the collect_ambiguous() function · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 07/12] get_short_oid: learn to disambiguate by ^{tag} · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 06/12] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 09/12] get_short_oid / peel_onion: ^{tree} should be tree, not treeish · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 10/12] get_short_oid / peel_onion: ^{commit} should be commit, not committish · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 12/12] get_short_oid: document & warn if we ignore the type selector · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 11/12] config doc: document core.disambiguate · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 08/12] get_short_oid: learn to disambiguate by ^{blob} · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v2 06/11] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
Re: [PATCH v2 06/11] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v2 06/11] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
Re: [PATCH v2 06/11] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v2 06/11] get_short_oid: sort ambiguous objects by type, then SHA-1 · Derrick Stolee <hidden> · 2018-05-01
[PATCH v3 00/12] get_short_oid UI improvements · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 01/12] sha1-name.c: remove stray newline · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 02/12] sha1-array.h: align function arguments · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 04/12] cache.h: add comment explaining the order in object_type · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v3 04/12] cache.h: add comment explaining the order in object_type · Duy Nguyen <hidden> · 2018-05-08
[PATCH] pack-format.txt: more details on pack file format · Nguyễn Thái Ngọc Duy <hidden> · 2018-05-08
Re: [PATCH] pack-format.txt: more details on pack file format · Stefan Beller <hidden> · 2018-05-08
Re: [PATCH] pack-format.txt: more details on pack file format · Duy Nguyen <hidden> · 2018-05-08
Re: [PATCH] pack-format.txt: more details on pack file format · Stefan Beller <hidden> · 2018-05-08
Re: [PATCH] pack-format.txt: more details on pack file format · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-08
Re: [PATCH] pack-format.txt: more details on pack file format · Duy Nguyen <hidden> · 2018-05-08
[PATCH v2] pack-format.txt: more details on pack file format · Nguyễn Thái Ngọc Duy <hidden> · 2018-05-10
Re: [PATCH v2] pack-format.txt: more details on pack file format · Stefan Beller <hidden> · 2018-05-10
Re: [PATCH v2] pack-format.txt: more details on pack file format · Duy Nguyen <hidden> · 2018-05-11
[PATCH v3] pack-format.txt: more details on pack file format · Nguyễn Thái Ngọc Duy <hidden> · 2018-05-11
[PATCH v3 03/12] git-p4: change "commitish" typo to "committish" · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 05/12] sha1-name.c: move around the collect_ambiguous() function · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 08/12] get_short_oid: learn to disambiguate by ^{blob} · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 09/12] get_short_oid / peel_onion: ^{tree} should be tree, not treeish · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v4 0/6] get_short_oid UI improvements · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
[PATCH v4 1/6] sha1-name.c: remove stray newline · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
[PATCH v4 2/6] sha1-array.h: align function arguments · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
Re: [PATCH v4 2/6] sha1-array.h: align function arguments · Jeff King <hidden> · 2018-05-10
[PATCH v4 3/6] git-p4: change "commitish" typo to "committish" · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
Re: [PATCH v4 3/6] git-p4: change "commitish" typo to "committish" · Luke Diamand <hidden> · 2018-05-10
[PATCH v4 4/6] sha1-name.c: move around the collect_ambiguous() function · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
[PATCH v4 5/6] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
Re: [PATCH v4 5/6] get_short_oid: sort ambiguous objects by type, then SHA-1 · Jeff King <hidden> · 2018-05-10
[PATCH v4 6/6] get_short_oid: document & warn if we ignore the type selector · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-10
Re: [PATCH v4 6/6] get_short_oid: document & warn if we ignore the type selector · Martin Ågren <hidden> · 2018-05-10
Re: [PATCH v4 6/6] get_short_oid: document & warn if we ignore the type selector · Jeff King <hidden> · 2018-05-10
Re: [PATCH v4 6/6] get_short_oid: document & warn if we ignore the type selector · Jeff King <hidden> · 2018-05-10
Re: [PATCH v4 6/6] get_short_oid: document & warn if we ignore the type selector · Jeff King <hidden> · 2018-05-10
Re: [PATCH v4 0/6] get_short_oid UI improvements · Jeff King <hidden> · 2018-05-10
[PATCH v3 10/12] get_short_oid / peel_onion: ^{commit} should be commit, not committish · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 11/12] config doc: document core.disambiguate · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v3 11/12] config doc: document core.disambiguate · Jeff King <hidden> · 2018-05-08
[PATCH v3 12/12] get_short_oid: document & warn if we ignore the type selector · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
[PATCH v3 06/12] get_short_oid: sort ambiguous objects by type, then SHA-1 · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v3 06/12] get_short_oid: sort ambiguous objects by type, then SHA-1 · Jeff King <hidden> · 2018-05-08
[PATCH v3 07/12] get_short_oid: learn to disambiguate by ^{tag} · Ævar Arnfjörð Bjarmason <hidden> · 2018-05-01
Re: [PATCH v3 00/12] get_short_oid UI improvements · Derrick Stolee <hidden> · 2018-05-02
Re: [PATCH v3 00/12] get_short_oid UI improvements · Derrick Stolee <hidden> · 2018-05-02
Re: [PATCH v3 00/12] get_short_oid UI improvements · Jacob Keller <hidden> · 2018-05-03

From: Ævar Arnfjörð Bjarmason <hidden>
Date: 2018-05-01 12:36:46

On Tue, May 01 2018, Derrick Stolee wrote:

On 5/1/2018 7:27 AM, Ævar Arnfjörð Bjarmason wrote:

quoted

On Tue, May 01 2018, Derrick Stolee wrote:

quoted

On 4/30/2018 6:07 PM, Ævar Arnfjörð Bjarmason wrote:

quoted

Since we show the commit data in the output that's nicely aligned once
we sort by object type. The decision to show tags before commits is
pretty arbitrary, but it's much less likely that we'll display a tag,
so if there is one it makes sense to show it first.

Here's a non-arbitrary reason: the object types are ordered
topologically (ignoring self-references):

tag -> commit, tree, blob
commit -> tree
tree -> blob

Thanks. I'll add a patch with that comment to v2.

quoted

@@ -421,7 +451,12 @@ static int get_short_oid(const char *name, int len, struct object_id *oid,
   			ds.fn = NULL;
     		advise(_("The candidates are:"));
-		for_each_abbrev(ds.hex_pfx, show_ambiguous_object, &ds);
+		for_each_abbrev(ds.hex_pfx, collect_ambiguous, &collect);
+		QSORT(collect.oid, collect.nr, sort_ambiguous);

I was wondering how the old code sorted by SHA even when the ambiguous
objects were loaded from different sources (multiple pack-files, loose
objects). Turns out that for_each_abbrev() does its own sort after
collecting the SHAs and then calls the given function pointer only
once per distinct object. This avoids multiple instances of the same
object, which may appear multiple times across pack-files.

I only ask because now we are doing two sorts. I wonder if it would be
more elegant to provide your sorting algorithm to for_each_abbrev()
and let it call show_ambiguous_object as before.

Another question is if we should use this sort generally for all calls
to for_each_abbrev(). The only other case I see is in
builtin/revparse.c.

When preparing v2 I realized how confusing this was, so I'd added this
to the commit message of my WIP re-roll which should explain this:

     A note on the implementation: I started out with something much
     simpler which just replaced oid_array_sort() in sha1-array.c with a
     custom sort function before calling oid_array_for_each_unique(). But
     then dumbly noticed that it doesn't work because the output function
     was tangled up with the code added in fad6b9e590 ("for_each_abbrev:
     drop duplicate objects", 2016-09-26) to ensure we don't display
     duplicate objects.
          That's why we're doing two passes here, first we need to
sort the list
     and de-duplicate the objects, then sort them in our custom order, and
     finally output them without re-sorting them. I suppose we could also
     make oid_array_for_each_unique() maintain a hashmap of emitted
     objects, but that would increase its memory profile and wouldn't be
     worth the complexity for this one-off use-case,
     oid_array_for_each_unique() is used in many other places.

How would sorting in our custom order before de-duplicating fail the
de-duplication? We will still pair identical OIDs as consecutive
elements and oid_array_for_each_unique only cares about consecutive
elements having distinct OIDs, not lex-ordered OIDs.

Because there's no de-duplication without the array first being sorted
in oidcmp() order, which oid_array_for_each_unique() checks for and
re-sorts if !array->sorted. I.e. its de-duplication is just a state
machine where it won't call the callback if the currently processed
element has the same SHA1 as the last one.

Perhaps the noise is because we rely on oid_array_sort() to mark the
array as sorted inside oid_array_for_each_unique(), but that could be
remedied by calling our QSORT() inside for_each_abbrev() and marking
the array as sorted before calling oid_array_for_each_unique().

As noted above this won't work, because the function inherently relies
on the array being sorted to be able to de-duplicate. Doing this will
yield duplicate entries.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help