Page MenuHomeSoftware Heritage
Feed Advanced Search

Sep 17 2021

marmoute added a comment to T3581: List heptapod instance foss.heptapod.net.

The hg-git type are served as regular Mercurial repository. So they can be listed as Mercurial repository safely

Sep 17 2021, 11:31 AM · Archive coverage, System administration, Origin-GitLab

Jun 3 2021

marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).

I think @marmoute's intention was to more closely convey the semantics of Mercurial's branching system. A branch tip or head are not a branches themselves, so it would be "wrong" to put them under branches/. Thus, since the plural of "a branch head" is "branch heads" I don't feel like the second change would be appropriate either.

But then again, you have the final say, and I won't die on this hill. :)

Jun 3 2021, 2:35 PM · Mercurial loader
marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).
In T3352#65752, @zack wrote:

My point here is for user looking at the structure to easily distinguish between the different mapping format. Something based on the "visit data" and associated documentation seems quite fragile.

But a version number for the mapping format will be completely meaningless for a user. It's too Software Heritage-specific. The more I think of it the more I'm convinced we should just offer names that are as natural and self-explanatory as possible. Our notion of what is self-explanatory might change in the future (also based on user feedback), but so be it. It is not going to be a new problem in the archive, as @olasd pointed out, so we will not be making it any worse.

In practical terms here I concur that that means dropping the "refs/hg" prefix.

Jun 3 2021, 1:18 PM · Mercurial loader
marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).
In T3352#65749, @olasd wrote:
In T3352#65655, @zack wrote:

I'm happy with the HEAD branch being an alias to a name that's more representative of the corresponding mercurial concept (which would be tip, I guess).

Jun 3 2021, 12:42 PM · Mercurial loader

Jun 1 2021

marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).
In T3352#65657, @zack wrote:

So, to pivot the question around, what is the minimal (also in the sense that it is shorter / has less cruft) naming scheme that would allow us to represent without ambiguity all the Mercurial naming aspects that you want to capture?

Jun 1 2021, 12:29 AM · Mercurial loader

May 31 2021

marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).
In T3352#65655, @zack wrote:

Is the ability to recognize that a snapshot comes from Mercurial an actual goal here? I don't think we care about "clashes" between snapshot created from different VCS, but maybe I'm missing something.

May 31 2021, 9:26 PM · Mercurial loader
marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).

@Alphare proposal is missing the branch-name part for branch-tip. I would ajust it as such:

May 31 2021, 5:13 PM · Mercurial loader
marmoute added a comment to T3352: Define a ref mapping naming scheme for all Mercurial "pointers" (heads, closed heads, bookmarks, tip).

I would go: refs/hg/branch-tip/ instead of just tip

May 31 2021, 4:34 PM · Mercurial loader

Dec 8 2020

marmoute added a comment to T2849: Design and implement a mapping from "original VCS ids" to SWHIDs to help incremental loaders.
In T2849#54256, @olasd wrote:
In T2849#54254, @zack wrote:

Thanks, this would be an important new feature. Some comments/random thoughts below.

Further design point, which I think it's already implicit in what you wrote, but that it'd be useful to make explicit:

  • this new mapping table is needed "only" to speed up things, if we lose it, it will just mean we will be slower in doing future archival (at least for a while), but won't be the end of the world

assuming this is true, loaders will need to be designed with graceful degradation for the incompleteness (or entire disappearance) of the mapping table.

Agreed. Missing objects in the mapping table would just mean that the loader does a bit more work to re-create the associated objects, and, hopefully, generate the same SWHIDs again (which would then be used to populate the mapping table).

Dec 8 2020, 3:28 PM · Storage manager

Dec 3 2020

marmoute added a comment to T2849: Design and implement a mapping from "original VCS ids" to SWHIDs to help incremental loaders.
In T2849#53968, @olasd wrote:

scope: mercurial will only need the mapping for revisions;

Mercurial could also use that mapping for every object. However, just keeping that mapping for revision is already providing a very large complexity boost (or reduction I should says) so that is "good enough" for us. I strongly suspect the same will apply to baazar.

Ack. I think having this for all objects in the short term would make it "too large", and having it for "objects with a history where computing the swhid is (deeply) recursive and expensive" (i.e. revisions and releases) is a decent balance.

Dec 3 2020, 1:48 PM · Storage manager
marmoute added a comment to T2849: Design and implement a mapping from "original VCS ids" to SWHIDs to help incremental loaders.

scope: mercurial will only need the mapping for revisions;

Dec 3 2020, 11:57 AM · Storage manager

Nov 26 2020

marmoute accepted D4540: Add tree diffing in HgLoaderFromDisk.

great

Nov 26 2020, 3:48 PM
marmoute requested changes to D4540: Add tree diffing in HgLoaderFromDisk.
Nov 26 2020, 2:44 PM

Nov 25 2020

marmoute accepted D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 25 2020, 3:22 PM
marmoute accepted D3435: Add mercurial.from_disk.HgLoaderFromDisk.

Ther is (maybe) a last small bit to document, but the patch looks good overall.

Nov 25 2020, 3:01 PM
marmoute requested changes to D4540: Add tree diffing in HgLoaderFromDisk.

almost there.

Nov 25 2020, 2:19 PM
marmoute added a comment to D3435: Add mercurial.from_disk.HgLoaderFromDisk.

I only have one last question. The rest seems fine.

Nov 25 2020, 2:17 PM

Nov 24 2020

marmoute added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 24 2020, 2:34 PM
marmoute added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 24 2020, 2:28 PM
marmoute added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 24 2020, 2:27 PM
marmoute added inline comments to D4540: Add tree diffing in HgLoaderFromDisk.
Nov 24 2020, 2:22 PM

Nov 23 2020

marmoute added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 23 2020, 10:09 PM

Nov 20 2020

marmoute requested changes to D4540: Add tree diffing in HgLoaderFromDisk.
Nov 20 2020, 12:14 PM
marmoute added a comment to D4541: Add content lru cache to HgLoaderFromDisk.

I have two small feedback. this looks good otherwise.

Nov 20 2020, 12:08 PM
marmoute requested changes to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 20 2020, 12:04 PM

Nov 13 2020

marmoute added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 13 2020, 10:31 AM

Nov 9 2020

marmoute accepted D4216: add swh-hg-identify a cli to identify hg objects.

Looks overall good. I made various feedback that could be included in this patch or done as follow up.

Nov 9 2020, 7:30 PM

Oct 21 2020

marmoute requested changes to D4216: add swh-hg-identify a cli to identify hg objects.
Oct 21 2020, 9:55 AM

Oct 12 2020

marmoute requested changes to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Oct 12 2020, 1:26 PM

Oct 10 2020

marmoute added a comment to D4216: add swh-hg-identify a cli to identify hg objects.

Also: the build seems to fails, can you check why ?

Oct 10 2020, 2:51 PM
marmoute requested changes to D4216: add swh-hg-identify a cli to identify hg objects.

Nothing major found, but a swarm of small feedback.

Oct 10 2020, 2:51 PM
marmoute added a comment to D4193: swh identify: add --exclude.

Except for the small feedback about the exception, this looks fine.

Oct 10 2020, 2:41 PM

Oct 8 2020

marmoute added inline comments to D4193: swh identify: add --exclude.
Oct 8 2020, 11:15 AM