Page MenuHomeSoftware Heritage
Feed All Stories

Oct 1 2021

vlorentz accepted D6383: fix TypeError due to wrong LoadCvsRepository task description.
Oct 1 2021, 11:49 AM
stsp requested review of D6383: fix TypeError due to wrong LoadCvsRepository task description.
Oct 1 2021, 11:49 AM
moranegg added a comment to T3036: Create features/services page on the website connecting to archive/docs.

The features page is online and accessible for admins and ambassadors to get feedback:
https://www.softwareheritage.org/features/

Oct 1 2021, 11:42 AM · Documentation
ardumont abandoned D6377: wip: git: Group objects per type early to drop the packfile reference asap.
Oct 1 2021, 11:21 AM
ardumont added a comment to D6377: wip: git: Group objects per type early to drop the packfile reference asap.

I truly doubt that proceeding like this will optimize the memory consumption of the loader as objects in a pack file are gzip compressed and usually deltified to optimize size.

Your approach will store all git objects uncompressed in a dict so few chances that it will eat less memory than the pack file here, nevertheless it will optimize loading performance.

Oct 1 2021, 11:21 AM
stsp requested review of D6382: make "yarn install" non-fatal in swh-web entrypoint.
Oct 1 2021, 10:28 AM
anlambert added a comment to D6377: wip: git: Group objects per type early to drop the packfile reference asap.

I truly doubt that proceeding like this will optimize the memory consumption of the loader as objects in a pack file are gzip compressed and usually deltified to optimize size.

Oct 1 2021, 10:19 AM
ardumont added a comment to D6381: Add support for Django 3.

I'm assuming it's just to demonstrate we could migrate to django3 right?
Not a "let's migrate to django3"?

Oct 1 2021, 9:21 AM
swh-public-ci added a comment to D6380: Allow partial snapshot creation during ingestion.

Build is green

Oct 1 2021, 9:12 AM
ardumont updated the diff for D6380: Allow partial snapshot creation during ingestion.

Adapt according to suggestion

Oct 1 2021, 8:58 AM
ardumont added a comment to D6380: Allow partial snapshot creation during ingestion.

what about naming the parameter create_snapshot instead?

Oct 1 2021, 8:45 AM

Sep 30 2021

vlorentz committed rDSNIPe866727cdccf: analyze_consistency_failures.py: Make multiprocess + handle more edge cases (authored by vlorentz).
analyze_consistency_failures.py: Make multiprocess + handle more edge cases
Sep 30 2021, 7:39 PM
vlorentz added a comment to D6380: Allow partial snapshot creation during ingestion.

Prior to this
commit, it was implied that the store_data could only be called once. It's a limitation
that needs to change for some ongoing optimizations in the loader git.

Sep 30 2021, 7:31 PM
vlorentz added a comment to D6380: Allow partial snapshot creation during ingestion.

what about naming the parameter create_snapshot instead?

Sep 30 2021, 7:30 PM
ardumont created P1189 draft: ryout runs on loader git improvments.
Sep 30 2021, 6:46 PM
ardumont added a revision to T3625: Reduce git loader memory footprint: D6380: Allow partial snapshot creation during ingestion.
Sep 30 2021, 6:01 PM · Git loader
ardumont updated the summary of D6380: Allow partial snapshot creation during ingestion.
Sep 30 2021, 6:01 PM
ardumont triaged T3625: Reduce git loader memory footprint as High priority.
Sep 30 2021, 6:01 PM · Git loader
vlorentz requested review of D6381: Add support for Django 3.
Sep 30 2021, 5:36 PM
vsellier updated subscribers of T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet.

cc @seirl

Sep 30 2021, 5:10 PM · Compressed graph service, System administration
ardumont requested review of D6380: Allow partial snapshot creation during ingestion.
Sep 30 2021, 5:09 PM
zack updated the title for P1187 import-ics from Command-Line Input to import-ics.
Sep 30 2021, 5:00 PM
zack created P1187 import-ics.
Sep 30 2021, 5:00 PM
ardumont committed rDSNIP9a1e273feefc: Add autoscale configuration (authored by ardumont).
Add autoscale configuration
Sep 30 2021, 4:49 PM
Harbormaster failed to build B24133: rDOBJSc0bac6bc0941: Blacklist azure-storage-blob 12.9.0, to fix mypy errors for rDOBJSc0bac6bc0941: Blacklist azure-storage-blob 12.9.0, to fix mypy errors!
Sep 30 2021, 4:49 PM
vlorentz closed D6379: Blacklist azure-storage-blob 12.5.0, to fix mypy errors.
Sep 30 2021, 4:47 PM
vlorentz committed rDOBJSc0bac6bc0941: Blacklist azure-storage-blob 12.9.0, to fix mypy errors (authored by vlorentz).
Blacklist azure-storage-blob 12.9.0, to fix mypy errors
Sep 30 2021, 4:47 PM
vlorentz requested review of D6379: Blacklist azure-storage-blob 12.5.0, to fix mypy errors.
Sep 30 2021, 4:43 PM
swh-public-ci added a comment to D6339: Add support for remote backend on existing storage tests.

Build is green

Sep 30 2021, 4:27 PM
zack created P1186 Command-Line Input.
Sep 30 2021, 4:25 PM
swh-public-ci added a comment to D6165: Add new RabbitMQ-based client/server API.

Build is green

Sep 30 2021, 4:23 PM
aeviso updated the diff for D6339: Add support for remote backend on existing storage tests.

rebase

Sep 30 2021, 4:21 PM
aeviso updated the diff for D6165: Add new RabbitMQ-based client/server API.

rebase

Sep 30 2021, 4:20 PM
marla.dasilva closed T3162: Services page iconography as Resolved.
Sep 30 2021, 3:58 PM · Unknown Object (Project)
ardumont committed rDLDG940a5e7e6f42: Drop spurious print statement (authored by ardumont).
Drop spurious print statement
Sep 30 2021, 3:44 PM
vlorentz lowered the priority of T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet from High to Normal.
Sep 30 2021, 3:21 PM · Compressed graph service, System administration
vlorentz updated the task description for T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet.
Sep 30 2021, 3:07 PM · Compressed graph service, System administration
vlorentz lowered the priority of T3623: Run swh-graph with gunicorn to support multiple/parallel requests from Normal to Low.

Hmm actually this might be harder than just using gunicorn, because the java subprocess needs to be shared between workers, hmm...

Sep 30 2021, 3:04 PM · Compressed graph service, System administration
vlorentz triaged T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet as High priority.
Sep 30 2021, 3:04 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:32 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:31 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:30 PM · Compressed graph service, System administration
vlorentz updated the task description for T3623: Run swh-graph with gunicorn to support multiple/parallel requests.
Sep 30 2021, 2:30 PM · Compressed graph service, System administration
vsellier updated the task description for T3487: Installation of the new provenance server.
Sep 30 2021, 2:27 PM · System administration
vlorentz triaged T3623: Run swh-graph with gunicorn to support multiple/parallel requests as Normal priority.
Sep 30 2021, 2:27 PM · Compressed graph service, System administration
vsellier closed D6378: provenance: Declare 10 pre-provisioned databases for the different experiments.
Sep 30 2021, 2:23 PM
vsellier committed rSPSITEe42b581fc789: provenance: Declare 10 pre-provisioned databases for the different experiments (authored by vsellier).
provenance: Declare 10 pre-provisioned databases for the different experiments
Sep 30 2021, 2:23 PM
ardumont accepted D6378: provenance: Declare 10 pre-provisioned databases for the different experiments.
Sep 30 2021, 2:22 PM
vsellier updated the task description for T3487: Installation of the new provenance server.
Sep 30 2021, 12:50 PM · System administration
vsellier requested review of D6378: provenance: Declare 10 pre-provisioned databases for the different experiments.
Sep 30 2021, 12:50 PM
vsellier added a revision to T3487: Installation of the new provenance server: D6378: provenance: Declare 10 pre-provisioned databases for the different experiments.
Sep 30 2021, 12:50 PM · System administration
ardumont closed T3613: svn: An error occurred when running svnrdump and no exploitable dump file has been generated. as Resolved.

Deployed.

Sep 30 2021, 12:16 PM · SVN Loader
ardumont closed T3622: svn: Failing to start the ingestion "No such revision 1', 160006)" as Resolved.
Sep 30 2021, 12:16 PM · SVN Loader
ardumont added a comment to T3622: svn: Failing to start the ingestion "No such revision 1', 160006)".

Deployed.

Sep 30 2021, 12:16 PM · SVN Loader
ardumont updated the task description for T3487: Installation of the new provenance server.
Sep 30 2021, 12:15 PM · System administration
vlorentz added a comment to T75: Check integrity of directories, revisions, and releases.
  1. some negative UTCs don't have the boolean set to true (the only example I have is also missing a space in the same line, so this might be why)
  2. "0000" as offset (with no sign), dulwich can't even parse this
  3. "+0575" as offset; parsed as "+0615"
  4. "\r" in an email address was stripped
Sep 30 2021, 12:00 PM · Archive content, Restricted Project
ardumont retitled D6377: wip: git: Group objects per type early to drop the packfile reference asap from git: Group objects per type early to drop the packfile reference asap to wip: git: Group objects per type early to drop the packfile reference asap.
Sep 30 2021, 10:11 AM
ardumont updated the title for P1185 computers say no to D6377 from untitled to computers say no to D6377.
Sep 30 2021, 10:10 AM
ardumont edited P1185 computers say no to D6377.
Sep 30 2021, 10:09 AM
ardumont closed D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 30 2021, 10:07 AM
ardumont committed rDLDSVN7ee2795d5cc2: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit (authored by ardumont).
SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit
Sep 30 2021, 10:07 AM
ardumont created P1185 computers say no to D6377.
Sep 30 2021, 10:06 AM
swh-public-ci added a comment to D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Build is green

Sep 30 2021, 9:56 AM
ardumont updated the diff for D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Realign commit message and diff description

Sep 30 2021, 9:54 AM
ardumont retitled D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit from Fix failed visit as uneventful one for the SvnLoaderFromRemoteDump to SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 30 2021, 9:53 AM
ardumont added a comment to D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Looks ok (not sure I really understand the fix however, more precisely, what was the purpose of the revision_start != 1 condition), but I really don't understand the commit message:

Sep 30 2021, 9:52 AM
ardumont updated the summary of D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 30 2021, 9:51 AM
douardda accepted D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Looks ok (not sure I really understand the fix however, more precisely, what was the purpose of the revision_start != 1 condition), but I really don't understand the commit message:

Sep 30 2021, 9:40 AM
swh-public-ci added a comment to D6377: wip: git: Group objects per type early to drop the packfile reference asap.

Build is green

Sep 30 2021, 9:39 AM
ardumont updated the diff for D6377: wip: git: Group objects per type early to drop the packfile reference asap.

Drop no longer needs to rewind the packfile prior to read it

Sep 30 2021, 9:37 AM
ardumont requested review of D6377: wip: git: Group objects per type early to drop the packfile reference asap.
Sep 30 2021, 9:29 AM
swh-public-ci added a comment to D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Build is green

Sep 30 2021, 8:49 AM
ardumont updated the diff for D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Rework commit message/diff description

Sep 30 2021, 8:47 AM
ardumont retitled D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit from svn: Fix missed uneventful visits on some edge case to Fix failed visit as uneventful one for the SvnLoaderFromRemoteDump.
Sep 30 2021, 8:46 AM
ardumont updated the summary of D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 30 2021, 8:45 AM

Sep 29 2021

vlorentz committed rDSNIP9f7568f0c61d: vlorentz/analyze_consistency_failures.py: Initial commit (authored by vlorentz).
vlorentz/analyze_consistency_failures.py: Initial commit
Sep 29 2021, 8:35 PM
vlorentz committed rDSNIP4c59e23a7afd: check_consistency.py: Fix deprecation warnings on swh-model >= 3.0.0 (authored by vlorentz).
check_consistency.py: Fix deprecation warnings on swh-model >= 3.0.0
Sep 29 2021, 8:35 PM
vlorentz committed rDSNIP80a5ca1ffe86: check_consistency.py: Fix Kafka message corruption issues (authored by vlorentz).
check_consistency.py: Fix Kafka message corruption issues
Sep 29 2021, 8:35 PM
swh-public-ci added a comment to D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Build is green

Sep 29 2021, 8:10 PM
ardumont updated the diff for D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Assert snapshot is not null

Sep 29 2021, 8:09 PM
swh-public-ci added a comment to D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Build is green

Sep 29 2021, 8:08 PM
ardumont retitled D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit from wip: svn: Incremental loading fails sometimes with SvnLoaderFromRemoteDump to svn: Fix missed uneventful visits on some edge case.
Sep 29 2021, 8:06 PM
ardumont updated the diff for D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.

Fix

Sep 29 2021, 8:06 PM
ardumont retitled D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit from svn: Incremental loading fails sometimes with SvnLoaderFromRemoteDump to wip: svn: Incremental loading fails sometimes with SvnLoaderFromRemoteDump.
Sep 29 2021, 7:42 PM
ardumont requested review of D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 29 2021, 7:40 PM
ardumont closed T3424: Opam support as Resolved.

And it's now deployed.
Closing.

Sep 29 2021, 7:39 PM · Archive coverage, Opam
ardumont added a revision to T3622: svn: Failing to start the ingestion "No such revision 1', 160006)": D6376: SvnLoaderFromRemoteDump: Fix failed visit which should be an uneventful visit.
Sep 29 2021, 7:38 PM · SVN Loader
ardumont closed D6373: svn: Capture svnrdump failure as not_found instead of failed.
Sep 29 2021, 7:19 PM
ardumont committed rDLDSVN666f32a01cc3: svn: Capture svnrdump failure as not_found instead of failed (authored by ardumont).
svn: Capture svnrdump failure as not_found instead of failed
Sep 29 2021, 7:19 PM
anlambert accepted D6373: svn: Capture svnrdump failure as not_found instead of failed.

Looks good to me !

Sep 29 2021, 7:16 PM
swh-public-ci added a comment to D6373: svn: Capture svnrdump failure as not_found instead of failed.

Build is green

Sep 29 2021, 7:15 PM
ardumont updated the diff for D6373: svn: Capture svnrdump failure as not_found instead of failed.

Fix typo

Sep 29 2021, 7:13 PM
anlambert requested changes to D6373: svn: Capture svnrdump failure as not_found instead of failed.

Wrong variable used in code (see inline comment), I will accept the diff afterwards.

Sep 29 2021, 7:07 PM
swh-public-ci added a comment to D6373: svn: Capture svnrdump failure as not_found instead of failed.

Build is green

Sep 29 2021, 7:02 PM
ardumont updated the diff for D6373: svn: Capture svnrdump failure as not_found instead of failed.

Adapt to a better implementation

Sep 29 2021, 7:00 PM
anlambert requested changes to D6373: svn: Capture svnrdump failure as not_found instead of failed.

We should capture all svn errors in order to:

  • raise exception based on an error code
  • give more details about the failure by appending error messages to fallback exception
Sep 29 2021, 6:50 PM
swh-public-ci added a comment to D6373: svn: Capture svnrdump failure as not_found instead of failed.

Build is green

Sep 29 2021, 6:44 PM
ardumont updated the diff for D6373: svn: Capture svnrdump failure as not_found instead of failed.

Update test docstring

Sep 29 2021, 6:42 PM
ardumont updated the task description for T3622: svn: Failing to start the ingestion "No such revision 1', 160006)".
Sep 29 2021, 6:41 PM · SVN Loader
vlorentz added a comment to T75: Check integrity of directories, revisions, and releases.
  1. "author xxx<yyy@googlemail.com> <xxx<yyy@googlemail.com>> 1282956323 +0200" parsed as "author xxx<yyy@googlemail.com> <xxx <yy@googlemail.com>> 1282956323 +0200" (yes there is a fullname in the email)
  2. a few commits with this timezone: "--700", parsed as "+0700"
  3. Some weird offsets parsed as "+0000": "+1400"/"+051800"/"+1558601"
  4. looks like old git loaders tried to decode and re-encode name and/or email, which messed it up when not UTF8
Sep 29 2021, 6:38 PM · Archive content, Restricted Project