Page MenuHomeSoftware Heritage
Feed All Stories

Aug 20 2021

anlambert closed D6119: conf/cassandra: Remove no longer existing configuration entries.
Aug 20 2021, 5:07 PM
anlambert committed rDENVba1ee362dc26: conf/cassandra: Remove no longer existing configuration entries (authored by anlambert).
conf/cassandra: Remove no longer existing configuration entries
Aug 20 2021, 5:07 PM
vlorentz added projects to T3494: Implement citation button for directories with codemeta or CFF: Web app, Intrinsic metadata.
Aug 20 2021, 4:39 PM · Intrinsic metadata, Web app
vlorentz accepted D6119: conf/cassandra: Remove no longer existing configuration entries.
Aug 20 2021, 3:32 PM
anlambert requested review of D6119: conf/cassandra: Remove no longer existing configuration entries.
Aug 20 2021, 3:22 PM
swh-public-ci added a comment to D6118: cassandra: Make content_missing query in batches.

Build is green

Aug 20 2021, 2:46 PM
moranegg triaged T3494: Implement citation button for directories with codemeta or CFF as Normal priority.
Aug 20 2021, 2:46 PM · Intrinsic metadata, Web app
Harbormaster failed remote builds in B23096: Diff 22138 for D6118: cassandra: Make content_missing query in batches!
Aug 20 2021, 2:36 PM
swh-public-ci added a comment to D6118: cassandra: Make content_missing query in batches.

Build was aborted

Aug 20 2021, 2:36 PM
vlorentz updated the summary of D6118: cassandra: Make content_missing query in batches.
Aug 20 2021, 2:15 PM
vlorentz updated the diff for D6118: cassandra: Make content_missing query in batches.

mention schema change in commit

Aug 20 2021, 2:15 PM
vlorentz requested review of D6118: cassandra: Make content_missing query in batches.
Aug 20 2021, 2:00 PM
anlambert closed D6116: api/metadata: Fix issues detected with hypothesis.
Aug 20 2021, 1:29 PM
anlambert committed rDWAPPSc7548f93a171: api/metadata: Fix issues detected with hypothesis (authored by anlambert).
api/metadata: Fix issues detected with hypothesis
Aug 20 2021, 1:29 PM
swh-public-ci added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

Build is green

Aug 20 2021, 1:23 PM
anlambert updated the diff for D6116: api/metadata: Fix issues detected with hypothesis.

Rebase

Aug 20 2021, 1:08 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTO55c3f0c8857c: Updated backport on buster-swh from debian/0.35.1-1_swh1 (unstable-swh) (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
Updated backport on buster-swh from debian/0.35.1-1_swh1 (unstable-swh)
Aug 20 2021, 12:07 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTO8f1e8cd1539b: Merge tag 'debian/0.35.1-1_swh1' into debian/buster-swh (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
Merge tag 'debian/0.35.1-1_swh1' into debian/buster-swh
Aug 20 2021, 12:07 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTO05c2b18e60ed: pristine-tar data for swh-storage_0.35.1.orig.tar.gz (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
pristine-tar data for swh-storage_0.35.1.orig.tar.gz
Aug 20 2021, 12:01 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTOae70564eab2b: Updated debian changelog for version 0.35.1 (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
Updated debian changelog for version 0.35.1
Aug 20 2021, 12:01 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTOca5ee3d67305: Update upstream source from tag 'debian/upstream/0.35.1' (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
Update upstream source from tag 'debian/upstream/0.35.1'
Aug 20 2021, 12:01 PM
Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org> committed rDSTO1c038f0507a4: New upstream version 0.35.1 (authored by Jenkins for Software Heritage <jenkins@jenkins-debian1.internal.softwareheritage.org>).
New upstream version 0.35.1
Aug 20 2021, 12:01 PM
vlorentz accepted D6116: api/metadata: Fix issues detected with hypothesis.

thanks!

Aug 20 2021, 11:39 AM
KShivendu created P1126 search.yml.
Aug 20 2021, 10:59 AM
KShivendu created P1125 swh-{indexer,scheduler,search}-journal-client containers exit (unhealthy).
Aug 20 2021, 10:49 AM

Aug 19 2021

vlorentz added a comment to T3465: Test multidatacenter replication.

Starting with 10 nodes will allow to have some remaining space.

Aug 19 2021, 7:53 PM · System administration, Storage manager
vlorentz added a comment to T3493: [cassandra] Git loader performance are very bad.

Can you try with this patch? P1118

Aug 19 2021, 7:48 PM · System administration, Storage manager
anlambert committed rDWAPPSe18d30e5bc0a: webpack: Upgrade webpack-dev-server to 4.0.0 (authored by anlambert).
webpack: Upgrade webpack-dev-server to 4.0.0
Aug 19 2021, 7:27 PM
vsellier changed the status of T3465: Test multidatacenter replication, a subtask of T3357: Perform some tests of the cassandra storage on Grid5000, from Open to Work in Progress.
Aug 19 2021, 7:19 PM · System administration, Storage manager
vsellier changed the status of T3465: Test multidatacenter replication from Open to Work in Progress.
Aug 19 2021, 7:19 PM · System administration, Storage manager
vsellier added a comment to T3465: Test multidatacenter replication.

The gros cluster at Nancy[1] has a lot of nodes(124) with small reservable SSD of 960Go. This can be a good candidate to create the second cluster. It will also allow to check the performance with data (and commit logs) on SSDs.
According to the main cluster, a minimum of 8 nodes are necessary to handle the volume of data (7.3 To and growing). Starting with 10 nodes will allow to have some remaining space.

Aug 19 2021, 7:11 PM · System administration, Storage manager
vsellier added a comment to T3493: [cassandra] Git loader performance are very bad.

it seems some more precise information can be logged by activating the full query logs without a big performance impact: https://cassandra.apache.org/doc/latest/cassandra/new/fqllogging.html

Aug 19 2021, 6:52 PM · System administration, Storage manager
anlambert planned changes to D6004: misc/coverage: Revamp and improve archive coverage widget.

Next step: write tests for that updated view.

Aug 19 2021, 5:22 PM
anlambert updated the summary of D6004: misc/coverage: Revamp and improve archive coverage widget.
Aug 19 2021, 5:20 PM
anlambert closed D6117: Makefile.local: add dependency between test and ts-build-so targets.
Aug 19 2021, 5:11 PM
anlambert committed rDSEA26f800cde3cf: Makefile.local: add dependency between test and ts-build-so targets (authored by anlambert).
Makefile.local: add dependency between test and ts-build-so targets
Aug 19 2021, 5:11 PM
vlorentz accepted D6117: Makefile.local: add dependency between test and ts-build-so targets.
Aug 19 2021, 5:05 PM
swh-public-ci added a comment to D6117: Makefile.local: add dependency between test and ts-build-so targets.

Build is green

Aug 19 2021, 5:01 PM
anlambert retitled D6117: Makefile.local: add dependency between test and ts-build-so targets from Makefile.local: Fix make test command and add dependency to ts-build-so to Makefile.local: add dependency between test and ts-build-so targets.
Aug 19 2021, 4:58 PM
anlambert updated the diff for D6117: Makefile.local: add dependency between test and ts-build-so targets.

Remove TEST_DIRS modification.

Aug 19 2021, 4:57 PM
anlambert added a comment to D6117: Makefile.local: add dependency between test and ts-build-so targets.

I don't think that issue is specific to swh-search, I have it with other packages from time to time. rm build/ -rf should fix it.

Aug 19 2021, 4:55 PM
vlorentz added a comment to D6117: Makefile.local: add dependency between test and ts-build-so targets.

I don't think that issue is specific to swh-search, I have it with other packages from time to time. rm build/ -rf should fix it.

Aug 19 2021, 4:53 PM
anlambert retitled D6117: Makefile.local: add dependency between test and ts-build-so targets from Makefile.local: Fix make test command and add dependency to ts-build-soFor some reasons, I have the following error when calling `make test`.```python3 -m pytest . to Makefile.local: Fix make test command and add dependency to ts-build-so.
Aug 19 2021, 4:52 PM
vlorentz merged task T3491: Origin visit ids restart from 1 even if there is previous visits into T3492: cassandra: origin_visit_add should increase next_visit_id even when upserting.
Aug 19 2021, 4:50 PM · System administration, Storage manager
vlorentz merged T3491: Origin visit ids restart from 1 even if there is previous visits into T3492: cassandra: origin_visit_add should increase next_visit_id even when upserting.
Aug 19 2021, 4:50 PM · Storage manager
anlambert retitled D6117: Makefile.local: add dependency between test and ts-build-so targets from Makefile.local: Fix make test command and add dependency to ts-build-so For some reasons, I have the following error when calling `make test`. ``` python3 -m pytest . to Makefile.local: Fix make test command and add dependency to ts-build-soFor some reasons, I have the following error when calling `make test`.```python3 -m pytest ..
Aug 19 2021, 4:50 PM
anlambert requested review of D6117: Makefile.local: add dependency between test and ts-build-so targets.
Aug 19 2021, 4:49 PM
vlorentz added a comment to T3491: Origin visit ids restart from 1 even if there is previous visits.

you mean T3492

Aug 19 2021, 4:49 PM · System administration, Storage manager
vsellier added a comment to T3491: Origin visit ids restart from 1 even if there is previous visits.

Should be fixed by T3482

Aug 19 2021, 4:34 PM · System administration, Storage manager
vsellier triaged T3493: [cassandra] Git loader performance are very bad as Normal priority.
Aug 19 2021, 4:32 PM · System administration, Storage manager
vlorentz triaged T3492: cassandra: origin_visit_add should increase next_visit_id even when upserting as Normal priority.
Aug 19 2021, 4:31 PM · Storage manager
vsellier triaged T3491: Origin visit ids restart from 1 even if there is previous visits as Normal priority.
Aug 19 2021, 4:20 PM · System administration, Storage manager
swh-public-ci added a comment to D6114: swh-scanner: retrieve additional information about software artifacts.

Build is green

Aug 19 2021, 4:13 PM
DanSeraf updated the diff for D6114: swh-scanner: retrieve additional information about software artifacts.

extra info description

Aug 19 2021, 4:10 PM
swh-public-ci added a comment to D6004: misc/coverage: Revamp and improve archive coverage widget.

Build is green

Aug 19 2021, 4:09 PM
anlambert added inline comments to D6004: misc/coverage: Revamp and improve archive coverage widget.
Aug 19 2021, 3:58 PM
anlambert updated the diff for D6004: misc/coverage: Revamp and improve archive coverage widget.

Update:

  • Rename section titles according to @zack suggestions
  • Remove vertical padding around counters to gain vertical space
  • Add fallback when scheduler metrics or deposit lists are not available, widget with logos will stil be displayed but without counters info
Aug 19 2021, 3:56 PM
anlambert added a comment to D6004: misc/coverage: Revamp and improve archive coverage widget.

As a minor suggestion I propose the following heading changes:

 listed origins -> regular crawling
 legacy origins -> discontinued hosting
deposited origins -> on demand archival
Aug 19 2021, 3:13 PM
anlambert added inline comments to D6113: vault API: Rename bundle types and use SWHIDs to identify objects.
Aug 19 2021, 3:09 PM
swh-public-ci added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

Build is green

Aug 19 2021, 3:08 PM
anlambert accepted D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.

Looks good but the SQL schema migration file is missing in sql/upgrades so I cannot accept the diff yet.

Actually, we're going to drop the database + objstorage and recreate it, writing a migration for this looks like too much trouble for a cache.

Aug 19 2021, 3:02 PM
anlambert closed D6115: tests: Ensure they all can be run with multiple hypothesis examples.
Aug 19 2021, 3:00 PM
anlambert committed rDWAPPS87cc9e042dc2: tests: Ensure they all can be run with multiple hypothesis examples (authored by anlambert).
tests: Ensure they all can be run with multiple hypothesis examples
Aug 19 2021, 3:00 PM
anlambert updated the diff for D6116: api/metadata: Fix issues detected with hypothesis.

Update: Provide RawExtrinsicMetadata targetting core SWHIDs as test inputs and reverse related changes.

Aug 19 2021, 2:55 PM
anlambert added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

We don't want to allow extended SWHID in the public API, you should restrict data generated by hypothesis instead

Aug 19 2021, 2:17 PM
anlambert added a comment to D6115: tests: Ensure they all can be run with multiple hypothesis examples.

Does it mean we need to do def test_inner on *every* test that uses hypothesis?

Aug 19 2021, 2:09 PM
vlorentz closed D6110: Replace index-fossology-license-for-range with index-fossology-license-for-partition.
Aug 19 2021, 1:59 PM
vlorentz committed rDSCH28ae1d86aad7: Replace index-fossology-license-for-range with index-fossology-license-for… (authored by vlorentz).
Replace index-fossology-license-for-range with index-fossology-license-for…
Aug 19 2021, 1:59 PM
vlorentz closed D6111: Add support for releases pointing to other releases or contents..
Aug 19 2021, 1:59 PM
vlorentz committed rDVAUd9e712cf7082: Add support for releases pointing to other releases or contents. (authored by vlorentz).
Add support for releases pointing to other releases or contents.
Aug 19 2021, 1:59 PM
vlorentz added inline comments to D6114: swh-scanner: retrieve additional information about software artifacts.
Aug 19 2021, 1:58 PM
Harbormaster failed remote builds in B23082: Diff 22125 for D6113: vault API: Rename bundle types and use SWHIDs to identify objects!
Aug 19 2021, 1:54 PM
swh-public-ci added a comment to D6113: vault API: Rename bundle types and use SWHIDs to identify objects.

Build was aborted

Aug 19 2021, 1:54 PM
vlorentz accepted D6115: tests: Ensure they all can be run with multiple hypothesis examples.

I really can't find a better way to do this :(

Aug 19 2021, 1:53 PM
swh-public-ci added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

Build is green

Aug 19 2021, 1:32 PM
vlorentz added a comment to D6115: tests: Ensure they all can be run with multiple hypothesis examples.

Does it mean we need to do def test_inner on *every* test that uses hypothesis?

Aug 19 2021, 1:25 PM
vlorentz added inline comments to D6081: Fix api_raw_extrinsic_metadata_swhid-related bugs found when using the 'slow' hypothesis profile.
Aug 19 2021, 1:22 PM
vlorentz added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

We don't want to allow extended SWHID in the public API, you should restrict data generated by hypothesis instead

Aug 19 2021, 1:20 PM
swh-public-ci added a comment to D6115: tests: Ensure they all can be run with multiple hypothesis examples.

Build is green

Aug 19 2021, 1:19 PM
vlorentz updated the diff for D6113: vault API: Rename bundle types and use SWHIDs to identify objects.

update JS code

Aug 19 2021, 1:17 PM
swh-public-ci added a comment to D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.

Build is green

Aug 19 2021, 1:17 PM
vlorentz added a comment to D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.

Looks good but the SQL schema migration file is missing in sql/upgrades so I cannot accept the diff yet.

Aug 19 2021, 1:16 PM
vlorentz updated the diff for D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.

fix task arg serialization/deserialization

Aug 19 2021, 1:15 PM
anlambert updated the diff for D6116: api/metadata: Fix issues detected with hypothesis.

Rebase

Aug 19 2021, 1:05 PM
anlambert updated the diff for D6115: tests: Ensure they all can be run with multiple hypothesis examples.

Fix a comment

Aug 19 2021, 1:05 PM
anlambert added inline comments to D6081: Fix api_raw_extrinsic_metadata_swhid-related bugs found when using the 'slow' hypothesis profile.
Aug 19 2021, 1:01 PM
anlambert added a comment to D6116: api/metadata: Fix issues detected with hypothesis.

Oh I missed D6081, I guess we both encounter the same kind of issues but fixes are not exactly the same.

Aug 19 2021, 12:57 PM
anlambert accepted D6110: Replace index-fossology-license-for-range with index-fossology-license-for-partition.
Aug 19 2021, 12:52 PM
anlambert added inline comments to D6115: tests: Ensure they all can be run with multiple hypothesis examples.
Aug 19 2021, 12:51 PM
anlambert requested review of D6116: api/metadata: Fix issues detected with hypothesis.
Aug 19 2021, 12:40 PM
vsellier updated the task description for T3487: Installation of the new provenance server.
Aug 19 2021, 12:29 PM · System administration
anlambert requested changes to D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.

Looks good but the SQL schema migration file is missing in sql/upgrades so I cannot accept the diff yet.

Aug 19 2021, 12:28 PM
anlambert requested review of D6115: tests: Ensure they all can be run with multiple hypothesis examples.
Aug 19 2021, 12:27 PM
anlambert accepted D6111: Add support for releases pointing to other releases or contents..
Aug 19 2021, 12:18 PM
anlambert added a comment to D6083: hypothesis: Run with more examples by default.

@vlorentz, I have submitted D6115 and D6116 to ensure swh-web tests can be safely executed with multiple hypothesis examples.

Aug 19 2021, 12:15 PM
anlambert added a revision to T1695: Make hypothesis strategies for swh-web stateless: D6115: tests: Ensure they all can be run with multiple hypothesis examples.
Aug 19 2021, 12:12 PM · Web app
jayeshv added a comment to T3487: Installation of the new provenance server.

@vsellier I am not sure about this.
The idea is to use this machine as the production server. (I guess this will host either postgres or mongodb after we decide on a preferred backend. But that is going to take some time)
@olasd or @aeviso will know better.

Aug 19 2021, 11:49 AM · System administration
landingbubble updated landingbubble.
Aug 19 2021, 10:18 AM
zack updated the task description for T3490: Collect metadata from ClearlyDefined.
Aug 19 2021, 10:13 AM · Extrinsic metadata
vsellier added a comment to T3485: extid topic is misconfigured in staging and production.

In ~40h, the backfill is done at ~5% for staging and less than 1% for the production

Aug 19 2021, 10:08 AM · System administration