Page MenuHomeSoftware Heritage
Feed All Stories

Nov 13 2022

bchauvet triaged T4678: Automation of add forge now workflow as Normal priority.
Nov 13 2022, 1:14 PM · Add Forge Now

Nov 12 2022

zack committed rMSLD6f491d0d9a23: check-in slides for talk at Biennale Tecnologia 2022, Turin, Italy (authored by zack).
check-in slides for talk at Biennale Tecnologia 2022, Turin, Italy
Nov 12 2022, 2:45 PM
zack committed rMSLDa239fce04a2f: move reusable ESE slides from 2022-09-28 talk to modules/ (refactoring) (authored by zack).
move reusable ESE slides from 2022-09-28 talk to modules/ (refactoring)
Nov 12 2022, 2:07 PM

Nov 11 2022

KShivendu added a comment to D8386: feat(fedora): Introduce fedora lister.

I noticed that https://archive.softwareheritage.org/browse/origin/directory/?origin_url=deb://Ubuntu/packages/nginx has duplicate branch names, which is very confusing. In fact, even the default branch is repeated twice and I see two check marks. If we use branch names like 0.3.9-15.fc26, won't the same happen with Fedora listers? It doesn't seem to differentiate between the editions. (or does it?)

Nov 11 2022, 8:29 AM
swh-public-ci added a comment to D8386: feat(fedora): Introduce fedora lister.

Build is green

Nov 11 2022, 7:10 AM
KShivendu updated the diff for D8386: feat(fedora): Introduce fedora lister.
  • Send origins in the last iteration
  • Change origin url format
  • Use intrinsic version for pkg version key
Nov 11 2022, 7:04 AM

Nov 10 2022

vlorentz requested review of D8832: luigi: Add DownloadFromS3 task.
Nov 10 2022, 4:43 PM
ardumont added a comment to D8825: Configure gitlab projects declaratively.

I wonder whether we should merge this config file and that of the other script, so that we get all settings in one place.

What do you think?

Yes, i think so too. But which script, to be sure, manage_user_groups right?

I just started it that way to simplify the reading.
I think that i'll adapt according to what you proposed here then update the diff then we land.

Nov 10 2022, 4:41 PM
vlorentz closed D8829: Add luigi tasks.
Nov 10 2022, 4:38 PM
vlorentz closed D8828: cli: Move the main code of export_graph to its own function.
Nov 10 2022, 4:38 PM
vlorentz committed rDDATASET058e568492ba: Add luigi tasks (authored by vlorentz).
Add luigi tasks
Nov 10 2022, 4:38 PM
vlorentz committed rDDATASETeea3e15bf7e4: cli: Move the main code of export_graph to its own function (authored by vlorentz).
cli: Move the main code of export_graph to its own function
Nov 10 2022, 4:38 PM
vlorentz closed D8827: athena: Fix create_table to work with restricted permissions.
Nov 10 2022, 4:38 PM
vlorentz committed rDDATASET5087a463974e: athena: Fix create_table to work with restricted permissions (authored by vlorentz).
athena: Fix create_table to work with restricted permissions
Nov 10 2022, 4:38 PM
vlorentz updated the summary of D8829: Add luigi tasks.
Nov 10 2022, 4:37 PM
vlorentz added a comment to D8829: Add luigi tasks.

Overall, that feels like luigi is a kinda "distributed makefile" (with a python dsl).

Nov 10 2022, 4:28 PM
ardumont updated the test plan for D8831: Merge gitlab tools to manage groups and projects into a unified cli.
Nov 10 2022, 4:23 PM
ardumont updated the diff for D8831: Merge gitlab tools to manage groups and projects into a unified cli.

Add cli.py

Nov 10 2022, 4:22 PM
ardumont requested review of D8831: Merge gitlab tools to manage groups and projects into a unified cli.
Nov 10 2022, 4:20 PM
ardumont closed D8825: Configure gitlab projects declaratively.
Nov 10 2022, 3:45 PM
ardumont committed rDSNIPecab8f25d50c: Configure gitlab projects declaratively (authored by ardumont).
Configure gitlab projects declaratively
Nov 10 2022, 3:45 PM
ardumont accepted D8828: cli: Move the main code of export_graph to its own function.
Nov 10 2022, 3:44 PM
ardumont accepted D8829: Add luigi tasks.
Nov 10 2022, 3:43 PM
ardumont added a comment to D8829: Add luigi tasks.

one typo inline.

Nov 10 2022, 3:43 PM
ardumont accepted D8827: athena: Fix create_table to work with restricted permissions.
Nov 10 2022, 3:27 PM
ardumont retitled D8825: Configure gitlab projects declaratively from Allow gitlab projects configuration declaratively to Configure gitlab projects declaratively.
Nov 10 2022, 3:26 PM
lunar closed D8830: Stop scheduling mirror tests while building Docker images.
Nov 10 2022, 3:26 PM
lunar committed rCJSWH948efe646897: Stop scheduling mirror tests while building Docker images (authored by lunar).
Stop scheduling mirror tests while building Docker images
Nov 10 2022, 3:26 PM
olasd accepted D8830: Stop scheduling mirror tests while building Docker images.
Nov 10 2022, 3:25 PM
ardumont accepted D8830: Stop scheduling mirror tests while building Docker images.
Nov 10 2022, 3:24 PM
ardumont updated the summary of D8830: Stop scheduling mirror tests while building Docker images.
Nov 10 2022, 3:24 PM
ardumont updated the diff for D8825: Configure gitlab projects declaratively.

Adapt according to last remark, dump partial json updated project.

Nov 10 2022, 3:23 PM
lunar requested review of D8830: Stop scheduling mirror tests while building Docker images.
Nov 10 2022, 3:21 PM
ardumont updated the test plan for D8825: Configure gitlab projects declaratively.
Nov 10 2022, 3:19 PM
ardumont added a comment to P1526 manage project run in staging.

And fix some staging issues (after that first run):

Nov 10 2022, 3:19 PM
ardumont updated the title for P1526 manage project run in staging from give it a run to manage project run in staging.
Nov 10 2022, 3:12 PM
anlambert requested changes to D8753: feat: Introduce RPM loader.

@KShivendu , I added some inline comments to improve the loader output.

Nov 10 2022, 3:05 PM
olasd accepted D8825: Configure gitlab projects declaratively.

I'm not too sure about having a single "big" object as json output, as dumping that full json structure will only work out if the process didn't crash in the middle (and when that does, you won't really know what had happened until then). Maybe output one json object per project mutated instead?

Nov 10 2022, 3:03 PM
ardumont created P1526 manage project run in staging.
Nov 10 2022, 2:59 PM
ardumont updated the diff for D8825: Configure gitlab projects declaratively.
  • Manage only a list of namespaces and skip the other projects
  • Make the script outputs json (for easier parsing)
Nov 10 2022, 2:48 PM
ardumont updated the summary of D8825: Configure gitlab projects declaratively.
Nov 10 2022, 2:47 PM
ardumont added a comment to D8825: Configure gitlab projects declaratively.

Looks great, thanks.

Nov 10 2022, 2:07 PM
anlambert added a comment to T4632: Origin URLs generated for Fedora origins.

@anlambert What about non-Fedora RPM repositories? (RHEL, SUSE, Rocky Linux, ...)

Nov 10 2022, 1:57 PM · Archive coverage, Data Model
anlambert requested changes to D8386: feat(fedora): Introduce fedora lister.

@KShivendu thanks for the adaptations !

Nov 10 2022, 1:33 PM
vlorentz added a comment to T4632: Origin URLs generated for Fedora origins.

@anlambert What about non-Fedora RPM repositories? (RHEL, SUSE, Rocky Linux, ...)

Nov 10 2022, 1:29 PM · Archive coverage, Data Model
vlorentz requested changes to D8663: Hackage: Implement incremental mode.

buuuut you are using a strict inequality, so you need to subtract one day, in order not to miss uploads submitted after the previous run of the lister but on the same day.

Nov 10 2022, 1:23 PM
vlorentz accepted D8663: Hackage: Implement incremental mode.

Oh, I missed that you actually convert using .date() in the code, my bad.

Nov 10 2022, 1:19 PM
vlorentz requested changes to D8663: Hackage: Implement incremental mode.

lastUpload is documented as working on dates rather than datetime, so we are going to need to do the same stuff as with NuGet.

Nov 10 2022, 1:18 PM
vlorentz accepted D8748: Nuget: Implement incremental listing.
Nov 10 2022, 1:06 PM
vlorentz requested changes to D8760: Add payloads to ExtIDs.
Nov 10 2022, 1:05 PM
vlorentz added a comment to T4675: /browse/search/ crashes when swh-search results are not in swh-storage.

It seems the fix is to only encode the ? in an origin URL when it is provided as URL argument.

Nov 10 2022, 12:59 PM · Web app
olasd added a comment to D8825: Configure gitlab projects declaratively.

Looks great, thanks.

Nov 10 2022, 12:00 PM
ardumont retitled D8825: Configure gitlab projects declaratively from gitlab projects: Unify project configuration to Allow gitlab projects configuration declaratively.
Nov 10 2022, 11:55 AM
ardumont updated the diff for D8825: Configure gitlab projects declaratively.

Adapt according to discussion and more

Nov 10 2022, 11:54 AM
anlambert added a comment to T4632: Origin URLs generated for Fedora origins.

Actually for fedora, I found a better origin URL pattern: https://src.fedoraproject.org/rpms/{pkg_name}

Nov 10 2022, 11:27 AM · Archive coverage, Data Model
ardumont added inline comments to D8825: Configure gitlab projects declaratively.
Nov 10 2022, 11:19 AM
ardumont added a comment to D8825: Configure gitlab projects declaratively.
In D8825#229537, @olasd wrote:

Looks nice!

I guess we may want to merge three levels of settings:

  • global (default) settings
  • per-namespace settings (for instance, to have all projects under infra/puppet/3rdparty have the same config overrides, e.g. disabling the issue tracker, changing the destination of webhooks to #swh-sysadm, etc.)
  • and per-repository settings (in the case of visibility, it's quite obvious)
Nov 10 2022, 11:09 AM
olasd added inline comments to D8825: Configure gitlab projects declaratively.
Nov 10 2022, 11:03 AM
anlambert requested changes to D8824: Cpan: Implement incremental mode.

@franckbret, as explained in my inline comment we cannot use the date filtering on the release index of CPAN elasticsearch.

Nov 10 2022, 11:01 AM
vlorentz requested review of D8829: Add luigi tasks.
Nov 10 2022, 10:55 AM
olasd added a comment to D8825: Configure gitlab projects declaratively.

Looks nice!

Nov 10 2022, 10:50 AM
vlorentz added a revision to T2579: swh-graph: display server and dataset versions in the live server instance: D8829: Add luigi tasks.
Nov 10 2022, 10:42 AM · Compressed graph service
vlorentz added a task to D8828: cli: Move the main code of export_graph to its own function: T4676: Add Luigi workflow in swh-dataset.
Nov 10 2022, 10:42 AM
vlorentz added a task to D8827: athena: Fix create_table to work with restricted permissions: T4676: Add Luigi workflow in swh-dataset.
Nov 10 2022, 10:42 AM
vlorentz added revisions to T4676: Add Luigi workflow in swh-dataset: D8827: athena: Fix create_table to work with restricted permissions, D8828: cli: Move the main code of export_graph to its own function, D8829: Add luigi tasks.
Nov 10 2022, 10:42 AM · Datasets, Compressed graph service
vlorentz added a parent task for T4676: Add Luigi workflow in swh-dataset: T4677: Add support for generating subdatasets in swh.dataset.luigi.
Nov 10 2022, 10:42 AM · Datasets, Compressed graph service
vlorentz added a subtask for T4677: Add support for generating subdatasets in swh.dataset.luigi: T4676: Add Luigi workflow in swh-dataset.
Nov 10 2022, 10:42 AM · Datasets
vlorentz triaged T4677: Add support for generating subdatasets in swh.dataset.luigi as Normal priority.
Nov 10 2022, 10:42 AM · Datasets
vlorentz triaged T4676: Add Luigi workflow in swh-dataset as High priority.
Nov 10 2022, 10:41 AM · Datasets, Compressed graph service
vlorentz claimed T2220: swh-graph in production.
Nov 10 2022, 10:39 AM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
vlorentz requested review of D8828: cli: Move the main code of export_graph to its own function.
Nov 10 2022, 10:19 AM
vlorentz requested review of D8827: athena: Fix create_table to work with restricted permissions.
Nov 10 2022, 10:14 AM
swh-public-ci added a comment to D8386: feat(fedora): Introduce fedora lister.

Build is green

Nov 10 2022, 6:31 AM
KShivendu updated the diff for D8386: feat(fedora): Introduce fedora lister.
Nov 10 2022, 6:25 AM

Nov 9 2022

ardumont added inline comments to D8825: Configure gitlab projects declaratively.
Nov 9 2022, 10:51 PM
anlambert closed T4675: /browse/search/ crashes when swh-search results are not in swh-storage as Resolved.

Fixed and deployed.

Nov 9 2022, 6:25 PM · Web app
Harbormaster failed to build B32760: rDWAPPSed38856c601e: package.json: Upgrade dependencies for rDWAPPSed38856c601e: package.json: Upgrade dependencies!
Nov 9 2022, 5:37 PM
anlambert committed rDWAPPSed38856c601e: package.json: Upgrade dependencies (authored by anlambert).
package.json: Upgrade dependencies
Nov 9 2022, 5:26 PM
anlambert closed D8826: browse/origin-search: Percent encode ? character only in origin URL.
Nov 9 2022, 5:26 PM
anlambert committed rDWAPPS4bd225b8225c: browse/origin-search: Percent encode ? character only in origin URL (authored by anlambert).
browse/origin-search: Percent encode ? character only in origin URL
Nov 9 2022, 5:26 PM
anlambert requested review of D8826: browse/origin-search: Percent encode ? character only in origin URL.
Nov 9 2022, 5:25 PM
anlambert added a revision to T4675: /browse/search/ crashes when swh-search results are not in swh-storage: D8826: browse/origin-search: Percent encode ? character only in origin URL.
Nov 9 2022, 5:05 PM · Web app
anlambert added a comment to T4675: /browse/search/ crashes when swh-search results are not in swh-storage.

It seems the fix is to only encode the ? in an origin URL when it is provided as URL argument.
The issue we hit in production might be related to the varnish cache I think.

Nov 9 2022, 4:51 PM · Web app
ardumont requested review of D8825: Configure gitlab projects declaratively.
Nov 9 2022, 4:49 PM
anlambert added a comment to T4675: /browse/search/ crashes when swh-search results are not in swh-storage.

Hmm, I do not hit the issue locally so that is why tests did not spot the issue. This seems to only happen in production.

Nov 9 2022, 4:37 PM · Web app
anlambert added a comment to T4675: /browse/search/ crashes when swh-search results are not in swh-storage.

Actually this is not related to the replica lag but it is a regression induced by recent commit rDWAPPS4cc9676a54cc368394c05b7f19c92ea072f8041e.

Nov 9 2022, 4:26 PM · Web app
anlambert added a comment to T4675: /browse/search/ crashes when swh-search results are not in swh-storage.

Ah right I noticed that behavior when fixing a recent bug in the webapp (D8820), will fix that asap.

Nov 9 2022, 3:56 PM · Web app
vlorentz updated the task description for T4675: /browse/search/ crashes when swh-search results are not in swh-storage.
Nov 9 2022, 3:49 PM · Web app
franckbret requested review of D8824: Cpan: Implement incremental mode.
Nov 9 2022, 3:48 PM
vlorentz updated the task description for T4675: /browse/search/ crashes when swh-search results are not in swh-storage.
Nov 9 2022, 3:45 PM · Web app
anlambert closed D8823: docker: Add support for incoming rpm loader.
Nov 9 2022, 3:45 PM
anlambert committed rDENV5af2fc028973: docker: Add support for incoming rpm loader (authored by anlambert).
docker: Add support for incoming rpm loader
Nov 9 2022, 3:45 PM
vlorentz raised the priority of T4675: /browse/search/ crashes when swh-search results are not in swh-storage from Normal to High.
Nov 9 2022, 3:44 PM · Web app
vlorentz placed T4675: /browse/search/ crashes when swh-search results are not in swh-storage up for grabs.
Nov 9 2022, 3:44 PM · Web app
swh-sentry-integration assigned T4675: /browse/search/ crashes when swh-search results are not in swh-storage to vlorentz.
Nov 9 2022, 3:44 PM · Web app
anlambert added a comment to D8386: feat(fedora): Introduce fedora lister.

@KShivendu I forgot to mention in my review that we should also get the checksums associated to a rpm archive, the loader will then use it to check download integrity.

Nov 9 2022, 3:19 PM
ardumont accepted D8823: docker: Add support for incoming rpm loader.
Nov 9 2022, 3:02 PM
anlambert requested review of D8823: docker: Add support for incoming rpm loader.
Nov 9 2022, 2:58 PM
anlambert requested changes to D8386: feat(fedora): Introduce fedora lister.

@KShivendu , after testing the lister in docker environment there is room for improvements before we can accept that diff.

Nov 9 2022, 2:46 PM
anlambert added a comment to T4632: Origin URLs generated for Fedora origins.

After reviewing and hacking on the fedora lister, I think we should use origin URL in the form https://packages.fedoraproject.org/pkgs/{src_pkg_name} for a fedora source package.

Nov 9 2022, 2:09 PM · Archive coverage, Data Model