Feed Advanced Search

Advanced Search
Use Results
Edit Query
Hide Query

	Include stories about projects I am a member of.

Apr 14 2022

ardumont closed T4143: staging: Deploy maven stack fixes as Resolved.

Apr 14 2022, 3:25 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont closed T4143: staging: Deploy maven stack fixes, a subtask of T3874: staging: Analyze result of the maven listing and ingestion, as Resolved.

Apr 14 2022, 3:25 PM · Maven loader, Maven lister, Archive coverage

ardumont closed T3746: staging: Deploy maven indexer/lister/loader, a subtask of T1724: Maven Central repository support, as Resolved.

Apr 14 2022, 3:23 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage

ardumont closed T3746: staging: Deploy maven indexer/lister/loader as Resolved.

Apr 14 2022, 3:23 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont moved T4143: staging: Deploy maven stack fixes from in-progress to deployed/landed/monitoring on the System administration board.

Apr 14 2022, 3:23 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T4143: staging: Deploy maven stack fixes.

Apr 14 2022, 3:23 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T4143: staging: Deploy maven stack fixes.

Rescheduled the lister instance to scrape clojars and now it continues on, skipping (and not failing) when it fails to retrieve artifacts informations:

Apr 14 2022, 3:23 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated subscribers of T4143: staging: Deploy maven stack fixes.

Apr 14 2022, 3:16 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T4143: staging: Deploy maven stack fixes.

Loader maven status are now ingesting without failing:

Apr 14 2022, 3:16 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T4143: staging: Deploy maven stack fixes.

Apr 14 2022, 3:15 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T4143: staging: Deploy maven stack fixes.

Deploy new version:

root@pergamon:~# clush -b -w @staging-workers "dpkg -l python3-swh.lister python3-swh.loader.core"
---------------
worker[0-3].internal.staging.swh.network (4)
---------------
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name                    Version              Architecture Description
+++-=======================-====================-============-=================================================================
ii  python3-swh.lister      2.8.0-1~swh2~bpo10+1 all          Software Heritage Listers (bitbucket, git(lab|hub), pypi, etc...)
ii  python3-swh.loader.core 2.6.2-1~swh1~bpo10+1 all          Software Heritage Loader Core

Apr 14 2022, 3:14 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont changed the status of T4143: staging: Deploy maven stack fixes from Open to Work in Progress.

Apr 14 2022, 3:03 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont changed the status of T4143: staging: Deploy maven stack fixes, a subtask of T3874: staging: Analyze result of the maven listing and ingestion, from Open to Work in Progress.

Apr 14 2022, 3:03 PM · Maven loader, Maven lister, Archive coverage

ardumont triaged T4143: staging: Deploy maven stack fixes as Normal priority.

Apr 14 2022, 3:02 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3874: staging: Analyze result of the maven listing and ingestion.

Apr 14 2022, 3:00 PM · Maven loader, Maven lister, Archive coverage

ardumont updated the task description for T3874: staging: Analyze result of the maven listing and ingestion.

Apr 14 2022, 10:19 AM · Maven loader, Maven lister, Archive coverage

ardumont added a revision to T3874: staging: Analyze result of the maven listing and ingestion: D7573: maven: Consistently read lister input to ingest a mvn origin.

Apr 14 2022, 9:53 AM · Maven loader, Maven lister, Archive coverage

ardumont updated the task description for T3874: staging: Analyze result of the maven listing and ingestion.

Apr 14 2022, 9:19 AM · Maven loader, Maven lister, Archive coverage

ardumont updated the task description for T3874: staging: Analyze result of the maven listing and ingestion.

Apr 14 2022, 9:09 AM · Maven loader, Maven lister, Archive coverage

Apr 13 2022

ardumont added a revision to T3874: staging: Analyze result of the maven listing and ingestion: D7572: maven: Continue listing if unable to retrieve pom information.

Apr 13 2022, 6:00 PM · Maven loader, Maven lister, Archive coverage

ardumont moved T3746: staging: Deploy maven indexer/lister/loader from in-progress to deployed/landed/monitoring on the System administration board.

Apr 13 2022, 5:17 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3874: staging: Analyze result of the maven listing and ingestion.

Apr 13 2022, 5:15 PM · Maven loader, Maven lister, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 13 2022, 5:11 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

So the gist of the deployment is done, let's fix those lister and loader issue in the dedicated task [1].

Apr 13 2022, 5:11 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

The swh-scheduler-scheduler-recurrent service needed a restart to take into account maven tasks to be loaded.

Apr 13 2022, 5:10 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

maven central listing is actually ongoing (at least up until the lister founds some 404 and it will behave the same, crash and stop).
Still some origins are now present in listed_origins:

Apr 13 2022, 5:05 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont renamed T3874: staging: Analyze result of the maven listing and ingestion from staging: Analyze result of the maven ingestion to staging: Analyze result of the maven listing and ingestion.

Apr 13 2022, 5:02 PM · Maven loader, Maven lister, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Maven central scheduled as well:

Apr 13 2022, 5:01 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Scheduling and the lister kicked in [1] but that fails on 404 [2].
And that stopped the listing.

Apr 13 2022, 4:49 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated subscribers of T3746: staging: Deploy maven indexer/lister/loader.

Trigger the run for maven-central triggered issue due to the high volume of data for that one.
I'll debug some more tomorrow.

Apr 13 2022, 10:08 AM · Maven loader, Maven lister, System administration, Archive coverage

vsellier added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7561: maven-exporter: fix the systemd reload after each puppet run.

Apr 13 2022, 9:14 AM · Maven loader, Maven lister, System administration, Archive coverage

Apr 11 2022

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Trigger the run for maven-central triggered issue due to the high volume of data for that one.
I'll debug some more tomorrow.

Apr 11 2022, 7:16 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 11 2022, 7:01 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Trigger the run for clojars:

root@maven-exporter0:~# systemctl start maven_index_exporter@clojars
root@maven-exporter0:~# systemctl status maven_index_exporter@clojars
● maven_index_exporter@clojars.service - Software Heritage Maven Index Exporter clojars
     Loaded: loaded (/etc/systemd/system/maven_index_exporter@.service; enabled; vendor preset: enabled)
    Drop-In: /etc/systemd/system/maven_index_exporter@clojars.service.d
             └─parameters.conf
     Active: active (running) since Mon 2022-04-11 16:53:51 UTC; 2s ago
TriggeredBy: ● maven_index_exporter@clojars.timer
   Main PID: 4569 (bash)
      Tasks: 9 (limit: 4675)
     Memory: 56.4M
        CPU: 160ms
     CGroup: /system.slice/system-maven_index_exporter.slice/maven_index_exporter@clojars.service
             ├─4569 bash /usr/local/bin/run_maven_index_exporter.sh clojars
             └─4571 docker run -v /srv/softwareheritage/maven-index-exporter//clojars/work:/work -v /var/www/maven_index_exporter:/publish -e MVN_IDX_EXPORTER_BASE_URL=http://clojars.org/repo/ softwareheritage/maven-index-exporter:v0.2.0
...
root@maven-exporter0:~# ls -lah /var/www/maven_index_exporter/export-clojars.fld
-rwxrwxrwx 1 root root 61M Apr 11 16:54 /var/www/maven_index_exporter/export-clojars.fld
root@maven-exporter0:~# zfs get all | grep compress
data                  compressratio         8.26x                                        -
data                  compression           off                                          default
data                  refcompressratio      1.00x                                        -
data/mvn-idx-publish  compressratio         18.88x                                       -
data/mvn-idx-publish  compression           zstd                                         local
data/mvn-idx-publish  refcompressratio      18.88x                                       -
data/mvn-idx-work     compressratio         5.84x                                        -
data/mvn-idx-work     compression           zstd                                         local
data/mvn-idx-work     refcompressratio      5.84x                                        -

Apr 11 2022, 6:57 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Configure zfs partitions:

root@maven-exporter0:~# lsblk | grep vdb
vdb                           254:16   0   50G  0 disk
root@maven-exporter0:~# zpool create -f data /dev/vdb
root@maven-exporter0:~# zpool status
  pool: data
 state: ONLINE
config:

Apr 11 2022, 6:56 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 11 2022, 6:55 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 11 2022, 6:52 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T4104: Ingest crates.io (Rust): D7501: Rust lang, Crates loader.

Apr 11 2022, 5:31 PM · Crates loader, Crates lister, Archive coverage

ardumont updated the task description for T4104: Ingest crates.io (Rust).

Apr 11 2022, 5:31 PM · Crates loader, Crates lister, Archive coverage

Apr 8 2022

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 6:11 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 5:11 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 4:16 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 4:15 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 4:15 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 4:15 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7542: staging: Provision new maven-exporter0.internal.staging.swh.network.

Apr 8 2022, 4:06 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7540: staging: Deploy maven loader.

Apr 8 2022, 3:19 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7538: staging: Deploy maven lister.

Apr 8 2022, 2:57 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 8 2022, 9:38 AM · Maven loader, Maven lister, System administration, Archive coverage

Apr 7 2022

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7528: staging: Deploy apache service to serve extracted export.fld files.

Apr 7 2022, 6:12 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7527: staging: Deploy maven index exporter to extract export.fld files.

Apr 7 2022, 5:50 PM · Maven loader, Maven lister, System administration, Archive coverage

Apr 6 2022

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7511: build-docker-image: Introduce a testing stage.

Apr 6 2022, 9:45 AM · Maven loader, Maven lister, System administration, Archive coverage

Apr 5 2022

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7509: maven-index-exporter: Update tests to check the image behavior.

Apr 5 2022, 6:55 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7508: Make the docker image extract and publish the export.fld directly.

Apr 5 2022, 6:53 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 5 2022, 10:36 AM · Maven loader, Maven lister, System administration, Archive coverage

Apr 1 2022

ardumont closed T4105: Push maven-index-exporter image to docker hub, a subtask of T3746: staging: Deploy maven indexer/lister/loader, as Resolved.

Apr 1 2022, 6:26 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 1 2022, 6:15 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a comment to T3746: staging: Deploy maven indexer/lister/loader.

Yes, thx. Will do.

Apr 1 2022, 6:14 PM · Maven loader, Maven lister, System administration, Archive coverage

borisbaldassari added a comment to T3746: staging: Deploy maven indexer/lister/loader.

I see there is a lot of progress here, nice!
I try to follow the thread as time allows, but if you're stuck please do not hesitate to notify me.

Apr 1 2022, 6:10 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T4104: Ingest crates.io (Rust).

Apr 1 2022, 6:06 PM · Crates loader, Crates lister, Archive coverage

ardumont updated the task description for T4104: Ingest crates.io (Rust).

Apr 1 2022, 6:06 PM · Crates loader, Crates lister, Archive coverage

ardumont updated the task description for T4124: Golang support.

Apr 1 2022, 6:05 PM · Golang loader, Golang lister, Archive coverage

ardumont added a project to T4124: Golang support: Archive coverage.

Apr 1 2022, 6:04 PM · Golang loader, Golang lister, Archive coverage

ardumont changed the status of T4105: Push maven-index-exporter image to docker hub, a subtask of T3746: staging: Deploy maven indexer/lister/loader, from Open to Work in Progress.

Apr 1 2022, 5:49 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Apr 1 2022, 4:10 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a subtask for T3746: staging: Deploy maven indexer/lister/loader: T4105: Push maven-index-exporter image to docker hub.

Apr 1 2022, 4:09 PM · Maven loader, Maven lister, System administration, Archive coverage

Mar 28 2022

ardumont added a project to T1423: Add .crate (Rust) loader: Archive coverage.

Mar 28 2022, 4:28 PM · Crates loader, Archive coverage, Sprint 2018 12, Restricted Project

ardumont removed a parent task for T1424: Add crates.io (Rust) lister: T1418: Loaders.

Mar 28 2022, 4:28 PM · Crates lister, Archive coverage, Restricted Project, Sprint 2018 12

ardumont added a parent task for T1424: Add crates.io (Rust) lister: T4104: Ingest crates.io (Rust).

Mar 28 2022, 4:27 PM · Crates lister, Archive coverage, Restricted Project, Sprint 2018 12

ardumont added subtasks for T4104: Ingest crates.io (Rust): T1424: Add crates.io (Rust) lister, T1423: Add .crate (Rust) loader.

Mar 28 2022, 4:27 PM · Crates loader, Crates lister, Archive coverage

ardumont triaged T4104: Ingest crates.io (Rust) as Normal priority.

Mar 28 2022, 4:27 PM · Crates loader, Crates lister, Archive coverage

ardumont added a project to T1424: Add crates.io (Rust) lister: Archive coverage.

Mar 28 2022, 4:26 PM · Crates lister, Archive coverage, Restricted Project, Sprint 2018 12

Mar 24 2022

ardumont moved T3746: staging: Deploy maven indexer/lister/loader from Weekly backlog to in-progress on the System administration board.

Mar 24 2022, 3:10 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont moved T3746: staging: Deploy maven indexer/lister/loader from Backlog to Weekly backlog on the System administration board.

Mar 24 2022, 2:33 PM · Maven loader, Maven lister, System administration, Archive coverage

Mar 22 2022

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7412: Adapt run_full_export according to swh cli conventions.

Mar 22 2022, 6:03 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Mar 22 2022, 3:52 PM · Maven loader, Maven lister, System administration, Archive coverage

Mar 17 2022

vlorentz added a comment to T3311: Use .gitmodules to discover origins.

The worst case scenario is that someone maliciously creates repositories generated on the fly that refer to each other via .gitmodules, so we end up in an infinite loop of loading garbage.

Mar 17 2022, 11:39 AM · Archive coverage, Git loader

Mar 16 2022

olasd added a comment to T3311: Use .gitmodules to discover origins.

In T3311#80997, @olasd wrote:

I'm not comfortable always creating high priority tasks in this context either, as I'm not sure what the throttling implications are when we inevitably end up on a repository that references a commit in a submodule that doesn't exist.

Mar 16 2022, 4:19 PM · Archive coverage, Git loader

olasd added a comment to T3311: Use .gitmodules to discover origins.

I think the approach in D7332 is interesting, but it feels a bit expensive to be doing it for every instance of a .gitmodules file found in any new directory for all git repos that are being loaded, as well as doing it again for the top level of any known branch in the git snapshot being loaded currently.

Mar 16 2022, 4:15 PM · Archive coverage, Git loader

Mar 14 2022

douardda added a comment to T3311: Use .gitmodules to discover origins.

It's been more/less discussed above but IMHO it would make sense to:

Mar 14 2022, 1:29 PM · Archive coverage, Git loader

ardumont moved T3746: staging: Deploy maven indexer/lister/loader from in-progress to Backlog on the System administration board.

Mar 14 2022, 1:25 PM · Maven loader, Maven lister, System administration, Archive coverage

Mar 10 2022

anlambert added a revision to T3311: Use .gitmodules to discover origins: D7332: loader: Add support for submodules discovering.

Mar 10 2022, 3:25 PM · Archive coverage, Git loader

Feb 22 2022

anlambert added a revision to T3835: staging: Ingest sourceforge cvs origins: D7221: misc/coverage: Display counters for bzr/cvs visit types conditionally.

Feb 22 2022, 3:04 PM · CVS loader, Archive coverage

Feb 18 2022

ardumont added a comment to T3835: staging: Ingest sourceforge cvs origins.

The actual issues were currently not reported properly. It's now fixed.

Feb 18 2022, 12:15 PM · CVS loader, Archive coverage

Feb 17 2022

ardumont updated the task description for T3835: staging: Ingest sourceforge cvs origins.

Feb 17 2022, 5:13 PM · CVS loader, Archive coverage

ardumont closed T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects, a subtask of T3835: staging: Ingest sourceforge cvs origins, as Resolved.

Feb 17 2022, 5:12 PM · CVS loader, Archive coverage

ardumont closed T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects as Resolved.

Feb 17 2022, 5:12 PM · CVS loader, Archive coverage

ardumont closed T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects, a subtask of T3691: Implement CVS loader, as Resolved.

Feb 17 2022, 5:12 PM · CVS loader, Archive coverage

ardumont closed T3947: Deploy swh.lister v2.7, a subtask of T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects, as Resolved.

Feb 17 2022, 5:11 PM · CVS loader, Archive coverage

ardumont changed the status of T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects, a subtask of T3691: Implement CVS loader, from Open to Work in Progress.

Feb 17 2022, 5:09 PM · CVS loader, Archive coverage

ardumont changed the status of T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects, a subtask of T3835: staging: Ingest sourceforge cvs origins, from Open to Work in Progress.

Feb 17 2022, 5:09 PM · CVS loader, Archive coverage

ardumont changed the status of T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects from Open to Work in Progress.

Currently deployed in staging and production.
So future listing will do the right thing.

Feb 17 2022, 5:09 PM · CVS loader, Archive coverage

ardumont added a subtask for T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects: T3947: Deploy swh.lister v2.7.

Feb 17 2022, 5:09 PM · CVS loader, Archive coverage

Feb 16 2022

anlambert added a revision to T3789: Adapt sourceforge lister to list cvs origins according to what the cvs loader expects: D7186: sourceforge: Fix origin URLs for CVS projects.

Feb 16 2022, 11:39 AM · CVS loader, Archive coverage

Feb 15 2022

ardumont updated subscribers of T3691: Implement CVS loader.

Could you please also open a diff with the necessary changes required for the docker
stack (swh-environment/docker changes you had to make to actually have the loader run
properly)?

Feb 15 2022, 4:54 PM · CVS loader, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Feb 15 2022, 4:12 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont updated the task description for T3746: staging: Deploy maven indexer/lister/loader.

Feb 15 2022, 4:12 PM · Maven loader, Maven lister, System administration, Archive coverage

ardumont added a revision to T3746: staging: Deploy maven indexer/lister/loader: D7178: Fix load_maven scheduling task name.

Feb 15 2022, 4:10 PM · Maven loader, Maven lister, System administration, Archive coverage