Page MenuHomeSoftware Heritage
Feed Advanced Search

Dec 9 2021

stsp added a revision to T3691: Implement CVS loader: D6813: fix Log keyword expansion with trailing whitespace in prefix.
Dec 9 2021, 3:48 PM · CVS loader, Archive coverage
olasd closed T3785: Update scheduler metrics routine is taking too long as Resolved.

SQL schema version 32 (from D6812) with the updated update_metrics function has been deployed in staging and prod.

Dec 9 2021, 3:22 PM · Archive coverage, System administration
stsp added a comment to T3691: Implement CVS loader.

I have started test conversions of the OpenBSD CVS repository.

Dec 9 2021, 3:21 PM · CVS loader, Archive coverage
ardumont updated the task description for T3785: Update scheduler metrics routine is taking too long.
Dec 9 2021, 3:18 PM · Archive coverage, System administration
ardumont placed T3785: Update scheduler metrics routine is taking too long up for grabs.
Dec 9 2021, 3:14 PM · Archive coverage, System administration
ardumont changed the status of T3785: Update scheduler metrics routine is taking too long from Open to Work in Progress.
Dec 9 2021, 3:14 PM · Archive coverage, System administration
ardumont added a revision to T3785: Update scheduler metrics routine is taking too long: D6812: Use a temporary table to update scheduler metrics.
Dec 9 2021, 3:14 PM · Archive coverage, System administration
ardumont triaged T3785: Update scheduler metrics routine is taking too long as Unbreak Now! priority.
Dec 9 2021, 3:13 PM · Archive coverage, System administration
ardumont added a comment to T2400: Ingest current and historical Ubuntu releases.

So the update scheduler metrics routine the graph (mentioned in the description)
relies upon is taking some time to finish (investigation ongoing [3]) hence the apparent
stale.

Dec 9 2021, 2:54 PM · System administration, Debian loader, Package Loader, Archive coverage

Dec 8 2021

ardumont moved T2400: Ingest current and historical Ubuntu releases from in-progress to deployed/landed/monitoring on the System administration board.
Dec 8 2021, 4:44 PM · System administration, Debian loader, Package Loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6791: support custom keywords during rsync:// conversion.
Dec 8 2021, 3:53 PM · CVS loader, Archive coverage
ardumont updated the task description for T2400: Ingest current and historical Ubuntu releases.
Dec 8 2021, 3:44 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added a revision to T2400: Ingest current and historical Ubuntu releases: D6790: changelog: Reference the Ubunbu releases ingestions.
Dec 8 2021, 3:44 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T2400: Ingest current and historical Ubuntu releases.
Dec 8 2021, 3:33 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T2400: Ingest current and historical Ubuntu releases.
Dec 8 2021, 3:31 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T2400: Ingest current and historical Ubuntu releases.
Dec 8 2021, 3:23 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added a comment to T2400: Ingest current and historical Ubuntu releases.

Scheduling has been done [1]. The listing process happened [2]. Now let the debian
loaders do its job (ongoing) [3].

Dec 8 2021, 3:18 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont renamed T2400: Ingest current and historical Ubuntu releases from ingest Ubuntu to Ingest current and history Ubuntu releases.
Dec 8 2021, 2:43 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont moved T2400: Ingest current and historical Ubuntu releases from Backlog to in-progress on the System administration board.
Dec 8 2021, 2:43 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added projects to T2400: Ingest current and historical Ubuntu releases: Package Loader, Debian loader, System administration.
Dec 8 2021, 2:43 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added a comment to T2400: Ingest current and historical Ubuntu releases.

Adaptations in the loader core for the ubuntu history release has been deployed within the v2.0 scope.
I'm looking into triggering the necessary task now.

Dec 8 2021, 2:41 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont closed T3774: Deploy swh.loader.core v2.0.0, a subtask of T2400: Ingest current and historical Ubuntu releases, as Resolved.
Dec 8 2021, 2:33 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Dec 8 2021, 12:32 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
borisbaldassari added a comment to T1724: Maven Central repository support.

On second thoughts: in order to run the docker-dev setup, I also had to run a virtual machine alongside the swh setup to host the text index file, and make sure the swh vm could access it.
I suppose that any vm/docker/baremetal machine with an apache/nginx server could do for that, as long as the lister can http-fetch the .fld file.

Dec 8 2021, 12:18 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
borisbaldassari added a comment to T1724: Maven Central repository support.

I'm asking you for a diff with the exact changes you had to make in the
swh-environment/docker/docker-compose.yml (and other folders) to actually make it run.
That will definitely help for the deployment on staging.

Dec 8 2021, 12:06 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
borisbaldassari added a revision to T1724: Maven Central repository support: D6784: maven: diff docker dev setup.
Dec 8 2021, 11:58 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Dec 8 2021, 11:31 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont added a comment to T1724: Maven Central repository support.

I'm not sure what you mean by the docker diff.

Dec 8 2021, 11:25 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6781: fix the top-level directory path of imported CVS modules.
Dec 8 2021, 9:51 AM · CVS loader, Archive coverage
ardumont added a subtask for T2400: Ingest current and historical Ubuntu releases: T3774: Deploy swh.loader.core v2.0.0.
Dec 8 2021, 9:06 AM · System administration, Debian loader, Package Loader, Archive coverage

Dec 7 2021

stsp added a revision to T3691: Implement CVS loader: D6762: update test suite documentation.
Dec 7 2021, 11:43 AM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6758: make CVS loader create one snapshot per visit.
Dec 7 2021, 10:59 AM · CVS loader, Archive coverage

Dec 6 2021

borisbaldassari added a comment to T1724: Maven Central repository support.

I'm not sure what you mean by the docker diff. Is that the update of the maven-index-exporter repository at D6740?
The above-mentioned repository has documentation to build, test and run the text index generation. As mentioned there I've also created a bunch of compressed text index exports, that can be used to test the lister/loader without running the docker image immediately. They are all real-world extracts obtained by running the docker image on the list of Maven repositories I could get as of last week. They together represent a few million artefacts.

Dec 6 2021, 10:16 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6755: hashutil: Add support for md5 sum.
Dec 6 2021, 5:43 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Dec 6 2021, 2:59 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6750: debian: Add md5 sum fallback when sha* checksum is missing in metadata.
Dec 6 2021, 2:36 PM · System administration, Debian loader, Package Loader, Archive coverage
anlambert added a comment to T2400: Ingest current and historical Ubuntu releases.

I also tested the listing of Ubuntu historical releases and it went fine.

Dec 6 2021, 1:06 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added a comment to T2400: Ingest current and historical Ubuntu releases.

After discussing with @olasd, the plan is mostly ok for the "current" release.

Dec 6 2021, 12:39 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Dec 6 2021, 10:24 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Dec 4 2021

stsp added a revision to T3691: Implement CVS loader: D6745: fix expansion of the Log keyword with rsync origins.
Dec 4 2021, 5:39 PM · CVS loader, Archive coverage

Dec 3 2021

anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6744: debian: Update last_update for a package when required.
Dec 3 2021, 5:50 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont changed the status of T2400: Ingest current and historical Ubuntu releases from Open to Work in Progress.
Dec 3 2021, 5:38 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont closed T3765: Deploy latest swh.loader.core and swh.lister, a subtask of T2400: Ingest current and historical Ubuntu releases, as Resolved.
Dec 3 2021, 5:35 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont updated subscribers of T2400: Ingest current and historical Ubuntu releases.

Heads up, now that the lister/loader are unstuck on production (and debian origins are
actually scheduled). This is a technical go for this.

Dec 3 2021, 5:32 PM · System administration, Debian loader, Package Loader, Archive coverage
ardumont added a subtask for T2400: Ingest current and historical Ubuntu releases: T3765: Deploy latest swh.loader.core and swh.lister.
Dec 3 2021, 5:29 PM · System administration, Debian loader, Package Loader, Archive coverage
anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6743: debian: Provide last_update to produced ListedOrigin models.
Dec 3 2021, 4:14 PM · System administration, Debian loader, Package Loader, Archive coverage
anlambert added a comment to T2400: Ingest current and historical Ubuntu releases.

I also tested the listing of Linux Mint source packages. Apart a small fix to add to the lister (D6741), the listing went fine:

(swh) ✘-1 ~/swh/swh-environment/docker [master|✚ 1⚑ 41] 
14:22 $ docker-compose exec swh-scheduler swh scheduler task add list-debian-distribution -p oneshot distribution=LinuxMint mirror_url=https://mirror.crexio.com/linuxmint/packages/ suites="[betsy, cindy, debbie, debian, elyssa, felicia, gloria, helena, helena-fluxbox, helena-kde, helena-lxde, helena-xfce, isadora, isadora-fluxbox, isadora-kde, isadora-lxde, isadora-xfce, julia, julia-fluxbox, julia-kde, julia-lxde, julia-xfce, katya, katya-fluxbox, katya-kde, katya-lxde, lisa, lisa-kde, lisa-lxde, maya, nadia, olivia, petra, qiana, rafaela, rebecca, romeo, rosa, sarah, serena, sonya, sylvia, tara, tessa, tina, tricia, ulyana, ulyssa, uma, una]" components="[backport, debian, import, incoming, main, romeo, upstream]"
Dec 3 2021, 2:36 PM · System administration, Debian loader, Package Loader, Archive coverage
anlambert added a comment to T2400: Ingest current and historical Ubuntu releases.

After trying to list and load ubuntu packages in the docker environment, I found a couple of issues in the debian lister and loader that were preventing succcessful listing and loading.
Those issues have been fixed in D6735, D6737 and D6738.

Dec 3 2021, 1:58 PM · System administration, Debian loader, Package Loader, Archive coverage
borisbaldassari added a revision to T1724: Maven Central repository support: D6740: Update maven-index-exporter from gh/borisbaldassari/maven-index-exporter.
Dec 3 2021, 1:57 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6738: debian: Update extra_loader_arguments dict produced ListedOrigin models.
Dec 3 2021, 11:19 AM · System administration, Debian loader, Package Loader, Archive coverage
anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6737: debian: Fix a couple of issues in the loader.
Dec 3 2021, 11:18 AM · System administration, Debian loader, Package Loader, Archive coverage

Dec 2 2021

anlambert added a revision to T2400: Ingest current and historical Ubuntu releases: D6735: debian: Add missing file URIs in lister output.
Dec 2 2021, 5:31 PM · System administration, Debian loader, Package Loader, Archive coverage

Dec 1 2021

zack raised the priority of T2400: Ingest current and historical Ubuntu releases from Normal to High.
Dec 1 2021, 12:39 PM · System administration, Debian loader, Package Loader, Archive coverage
borisbaldassari updated the task description for T1724: Maven Central repository support.
Dec 1 2021, 10:35 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Dec 1 2021, 10:32 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Nov 29 2021

stsp added a revision to T3691: Implement CVS loader: D6708: fix expansion of multiple RCS keywords on a line via rsync.
Nov 29 2021, 7:11 PM · CVS loader, Archive coverage

Nov 24 2021

stsp added a revision to T3691: Implement CVS loader: D6684: fix regular expression used for matching RCS keywords.
Nov 24 2021, 12:30 PM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6678: attempt to avoid content differences due to paths in keywords.
Nov 24 2021, 12:30 PM · CVS loader, Archive coverage
stsp added a comment to T3691: Implement CVS loader.

D6684 addresses another keyword expansion issue found while testing conversion of CVS's own history.

Nov 24 2021, 12:30 PM · CVS loader, Archive coverage
stsp added a comment to T3691: Implement CVS loader.

The above problem with the Header keyword can be worked around (at least for the GNU savannah site) with the patch in D6678.

Nov 24 2021, 10:53 AM · CVS loader, Archive coverage

Nov 23 2021

borisbaldassari added a comment to T1724: Maven Central repository support.

Hi there,

Nov 23 2021, 9:21 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Nov 22 2021

ardumont renamed T3746: staging: Deploy maven indexer/lister/loader from staging: Deploy maven exporter/lister/loader to staging: Deploy maven indexer/lister/loader.
Nov 22 2021, 2:38 PM · Maven loader, Maven lister, System administration, Archive coverage
ardumont triaged T3746: staging: Deploy maven indexer/lister/loader as Normal priority.
Nov 22 2021, 12:01 PM · Maven loader, Maven lister, System administration, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Nov 22 2021, 11:56 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont updated the task description for T1724: Maven Central repository support.
Nov 22 2021, 11:56 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont renamed T1724: Maven Central repository support from Maven Central repository Lister to Maven Central repository support.
Nov 22 2021, 11:54 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Nov 18 2021

ardumont moved T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from code-review/await-feedback/pause to done on the System administration board.
Nov 18 2021, 3:17 PM · System administration, Archive coverage, Origin-SourceForge
ardumont moved T3599: List and ingest heptapod instances from code-review/await-feedback/pause to done on the System administration board.
Nov 18 2021, 3:17 PM · System administration, Archive coverage
ardumont closed T3717: Ingest opam instance https://coq.inria.fr/opam/released/ as Resolved.
Nov 18 2021, 3:16 PM · System administration, Archive coverage, Opam
ardumont moved T3717: Ingest opam instance https://coq.inria.fr/opam/released/ from code-review/await-feedback/pause to deployed/landed/monitoring on the System administration board.
Nov 18 2021, 3:16 PM · System administration, Archive coverage, Opam

Nov 17 2021

ardumont moved T3717: Ingest opam instance https://coq.inria.fr/opam/released/ from in-progress to code-review/await-feedback/pause on the System administration board.
Nov 17 2021, 3:00 PM · System administration, Archive coverage, Opam
ardumont added a revision to T3717: Ingest opam instance https://coq.inria.fr/opam/released/: D6647: changelog: Reference the opam coq repository ingestion.
Nov 17 2021, 2:08 PM · System administration, Archive coverage, Opam

Nov 12 2021

stsp added a comment to T3691: Implement CVS loader.

Another problem with keyword expansion found during testing:

Nov 12 2021, 11:41 AM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6638: preserve empty lines in CVS log messages over pserver.
Nov 12 2021, 11:38 AM · CVS loader, Archive coverage

Nov 10 2021

ardumont added a comment to T3717: Ingest opam instance https://coq.inria.fr/opam/released/.

@ardumont For production, drop the / at the end of the url, in staging, it's duplicated...

Nov 10 2021, 6:27 PM · System administration, Archive coverage, Opam
ardumont changed the status of T3717: Ingest opam instance https://coq.inria.fr/opam/released/ from Open to Work in Progress.
Nov 10 2021, 6:27 PM · System administration, Archive coverage, Opam
ardumont added a project to T3717: Ingest opam instance https://coq.inria.fr/opam/released/: System administration.
Nov 10 2021, 6:24 PM · System administration, Archive coverage, Opam
ardumont added a comment to T3717: Ingest opam instance https://coq.inria.fr/opam/released/.

This got added to the staging infrastructure which lead to some surprises.
This mostly got fixed in the commits ^.

Nov 10 2021, 6:23 PM · System administration, Archive coverage, Opam
stsp added a comment to T3691: Implement CVS loader.
In T3691#73518, @stsp wrote:

There is another problem related to keywords: Some CVS-based projects use custom keywords, instead of the standard $Id$ keyword. This prevents wrong expansion of $Id$ when code is imported from one project to another. Usually the project's name will be used as the custom keyword name, such as $OpenBSD$ or $NetBSD$, instead of $Id$. At present, to expand keywords correctly in this case, we need to use the pserver access method to benefit from server-side keyword expansion. But we will end up with different hashes if rsync is used to import the same origin again. We might be able to auto-detect use of custom keywords if the rsync server allows access to the CVSROOT folder, but this is not always the case. If CVSROOT is hidden from rsync, the only reliable way to detect custom keywords would be a parameter that gets passed into the loader. We could, for example, allow passing the name of a custom keyword as a parameter embedded in the origin URL.

Nov 10 2021, 1:12 PM · CVS loader, Archive coverage

Nov 9 2021

stsp added a comment to T3691: Implement CVS loader.

As of D6623 the CVS loader is able to convert GNU dino correctly over both rsync and pserver access.

Nov 9 2021, 1:30 PM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6623: add CVS commit ID support to rlog.py.
Nov 9 2021, 1:27 PM · CVS loader, Archive coverage

Nov 8 2021

zack updated the task description for T3717: Ingest opam instance https://coq.inria.fr/opam/released/.
Nov 8 2021, 2:41 PM · System administration, Archive coverage, Opam
zack added a project to T3717: Ingest opam instance https://coq.inria.fr/opam/released/: Archive coverage.
Nov 8 2021, 2:41 PM · System administration, Archive coverage, Opam

Nov 5 2021

stsp added a comment to T3691: Implement CVS loader.

Status update:

Nov 5 2021, 2:31 PM · CVS loader, Archive coverage

Oct 31 2021

stsp added a revision to T3691: Implement CVS loader: D6593: handle Attic-only RCS files over CVS pserver.
Oct 31 2021, 11:32 PM · CVS loader, Archive coverage

Oct 30 2021

stsp added a revision to T3691: Implement CVS loader: D6590: add support for RCS keyword expansion over pserver protocol.
Oct 30 2021, 11:49 AM · CVS loader, Archive coverage

Oct 28 2021

ardumont closed T3667: Orchestrate origins scheduling according to scheduler metrics feedback, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, as Resolved.
Oct 28 2021, 5:13 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont closed T3667: Orchestrate origins scheduling according to scheduler metrics feedback as Resolved.
Oct 28 2021, 5:13 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont moved T3667: Orchestrate origins scheduling according to scheduler metrics feedback from in-progress to deployed/landed/monitoring on the System administration board.
Oct 28 2021, 4:43 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont moved T3667: Orchestrate origins scheduling according to scheduler metrics feedback from Backlog to in-progress on the System administration board.
Oct 28 2021, 4:43 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a project to T3667: Orchestrate origins scheduling according to scheduler metrics feedback: System administration.
Oct 28 2021, 4:42 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a comment to T3667: Orchestrate origins scheduling according to scheduler metrics feedback.

Deployed in production.

Oct 28 2021, 4:42 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont changed the status of T3667: Orchestrate origins scheduling according to scheduler metrics feedback, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, from Open to Work in Progress.
Oct 28 2021, 4:34 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont changed the status of T3667: Orchestrate origins scheduling according to scheduler metrics feedback from Open to Work in Progress.

Deployed in staging.

Oct 28 2021, 4:34 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
olasd added a revision to T3667: Orchestrate origins scheduling according to scheduler metrics feedback: D6583: scheduler: Add service to schedule recurrent visits.
Oct 28 2021, 4:16 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a revision to T3667: Orchestrate origins scheduling according to scheduler metrics feedback: D6574: scheduler: Add schedule recurrent tasks service.
Oct 28 2021, 11:35 AM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities

Oct 27 2021

stsp added a revision to T3691: Implement CVS loader: D6566: test checkout of file lacking trailing \n over pserver protocol.
Oct 27 2021, 4:20 PM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6559: cvsclient: handle additional responses sent by server.
Oct 27 2021, 3:55 PM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6561: rlog: fix loading of CVS commits which have a commit ID.
Oct 27 2021, 3:55 PM · CVS loader, Archive coverage
stsp added a revision to T3691: Implement CVS loader: D6560: rlog: fix parsing of multiple file revisions.
Oct 27 2021, 3:55 PM · CVS loader, Archive coverage