Page MenuHomeSoftware Heritage
Feed Advanced Search

Sep 22 2021

ardumont added projects to T3597: List heptapod instance https://forge.extranet.logilab.fr/: Archive coverage, System administration.
Sep 22 2021, 2:38 PM · System administration, Archive coverage
ardumont claimed T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs.
Sep 22 2021, 2:12 PM · Archive coverage, Opam
ardumont changed the status of T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs from Open to Work in Progress.
Sep 22 2021, 2:12 PM · Archive coverage, Opam
ardumont changed the status of T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs, a subtask of T3424: Opam support, from Open to Work in Progress.
Sep 22 2021, 2:12 PM · Archive coverage, Opam
ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6316: opam: Share opam root directory even on multiple instances.
Sep 22 2021, 2:11 PM · Archive coverage, Opam
ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6318: opam: Allow shared state between loader runs using multi-instance opam root.
Sep 22 2021, 2:11 PM · Archive coverage, Opam

Sep 21 2021

ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6317: opam: Initialize opam root directory outside the constructor.
Sep 21 2021, 7:36 PM · Archive coverage, Opam
ardumont renamed T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs from opam loader: Ensure the current opam state is shared amongst ingestion to opam loader: Ensure required opam state is shared amongst ingestion/listing runs.
Sep 21 2021, 7:28 PM · Archive coverage, Opam

Sep 20 2021

ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6312: opam: Make the instance optional and derived from the url.
Sep 20 2021, 5:57 PM · Archive coverage, Opam
ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6306: opam: Allow defining where to actually install the opam_root folder.
Sep 20 2021, 3:10 PM · Archive coverage, Opam
ardumont added a revision to T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs: D6305: opam: Install and maintain up-to-date shared opam root directories.
Sep 20 2021, 3:05 PM · Archive coverage, Opam
ardumont triaged T3590: opam loader: Ensure required opam state is shared amongst ingestion/listing runs as Normal priority.
Sep 20 2021, 2:51 PM · Archive coverage, Opam
ardumont closed T3468: staging: current opam loading issues, a subtask of T3424: Opam support, as Resolved.
Sep 20 2021, 2:44 PM · Archive coverage, Opam
ardumont changed the status of T3468: staging: current opam loading issues, a subtask of T3424: Opam support, from Open to Work in Progress.
Sep 20 2021, 2:44 PM · Archive coverage, Opam

Sep 18 2021

zack added a project to T3425: Opam loader: Archive coverage.
Sep 18 2021, 8:29 AM · Archive coverage, Opam
zack added a project to T3358: Opam lister: Archive coverage.
Sep 18 2021, 8:29 AM · Archive coverage, Lister
zack added a project to T3424: Opam support: Archive coverage.
Sep 18 2021, 8:28 AM · Archive coverage, Opam

Sep 14 2021

ardumont closed T3538: Send scheduler metrics to prometheus, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, as Resolved.
Sep 14 2021, 11:00 AM · Sprint 2021 01, Archive coverage, Scheduling utilities

Sep 3 2021

ardumont added a subtask for T2345: Improve handling of recurrent loading tasks in scheduler: T3538: Send scheduler metrics to prometheus.
Sep 3 2021, 5:17 PM · Sprint 2021 01, Archive coverage, Scheduling utilities

Aug 30 2021

borisbaldassari added a revision to T1724: Maven Central repository support: D6159: maven jar-loader: Initalise files, add archive format..
Aug 30 2021, 10:22 AM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Aug 29 2021

borisbaldassari added a revision to T1724: Maven Central repository support: D6158: maven jar-loader: Initalise files..
Aug 29 2021, 10:21 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Aug 27 2021

ardumont added a revision to T1724: Maven Central repository support: D6133: maven-lister: initialise lister..
Aug 27 2021, 2:57 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage

Aug 26 2021

ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

What's next, as a summary, subsequent subtasks should be created later:

Aug 26 2021, 5:45 PM · Sprint 2021 01, Archive coverage, Scheduling utilities

Aug 13 2021

ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.
  • Refactor a bit the journal client to update a docstring and inline one function (done, that'd be the 2 previous commits mentioned here just below that comment ^).
  • Deactivate failing visits (delegating to listers the act of activating back those origins which gets live again). I have diffs which deal with this that needs some rebase and work according to latest change (I need to get back to it) [1].
  • Deploy the current scheduler implementation (master) when that previous point is done. (That's gonna be my goal to reach prior to some vacation break).
Aug 13 2021, 4:44 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont moved T3471: production: Deploy swh.scheduler v0.17 from deployed/landed/monitoring to done on the System administration board.
Aug 13 2021, 3:49 PM · System administration, Archive coverage, Scheduling utilities
ardumont closed T3471: production: Deploy swh.scheduler v0.17, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, as Resolved.
Aug 13 2021, 3:48 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont closed T3471: production: Deploy swh.scheduler v0.17 as Resolved.
Aug 13 2021, 3:48 PM · System administration, Archive coverage, Scheduling utilities
ardumont updated subscribers of T3471: production: Deploy swh.scheduler v0.17.

including the next-gen scheduler runner not yet puppetized [4]

All got done except this part ^.
This needs first the following:

  • D5809 to be rebased on latest master branch (v0.17)
  • the saatchi venv (in swhscheduler home) to be updated with it
Aug 13 2021, 3:48 PM · System administration, Archive coverage, Scheduling utilities
ardumont moved T3471: production: Deploy swh.scheduler v0.17 from code-review/await-feedback/pause to in-progress on the System administration board.
Aug 13 2021, 10:33 AM · System administration, Archive coverage, Scheduling utilities
ardumont moved T3471: production: Deploy swh.scheduler v0.17 from in-progress to code-review/await-feedback/pause on the System administration board.
Aug 13 2021, 10:33 AM · System administration, Archive coverage, Scheduling utilities
ardumont changed the status of T3471: production: Deploy swh.scheduler v0.17, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, from Open to Work in Progress.
Aug 13 2021, 10:33 AM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont changed the status of T3471: production: Deploy swh.scheduler v0.17 from Open to Work in Progress.
Aug 13 2021, 10:33 AM · System administration, Archive coverage, Scheduling utilities
ardumont edited projects for T3471: production: Deploy swh.scheduler v0.17, added: System administration; removed Sprint 2021 01.
Aug 13 2021, 10:33 AM · System administration, Archive coverage, Scheduling utilities
ardumont added a comment to T3471: production: Deploy swh.scheduler v0.17.

including the next-gen scheduler runner not yet puppetized [4]

Aug 13 2021, 10:32 AM · System administration, Archive coverage, Scheduling utilities

Aug 12 2021

ardumont moved T3471: production: Deploy swh.scheduler v0.17 from Backlog to in-progress on the Sprint 2021 01 board.
Aug 12 2021, 8:41 AM · System administration, Archive coverage, Scheduling utilities
ardumont added a comment to T3471: production: Deploy swh.scheduler v0.17.

Following actions in order:

Aug 12 2021, 8:41 AM · System administration, Archive coverage, Scheduling utilities

Aug 10 2021

zack updated the task description for T3475: leverage Shodan scans to find and ingest the "penumbra" of FOSS.
Aug 10 2021, 12:21 PM · Archive coverage
zack updated the task description for T3475: leverage Shodan scans to find and ingest the "penumbra" of FOSS.
Aug 10 2021, 12:19 PM · Archive coverage
zack triaged T3475: leverage Shodan scans to find and ingest the "penumbra" of FOSS as Low priority.
Aug 10 2021, 12:19 PM · Archive coverage

Aug 9 2021

ardumont closed T3456: staging: Deploy scheduler v0.17 as Resolved.
Aug 9 2021, 11:07 AM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont closed T3456: staging: Deploy scheduler v0.17, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, as Resolved.
Aug 9 2021, 11:07 AM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont moved T3456: staging: Deploy scheduler v0.17 from code-review/await-feedback/pause to deployed/landed/monitoring on the System administration board.
Aug 9 2021, 11:07 AM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont triaged T3471: production: Deploy swh.scheduler v0.17 as High priority.
Aug 9 2021, 11:06 AM · System administration, Archive coverage, Scheduling utilities

Aug 6 2021

ardumont triaged T3470: lister-sourceforge: Activate sourceforge origins when listed as High priority.
Aug 6 2021, 4:09 PM · System administration, Archive coverage, Origin-SourceForge
ardumont moved T3456: staging: Deploy scheduler v0.17 from in-progress to code-review/await-feedback/pause on the System administration board.
Aug 6 2021, 3:11 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3456: staging: Deploy scheduler v0.17.
Aug 6 2021, 3:11 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a comment to T3456: staging: Deploy scheduler v0.17.

Ensure the journal client is doing its new job

Aug 6 2021, 3:10 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3456: staging: Deploy scheduler v0.17.
Aug 6 2021, 3:09 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3456: staging: Deploy scheduler v0.17.
Aug 6 2021, 12:12 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont changed the status of T3456: staging: Deploy scheduler v0.17, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, from Open to Work in Progress.
Aug 6 2021, 12:11 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont changed the status of T3456: staging: Deploy scheduler v0.17 from Open to Work in Progress.
Aug 6 2021, 12:11 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a project to T3456: staging: Deploy scheduler v0.17: System administration.
Aug 6 2021, 12:11 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3456: staging: Deploy scheduler v0.17.
Aug 6 2021, 11:10 AM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities

Aug 5 2021

ardumont claimed T3456: staging: Deploy scheduler v0.17.
Aug 5 2021, 3:25 PM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities

Aug 4 2021

ardumont triaged T3456: staging: Deploy scheduler v0.17 as High priority.
Aug 4 2021, 10:10 AM · System administration, Sprint 2021 01, Archive coverage, Scheduling utilities

Aug 3 2021

ardumont added a revision to T3374: Ingest sourceforge repositories (origins of type git, svn, hg): D6051: changelog: Reference first completion of sourceforge hg origins.
Aug 3 2021, 11:43 AM · System administration, Archive coverage, Origin-SourceForge
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Aug 3 2021, 10:17 AM · System administration, Archive coverage, Origin-SourceForge
ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Deactivate failing visits (delegating to listers the act of activating back those
origins which gets live again). I have diffs which deal with this that needs some
rebase and work according to latest change (I need to get back to it) [1].

Aug 3 2021, 8:58 AM · Sprint 2021 01, Archive coverage, Scheduling utilities

Jul 30 2021

ardumont changed the status of T2345: Improve handling of recurrent loading tasks in scheduler from Open to Work in Progress.

(^ for a while ;)

Jul 30 2021, 3:55 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Status on this, after the recent refactoring we did with @olasd to simplify the actual
implementation (backend and journal client). There remains to:

Jul 30 2021, 3:54 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont moved T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from in-progress to code-review/await-feedback/pause on the System administration board.
Jul 30 2021, 11:22 AM · System administration, Archive coverage, Origin-SourceForge
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 30 2021, 10:25 AM · System administration, Archive coverage, Origin-SourceForge

Jul 29 2021

ardumont moved T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from Backlog to in-progress on the System administration board.
Jul 29 2021, 5:45 PM · System administration, Archive coverage, Origin-SourceForge
ardumont added a project to T3374: Ingest sourceforge repositories (origins of type git, svn, hg): System administration.
Jul 29 2021, 5:45 PM · System administration, Archive coverage, Origin-SourceForge
ardumont renamed T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from Ingest sourceforge repositories to Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 29 2021, 5:45 PM · System administration, Archive coverage, Origin-SourceForge
ardumont added a comment to T3374: Ingest sourceforge repositories (origins of type git, svn, hg).

The 'hg' ingestion started now that the latest loader mercurial got deployed.

Jul 29 2021, 5:43 PM · System administration, Archive coverage, Origin-SourceForge
ardumont moved T3350: Deploy sourceforge lister in production from in-progress to done on the System administration board.
Jul 29 2021, 1:23 PM · Archive coverage, System administration, Origin-SourceForge

Jul 22 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 22 2021, 10:15 AM · System administration, Archive coverage, Origin-SourceForge
ardumont added a comment to T3374: Ingest sourceforge repositories (origins of type git, svn, hg).

The 'git' ingestion caught up [1] so now let's make the svn origins finish [2]. In
effect, making the loader run as before. Activating back the svn queue consumption where
it remains few origins to consume.

Jul 22 2021, 9:43 AM · System administration, Archive coverage, Origin-SourceForge
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 22 2021, 9:40 AM · System administration, Archive coverage, Origin-SourceForge

Jul 20 2021

ardumont added a comment to T3374: Ingest sourceforge repositories (origins of type git, svn, hg).

So, to improve the current situation, the svn ingestion got put in stand-by to let the
git ingestion progress as well.

Jul 20 2021, 10:09 AM · System administration, Archive coverage, Origin-SourceForge
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 20 2021, 9:56 AM · System administration, Archive coverage, Origin-SourceForge
ardumont added a comment to T3374: Ingest sourceforge repositories (origins of type git, svn, hg).

So a bit of status report.

Jul 20 2021, 9:53 AM · System administration, Archive coverage, Origin-SourceForge

Jul 19 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 19 2021, 9:42 AM · System administration, Archive coverage, Origin-SourceForge

Jul 13 2021

vlorentz added a comment to T3311: Use .gitmodules to discover origins.

if it would be worth submitting these recursive origins with "save code now" so we can try to get submodule updates close to the update of the main repository

Jul 13 2021, 12:10 PM · Archive coverage, Git loader

Jul 12 2021

olasd added a comment to T3311: Use .gitmodules to discover origins.

I also wonder if we have a somewhat common approach to handle the SVN externals as well.

Jul 12 2021, 3:48 PM · Archive coverage, Git loader
olasd added a comment to T3311: Use .gitmodules to discover origins.

I think this is worthwhile in general, at least for repositories that are still live.

Jul 12 2021, 3:47 PM · Archive coverage, Git loader

Jul 9 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 9 2021, 4:27 PM · System administration, Archive coverage, Origin-SourceForge
ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Updated stats in descending order on the no_last_update column:

Jul 9 2021, 3:11 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Relatedly to this task, some work has been started to make the pypi lister list its
origins with the last_update information in the diff D5977 / T3399 (review got done
and the implementation needs to be improved but still ;).

Jul 9 2021, 3:05 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont closed T3399: Improve PyPI lister to pull last update information when running incrementally, a subtask of T2345: Improve handling of recurrent loading tasks in scheduler, as Resolved.
Jul 9 2021, 2:52 PM · Sprint 2021 01, Archive coverage, Scheduling utilities

Jul 8 2021

ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Status on the latest development for this task, "Baseline for the recurrence of origin
visits" chapter has been implemented in the following stacked diffs (in review):

Jul 8 2021, 12:30 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5980: journal_client: Disable origins when too many visited attempts failed.
Jul 8 2021, 11:26 AM · Sprint 2021 01, Archive coverage, Scheduling utilities

Jul 7 2021

ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5978: Add a successive_visits counter to origin visit stats.
Jul 7 2021, 5:26 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 7 2021, 11:13 AM · System administration, Archive coverage, Origin-SourceForge

Jul 5 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 5 2021, 11:18 AM · System administration, Archive coverage, Origin-SourceForge

Jul 1 2021

ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5956: Introduce new scheduling policy to grab origins without last update.
Jul 1 2021, 12:34 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5950: journal_client: Compute next position for origin visit.
Jul 1 2021, 10:14 AM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jul 1 2021, 10:05 AM · System administration, Archive coverage, Origin-SourceForge
ardumont added a revision to T3374: Ingest sourceforge repositories (origins of type git, svn, hg): D5952: changelog: Reference first completion of sourceforge git/svn origins.
Jul 1 2021, 9:26 AM · System administration, Archive coverage, Origin-SourceForge

Jun 28 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jun 28 2021, 12:04 PM · System administration, Archive coverage, Origin-SourceForge
ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jun 28 2021, 12:03 PM · System administration, Archive coverage, Origin-SourceForge

Jun 24 2021

ardumont changed the status of T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from Open to Work in Progress.
Jun 24 2021, 4:48 PM · System administration, Archive coverage, Origin-SourceForge

Jun 23 2021

ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5919: Start handling of recurrent loading tasks in scheduler.
Jun 23 2021, 6:11 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a revision to T2345: Improve handling of recurrent loading tasks in scheduler: D5914: backend: Auto-generate origin visit stats upsert query.
Jun 23 2021, 3:32 PM · Sprint 2021 01, Archive coverage, Scheduling utilities

Jun 22 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jun 22 2021, 11:54 AM · System administration, Archive coverage, Origin-SourceForge

Jun 21 2021

borisbaldassari added a comment to T1724: Maven Central repository support.

Updates:

  • A ticket has been submitted in the Sonatype JIRA to let them know we will fetch maven poms and src jars soon.
  • An email has been sent on the maven-dev mailing list with a few kind answers, mainly stating to let Sonatype know through a JIRA issue.
  • Hervé Bouthemy provided some precious insights about the best way to use the poms; it seems we can get a near-complete list of maven repositories worldwide by parsing some pom arguments and following dependencies up. It should probably not be used directly by the lister (which should provide only the list of src jars and scm attributes to the loaders), but we can output it somewhere to feed the lister manually.
Jun 21 2021, 8:59 PM · Maven loader, Maven lister, GSoC 2019, Archive coverage
ardumont updated the task description for T2345: Improve handling of recurrent loading tasks in scheduler.
Jun 21 2021, 5:50 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
olasd added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

Summary of the data available in the listed_origins table, broken down by lister and "known state" of origins:

Jun 21 2021, 2:27 PM · Sprint 2021 01, Archive coverage, Scheduling utilities

Jun 17 2021

ardumont updated the task description for T3374: Ingest sourceforge repositories (origins of type git, svn, hg).
Jun 17 2021, 10:28 AM · System administration, Archive coverage, Origin-SourceForge