Page MenuHomeSoftware Heritage
Feed All Stories

Oct 14 2021

ardumont added a parent task for T3635: git loader: enable "partial" global deduplication of revisions via the extid mapping table: T3653: Stabilize loader git.
Oct 14 2021, 10:37 AM · Git loader
ardumont added subtasks for T3653: Stabilize loader git: T3625: Reduce git loader memory footprint, T3635: git loader: enable "partial" global deduplication of revisions via the extid mapping table, T3640: Make long running task stop fast when warm shutdown is triggered.
Oct 14 2021, 10:37 AM · Git loader
ardumont renamed T1969: graph: reduce RAM usage for /walk from reduce RAM usage for /walk to graph: reduce RAM usage for /walk.
Oct 14 2021, 10:37 AM · Compressed graph service
ardumont triaged T3653: Stabilize loader git as Normal priority.
Oct 14 2021, 10:36 AM · Git loader
ardumont triaged T3652: Cannot ingest git repositories with (too) large packfiles as Normal priority.
Oct 14 2021, 10:29 AM · Git loader
vsellier added a comment to T3573: [cassandra] directory and content read benchmarks.

Some flame graphs of storage was performed during the ingestion with 50 workers in //

Oct 14 2021, 10:08 AM · System administration, Storage manager
zack committed rMSLD26cb7cb8692f: check in slides for talk at Telecom Paris, DIG team (authored by zack).
check in slides for talk at Telecom Paris, DIG team
Oct 14 2021, 10:04 AM
zack retitled D6470: Make it explicit that the "main" docs page is actually devel doc from Explicit the main docs page is actually the devel instance to Make it explicit that the "main" docs page is actually devel doc.
Oct 14 2021, 9:49 AM
ardumont requested review of D6470: Make it explicit that the "main" docs page is actually devel doc.
Oct 14 2021, 9:34 AM
ardumont requested review of D6469: Drop unneeded "Software Heritage" from page titles.
Oct 14 2021, 9:30 AM
ardumont added a revision to T3651: doc: drop unneeded "Software Heritage" from page titles: D6469: Drop unneeded "Software Heritage" from page titles.
Oct 14 2021, 9:30 AM · Easy hack, Documentation
moranegg added a subtask for T3649: Add access from one docs dimension to another: T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular).
Oct 14 2021, 9:25 AM · Documentation
moranegg added a parent task for T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular): T3649: Add access from one docs dimension to another.
Oct 14 2021, 9:25 AM · Documentation
moranegg added a comment to T3154: sysadm docs: Move relevant and public doc from intranet to swh-docs.

@olasd , @ardumont and @vsellier : I think that we can gently resolve this task.

Oct 14 2021, 9:24 AM · System administration, Documentation
ardumont updated the diff for D6467: Rename users documentation to user.

Add the redirection setup as well

Oct 14 2021, 9:24 AM
ardumont requested review of D6468: ci: Rename users documentation to user.
Oct 14 2021, 9:20 AM
ardumont added a revision to T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular): D6468: ci: Rename users documentation to user.
Oct 14 2021, 9:20 AM · Documentation
ardumont added a revision to T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular): D6467: Rename users documentation to user.
Oct 14 2021, 9:18 AM · Documentation
ardumont requested review of D6467: Rename users documentation to user.
Oct 14 2021, 9:18 AM

Oct 13 2021

zack triaged T3651: doc: drop unneeded "Software Heritage" from page titles as Low priority.
Oct 13 2021, 8:14 PM · Easy hack, Documentation
zack renamed T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular) from documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singula) to documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular).
Oct 13 2021, 8:11 PM · Documentation
zack triaged T3650: documentation: rename docs.s.o/users/ (plural) to docs.s.o/user/ (singular) as High priority.
Oct 13 2021, 8:11 PM · Documentation
zack added a comment to T3649: Add access from one docs dimension to another.

+1 on dropping the / -> /devel/ redirect and have at / a landing page allowing to choose between the 3 bodies of documentation.

Oct 13 2021, 8:09 PM · Documentation
vlorentz added a comment to T75: Check integrity of directories, revisions, and releases.

My script finished running on releases. Result: all 644k releases are recoverable (mostly just missing gpg signatures), except 75k whose origin does not exist anymore.

Oct 13 2021, 6:40 PM · Archive content, Restricted Project
ardumont accepted D6466: Proposal for network page.

Some suggestions to attend to prior to merge but otherwise, great ;)

Oct 13 2021, 6:24 PM
ardumont committed rSPSITEab408a55220d: worker17: Align with actual configuration (authored by ardumont).
worker17: Align with actual configuration
Oct 13 2021, 6:09 PM
ardumont committed rSPSITE7c7e6fe6bc1d: worker17: Increase the load warning (authored by ardumont).
worker17: Increase the load warning
Oct 13 2021, 6:09 PM
ardumont added a comment to P1200 `make -C docs clean html` failure to build the devel docs.

Yes, after a migration, the postgres version got upgraded.
It was missing the user role correctly configured in the db:

Oct 13 2021, 6:01 PM
ardumont committed rDDOC5dd615c57dc4: Fix document isn't included in any toctree warning (authored by ardumont).
Fix document isn't included in any toctree warning
Oct 13 2021, 5:40 PM
olasd added a comment to D6458: tests: Turn origin* hypothesis strategies into pytest fixtures.
In D6458#167771, @olasd wrote:

Yeah, sure, I don't have a problem with that.

Oct 13 2021, 5:34 PM
olasd added a comment to D6458: tests: Turn origin* hypothesis strategies into pytest fixtures.

@olasd Could you open a task, so anlambert can land this stack of diffs now before we discuss the next step?

Oct 13 2021, 5:31 PM
ardumont created P1200 `make -C docs clean html` failure to build the devel docs.
Oct 13 2021, 5:21 PM
swh-public-ci added a comment to D6463: tests: Turn remaining hypothesis strategies into pytest fixtures.

Build is green

Oct 13 2021, 5:09 PM
swh-public-ci added a comment to D6461: tests: Turn snapshot* hypothesis strategies into pytest fixtures.

Build is green

Oct 13 2021, 5:07 PM
vlorentz added inline comments to D6466: Proposal for network page.
Oct 13 2021, 5:07 PM
vlorentz committed rDSNIP2dbd41a55ab2: analyze_consistency_failures.py: add support for a fallback swh-graph instance (authored by vlorentz).
analyze_consistency_failures.py: add support for a fallback swh-graph instance
Oct 13 2021, 5:01 PM
vlorentz committed rDSNIP04b5bcd5af2a: analyze_consistency_failures.py: minor tweaks (authored by vlorentz).
analyze_consistency_failures.py: minor tweaks
Oct 13 2021, 5:01 PM
moranegg committed rDDOC46a33656368c: docs/devel: add tutorial to contribute docs (authored by moranegg).
docs/devel: add tutorial to contribute docs
Oct 13 2021, 4:58 PM
vsellier updated the summary of D6466: Proposal for network page.
Oct 13 2021, 4:56 PM
ardumont added a comment to T3649: Add access from one docs dimension to another.

Small discussion about a possible implementation of this:

16:49 <+olasd> ardumont: morane's point is not about having cross links to individual pages; it's about having an entry point and cross referencing the docs *instances* overall
16:49 <+olasd> intersphinx doesn't solve that
...
16:51 <+ardumont> for the main point, i recall we discussed a while back having an index page which would display the main doc instances (user, sysadm, devel)
16:51 <+moranegg> This is a good solution, if each page has a link to this parent page
16:51 <+ardumont> today we have a redirect from docs.s.o to docs.s.o/devel
16:52 <+ardumont> that may probably need to go away and have that main page instead
Oct 13 2021, 4:56 PM · Documentation
vsellier requested review of D6466: Proposal for network page.
Oct 13 2021, 4:55 PM
vsellier added a revision to T3154: sysadm docs: Move relevant and public doc from intranet to swh-docs: D6466: Proposal for network page.
Oct 13 2021, 4:55 PM · System administration, Documentation
anlambert updated the diff for D6463: tests: Turn remaining hypothesis strategies into pytest fixtures.

Rebase

Oct 13 2021, 4:53 PM
anlambert updated the diff for D6461: tests: Turn snapshot* hypothesis strategies into pytest fixtures.

Add missing tests_data parameter to snapshot_swhid fixture

Oct 13 2021, 4:51 PM
moranegg triaged T3649: Add access from one docs dimension to another as High priority.
Oct 13 2021, 4:33 PM · Documentation
anlambert added a comment to D6458: tests: Turn origin* hypothesis strategies into pytest fixtures.
In D6458#167702, @olasd wrote:

Thanks for working on reducing the number of hypothesis fixtures!

I'm a bit concerned about the reproducibility of test results, given fixtures that pull random list elements, with no control on the sequence of test executions and on the seed of the python random module when the fixture is called. (Now that I've looked at swh.web.tests.data, I'm even more concerned :-))

I don't have an answer about "what to use?", unfortunately, except just going for exhaustive tests (i.e. running the test functions for *all* values of the origins in the test data set), which doesn't sound very compelling unless the size of the sample dataset is small, which doesn't look to be the case.

https://github.com/pytest-dev/pytest/issues/5463 has some background about concerns with respect to random seeding in tests.

Apart from that, I see that some of the function-level fixtures are doing ""heavy"" querying on the test data for information that is, in effect, static (e.g. the list of origins with more than two visits, etc.). I wonder if it would be possible to extract this logic to only run it once on initialization of the test data?

I initially wrote: we may want to initialize a single, module scoped seed_storage fixture with all data inserted, and make the storage fixture used by tests a function-scoped fixture which would clone this seed storage instance - I assume some tests have to *write* to the storage, so you can't just have one global read only storage fixture - but I now see that's what swh.web.tests.data does. Maybe _init_tests_data could be turned into that seed_storage module-scoped pytest fixture, instead of the current ad-hoc logic? This would also help us control the random seed used for generating the test data (allowing us to override it to reproduce test results)?

Oct 13 2021, 4:10 PM
vsellier updated the test plan for D6465: sphinx: fix pip cache directory permissions.
Oct 13 2021, 3:41 PM
vsellier updated the test plan for D6465: sphinx: fix pip cache directory permissions.
Oct 13 2021, 3:41 PM
vsellier requested review of D6465: sphinx: fix pip cache directory permissions.
Oct 13 2021, 3:41 PM
vlorentz accepted D6461: tests: Turn snapshot* hypothesis strategies into pytest fixtures.
Oct 13 2021, 3:40 PM
vlorentz accepted D6460: tests: Turn revision* hypothesis strategies into pytest fixtures.
Oct 13 2021, 3:40 PM
vlorentz added a comment to D6458: tests: Turn origin* hypothesis strategies into pytest fixtures.

@olasd Could you open a task, so anlambert can land this stack of diffs now before we discuss the next step?

Oct 13 2021, 3:40 PM
douardda closed D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 13 2021, 3:19 PM
douardda committed rDOBJS23b7f81c1483: Extract the path slicing logic in a dedicated PathSlicer class (authored by douardda).
Extract the path slicing logic in a dedicated PathSlicer class
Oct 13 2021, 3:19 PM
vlorentz accepted D6463: tests: Turn remaining hypothesis strategies into pytest fixtures.
Oct 13 2021, 3:00 PM
ardumont committed rDDOC2dbdfdf521bd: sysadm/mirror-operations: Fix nonexisting document warning (authored by ardumont).
sysadm/mirror-operations: Fix nonexisting document warning
Oct 13 2021, 10:15 AM
ardumont committed rDDOCe61e144cad40: sysadm: Fix toctree contains reference to nonexisting document (authored by ardumont).
sysadm: Fix toctree contains reference to nonexisting document
Oct 13 2021, 10:15 AM
ardumont closed D6464: sysadm: Fix remaining warning on sysadm docs.
Oct 13 2021, 10:14 AM
ardumont committed rDDOCe96db5c48130: sysadm: Fix undefined label warning (authored by ardumont).
sysadm: Fix undefined label warning
Oct 13 2021, 10:14 AM
olasd accepted D6464: sysadm: Fix remaining warning on sysadm docs.
Oct 13 2021, 10:14 AM
ardumont requested review of D6464: sysadm: Fix remaining warning on sysadm docs.
Oct 13 2021, 10:12 AM
ardumont added a revision to T3648: Fix swh-docs's broken dev build: D6464: sysadm: Fix remaining warning on sysadm docs.
Oct 13 2021, 10:12 AM · Documentation

Oct 12 2021

borisbaldassari requested review of D6395: lister: Add new maven lister.
Oct 12 2021, 11:05 PM
olasd added a comment to D6458: tests: Turn origin* hypothesis strategies into pytest fixtures.

Thanks for working on reducing the number of hypothesis fixtures!

Oct 12 2021, 6:46 PM
vsellier added a comment to T3577: Parallel loaders performances .

Some runs with the fix:
It globally improves the stability of the benchmark by reducing the timeouts.

Oct 12 2021, 6:27 PM · System administration, Storage manager
vsellier closed T3407: Upgrade sphinx docker image to use a more recent version of plantuml as Resolved.
Oct 12 2021, 5:59 PM · System administration, Documentation
vsellier closed D6462: sphinx: update the plantuml version installed by the debian package.
Oct 12 2021, 5:59 PM
vsellier committed rCDFJ727112cbfeac: sphinx: update the plantuml version installed by the debian package (authored by vsellier).
sphinx: update the plantuml version installed by the debian package
Oct 12 2021, 5:59 PM
swh-public-ci added a comment to D6442: Extract the path slicing logic in a dedicated PathSlicer class.

Build is green

Oct 12 2021, 5:54 PM
douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.

forgotten print statement...

Oct 12 2021, 5:52 PM
douardda added inline comments to D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 12 2021, 5:50 PM
anlambert requested review of D6463: tests: Turn remaining hypothesis strategies into pytest fixtures.
Oct 12 2021, 5:43 PM
olasd accepted D6462: sphinx: update the plantuml version installed by the debian package.
Oct 12 2021, 5:43 PM
vsellier updated the diff for D6462: sphinx: update the plantuml version installed by the debian package.

Remove an unnecessary linefeed

Oct 12 2021, 5:34 PM
anlambert added a revision to T3603: Replace stateful hypothesis strategies by pytest fixtures: D6463: tests: Turn remaining hypothesis strategies into pytest fixtures.
Oct 12 2021, 5:25 PM · Web app
vsellier added a revision to T3407: Upgrade sphinx docker image to use a more recent version of plantuml: D6462: sphinx: update the plantuml version installed by the debian package.
Oct 12 2021, 5:14 PM · System administration, Documentation
vsellier requested review of D6462: sphinx: update the plantuml version installed by the debian package.
Oct 12 2021, 5:14 PM
olasd committed rDDOC8efe43320d21: sysadm: add stub to the data silo pages (authored by olasd).
sysadm: add stub to the data silo pages
Oct 12 2021, 5:10 PM
olasd committed rDDOCb1d41f68fba1: sysadm: Add some meat to the PostgreSQL section (authored by olasd).
sysadm: Add some meat to the PostgreSQL section
Oct 12 2021, 5:10 PM
vsellier changed the status of T3407: Upgrade sphinx docker image to use a more recent version of plantuml from Open to Work in Progress.
Oct 12 2021, 5:04 PM · System administration, Documentation
swh-public-ci added a comment to D6424: Perfect hashmap C implementation.

Build is green

Oct 12 2021, 4:56 PM
dachary updated the diff for D6424: Perfect hashmap C implementation.

create and lookup a Read Shard with a perfect hash

Oct 12 2021, 4:55 PM
douardda committed rDDOC807d63991a8e: sysadm: fill the mirror deployment section (authored by douardda).
sysadm: fill the mirror deployment section
Oct 12 2021, 4:53 PM
dachary added a comment to D6424: Perfect hashmap C implementation.

@ardumont @olasd this is ready for review... I think :-D

Oct 12 2021, 4:49 PM
Harbormaster failed remote builds in B24390: Diff 23473 for D6424: Perfect hashmap C implementation!
Oct 12 2021, 4:48 PM
swh-public-ci added a comment to D6424: Perfect hashmap C implementation.

Build has FAILED

Oct 12 2021, 4:48 PM
dachary updated the diff for D6424: Perfect hashmap C implementation.

create and lookup a Read Shard with a perfect hash

Oct 12 2021, 4:48 PM
olasd committed rDDOC85752ba1f728: Make sure `make clean` cleans images too (authored by olasd).
Make sure `make clean` cleans images too
Oct 12 2021, 4:17 PM
olasd committed rDDOCab66ccb0b6b1: Add stubs for data silos (authored by olasd).
Add stubs for data silos
Oct 12 2021, 4:17 PM
anlambert requested review of D6461: tests: Turn snapshot* hypothesis strategies into pytest fixtures.
Oct 12 2021, 4:08 PM
zack created P1199 Command-Line Input.
Oct 12 2021, 4:03 PM
ardumont committed rDDOC41baba1d92e2: sysadm: Add software deployment overview for staging environment (authored by ardumont).
sysadm: Add software deployment overview for staging environment
Oct 12 2021, 4:01 PM
anlambert requested review of D6460: tests: Turn revision* hypothesis strategies into pytest fixtures.
Oct 12 2021, 3:57 PM
anlambert added a revision to T3603: Replace stateful hypothesis strategies by pytest fixtures: D6461: tests: Turn snapshot* hypothesis strategies into pytest fixtures.
Oct 12 2021, 3:53 PM · Web app
anlambert added a revision to T3603: Replace stateful hypothesis strategies by pytest fixtures: D6460: tests: Turn revision* hypothesis strategies into pytest fixtures.
Oct 12 2021, 3:27 PM · Web app
olasd committed rCJSWHcb5bb02aa0e4: swh-docs/dev: publish all built docs in jenkins (authored by olasd).
swh-docs/dev: publish all built docs in jenkins
Oct 12 2021, 3:19 PM
vsellier committed rDDOC31a362fe33b9: sysadm/life-cycle: complete the tools life-cycle page (authored by vsellier).
sysadm/life-cycle: complete the tools life-cycle page
Oct 12 2021, 3:15 PM
Harbormaster failed to build B24387: rDLS5bba1a783a85: Let sourceforge origins be enabled by default for rDLS5bba1a783a85: Let sourceforge origins be enabled by default!
Oct 12 2021, 3:10 PM
ardumont closed D6450: Let sourceforge origins be enabled by default.
Oct 12 2021, 3:07 PM
ardumont committed rDLS5bba1a783a85: Let sourceforge origins be enabled by default (authored by ardumont).
Let sourceforge origins be enabled by default
Oct 12 2021, 3:07 PM