Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 27 2021

douardda requested review of D6565: Pass the object_type to JournalClient.value_serializer().
Oct 27 2021, 4:23 PM
douardda requested review of D6564: Do call consumer.commit() even if not objects have been received.
Oct 27 2021, 4:22 PM
douardda added a revision to T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror): D6565: Pass the object_type to JournalClient.value_serializer().
Oct 27 2021, 4:19 PM · Storage manager
douardda accepted D6563: grab_next_visits: avoid time interval calculations in PostgreSQL.

A bit sad, but hey, thanks!

Oct 27 2021, 4:03 PM
douardda closed T3668: Improve the seaweedfs backend as Resolved.

Let's consider this is done since 2 of 3 bullets have been canceled (for now)

Oct 27 2021, 10:20 AM · Object storage
douardda updated the task description for T3668: Improve the seaweedfs backend.
Oct 27 2021, 10:18 AM · Object storage

Oct 26 2021

douardda requested review of D6554: [WIP] Add a (redis-based) validation error reporting facility.
Oct 26 2021, 5:56 PM
douardda added a revision to T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror): D6554: [WIP] Add a (redis-based) validation error reporting facility.
Oct 26 2021, 5:48 PM · Storage manager
douardda triaged T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror) as High priority.
Oct 26 2021, 5:41 PM · Storage manager

Oct 22 2021

douardda requested review of D6538: Remove the RADOS backend.
Oct 22 2021, 12:26 PM

Oct 21 2021

douardda closed D6521: Add a simple read-only HTTP backend.
Oct 21 2021, 2:23 PM
douardda committed rDOBJS8ed5f4ebc915: Add a simple read-only HTTP backend (authored by douardda).
Add a simple read-only HTTP backend
Oct 21 2021, 2:23 PM
douardda updated the diff for D6521: Add a simple read-only HTTP backend.

rebase

Oct 21 2021, 2:09 PM
douardda closed D6526: Reorganise the seaweedfs backend in a subpackage.
Oct 21 2021, 2:06 PM
douardda committed rDOBJSbcbbfd466987: Reorganise the seaweedfs backend in a subpackage (authored by douardda).
Reorganise the seaweedfs backend in a subpackage
Oct 21 2021, 2:06 PM
douardda closed D6525: Use get_objstorage in seaweedfs tests instead of direct class instanciation.
Oct 21 2021, 2:06 PM
douardda closed D6524: Add support for deprecation of objstorage cls in factory.
Oct 21 2021, 2:06 PM
douardda committed rDOBJS38c02dcfae2b: Add support for deprecation of objstorage cls in factory (authored by douardda).
Add support for deprecation of objstorage cls in factory
Oct 21 2021, 2:06 PM
douardda committed rDOBJS82d9714b0ae5: Use get_objstorage in seaweedfs tests instead of direct class (authored by douardda).
Use get_objstorage in seaweedfs tests instead of direct class
Oct 21 2021, 2:06 PM
douardda updated the diff for D6526: Reorganise the seaweedfs backend in a subpackage.

and fix the LOGGER.error usage

Oct 21 2021, 2:03 PM
douardda updated the diff for D6526: Reorganise the seaweedfs backend in a subpackage.

remove hardcoded log levels

Oct 21 2021, 1:53 PM
douardda added inline comments to D6526: Reorganise the seaweedfs backend in a subpackage.
Oct 21 2021, 1:49 PM
douardda requested review of D6526: Reorganise the seaweedfs backend in a subpackage.
Oct 21 2021, 1:09 PM
douardda requested review of D6525: Use get_objstorage in seaweedfs tests instead of direct class instanciation.
Oct 21 2021, 1:09 PM
douardda requested review of D6524: Add support for deprecation of objstorage cls in factory.
Oct 21 2021, 1:08 PM
douardda updated the diff for D6521: Add a simple read-only HTTP backend.

Use ReadOnlyObjStorage and NonIterableObjStorage instead of NotImplementedError

Oct 21 2021, 12:50 PM
douardda added inline comments to D6521: Add a simple read-only HTTP backend.
Oct 21 2021, 12:24 PM

Oct 20 2021

douardda added inline comments to D6424: Perfect hashmap C implementation.
Oct 20 2021, 5:17 PM
douardda updated the diff for D6521: Add a simple read-only HTTP backend.

document the build_objstorage() test helper function

Oct 20 2021, 5:00 PM
douardda updated the diff for D6521: Add a simple read-only HTTP backend.

remove useless statement

Oct 20 2021, 4:55 PM
douardda updated the diff for D6521: Add a simple read-only HTTP backend.

remove mistakenly commited mypy.ini file

Oct 20 2021, 4:53 PM
douardda requested review of D6521: Add a simple read-only HTTP backend.
Oct 20 2021, 4:53 PM
douardda committed rDOBJS6269067ca7b3: Improve a bit the seaweedfs backend (authored by douardda).
Improve a bit the seaweedfs backend
Oct 20 2021, 2:40 PM
douardda closed D6517: Improve tests of the seaweedfs backend.
Oct 20 2021, 2:40 PM
douardda committed rDOBJS55ff4b95d306: Improve tests of the seaweedfs backend (authored by douardda).
Improve tests of the seaweedfs backend
Oct 20 2021, 2:40 PM
douardda updated the diff for D6492: Add support for pathslicing in seaweedfs backend.

rebase

Oct 20 2021, 2:36 PM
douardda updated the diff for D6517: Improve tests of the seaweedfs backend.

remove a (comment) garbage line

Oct 20 2021, 2:35 PM
douardda updated the diff for D6492: Add support for pathslicing in seaweedfs backend.

rebase

Oct 20 2021, 2:31 PM
douardda updated the diff for D6517: Improve tests of the seaweedfs backend.

slight simplification as suggested by vlorentz

Oct 20 2021, 2:30 PM
douardda added inline comments to D6517: Improve tests of the seaweedfs backend.
Oct 20 2021, 2:17 PM
douardda updated the diff for D6492: Add support for pathslicing in seaweedfs backend.

respawn jenkins

Oct 20 2021, 2:15 PM
douardda updated the diff for D6517: Improve tests of the seaweedfs backend.

respawn jenkins

Oct 20 2021, 2:13 PM
douardda requested review of D6517: Improve tests of the seaweedfs backend.
Oct 20 2021, 2:08 PM
douardda updated the diff for D6492: Add support for pathslicing in seaweedfs backend.

rebase

Oct 20 2021, 12:34 PM
douardda added inline comments to D6492: Add support for pathslicing in seaweedfs backend.
Oct 20 2021, 12:23 PM
douardda updated the diff for D6492: Add support for pathslicing in seaweedfs backend.

split the diff in 2

Oct 20 2021, 12:14 PM

Oct 19 2021

douardda created P1203 (An Untitled Masterwork).
Oct 19 2021, 11:43 AM

Oct 18 2021

douardda triaged T3668: Improve the seaweedfs backend as Normal priority.
Oct 18 2021, 3:34 PM · Object storage
douardda created T3668: Improve the seaweedfs backend.
Oct 18 2021, 3:33 PM · Object storage
douardda created P1202 (An Untitled Masterwork).
Oct 18 2021, 2:47 PM
douardda added a comment to T3627: Consider dropping pull request references from the git loader ingestion.

B3 I am not convinced a "synthetic" flag on the Snapshot branch makes sense, or at least I find this name confusing, especially considering we already have a synthetic flag on Revision: it's not synthetic in the sense of it's not object crafted by SWH, it comes from the origin.

Oct 18 2021, 11:59 AM · Git loader

Oct 15 2021

douardda requested review of D6492: Add support for pathslicing in seaweedfs backend.
Oct 15 2021, 6:35 PM
douardda triaged T3663: Make the swh-environment jenkins job green and activate notifications as High priority.
Oct 15 2021, 10:45 AM · System administration

Oct 14 2021

douardda added a comment to T3635: git loader: enable "partial" global deduplication of revisions via the extid mapping table.

Ok I think what puzzle me in this description is the fact the 2 first bullets of the "git loader adaptations" are actually only one point: at the end of a successful loading, store a mapping in the extid table.

Oct 14 2021, 11:23 AM · Git loader

Oct 13 2021

douardda closed D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 13 2021, 3:19 PM
douardda committed rDOBJS23b7f81c1483: Extract the path slicing logic in a dedicated PathSlicer class (authored by douardda).
Extract the path slicing logic in a dedicated PathSlicer class
Oct 13 2021, 3:19 PM

Oct 12 2021

douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.

forgotten print statement...

Oct 12 2021, 5:52 PM
douardda added inline comments to D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 12 2021, 5:50 PM
douardda committed rDDOC807d63991a8e: sysadm: fill the mirror deployment section (authored by douardda).
sysadm: fill the mirror deployment section
Oct 12 2021, 4:53 PM
douardda committed rDDOCfefabca8e6d3: conf: add swh-sysadm intershpinx mapping entry (authored by douardda).
conf: add swh-sysadm intershpinx mapping entry
Oct 12 2021, 2:27 PM
douardda committed rDDOCe6ebb39c4b6d: sysadm: add mirror-operations without content (authored by douardda).
sysadm: add mirror-operations without content
Oct 12 2021, 2:07 PM

Oct 11 2021

douardda added a comment to T3627: Consider dropping pull request references from the git loader ingestion.

An alternative to annotating synthetic refs: add a "type" or "forge_type" attribute to snapshots.

Oct 11 2021, 12:33 PM · Git loader
douardda added a comment to T3632: Investigate the ContentDisallowed exception.

What's the difference in deployed dependencies versions (staging vs. prod)?

Oct 11 2021, 12:15 PM · Scheduling utilities
douardda added a comment to T3621: Create a production read-only objstorage.

For ENEA I'd llike to test different scenarios for the source objstorage:

Oct 11 2021, 12:12 PM · System administration
douardda added a comment to T3592: POC elastic worker infrastructure.

just a quick remark about the scheduling of (sub)tasks of this task: IMHO the autoscaling should come last; all the supervision/monitoring/logging related tasks are much more important than the autoscaling.

Oct 11 2021, 10:29 AM · System administration

Oct 8 2021

douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.

Better docstrings and kill a few map()

Oct 8 2021, 4:19 PM
douardda added inline comments to D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 8 2021, 3:53 PM
douardda closed D6444: docker: configure and document the APP evironment variable for celery.
Oct 8 2021, 3:51 PM
douardda committed rDENVeefd5e532124: docker: configure and document the APP evironment variable for celery (authored by douardda).
docker: configure and document the APP evironment variable for celery
Oct 8 2021, 3:51 PM
douardda accepted D6443: buffer: add a threshold for the number of directory entries in one batch.

Thx

Oct 8 2021, 3:46 PM
douardda updated the diff for D6444: docker: configure and document the APP evironment variable for celery.

be a bit more consistent...

Oct 8 2021, 3:29 PM
douardda added a comment to D6410: Allow application/x-msgpack deserialization again.

as @vlorentz pointed out [1], this change should be irrelevant though...

[1] https://github.com/celery/kombu/blob/master/kombu/serialization.py#L369-L372

does not seem to be the proper fix.

FTR, using the celery cli tool directly from a development venv to interact with the celery server running in the docker compose test setup (as described there ) used to work ok, but not any more.

One have to specify the app, like:

celery --app=swh.scheduler.celery_backend.config.app status

[edit] I use celery 5.1.2 in my venv.

Oct 8 2021, 3:27 PM
douardda requested review of D6444: docker: configure and document the APP evironment variable for celery.
Oct 8 2021, 3:25 PM
douardda added a comment to T3632: Investigate the ContentDisallowed exception.

Unless I'm mistaken, this error does not appear in sentry any more, right?

Oct 8 2021, 3:06 PM · Scheduling utilities
douardda added a comment to D6410: Allow application/x-msgpack deserialization again.

as @vlorentz pointed out [1], this change should be irrelevant though...

[1] https://github.com/celery/kombu/blob/master/kombu/serialization.py#L369-L372

Oct 8 2021, 3:04 PM
douardda accepted D6427: swh.storage filter/buffer improvements.

looks fine to me

Oct 8 2021, 2:53 PM
douardda accepted D6428: docs: Add a save forge documentation.

Ok but see my 2 (nitpicky) comments

Oct 8 2021, 2:49 PM
douardda accepted D6431: Rename imports of swh.model.identifiers to fix deprecation warnings..

LGTM thx

Oct 8 2021, 2:46 PM
douardda added a comment to T3104: Persistent readonly perfect hash table.

[FTR] See https://forge.softwareheritage.org/source/swh-perfecthash/

Oct 8 2021, 2:43 PM · Object storage (RedHat collaboration)
douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.

allow the pathslicer to be a noop (with an empty slicing)

Oct 8 2021, 2:12 PM
douardda requested review of D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Oct 8 2021, 2:10 PM

Oct 7 2021

douardda accepted D6401: Filter out pull request related branches.

LGTM

Oct 7 2021, 9:32 AM

Oct 6 2021

douardda added a comment to T3627: Consider dropping pull request references from the git loader ingestion.

FTR without D6401, the packfile received from GH for the CocoaPods/Specs repo contains 21162 references, 21146 of which are starting with /refs/pull/ and 7126 are ending with /merge (even if those have been explicitly not asked thanks to the filtering in RepoRepresentation.determine_wanted().
When D6401 is applied, we only get the 20-ish references that are not pull request related.

Oct 6 2021, 2:56 PM · Git loader

Oct 5 2021

douardda committed rMSLDfef6e8ca5b60: EOSC-Pillar F2F meeting: presentation of the UC6.4 (authored by douardda).
EOSC-Pillar F2F meeting: presentation of the UC6.4
Oct 5 2021, 5:03 PM
douardda added a comment to T3633: staging/production - Kafka access for ENEA mirror.

token for the prod will be needed after that as well, thanks

Oct 5 2021, 3:40 PM · System administration
douardda committed rCDFPe00b10ea28c8: Fix memcache config in web.yml (authored by douardda).
Fix memcache config in web.yml
Oct 5 2021, 2:32 PM
douardda committed rCDFP44d8b4cad1ed: Fix replayers' entrypoint script (authored by douardda).
Fix replayers' entrypoint script
Oct 5 2021, 2:32 PM
douardda committed rCDFP1831360b6c84: Improve posgresql config, especially for swh-web (authored by douardda).
Improve posgresql config, especially for swh-web
Oct 5 2021, 2:32 PM
douardda committed rCDFP5de6a2ec92ea: Dockerfile: install postgresql-client in swh-web image (authored by douardda).
Dockerfile: install postgresql-client in swh-web image
Oct 5 2021, 2:32 PM
douardda committed rCDFPb466ad7a743f: Improve nginx config (authored by douardda).
Improve nginx config
Oct 5 2021, 2:32 PM
douardda committed rCDFPc36f34d1e137: Add support for postgresql as swh-web database (authored by douardda).
Add support for postgresql as swh-web database
Oct 5 2021, 2:32 PM
douardda committed rCDFPf1cf061a3177: Add explicit rw and Z to volume definitions (authored by Jonas Eriksson <jonas.eriksson@fossid.com>).
Add explicit rw and Z to volume definitions
Oct 5 2021, 2:32 PM
douardda committed rCDFP0fb09c414448: Storage conf: Point to correct objstorage port (authored by Jonas Eriksson <jonas.eriksson@fossid.com>).
Storage conf: Point to correct objstorage port
Oct 5 2021, 2:32 PM
douardda closed D6403: docker: use a dedicated container for the cron-like job of swh-web.
Oct 5 2021, 10:56 AM
douardda committed rDENVaf0a2af3e7c8: docker: use a dedicated container for the cron-like job of swh-web (authored by douardda).
docker: use a dedicated container for the cron-like job of swh-web
Oct 5 2021, 10:56 AM
douardda closed D6402: docker: Do not limit the list of task types handled by swh-scheduler-runner-priority.
Oct 5 2021, 10:56 AM
douardda committed rDENVebb07bdae059: docker: Do not limit the list of task types handled by swh-scheduler-runner… (authored by douardda).
docker: Do not limit the list of task types handled by swh-scheduler-runner…
Oct 5 2021, 10:56 AM
douardda added a comment to D6165: Add new RabbitMQ-based client/server API.

Also there is no real value in keeping 3 revisions: the last 2 revisions actually improve/modify the code from the first revision.

Oct 5 2021, 10:47 AM
douardda added a comment to D6339: Add support for remote backend on existing storage tests.

this should be squashed with the previous diff, and still my previous question about .gitignore

Oct 5 2021, 10:45 AM
douardda accepted D6165: Add new RabbitMQ-based client/server API.

As others (and I) said, this must come with actual documentation.
As is, I have hard time understanding how this actually works (even after reading the document in hedgdoc).

Oct 5 2021, 10:39 AM