In D6570#171059, @douardda wrote:In D6570#170758, @vlorentz wrote:You checked on the whole journal, right?
not yet. The shouldn't be any left, but... I'll check as soon as I have credentials for the production kafka cluster.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
Oct 29 2021
Oct 29 2021
Oct 28 2021
Oct 28 2021
douardda updated the diff for D6571: Add support for a redis-based reporting for invalid mirrorred objects.
Bump the dependency on swh-journal to 0.9
douardda committed rDJNL79d28b51ae08: Make the test_client_with_deserializer reliable (authored by douardda).
Make the test_client_with_deserializer reliable
In D6570#170941, @olasd wrote:While we're here, could you check whether the other fixers are really still needed too?
In D6570#170758, @vlorentz wrote:You checked on the whole journal, right?
douardda committed rDJNLf92d4acf30e6: Pass the object_type to JournalClient.value_serializer() (authored by douardda).
Pass the object_type to JournalClient.value_serializer()
douardda committed rDJNL88054da89653: Do call consumer.commit() even if not objects have been received (authored by douardda).
Do call consumer.commit() even if not objects have been received
^x^s...
typo
Overall it looks very good to me. There is room for improvement, for sure, but let's put this to work and see how it performs.
Document a bit more the value_deserializer and add a test for it
douardda requested review of D6571: Add support for a redis-based reporting for invalid mirrorred objects.
asking for review even if tests are expected to fail because it depends on D6565 (in swh-journal)
douardda added a comment to D6564: Do call consumer.commit() even if not objects have been received.
In D6564#170522, @ardumont wrote:why if i may ask?
Oct 27 2021
Oct 27 2021
douardda requested review of D6564: Do call consumer.commit() even if not objects have been received.
A bit sad, but hey, thanks!
Let's consider this is done since 2 of 3 bullets have been canceled (for now)
Oct 26 2021
Oct 26 2021
douardda triaged T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror) as High priority.
Oct 22 2021
Oct 22 2021
Oct 21 2021
Oct 21 2021
Add a simple read-only HTTP backend
douardda committed rDOBJSbcbbfd466987: Reorganise the seaweedfs backend in a subpackage (authored by douardda).
Reorganise the seaweedfs backend in a subpackage
douardda committed rDOBJS38c02dcfae2b: Add support for deprecation of objstorage cls in factory (authored by douardda).
Add support for deprecation of objstorage cls in factory
douardda committed rDOBJS82d9714b0ae5: Use get_objstorage in seaweedfs tests instead of direct class (authored by douardda).
Use get_objstorage in seaweedfs tests instead of direct class
and fix the LOGGER.error usage
remove hardcoded log levels
Use ReadOnlyObjStorage and NonIterableObjStorage instead of NotImplementedError
Oct 20 2021
Oct 20 2021
document the build_objstorage() test helper function
remove useless statement
remove mistakenly commited mypy.ini file
Improve a bit the seaweedfs backend
douardda committed rDOBJS55ff4b95d306: Improve tests of the seaweedfs backend (authored by douardda).
Improve tests of the seaweedfs backend
rebase
remove a (comment) garbage line
rebase
slight simplification as suggested by vlorentz
respawn jenkins
respawn jenkins
rebase
split the diff in 2
Oct 19 2021
Oct 19 2021
Oct 18 2021
Oct 18 2021
douardda added a comment to T3627: Consider dropping pull request references from the git loader ingestion.
B3 I am not convinced a "synthetic" flag on the Snapshot branch makes sense, or at least I find this name confusing, especially considering we already have a synthetic flag on Revision: it's not synthetic in the sense of it's not object crafted by SWH, it comes from the origin.
Oct 15 2021
Oct 15 2021
douardda triaged T3663: Make the swh-environment jenkins job green and activate notifications as High priority.
Oct 14 2021
Oct 14 2021
douardda added a comment to T3635: git loader: enable "partial" global deduplication of revisions via the extid mapping table.
Ok I think what puzzle me in this description is the fact the 2 first bullets of the "git loader adaptations" are actually only one point: at the end of a successful loading, store a mapping in the extid table.
Oct 13 2021
Oct 13 2021
douardda committed rDOBJS23b7f81c1483: Extract the path slicing logic in a dedicated PathSlicer class (authored by douardda).
Extract the path slicing logic in a dedicated PathSlicer class
Oct 12 2021
Oct 12 2021
douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.
forgotten print statement...
douardda added inline comments to D6442: Extract the path slicing logic in a dedicated PathSlicer class.
douardda committed rDDOC807d63991a8e: sysadm: fill the mirror deployment section (authored by douardda).
sysadm: fill the mirror deployment section
douardda committed rDDOCfefabca8e6d3: conf: add swh-sysadm intershpinx mapping entry (authored by douardda).
conf: add swh-sysadm intershpinx mapping entry
douardda committed rDDOCe6ebb39c4b6d: sysadm: add mirror-operations without content (authored by douardda).
sysadm: add mirror-operations without content
Oct 11 2021
Oct 11 2021
douardda added a comment to T3627: Consider dropping pull request references from the git loader ingestion.
In T3627#71819, @vlorentz wrote:An alternative to annotating synthetic refs: add a "type" or "forge_type" attribute to snapshots.
What's the difference in deployed dependencies versions (staging vs. prod)?
For ENEA I'd llike to test different scenarios for the source objstorage:
just a quick remark about the scheduling of (sub)tasks of this task: IMHO the autoscaling should come last; all the supervision/monitoring/logging related tasks are much more important than the autoscaling.
Oct 8 2021
Oct 8 2021
douardda updated the diff for D6442: Extract the path slicing logic in a dedicated PathSlicer class.
Better docstrings and kill a few map()
douardda added inline comments to D6442: Extract the path slicing logic in a dedicated PathSlicer class.
douardda committed rDENVeefd5e532124: docker: configure and document the APP evironment variable for celery (authored by douardda).
docker: configure and document the APP evironment variable for celery
Thx
douardda updated the diff for D6444: docker: configure and document the APP evironment variable for celery.
be a bit more consistent...
In D6410#167121, @douardda wrote:In D6410#166398, @ardumont wrote:as @vlorentz pointed out [1], this change should be irrelevant though...
[1] https://github.com/celery/kombu/blob/master/kombu/serialization.py#L369-L372
does not seem to be the proper fix.
FTR, using the celery cli tool directly from a development venv to interact with the celery server running in the docker compose test setup (as described there ) used to work ok, but not any more.
One have to specify the app, like:
celery --app=swh.scheduler.celery_backend.config.app status[edit] I use celery 5.1.2 in my venv.
douardda requested review of D6444: docker: configure and document the APP evironment variable for celery.
Unless I'm mistaken, this error does not appear in sentry any more, right?
In D6410#166398, @ardumont wrote:as @vlorentz pointed out [1], this change should be irrelevant though...
[1] https://github.com/celery/kombu/blob/master/kombu/serialization.py#L369-L372
Ok but see my 2 (nitpicky) comments
LGTM thx