Page MenuHomeSoftware Heritage
Feed Advanced Search

Mar 29 2022

douardda requested review of D7463: Delay the unsubscribe to the end of handle_messages.
Mar 29 2022, 5:47 PM
douardda requested review of D7462: Move exporter config entries in dedicated sections.
Mar 29 2022, 5:46 PM
douardda closed D7388: Export revision extra headers in a dedicated ORC file.
Mar 29 2022, 5:46 PM
douardda closed D7387: Add the type fields for revision and origin_visit_status ORC table.
Mar 29 2022, 5:46 PM
douardda closed D7389: Add the raw_manifest column for revision, release and directory ORC files.
Mar 29 2022, 5:46 PM
douardda committed rDDATASET45c8124b7a31: Add the type fields for revision and origin_visit_status ORC table (authored by douardda).
Add the type fields for revision and origin_visit_status ORC table
Mar 29 2022, 5:46 PM
douardda committed rDDATASET5c652bb058e2: Export revision extra headers in a dedicated ORC file (authored by douardda).
Export revision extra headers in a dedicated ORC file
Mar 29 2022, 5:46 PM
douardda committed rDDATASETfd3f9aa61de3: Add the raw_manifest column for revision, release and directory ORC files (authored by douardda).
Add the raw_manifest column for revision, release and directory ORC files
Mar 29 2022, 5:46 PM
douardda requested review of D7461: Add support for limited row numbers in ORC files.
Mar 29 2022, 5:46 PM
douardda updated the diff for D7389: Add the raw_manifest column for revision, release and directory ORC files.

rebase

Mar 29 2022, 5:41 PM
douardda updated the diff for D7388: Export revision extra headers in a dedicated ORC file.

rebase

Mar 29 2022, 5:40 PM
douardda updated the diff for D7387: Add the type fields for revision and origin_visit_status ORC table.

rebase

Mar 29 2022, 5:40 PM
douardda closed D7385: Write related ORC files in the same directory using the same UUID.
Mar 29 2022, 5:39 PM
douardda committed rDDATASET5a8a8a7847f6: Write related ORC files in the same directory using the same UUID (authored by douardda).
Write related ORC files in the same directory using the same UUID
Mar 29 2022, 5:39 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

forgot vlorentz' comment...

Mar 29 2022, 5:36 PM
douardda closed D7384: Add some user metadata in generated ORC files.
Mar 29 2022, 5:25 PM
douardda committed rDDATASET729ae64f36cd: Add some user metadata in generated ORC files (authored by douardda).
Add some user metadata in generated ORC files
Mar 29 2022, 5:25 PM
douardda closed D7383: Implement test_orc exporter as a simple function instead of a fixture.
Mar 29 2022, 5:24 PM
douardda committed rDDATASET2298fb342280: Implement test_orc exporter as a simple function instead of a fixture (authored by douardda).
Implement test_orc exporter as a simple function instead of a fixture
Mar 29 2022, 5:24 PM
douardda closed D7382: Make the kafka group_id prefix configurable in the config file.
Mar 29 2022, 5:24 PM
douardda committed rDDATASET68899901c7e5: Make the kafka group_id prefix configurable in the config file (authored by douardda).
Make the kafka group_id prefix configurable in the config file
Mar 29 2022, 5:24 PM
douardda closed D7381: Use a named logger for journalprocessor.py.
Mar 29 2022, 5:24 PM
douardda committed rDDATASET769b6a77d250: Use a named logger for journalprocessor.py (authored by douardda).
Use a named logger for journalprocessor.py
Mar 29 2022, 5:24 PM
douardda closed D7380: Update JournalClientOffsetRanges for swh.journal 0.9.
Mar 29 2022, 5:24 PM
douardda committed rDDATASETd7c332e4e7e1: Update JournalClientOffsetRanges for swh.journal 0.9 (authored by douardda).
Update JournalClientOffsetRanges for swh.journal 0.9
Mar 29 2022, 5:24 PM
douardda abandoned D7386: Use the same 'id' column name everywhere in ORC files.
Mar 29 2022, 5:22 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

rebase

Mar 29 2022, 5:14 PM
douardda updated the diff for D7384: Add some user metadata in generated ORC files.

rebase

Mar 29 2022, 5:13 PM
douardda updated the diff for D7383: Implement test_orc exporter as a simple function instead of a fixture.

rebase

Mar 29 2022, 5:13 PM
douardda updated the diff for D7382: Make the kafka group_id prefix configurable in the config file.

rebase

Mar 29 2022, 5:13 PM
douardda updated the diff for D7381: Use a named logger for journalprocessor.py.

rebase

Mar 29 2022, 5:13 PM
douardda updated the diff for D7380: Update JournalClientOffsetRanges for swh.journal 0.9.

rebase

Mar 29 2022, 5:13 PM
douardda closed D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.
Mar 29 2022, 3:20 PM
douardda committed rDDATASETf588e20a41af: Encode TimestampWithTimezone as (timestamp, offset, raw_offset_bytes) in ORC… (authored by douardda).
Encode TimestampWithTimezone as (timestamp, offset, raw_offset_bytes) in ORC…
Mar 29 2022, 3:20 PM
douardda updated the diff for D7389: Add the raw_manifest column for revision, release and directory ORC files.

rebase

Mar 29 2022, 3:19 PM
douardda updated the diff for D7388: Export revision extra headers in a dedicated ORC file.

rebase

Mar 29 2022, 3:18 PM
douardda updated the diff for D7387: Add the type fields for revision and origin_visit_status ORC table.

rebase

Mar 29 2022, 3:18 PM
douardda updated the diff for D7386: Use the same 'id' column name everywhere in ORC files.

rebase

Mar 29 2022, 3:17 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

rebase

Mar 29 2022, 3:16 PM
douardda updated the diff for D7384: Add some user metadata in generated ORC files.

rebase

Mar 29 2022, 3:16 PM
douardda updated the diff for D7383: Implement test_orc exporter as a simple function instead of a fixture.

rebase

Mar 29 2022, 3:16 PM
douardda updated the diff for D7382: Make the kafka group_id prefix configurable in the config file.

rebase

Mar 29 2022, 3:15 PM
douardda updated the diff for D7381: Use a named logger for journalprocessor.py.

rebase

Mar 29 2022, 3:15 PM
douardda updated the diff for D7380: Update JournalClientOffsetRanges for swh.journal 0.9.

rebase

Mar 29 2022, 3:14 PM
douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

Rename date fields back to their previous names and use smallint for offsets

Mar 29 2022, 3:14 PM
douardda added a comment to D7451: Add missing cypress tests to 'add-forge-now' request dashboard.

note there is a typo in the commit message (add-fore-now)

Mar 29 2022, 1:11 PM
douardda accepted D6990: Add partial implementation of `ArchiveGraph` class.
Mar 29 2022, 11:31 AM

Mar 24 2022

douardda added inline comments to D7397: checks: Add type annotation to extra_validator.
Mar 24 2022, 11:41 AM

Mar 23 2022

douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

add a couple of missing 'is not None' as asked by volrentz

Mar 23 2022, 10:00 AM

Mar 22 2022

douardda updated the diff for D7389: Add the raw_manifest column for revision, release and directory ORC files.

rebase

Mar 22 2022, 5:30 PM
douardda updated the diff for D7388: Export revision extra headers in a dedicated ORC file.

rebase

Mar 22 2022, 5:30 PM
douardda updated the diff for D7387: Add the type fields for revision and origin_visit_status ORC table.

rebase

Mar 22 2022, 5:30 PM
douardda updated the diff for D7386: Use the same 'id' column name everywhere in ORC files.

rebase

Mar 22 2022, 5:30 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

rebase

Mar 22 2022, 5:30 PM
douardda updated the diff for D7384: Add some user metadata in generated ORC files.

rebase

Mar 22 2022, 5:29 PM
douardda updated the diff for D7383: Implement test_orc exporter as a simple function instead of a fixture.

rebase

Mar 22 2022, 5:29 PM
douardda updated the diff for D7382: Make the kafka group_id prefix configurable in the config file.

rebase

Mar 22 2022, 5:29 PM
douardda updated the diff for D7381: Use a named logger for journalprocessor.py.

rebase

Mar 22 2022, 5:28 PM
douardda updated the diff for D7380: Update JournalClientOffsetRanges for swh.journal 0.9.

rebase

Mar 22 2022, 5:28 PM
douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

attempt to please mypy

Mar 22 2022, 5:23 PM
douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

typo

Mar 22 2022, 4:04 PM
douardda updated the diff for D7389: Add the raw_manifest column for revision, release and directory ORC files.

rebase

Mar 22 2022, 3:50 PM
douardda updated the diff for D7388: Export revision extra headers in a dedicated ORC file.

rebase

Mar 22 2022, 3:49 PM
douardda updated the diff for D7387: Add the type fields for revision and origin_visit_status ORC table.

rebase

Mar 22 2022, 3:49 PM
douardda updated the diff for D7386: Use the same 'id' column name everywhere in ORC files.

rebase

Mar 22 2022, 3:49 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

rebase

Mar 22 2022, 3:49 PM
douardda updated the diff for D7384: Add some user metadata in generated ORC files.

rebase

Mar 22 2022, 3:48 PM
douardda updated the diff for D7383: Implement test_orc exporter as a simple function instead of a fixture.

rebase

Mar 22 2022, 3:48 PM
douardda updated the diff for D7382: Make the kafka group_id prefix configurable in the config file.

rebase

Mar 22 2022, 3:47 PM
douardda updated the diff for D7381: Use a named logger for journalprocessor.py.

rebase

Mar 22 2022, 3:47 PM
douardda updated the diff for D7380: Update JournalClientOffsetRanges for swh.journal 0.9.

rebase

Mar 22 2022, 3:47 PM
douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

Update the encoding of TimestampWithTimezone to keep the timestamp part as an ORC timestamp

Mar 22 2022, 3:46 PM
douardda accepted D7400: Add support for None as author or committer of a Revision.

I would got for the more compact version of the code in from_dict, otherwise lgtm.

Mar 22 2022, 12:07 PM
douardda added inline comments to D7400: Add support for None as author or committer of a Revision.
Mar 22 2022, 12:05 PM
douardda added inline comments to D7400: Add support for None as author or committer of a Revision.
Mar 22 2022, 12:04 PM

Mar 21 2022

douardda added inline comments to D6990: Add partial implementation of `ArchiveGraph` class.
Mar 21 2022, 5:28 PM
douardda added inline comments to D6990: Add partial implementation of `ArchiveGraph` class.
Mar 21 2022, 5:18 PM
douardda added inline comments to D6990: Add partial implementation of `ArchiveGraph` class.
Mar 21 2022, 5:16 PM
douardda added inline comments to D6990: Add partial implementation of `ArchiveGraph` class.
Mar 21 2022, 5:11 PM
douardda added inline comments to D6990: Add partial implementation of `ArchiveGraph` class.
Mar 21 2022, 5:07 PM
douardda updated the task description for T3415: Specify the Vitam archiving format.
Mar 21 2022, 10:18 AM

Mar 18 2022

douardda added inline comments to D7389: Add the raw_manifest column for revision, release and directory ORC files.
Mar 18 2022, 4:11 PM
douardda added a comment to D7388: Export revision extra headers in a dedicated ORC file.

couldn't it be an extra column in the revision export? I haven't seen any long extra header on revisions

Mar 18 2022, 4:10 PM
douardda updated the summary of D7388: Export revision extra headers in a dedicated ORC file.
Mar 18 2022, 4:08 PM
douardda added a comment to D7386: Use the same 'id' column name everywhere in ORC files.

Why? This may lead to confusion

Mar 18 2022, 4:05 PM
douardda added inline comments to D7382: Make the kafka group_id prefix configurable in the config file.
Mar 18 2022, 3:57 PM
douardda updated the diff for D7389: Add the raw_manifest column for revision, release and directory ORC files.

rebase

Mar 18 2022, 3:52 PM
douardda updated the diff for D7388: Export revision extra headers in a dedicated ORC file.

rebase

Mar 18 2022, 3:50 PM
douardda updated the diff for D7387: Add the type fields for revision and origin_visit_status ORC table.

rebase

Mar 18 2022, 3:50 PM
douardda updated the diff for D7386: Use the same 'id' column name everywhere in ORC files.

rebase

Mar 18 2022, 3:50 PM
douardda updated the diff for D7385: Write related ORC files in the same directory using the same UUID.

rebase

Mar 18 2022, 3:50 PM
douardda updated the diff for D7384: Add some user metadata in generated ORC files.

rebase

Mar 18 2022, 3:49 PM
douardda updated the diff for D7383: Implement test_orc exporter as a simple function instead of a fixture.

rebase

Mar 18 2022, 3:49 PM
douardda updated the diff for D7382: Make the kafka group_id prefix configurable in the config file.

rebase

Mar 18 2022, 3:49 PM
douardda updated the diff for D7381: Use a named logger for journalprocessor.py.

rebase and fixes suggested by ardumont

Mar 18 2022, 3:48 PM
douardda updated the diff for D7380: Update JournalClientOffsetRanges for swh.journal 0.9.

rebase

Mar 18 2022, 3:48 PM
douardda updated the diff for D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.

Do not use TimestampWithTimezone object in swh_date_to_tuple()

Mar 18 2022, 3:47 PM
douardda added inline comments to D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.
Mar 18 2022, 3:07 PM
douardda added inline comments to D7379: Encode TimestampWithTimezone as (sec, usec, offset) in ORC file.
Mar 18 2022, 3:04 PM
douardda added inline comments to D7381: Use a named logger for journalprocessor.py.
Mar 18 2022, 2:58 PM