Page MenuHomeSoftware Heritage
Feed Advanced Search

Feb 15 2021

dachary changed the status of T3051: Using EOS to store objects from Open to Work in Progress.
Feb 15 2021, 9:36 PM · Object storage
dachary updated the task description for T3050: Using libcephsqlite to store objects.
Feb 15 2021, 9:05 PM · Object storage
dachary added a comment to T3050: Using libcephsqlite to store objects.

A mail was sent to Patrick Donnelly to ask for his opinion on the matter.

Feb 15 2021, 9:03 PM · Object storage
dachary triaged T3050: Using libcephsqlite to store objects as Normal priority.
Feb 15 2021, 9:03 PM · Object storage
dachary changed the status of T3050: Using libcephsqlite to store objects from Open to Work in Progress.
Feb 15 2021, 9:02 PM · Object storage
dachary closed T3014: Using an RBD image to store artifacts as Resolved.
Feb 15 2021, 9:00 PM · Object storage
dachary closed T3014: Using an RBD image to store artifacts, a subtask of T3049: Benchmarking an RBD based object container, as Resolved.
Feb 15 2021, 9:00 PM · Object storage
dachary added a comment to T3014: Using an RBD image to store artifacts.

This preliminary exploration is complete and moved to benchmarking to discover blockers.

Feb 15 2021, 8:59 PM · Object storage
dachary added a parent task for T3014: Using an RBD image to store artifacts: T3049: Benchmarking an RBD based object container.
Feb 15 2021, 8:58 PM · Object storage
dachary added a subtask for T3049: Benchmarking an RBD based object container: T3014: Using an RBD image to store artifacts.
Feb 15 2021, 8:58 PM · Object storage
dachary updated the task description for T3049: Benchmarking an RBD based object container.
Feb 15 2021, 8:51 PM · Object storage
dachary updated the task description for T3048: Using a custom Sorted String Table format.
Feb 15 2021, 8:09 PM · Object storage
dachary added a comment to T3048: Using a custom Sorted String Table format.

Updated the description, even simpler.

Feb 15 2021, 8:09 PM · Object storage
dachary updated the task description for T3049: Benchmarking an RBD based object container.
Feb 15 2021, 8:08 PM · Object storage
dachary added a comment to T3049: Benchmarking an RBD based object container.

Thanks for the comment. Let's keep just the SWHID then.

Feb 15 2021, 8:07 PM · Object storage
dachary updated the task description for T3049: Benchmarking an RBD based object container.
Feb 15 2021, 6:19 PM · Object storage
dachary changed the status of T3049: Benchmarking an RBD based object container from Open to Work in Progress.
Feb 15 2021, 6:16 PM · Object storage
dachary updated the task description for T3048: Using a custom Sorted String Table format.
Feb 15 2021, 6:04 PM · Object storage
dachary renamed T3048: Using a custom Sorted String Table format from Using custom format for 1TB archive to Using a custom format for 1TB archive.
Feb 15 2021, 5:43 PM · Object storage
dachary added a parent task for T3046: Using xz-file-format for 1TB archive: T3048: Using a custom Sorted String Table format.
Feb 15 2021, 5:43 PM · Object storage
dachary added subtasks for T3048: Using a custom Sorted String Table format: T3046: Using xz-file-format for 1TB archive, T3045: Using pixz for 1TB archives.
Feb 15 2021, 5:43 PM · Object storage
dachary added a parent task for T3045: Using pixz for 1TB archives: T3048: Using a custom Sorted String Table format.
Feb 15 2021, 5:43 PM · Object storage
dachary changed the status of T3048: Using a custom Sorted String Table format from Open to Work in Progress.
Feb 15 2021, 5:41 PM · Object storage
dachary closed T3046: Using xz-file-format for 1TB archive as Resolved.
Feb 15 2021, 5:37 PM · Object storage
dachary added a comment to T3046: Using xz-file-format for 1TB archive.

Although simple and close to what is needed, Xz is not an exact match: the index would need to be maintained.

Feb 15 2021, 5:37 PM · Object storage
dachary added a comment to T3046: Using xz-file-format for 1TB archive.

Xz format inadequate for long-term archiving

Feb 15 2021, 5:35 PM · Object storage
dachary changed the status of T3014: Using an RBD image to store artifacts from Open to Work in Progress.
Feb 15 2021, 2:13 PM · Object storage
dachary added a comment to T3046: Using xz-file-format for 1TB archive.

The zstd format is tightly associated with the compression algorithm and is therefore more complex. It can however be a sequence of independently compressed content and could be used for the same purpose as xz.

Feb 15 2021, 10:42 AM · Object storage
dachary added a comment to T3046: Using xz-file-format for 1TB archive.

The 7z format is more complex because it knows about files, directories etc. It is not not just a compressed data format.

Feb 15 2021, 9:56 AM · Object storage
dachary changed the status of T3046: Using xz-file-format for 1TB archive from Open to Work in Progress.
Feb 15 2021, 9:36 AM · Object storage
dachary closed T3045: Using pixz for 1TB archives as Resolved.
Feb 15 2021, 9:28 AM · Object storage
dachary added a comment to T3045: Using pixz for 1TB archives.

There are two blockers:

Feb 15 2021, 9:28 AM · Object storage
dachary added a comment to T3045: Using pixz for 1TB archives.

When extracting a single file (-x file) the in memory index is walked sequentially looking for the file.

Feb 15 2021, 9:20 AM · Object storage
dachary updated the task description for T3045: Using pixz for 1TB archives.
Feb 15 2021, 9:02 AM · Object storage
dachary added a comment to T3045: Using pixz for 1TB archives.

XZ file format https://tukaani.org/xz/format.html & https://tukaani.org/xz/xz-file-format.txt

Feb 15 2021, 8:54 AM · Object storage
dachary added a comment to T3045: Using pixz for 1TB archives.

The index is located at the end of the file.
The content of the archive is compressed as successive blocs of a given size.
The index is compressed as a single block of unlimited size.

Feb 15 2021, 8:54 AM · Object storage
dachary triaged T3045: Using pixz for 1TB archives as Normal priority.
Feb 15 2021, 8:49 AM · Object storage

Feb 14 2021

dachary added a comment to T3014: Using an RBD image to store artifacts.
Feb 14 2021, 7:11 PM · Object storage
dachary added a comment to D398: [WIP] "packing" object storage design documentation.

No idea whether if it's of some interest for our subject, but we may also have a look at openio

Feb 14 2021, 6:27 PM · Object storage
dachary added a comment to T2216: Packing object storage.

About Ceph RGW and the lack of packing https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/AEMW6O7WVJFMUIX7QGI2KM7HKDSTNIYT/

Feb 14 2021, 6:24 PM · Object storage, Roadmap 2020
dachary added a comment to T3014: Using an RBD image to store artifacts.

https://github.com/vasi/pixz is a candidate for the 1TB archive content

Feb 14 2021, 5:29 PM · Object storage
dachary added a watcher for Storage manager: dachary.
Feb 14 2021, 4:56 PM
dachary added a watcher for Object storage: dachary.
Feb 14 2021, 10:20 AM
dachary added a comment to T3014: Using an RBD image to store artifacts.

For the record yesterday's IRC log

Feb 14 2021, 9:27 AM · Object storage

Feb 13 2021

dachary added a comment to T3014: Using an RBD image to store artifacts.

For the record, today's IRC log:

Feb 13 2021, 12:21 AM · Object storage

Feb 6 2021

dachary added a comment to T3014: Using an RBD image to store artifacts.

Benchmarking S3 in Ceph with COSBench could be interesting (the video is not yet available). In the past COSBench was difficult to use but maybe it improved. This is off-topic though, but I don't know where to write that down at the moment.

Feb 6 2021, 2:03 PM · Object storage

Feb 4 2021

dachary updated the task description for T3014: Using an RBD image to store artifacts.
Feb 4 2021, 12:38 PM · Object storage

Feb 2 2021

dachary updated the task description for T3014: Using an RBD image to store artifacts.
Feb 2 2021, 9:36 PM · Object storage

Feb 1 2021

dachary updated the task description for T3014: Using an RBD image to store artifacts.
Feb 1 2021, 9:56 PM · Object storage
dachary added a comment to T3014: Using an RBD image to store artifacts.

A trivial test case (attached) shows that an RBD image backed by a k=4,m=2 erasure coded pool (RAID6 equivalent) can store 4GB of data using 6GB of disk. The metadata overhead is small. It would be great if someone could repeat the test to make sure I did not accidentally obtained these results.

Feb 1 2021, 9:16 PM · Object storage
dachary created T3014: Using an RBD image to store artifacts.
Feb 1 2021, 9:10 PM · Object storage

Jan 11 2021

dachary added a comment to T2946: pre-commit is required by bin/update.

Thanks for the merge :-) It feels really good to see a commit, however simple, being merged on a Monday morning!

Jan 11 2021, 10:53 AM · Easy hack, Documentation

Jan 9 2021

dachary added a comment to T2946: pre-commit is required by bin/update.

bin/update contains

Jan 9 2021, 7:25 PM · Easy hack, Documentation
dachary updated the diff for D4832: bin/update fails unless pre-commit is installed.

reword commit title

Jan 9 2021, 12:02 AM
dachary retitled D4832: bin/update fails unless pre-commit is installed from bin/update fails unless pip install pre-commit to bin/update fails unless pre-commit is installed.
Jan 9 2021, 12:01 AM
dachary retitled D4832: bin/update fails unless pre-commit is installed from bin/update unless pip install pre-commit to bin/update fails unless pip install pre-commit.
Jan 9 2021, 12:01 AM

Jan 8 2021

dachary updated the diff for D4832: bin/update fails unless pre-commit is installed.

forgot the leading (swh)

Jan 8 2021, 11:58 PM
dachary requested review of D4832: bin/update fails unless pre-commit is installed.
Jan 8 2021, 11:54 PM
dachary added a project to T2946: pre-commit is required by bin/update: Easy hack.
Jan 8 2021, 11:45 PM · Easy hack, Documentation
dachary triaged T2946: pre-commit is required by bin/update as Low priority.
Jan 8 2021, 11:41 PM · Easy hack, Documentation