Page MenuHomeSoftware Heritage
Feed Advanced Search

Mar 9 2021

dachary closed T3107: Scale out object storage explorations as Resolved.
Mar 9 2021, 1:09 PM · Object storage

Mar 8 2021

dachary added a parent task for T3106: TCO of a production ready Ambry vs implementing the design: T3054: Scale out object storage design.
Mar 8 2021, 10:35 PM · Object storage
dachary added a subtask for T3054: Scale out object storage design: T3106: TCO of a production ready Ambry vs implementing the design.
Mar 8 2021, 10:35 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary triaged T3106: TCO of a production ready Ambry vs implementing the design as Normal priority.
Mar 8 2021, 10:35 PM · Object storage
dachary added a parent task for T3105: LSM vs Hash Table / RocksDB vs Citus: T3054: Scale out object storage design.
Mar 8 2021, 10:29 PM · Object storage
dachary added a subtask for T3054: Scale out object storage design: T3105: LSM vs Hash Table / RocksDB vs Citus.
Mar 8 2021, 10:29 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary triaged T3105: LSM vs Hash Table / RocksDB vs Citus as Normal priority.
Mar 8 2021, 10:29 PM · Object storage
dachary added a parent task for T3104: Persistent readonly perfect hash table: T3054: Scale out object storage design.
Mar 8 2021, 10:09 PM · Object storage (RedHat collaboration)
dachary added a subtask for T3054: Scale out object storage design: T3104: Persistent readonly perfect hash table.
Mar 8 2021, 10:09 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary changed the status of T3104: Persistent readonly perfect hash table from Open to Work in Progress.
Mar 8 2021, 10:08 PM · Object storage (RedHat collaboration)
dachary added a comment to T3048: Using a custom Sorted String Table format.

https://en.wikipedia.org/wiki/Perfect_hash_function could be used to get O(1) instead of O(log(N))

Mar 8 2021, 9:53 PM · Object storage
dachary added a comment to T3054: Scale out object storage design.

Very interesting to see how this problem was presented & solved in the Hadoop ecosystem, thanks for the links.

Mar 8 2021, 10:06 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Mar 5 2021

dachary closed T3088: Object storage index with redis, a subtask of T3054: Scale out object storage design, as Invalid.
Mar 5 2021, 8:32 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3088: Object storage index with redis as Invalid.
Mar 5 2021, 8:32 AM · Object storage
dachary updated the task description for T3088: Object storage index with redis.
Mar 5 2021, 8:32 AM · Object storage
dachary updated the task description for T3088: Object storage index with redis.
Mar 5 2021, 8:27 AM · Object storage

Mar 4 2021

dachary updated the task description for T3054: Scale out object storage design.
Mar 4 2021, 11:23 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a subtask for T3054: Scale out object storage design: T3088: Object storage index with redis.
Mar 4 2021, 10:45 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a parent task for T3088: Object storage index with redis: T3054: Scale out object storage design.
Mar 4 2021, 10:45 PM · Object storage
dachary changed the status of T3088: Object storage index with redis from Open to Work in Progress.
Mar 4 2021, 10:45 PM · Object storage

Mar 3 2021

dachary updated the task description for T3054: Scale out object storage design.
Mar 3 2021, 1:48 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 3 2021, 1:17 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 3 2021, 8:23 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 3 2021, 1:20 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 3 2021, 12:52 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Mar 2 2021

dachary updated the task description for T3054: Scale out object storage design.
Mar 2 2021, 11:47 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 2 2021, 11:43 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 2 2021, 4:30 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 2 2021, 3:32 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Mar 1 2021

dachary updated the task description for T3054: Scale out object storage design.
Mar 1 2021, 4:06 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 1 2021, 11:57 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Mar 1 2021, 11:55 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Feb 24 2021

dachary added a comment to T3054: Scale out object storage design.

For the record the half baked benchmark script for the proposed designed I worked on today. To be continued!


Feb 24 2021, 6:16 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Feb 23 2021

dachary updated the task description for T3048: Using a custom Sorted String Table format.
Feb 23 2021, 11:26 PM · Object storage
dachary renamed T3048: Using a custom Sorted String Table format from Using a custom Sorted String format to Using a custom Sorted String Table format.
Feb 23 2021, 11:24 PM · Object storage
dachary added a comment to T3048: Using a custom Sorted String Table format.

Reopening for benchmarking purposes because there does not seem to be anything ready to use T3068.

Feb 23 2021, 11:22 PM · Object storage
dachary reopened T3048: Using a custom Sorted String Table format, a subtask of T3054: Scale out object storage design, as Work in Progress.
Feb 23 2021, 11:21 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary renamed T3048: Using a custom Sorted String Table format from Using a custom format for 1TB archive to Using a custom Sorted String format.
Feb 23 2021, 11:21 PM · Object storage
dachary updated the task description for T3068: Using Sorted String Tables as a file format.
Feb 23 2021, 1:53 PM · Object storage
dachary updated the task description for T3054: Scale out object storage design.
Feb 23 2021, 1:52 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 23 2021, 1:51 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 23 2021, 1:40 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 23 2021, 1:16 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3069: Using MZ as a file format as Invalid.
Feb 23 2021, 1:15 PM · Object storage
dachary added a comment to T3068: Using Sorted String Tables as a file format.

jumpDB is 100% python therefore less than ideal for CPU performance but for the purpose of benchmarking I/O and space usage it is conveniently ready to use.

Feb 23 2021, 12:15 PM · Object storage
dachary updated the task description for T3068: Using Sorted String Tables as a file format.
Feb 23 2021, 12:13 PM · Object storage
dachary updated the task description for T3068: Using Sorted String Tables as a file format.
Feb 23 2021, 12:06 PM · Object storage
dachary added a subtask for T3054: Scale out object storage design: T3068: Using Sorted String Tables as a file format.
Feb 23 2021, 12:04 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a parent task for T3068: Using Sorted String Tables as a file format: T3054: Scale out object storage design.
Feb 23 2021, 12:04 PM · Object storage
dachary changed the status of T3068: Using Sorted String Tables as a file format from Open to Work in Progress.
Feb 23 2021, 12:04 PM · Object storage
dachary added a comment to T3066: Using RocksDB SST as a file format.

There is not enough tooling to use SST files independently of RocksDB. Maybe it is possible to use the RocksDB with a configuration that makes it so it only uses a single SST file ?

Feb 23 2021, 11:45 AM · Object storage
dachary updated the task description for T3066: Using RocksDB SST as a file format.
Feb 23 2021, 11:41 AM · Object storage
dachary updated the task description for T3066: Using RocksDB SST as a file format.
Feb 23 2021, 11:34 AM · Object storage
dachary updated the task description for T3066: Using RocksDB SST as a file format.
Feb 23 2021, 11:33 AM · Object storage
dachary updated the task description for T3066: Using RocksDB SST as a file format.
Feb 23 2021, 9:31 AM · Object storage
dachary updated the task description for T3054: Scale out object storage design.
Feb 23 2021, 9:31 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a subtask for T3054: Scale out object storage design: T3066: Using RocksDB SST as a file format.
Feb 23 2021, 9:30 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a parent task for T3066: Using RocksDB SST as a file format: T3054: Scale out object storage design.
Feb 23 2021, 9:30 AM · Object storage
dachary changed the status of T3066: Using RocksDB SST as a file format from Open to Work in Progress.
Feb 23 2021, 9:30 AM · Object storage

Feb 22 2021

dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:58 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 4:28 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 4:26 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 4:24 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 4:04 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 2:06 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 11:33 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 11:12 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:32 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:28 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:20 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:19 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 10:10 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 9:54 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 8:34 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 12:28 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary changed the status of T3050: Using libcephsqlite to store objects from Work in Progress to Open.
Feb 22 2021, 12:25 AM · Object storage
dachary changed the status of T3050: Using libcephsqlite to store objects, a subtask of T3054: Scale out object storage design, from Work in Progress to Open.
Feb 22 2021, 12:25 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3064: Using ambry to store objects, a subtask of T3054: Scale out object storage design, as Invalid.
Feb 22 2021, 12:25 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3064: Using ambry to store objects as Invalid.
Feb 22 2021, 12:25 AM · Object storage
dachary added a comment to T3064: Using ambry to store objects.

Ambry has been a great source of inspiration and the best fit for the software heritage use case. Including the partition UUID in the object takes advantage of the immutability of the objects allows all readers to have a scale out object storage.

Feb 22 2021, 12:24 AM · Object storage
dachary changed the status of T3065: Using git to store objects, a subtask of T3054: Scale out object storage design, from Work in Progress to Open.
Feb 22 2021, 12:17 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary changed the status of T3065: Using git to store objects from Work in Progress to Open.
Feb 22 2021, 12:17 AM · Object storage
dachary closed T3048: Using a custom Sorted String Table format, a subtask of T3054: Scale out object storage design, as Invalid.
Feb 22 2021, 12:16 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3048: Using a custom Sorted String Table format as Invalid.
Feb 22 2021, 12:16 AM · Object storage
dachary added a comment to T3048: Using a custom Sorted String Table format.

It turns out there are a number of suitable formats (SST from RocksDB for one), no need to re-invent this wheel.

Feb 22 2021, 12:16 AM · Object storage
dachary closed T3052: Reducing Ceph bluestore_min_alloc_size from 64K to 4K, a subtask of T3056: Ceph as an object storage, as Invalid.
Feb 22 2021, 12:13 AM · Object storage (RedHat collaboration)
dachary closed T3052: Reducing Ceph bluestore_min_alloc_size from 64K to 4K as Invalid.
Feb 22 2021, 12:13 AM · Object storage
dachary added a comment to T3052: Reducing Ceph bluestore_min_alloc_size from 64K to 4K.

In the T3054 proposed design, objects are packed into larger files and there is no reason to continue in this direction. There seems to be a consensus that tenths of billions of individual objects is problematic. It takes very long to enumerate, for one thing. And noone is doing that which is not a great sign.

Feb 22 2021, 12:13 AM · Object storage
dachary added a comment to T3049: Benchmarking an RBD based object container.

The T3054 design evolved and this benchmark won't be needed

Feb 22 2021, 12:09 AM · Object storage
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 12:08 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3049: Benchmarking an RBD based object container as Invalid.
Feb 22 2021, 12:06 AM · Object storage
dachary closed T3049: Benchmarking an RBD based object container, a subtask of T3056: Ceph as an object storage, as Invalid.
Feb 22 2021, 12:06 AM · Object storage (RedHat collaboration)
dachary updated the task description for T3054: Scale out object storage design.
Feb 22 2021, 12:04 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Feb 21 2021

dachary updated the task description for T3054: Scale out object storage design.
Feb 21 2021, 8:54 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 21 2021, 8:49 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Feb 21 2021, 5:45 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a comment to T3064: Using ambry to store objects.

Readonly partitions are stored in Sorted String Table format.

Feb 21 2021, 5:41 PM · Object storage
dachary updated the task description for T3054: Scale out object storage design.
Feb 21 2021, 12:22 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3064: Using ambry to store objects.
Feb 21 2021, 12:13 PM · Object storage
dachary added a comment to T3064: Using ambry to store objects.

Open sourcing DataHub: LinkedIn’s metadata search and discovery platform explains how developers work on DataHub and the relationship between code internal to Linkedin and what is published as Free Software. It is not about ambry and maybe the ambry team has a completely different behavior. A similar article about ambry is dated 2016:

Feb 21 2021, 12:11 PM · Object storage