My API idea was to simply have something like ENTRIES_REGEXP = r'^.*:.*$' as a class attribute of each type of directory, and a validate_entry(self, name: str) method which, by default, just checks that it matches the regexp.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Dec 8 2020
Dec 3 2020
We also need to discuss what exactly we put in cache/. I thought about symlinks to archive/ and meta/, what do you think? Removing the symlinks also means removing the data from the cache.
Dec 2 2020
Dec 1 2020
Nov 27 2020
Nov 25 2020
Nov 20 2020
Nov 19 2020
Nov 18 2020
Nov 16 2020
Nov 13 2020
Nov 12 2020
I think I understand what your fill_direntry_cache function is trying to do: you want to avoid fetching the history multiple times by doing the request only once and writing the direntry cache of all the children recursively?
Would it be maybe better to instead have a small LRU cache for the API queries, and keep the direntry code simple and fully lazy?
Nov 5 2020
Nov 4 2020
Looks good apart from two small things.
Nov 3 2020
One thing I don't really like here is that FuseEntries cannot easily list their own entries easily using the cache when available. I would much rather have the cache logic moved inside FuseEntry like what we discussed.
Oct 22 2020
Oct 21 2020
Oct 16 2020
Oct 14 2020
Oct 13 2020
lookup() should ideally be O(1).
Oct 12 2020
Oct 9 2020
Oct 8 2020
Oct 7 2020
Oct 6 2020
This is looking pretty great. I see three more good refactoring possibilities:
Oct 5 2020
Fix BVGraph/ImmutableGraph implicit naming
Oct 3 2020
Yes, the correct method name is enter_context
Sep 30 2020
Sep 29 2020
Sep 24 2020
Sep 23 2020
Sep 22 2020
I would do a bit of refactoring:
Sep 18 2020
Sep 16 2020
No, only the edge part is done, we still need a parquet and a CSV exporter :/
It is already running on granet :-)
Sep 15 2020
We considered three possibilities for the schema (assuming that we want to get rid of the three separate tables for dir_entries, rev_entries and file_entries -- otherwise, there's 6 possibilities).
Sep 14 2020
Sep 10 2020
Sep 4 2020
Sep 3 2020
Fix useless dependency
Sep 1 2020
Aug 13 2020
Aug 12 2020
Aug 3 2020
Rebase + add commit message
Jul 27 2020
- Remove debug prints
Jul 23 2020
Obsoleted by D3011
Just to be clear, the problem here wasn't directly linked to swhgraphshm but simply to the amount of available memory, because the MAP_PRIVATE flag tried to reserve all that memory to be able to perform copy on write. Using MAP_SHARED + PROT_READ avoids having this memory reservation and fixes the issue. swhgraphshm was just a random process taking a lot of the available ram, not specifically the reason why it failed.
Fixed and added a test for that problem.