Query: Advanced Search

	Include stories about projects I am a member of.

Can we associate the name of the temporary storage directory for a load with that loader's pid, and then make every new loader instance compare existing temp storage dirs during init? If a storage directory exists for a process that does not exist (because the process was killed) then it can be deleted.

I worry that RAM is way more constrained than disk space is. It seems like the biggest problem is/was

If cache files are sticking around, then of course the code should make sure that they go away when done or aborted. But I think that a few G used during processing of extremely large repos should be acceptable. :/

I think 6e12c90b160ad3277a1edea27a05f9adea1bc92f may be a bad idea. Have you tested how much RAM it takes to hold the whole dirs dict in memory on a very large repo like mozilla-unified?

I agree with taking tags from both sides and discarding all lines that don't fit the pattern.

As discussed in irc a short while ago (just leaving this as note here), seeing 2 caches is normal and expected, since one is spawned inside reader and one in loader. Will have to also pass that argument to the reader instance.

use parent rev ids instead of node ids

don't keep hglib attached. it can use a lot of ram

Does it make sense to open that in the loader's configuration property?

The bundle loader is tunable to use less ram and therefore more disk for its live caching (though I need to revisit the counter to make the tuning argument less arbitrary and more representative of real bytes used, because it currently ignores overhead and python data has a lot of overhead).

The bundle step, for some repository, is at the moment needing quite some ram

Well I'm not sure what just happened, but I commited a patch (and apparently also some duplicate history).

Merge branch 'master' of ssh://forge.softwareheritage.org/source/swh-loader…

Bump requirements for new swh.loader.core

objects: make all functions conform to flake8

swh.mercurial.loader: Fix slow_loader release computation

swh.loader.mercurial: Remove unneeded flush method

swh.mercurial.loader: Fix slow_loader + replace occs with snapshot

revert fbdd798b0e32a4cc0ef50b08ae2217d45f95e7ad and skip some work when possible

swh.mercurial.loader: Replace occurrences with snapshot

comment about compression

I'll do it as part of my patch, but I will need you to look at it. You made the original changes for good reasons, so I just want to make sure that the reasons are preserved.

Also commit fbdd798b0e32a4cc0ef50b08ae2217d45f95e7ad is very problematic.

I propose to treat remote and local repositories the same (for now at least) with hg incoming to write the bundle in bundle20_loader:prepare. (This may require building mercurial from available 4.5 source to not hit some giant memory leak)

For fetching the blob, the only gotcha i see is that possibly we have contents without data (the big one are filtered out).

as entertained in the code, only bundle20 format support

force v2 bundle generation

prepare args were reordered in previous commit

go back to creation of chunked_reader

chunked reader for hg20 bundles

fix logic and speed way up

remove the intermediate file read object

allow second resolution in time offsets

HG bundle20 parser first prototype

I don't know if I should finish cleaning up slow_loader for code review, since the hglib interface is so slow as to be next to useless.

might as well push the slow hglib mercurial loader

When would be a good time to try to get this running?

or f['perms'] != parent_dir[fname]['perms'])):  # please don't remove the double parens. pydocstyle needs them.

Refactor lister code

Formatted the test responses and added commentary on the life goals of tasks.

what does this button do?

shot at making the lister base and intermediate classes agnostic to the transport layer

updated requirements

rebase on origin/master, more tests + bug fixes

longer tests, more refactoring

Advanced Search
Use Results
Edit Query
Hide Query

Feb 21 2018

Feb 20 2018

Feb 15 2018

Feb 14 2018

Feb 13 2018

Feb 9 2018

Feb 8 2018

Feb 7 2018

Feb 2 2018

Jan 12 2018

Dec 26 2017

Dec 25 2017

Oct 16 2017

Jun 9 2017

May 30 2017

May 24 2017

May 23 2017

May 21 2017

May 17 2017

May 9 2017

Apr 21 2017

Apr 4 2017

Mar 17 2017

Mar 10 2017

Mar 6 2017

Mar 2 2017

Feb 23 2017

Feb 20 2017

Feb 18 2017

Advanced SearchUse ResultsEdit QueryHide Query