Page MenuHomeSoftware Heritage
Feed Advanced Search

Dec 8 2017

ardumont changed the status of T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps, a subtask of T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects, from Open to Work in Progress.
Dec 8 2017, 1:42 PM · SVN Loader
ardumont changed the status of T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps from Open to Work in Progress.
Dec 8 2017, 1:42 PM · Origin-GoogleCode, SVN Loader
ardumont claimed T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps.
Dec 8 2017, 1:42 PM · Origin-GoogleCode, SVN Loader
ardumont changed the status of T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects from Open to Work in Progress.
Dec 8 2017, 1:40 PM · SVN Loader
ardumont closed T876: loader-svn: Reschedule origins with missing data as Resolved.
Dec 8 2017, 1:39 PM · Origin-GoogleCode, SVN Loader
ardumont closed T876: loader-svn: Reschedule origins with missing data, a subtask of T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects, as Resolved.
Dec 8 2017, 1:39 PM · SVN Loader
ardumont added a comment to T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects.

I concur with myself on the previous remarks, here is the repartition between issue and origins:

Dec 8 2017, 1:39 PM · SVN Loader
ardumont renamed T876: loader-svn: Reschedule origins with missing data from Reschedule origins with missing data to loader-svn: Reschedule origins with missing data.
Dec 8 2017, 1:37 PM · Origin-GoogleCode, SVN Loader
ardumont created T876: loader-svn: Reschedule origins with missing data.
Dec 8 2017, 1:37 PM · Origin-GoogleCode, SVN Loader

Dec 1 2017

ardumont added a project to T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps: Origin-GoogleCode.
Dec 1 2017, 11:45 AM · Origin-GoogleCode, SVN Loader
ardumont updated the task description for T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps.
Dec 1 2017, 11:45 AM · Origin-GoogleCode, SVN Loader

Nov 29 2017

ardumont updated the task description for T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps.
Nov 29 2017, 11:29 AM · Origin-GoogleCode, SVN Loader
ardumont renamed T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps from loader-svn: Investigate potential origin clash for homonyms but distinct svn dump loading to loader-svn: Investigate potential origin clash for homonym but distinct svn dumps.
Nov 29 2017, 11:23 AM · Origin-GoogleCode, SVN Loader
ardumont created T863: loader-svn: Fix origin clashes for homonym but distinct svn dumps.
Nov 29 2017, 11:22 AM · Origin-GoogleCode, SVN Loader

Nov 23 2017

ardumont added a comment to T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects.

For information, for the first url, It's more a bug in the origin_url computation which results with the same origin_url for 2 differents dumps with the same name:

ardumont@uffizi:~% grep ich-sys /srv/storage/space/mirrors/code.google.com/sources/INDEX-svn-dumps
http://ich-sys.googlecode.com/svn/ /srv/storage/space/mirrors/code.google.com/sources/v2/code.google.com/i/ich-sys/ich-sys-repo.svndump.gz
http://ich-sys.googlecode.com/svn/ /srv/storage/space/mirrors/code.google.com/sources/v2/eclipselabs.org/i/ich-sys/ich-sys-repo.svndump.gz
Nov 23 2017, 3:13 PM · SVN Loader

Nov 14 2017

ardumont added a comment to T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects.

Possibly related to rDLDBASEd74506f6b53dc3ffdae66e8446fe561d006a64f9

Nov 14 2017, 12:12 PM · SVN Loader
olasd created T847: loader-svn: Some SVN origins have occurrences that point to non-existent objects.
Nov 14 2017, 11:27 AM · SVN Loader

Nov 10 2017

ardumont added a comment to T844: google code svn import: symlink: embedded null character in src.

With latest fix from T839, this removes that edge case as well.
The symlink referenced here was not a symlink.
Which now makes sense with my first disconcerting analysis.

Nov 10 2017, 3:59 PM · SVN Loader
ardumont closed T839: google code svn import: filename too long as Resolved by committing rDLDSVN2214719194c8: swh.loader.svn.loader: Fix special symlink case.
Nov 10 2017, 3:44 PM · SVN Loader
ardumont closed T844: google code svn import: symlink: embedded null character in src, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Resolved.
Nov 10 2017, 3:44 PM · SVN Loader
ardumont closed T839: google code svn import: filename too long, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Resolved.
Nov 10 2017, 3:44 PM · SVN Loader
ardumont closed T844: google code svn import: symlink: embedded null character in src as Resolved by committing rDLDSVN2214719194c8: swh.loader.svn.loader: Fix special symlink case.
Nov 10 2017, 3:44 PM · SVN Loader
ardumont added a comment to T839: google code svn import: filename too long.

So that means that's the fix is:

Nov 10 2017, 1:42 PM · SVN Loader
ardumont added a comment to T839: google code svn import: filename too long.

That's awesome.

Nov 10 2017, 12:29 PM · SVN Loader
ardumont reopened T839: google code svn import: filename too long, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Open.
Nov 10 2017, 11:51 AM · SVN Loader
ardumont reopened T839: google code svn import: filename too long as "Open".

Another repository triggered something similar. So, not quite yet fixed.

Nov 10 2017, 11:51 AM · SVN Loader
ardumont added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T844: google code svn import: symlink: embedded null character in src.
Nov 10 2017, 11:30 AM · SVN Loader
ardumont added a parent task for T844: google code svn import: symlink: embedded null character in src: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Nov 10 2017, 11:30 AM · SVN Loader
ardumont created T844: google code svn import: symlink: embedded null character in src.
Nov 10 2017, 11:30 AM · SVN Loader

Nov 8 2017

ardumont added a comment to T811: Google code svn import - 'Too many open file' errors.

As usual, to be complete...

Nov 8 2017, 3:27 PM · SVN Loader

Nov 7 2017

ardumont renamed T811: Google code svn import - 'Too many open file' errors from Google code svn dump - 'Too many open file' errors to Google code svn import - 'Too many open file' errors.
Nov 7 2017, 6:45 PM · SVN Loader
ardumont renamed T812: Google code svn import - 'Eventful partial visit. Detail: too many values to unpack' from Google code svn dump - 'Eventful partial visit. Detail: too many values to unpack' to Google code svn import - 'Eventful partial visit. Detail: too many values to unpack'.
Nov 7 2017, 6:45 PM · SVN Loader
ardumont added a comment to T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.

Latest run from last week-end, some new bugs (new tasks are or will be opened for detailed analysis and fix):

Nov 7 2017, 6:44 PM · SVN Loader
ardumont closed T811: Google code svn import - 'Too many open file' errors as Resolved.
Nov 7 2017, 6:38 PM · SVN Loader
ardumont closed T811: Google code svn import - 'Too many open file' errors, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Resolved.
Nov 7 2017, 6:38 PM · SVN Loader
ardumont added a parent task for T840: google code svn import: file exists: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Nov 7 2017, 6:37 PM · SVN Loader
ardumont added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T840: google code svn import: file exists.
Nov 7 2017, 6:37 PM · SVN Loader
ardumont closed T839: google code svn import: filename too long, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Resolved.
Nov 7 2017, 6:37 PM · SVN Loader
ardumont closed T839: google code svn import: filename too long as Resolved.
Nov 7 2017, 6:37 PM · SVN Loader
ardumont added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T839: google code svn import: filename too long.
Nov 7 2017, 6:36 PM · SVN Loader
ardumont added a parent task for T839: google code svn import: filename too long: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Nov 7 2017, 6:36 PM · SVN Loader
ardumont added a comment to T839: google code svn import: filename too long.

Other repository impacted: /srv/storage/space/mirrors/code.google.com/sources/v2/code.google.com/h/humanroot/humanroot-repo.svndump.gz

Nov 7 2017, 6:35 PM · SVN Loader
ardumont closed T840: google code svn import: file exists as Resolved by committing rDLDSVN8bd410278d98: ra: Fix not cleaned up path leading to file exists error.
Nov 7 2017, 6:21 PM · SVN Loader

Nov 6 2017

ardumont added a comment to T840: google code svn import: file exists.

Apparently a link exists with the same path. Thus the error.

Nov 6 2017, 5:58 PM · SVN Loader
ardumont created T840: google code svn import: file exists.
Nov 6 2017, 5:48 PM · SVN Loader
ardumont updated the task description for T839: google code svn import: filename too long.
Nov 6 2017, 5:01 PM · SVN Loader
ardumont created T839: google code svn import: filename too long.
Nov 6 2017, 4:43 PM · SVN Loader

Oct 26 2017

ardumont added a comment to T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.

Heads up on this, i fixed some wrong behavior or bugs:

  • uneventful visit (with no more changes since last time) were rendered as 'partial' visits instead of 'full'.
  • space in symlink path broke the loading (T812)
  • too many opened file issue (T811)
Oct 26 2017, 9:58 AM · SVN Loader

Oct 25 2017

ardumont added a comment to T811: Google code svn import - 'Too many open file' errors.

Looking further into it.
Defining such option for this in our puppet manifest would make this eventually end up in /etc/systemd/system/swh-worker@${SERVICE_NAME}.service.d/parameters.conf.
In this case ${SERVICE_NAME} being swh_loader_svn.

Oct 25 2017, 11:16 AM · SVN Loader
ardumont added a comment to T811: Google code svn import - 'Too many open file' errors.

We could use the loader-svn's systemd [service] property LimitNOFILE:

  • 'LimitNOFILE= ulimit -n Number of File Descriptors'.
Oct 25 2017, 10:50 AM · SVN Loader

Oct 24 2017

ardumont added a comment to T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.

The 'googlecode' and 'unknown' key entries are googlecode svn related.
Most are stored in the 'unknown' entry key because, we somehow don't have the task's input args (origin url + dump file), which is unfortunate (for rescheduling those).

Oct 24 2017, 6:12 PM · SVN Loader
ardumont added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T812: Google code svn import - 'Eventful partial visit. Detail: too many values to unpack'.
Oct 24 2017, 5:59 PM · SVN Loader
ardumont added a parent task for T812: Google code svn import - 'Eventful partial visit. Detail: too many values to unpack': T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Oct 24 2017, 5:59 PM · SVN Loader
ardumont added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T811: Google code svn import - 'Too many open file' errors.
Oct 24 2017, 5:59 PM · SVN Loader
ardumont added a parent task for T811: Google code svn import - 'Too many open file' errors: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Oct 24 2017, 5:59 PM · SVN Loader
ardumont closed T812: Google code svn import - 'Eventful partial visit. Detail: too many values to unpack' as Resolved by committing rDLDSVNbae364bbe8ec: swh.loader.svn.ra: Fix blank space in symlink path split problem.
Oct 24 2017, 4:40 PM · SVN Loader
ardumont created T812: Google code svn import - 'Eventful partial visit. Detail: too many values to unpack'.
Oct 24 2017, 4:25 PM · SVN Loader
ardumont added a comment to T811: Google code svn import - 'Too many open file' errors.

There is no file descriptor leak.

Oct 24 2017, 3:34 PM · SVN Loader
ardumont created T811: Google code svn import - 'Too many open file' errors.
Oct 24 2017, 2:03 PM · SVN Loader

Oct 17 2017

zack added a parent task for T328: svn / subversion loader: T807: dogfooding: ingest the Software Heritage forge into the archive (via the canonical URLs).
Oct 17 2017, 3:44 PM · SVN Loader

Oct 10 2017

ardumont created P182 scheduler - new task type for loading one-shot task (svn mount and load repository, load deposit archive)....
Oct 10 2017, 7:53 PM · Scheduling utilities

Oct 9 2017

ardumont closed T798: Empty partial visits when we fail to load due to externals property as Resolved by committing rDLDSVNe2d300a42ea6: swh.loader.svn.loader: Fix missing revision id key.
Oct 9 2017, 3:53 PM · SVN Loader
ardumont added a comment to T798: Empty partial visits when we fail to load due to externals property.

Well, well, at that moment, the revision has no 'id' key yet...
Ok, fixing it.

Oct 9 2017, 12:09 PM · SVN Loader
ardumont added a comment to T798: Empty partial visits when we fail to load due to externals property.

Well (remembering now), the fact that only 400 revisions is stored is normal (as in implemented that way).

Oct 9 2017, 12:04 PM · SVN Loader
ardumont claimed T798: Empty partial visits when we fail to load due to externals property.
Oct 9 2017, 11:41 AM · SVN Loader
ardumont added a comment to T798: Empty partial visits when we fail to load due to externals property.

Stacktrace of the reproduced error:

Oct 9 2017, 11:41 AM · SVN Loader
ardumont added a comment to T798: Empty partial visits when we fail to load due to externals property.

Ok, just so you know, i did not reproduce this behavior immediately (even though: swh-env updated, db rebuilt, configuration file 'almost' identical to prod).
I had another issue prior to the one described (about DentryPerms.directory).

Oct 9 2017, 11:29 AM · SVN Loader

Oct 6 2017

olasd created T798: Empty partial visits when we fail to load due to externals property.
Oct 6 2017, 4:51 PM · SVN Loader

Oct 4 2017

ardumont updated the task description for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Oct 4 2017, 3:59 PM · SVN Loader
ardumont added a comment to T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.

First, an important detail, those were disk loading of svn dumps.
So, at first, this mount a gzip dump as an svn repository and then it processes the history log.
So, this can be quite resource consuming (disk, memory).

Oct 4 2017, 3:58 PM · SVN Loader
ardumont added a comment to T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.

Out of the 140k remaining svn repositories to mount and load, i have 606 errors.

Oct 4 2017, 12:43 PM · SVN Loader

Oct 3 2017

ardumont added a parent task for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:56 AM · SVN Loader
ardumont removed a subtask for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:55 AM · SVN Loader
ardumont reopened T617: ingest Google Code Subversion repositories, a subtask of T328: svn / subversion loader, as Open.
Oct 3 2017, 9:49 AM · SVN Loader
ardumont added a subtask for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:47 AM · SVN Loader
ardumont removed a parent task for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:46 AM · SVN Loader
ardumont added a parent task for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:46 AM · SVN Loader
ardumont removed a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T617: ingest Google Code Subversion repositories.
Oct 3 2017, 9:46 AM · SVN Loader

Oct 2 2017

ardumont closed T617: ingest Google Code Subversion repositories, a subtask of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any, as Resolved.
Oct 2 2017, 4:22 PM · SVN Loader

Feb 15 2017

zack added a parent task for T617: ingest Google Code Subversion repositories: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Feb 15 2017, 4:11 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack added a subtask for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T617: ingest Google Code Subversion repositories.
Feb 15 2017, 4:11 PM · SVN Loader
zack removed a parent task for T676: Google Code SVN import: Examine ingestion logs for errors and list them if any: T617: ingest Google Code Subversion repositories.
Feb 15 2017, 4:11 PM · SVN Loader
zack removed a subtask for T617: ingest Google Code Subversion repositories: T676: Google Code SVN import: Examine ingestion logs for errors and list them if any.
Feb 15 2017, 4:11 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack added a parent task for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Feb 15 2017, 4:08 PM · SVN Loader
zack added a subtask for T617: ingest Google Code Subversion repositories: T328: svn / subversion loader.
Feb 15 2017, 4:08 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack added a parent task for T617: ingest Google Code Subversion repositories: T367: ingest Google Code repositories.
Feb 15 2017, 4:08 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack removed a subtask for T328: svn / subversion loader: T617: ingest Google Code Subversion repositories.
Feb 15 2017, 4:07 PM · SVN Loader
zack removed a parent task for T617: ingest Google Code Subversion repositories: T328: svn / subversion loader.
Feb 15 2017, 4:07 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack removed a parent task for T328: svn / subversion loader: T367: ingest Google Code repositories.
Feb 15 2017, 4:06 PM · SVN Loader
ardumont added a comment to T617: ingest Google Code Subversion repositories.

Command used to trigger the production of tasks:

Feb 15 2017, 12:42 PM · Archive coverage, Origin-GoogleCode, SVN Loader

Feb 14 2017

olasd closed T680: fix off-by-1us timestamp in revisions coming from SVN loader as Resolved.

This issue has been solved and the fix deployed everywhere.

Feb 14 2017, 11:21 PM · Storage manager, SVN Loader, Restricted Project
olasd added a comment to T680: fix off-by-1us timestamp in revisions coming from SVN loader.

I just actually stopped the SVN loaders :)

Feb 14 2017, 2:52 PM · Storage manager, SVN Loader, Restricted Project
zack lowered the priority of T617: ingest Google Code Subversion repositories from High to Normal.
Feb 14 2017, 9:52 AM · Archive coverage, Origin-GoogleCode, SVN Loader
zack created T680: fix off-by-1us timestamp in revisions coming from SVN loader.
Feb 14 2017, 9:50 AM · Storage manager, SVN Loader, Restricted Project

Feb 12 2017

zack moved T617: ingest Google Code Subversion repositories from Restricted Project Column to Restricted Project Column on the Restricted Project board.
Feb 12 2017, 6:37 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack lowered the priority of T676: Google Code SVN import: Examine ingestion logs for errors and list them if any from High to Normal.
Feb 12 2017, 6:31 PM · SVN Loader
zack renamed T617: ingest Google Code Subversion repositories from Ingest googlecode's svn dump repositories to ingest Google Code Subversion repositories.
Feb 12 2017, 6:15 PM · Archive coverage, Origin-GoogleCode, SVN Loader

Feb 10 2017

ardumont renamed T676: Google Code SVN import: Examine ingestion logs for errors and list them if any from Google Code svn import: reference errors after ingestion to Google Code SVN import: Examine ingestion logs for errors and list them if any.
Feb 10 2017, 3:17 PM · SVN Loader
olasd renamed T676: Google Code SVN import: Examine ingestion logs for errors and list them if any from Google Code: reference errors after ingestion to Google Code svn import: reference errors after ingestion.
Feb 10 2017, 2:39 PM · SVN Loader
olasd renamed T676: Google Code SVN import: Examine ingestion logs for errors and list them if any from reference errors after ingestion to Google Code: reference errors after ingestion.
Feb 10 2017, 2:38 PM · SVN Loader