Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 10 2018

olasd triaged T1260: Extend the release object model to allow synthetic objects as Normal priority.
Oct 10 2018, 1:58 PM · Archive content
olasd triaged T1258: Synthesize release objects for all upstream things that match the concept of a release as Normal priority.
Oct 10 2018, 1:22 PM · Archive content
zack added a comment to T1257: Formalize the default branch convention for snapshots.
  • the default branch for snapshots is defined to be HEAD.
    • if the concept of HEAD exists with the same name in the upstream VCS (f.e. git, svn), this branch should be a literal pointer to the corresponding archived object
    • if the concept of HEAD doesn't exist with the same name in the upstream VCS (f.e. mercurial), this branch should be an alias pointing at the default branch, named using the upstream VCS context (f.e. in the mercurial case, that would be an alias for the tip of the default branch)
    • if the concept of a default branch/version doesn't exist in the upstream VCS, no HEAD branch should exist in the snapshot
Oct 10 2018, 1:21 PM · Archive content
olasd triaged T1257: Formalize the default branch convention for snapshots as High priority.
Oct 10 2018, 11:58 AM · Archive content

Oct 4 2018

zack added a comment to T838: SQL storage: drop the entity tables.

agreed, they should be removed (I've updated the task title accordingly)

Oct 4 2018, 12:19 PM · Storage manager, Archive content
zack renamed T838: SQL storage: drop the entity tables from Decide what to do with the entity tables to SQL storage: drop the entity tables.
Oct 4 2018, 12:18 PM · Storage manager, Archive content
zack removed a parent task for T1156: Fix release targets of already loaded mercurial type origins: T336: "save code now".
Oct 4 2018, 11:46 AM · Archive content

Oct 3 2018

olasd added a comment to T830: Remove tables occurrence and occurrence_history.

All the old visits have now been migrated to snapshots.

Oct 3 2018, 3:04 PM · Storage manager, Archive content
olasd claimed T830: Remove tables occurrence and occurrence_history.

There's some work in progress by @zack in branch feature/schema-revamp of rDSTO, I'll pick that up to drive it to completion.

Oct 3 2018, 12:18 PM · Storage manager, Archive content
olasd renamed T830: Remove tables occurrence and occurrence_history from Remove table occurrence to Remove tables occurrence and occurrence_history.
Oct 3 2018, 12:05 PM · Storage manager, Archive content

Sep 30 2018

ardumont added a comment to T958: googlecode import: Clean up googlecode origin's origin_visits.

And also make sure the one visit date is the right one:

Sep 30 2018, 11:39 AM · SVN Loader, Origin-GoogleCode, Archive content

Sep 28 2018

ardumont closed T946: loader-svn: googlecode import: UnicodeDecodeError in user svn properties fails the loading, a subtask of T879: Reschedule googlecode svn origins from scratch, as Resolved.
Sep 28 2018, 3:04 PM · Origin-GoogleCode, SVN Loader, Archive content

Sep 27 2018

zack assigned T1211: reingest missing early objects to olasd.
Sep 27 2018, 11:38 AM · Archive content
zack triaged T1211: reingest missing early objects as Normal priority.
Sep 27 2018, 11:36 AM · Archive content

Sep 21 2018

ardumont added a comment to T1156: Fix release targets of already loaded mercurial type origins.

I added T336 as parent so that the save code now is deployed only when this one is fixed.

Sep 21 2018, 9:43 AM · Archive content

Sep 20 2018

ardumont closed T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode, a subtask of T1156: Fix release targets of already loaded mercurial type origins, as Resolved.
Sep 20 2018, 11:29 AM · Archive content
ardumont closed T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode as Resolved.
Sep 20 2018, 11:29 AM · Archive content
ardumont added a comment to T1156: Fix release targets of already loaded mercurial type origins.

Related D409

Sep 20 2018, 11:27 AM · Archive content

Sep 19 2018

ardumont closed T879: Reschedule googlecode svn origins from scratch as Resolved.

That's been done for a while now.

Sep 19 2018, 1:56 PM · Origin-GoogleCode, SVN Loader, Archive content

Sep 12 2018

olasd added a comment to P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.

Spot checks:

Sep 12 2018, 5:01 PM · Archive content

Aug 24 2018

ardumont added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

Heads up on this btw.

Aug 24 2018, 11:04 AM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Aug 24 2018, 11:03 AM · Archive content

Aug 22 2018

ardumont added a comment to P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.

Last edit to write all data to "temporary" table prior to actually cleanup (it was only done for release and snapshot so far. It's also done for origin_visit and fetch_history now).

Aug 22 2018, 3:44 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Aug 22 2018, 3:41 PM · Archive content

Aug 3 2018

ardumont added a parent task for T1156: Fix release targets of already loaded mercurial type origins: T682: Ingest Google Code Mercurial repositories.
Aug 3 2018, 3:03 PM · Archive content

Jul 26 2018

zack added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

@zack or @olasd if you have some time to review P286 at one point in time, that would be awesome.

Jul 26 2018, 6:18 PM · Archive content
ardumont added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

Local tests are fine.

Jul 26 2018, 3:19 PM · Archive content
ardumont added a subtask for T1156: Fix release targets of already loaded mercurial type origins: T1155: Mercurial loader: release target is invalid.
Jul 26 2018, 3:17 PM · Archive content
ardumont updated subscribers of T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

@zack or @olasd if you have some time to review P286 at one point in time, that would be awesome.

Jul 26 2018, 3:13 PM · Archive content
ardumont added a comment to T1156: Fix release targets of already loaded mercurial type origins.

I added T336 as parent so that the save code now is deployed only when this one is fixed.

Jul 26 2018, 3:09 PM · Archive content
ardumont added a parent task for T1156: Fix release targets of already loaded mercurial type origins: T336: "save code now".
Jul 26 2018, 3:08 PM · Archive content
ardumont added a comment to P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.

Last edit [1] to update code comments.

Jul 26 2018, 2:37 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 26 2018, 2:36 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 26 2018, 2:18 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 26 2018, 1:33 PM · Archive content
ardumont added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

P286 is still wip as some now unclear exception occurs.

Jul 26 2018, 12:03 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 26 2018, 12:02 PM · Archive content
ardumont edited P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 26 2018, 11:57 AM · Archive content

Jul 25 2018

ardumont changed the status of T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode from Open to Work in Progress.
Jul 25 2018, 6:28 PM · Archive content
ardumont changed the status of T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode, a subtask of T1156: Fix release targets of already loaded mercurial type origins, from Open to Work in Progress.
Jul 25 2018, 6:28 PM · Archive content
ardumont added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

P286 is still wip as some now unclear exception occurs.

Jul 25 2018, 6:28 PM · Archive content
ardumont updated the title for P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure from Cleanup wrong data from mercurial loading to Cleanup wrong data from mercurial loading - wip.
Jul 25 2018, 6:23 PM · Archive content
ardumont created P286 mercurial loading release issue - Cleanup release/snapshot/origin_visits procedure.
Jul 25 2018, 6:23 PM · Archive content
ardumont added a comment to T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.

To find those releases:

Jul 25 2018, 2:25 PM · Archive content
ardumont renamed T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode from hg loader: Clean up wrong releases during hg loading of googlecode to hg loader: Clean up wrong snapshots/releases during hg loading of googlecode.
Jul 25 2018, 2:04 PM · Archive content
ardumont updated the task description for T1159: hg loader: Schedule oneshot tasks for googlecode origin ingestion.
Jul 25 2018, 11:43 AM · Archive content
ardumont triaged T1159: hg loader: Schedule oneshot tasks for googlecode origin ingestion as Normal priority.
Jul 25 2018, 11:43 AM · Archive content
ardumont renamed T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode from Clean up wrong releases to hg loader: Clean up wrong releases during hg loading of googlecode.
Jul 25 2018, 11:38 AM · Archive content
ardumont triaged T1158: hg loader: Clean up wrong snapshots/releases during hg loading of googlecode as Normal priority.
Jul 25 2018, 11:37 AM · Archive content

Jul 19 2018

anlambert updated the task description for T1156: Fix release targets of already loaded mercurial type origins.
Jul 19 2018, 4:17 PM · Archive content
anlambert triaged T1156: Fix release targets of already loaded mercurial type origins as Normal priority.
Jul 19 2018, 4:15 PM · Archive content

Jun 19 2018

zack edited Description on Archive content.
Jun 19 2018, 3:22 PM
zack added a subtask for T1111: ingest GitLab.com (meta-task): T989: Implement GitLab lister.
Jun 19 2018, 3:21 PM · Archive coverage, General, Origin-GitLab
zack triaged T1111: ingest GitLab.com (meta-task) as High priority.
Jun 19 2018, 3:21 PM · Archive coverage, General, Origin-GitLab

Jun 6 2018

zack triaged T1086: ingest Debian's Alioth (archived) repositories (meta-task) as Normal priority.
Jun 6 2018, 1:42 PM · Archive coverage
zack added a project to T1002: ingest Hackage, the Haskell package repository (meta task): Archive content.
Jun 6 2018, 1:41 PM · Hackage loader, Hackage lister, Archive coverage

Apr 12 2018

ardumont closed T312: Gitorious import: ingest repositories as Resolved.
Apr 12 2018, 2:05 PM · Archive coverage, Restricted Project, Origin-Gitorious, Format-Git
ardumont closed T674: Gitorious import: Examine ingestion logs for errors and list them if any, a subtask of T312: Gitorious import: ingest repositories, as Resolved.
Apr 12 2018, 2:04 PM · Archive coverage, Restricted Project, Origin-Gitorious, Format-Git

Mar 21 2018

ardumont added a comment to T682: Ingest Google Code Mercurial repositories.
$ cat ~/.config/swh/kibana/query.yml
indexes:
  - swh_workers-2018.03.*
Mar 21 2018, 3:07 PM · Archive coverage, Mercurial loader
ardumont added a comment to T682: Ingest Google Code Mercurial repositories.

why no errors reported at all in logs (or logs for that matters..., removing all filters, this seems to stop around the 7th of march 2018)

Mar 21 2018, 2:29 PM · Archive coverage, Mercurial loader
ardumont added a comment to T682: Ingest Google Code Mercurial repositories.

Current status, the queue is empty.

Mar 21 2018, 12:31 PM · Archive coverage, Mercurial loader

Mar 19 2018

olasd changed the status of T997: Debian loader sometimes thinks a package has been loaded when it has not, a subtask of T995: Investigate and fix snapshots with broken links, from Open to Work in Progress.
Mar 19 2018, 7:55 PM · Archive content

Mar 16 2018

olasd triaged T995: Investigate and fix snapshots with broken links as High priority.
Mar 16 2018, 5:53 PM · Archive content
olasd added a subtask for T829: Remove duplication between fetch_history and origin_visit: T994: origin_visit: distinguish "fetch date" and "injection date".
Mar 16 2018, 4:56 PM · Storage manager, Archive content
olasd added a parent task for T994: origin_visit: distinguish "fetch date" and "injection date": T829: Remove duplication between fetch_history and origin_visit.
Mar 16 2018, 4:56 PM · Archive content
olasd triaged T994: origin_visit: distinguish "fetch date" and "injection date" as Normal priority.
Mar 16 2018, 4:56 PM · Archive content

Mar 14 2018

ardumont added a comment to T682: Ingest Google Code Mercurial repositories.

Finally, rescheduled using swh-scheduler.
Heading towards T986.

Mar 14 2018, 3:25 PM · Archive coverage, Mercurial loader
ardumont added a comment to T682: Ingest Google Code Mercurial repositories.

As in https://forge.softwareheritage.org/T879#16396, a limit of 2Gib on dump size was used to separate origins.
The current lists are stored at:

Mar 14 2018, 2:17 PM · Archive coverage, Mercurial loader

Mar 5 2018

anlambert closed T980: Identify and fix releases that are stored as revisions as Wontfix.

As this is not a bug, I am closing that task.

Mar 5 2018, 10:55 AM · Archive content

Mar 2 2018

rdicosmo added a comment to T980: Identify and fix releases that are stored as revisions.

This example comes from parmap, see https://github.com/rdicosmo/parmap/

Mar 2 2018, 6:58 PM · Archive content
zack added a comment to T980: Identify and fix releases that are stored as revisions.

Thanks for the report. I haven't looked into this specific, so it's indeed possible it's a bug, but in the general case this is potentially normal behavior.
Branches can point to either releases or revisions (or, in fact, anything at all).
In the Git case, which looks like your case comes from, if one simply does a "git tag", that would create a ref pointing to a revision; whereas if one does "git tag -a" (annotated tag), that would create a release object (pointing to a revision) and a ref pointing to the release object. So an author that switched from using "git tag" to use "git tag -a" would justify what you have seen.

Mar 2 2018, 5:53 PM · Archive content
zack triaged T980: Identify and fix releases that are stored as revisions as Normal priority.
Mar 2 2018, 5:49 PM · Archive content

Feb 24 2018

ardumont closed T976: google import: Clean up wrong revisions as Resolved.
Feb 24 2018, 5:32 PM · Archive content, Mercurial loader
ardumont closed T976: google import: Clean up wrong revisions, a subtask of T682: Ingest Google Code Mercurial repositories, as Resolved.
Feb 24 2018, 5:32 PM · Archive coverage, Mercurial loader

Feb 23 2018

ardumont changed the status of T976: google import: Clean up wrong revisions from Open to Work in Progress.

Status:

  • [DONE] backup
  • [IN-PROGRESS] Clean up in progress
Feb 23 2018, 10:30 AM · Archive content, Mercurial loader
ardumont changed the status of T976: google import: Clean up wrong revisions, a subtask of T682: Ingest Google Code Mercurial repositories, from Open to Work in Progress.
Feb 23 2018, 10:30 AM · Archive coverage, Mercurial loader

Feb 21 2018

ardumont added a comment to T976: google import: Clean up wrong revisions.

Thanks for the heads up.

Feb 21 2018, 6:46 PM · Archive content, Mercurial loader
olasd added a comment to T976: google import: Clean up wrong revisions.

FWIW the backup has now completed.

Feb 21 2018, 4:57 PM · Archive content, Mercurial loader
fiendish added a comment to T970: mercurial loader: What to do in case of .hgtags?.

I agree with taking tags from both sides and discarding all lines that don't fit the pattern.

Feb 21 2018, 4:28 AM · Archive content, Mercurial loader

Feb 20 2018

ardumont closed T965: googlecode import: Analyze and fix errors as Resolved.
Feb 20 2018, 4:49 PM · Archive content, Mercurial loader
ardumont closed T965: googlecode import: Analyze and fix errors, a subtask of T682: Ingest Google Code Mercurial repositories, as Resolved.
Feb 20 2018, 4:49 PM · Archive coverage, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.

Latest version with fixes deployed.
Still remains actions on T964 and clean up on T976 before scheduling another run.

Feb 20 2018, 4:49 PM · Archive content, Mercurial loader
ardumont added a comment to T976: google import: Clean up wrong revisions.

postgres@prado:~$ pg_dump --format tar --table revision_history --table revision softwareheritage | gzip -c - > /srv/remote-backups/postgres/T970/revision-revision-history.tar.gz

Feb 20 2018, 4:40 PM · Archive content, Mercurial loader
ardumont updated the task description for T976: google import: Clean up wrong revisions.
Feb 20 2018, 4:25 PM · Archive content, Mercurial loader
ardumont added a comment to T976: google import: Clean up wrong revisions.

Running a full backup of the table for the handful of revisions concerned here is a bit overkill! (better be safe than sorry and all that, but still...)

Feb 20 2018, 4:23 PM · Archive content, Mercurial loader
olasd added a comment to T976: google import: Clean up wrong revisions.

In any case, we need to make a backup dump prior to touching those tables!

Backup running on prado:

postgres@prado:~$ pg_dump --format tar --table revision_history --table revision softwareheritage | gzip -c - > /srv/remote-backups/postgres/T970/revision-revision-history.tar.gz
Feb 20 2018, 4:14 PM · Archive content, Mercurial loader
ardumont added a comment to T976: google import: Clean up wrong revisions.

In any case, we need to make a backup dump prior to touching those tables!

Feb 20 2018, 3:45 PM · Archive content, Mercurial loader
ardumont added a comment to T976: google import: Clean up wrong revisions.
  1. We can 'simply' delete the revision of type 'hg' as no other mercurial revision exists as of today.
Feb 20 2018, 2:43 PM · Archive content, Mercurial loader
ardumont renamed T976: google import: Clean up wrong revisions from google import: Clean up wrong revision to google import: Clean up wrong revisions.
Feb 20 2018, 12:58 PM · Archive content, Mercurial loader
ardumont created T976: google import: Clean up wrong revisions.
Feb 20 2018, 12:57 PM · Archive content, Mercurial loader

Feb 16 2018

ardumont closed T970: mercurial loader: What to do in case of .hgtags? as Resolved by committing rDLDHGe0b48c6c6e9a: bundle20_loader: Warn about wrong pattern in tags & continue loading.
Feb 16 2018, 4:00 PM · Archive content, Mercurial loader
ardumont closed T970: mercurial loader: What to do in case of .hgtags?, a subtask of T965: googlecode import: Analyze and fix errors, as Resolved.
Feb 16 2018, 4:00 PM · Archive content, Mercurial loader
ardumont added a comment to T970: mercurial loader: What to do in case of .hgtags?.

I've chosen 3. to comply with the doc's suggestion.
As usual nothing is set in stone.

Feb 16 2018, 3:59 PM · Archive content, Mercurial loader
ardumont updated the task description for T970: mercurial loader: What to do in case of .hgtags?.
Feb 16 2018, 3:58 PM · Archive content, Mercurial loader
ardumont created T970: mercurial loader: What to do in case of .hgtags?.
Feb 16 2018, 3:43 PM · Archive content, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.

Well, sure, if the .hgtags is corrupted...

Feb 16 2018, 3:26 PM · Archive content, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.

Last relevant error found in crossing multiple streams... (noooooo).

Feb 16 2018, 3:19 PM · Archive content, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.

I have a problem in the log fetcher (that's why it's said to be incomplete in the task description).

Feb 16 2018, 3:11 PM · Archive content, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.

I have a problem in the log fetcher (that's why it's said to be incomplete in the task description).

Feb 16 2018, 2:29 PM · Archive content, Mercurial loader
ardumont added a comment to T965: googlecode import: Analyze and fix errors.
"PatoolError('error extracting /srv/storage/space/m": 19,
Feb 16 2018, 2:26 PM · Archive content, Mercurial loader