Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 29 2018

olasd updated the task description for T1139: ingest major gitlab instances.
Oct 29 2018, 10:48 AM · Archive coverage, Origin-GitLab
douardda updated the task description for T1139: ingest major gitlab instances.
Oct 29 2018, 10:01 AM · Archive coverage, Origin-GitLab
douardda updated the task description for T1139: ingest major gitlab instances.
Oct 29 2018, 10:00 AM · Archive coverage, Origin-GitLab

Oct 22 2018

zack added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.

the internship topic on this is now available here: https://wiki.softwareheritage.org/wiki/Ingest_all_Debian_derivatives_(internship)

Oct 22 2018, 7:59 PM · Archive coverage
olasd added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.
In T1262#23695, @zack wrote:

That's a very good idea, which I'll be happy to draft as a proper internship proposal. Before doing so, however, can you confirm that, scheduling wise, tracking something like ~100 additional derivatives wouldn't be a problem for us in terms of load?

Oct 22 2018, 6:45 PM · Archive coverage
ardumont updated the task description for T1246: pypi loader: Analyze existing errors.
Oct 22 2018, 10:24 AM · Archive coverage, Origin-Pypi

Oct 18 2018

ardumont added a comment to T1246: pypi loader: Analyze existing errors.

Ok, so reworked the group_by_exception snippet to have a more sensible output:

Oct 18 2018, 11:27 AM · Archive coverage, Origin-Pypi

Oct 17 2018

ardumont added a comment to T1246: pypi loader: Analyze existing errors.

In any case, for now, like i said in [2], we will first schedule back
those 1409 origins in error.

Oct 17 2018, 4:22 PM · Archive coverage, Origin-Pypi

Oct 16 2018

zack added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.
In T1262#23681, @olasd wrote:

Automating the addition of distributions from the Debian derivatives census to Software Heritage would probably be a good topic for an internship, e.g. a Google Summer of Code/Outreachy project.

Oct 16 2018, 5:41 PM · Archive coverage
ardumont added a comment to T1246: pypi loader: Analyze existing errors.

Here is the pypi report about the loading errors.

Oct 16 2018, 2:03 PM · Archive coverage, Origin-Pypi
olasd added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.

Debian derivatives (that is, distributions that are forks of Debian, not Debian itself) are not being archived.

Oct 16 2018, 12:19 PM · Archive coverage
moranegg updated the task description for T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.
Oct 16 2018, 11:55 AM · Archive coverage
moranegg renamed T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested from wiki: Update suggestion box to wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.
Oct 16 2018, 11:54 AM · Archive coverage

Oct 12 2018

zack added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.

My point was just that you didn't list here the entries that you think have to be updated, so it wasn't actionable.
It would be great if you can update the task description with all the entries that you think deserve an update (even if you've doubts about them).

Oct 12 2018, 11:13 AM · Archive coverage
moranegg added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.

I suggested this task instead of editing because I wasn't sure about item no° 3 (Debian).
And I didn't know if entries should be dropped or do we want to keep all items in the list and have a checkbox when we get to them.

Oct 12 2018, 11:09 AM · Archive coverage

Oct 11 2018

zack added a comment to T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested.

Can you clarify the scope of this task?

Oct 11 2018, 8:22 PM · Archive coverage
moranegg triaged T1262: wiki: Update suggestion box if `all Debian derivatives` can be noted as ingested as Low priority.
Oct 11 2018, 2:41 PM · Archive coverage

Oct 5 2018

ardumont renamed T1246: pypi loader: Analyze existing errors from Analyze pypi errors to pypi loader: Analyze existing errors.
Oct 5 2018, 6:31 PM · Archive coverage, Origin-Pypi
ardumont added a comment to T1246: pypi loader: Analyze existing errors.

kibana dashboard will help in that matters (P311 because it's noisy).

Oct 5 2018, 6:30 PM · Archive coverage, Origin-Pypi
ardumont triaged T1246: pypi loader: Analyze existing errors as Normal priority.
Oct 5 2018, 6:28 PM · Archive coverage, Origin-Pypi
zack added a project to T1139: ingest major gitlab instances: Archive coverage.
Oct 5 2018, 4:34 PM · Archive coverage, Origin-GitLab

Sep 21 2018

ardumont closed T421: PyPI loader, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), as Resolved.
Sep 21 2018, 6:35 PM · Archive coverage, Origin-Pypi

Sep 20 2018

ardumont closed T1181: pypi: Schedule ingestion, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), as Resolved.
Sep 20 2018, 11:17 AM · Archive coverage, Origin-Pypi
ardumont closed T1181: pypi: Schedule ingestion as Resolved.
Sep 20 2018, 11:17 AM · Archive coverage, Origin-Pypi
ardumont added a comment to T1181: pypi: Schedule ingestion.

Now, it's scheduled. Just need to wait for the swh-scheduler-runner.service to finish its loop on task_types.

Sep 20 2018, 9:52 AM · Archive coverage, Origin-Pypi
ardumont added a comment to T1181: pypi: Schedule ingestion.
swhscheduler@saatchi:~$ python3 -m swh.scheduler.cli task list-pending -t swh-lister-pypi
Found 1 tasks
Sep 20 2018, 9:48 AM · Archive coverage, Origin-Pypi
ardumont updated the task description for T1181: pypi: Schedule ingestion.
Sep 20 2018, 9:47 AM · Archive coverage, Origin-Pypi
ardumont added a comment to T1181: pypi: Schedule ingestion.

Schedule the lister-pypi:

Sep 20 2018, 9:47 AM · Archive coverage, Origin-Pypi

Sep 19 2018

ardumont changed the status of T1181: pypi: Schedule ingestion, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Open to Work in Progress.
Sep 19 2018, 7:52 PM · Archive coverage, Origin-Pypi
ardumont changed the status of T1181: pypi: Schedule ingestion from Open to Work in Progress.
Sep 19 2018, 7:52 PM · Archive coverage, Origin-Pypi
ardumont closed T879: Reschedule googlecode svn origins from scratch, a subtask of T617: ingest Google Code Subversion repositories, as Resolved.
Sep 19 2018, 1:56 PM · Archive coverage, Origin-GoogleCode, SVN Loader

Sep 6 2018

ardumont updated the task description for T1181: pypi: Schedule ingestion.
Sep 6 2018, 5:38 PM · Archive coverage, Origin-Pypi
ardumont renamed T1181: pypi: Schedule ingestion from pypi: Trigger listing task to pypi: Schedule ingestion.
Sep 6 2018, 5:37 PM · Archive coverage, Origin-Pypi
ardumont triaged T1181: pypi: Schedule ingestion as Normal priority.
Sep 6 2018, 5:31 PM · Archive coverage, Origin-Pypi
ardumont closed T422: PyPI lister, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), as Resolved.
Sep 6 2018, 5:31 PM · Archive coverage, Origin-Pypi

Sep 4 2018

ardumont closed T1111: ingest GitLab.com (meta-task) as Resolved.
Sep 4 2018, 6:17 PM · Archive coverage, General, Origin-GitLab

Aug 24 2018

ardumont added a comment to T1111: ingest GitLab.com (meta-task).

A priori, at current speed, there remains ~7.5 days till the end of the gitlab origins ingestion.

Aug 24 2018, 12:06 PM · Archive coverage, General, Origin-GitLab

Aug 3 2018

ardumont added a comment to T682: Ingest Google Code Mercurial repositories.

First pass have been done complete a while back.

Aug 3 2018, 3:05 PM · Archive coverage, Mercurial loader
ardumont added a subtask for T682: Ingest Google Code Mercurial repositories: T1156: Fix release targets of already loaded mercurial type origins.
Aug 3 2018, 3:03 PM · Archive coverage, Mercurial loader
ardumont closed T329: hg / mercurial loader, a subtask of T593: ingest bitbucket hg/mercurial repositories, as Resolved.
Aug 3 2018, 3:03 PM · Archive coverage, Origin-Bitbucket
ardumont closed T329: hg / mercurial loader, a subtask of T682: Ingest Google Code Mercurial repositories, as Resolved.
Aug 3 2018, 3:03 PM · Archive coverage, Mercurial loader

Aug 1 2018

ardumont changed the status of T422: PyPI lister, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Open to Work in Progress.
Aug 1 2018, 3:10 PM · Archive coverage, Origin-Pypi
ardumont changed the status of T421: PyPI loader, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), from Open to Work in Progress.
Aug 1 2018, 3:10 PM · Archive coverage, Origin-Pypi

Jul 26 2018

ardumont closed T420: mirror PyPI, a subtask of T419: ingest PyPI into the Software Heritage archive (meta task), as Wontfix.
Jul 26 2018, 3:33 PM · Archive coverage, Origin-Pypi

Jul 25 2018

ardumont changed the status of T1111: ingest GitLab.com (meta-task) from Open to Work in Progress.
Jul 25 2018, 6:29 PM · Archive coverage, General, Origin-GitLab

Jul 20 2018

ardumont closed T1151: Start listing gitlab.com as Resolved.
Jul 20 2018, 1:21 PM · Scheduling utilities, Archive coverage, General, Origin-GitLab
ardumont closed T1151: Start listing gitlab.com , a subtask of T1111: ingest GitLab.com (meta-task), as Resolved.
Jul 20 2018, 1:21 PM · Archive coverage, General, Origin-GitLab

Jul 19 2018

ardumont changed the status of T1151: Start listing gitlab.com from Open to Work in Progress.
Jul 19 2018, 11:46 AM · Scheduling utilities, Archive coverage, General, Origin-GitLab
ardumont changed the status of T1151: Start listing gitlab.com , a subtask of T1111: ingest GitLab.com (meta-task), from Open to Work in Progress.
Jul 19 2018, 11:46 AM · Archive coverage, General, Origin-GitLab

Jul 18 2018

ardumont updated the task description for T1151: Start listing gitlab.com .
Jul 18 2018, 6:39 PM · Scheduling utilities, Archive coverage, General, Origin-GitLab
ardumont triaged T1151: Start listing gitlab.com as High priority.
Jul 18 2018, 4:28 PM · Scheduling utilities, Archive coverage, General, Origin-GitLab

Jul 17 2018

ardumont closed T989: Implement GitLab lister, a subtask of T1111: ingest GitLab.com (meta-task), as Resolved.
Jul 17 2018, 6:46 PM · Archive coverage, General, Origin-GitLab

Jul 5 2018

ardumont updated subscribers of T1111: ingest GitLab.com (meta-task).

Some repositories @olasd mentioned to me that qualifies as gitlab repositories (in parenthesis, their current size in term of repositories):

Jul 5 2018, 9:36 AM · Archive coverage, General, Origin-GitLab

Jun 25 2018

ardumont changed the status of T989: Implement GitLab lister, a subtask of T1111: ingest GitLab.com (meta-task), from Open to Work in Progress.
Jun 25 2018, 3:13 PM · Archive coverage, General, Origin-GitLab

Jun 19 2018

zack edited projects for T682: Ingest Google Code Mercurial repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:30 PM · Archive coverage, Mercurial loader
zack edited projects for T592: ingest bitbucket git repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:29 PM · Archive coverage, Origin-Bitbucket
zack edited projects for T561: ingest bitbucket (meta task), added: Archive coverage; removed Archive content.
Jun 19 2018, 3:29 PM · Archive coverage, Origin-Bitbucket
zack edited projects for T419: ingest PyPI into the Software Heritage archive (meta task), added: Archive coverage; removed Archive content.
Jun 19 2018, 3:29 PM · Archive coverage, Origin-Pypi
zack edited projects for T376: ingest git.eclipse.org repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:29 PM · Archive coverage
zack edited projects for T593: ingest bitbucket hg/mercurial repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:28 PM · Archive coverage, Origin-Bitbucket
zack edited projects for T367: ingest Google Code repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:28 PM · Archive coverage, Restricted Project
zack edited projects for T617: ingest Google Code Subversion repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:28 PM · Archive coverage, Origin-GoogleCode, SVN Loader
zack edited projects for T1002: ingest Hackage, the Haskell package repository (meta task), added: Archive coverage; removed Archive content, General.
Jun 19 2018, 3:27 PM · Hackage loader, Hackage lister, Archive coverage
zack edited projects for T1086: ingest Debian's Alioth (archived) repositories (meta-task), added: Archive coverage; removed Archive content, General.
Jun 19 2018, 3:27 PM · Archive coverage
zack edited projects for T312: Gitorious import: ingest repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:27 PM · Archive coverage, Restricted Project, Origin-Gitorious, Format-Git
zack edited projects for T673: ingest Google Code Git repositories, added: Archive coverage; removed Archive content.
Jun 19 2018, 3:27 PM · Archive coverage
zack edited projects for T1111: ingest GitLab.com (meta-task), added: Archive coverage; removed Archive content.
Jun 19 2018, 3:27 PM · Archive coverage, General, Origin-GitLab
zack created Archive coverage.
Jun 19 2018, 3:24 PM