@jbertran Indeed, for the moment locally, it's more /api/1/provenance/
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Sep 1 2016
/browse/<>/ should probably be replaced by a more significant/useful route as well.
Aug 30 2016
local cache: we consider "revisions we haven't seen" to be revisions not seen in the past for a specific origin (the one being visited)
As per yesterday's F2F discussion, we are going to experiment (first) with 2.B (new revisions only with local cache).
The rationale is twofold:
- there is no loss of information with it (if we want, we can always further "unroll" transitive revisions later)
Ack on all the above. Just a precision on the revisions→origin mapping.
Aug 16 2016
Jul 26 2016
Jul 22 2016
Jul 19 2016
Jul 13 2016
Jul 11 2016
Jun 26 2016
Still missing quite a few entries but the foundations are there, so Ok for
closing this.... it will grow by itself now
Jun 25 2016
I've reviewed @olasd entries and completed some more.
I've also standardized how we add common information to terms, e.g. "Examples:" sections, or "Also known as:" and "Note:".
Jun 14 2016
Jun 13 2016
May 31 2016
No more errors.
May 30 2016
We need at least a draft of the glossary in good shape before the grand opening.
Remember to do this on: https://wg.softwareheritage.org/index.php?title=Glossary
Only 4132 out of 1379346 files were in errors during checks (~0.29%)
May 29 2016
May 25 2016
May 13 2016
May 6 2016
Around ~120k done.
It's rather slow, around 1.1/s.
May 5 2016
- done in 86c1353
- packaged in python3-swh.fetcher.googlecode v0.0.3
- deployed on worker01
- worker01 is currently checking those archives
May 3 2016
Rescheduled and no more errors now.
After checks, there are:
- 342 files in error (problem during fetch time)
- 158 corrupted files (bad length or md5 checksums mismatch)
May 1 2016
It's a second round-trip.
worker01 is done.
Apr 27 2016
Apr 15 2016
Apr 13 2016
The type of repository can be extracted using the main API of the Google Code Archive. It's something extra that we should do in addition to the file download, but it'd be much better than applying heuristics do the download files (no matter how trivial they would be).
Apr 12 2016
worker01 is now fetching and checking the source archives from google archive.
repository: https://forge.softwareheritage.org/diffusion/61/
Apr 11 2016
Apr 9 2016
Attention: as of today, we have a bit less than two months left before Google erases *all* the original VCS from Google Code. After that date, only the archived version will remain, that may be incorrect.
So we have only a bit less than two months to report bugs up to them