Page MenuHomeSoftware Heritage

FindEarliestRevision: Add earliest_ts and rev_occurrences columns
ClosedPublic

Authored by vlorentz on Oct 11 2022, 8:20 PM.

Details

Summary

This is an import from the replication package in
https://annex.softwareheritage.org/public/dataset/license-blobs/2021-03-23/
based on the closest commit I could find + a merge commit
(to reflect the actual history)

Diff Detail

Event Timeline

Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:21 PM
Harbormaster failed remote builds in B32232: Diff 31281!

ugh that's nonesense, there are undefined vars because of 352d27f4b3f6b230757966ea73eb7c123d56337c

actually, that's salveageable

Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:31 PM
Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:33 PM
Harbormaster failed remote builds in B32233: Diff 31282!

not sure I understand what's going on here.

java/src/main/java/org/softwareheritage/graph/utils/FindEarliestRevision.java
2

-2022 ?

3–4

I guess it comes from the merge, but why adding individual copyright statements here?

63

ah it took me a bit to figure out what TSV was... I see no advantage of using this instead of CSV (even if the sep id the tab)

95

'misssing' is MIN_VALUE and epoch is 0 right?

Just in case, in provenance, I think we found there are also quite some "bogus" TS in an [EPOCH, EPOCH+epsilon] range

Also I don't understand what "this is an import from replication package" means in this context...

Also I don't understand what "this is an import from replication package" means in this context...

It means I checked out 0068f61008e6209d8739738659c86464417f020c, extracted replication/java/FindEarliestRevision.java from https://annex.softwareheritage.org/public/dataset/license-blobs/2021-03-23/replication-package.tar.gz , committed, then merged with current master

java/src/main/java/org/softwareheritage/graph/utils/FindEarliestRevision.java
2

hmm yes, I guess

3–4

yes, it is

95

hmm yes, that makes sense. I'll send another diff.

vlorentz marked an inline comment as done.

fix range

This revision was not accepted when it landed; it landed in state Needs Review.Oct 17 2022, 5:52 PM
This revision was landed with ongoing or failed builds.
This revision was automatically updated to reflect the committed changes.