Page MenuHomeSoftware Heritage

FindEarliestRevision: Add earliest_ts and rev_occurrences columns
ClosedPublic

Authored by vlorentz on Oct 11 2022, 8:20 PM.

Details

Summary

This is an import from the replication package in
https://annex.softwareheritage.org/public/dataset/license-blobs/2021-03-23/
based on the closest commit I could find + a merge commit
(to reflect the actual history)

Diff Detail

Repository
rDGRPH Compressed graph representation
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.

Event Timeline

Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:21 PM
Harbormaster failed remote builds in B32232: Diff 31281!

ugh that's nonesense, there are undefined vars because of 352d27f4b3f6b230757966ea73eb7c123d56337c

actually, that's salveageable

Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:31 PM
Harbormaster returned this revision to the author for changes because remote builds failed.Oct 11 2022, 8:33 PM
Harbormaster failed remote builds in B32233: Diff 31282!

not sure I understand what's going on here.

java/src/main/java/org/softwareheritage/graph/utils/FindEarliestRevision.java
1–2

I guess it comes from the merge, but why adding individual copyright statements here?

1–6

-2022 ?

63

ah it took me a bit to figure out what TSV was... I see no advantage of using this instead of CSV (even if the sep id the tab)

99

'misssing' is MIN_VALUE and epoch is 0 right?

Just in case, in provenance, I think we found there are also quite some "bogus" TS in an [EPOCH, EPOCH+epsilon] range

Also I don't understand what "this is an import from replication package" means in this context...

Also I don't understand what "this is an import from replication package" means in this context...

It means I checked out 0068f61008e6209d8739738659c86464417f020c, extracted replication/java/FindEarliestRevision.java from https://annex.softwareheritage.org/public/dataset/license-blobs/2021-03-23/replication-package.tar.gz , committed, then merged with current master

java/src/main/java/org/softwareheritage/graph/utils/FindEarliestRevision.java
1–2

yes, it is

1–6

hmm yes, I guess

99

hmm yes, that makes sense. I'll send another diff.

vlorentz marked an inline comment as done.

fix range

This revision was not accepted when it landed; it landed in state Needs Review.Oct 17 2022, 5:52 PM
This revision was landed with ongoing or failed builds.
This revision was automatically updated to reflect the committed changes.