I'm not really sure this new algorithm does the same as the previous one. Some subtle things were changed and I have the filling the semantics are different now. Also, I found the previous version to be clearer, I rather stay with it
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
May 26 2021
May 26 2021
May 25 2021
May 25 2021
aeviso committed R230:c0adb80b9318: Updated client to use new module's interface. (authored by aeviso).
Updated client to use new module's interface.
aeviso committed rDPROVa6d3b9debe13: Refactores to commit after several revisions. (authored by aeviso).
Refactores to commit after several revisions.
May 14 2021
May 14 2021
aeviso committed rDPROVb2ddef88ebf2: Merge branch 'master' of ssh://forge.softwareheritage.org/source/swh-provenance (authored by aeviso).
Merge branch 'master' of ssh://forge.softwareheritage.org/source/swh-provenance
aeviso committed rDPROVbc0845fe7885: Fixed timezone related bug and added logging. (authored by aeviso).
Fixed timezone related bug and added logging.
May 7 2021
May 7 2021
Several fixes.
May 5 2021
May 5 2021
aeviso committed rDPROV243aa290acb1: Fixed edge-case bug in buld_isochrone_graph. (authored by aeviso).
Fixed edge-case bug in buld_isochrone_graph.
Apr 30 2021
Apr 30 2021
Updated client/server.
Apr 27 2021
Apr 27 2021
Minor changes.
aeviso committed rDPROV4bf3056a3d15: Fix for for revisions with the same timestamp. (authored by aeviso).
Fix for for revisions with the same timestamp.
Mar 26 2021
Mar 26 2021
aeviso committed rDPROVf3616bcc907f: Merge branch 'master' of ssh://forge.softwareheritage.org/source/swh-provenance (authored by aeviso).
Merge branch 'master' of ssh://forge.softwareheritage.org/source/swh-provenance
Mar 23 2021
Mar 23 2021
Mar 22 2021
Mar 22 2021
Updated scripts to latest provenance API.
aeviso committed rDPROV7ff72e3925ae: Refactored heuristics to be selected from CLI. (authored by aeviso).
Refactored heuristics to be selected from CLI.
Mar 20 2021
Mar 20 2021
Code clean-up.
aeviso committed R230:3933c17f5e01: Fixed histogram calculation to distinguish root. (authored by aeviso).
Fixed histogram calculation to distinguish root.
Added histogram utility.
Mar 18 2021
Mar 18 2021
aeviso committed R230:1d12a7e0c647: Updated tools to use latest version of the module. (authored by aeviso).
Updated tools to use latest version of the module.
Merged latest heuristics fixes.
Mar 8 2021
Mar 8 2021
aeviso committed R230:155287f49e09: Refactored server to take command line parameters. (authored by aeviso).
Refactored server to take command line parameters.
aeviso committed R230:82d7f033f82f: New utility to query occurrencies of a given file. (authored by aeviso).
New utility to query occurrencies of a given file.
aeviso committed R230:ffc97956853b: Added stats about the root directory as frontier. (authored by aeviso).
Added stats about the root directory as frontier.
aeviso committed R230:0155d5b39884: Refactored utility get revisions from the archive. (authored by aeviso).
Refactored utility get revisions from the archive.
Mar 2 2021
Mar 2 2021
aeviso committed rDPROV4bb53c1e456a: Potential fix to the empty directory issue. (authored by aeviso).
Potential fix to the empty directory issue.
Mar 1 2021
Mar 1 2021
Feb 22 2021
Feb 22 2021
aeviso committed rDPROVe2f05eba8cbf: Made code compliant with black and flake8. (authored by aeviso).
Made code compliant with black and flake8.
Feb 18 2021
Feb 18 2021
Minor fixes to metrics script.
Added script to get db metrics.
Defined different frontier criteria.
I guess this is the correct way to define interfaces in Python. I don't get way ArchiveStorage doesn't inherit from ArchiveInterface though.
aeviso added a comment to D5098: Add basic tests for the provenanceDB and archive abstraction classes.
I don't really understand what's going on here, so there is no much I can review.
OK about this refactoring, although this will probably change in the future. I think we actually want to receive the status or status visit as parameter, not the origins themselves.
I agree with this revision. I only have those few comments/questions above.
Feb 9 2021
Feb 9 2021
Improved queried dates reusage.
Feb 6 2021
Feb 6 2021
Code cleanup and minor improvements.
New implementation of the algorithm.
Jan 31 2021
Jan 31 2021
aeviso committed rDPROVceebf2ea606b: Bug fixed. Added psql backend without path saving. (authored by aeviso).
Bug fixed. Added psql backend without path saving.
Ignore list updated.
Jan 18 2021
Jan 18 2021
New script to check db structure.
Jan 17 2021
Jan 17 2021
aeviso committed R230:a2d25d1b85be: Refactored to use old queries over first db. (authored by aeviso).
Refactored to use old queries over first db.
aeviso committed rDPROV6ba2dc38758b: Refactored to use internal id for objects. (authored by aeviso).
Refactored to use internal id for objects.
Jan 14 2021
Jan 14 2021
Force commit retry on fail.
Jan 13 2021
Jan 13 2021
aeviso committed R230:402476374f2f: Refactor client to parallelize in two levels. (authored by aeviso).
Refactor client to parallelize in two levels.
Jan 7 2021
Jan 7 2021
aeviso committed rDPROV3928ff796f15: Minor fixes and comments for multithread approach. (authored by aeviso).
Minor fixes and comments for multithread approach.
aeviso committed R230:53cc0182e3be: Server/client tools for provenance db creation. (authored by aeviso).
Server/client tools for provenance db creation.
aeviso committed R230:be770b5139ee: New script to compare dbs by first occurrence. (authored by aeviso).
New script to compare dbs by first occurrence.
Jan 4 2021
Jan 4 2021
aeviso committed rDPROV0e5fa2782b71: Fixed treatment of remaining cadidates at the end. (authored by aeviso).
Fixed treatment of remaining cadidates at the end.
Attempt to fix parallel processing.
Apply black to the solution.
Removed trailing commas.
aeviso committed rDPROVbdb28fbfb0b4: Add mutex for thread-safe archive queries. (authored by aeviso).
Add mutex for thread-safe archive queries.
Removed PosixPath dependency.
aeviso committed rDPROV5c506a174001: Fix to allow processing revision out of order. (authored by aeviso).
Fix to allow processing revision out of order.
Dec 21 2020
Dec 21 2020
Bug fixing and minor CLI improvements.
Dec 18 2020
Dec 18 2020
Removed unused files.
Improved logging and cli feedback.
aeviso committed rDPROVbfaa96ce67c9: Fixed missing frontier subdirectories issue. (authored by aeviso).
Fixed missing frontier subdirectories issue.
aeviso committed rDPROV3506fba42ee6: Fixed issue with parent directory computation. (authored by aeviso).
Fixed issue with parent directory computation.
Dec 17 2020
Dec 17 2020
Output comparison tool.
Dec 11 2020
Dec 11 2020
Content comparison script
Dec 10 2020
Dec 10 2020
Bug fixing in revisited version.
Dec 9 2020
Dec 9 2020
aeviso committed rDPROV0c7e5d17ced9: Fix to allow the root directory in the frontier. (authored by aeviso).
Fix to allow the root directory in the frontier.
aeviso committed rDPROV78e38e5b17b0: Resolved TODO's in the revisited version. (authored by aeviso).
Resolved TODO's in the revisited version.
aeviso committed rDPROV6cdd592762ed: Merged different versions into the main one. (authored by aeviso).
Merged different versions into the main one.
Dec 7 2020
Dec 7 2020
Made the code compliant with flake8
Applied black for code formatting.
Added missing swh requirements.
Solved most type anotation issues.
aeviso committed rDPROVbea86ac60b17: Resolved SQL conflicts to avoid reprocessing. Empirical cache size. (authored by aeviso).
Resolved SQL conflicts to avoid reprocessing. Empirical cache size.
Nov 17 2020
Nov 17 2020
Added cache support for archive queries.
Nov 16 2020
Nov 16 2020
Bug fixing.
Removed duplicated functions.
aeviso committed rDPROV9d45b455d053: Merge branch 'master' of https://forge.softwareheritage. (authored by aeviso).
Merge branch 'master' of https://forge.softwareheritage.
Patched default configuration.
Nov 15 2020
Nov 15 2020
aeviso committed rDPROV53a666e63034: Fixed bug that made the algorithm unnecessarily walk already known directories (authored by aeviso).
Fixed bug that made the algorithm unnecessarily walk already known directories
Nov 9 2020
Nov 9 2020
Minor fixes on configuration.
Nov 5 2020
Nov 5 2020
aeviso committed rDPROV79a8901853af: Code restrutured to separate different backend's implementations. (authored by aeviso).
Code restrutured to separate different backend's implementations.
Improved profiling support.
aeviso committed rDPROV9dabe5d33625: Origin-revision layer updated to use ArchiveInterface. (authored by aeviso).
Origin-revision layer updated to use ArchiveInterface.
Oct 30 2020
Oct 30 2020
aeviso committed rDPROV855f583350fb: Minor fixes and comments during profiling. (authored by aeviso).
Minor fixes and comments during profiling.
Oct 27 2020
Oct 27 2020
aeviso committed rDPROV0666a436a8b0: First version of frontend/backend for provenance and archive queries. (authored by aeviso).
First version of frontend/backend for provenance and archive queries.
Added profiling option.
Oct 16 2020
Oct 16 2020
Fixed some minor cli bugs.
Oct 15 2020
Oct 15 2020
aeviso committed rDPROV26aa2fbae229: Moved all database related code to provenance.py (authored by aeviso).
Moved all database related code to provenance.py
aeviso committed rDPROV730948897249: Content-revision layer refactored and cleaned-up. (authored by aeviso).
Content-revision layer refactored and cleaned-up.
aeviso committed rDPROV1a6bd2411f0a: Revision-origin layer refactored into the swh module. (authored by aeviso).
Revision-origin layer refactored into the swh module.
Oct 13 2020
Oct 13 2020
aeviso committed rDPROVff6995ce94b9: Content-revision layer refactored to postpone insertions. (authored by aeviso).
Content-revision layer refactored to postpone insertions.
Code restructuring into swh module
Ignore list and README updated.
aeviso committed rDPROV037c0cd7a58b: Refactored model.py to use the storage API. (authored by aeviso).
Refactored model.py to use the storage API.
aeviso committed rDPROV068dbc8c5759: Some utilities to graphically verify results. (authored by aeviso).
Some utilities to graphically verify results.