Page MenuHomeSoftware Heritage
Feed Advanced Search

Dec 16 2022

vlorentz accepted D8962: Add a --size-limit cli option to the replay command.
Dec 16 2022, 1:15 PM

Dec 15 2022

vlorentz closed T4676: Add Luigi workflow in swh-dataset, a subtask of T2220: swh-graph in production, as Resolved.
Dec 15 2022, 1:00 PM · Roadmap 2022, meta-task, Roadmap 2021, Compressed graph service
vlorentz closed T4676: Add Luigi workflow in swh-dataset, a subtask of T4677: Add support for generating subdatasets in swh.dataset.luigi, as Resolved.
Dec 15 2022, 1:00 PM · Datasets
vlorentz closed T4676: Add Luigi workflow in swh-dataset as Resolved.
Dec 15 2022, 1:00 PM · Datasets, Compressed graph service
vlorentz added inline comments to D8959: github: Export statsd metrics about API requests and token usage.
Dec 15 2022, 1:00 PM
vlorentz requested review of D8959: github: Export statsd metrics about API requests and token usage.
Dec 15 2022, 12:30 PM
vlorentz added a revision to T4728: Add monitoring of API token usage: D8959: github: Export statsd metrics about API requests and token usage.
Dec 15 2022, 12:22 PM · Metrics/monitoring, Origin-GitHub
vlorentz triaged T4728: Add monitoring of API token usage as Normal priority.
Dec 15 2022, 12:18 PM · Metrics/monitoring, Origin-GitHub
vlorentz accepted D8911: Remove ambiguous item cursors.

thx

Dec 15 2022, 12:07 PM
vlorentz closed T4719: indexer storage crashes on kafka errors, because of integers in the key as Resolved.
Dec 15 2022, 10:31 AM · Indexer
vlorentz accepted D8953: Make query introspection configurable in the settings.
Dec 15 2022, 10:11 AM
vlorentz added a comment to D8911: Remove ambiguous item cursors.

Please add comments next to commented out code to explain why it's commented out. (it could be a link to this diff)

Dec 15 2022, 10:08 AM
vlorentz accepted D8897: Change the input type in contentByHashes entrypoint.
Dec 15 2022, 10:07 AM
vlorentz added a comment to D8953: Make query introspection configurable in the settings.

Shouldn't it be enabled in prod too, then?

Dec 15 2022, 10:06 AM
vlorentz accepted D8939: Rework the replaying exception handling.

Could you use a logger instance,

what do you mean by "use a logger instance"?

Dec 15 2022, 10:04 AM
vlorentz accepted D8952: Add missing __init__.py so find_packages keep finding sql modules.
Dec 15 2022, 9:55 AM
vlorentz added a comment to D8953: Make query introspection configurable in the settings.

to avoid attacks

Dec 15 2022, 9:54 AM
vlorentz accepted D8954: docs/persistent-identifiers: Fix some broken links for browsing SWHIDs.
Dec 15 2022, 9:53 AM

Dec 14 2022

vlorentz updated the task description for T4726: ValueError: Hash tree computation divergence detected while loading https://svn.code.sf.net/p/openautomation/code.
Dec 14 2022, 3:05 PM · SVN Loader
vlorentz placed T4726: ValueError: Hash tree computation divergence detected while loading https://svn.code.sf.net/p/openautomation/code up for grabs.
Dec 14 2022, 3:04 PM · SVN Loader
vlorentz closed D8877: Fix incorrect error messages when failing to connect.
Dec 14 2022, 2:46 PM
vlorentz committed rDVAUeb055b2c0487: Fix incorrect error messages when failing to connect (authored by vlorentz).
Fix incorrect error messages when failing to connect
Dec 14 2022, 2:46 PM
vlorentz placed T4725: Crash when changing the language of .ipynb files up for grabs.
Dec 14 2022, 2:29 PM · Easy hack, Web app

Dec 13 2022

vlorentz closed T4724: UnicodeDecodeError on branch names in git loader as Resolved.
Dec 13 2022, 1:35 PM · Git loader
vlorentz closed D8956: Fix crash on non-UTF8 branch names.
Dec 13 2022, 1:35 PM
vlorentz committed rDLDG5018c6ad5be6: Fix crash on non-UTF8 branch names (authored by vlorentz).
Fix crash on non-UTF8 branch names
Dec 13 2022, 1:35 PM
vlorentz requested review of D8956: Fix crash on non-UTF8 branch names.
Dec 13 2022, 1:26 PM
vlorentz triaged T4724: UnicodeDecodeError on branch names in git loader as Normal priority.
Dec 13 2022, 1:24 PM · Git loader
vlorentz added a revision to T4724: UnicodeDecodeError on branch names in git loader: D8956: Fix crash on non-UTF8 branch names.
Dec 13 2022, 1:23 PM · Git loader
vlorentz updated subscribers of T4438: Look into triple-stores suitable as swh-search backends.

@KShivendu pointed out that Neo4j may be a strong option too. It doesn't natively support SPARQL, but we could have it as a layer on top of the Tinkerpop API

Dec 13 2022, 12:24 PM · Archive search
vlorentz accepted D8955: utils: Fix unquoted SWHID URLs generated by get_swhids_info.
Dec 13 2022, 12:09 PM
vlorentz added a comment to T4438: Look into triple-stores suitable as swh-search backends.

Actually the open-source version of Virtuoso is unmaintained, and I couldn't figure out how to use it. Blazegraph and Apache Jena look promising too, though.

Dec 13 2022, 12:05 PM · Archive search

Dec 12 2022

vlorentz closed T4355: Guix event presentation as Resolved.

Source code is at https://forge.softwareheritage.org/source/slides/browse/master/talks-public/2022-09-16-Guix/

Dec 12 2022, 10:53 AM · Community Building

Dec 8 2022

vlorentz requested review of D8947: Document statsd metrics and link to dashboards.
Dec 8 2022, 1:20 PM
vlorentz closed D8935: Add dataset name to the export id.
Dec 8 2022, 11:47 AM
vlorentz committed rDGRPH94b1d2c14fe8: Add dataset name to the export id (authored by vlorentz).
Add dataset name to the export id
Dec 8 2022, 11:47 AM
vlorentz closed T4354: Contribute terms to ForgeFed as Resolved.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz closed T4354: Contribute terms to ForgeFed, a subtask of T4249: Choose/define an ontology to use for indexed extrinsic origin metadata, as Resolved.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz updated the task description for T4354: Contribute terms to ForgeFed.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz accepted D8944: replay: Copy dir states and external paths in copy_from operations.
Dec 8 2022, 11:39 AM
vlorentz accepted D8882: replay: Do not ignore externals in copyfrom operations.
Dec 8 2022, 11:38 AM
vlorentz accepted D8941: replay: Simplify FileEditor implementation.

huh, nice

Dec 8 2022, 11:38 AM
vlorentz added a comment to D8944: replay: Copy dir states and external paths in copy_from operations.

Are you sure the path argument to add_directory cannot start with a / or contain ..?

Dec 8 2022, 11:36 AM
vlorentz accepted D8942: utils: Raise ValueError when external definition could not be parsed.
Dec 8 2022, 11:33 AM
vlorentz added a comment to D8939: Rework the replaying exception handling.

Could you use a logger instance, and add if logger.isEnabledFor(logging.DEBUG): before logger.debug statements that use hash_to_hex?

Dec 8 2022, 11:31 AM
vlorentz accepted D8946: svn_retry: Reduce max number of retry attempts to 3.
Dec 8 2022, 2:29 AM
vlorentz accepted D8945: api, browse: Ensure to sanitize filename passed to django FileResponse.
Dec 8 2022, 2:29 AM
vlorentz requested changes to D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 8 2022, 2:27 AM

Dec 7 2022

vlorentz closed D8932: Replace RunAll with RunExportCompressUpload.
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPHcd69e48b5acc: Replace RunAll with RunExportCompressUpload (authored by vlorentz).
Replace RunAll with RunExportCompressUpload
Dec 7 2022, 5:15 PM
vlorentz closed D8931: Prevent incorrect warning from being printed to output files.
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPH233b0508395a: Prevent incorrect warning from being printed to output files (authored by vlorentz).
Prevent incorrect warning from being printed to output files
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPH042af3adf5b6: Fix crash when the sensitive dataset directory does not exist (authored by vlorentz).
Fix crash when the sensitive dataset directory does not exist
Dec 7 2022, 5:15 PM
vlorentz requested review of D8935: Add dataset name to the export id.
Dec 7 2022, 4:57 PM
vlorentz closed D8934: Remove tool ids from Kafka messages.
Dec 7 2022, 4:34 PM
vlorentz committed rDCIDXe8549400bc54: Remove tool ids from Kafka messages (authored by vlorentz).
Remove tool ids from Kafka messages
Dec 7 2022, 4:34 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:24 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

I'm tired

Dec 7 2022, 3:24 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:18 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

remove useless function

Dec 7 2022, 3:18 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:10 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

less awful fix

Dec 7 2022, 3:09 PM
vlorentz planned changes to D8931: Prevent incorrect warning from being printed to output files.
Dec 7 2022, 2:44 PM
vlorentz added a comment to D8931: Prevent incorrect warning from being printed to output files.
In D8931#232231, @olasd wrote:

Why not just touch all the files?

Dec 7 2022, 2:43 PM
vlorentz created P1541 (An Untitled Masterwork).
Dec 7 2022, 2:31 PM
vlorentz requested review of D8934: Remove tool ids from Kafka messages.
Dec 7 2022, 2:20 PM
vlorentz added a revision to T4719: indexer storage crashes on kafka errors, because of integers in the key: D8934: Remove tool ids from Kafka messages.
Dec 7 2022, 2:08 PM · Indexer
vlorentz triaged T4719: indexer storage crashes on kafka errors, because of integers in the key as Normal priority.
Dec 7 2022, 2:08 PM · Indexer
vlorentz requested review of D8932: Replace RunAll with RunExportCompressUpload.
Dec 7 2022, 12:55 PM
vlorentz accepted D8933: task add: Ensure task type provided exist and raise otherwise.
Dec 7 2022, 12:55 PM
vlorentz requested review of D8931: Prevent incorrect warning from being printed to output files.
Dec 7 2022, 12:54 PM
vlorentz committed rDGRPH66253a872d6b: Add missing dependency on pytest-mock (authored by vlorentz).
Add missing dependency on pytest-mock
Dec 7 2022, 12:47 PM
vlorentz closed D8930: origin_contributors: Fix typo and improve readability.
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHb8ddd6ceadbd: origin_contributors: Fix typo and improve readability (authored by vlorentz).
origin_contributors: Fix typo and improve readability
Dec 7 2022, 10:40 AM
vlorentz closed D8910: Regenerate the test dataset to include a release with no author.
Dec 7 2022, 10:40 AM
vlorentz closed D8919: Add CLI script to generate Luigi config and call it.
Dec 7 2022, 10:40 AM
vlorentz closed D8917: Split swh/graph/luigi.py into modules.
Dec 7 2022, 10:40 AM
vlorentz closed D8912: ListOriginContributors: Ignore null author/committer in revisions/releases.
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHe65858a73918: Split swh/graph/luigi.py into modules (authored by vlorentz).
Split swh/graph/luigi.py into modules
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHb76801259953: Add CLI script to generate Luigi config and call it (authored by vlorentz).
Add CLI script to generate Luigi config and call it
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHdfd4c1dc3b22: ListOriginContributors: Ignore null author/committer in revisions/releases (authored by vlorentz).
ListOriginContributors: Ignore null author/committer in revisions/releases
Dec 7 2022, 10:40 AM
vlorentz closed D8908: Add ListOriginContributors.
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHab2703efcb9a: Add Luigi task TopoSort and add a simple test (authored by vlorentz).
Add Luigi task TopoSort and add a simple test
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPHf3235e318485: Add ListOriginContributors (authored by vlorentz).
Add ListOriginContributors
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPH559d4068bfe1: Regenerate the test dataset to include a release with no author (authored by vlorentz).
Regenerate the test dataset to include a release with no author
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPH7bee5d47a6eb: revert multithreading, it's actually twice as slow as singlethread (authored by vlorentz).
revert multithreading, it's actually twice as slow as singlethread
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPH58f44785816b: Improve comments (authored by vlorentz).
Improve comments
Dec 7 2022, 10:40 AM
vlorentz committed rDGRPH922894410b6e: Add a sample of two ancestor with each node (authored by vlorentz).
Add a sample of two ancestor with each node
Dec 7 2022, 10:40 AM
vlorentz closed D8883: Add a script to generate a topological sort.
Dec 7 2022, 10:39 AM
vlorentz committed rDGRPH30dad16a2365: tentative multithread DFS (authored by vlorentz).
tentative multithread DFS
Dec 7 2022, 10:39 AM
vlorentz committed rDGRPHed6636c26be8: Implement a naive topological sort (authored by vlorentz).
Implement a naive topological sort
Dec 7 2022, 10:39 AM
vlorentz closed D8903: luigi: Add tasks UploadGraphToS3 and DownloadGraphFromS3.
Dec 7 2022, 10:39 AM
vlorentz committed rDGRPHb8dc411ccd30: luigi: Add tasks UploadGraphToS3 and DownloadGraphFromS3 (authored by vlorentz).
luigi: Add tasks UploadGraphToS3 and DownloadGraphFromS3
Dec 7 2022, 10:39 AM
vlorentz added a comment to D8908: Add ListOriginContributors.

fixed by D8930

Dec 7 2022, 10:09 AM
vlorentz updated the diff for D8919: Add CLI script to generate Luigi config and call it.

rebase + fix typos + improve readability

Dec 7 2022, 10:08 AM
vlorentz updated the diff for D8917: Split swh/graph/luigi.py into modules.

rebase

Dec 7 2022, 10:08 AM
vlorentz updated the diff for D8912: ListOriginContributors: Ignore null author/committer in revisions/releases.

rebase

Dec 7 2022, 10:08 AM
vlorentz updated the diff for D8910: Regenerate the test dataset to include a release with no author.

rebase

Dec 7 2022, 10:08 AM
vlorentz updated the diff for D8908: Add ListOriginContributors.

rebase

Dec 7 2022, 10:08 AM
vlorentz updated the diff for D8883: Add a script to generate a topological sort.

rebase

Dec 7 2022, 10:08 AM