Page MenuHomeSoftware Heritage

DanSeraf (Daniele Serafini)
User

Projects

User Details

User Since
Sep 23 2019, 3:10 PM (69 w, 5 d)

Recent Activity

Mon, Jan 18

DanSeraf closed D4875: scanner-benchmark: the temporary directory is removed by tempdir.
Mon, Jan 18, 12:32 PM
DanSeraf committed rDTSCN5cd9f762467e: fix: the temporary directory is removed by tempfile (authored by DanSeraf).
fix: the temporary directory is removed by tempfile
Mon, Jan 18, 12:32 PM
DanSeraf requested review of D4875: scanner-benchmark: the temporary directory is removed by tempdir.
Mon, Jan 18, 10:35 AM

Wed, Jan 13

DanSeraf committed rDTSCN7a289332f730: print results as a csv (authored by DanSeraf).
print results as a csv
Wed, Jan 13, 7:46 PM
DanSeraf closed D4851: scanner benchmark: output format and repository extraction in temporary directories.
Wed, Jan 13, 7:45 PM
DanSeraf committed rDTSCN9e4df16d9486: extract repositories in temporary directories (authored by DanSeraf).
extract repositories in temporary directories
Wed, Jan 13, 7:45 PM
DanSeraf requested review of D4851: scanner benchmark: output format and repository extraction in temporary directories.
Wed, Jan 13, 11:20 AM

Dec 19 2020

DanSeraf committed rDTSCN7bd1939949dc: scanner experiments (authored by DanSeraf).
scanner experiments
Dec 19 2020, 4:46 PM
DanSeraf closed D4721: WIP: scanner benchmark.
Dec 19 2020, 4:46 PM
DanSeraf updated the diff for D4721: WIP: scanner benchmark.

wrong algorithm name in example

Dec 19 2020, 4:41 PM

Dec 17 2020

DanSeraf updated the diff for D4721: WIP: scanner benchmark.

variable name in run_benchmark.sh

Dec 17 2020, 2:12 PM
DanSeraf updated the diff for D4721: WIP: scanner benchmark.

remove git missing imports in mypy.ini

Dec 17 2020, 2:05 PM
DanSeraf updated the diff for D4721: WIP: scanner benchmark.

requested changes
+ algorithms can be specified from run_benchmark.sh
+ if "random" algorithm is specified, benchmark.py will run three experiments using the default seeds (10, 20, 30)

Dec 17 2020, 7:51 AM

Dec 11 2020

DanSeraf created D4721: WIP: scanner benchmark.
Dec 11 2020, 12:46 PM

Nov 24 2020

DanSeraf closed T2760: swh-scanner: add support for local DB of known SWHIDs as Resolved.
Nov 24 2020, 1:54 PM · Code scanner
DanSeraf committed rDTSCN09c28d60f1ad: 'db serve' option to start the API service (authored by DanSeraf).
'db serve' option to start the API service
Nov 24 2020, 1:51 PM
DanSeraf closed D4552: 'db serve' option to start the API service.
Nov 24 2020, 1:51 PM
DanSeraf updated the diff for D4552: 'db serve' option to start the API service.

Minor changes

Nov 24 2020, 1:49 PM
DanSeraf updated the diff for D4552: 'db serve' option to start the API service.

rebase

Nov 24 2020, 11:48 AM
DanSeraf updated the diff for D4552: 'db serve' option to start the API service.
  • changed module name
  • query the database with only one cursor
  • get SWHID known status directly when generating the response
Nov 24 2020, 11:09 AM

Nov 23 2020

DanSeraf added inline comments to D4552: 'db serve' option to start the API service.
Nov 23 2020, 5:01 PM

Nov 22 2020

DanSeraf added a revision to T2760: swh-scanner: add support for local DB of known SWHIDs: D4552: 'db serve' option to start the API service.
Nov 22 2020, 4:19 PM · Code scanner
DanSeraf created D4552: 'db serve' option to start the API service.
Nov 22 2020, 4:19 PM

Nov 21 2020

DanSeraf committed rDTSCN521420e7d5eb: 'db import' option to create local database with known swhids (authored by DanSeraf).
'db import' option to create local database with known swhids
Nov 21 2020, 3:01 PM
DanSeraf closed D4508: scanner: 'db import' option to create local database with known swhids.
Nov 21 2020, 3:01 PM

Nov 20 2020

DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.

minor changes:

  • mypy annotation
  • tests
Nov 20 2020, 10:03 AM

Nov 19 2020

DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.

minor changes

Nov 19 2020, 5:08 PM
DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.

requested changes:

  • SWHID as PRIMARY KEY in db
  • SWHID insertion without query the Web API
  • bulk insert of SWHID values in db
Nov 19 2020, 4:04 PM

Nov 18 2020

DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.

requested changes

Nov 18 2020, 5:43 PM
DanSeraf added a revision to T2760: swh-scanner: add support for local DB of known SWHIDs: D4508: scanner: 'db import' option to create local database with known swhids.
Nov 18 2020, 2:24 PM · Code scanner
DanSeraf created D4508: scanner: 'db import' option to create local database with known swhids.
Nov 18 2020, 2:24 PM

Nov 16 2020

DanSeraf changed the status of T2760: swh-scanner: add support for local DB of known SWHIDs from Open to Work in Progress.
Nov 16 2020, 10:41 AM · Code scanner

Oct 13 2020

DanSeraf triaged T2692: Move the output related functions to another (sub)module as Normal priority.
Oct 13 2020, 9:57 AM · Code scanner
DanSeraf committed rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function (authored by DanSeraf).
model: dropped _iter_nodes_attr function
Oct 13 2020, 9:36 AM
DanSeraf closed T2690: swh scanner reports double results in ndjson format as Resolved by committing rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function.
Oct 13 2020, 9:36 AM · Code scanner
DanSeraf closed D4241: scanner: removed _iter_nodes_attr function in model (causes results duplication).
Oct 13 2020, 9:35 AM
DanSeraf created D4241: scanner: removed _iter_nodes_attr function in model (causes results duplication).
Oct 13 2020, 8:54 AM

Sep 11 2020

DanSeraf committed rDTSCNf838fed672d8: cli: don't check for glob pattern before translating into regex (authored by DanSeraf).
cli: don't check for glob pattern before translating into regex
Sep 11 2020, 9:20 AM
DanSeraf closed D3924: cli: don't check for glob pattern before translating into regex.
Sep 11 2020, 9:20 AM
DanSeraf updated the diff for D3924: cli: don't check for glob pattern before translating into regex.

tests

Sep 11 2020, 9:16 AM

Sep 10 2020

DanSeraf created D3924: cli: don't check for glob pattern before translating into regex.
Sep 10 2020, 6:05 PM
DanSeraf committed rDTSCN2fb9cb1c59e2: docs: readme and cli description update (authored by DanSeraf).
docs: readme and cli description update
Sep 10 2020, 5:23 PM
DanSeraf closed D3876: readme and cli description update.
Sep 10 2020, 5:23 PM
DanSeraf updated the diff for D3876: readme and cli description update.

rebase

Sep 10 2020, 5:22 PM
DanSeraf updated the diff for D3876: readme and cli description update.

changes

Sep 10 2020, 4:44 PM
DanSeraf added inline comments to D3876: readme and cli description update.
Sep 10 2020, 4:41 PM
DanSeraf accepted D3919: cli: speedup the `swh` cli command startup time.
Sep 10 2020, 4:39 PM

Sep 9 2020

DanSeraf updated the diff for D3876: readme and cli description update.

requested changes

Sep 9 2020, 5:42 PM

Sep 8 2020

DanSeraf committed rDDOCccb3629bc484: index: add swh.scanner (authored by DanSeraf).
index: add swh.scanner
Sep 8 2020, 1:52 PM
DanSeraf closed D3886: index: add swh.scanner.
Sep 8 2020, 1:52 PM
DanSeraf created D3886: index: add swh.scanner.
Sep 8 2020, 1:47 PM

Sep 3 2020

DanSeraf added inline comments to D3876: readme and cli description update.
Sep 3 2020, 5:22 PM
DanSeraf created D3876: readme and cli description update.
Sep 3 2020, 4:01 PM

Jul 3 2020

DanSeraf created P712 (An Untitled Masterwork).
Jul 3 2020, 12:43 PM

Jun 22 2020

DanSeraf committed rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path (authored by DanSeraf).
dashboard: file visualization per directory path
Jun 22 2020, 7:39 PM
DanSeraf closed T2364: scanner: file browser in the sunburst/dashboard output as Resolved by committing rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path.
Jun 22 2020, 7:39 PM · Code scanner
DanSeraf closed D3293: scanner: dashboard file visualization per directory path.
Jun 22 2020, 7:39 PM
DanSeraf closed T2336: scanner: add support for an exclusion list as Resolved.
Jun 22 2020, 2:57 PM · Code scanner
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • non-minified css in assets
Jun 22 2020, 2:50 PM

Jun 20 2020

DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • workaround to check the table body
  • bootstrap css update
Jun 20 2020, 4:37 PM
DanSeraf added a comment to D3293: scanner: dashboard file visualization per directory path.

Unfortunately the expected values can't be tested.

Why not?

Because the dash_html_components checks for object identity only. I wrote a comment inside the test.

Jun 20 2020, 4:37 PM

Jun 19 2020

DanSeraf added a comment to D3293: scanner: dashboard file visualization per directory path.
  • It's missing tests. I understand it's not easy to do for a GUI, but could you see what you can do about it?
Jun 19 2020, 4:54 PM
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • init.py in dashboard directory
Jun 19 2020, 4:51 PM
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • css as static asset
Jun 19 2020, 4:41 PM

Jun 16 2020

DanSeraf updated the summary of D3293: scanner: dashboard file visualization per directory path.
Jun 16 2020, 6:45 PM
DanSeraf created D3293: scanner: dashboard file visualization per directory path.
Jun 16 2020, 6:31 PM
DanSeraf committed rDTSCNbf9e586436d8: model: get file attributes from a specific directory (authored by DanSeraf).
model: get file attributes from a specific directory
Jun 16 2020, 5:54 PM
DanSeraf closed D3284: scanner: retrieve file attributes.
Jun 16 2020, 5:54 PM
DanSeraf updated the diff for D3284: scanner: retrieve file attributes.
  • wrong check when using the source path
Jun 16 2020, 1:10 PM
DanSeraf updated the diff for D3284: scanner: retrieve file attributes.

wrong directory check

Jun 16 2020, 12:15 PM
DanSeraf added inline comments to D3284: scanner: retrieve file attributes.
Jun 16 2020, 12:15 PM

Jun 15 2020

DanSeraf created D3284: scanner: retrieve file attributes.
Jun 15 2020, 6:35 PM

Jun 5 2020

DanSeraf committed rDTSCN13ac68a9471e: model: iterate only the child nodes instead of the child nodes attributes (authored by DanSeraf).
model: iterate only the child nodes instead of the child nodes attributes
Jun 5 2020, 4:57 PM
DanSeraf closed D3230: scanner: nodes iteration function in model.
Jun 5 2020, 4:57 PM
DanSeraf created D3230: scanner: nodes iteration function in model.
Jun 5 2020, 12:00 PM

Jun 4 2020

DanSeraf committed rDTSCNe126d2ac98c9: interactive dashboard (authored by DanSeraf).
interactive dashboard
Jun 4 2020, 11:46 AM
DanSeraf closed D3216: scanner: Interactive dashboard.
Jun 4 2020, 11:46 AM

Jun 3 2020

DanSeraf created D3216: scanner: Interactive dashboard.
Jun 3 2020, 4:29 PM

Apr 30 2020

DanSeraf closed T2365: scanner: add color legend for sunburst output as Resolved by committing rDTSCNfb8ae03e494c: plot: color legend.
Apr 30 2020, 12:41 PM · Code scanner
DanSeraf committed rDTSCNfb8ae03e494c: plot: color legend (authored by DanSeraf).
plot: color legend
Apr 30 2020, 12:41 PM
DanSeraf closed D3099: plot: color legend.
Apr 30 2020, 12:41 PM
DanSeraf created D3099: plot: color legend.
Apr 30 2020, 12:02 PM

Apr 29 2020

DanSeraf committed rDTSCN623a9dbe6157: ndjson output format (authored by DanSeraf).
ndjson output format
Apr 29 2020, 4:40 PM
DanSeraf closed T2363: scanner: json output should return both known and unknown files/dirs as Resolved by committing rDTSCN623a9dbe6157: ndjson output format.
Apr 29 2020, 4:40 PM · Code scanner
DanSeraf closed D3085: scanner: ndjson output format.
Apr 29 2020, 4:40 PM
DanSeraf updated the diff for D3085: scanner: ndjson output format.

rebase

Apr 29 2020, 4:40 PM
DanSeraf updated the diff for D3085: scanner: ndjson output format.

missing import in mypy.ini

Apr 29 2020, 2:28 PM
DanSeraf updated the summary of D3085: scanner: ndjson output format.
Apr 29 2020, 1:17 PM
DanSeraf created D3085: scanner: ndjson output format.
Apr 29 2020, 1:16 PM
DanSeraf committed rDTSCN3f00bb004b4d: flat json output with known and swhid values (authored by DanSeraf).
flat json output with known and swhid values
Apr 29 2020, 12:57 PM
DanSeraf closed D3069: scanner: json output format.
Apr 29 2020, 12:57 PM
DanSeraf updated the diff for D3069: scanner: json output format.

rebase

Apr 29 2020, 12:56 PM
DanSeraf committed rDTSCNbbf296f7523a: model: known attribute in Tree structure (authored by DanSeraf).
model: known attribute in Tree structure
Apr 29 2020, 12:36 PM
DanSeraf closed D3070: model: known attribute in Tree structure.
Apr 29 2020, 12:36 PM

Apr 27 2020

DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

requested changes

Apr 27 2020, 7:26 PM
DanSeraf updated the diff for D3069: scanner: json output format.

update

Apr 27 2020, 5:18 PM
DanSeraf updated the diff for D3069: scanner: json output format.

update

Apr 27 2020, 4:54 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

rebase

Apr 27 2020, 4:52 PM
DanSeraf updated the diff for D3069: scanner: json output format.

parent

Apr 27 2020, 4:46 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

child diff

Apr 27 2020, 4:46 PM