Page MenuHomeSoftware Heritage

DanSeraf (Daniele Serafini)
User

Projects

User Details

User Since
Sep 23 2019, 3:10 PM (57 w, 3 d)

Recent Activity

Tue, Oct 13

DanSeraf triaged T2692: Move the output related functions to another (sub)module as Normal priority.
Tue, Oct 13, 9:57 AM · Code scanner
DanSeraf committed rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function (authored by DanSeraf).
model: dropped _iter_nodes_attr function
Tue, Oct 13, 9:36 AM
DanSeraf closed T2690: swh scanner reports double results in ndjson format as Resolved by committing rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function.
Tue, Oct 13, 9:36 AM · Code scanner
DanSeraf closed D4241: scanner: removed _iter_nodes_attr function in model (causes results duplication).
Tue, Oct 13, 9:35 AM
DanSeraf created D4241: scanner: removed _iter_nodes_attr function in model (causes results duplication).
Tue, Oct 13, 8:54 AM

Sep 11 2020

DanSeraf committed rDTSCNf838fed672d8: cli: don't check for glob pattern before translating into regex (authored by DanSeraf).
cli: don't check for glob pattern before translating into regex
Sep 11 2020, 9:20 AM
DanSeraf closed D3924: cli: don't check for glob pattern before translating into regex.
Sep 11 2020, 9:20 AM
DanSeraf updated the diff for D3924: cli: don't check for glob pattern before translating into regex.

tests

Sep 11 2020, 9:16 AM

Sep 10 2020

DanSeraf created D3924: cli: don't check for glob pattern before translating into regex.
Sep 10 2020, 6:05 PM
DanSeraf committed rDTSCN2fb9cb1c59e2: docs: readme and cli description update (authored by DanSeraf).
docs: readme and cli description update
Sep 10 2020, 5:23 PM
DanSeraf closed D3876: readme and cli description update.
Sep 10 2020, 5:23 PM
DanSeraf updated the diff for D3876: readme and cli description update.

rebase

Sep 10 2020, 5:22 PM
DanSeraf updated the diff for D3876: readme and cli description update.

changes

Sep 10 2020, 4:44 PM
DanSeraf added inline comments to D3876: readme and cli description update.
Sep 10 2020, 4:41 PM
DanSeraf accepted D3919: cli: speedup the `swh` cli command startup time.
Sep 10 2020, 4:39 PM

Sep 9 2020

DanSeraf updated the diff for D3876: readme and cli description update.

requested changes

Sep 9 2020, 5:42 PM

Sep 8 2020

DanSeraf committed rDDOCccb3629bc484: index: add swh.scanner (authored by DanSeraf).
index: add swh.scanner
Sep 8 2020, 1:52 PM
DanSeraf closed D3886: index: add swh.scanner.
Sep 8 2020, 1:52 PM
DanSeraf created D3886: index: add swh.scanner.
Sep 8 2020, 1:47 PM

Sep 3 2020

DanSeraf added inline comments to D3876: readme and cli description update.
Sep 3 2020, 5:22 PM
DanSeraf created D3876: readme and cli description update.
Sep 3 2020, 4:01 PM

Jul 3 2020

DanSeraf created P712 (An Untitled Masterwork).
Jul 3 2020, 12:43 PM

Jun 22 2020

DanSeraf committed rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path (authored by DanSeraf).
dashboard: file visualization per directory path
Jun 22 2020, 7:39 PM
DanSeraf closed T2364: scanner: file browser in the sunburst/dashboard output as Resolved by committing rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path.
Jun 22 2020, 7:39 PM · Code scanner
DanSeraf closed D3293: scanner: dashboard file visualization per directory path.
Jun 22 2020, 7:39 PM
DanSeraf closed T2336: scanner: add support for an exclusion list as Resolved.
Jun 22 2020, 2:57 PM · Code scanner
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • non-minified css in assets
Jun 22 2020, 2:50 PM

Jun 20 2020

DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • workaround to check the table body
  • bootstrap css update
Jun 20 2020, 4:37 PM
DanSeraf added a comment to D3293: scanner: dashboard file visualization per directory path.

Unfortunately the expected values can't be tested.

Why not?

Because the dash_html_components checks for object identity only. I wrote a comment inside the test.

Jun 20 2020, 4:37 PM

Jun 19 2020

DanSeraf added a comment to D3293: scanner: dashboard file visualization per directory path.
  • It's missing tests. I understand it's not easy to do for a GUI, but could you see what you can do about it?
Jun 19 2020, 4:54 PM
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • init.py in dashboard directory
Jun 19 2020, 4:51 PM
DanSeraf updated the diff for D3293: scanner: dashboard file visualization per directory path.
  • css as static asset
Jun 19 2020, 4:41 PM

Jun 16 2020

DanSeraf updated the summary of D3293: scanner: dashboard file visualization per directory path.
Jun 16 2020, 6:45 PM
DanSeraf created D3293: scanner: dashboard file visualization per directory path.
Jun 16 2020, 6:31 PM
DanSeraf committed rDTSCNbf9e586436d8: model: get file attributes from a specific directory (authored by DanSeraf).
model: get file attributes from a specific directory
Jun 16 2020, 5:54 PM
DanSeraf closed D3284: scanner: retrieve file attributes.
Jun 16 2020, 5:54 PM
DanSeraf updated the diff for D3284: scanner: retrieve file attributes.
  • wrong check when using the source path
Jun 16 2020, 1:10 PM
DanSeraf updated the diff for D3284: scanner: retrieve file attributes.

wrong directory check

Jun 16 2020, 12:15 PM
DanSeraf added inline comments to D3284: scanner: retrieve file attributes.
Jun 16 2020, 12:15 PM

Jun 15 2020

DanSeraf created D3284: scanner: retrieve file attributes.
Jun 15 2020, 6:35 PM

Jun 5 2020

DanSeraf committed rDTSCN13ac68a9471e: model: iterate only the child nodes instead of the child nodes attributes (authored by DanSeraf).
model: iterate only the child nodes instead of the child nodes attributes
Jun 5 2020, 4:57 PM
DanSeraf closed D3230: scanner: nodes iteration function in model.
Jun 5 2020, 4:57 PM
DanSeraf created D3230: scanner: nodes iteration function in model.
Jun 5 2020, 12:00 PM

Jun 4 2020

DanSeraf committed rDTSCNe126d2ac98c9: interactive dashboard (authored by DanSeraf).
interactive dashboard
Jun 4 2020, 11:46 AM
DanSeraf closed D3216: scanner: Interactive dashboard.
Jun 4 2020, 11:46 AM

Jun 3 2020

DanSeraf created D3216: scanner: Interactive dashboard.
Jun 3 2020, 4:29 PM

Apr 30 2020

DanSeraf closed T2365: scanner: add color legend for sunburst output as Resolved by committing rDTSCNfb8ae03e494c: plot: color legend.
Apr 30 2020, 12:41 PM · Code scanner
DanSeraf committed rDTSCNfb8ae03e494c: plot: color legend (authored by DanSeraf).
plot: color legend
Apr 30 2020, 12:41 PM
DanSeraf closed D3099: plot: color legend.
Apr 30 2020, 12:41 PM
DanSeraf created D3099: plot: color legend.
Apr 30 2020, 12:02 PM

Apr 29 2020

DanSeraf committed rDTSCN623a9dbe6157: ndjson output format (authored by DanSeraf).
ndjson output format
Apr 29 2020, 4:40 PM
DanSeraf closed T2363: scanner: json output should return both known and unknown files/dirs as Resolved by committing rDTSCN623a9dbe6157: ndjson output format.
Apr 29 2020, 4:40 PM · Code scanner
DanSeraf closed D3085: scanner: ndjson output format.
Apr 29 2020, 4:40 PM
DanSeraf updated the diff for D3085: scanner: ndjson output format.

rebase

Apr 29 2020, 4:40 PM
DanSeraf updated the diff for D3085: scanner: ndjson output format.

missing import in mypy.ini

Apr 29 2020, 2:28 PM
DanSeraf updated the summary of D3085: scanner: ndjson output format.
Apr 29 2020, 1:17 PM
DanSeraf created D3085: scanner: ndjson output format.
Apr 29 2020, 1:16 PM
DanSeraf committed rDTSCN3f00bb004b4d: flat json output with known and swhid values (authored by DanSeraf).
flat json output with known and swhid values
Apr 29 2020, 12:57 PM
DanSeraf closed D3069: scanner: json output format.
Apr 29 2020, 12:57 PM
DanSeraf updated the diff for D3069: scanner: json output format.

rebase

Apr 29 2020, 12:56 PM
DanSeraf committed rDTSCNbbf296f7523a: model: known attribute in Tree structure (authored by DanSeraf).
model: known attribute in Tree structure
Apr 29 2020, 12:36 PM
DanSeraf closed D3070: model: known attribute in Tree structure.
Apr 29 2020, 12:36 PM

Apr 27 2020

DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

requested changes

Apr 27 2020, 7:26 PM
DanSeraf updated the diff for D3069: scanner: json output format.

update

Apr 27 2020, 5:18 PM
DanSeraf updated the diff for D3069: scanner: json output format.

update

Apr 27 2020, 4:54 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

rebase

Apr 27 2020, 4:52 PM
DanSeraf updated the diff for D3069: scanner: json output format.

parent

Apr 27 2020, 4:46 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

child diff

Apr 27 2020, 4:46 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

squash

Apr 27 2020, 4:42 PM
DanSeraf updated the diff for D3070: model: known attribute in Tree structure.

child diff

Apr 27 2020, 4:38 PM
DanSeraf added a comment to D3069: scanner: json output format.

Could you split this into two diffs?

Apr 27 2020, 4:17 PM
DanSeraf updated the summary of D3069: scanner: json output format.
Apr 27 2020, 4:09 PM
DanSeraf created D3070: model: known attribute in Tree structure.
Apr 27 2020, 4:05 PM
DanSeraf created D3069: scanner: json output format.
Apr 27 2020, 11:25 AM

Apr 22 2020

DanSeraf added a comment to T2363: scanner: json output should return both known and unknown files/dirs.

The new json output will be like the following:

Apr 22 2020, 6:13 PM · Code scanner

Apr 17 2020

DanSeraf committed rDTSCNbe3d3c124ac7: requirements-swh: swh.model version >= 0.0.64 (authored by DanSeraf).
requirements-swh: swh.model version >= 0.0.64
Apr 17 2020, 12:32 PM
DanSeraf committed rDTSCNd52afd0eaedd: scanner: option to exclude specific patterns (authored by DanSeraf).
scanner: option to exclude specific patterns
Apr 17 2020, 12:32 PM
DanSeraf closed D2989: scanner: exclusion list through glob patterns.
Apr 17 2020, 12:32 PM

Apr 16 2020

DanSeraf updated the diff for D2989: scanner: exclusion list through glob patterns.

requested changes

Apr 16 2020, 5:16 PM

Apr 15 2020

DanSeraf closed T2362: scanner: aiohttp.client_exceptions.ServerDisconnectedError: None as Invalid.
Apr 15 2020, 5:58 PM · Code scanner

Apr 14 2020

DanSeraf added inline comments to D2989: scanner: exclusion list through glob patterns.
Apr 14 2020, 4:36 PM

Apr 10 2020

DanSeraf updated the diff for D2989: scanner: exclusion list through glob patterns.
  • requirements-swh: swh.model version >= 0.0.65
  • match pattern with fnmatch
  • tests adapted to accept exclude patterns
Apr 10 2020, 3:51 PM

Apr 9 2020

DanSeraf added a comment to D2989: scanner: exclusion list through glob patterns.

(maybe not immediately though, it's currently building ;).

it's built so it's fine now ;)

Apr 9 2020, 5:07 PM
DanSeraf added a comment to D2989: scanner: exclusion list through glob patterns.

Instead of building the set of all paths to exclude (possibly very large), I think you should pass patterns to directory_filter and use [[ https://docs.python.org/3/library/fnmatch.html | fnmatch ]] there

Apr 9 2020, 4:23 PM
DanSeraf added inline comments to D2989: scanner: exclusion list through glob patterns.
Apr 9 2020, 4:18 PM
DanSeraf created D2989: scanner: exclusion list through glob patterns.
Apr 9 2020, 1:05 PM

Apr 8 2020

DanSeraf committed rDMOD5d6883b26594: from_disk: path parameter to dir_filter functions (authored by DanSeraf).
from_disk: path parameter to dir_filter functions
Apr 8 2020, 11:58 AM
DanSeraf closed D2960: swh-model: exclude path support for the function from_disk in Directory.
Apr 8 2020, 11:58 AM
DanSeraf added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

Looks good to me.

Don't you want to implement your callable here (the one which actually exclude stuff)?

Because now, the diff title is no longer in sync with the code ;)

Or do you plan to add it in swh-scanner?

Apr 8 2020, 11:58 AM
DanSeraf updated the diff for D2960: swh-model: exclude path support for the function from_disk in Directory.

requested changes

Apr 8 2020, 11:33 AM
DanSeraf updated the diff for D2960: swh-model: exclude path support for the function from_disk in Directory.

requested changes

Apr 8 2020, 9:47 AM

Apr 6 2020

DanSeraf added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

I can define a dir_filter function but how could i get the values i need, like the full path generated inside from_disk, without touching the implementation?

I'm not sure i understand the question.
from_disk has a save_path parameter. If set to True, the paths are stored alongside the output result.
Isn't that enough?

Apr 6 2020, 8:27 PM
DanSeraf added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

I think in the end, this should be implemented with the dir_filter parameter instead.

Please, check that you can do this. If you can, this will, there is no need to change the current implementation.
And instead, either:

  • not touch swh.model at all, then define your ignore_path function in swh-scanner and use it when calling from_disk.
  • if you think, this can be a shared behavior (i guess it can), define a function callable here in swh-model (like accept_all_directories and the other ones next to it).

In both cases, this won't touch Directory.from_disk though.

Apr 6 2020, 6:10 PM
DanSeraf updated the summary of D2960: swh-model: exclude path support for the function from_disk in Directory.
Apr 6 2020, 4:10 PM
DanSeraf created D2960: swh-model: exclude path support for the function from_disk in Directory.
Apr 6 2020, 4:10 PM

Apr 2 2020

DanSeraf committed rDTSCNa618bf17c049: new visualization format: sunburst chart (authored by DanSeraf).
new visualization format: sunburst chart
Apr 2 2020, 2:08 PM
DanSeraf committed rDTSCNe4bb51b7d894: changed structure of temp_folder fixture (authored by DanSeraf).
changed structure of temp_folder fixture
Apr 2 2020, 2:08 PM
DanSeraf committed rDTSCNa49c4e5ec053: model: get information about directories (authored by DanSeraf).
model: get information about directories
Apr 2 2020, 2:08 PM
DanSeraf closed D2863: scanner: show result with a sunburst chart.
Apr 2 2020, 2:08 PM
DanSeraf updated the diff for D2863: scanner: show result with a sunburst chart.

rebase

Apr 2 2020, 2:07 PM