I've tried replacing the content of foo.txt with something unknown to the archive (random garbage) and the sunburst rendering still shows 100.0%.
So it could also be a rounding error instead.
Either way, it is misleading and should be fixed.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
Nov 29 2021
Nov 29 2021
zack triaged T3754: scanning sunburst rendering fail with "ValueError: Empty data passed with indices specified." as Normal priority.
Jul 21 2021
Jul 21 2021
DanSeraf closed T3420: scanner: make the various query algorithms user-selectable as Resolved by committing rDTSCNd5a070e1429d: add scan policies.
Jul 15 2021
Jul 15 2021
Jul 8 2021
Jul 8 2021
DanSeraf closed T3349: use swh.model.merkle/from_disk instead of swh.scanner.model, a subtask of T2730: scanner: should output the root SWHID as well, as Resolved.
DanSeraf closed T3349: use swh.model.merkle/from_disk instead of swh.scanner.model, a subtask of T3420: scanner: make the various query algorithms user-selectable, as Resolved.
zack changed the status of T2730: scanner: should output the root SWHID as well from Open to Work in Progress.
zack changed the status of T2692: Move the output related functions to another (sub)module from Open to Work in Progress.
zack moved T3318: scanner should use the known() method of web.client from In progress to Backlog on the Code scanner board.
zack added a subtask for T3318: scanner should use the known() method of web.client: T2635: web client: add async API.
Jul 5 2021
Jul 5 2021
zack changed the status of T3420: scanner: make the various query algorithms user-selectable from Open to Work in Progress.
zack changed the status of T3318: scanner should use the known() method of web.client from Open to Work in Progress.
Jun 30 2021
Jun 30 2021
Jun 25 2021
Jun 25 2021
Jun 15 2021
Jun 15 2021
zack closed T3209: Fix swh-scanner for python > 3.7 as Resolved by committing rDTSCNd58bcb59a099: Fix swh-scanner for python 3.7 and >= 3.8.
Jun 11 2021
Jun 11 2021
zack renamed T3349: use swh.model.merkle/from_disk instead of swh.scanner.model from consider using swh.model.merkle/from_disk instead of swh.scanner.model to use swh.model.merkle/from_disk instead of swh.scanner.model.
May 28 2021
May 28 2021
zack changed the status of T3349: use swh.model.merkle/from_disk instead of swh.scanner.model from Open to Work in Progress.
May 10 2021
May 10 2021
Apr 23 2021
Apr 23 2021
Apr 18 2021
Apr 18 2021
I'm sorry for the delay. I was unaware that this task was assigned to me. I missed it but solved it as soon as I came to know about it. Also, I wanted to ask if there is a way to install a library mentioned in requirements.txt according to the python version used?
Apr 6 2021
Apr 6 2021
Mar 26 2021
Mar 26 2021
Mar 19 2021
Mar 19 2021
Mar 16 2021
Mar 16 2021
@shashikant231 please do not claim tasks, thanks.
Hi @rdicosmo, can you guide me to start working on this issue.I have already built this project in my system.
Mar 15 2021
Mar 15 2021
rdicosmo added a subtask for T3136: Prior art detection service: T3112: Provenance index for the full archive.
Mar 10 2021
Mar 10 2021
KShivendu moved T2731: scanner: strip the path passed as argument from output from Backlog to In progress on the Easy hack board.
Mar 8 2021
Mar 8 2021
The issue should be fixed now
vlorentz added a project to T3101: Latest versions on Pypi are an incompatible combination: Code scanner.
Mar 7 2021
Mar 7 2021
Mar 3 2021
Mar 3 2021
Dec 19 2020
Dec 19 2020
zack closed T2813: swh scanner db import does not validate SWHIDs as Resolved by committing rDTSCN33a9cd4eb965: DB import: skip invalid SWHIDs during import.
Dec 15 2020
Dec 15 2020
zack renamed T2812: scanner import db is slow, improve its performances from scanner: improve SWHID (txt) -> sqlite import time to scanner import db is slow, improve its performances.
Dec 2 2020
Dec 2 2020
zack added a project to T2836: swh scanner db import loads keeps all input SWHIDs in memory: Easy hack.
zack triaged T2836: swh scanner db import loads keeps all input SWHIDs in memory as Normal priority.
Nov 25 2020
Nov 25 2020
zack closed T2680: proxy support for swh scanner as Resolved by committing rDTSCN65f0b8e4c6ea: honor HTTP(S)_PROXY environment variables, to support HTTP proxies.
Nov 24 2020
Nov 24 2020
Nov 22 2020
Nov 22 2020
Nov 18 2020
Nov 18 2020
Nov 16 2020
Nov 16 2020
DanSeraf changed the status of T2760: swh-scanner: add support for local DB of known SWHIDs from Open to Work in Progress.
Nov 6 2020
Nov 6 2020
Oct 24 2020
Oct 24 2020
Oct 13 2020
Oct 13 2020
DanSeraf triaged T2692: Move the output related functions to another (sub)module as Normal priority.
DanSeraf closed T2690: swh scanner reports double results in ndjson format as Resolved by committing rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function.
Oct 12 2020
Oct 12 2020
Oct 9 2020
Oct 9 2020
Sep 28 2020
Sep 28 2020
Sep 25 2020
Sep 25 2020
Sep 23 2020
Sep 23 2020
Sep 14 2020
Sep 14 2020
Sep 9 2020
Sep 9 2020
Sep 8 2020
Sep 8 2020
zack triaged T2572: swh-scanner: add support for authentication token to lift rate-limit as Normal priority.
zack renamed T2300: swh-scanner: print a nicer error message when rate limit is hit from scanner: print a nicer error message when rate limit is hit to swh-scanner: print a nicer error message when rate limit is hit.
Jun 22 2020
Jun 22 2020
DanSeraf closed T2364: scanner: file browser in the sunburst/dashboard output as Resolved by committing rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path.
Apr 30 2020
Apr 30 2020
DanSeraf closed T2365: scanner: add color legend for sunburst output as Resolved by committing rDTSCNfb8ae03e494c: plot: color legend.
Apr 29 2020
Apr 29 2020
DanSeraf closed T2363: scanner: json output should return both known and unknown files/dirs as Resolved by committing rDTSCN623a9dbe6157: ndjson output format.
Apr 23 2020
Apr 23 2020
olasd added a comment to T2363: scanner: json output should return both known and unknown files/dirs.
Just jumping in, I suggest using ndjson (newline-delimited json) instead of a full json tree, as the former is easier to stream / parse incrementally for large outputs (like the linux kernel).
zack added a comment to T2363: scanner: json output should return both known and unknown files/dirs.
In T2363#43710, @DanSeraf wrote:$ swh scanner scan -f json /tmp/test { "dir1": { "children": { "subdir1": { "children": { "text.txt": { "known": true, "swhid": "swh:1:cnt:ff5b57b7095eb5d168a36db6552ad2ce1f219bf6" }
Apr 22 2020
Apr 22 2020
DanSeraf added a comment to T2363: scanner: json output should return both known and unknown files/dirs.
The new json output will be like the following:
Apr 15 2020
Apr 15 2020
zack updated the task description for T2363: scanner: json output should return both known and unknown files/dirs.