Already implemented in D4193
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
Mar 26 2021
Mar 26 2021
DanSeraf closed T2570: swh-identify: support exclusion patterns (e.g., for .git/) as swh-scanner does as Resolved.
removed unnecessary conversion
Feb 5 2021
Feb 5 2021
reimplement algo_min
improve logging information
Feb 4 2021
Feb 4 2021
run random algorithm only once
DanSeraf committed rDTSCN3004b66787b2: use os.listdir() instead of os.walk() to avoid symlinks (authored by DanSeraf).
use os.listdir() instead of os.walk() to avoid symlinks
DanSeraf updated the diff for D5011: scanner-benchmark: use os.listdir() instead of os.walk() to avoid symlinks.
rebase
DanSeraf updated the diff for D5011: scanner-benchmark: use os.listdir() instead of os.walk() to avoid symlinks.
rebase
DanSeraf updated the diff for D5011: scanner-benchmark: use os.listdir() instead of os.walk() to avoid symlinks.
reword commit
DanSeraf updated the diff for D5011: scanner-benchmark: use os.listdir() instead of os.walk() to avoid symlinks.
exclude symlink
Feb 2 2021
Feb 2 2021
DanSeraf committed rDTSCN3d3665a4f5bb: algo_min: delete the upstream directories if a (sub)directory is unknown (authored by DanSeraf).
algo_min: delete the upstream directories if a (sub)directory is unknown
DanSeraf requested review of D4996: algo_min: delete the upstream directories if a (sub)directory is unknown.
DanSeraf committed rDTSCNc42e643aa512: model: wrong iteration in 'iterate_bfs' function (authored by DanSeraf).
model: wrong iteration in 'iterate_bfs' function
DanSeraf committed rDTSCN0d3b5cb86144: make 'set_children_status' works with different kind of nodes (authored by DanSeraf).
make 'set_children_status' works with different kind of nodes
DanSeraf committed rDTSCNb601f382db64: file_priority: remove children only when the unset directory is known (authored by DanSeraf).
file_priority: remove children only when the unset directory is known
Jan 28 2021
Jan 28 2021
DanSeraf committed rDTSCN5e01c09af4c6: retry request in case of backend failure (authored by DanSeraf).
retry request in case of backend failure
DanSeraf committed rDTSCNebad16c02da6: algo_min: remove the current node as well (authored by DanSeraf).
algo_min: remove the current node as well
Jan 26 2021
Jan 26 2021
DanSeraf updated the diff for D4947: scanner-benchmark: algo_min fixed, retry mechanism on request error.
commit message
DanSeraf retitled D4947: scanner-benchmark: algo_min fixed, retry mechanism on request error from scanner-benchmark: algo_miv fixed, retry mechanism on request error to scanner-benchmark: algo_min fixed, retry mechanism on request error.
DanSeraf retitled D4947: scanner-benchmark: algo_min fixed, retry mechanism on request error from scanner-benchmark: algo_mix fixed, retry mechanism on request error to scanner-benchmark: algo_miv fixed, retry mechanism on request error.
DanSeraf requested review of D4947: scanner-benchmark: algo_min fixed, retry mechanism on request error.
Jan 18 2021
Jan 18 2021
DanSeraf committed rDTSCN5cd9f762467e: fix: the temporary directory is removed by tempfile (authored by DanSeraf).
fix: the temporary directory is removed by tempfile
DanSeraf requested review of D4875: scanner-benchmark: the temporary directory is removed by tempdir.
Jan 13 2021
Jan 13 2021
print results as a csv
DanSeraf committed rDTSCN9e4df16d9486: extract repositories in temporary directories (authored by DanSeraf).
extract repositories in temporary directories
Dec 19 2020
Dec 19 2020
scanner experiments
wrong algorithm name in example
Dec 17 2020
Dec 17 2020
variable name in run_benchmark.sh
remove git missing imports in mypy.ini
requested changes
+ algorithms can be specified from run_benchmark.sh
+ if "random" algorithm is specified, benchmark.py will run three experiments using the default seeds (10, 20, 30)
Dec 11 2020
Dec 11 2020
Nov 24 2020
Nov 24 2020
DanSeraf committed rDTSCN09c28d60f1ad: 'db serve' option to start the API service (authored by DanSeraf).
'db serve' option to start the API service
Minor changes
rebase
- changed module name
- query the database with only one cursor
- get SWHID known status directly when generating the response
Nov 23 2020
Nov 23 2020
Nov 22 2020
Nov 22 2020
Nov 21 2020
Nov 21 2020
DanSeraf committed rDTSCN521420e7d5eb: 'db import' option to create local database with known swhids (authored by DanSeraf).
'db import' option to create local database with known swhids
Nov 20 2020
Nov 20 2020
DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.
minor changes:
- mypy annotation
- tests
Nov 19 2020
Nov 19 2020
DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.
minor changes
DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.
requested changes:
- SWHID as PRIMARY KEY in db
- SWHID insertion without query the Web API
- bulk insert of SWHID values in db
Nov 18 2020
Nov 18 2020
DanSeraf updated the diff for D4508: scanner: 'db import' option to create local database with known swhids.
requested changes
Nov 16 2020
Nov 16 2020
DanSeraf changed the status of T2760: swh-scanner: add support for local DB of known SWHIDs from Open to Work in Progress.
Oct 13 2020
Oct 13 2020
DanSeraf triaged T2692: Move the output related functions to another (sub)module as Normal priority.
DanSeraf committed rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function (authored by DanSeraf).
model: dropped _iter_nodes_attr function
DanSeraf closed T2690: swh scanner reports double results in ndjson format as Resolved by committing rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function.
Sep 11 2020
Sep 11 2020
DanSeraf committed rDTSCNf838fed672d8: cli: don't check for glob pattern before translating into regex (authored by DanSeraf).
cli: don't check for glob pattern before translating into regex
DanSeraf updated the diff for D3924: cli: don't check for glob pattern before translating into regex.
tests
Sep 10 2020
Sep 10 2020
DanSeraf committed rDTSCN2fb9cb1c59e2: docs: readme and cli description update (authored by DanSeraf).
docs: readme and cli description update
Sep 9 2020
Sep 9 2020
requested changes
Sep 8 2020
Sep 8 2020
index: add swh.scanner
Sep 3 2020
Sep 3 2020
Jul 3 2020
Jul 3 2020
Jun 22 2020
Jun 22 2020
DanSeraf committed rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path (authored by DanSeraf).
dashboard: file visualization per directory path
DanSeraf closed T2364: scanner: file browser in the sunburst/dashboard output as Resolved by committing rDTSCN0f10ec6ae8fe: dashboard: file visualization per directory path.
- non-minified css in assets
Jun 20 2020
Jun 20 2020
- workaround to check the table body
- bootstrap css update
In D3293#81217, @vlorentz wrote:In D3293#81209, @DanSeraf wrote:Unfortunately the expected values can't be tested.
Why not?
Because the dash_html_components checks for object identity only. I wrote a comment inside the test.
Jun 19 2020
Jun 19 2020
In D3293#80514, @vlorentz wrote:
- It's missing tests. I understand it's not easy to do for a GUI, but could you see what you can do about it?
- init.py in dashboard directory
- css as static asset
Jun 16 2020
Jun 16 2020