Page MenuHomeSoftware Heritage

Code scannerFolder
ActivePublic

Members

  • This project does not have any members.
  • View All

Details

Description

Source code scanner using the Software Heritage archive as knowledge base.

Recent Activity

Mon, May 10

zack triaged T3318: scanner should use the known() method of web.client as Low priority.
Mon, May 10, 9:02 AM · Code scanner

Fri, Apr 23

vlorentz assigned T3136: Prior art detection service to zack.
Fri, Apr 23, 4:51 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task

Sun, Apr 18

aastha1999 added a comment to T3209: Fix swh-scanner for python > 3.7.

I'm sorry for the delay. I was unaware that this task was assigned to me. I missed it but solved it as soon as I came to know about it. Also, I wanted to ask if there is a way to put a limit in requirements.txt according to the python version used?

Sun, Apr 18, 3:54 PM · Code scanner

Apr 6 2021

zack added a project to T3209: Fix swh-scanner for python > 3.7: Code scanner.
Apr 6 2021, 12:01 PM · Code scanner

Mar 26 2021

DanSeraf closed T2679: Use the `swh.model` version of `extract_regex_objs` as Resolved.
Mar 26 2021, 4:54 PM · Code scanner
DanSeraf added a revision to T2679: Use the `swh.model` version of `extract_regex_objs`: D5359: scanner: use 'extract_regex_objs' from swh.model.
Mar 26 2021, 2:39 PM · Code scanner

Mar 19 2021

vlorentz triaged T3136: Prior art detection service as Normal priority.
Mar 19 2021, 4:24 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task

Mar 16 2021

zack placed T3136: Prior art detection service up for grabs.

@shashikant231 please do not claim tasks, thanks.

Mar 16 2021, 7:00 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task
shashikant231 claimed T3136: Prior art detection service.

Hi @rdicosmo, can you guide me to start working on this issue.I have already built this project in my system.

Mar 16 2021, 4:00 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task

Mar 15 2021

rdicosmo added a subtask for T3136: Prior art detection service: T3112: Provenance index for the full archive.
Mar 15 2021, 8:59 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task
rdicosmo added a project to T3136: Prior art detection service: Code scanner.
Mar 15 2021, 8:58 PM · Code scanner, Scientific Community Building, Roadmap 2021, meta-task

Mar 10 2021

KShivendu closed T2731: scanner: strip the path passed as argument from output as Resolved.
Mar 10 2021, 6:59 PM · Easy hack, Code scanner
KShivendu moved T2731: scanner: strip the path passed as argument from output from Backlog to In progress on the Easy hack board.
Mar 10 2021, 5:34 PM · Easy hack, Code scanner

Mar 8 2021

vlorentz closed T3101: Latest versions on Pypi are an incompatible combination as Resolved.

The issue should be fixed now

Mar 8 2021, 5:03 PM · Code scanner
vlorentz added a project to T3101: Latest versions on Pypi are an incompatible combination: Code scanner.
Mar 8 2021, 4:31 PM · Code scanner

Mar 7 2021

KShivendu added a revision to T2731: scanner: strip the path passed as argument from output: D5213: swh/scanner : Strip root path from json output.
Mar 7 2021, 2:34 PM · Easy hack, Code scanner

Mar 3 2021

zack added a project to T2730: scanner: should output the root SWHID as well: Easy hack.
Mar 3 2021, 9:49 AM · Easy hack, Code scanner
zack added a project to T2731: scanner: strip the path passed as argument from output: Easy hack.
Mar 3 2021, 9:49 AM · Easy hack, Code scanner

Dec 19 2020

zack placed T2300: swh-scanner: print a nicer error message when rate limit is hit up for grabs.
Dec 19 2020, 9:48 PM · Easy hack, Code scanner
zack closed T2813: swh scanner db import does not validate SWHIDs as Resolved by committing rDTSCN33a9cd4eb965: DB import: skip invalid SWHIDs during import.
Dec 19 2020, 9:47 PM · Code scanner
zack closed T2812: scanner import db is slow, improve its performances as Resolved by committing rDTSCNfe84403087cc: DB import: massive speed up, via sqlite tuning and better mem handling.
Dec 19 2020, 9:47 PM · Code scanner
zack closed T2836: swh scanner db import loads keeps all input SWHIDs in memory as Resolved by committing rDTSCNfe84403087cc: DB import: massive speed up, via sqlite tuning and better mem handling.
Dec 19 2020, 9:47 PM · Easy hack, Code scanner

Dec 15 2020

zack updated the task description for T2812: scanner import db is slow, improve its performances.
Dec 15 2020, 5:57 PM · Code scanner
zack updated the task description for T2812: scanner import db is slow, improve its performances.
Dec 15 2020, 5:50 PM · Code scanner
zack renamed T2812: scanner import db is slow, improve its performances from scanner: improve SWHID (txt) -> sqlite import time to scanner import db is slow, improve its performances.
Dec 15 2020, 5:48 PM · Code scanner

Dec 2 2020

zack added a project to T2836: swh scanner db import loads keeps all input SWHIDs in memory: Easy hack.
Dec 2 2020, 9:26 AM · Easy hack, Code scanner
zack triaged T2836: swh scanner db import loads keeps all input SWHIDs in memory as Normal priority.
Dec 2 2020, 9:26 AM · Easy hack, Code scanner

Nov 25 2020

zack triaged T2813: swh scanner db import does not validate SWHIDs as Low priority.
Nov 25 2020, 10:37 PM · Code scanner
zack triaged T2812: scanner import db is slow, improve its performances as Low priority.
Nov 25 2020, 10:00 PM · Code scanner
zack closed T2680: proxy support for swh scanner as Resolved by committing rDTSCN65f0b8e4c6ea: honor HTTP(S)_PROXY environment variables, to support HTTP proxies.
Nov 25 2020, 4:42 PM · Easy hack, Code scanner

Nov 24 2020

DanSeraf closed T2760: swh-scanner: add support for local DB of known SWHIDs as Resolved.
Nov 24 2020, 1:54 PM · Code scanner

Nov 22 2020

DanSeraf added a revision to T2760: swh-scanner: add support for local DB of known SWHIDs: D4552: 'db serve' option to start the API service.
Nov 22 2020, 4:19 PM · Code scanner

Nov 18 2020

DanSeraf added a revision to T2760: swh-scanner: add support for local DB of known SWHIDs: D4508: scanner: 'db import' option to create local database with known swhids.
Nov 18 2020, 2:24 PM · Code scanner

Nov 16 2020

DanSeraf changed the status of T2760: swh-scanner: add support for local DB of known SWHIDs from Open to Work in Progress.
Nov 16 2020, 10:41 AM · Code scanner

Nov 6 2020

zack updated the task description for T2760: swh-scanner: add support for local DB of known SWHIDs.
Nov 6 2020, 2:50 PM · Code scanner
zack triaged T2760: swh-scanner: add support for local DB of known SWHIDs as Normal priority.
Nov 6 2020, 2:32 PM · Code scanner

Oct 24 2020

zack triaged T2731: scanner: strip the path passed as argument from output as Low priority.
Oct 24 2020, 5:01 PM · Easy hack, Code scanner
zack updated the task description for T2730: scanner: should output the root SWHID as well.
Oct 24 2020, 4:58 PM · Easy hack, Code scanner
zack updated the task description for T2730: scanner: should output the root SWHID as well.
Oct 24 2020, 4:58 PM · Easy hack, Code scanner
zack triaged T2730: scanner: should output the root SWHID as well as Normal priority.
Oct 24 2020, 4:58 PM · Easy hack, Code scanner

Oct 13 2020

DanSeraf triaged T2692: Move the output related functions to another (sub)module as Normal priority.
Oct 13 2020, 9:57 AM · Code scanner
DanSeraf closed T2690: swh scanner reports double results in ndjson format as Resolved by committing rDTSCNc2768d171a78: model: dropped _iter_nodes_attr function.
Oct 13 2020, 9:36 AM · Code scanner

Oct 12 2020

zack triaged T2679: Use the `swh.model` version of `extract_regex_objs` as Low priority.
Oct 12 2020, 6:59 PM · Code scanner
zack triaged T2690: swh scanner reports double results in ndjson format as Normal priority.
Oct 12 2020, 6:59 PM · Code scanner
zvr created T2690: swh scanner reports double results in ndjson format.
Oct 12 2020, 6:47 PM · Code scanner

Oct 9 2020

zack added a project to T2680: proxy support for swh scanner: Easy hack.
Oct 9 2020, 2:59 PM · Easy hack, Code scanner
zack triaged T2680: proxy support for swh scanner as Normal priority.
Oct 9 2020, 2:58 PM · Easy hack, Code scanner
acezar created T2679: Use the `swh.model` version of `extract_regex_objs`.
Oct 9 2020, 2:47 PM · Code scanner

Sep 28 2020

tenma closed T2632: swh scanner fail to start when configuration file is missing as Resolved.
Sep 28 2020, 10:14 AM · Code scanner

Sep 25 2020

tenma added a revision to T2632: swh scanner fail to start when configuration file is missing: D4046: Fix default config file may be absent in scanner cli.
Sep 25 2020, 11:44 AM · Code scanner