Page MenuHomeSoftware Heritage
Feed All Stories

Jan 29 2021

ardumont claimed T2999: Optimize the number of HTTP requests sent by the cgit lister.
Jan 29 2021, 10:54 AM · CGit lister
ardumont added a comment to T376: ingest git.eclipse.org repositories.

yes, agreed.

Jan 29 2021, 10:34 AM · Archive coverage
ardumont added a comment to T2999: Optimize the number of HTTP requests sent by the cgit lister.

Analyzing further the suggestions using the deprecated swh-lister cache db table as data
point (production data) [1], 3 instances so far will generate sometimes wrong origin
urls with the suggested approach.

Jan 29 2021, 10:25 AM · CGit lister
rdicosmo added a comment to T376: ingest git.eclipse.org repositories.

Thanks @ardumont for experimenting with this. The 500 seems normal: we need to tell Eclipse about us first, I'll put you in touch. So maybe it's still a no-brainer, and we just need to document the "contant the owner to get whitelisted" human step :-)

Jan 29 2021, 10:04 AM · Archive coverage
vlorentz closed D4916: Run simulator tests on all known scheduling policies.
Jan 29 2021, 10:00 AM
vlorentz closed D4917: simulator: stop validating the scheduling policy in the CLI.
Jan 29 2021, 10:00 AM
vlorentz closed D4914: simulator: stop using the database as a cache for origin data.
Jan 29 2021, 10:00 AM
vlorentz closed D4915: simulator: record visit metrics alongside scheduler metrics.
Jan 29 2021, 10:00 AM
vlorentz committed rDSCHcf0583b07959: simulator: stop validating the scheduling policy in the CLI (authored by olasd).
simulator: stop validating the scheduling policy in the CLI
Jan 29 2021, 10:00 AM
vlorentz committed rDSCHebb5847ea2ee: Run simulator tests on all known scheduling policies (authored by olasd).
Run simulator tests on all known scheduling policies
Jan 29 2021, 10:00 AM
vlorentz committed rDSCH889839446eb8: simulator: stop using the database as a cache for origin data (authored by olasd).
simulator: stop using the database as a cache for origin data
Jan 29 2021, 10:00 AM
vlorentz committed rDSCH1f77521d486c: simulator: record visit metrics alongside scheduler metrics (authored by olasd).
simulator: record visit metrics alongside scheduler metrics
Jan 29 2021, 10:00 AM
vlorentz closed D4912: grab_next_visits: don't re-schedule visits too fast.
Jan 29 2021, 10:00 AM
vlorentz closed D4911: Allow overriding the timestamp of grab_next_visits.
Jan 29 2021, 10:00 AM
vlorentz committed rDSCH2b39cbcabf99: Allow overriding the timestamp of grab_next_visits (authored by olasd).
Allow overriding the timestamp of grab_next_visits
Jan 29 2021, 10:00 AM
vlorentz committed rDSCHc92ead5875ec: grab_next_visits: don't re-schedule visits too fast (authored by olasd).
grab_next_visits: don't re-schedule visits too fast
Jan 29 2021, 10:00 AM
vlorentz closed D4910: Construct grab_next_visits query arguments incrementally.
Jan 29 2021, 10:00 AM
vlorentz committed rDSCH7ffbdd1b3eb5: Construct grab_next_visits query arguments incrementally (authored by olasd).
Construct grab_next_visits query arguments incrementally
Jan 29 2021, 10:00 AM
vlorentz committed rDSCHea068b46a89e: simulator: add simple lister simulation (authored by vlorentz).
simulator: add simple lister simulation
Jan 29 2021, 10:00 AM
vlorentz closed D4909: simulator: add lister simulation.
Jan 29 2021, 10:00 AM
vlorentz committed rDSCH7af98e2bc048: Factor out ListedOrigin generation to use the OriginModel (authored by vlorentz).
Factor out ListedOrigin generation to use the OriginModel
Jan 29 2021, 10:00 AM
ardumont updated the task description for T2999: Optimize the number of HTTP requests sent by the cgit lister.
Jan 29 2021, 9:30 AM · CGit lister
ardumont added a parent task for T2999: Optimize the number of HTTP requests sent by the cgit lister: T376: ingest git.eclipse.org repositories.
Jan 29 2021, 9:24 AM · CGit lister
ardumont added a subtask for T376: ingest git.eclipse.org repositories: T2999: Optimize the number of HTTP requests sent by the cgit lister.
Jan 29 2021, 9:24 AM · Archive coverage
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 29 2021, 9:22 AM · System administration, Lister
ardumont moved T2998: Deploy lister next-gen in staging from in-progress to deployed/landed/monitoring on the System administration board.
Jan 29 2021, 9:19 AM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 29 2021, 9:19 AM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 29 2021, 9:18 AM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

Current status of all listings:

Jan 29 2021, 9:11 AM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 29 2021, 9:11 AM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

npm listing done, so status ok as well:

Jan 29 2021, 9:10 AM · System administration, Lister
swh-public-ci added a comment to D4931: Add mapping of definitions and harvests.

Build is green

Jan 29 2021, 7:43 AM
TG1999 updated the diff for D4931: Add mapping of definitions and harvests.

Add mapping for definitions and harvests
Add functions map_row, map_definition, map_harvest to check whether swh archive is able to map clearlydefined object or not

Jan 29 2021, 7:42 AM
Harbormaster failed remote builds in B18871: Diff 17730 for D4931: Add mapping of definitions and harvests!
Jan 29 2021, 7:40 AM
swh-public-ci added a comment to D4931: Add mapping of definitions and harvests.

Build has FAILED

Jan 29 2021, 7:40 AM
TG1999 updated the diff for D4931: Add mapping of definitions and harvests.

Add mapping for definitions and harvests
Add functions map_row, map_definition, map_harvest to check whether swh archive is able to map clearlydefined object or not

Jan 29 2021, 7:39 AM

Jan 28 2021

swh-public-ci added a comment to D4931: Add mapping of definitions and harvests.

Build is green

Jan 28 2021, 8:27 PM
TG1999 updated the diff for D4931: Add mapping of definitions and harvests.

Add mapping for definitions and harvests
Add functions map_row, map_definition, map_harvest to check whether swh archive is able to map clearlydefined object or not

Jan 28 2021, 8:26 PM
rdicosmo added a comment to T2912: Next generation archive counters.

Bloom filters are still on the table for other use cases, like testing super quickly for contents that we do not have, but if nobody has strong objections, this seems the way to go for the counters (very small footprint, small under/over counting errors, thanks Philippe Flajolet's magic :-))

Jan 28 2021, 7:27 PM · Roadmap 2021, System administration, Monitoring, Web app
anlambert closed D4967: launchpad: Remove call to dataclasses.asdict on lister state.
Jan 28 2021, 7:26 PM
anlambert committed rDLS5aa7c8f2b21f: launchpad: Remove call to dataclasses.asdict on lister state (authored by anlambert).
launchpad: Remove call to dataclasses.asdict on lister state
Jan 28 2021, 7:26 PM
ardumont accepted D4967: launchpad: Remove call to dataclasses.asdict on lister state.
Jan 28 2021, 7:22 PM
anlambert requested review of D4967: launchpad: Remove call to dataclasses.asdict on lister state.
Jan 28 2021, 7:21 PM
anlambert added a revision to T3003: next gen lister: Make lister flush their visit state regularly: D4967: launchpad: Remove call to dataclasses.asdict on lister state.
Jan 28 2021, 7:18 PM · Lister
anlambert closed D4966: launchpad: Prevent error due to origin listed twice.
Jan 28 2021, 7:17 PM
anlambert committed rDLS46f5a50099f9: launchpad: Prevent error due to origin listed twice (authored by anlambert).
launchpad: Prevent error due to origin listed twice
Jan 28 2021, 7:17 PM
ardumont accepted D4966: launchpad: Prevent error due to origin listed twice.

Nice catch

Jan 28 2021, 7:15 PM
anlambert requested review of D4966: launchpad: Prevent error due to origin listed twice.
Jan 28 2021, 7:13 PM
anlambert added a revision to T3003: next gen lister: Make lister flush their visit state regularly: D4966: launchpad: Prevent error due to origin listed twice.
Jan 28 2021, 7:10 PM · Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 7:01 PM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

npm run scheduled, run in progress:

Jan 28 2021, 7:00 PM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 6:53 PM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

pypi run triggered itself and went well all alone (cool):

Jan 28 2021, 6:53 PM · System administration, Lister
ardumont committed rDLS130ad7d73ee0: Make debian lister constructors compatible with credentials (authored by ardumont).
Make debian lister constructors compatible with credentials
Jan 28 2021, 6:51 PM
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 6:50 PM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 6:50 PM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 6:49 PM · System administration, Lister
olasd committed rSPSITE09a6a005fea0: Single-thread git loader (authored by olasd).
Single-thread git loader
Jan 28 2021, 6:47 PM
ardumont added a comment to T2998: Deploy lister next-gen in staging.

Status lister: ok (with local patch):

Jan 28 2021, 6:40 PM · System administration, Lister
anlambert closed D4964: launchpad/tasks: Fix ping task function name.
Jan 28 2021, 6:18 PM
anlambert committed rDLSe8725eb2476c: launchpad/tasks: Fix ping task function name (authored by anlambert).
launchpad/tasks: Fix ping task function name
Jan 28 2021, 6:18 PM
swh-public-ci added a comment to D4931: Add mapping of definitions and harvests.

Build is green

Jan 28 2021, 6:16 PM
TG1999 updated the diff for D4931: Add mapping of definitions and harvests.

Add mapping for definitions and harvests
Add functions map_row, map_definition, map_harvest to check whether swh archive is able to map clearlydefined object or not

Jan 28 2021, 6:15 PM
vlorentz updated the task description for T3004: swh-storage documentation needs a better introduction.
Jan 28 2021, 6:02 PM · Documentation, Storage manager
vlorentz updated the task description for T3004: swh-storage documentation needs a better introduction.
Jan 28 2021, 6:02 PM · Documentation, Storage manager
vsellier added a comment to T2975: Disk replacement on esnode1.

Ticket opened via the dell support.
The disk should be delivered the Monday 1st February 2021, the DSI is informed

Jan 28 2021, 5:55 PM · System administration
ardumont created P932 launchpad lister incremental run failing.
Jan 28 2021, 5:42 PM
ardumont added a comment to T3003: next gen lister: Make lister flush their visit state regularly.

I thought having fixed that bug (also encountered when developing the lister).

Jan 28 2021, 5:40 PM · Lister
swh-public-ci added a comment to D4964: launchpad/tasks: Fix ping task function name.

Build is green

Jan 28 2021, 5:39 PM
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 5:38 PM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

lister-cran status: run ko [1]

swhworker@worker0:~$ SWH_CONFIG_FILENAME=/etc/softwareheritage/lister.yml swh lister run --lister cran
Traceback (most recent call last):
  File "/usr/bin/swh", line 11, in <module>
    load_entry_point('swh.core==0.11.0', 'console_scripts', 'swh')()
...
  File "/usr/lib/python3/dist-packages/swh/core/api/__init__.py", line 344, in raise_for_status
    raise exception from None
swh.core.api.RemoteException: <RemoteException 500 TypeError: ["can not serialize 'Attribute' object"]>

[1] https://sentry.softwareheritage.org/share/issue/2cd53c7575834b1aaf65760b80bcbcef/

Jan 28 2021, 5:37 PM · System administration, Lister
anlambert updated the diff for D4964: launchpad/tasks: Fix ping task function name.

Rebase

Jan 28 2021, 5:36 PM
moranegg closed T2578: Review Wikidata property name and meaning [P6138] as Resolved.
Jan 28 2021, 5:33 PM · Software Stories
ardumont accepted D4964: launchpad/tasks: Fix ping task function name.

:)

Jan 28 2021, 5:33 PM
ardumont closed T3002: Current next-gen lister cran failing to list as Resolved.

It's ok now.

Jan 28 2021, 5:27 PM · Origin-CRAN, Lister
ardumont added a comment to T3002: Current next-gen lister cran failing to list.

Yes, I completely forgot about your diff that fixed it.

Jan 28 2021, 5:25 PM · Origin-CRAN, Lister
ardumont created P931 staging: cgit instance eclipse: remote server closed connection.
Jan 28 2021, 5:21 PM
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 5:20 PM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

Node patch D4965, this gets better, the launchpad listed origins:

Jan 28 2021, 5:19 PM · System administration, Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 5:19 PM · System administration, Lister
olasd committed rSPSITEa9a0b1e77acf: Kick down git loader concurrency (authored by olasd).
Kick down git loader concurrency
Jan 28 2021, 5:13 PM
vlorentz updated the task description for T3004: swh-storage documentation needs a better introduction.
Jan 28 2021, 5:12 PM · Documentation, Storage manager
vlorentz updated the task description for T3004: swh-storage documentation needs a better introduction.
Jan 28 2021, 5:12 PM · Documentation, Storage manager
vlorentz updated the task description for T3004: swh-storage documentation needs a better introduction.
Jan 28 2021, 5:12 PM · Documentation, Storage manager
ardumont added a project to T3003: next gen lister: Make lister flush their visit state regularly: Lister.
Jan 28 2021, 5:12 PM · Lister
vlorentz triaged T3004: swh-storage documentation needs a better introduction as Normal priority.
Jan 28 2021, 5:12 PM · Documentation, Storage manager
ardumont closed D4965: pattern: Make lister flush regularly origins to scheduler.
Jan 28 2021, 5:11 PM
ardumont committed rDLS0ad37740d9d7: pattern: Make lister flush regularly origins to scheduler (authored by ardumont).
pattern: Make lister flush regularly origins to scheduler
Jan 28 2021, 5:11 PM
ardumont added a comment to D4965: pattern: Make lister flush regularly origins to scheduler.

Nevertheless, errors like T3003#57551 can still appear if there is duplicate origins in the sent list.

Jan 28 2021, 5:10 PM
ardumont added a comment to T376: ingest git.eclipse.org repositories.

In the context of deploying the next gen lister in staging (T2998), i also tried the eclipse cgit instance

Jan 28 2021, 5:09 PM · Archive coverage
anlambert accepted D4965: pattern: Make lister flush regularly origins to scheduler.

Looks good to me !

Jan 28 2021, 5:05 PM
anlambert added a comment to T3003: next gen lister: Make lister flush their visit state regularly.

I thought having fixed that bug (also encountered when developing the lister).

Jan 28 2021, 5:02 PM · Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

Note that this lister seems to need some writing improvments though.
It seemed to have flushed the writing only at the end of the listing.
If that's the real behavior (i'll need to check), that won't bode well for relatively high dimensioned instance like the cgit eclispe instance for example.

cgit lister should flush origins after each page, which instance has been listed here ?

Some listers like debian might flush a large amount of origins per page, will be curious to see how it goes.

Jan 28 2021, 5:02 PM · System administration, Lister
ardumont requested review of D4965: pattern: Make lister flush regularly origins to scheduler.
Jan 28 2021, 4:58 PM
ardumont added a revision to T3003: next gen lister: Make lister flush their visit state regularly: D4965: pattern: Make lister flush regularly origins to scheduler.
Jan 28 2021, 4:55 PM · Lister
ardumont updated the task description for T2998: Deploy lister next-gen in staging.
Jan 28 2021, 4:39 PM · System administration, Lister
ardumont added a comment to T2998: Deploy lister next-gen in staging.

lister launchpad run ko, see details [1]

Jan 28 2021, 4:38 PM · System administration, Lister
ardumont added a comment to T3003: next gen lister: Make lister flush their visit state regularly.

Make them flush their current state at regular interval sounds like a better behavior.

Jan 28 2021, 4:37 PM · Lister
anlambert added a comment to T3002: Current next-gen lister cran failing to list.

My guess is that swh-scheduler 0.9.2 is not installed on staging, issue has been fixed in rDSCH2906b4e8a08517f5e3d86272232ad8ba926a43d7.

Jan 28 2021, 4:22 PM · Origin-CRAN, Lister
anlambert requested review of D4964: launchpad/tasks: Fix ping task function name.
Jan 28 2021, 4:16 PM