Page MenuHomeSoftware Heritage
Feed Advanced Search

Mar 16 2021

vlorentz updated the task description for T3090: Make loaders not rely on the 'metadata' column of the 'revision' table.
Mar 16 2021, 12:28 PM · Core Loader
vlorentz updated the task description for T3090: Make loaders not rely on the 'metadata' column of the 'revision' table.
Mar 16 2021, 12:27 PM · Core Loader
vlorentz updated the task description for T3090: Make loaders not rely on the 'metadata' column of the 'revision' table.
Mar 16 2021, 12:27 PM · Core Loader
vlorentz triaged T3141: Make loaders pass ExtId objects between functions instead of "known_artifact" as Normal priority.
Mar 16 2021, 12:22 PM · Core Loader
vlorentz triaged T3140: Make loaders use ExtId to find what artifacts are already loaded, in addition to revision.metadata as Normal priority.
Mar 16 2021, 12:19 PM · Core Loader
vlorentz triaged T3139: Make loaders stop reading from the 'metadata' column of the 'revision' table as Normal priority.
Mar 16 2021, 12:17 PM · Core Loader
vlorentz triaged T3138: Make loaders stop writing in the 'metadata' column of the 'revision' table as Normal priority.
Mar 16 2021, 12:17 PM · Core Loader

Mar 15 2021

vlorentz added a subtask for T2471: NPM package angular-ts-manage fails to be properly loaded: T3089: Remove the 'metadata' column of the 'revision' table.
Mar 15 2021, 12:32 PM · Core Loader
vlorentz lowered the priority of T2471: NPM package angular-ts-manage fails to be properly loaded from High to Normal.
Mar 15 2021, 12:32 PM · Core Loader

Mar 11 2021

douardda closed T2849: Design and implement a mapping from "original VCS ids" to SWHIDs to help incremental loaders, a subtask of T3090: Make loaders not rely on the 'metadata' column of the 'revision' table, as Resolved.
Mar 11 2021, 2:55 PM · Core Loader

Mar 5 2021

vlorentz added a subtask for T3090: Make loaders not rely on the 'metadata' column of the 'revision' table: T2849: Design and implement a mapping from "original VCS ids" to SWHIDs to help incremental loaders.
Mar 5 2021, 12:29 PM · Core Loader
vlorentz triaged T3090: Make loaders not rely on the 'metadata' column of the 'revision' table as Normal priority.
Mar 5 2021, 12:29 PM · Core Loader

Feb 17 2021

anlambert added a revision to T3030: Improve loaders to deal with new visit status events: D5096: common/origin_save: Handle new "not_found" origin visit status.
Feb 17 2021, 4:45 PM · Core Loader
ardumont closed D5071: Unify loader instantiation.
Feb 17 2021, 12:03 PM · Core Loader

Feb 16 2021

swh-public-ci added a comment to D5071: Unify loader instantiation.

Build is green

Feb 16 2021, 7:11 PM · Core Loader
ardumont updated the diff for D5071: Unify loader instantiation.

Rework commit message (aligns with diff)

Feb 16 2021, 7:09 PM · Core Loader
olasd accepted D5071: Unify loader instantiation.

I wonder what would break if the new methods were just put in BaseLoader and the PackageLoader was made to inherit BaseLoader

I don't think anything would break. I'm just not sure the "indirection" would be clear in terms of code readability...

I was wrong. mypy is not happy.

Some signatures would need changing, notably the load, prepare, prepare_origin_visit to drop the spurious (i think) {*args, **kwargs} we are declaring.
(I don't think they are still used any more now, aside for the extra logging arguments)

But still, i'd be confortable if we go that way to do it in another diff (maybe).

Feb 16 2021, 6:57 PM · Core Loader
ardumont updated the summary of D5071: Unify loader instantiation.
Feb 16 2021, 6:57 PM · Core Loader
ardumont added a comment to D5071: Unify loader instantiation.

I wonder what would break if the new methods were just put in BaseLoader and the PackageLoader was made to inherit BaseLoader

I don't think anything would break. I'm just not sure the "indirection" would be clear in terms of code readability...

I was wrong. mypy is not happy.

Some signatures would need changing, notably the load, prepare, prepare_origin_visit to drop the spurious (i think) {*args, **kwargs} we are declaring.
(I don't think they are still used any more now, aside for the extra logging arguments)

But still, i'd be confortable if we go that way to do it in another diff (maybe).

$ tox -e mypy
GLOB sdist-make: /home/tony/work/inria/repo/swh/swh-environment/swh-loader-core/setup.py
...
mypy run-test: commands[0] | mypy swh
swh/loader/package/loader.py:289: error: Signature of "load" incompatible with supertype "BaseLoader"
swh/loader/package/loader.py:381: error: Argument "date" to "OriginVisit" has incompatible type "Optional[datetime]"; expected "datetime"
swh/loader/package/loader.py:570: error: Argument "discovery_date" to "RawExtrinsicMetadata" has incompatible type "Optional[datetime]"; expected "datetime"
swh/loader/package/loader.py:692: error: Argument "discovery_date" to "RawExtrinsicMetadata" has incompatible type "Optional[datetime]"; expected "datetime"
swh/loader/package/loader.py:727: error: Argument "discovery_date" to "RawExtrinsicMetadata" has incompatible type "Optional[datetime]"; expected "datetime"
swh/loader/package/loader.py:756: error: Argument "discovery_date" to "RawExtrinsicMetadata" has incompatible type "Optional[datetime]"; expected "datetime"
swh/loader/package/pypi/loader.py:151: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/npm/loader.py:177: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/nixguix/loader.py:218: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/deposit/loader.py:208: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/deposit/loader.py:244: error: Signature of "load" incompatible with supertype "BaseLoader"
swh/loader/package/debian/loader.py:237: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/cran/loader.py:129: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
swh/loader/package/archive/loader.py:170: error: Item "None" of "Optional[datetime]" has no attribute "isoformat"
Found 14 errors in 8 files (checked 71 source files)
Feb 16 2021, 6:52 PM · Core Loader
ardumont added a comment to D5071: Unify loader instantiation.

I wonder what would break if the new methods were just put in BaseLoader and the PackageLoader was made to inherit BaseLoader

Feb 16 2021, 6:33 PM · Core Loader
swh-public-ci added a comment to D5071: Unify loader instantiation.

Build is green

Feb 16 2021, 6:23 PM · Core Loader
ardumont updated the diff for D5071: Unify loader instantiation.

Adapt according to review:

  • Drop swh.loader.pattern and move class Loader in swh.loader.core.loader module
  • Drop unneeded self.create_authorities, self.create_fetchers
Feb 16 2021, 6:21 PM · Core Loader
ardumont added a comment to D5071: Unify loader instantiation.

@ardumont points out that the base PackageLoader doesn't inherit from BaseLoader, which explains the new (common) base class. I think the new class could just as well be next to BaseLoader, and doesn't warrant the introduction of a pattern module.

Feb 16 2021, 6:20 PM · Core Loader
olasd added a comment to D5071: Unify loader instantiation.

@ardumont points out that the base PackageLoader doesn't inherit from BaseLoader, which explains the new (common) base class. I think the new class could just as well be next to BaseLoader, and doesn't warrant the introduction of a pattern module.

Feb 16 2021, 6:07 PM · Core Loader
ardumont added a comment to D5071: Unify loader instantiation.

Are the new pattern module / pattern.Loader class really needed? It looks like these methods could live in the BaseLoader class directly.

Feb 16 2021, 5:35 PM · Core Loader
olasd added a comment to D5071: Unify loader instantiation.

This is great, thanks!

Feb 16 2021, 5:33 PM · Core Loader
swh-public-ci added a comment to D5071: Unify loader instantiation.

Build is green

Feb 16 2021, 10:20 AM · Core Loader
ardumont updated the diff for D5071: Unify loader instantiation.

Add missing test on cli run edge case

Feb 16 2021, 10:18 AM · Core Loader
swh-public-ci added a comment to D5071: Unify loader instantiation.

Build is green

Feb 16 2021, 10:00 AM · Core Loader
ardumont updated the diff for D5071: Unify loader instantiation.

Fix unused import

Feb 16 2021, 9:58 AM · Core Loader
Harbormaster failed remote builds in B19231: Diff 18127 for D5071: Unify loader instantiation!
Feb 16 2021, 9:54 AM · Core Loader
swh-public-ci added a comment to D5071: Unify loader instantiation.

Build has FAILED

Feb 16 2021, 9:54 AM · Core Loader
ardumont updated the diff for D5071: Unify loader instantiation.
  • Add missing test on Deposit.from_configfile
  • Drop unneeded conditional in cli
Feb 16 2021, 9:53 AM · Core Loader

Feb 15 2021

ardumont updated the summary of D5071: Unify loader instantiation.
Feb 15 2021, 6:12 PM · Core Loader
vlorentz accepted D5071: Unify loader instantiation.
Feb 15 2021, 4:43 PM · Core Loader

Feb 12 2021

ardumont updated the summary of D5071: Unify loader instantiation.
Feb 12 2021, 8:17 PM · Core Loader
ardumont updated the summary of D5071: Unify loader instantiation.
Feb 12 2021, 8:12 PM · Core Loader
ardumont updated the summary of D5071: Unify loader instantiation.
Feb 12 2021, 5:51 PM · Core Loader
ardumont updated the summary of D5071: Unify loader instantiation.
Feb 12 2021, 5:08 PM · Core Loader

Feb 9 2021

vsellier added a revision to T3030: Improve loaders to deal with new visit status events: D5052: loader: mark visit as 'not_found' when relevant.
Feb 9 2021, 5:09 PM · Core Loader
vsellier added a revision to T3030: Improve loaders to deal with new visit status events: D5049: loader: Mark visit status as failed when relevant.
Feb 9 2021, 3:57 PM · Core Loader
ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5046: loader.svn: Mark visit status as not_found when relevant.
Feb 9 2021, 11:58 AM · Core Loader

Feb 8 2021

ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5042: loader.svn: Explicit the failure test cases.
Feb 8 2021, 7:07 PM · Core Loader
ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5041: loader.git: Mark visit status as not_found when relevant.
Feb 8 2021, 5:51 PM · Core Loader
ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5040: loader.git: Explicit the failure test cases.
Feb 8 2021, 5:50 PM · Core Loader
ardumont updated the task description for T3030: Improve loaders to deal with new visit status events.
Feb 8 2021, 5:06 PM · Core Loader
ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5039: core.loader: Allow vcs loaders to deal with not_found status.
Feb 8 2021, 5:02 PM · Core Loader

Feb 5 2021

ardumont added a revision to T3030: Improve loaders to deal with new visit status events: D5035: package: Mark visit as not_found when relevant.
Feb 5 2021, 5:49 PM · Core Loader
ardumont renamed T3030: Improve loaders to deal with new visit status events from Improve loader to deal with new visit status events to Improve loaders to deal with new visit status events.
Feb 5 2021, 3:40 PM · Core Loader
zack added a project to T3030: Improve loaders to deal with new visit status events: Core Loader.
Feb 5 2021, 3:40 PM · Core Loader

Nov 20 2020

ardumont closed T2537: Extend new deposit endpoint to support metadata-only deposits, a subtask of T1021: SWORD deposit of metadata about an existing SWH object, as Resolved.
Nov 20 2020, 10:20 AM · Core Loader, SWORD deposit

Nov 18 2020

ardumont added a subtask for T1021: SWORD deposit of metadata about an existing SWH object: T2537: Extend new deposit endpoint to support metadata-only deposits.
Nov 18 2020, 11:33 AM · Core Loader, SWORD deposit

Nov 3 2020

ardumont moved T871: Migrate swh-storage api functions relative to indexers to swh-indexer from Backlog to Archived on the SWORD deposit board.
Nov 3 2020, 4:07 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont moved T872: Deploy and restart indexers from Backlog to Archived on the SWORD deposit board.
Nov 3 2020, 4:07 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer

Oct 26 2020

douardda added projects to T2585: Properly handle deleted/no longer available origins: Core Loader, Lister.
Oct 26 2020, 2:41 PM · Lister, Core Loader

Sep 25 2020

ardumont closed D4012: core.loader: Log information about origin currently being ingested.
Sep 25 2020, 9:04 AM · Core Loader
ardumont added a comment to D4012: core.loader: Log information about origin currently being ingested.

Our logging handler swh.core.logger.JournalHandler already knows how to pull some metadata from the celery tasks:
...

Sep 25 2020, 9:02 AM · Core Loader
olasd accepted D4012: core.loader: Log information about origin currently being ingested.
Sep 25 2020, 9:00 AM · Core Loader
swh-public-ci added a comment to D4012: core.loader: Log information about origin currently being ingested.

Build is green

Sep 25 2020, 8:58 AM · Core Loader
ardumont added inline comments to D4012: core.loader: Log information about origin currently being ingested.
Sep 25 2020, 8:56 AM · Core Loader
ardumont updated the diff for D4012: core.loader: Log information about origin currently being ingested.

Adapt according to suggestion

Sep 25 2020, 8:56 AM · Core Loader

Sep 24 2020

olasd added a comment to D4012: core.loader: Log information about origin currently being ingested.

To be clear, my main issue today, when I try to look through our logs to
investigate or plain read what's going on (after a deployment for example), I
don't have any clues immediately...

In my mind, the kibana information is not enough by itself, so i think i need
to cross information with say sentry to have some more context... It's
currently quite frustrating... up to an eventual point of, "oh well, I have
some other urgent matters somewhere else..." (sometimes I push through but
sometimes, I fail).

Sep 24 2020, 4:59 PM · Core Loader
swh-public-ci added a comment to D4012: core.loader: Log information about origin currently being ingested.

Build is green

Sep 24 2020, 4:06 PM · Core Loader
ardumont updated the summary of D4012: core.loader: Log information about origin currently being ingested.
Sep 24 2020, 4:05 PM · Core Loader
ardumont updated the diff for D4012: core.loader: Log information about origin currently being ingested.

Simplify to just one log statement

Sep 24 2020, 4:05 PM · Core Loader
ardumont added inline comments to D4012: core.loader: Log information about origin currently being ingested.
Sep 24 2020, 3:58 PM · Core Loader
ardumont added a comment to D4012: core.loader: Log information about origin currently being ingested.

ok, i'll adapt
i would not be against a nudge in the right direction to actually improve the logging

Sep 24 2020, 3:56 PM · Core Loader
douardda added a comment to D4012: core.loader: Log information about origin currently being ingested.
In D4012#99525, @olasd wrote:

I don't think the origin url and visit type should be sent in the task result; they're arguments of the task already.

If we want them logged by the worker when the task ends (which I agree would be useful), then we should improve logging on the worker/celery side to show some of the task arguments (for instance, if there's a "url" argument) instead / in addition of the task id.

Sep 24 2020, 3:28 PM · Core Loader
douardda added inline comments to D4012: core.loader: Log information about origin currently being ingested.
Sep 24 2020, 3:26 PM · Core Loader
ardumont added a comment to D4012: core.loader: Log information about origin currently being ingested.

I don't think the origin url and visit type should be sent in the task
result; they're arguments of the task already.

Sep 24 2020, 10:33 AM · Core Loader
ardumont added inline comments to D4012: core.loader: Log information about origin currently being ingested.
Sep 24 2020, 10:22 AM · Core Loader
olasd added a comment to D4012: core.loader: Log information about origin currently being ingested.

I don't think the origin url and visit type should be sent in the task result; they're arguments of the task already.

Sep 24 2020, 9:14 AM · Core Loader

Sep 23 2020

zack triaged T2634: swh-core: missing test dependency on requests as Low priority.
Sep 23 2020, 2:23 PM · Easy hack, Core & foundations

Sep 22 2020

olasd added a comment to T1461: Add loader-related metrics to swh-loader-core.

I think the second point mostly happened: the storage is returning statistics to the loader, but the loaders don't generally collect them.

Sep 22 2020, 6:13 PM · Core Loader, Metrics/monitoring
olasd updated the task description for T1461: Add loader-related metrics to swh-loader-core.
Sep 22 2020, 6:11 PM · Core Loader, Metrics/monitoring
olasd updated the task description for T1461: Add loader-related metrics to swh-loader-core.
Sep 22 2020, 6:10 PM · Core Loader, Metrics/monitoring
olasd placed T1461: Add loader-related metrics to swh-loader-core up for grabs.
Sep 22 2020, 6:10 PM · Core Loader, Metrics/monitoring
ardumont updated the summary of D4012: core.loader: Log information about origin currently being ingested.
Sep 22 2020, 6:07 PM · Core Loader

Aug 31 2020

vlorentz added a comment to T2471: NPM package angular-ts-manage fails to be properly loaded.

I'm afraid the only way to properly solve this is to wait until we stop writing metadata to the revision table

Aug 31 2020, 4:38 PM · Core Loader

Aug 19 2020

vlorentz triaged T2523: Archive opensource.samsung.com as Normal priority.
Aug 19 2020, 7:40 PM · Lister, Archive coverage

Jul 27 2020

ardumont closed T2481: Migrate dvcs loader tests code to pytest as Resolved.
Jul 27 2020, 3:21 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a parent task for T2481: Migrate dvcs loader tests code to pytest: T2221: Development workflow & code quality.
Jul 27 2020, 3:20 PM · SVN Loader, Mercurial loader, Git loader, Core Loader

Jul 20 2020

ardumont closed T2483: tests: Make check-snapshot utility test function recursively check targetted object exists, a subtask of T2481: Migrate dvcs loader tests code to pytest, as Resolved.
Jul 20 2020, 9:17 AM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont closed T2483: tests: Make check-snapshot utility test function recursively check targetted object exists as Resolved.
Jul 20 2020, 9:17 AM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont closed T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin, a subtask of T2481: Migrate dvcs loader tests code to pytest, as Resolved.
Jul 20 2020, 9:16 AM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont closed T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin as Resolved.
Jul 20 2020, 9:16 AM · SVN Loader, Mercurial loader, Git loader, Core Loader

Jul 17 2020

ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3551: tests: Reuse pytest fixtures from swh.loader.core.
Jul 17 2020, 12:12 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3550: tests: Reuse pytest fixtures from swh.loader.core.
Jul 17 2020, 12:04 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3549: tests: Reuse pytest fixtures from swh.loader.core.
Jul 17 2020, 12:04 PM · SVN Loader, Mercurial loader, Git loader, Core Loader

Jul 16 2020

ardumont closed T2488: Drop loader.core BaseLoaderTest and BaseLoaderStorageTest, a subtask of T2481: Migrate dvcs loader tests code to pytest, as Resolved.
Jul 16 2020, 3:18 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont closed T2488: Drop loader.core BaseLoaderTest and BaseLoaderStorageTest as Resolved.
Jul 16 2020, 3:18 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2488: Drop loader.core BaseLoaderTest and BaseLoaderStorageTest: D3522: tests: Drop no longer used loader base tests code.
Jul 16 2020, 11:45 AM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3515: pytest_plugin: Avoid fixture client to declare optional dependency.
Jul 16 2020, 10:30 AM · SVN Loader, Mercurial loader, Git loader, Core Loader

Jul 10 2020

ardumont added revisions to T2483: tests: Make check-snapshot utility test function recursively check targetted object exists: D3502: check_snapshot: Check existence down to the first level directories, D3503: check_snapshot: Check existence down to contents.
Jul 10 2020, 7:01 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont removed revisions from T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3503: check_snapshot: Check existence down to contents, D3502: check_snapshot: Check existence down to the first level directories.
Jul 10 2020, 7:01 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3503: check_snapshot: Check existence down to contents.
Jul 10 2020, 6:14 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3502: check_snapshot: Check existence down to the first level directories.
Jul 10 2020, 5:40 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont reopened T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin, a subtask of T2481: Migrate dvcs loader tests code to pytest, as Open.
Jul 10 2020, 3:44 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont reopened T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin as "Open".

Reopening as i'm still refactoring/cleaning up more modules.

Jul 10 2020, 3:44 PM · SVN Loader, Mercurial loader, Git loader, Core Loader
ardumont added a revision to T2484: Move sharable fixtures out of conftest into a dedicated pytest plugin: D3495: Expose a pytest_plugin module so other loaders can reuse for tests.
Jul 10 2020, 3:42 PM · SVN Loader, Mercurial loader, Git loader, Core Loader