Details

E + where False = <built-in method startswith of str object at 0x7f3c706974e0>('Usage: list [OPTIONS] archive|cran|debian|deposit|npm|pypi\n\n List supported loaders and optionally their arguments\n\nOptions:\n -h, --help Show this message and exit.\n')
E + where <built-in method startswith of str object at 0x7f3c706974e0> = 'Usage: list [OPTIONS] archive|cran|debian|deposit|npm|pypi|functional\n\n List

My guess is to you need to fix the help message in the cli...
That happens because in the setup.py, we are referencing endpoints and that adapt the cli to allow you to run the loader through the cli:
swh loader run functional ...

Also, i prefer reviewing "green" diffs ;)

Nonethesless, if you stack diff (as you did), you also need those to be green to avoid yourself some confusion ;)

Cheers,

functional: order entry points
cli: add the functional loader in the cli tests

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/347/
See console output for more information: https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/347/console

Harbormaster failed remote builds in B11063: Diff 9991!Mar 11 2020, 11:30 AM

cli: add the functional loader in the cli tests

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/348/
See console output for more information: https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/348/console

Harbormaster failed remote builds in B11064: Diff 9992!Mar 11 2020, 11:36 AM

cli: add the functional loader in the cli tests

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/349/ for more details.

Harbormaster completed remote builds in B11065: Diff 9993.Mar 11 2020, 11:58 AM

lewo edited the summary of this revision. (Show Details)Mar 11 2020, 12:19 PM

package.loader: add origin argument
Add the functional loader
cli: add the functional loader in the cli tests

@ardumont I squashed commits and do some cleaning and remove the WIP status.

swh/loader/package/functional/loader.py
32	It looks strange to me to put a TODO in the the docstring :/

lewo retitled this revision from wip: Functional Package Loader to Functional Package Loader.Mar 11 2020, 3:00 PM

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/350/ for more details.

Harbormaster completed remote builds in B11070: Diff 9998.Mar 11 2020, 3:01 PM

functional: add test for non json sources files

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/355/ for more details.

Harbormaster completed remote builds in B11097: Diff 10025.Mar 12 2020, 3:07 PM

I actually don't know how should I manage errors. It seems exception are generally just emitted, without a dedicated log message.
How do you catch errors on production?

For instance, when a source is not a archive, an exception is raised but there is not a specific error message. I'm wondering how we could get metrics on these errors once on the production line.

In D2792#67565, @lewo wrote:

I actually don't know how should I manage errors. It seems exception are generally just emitted, without a dedicated log message.

If you raise a ValueError with a significant error message, it should be fine.
Error will be caught within the package loader's run method and the task will have its status failed.

How do you catch errors on production?

sentry does now [1]
(going on the link, you can ask for an invite with the form ;)

For instance, when a source is not a archive, an exception is raised but there is not a specific error message. I'm wondering how we could get metrics on these errors once on the production line.

I recall the webapp captures specifically exception and send them to sentry so we may want to do that as well [2]

[1] https://sentry.softwareheritage.org

[2] https://forge.softwareheritage.org/source/swh-web/browse/master/swh/web/browse/utils.py$138

Almost ready to land.

As a general rule of thumb, you need to add test scenarios around the functionality added.
So here, i'm missing:

loading scenario which test the incremental nature of the visit (that'd test resolve_revision_from)
edge cases:
- e.g you added the case where there is a failure when uncompressing, that means something for the task and the snapshot, this needs to be tested :)

Oh and you should test the resolution of the origin_visit to check its status within the test.

Cheers,

swh/loader/package/functional/loader.py
32	Well, given that we sayed we'd incrementally merge the incomplete functionality. It's not a shocker those happened. It'd be annoying the fixme stayed forever though ;)
39	I'd make this a function with the url as parameter. This way, this can be tested independently of the loader instantiation.
84	Even though the main functionality is incomplete, this feels pretty empty. @douardda what do you think, couldn't be add more stuff in there?

ardumont requested changes to this revision.Mar 13 2020, 10:47 AM

This revision now requires changes to proceed.Mar 13 2020, 10:47 AM

functional: improve test_loader_two_visits
functional: add test_loader_incremental
functional: move retrieve_sources out of the FunctionalLoader class
functional: test the origin_visit
functional: add test_uncompress_failure

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/356/ for more details.

Harbormaster completed remote builds in B11127: Diff 10056.Mar 13 2020, 5:15 PM

So here, i'm missing:

loading scenario which test the incremental nature of the visit (that'd test resolve_revision_from)

Done.

edge cases:

e.g you added the case where there is a failure when uncompressing, that means something for the task and the snapshot, this needs to be tested :)

Done in the test_functional.py file

Oh and you should test the resolution of the origin_visit to check its status within the test.

Done as part of the test test_loader_one_visit.

ardumont added inline comments.Mar 13 2020, 6:12 PM

swh/loader/package/functional/tests/test_functional.py
92	as we are instantiating another loader anyway, it's not useful. Plus it's at least pypi specific :)
120	+1 (for that test scenario ;)
162	This should be removed as per my previous comment.
swh/loader/package/loader.py
47	Please move the type in the constructor.

package.loader: add origin argument
Add the functional loader
cli: add the functional loader in the cli tests
functional: add test for non json sources files
functional: improve test_loader_two_visits
functional: add test_loader_incremental
functional: move retrieve_sources out of the FunctionalLoader class
functional: test the origin_visit
functional: add test_uncompress_failure
functional: minor fixes in tests

swh/loader/package/functional/tests/test_functional.py
92	Removed.
162	Done

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/357/ for more details.

Harbormaster completed remote builds in B11128: Diff 10057.Mar 13 2020, 11:59 PM

ardumont accepted this revision.Mar 14 2020, 10:00 AM

This revision is now accepted and ready to land.Mar 14 2020, 10:00 AM

package.loader: ignore non tarball source
package.loader: add origin argument
Add the functional loader
cli: add the functional loader in the cli tests
functional: add test for non json sources files
functional: improve test_loader_two_visits
functional: add test_loader_incremental
functional: move retrieve_sources out of the FunctionalLoader class
functional: test the origin_visit
functional: add test_uncompress_failure
functional: minor fixes in tests

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/361/ for more details.

Harbormaster completed remote builds in B11158: Diff 10089.Mar 16 2020, 5:14 PM

I think I need to implement a test for the task also. WDYT?

I think I need to implement a test for the task also. WDYT?

right

as said on irc, you need to register the task in the conftest fixture [1]

[1] https://forge.softwareheritage.org/source/swh-loader-core/browse/master/conftest.py$75

Also, if you could rewrite a bit into logical commits whose build is fine, that'd be great.

Add the functional loader

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/363/ for more details.

Harbormaster completed remote builds in B11160: Diff 10091.Mar 16 2020, 6:42 PM

@ardumont I squashed almost all commits and added a task test.

@ardumont I squashed almost all commits and added a task test.

Thanks.

I'm not so sure about refactoring the base package loader to have two url and origin arguments that are the same most of the time but not all the time; In concrete terms, this means that we should refactor all loaders to use self.origin in the places where they currently use self.url.

For instance, we have the same issue with the pypi loader, where we generate an api url (which we're downloading) from the url of the origin (which is just a user-facing link).

All in all, I think we need to think about doing a refactoring to do this consistently, but I don't think this diff is the right place to do this refactoring; For now, we can just have a hardcoded map from origin url to index url.

swh/loader/package/functional/tests/test_functional.py
121–122	Comments should probably be more explicit about what changes between the two visits. I guess that the first visit only manages to fetch one tarball, and the second one manages to fetch both?

I'm not so sure about refactoring the base package loader to have two url and origin arguments that are the same most of the time but not all the time; In concrete terms, this means that we should refactor all loaders to use self.origin in the places where they currently use self.url.

For instance, we have the same issue with the pypi loader, where we generate an api url (which we're downloading) from the url of the origin (which is just a user-facing link).

All in all, I think we need to think about doing a refactoring to do this consistently, but I don't think this diff is the right place to do this refactoring; For now, we can just have a hardcoded map from origin url to index url.

For now, I just removed the origin parameter in order to move forward. I then use the url of the sources.json as origin "name".
But I'll create a task to discuss about this origin name.

Add the functional loader

@olasd Thanks for your comments. I addressed all of them.

Build is green
See https://jenkins.softwareheritage.org/job/DLDBASE/job/tox/365/ for more details.

Harbormaster completed remote builds in B11172: Diff 10104.Mar 17 2020, 2:56 PM

ardumont accepted this revision.Mar 18 2020, 10:23 AM

ardumont mentioned this in D2843: Deploy functional loader in staging area.

Closed by commit rDLDBASE09373d23b2fe: package.loader: ignore non tarball source (authored by lewo). · Explain WhyMar 18 2020, 11:11 AM

This revision was automatically updated to reflect the committed changes.

lewo added a commit: rDLDBASE09373d23b2fe: package.loader: ignore non tarball source.

lewo added a commit: rDLDBASE03d7dbf69279: Add the functional loader.

Functional Package Loader
ClosedPublic
Actions

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 10056

CONTRIBUTORS

setup.py

swh/loader/package/functional/init.py

swh/loader/package/functional/loader.py

swh/loader/package/functional/tasks.py

swh/loader/package/functional/tests/data/https_example.com/file.txt

swh/loader/package/functional/tests/data/https_github.com/owner-1_repository-1_revision-1.tgz

swh/loader/package/functional/tests/data/https_github.com/owner-2_repository-1_revision-1.tgz

swh/loader/package/functional/tests/data/https_nix-community.github.io/nixpkgs-swh_sources.json

swh/loader/package/functional/tests/data/https_nix-community.github.io/nixpkgs-swh_sources.json_visit1

swh/loader/package/functional/tests/test_functional.py

swh/loader/package/loader.py

swh/loader/tests/test_cli.py

Functional Package LoaderClosedPublicActions

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 10056

CONTRIBUTORS

setup.py

swh/loader/package/functional/__init__.py

swh/loader/package/functional/loader.py

swh/loader/package/functional/tasks.py

swh/loader/package/functional/tests/data/https_example.com/file.txt

swh/loader/package/functional/tests/data/https_github.com/owner-1_repository-1_revision-1.tgz

swh/loader/package/functional/tests/data/https_github.com/owner-2_repository-1_revision-1.tgz

swh/loader/package/functional/tests/data/https_nix-community.github.io/nixpkgs-swh_sources.json

swh/loader/package/functional/tests/data/https_nix-community.github.io/nixpkgs-swh_sources.json_visit1

swh/loader/package/functional/tests/test_functional.py

swh/loader/package/loader.py

swh/loader/tests/test_cli.py

Functional Package Loader
ClosedPublic
Actions

Revision Contents
Changeset List

swh/loader/package/functional/init.py