Details

swh/indexer/origin_head.py
46–49	why changing the behavior of head selection mechanism here? why cannot we know at this point which type the processed origin is? In all cases, if this try-based solution is now mandatory, it would be nice to have it encapsulated in a generic self.get_head() method IMHO.

This revision now requires changes to proceed.Jun 13 2019, 2:26 PM

vlorentz planned changes to this revision.Jun 13 2019, 2:41 PM

vlorentz added inline comments.

swh/indexer/origin_head.py
46–49	why cannot we know at this point which type the processed origin is? That requires a new API endpoint in the storage, but indeed, we could (and should).

Build is green
See https://jenkins.softwareheritage.org/job/DCIDX/job/tox/546/ for more details.

Harbormaster completed remote builds in B6126: Diff 5165.Jun 13 2019, 2:56 PM

vlorentz added inline comments.Jun 14 2019, 10:34 AM

swh/indexer/origin_head.py
46–49	D1581

Drop origin ids from tests as well
Use new-style snapshot_add in the tests (long overdue!)
Use origin_visit_get_latest instead of snapshot_get_latest (which is deprecated), in order to know the visit type.

add missing aliases.

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/548/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/548/console

Harbormaster failed remote builds in B6220: Diff 5247!Jun 14 2019, 3:10 PM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/549/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/549/console

Harbormaster failed remote builds in B6221: Diff 5248!Jun 14 2019, 3:15 PM

vlorentz mentioned this in T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types.Jun 17 2019, 4:20 PM

also patch the storage.

Harbormaster failed remote builds in B6303: Diff 5320!Jun 18 2019, 5:35 PM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/551/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/551/console

bump dependency version

Harbormaster failed remote builds in B6304: Diff 5321!Jun 18 2019, 5:36 PM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/552/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/552/console

rebase

vlorentz edited the summary of this revision. (Show Details)Jun 18 2019, 5:38 PM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/553/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/553/console

Harbormaster failed remote builds in B6305: Diff 5322!Jun 18 2019, 5:38 PM

Sounds good.

I'm missing a migration script for the actual data to backfill the new origin-url from the existing rows.
Or am i missing something?

Request changes for the sake of discussion on that point.

Cheers,

swh/indexer/sql/40-swh-func.sql
426	Did not read the rest yet... how are we backfilling the existing rows in the indexer db? Is there a script for that?
swh/indexer/storage/__init__.py
771	Unify with the other docstring one way (append `: Url of the origin`) or the other (drop the redundant definition)... /me singing `You've got the power! tududu du du du tududu du du du` ;) (~> lookup `snap music` if you don't grok that ;)

This revision now requires changes to proceed.Jun 19 2019, 10:32 AM

unify docstrings

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/554/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/554/console

Harbormaster failed remote builds in B6314: Diff 5331!Jun 19 2019, 10:50 AM

vlorentz added inline comments.Jun 19 2019, 11:03 AM

swh/indexer/sql/40-swh-func.sql
426	deploy indexers with that patch run a full pass on all origins (which I planned on doing anyway, since we had to drop the task queue) delete rows without an origin_url if any (these would be origins that had metadata but no longer do)

vlorentz marked an inline comment as done.Jun 19 2019, 11:04 AM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/555/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/555/console

Don't know why the build now.

swh/indexer/sql/40-swh-func.sql
426	Sounds fine to me, thanks.

Because Jenkins timeouted while compiling the package so it did not send it to PyPI the first time. I'm triggering a rebuild

Build is green
See https://jenkins.softwareheritage.org/job/DCIDX/job/tox/556/ for more details.

Harbormaster completed remote builds in B6314: Diff 5331.Jun 19 2019, 11:18 AM

ardumont accepted this revision.Jun 19 2019, 11:43 AM

douardda accepted this revision.Jun 24 2019, 3:23 PM

This revision is now accepted and ready to land.Jun 24 2019, 3:23 PM

rebase

This revision was landed with ongoing or failed builds.Jun 24 2019, 4:36 PM

Closed by commit rDCIDX8887a7d51d83: Manipulate origin URLs instead of origin ids. (authored by vlorentz). · Explain Why

This revision was automatically updated to reflect the committed changes.

Harbormaster failed remote builds in B6449: Diff 5462!Jun 24 2019, 4:36 PM

Build has FAILED

Link to build: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/557/
See console output for more information: https://jenkins.softwareheritage.org/job/DCIDX/job/tox/557/console

Manipulate origin URLs instead of origin ids.
ClosedPublic
Actions

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 5463

requirements-swh.txt

sql/upgrades/125.sql

swh/indexer/indexer.py

swh/indexer/metadata.py

swh/indexer/origin_head.py

swh/indexer/sql/30-swh-schema.sql

swh/indexer/sql/40-swh-func.sql

swh/indexer/storage/init.py

swh/indexer/storage/db.py

swh/indexer/storage/in_memory.py

swh/indexer/tests/storage/test_storage.py

swh/indexer/tests/test_cli.py

swh/indexer/tests/test_origin_head.py

swh/indexer/tests/test_origin_metadata.py

swh/indexer/tests/utils.py

Manipulate origin URLs instead of origin ids.ClosedPublicActions

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 5463

requirements-swh.txt

sql/upgrades/125.sql

swh/indexer/indexer.py

swh/indexer/metadata.py

swh/indexer/origin_head.py

swh/indexer/sql/30-swh-schema.sql

swh/indexer/sql/40-swh-func.sql

swh/indexer/storage/__init__.py

swh/indexer/storage/db.py

swh/indexer/storage/in_memory.py

swh/indexer/tests/storage/test_storage.py

swh/indexer/tests/test_cli.py

swh/indexer/tests/test_origin_head.py

swh/indexer/tests/test_origin_metadata.py

swh/indexer/tests/utils.py

Manipulate origin URLs instead of origin ids.
ClosedPublic
Actions

Revision Contents
Changeset List

swh/indexer/storage/init.py