Page Menu
Home
Software Heritage
Search
Configure Global Search
Log In
Files
F9342204
D8789.id31682.diff
No One
Temporary
Actions
View File
Edit File
Delete File
View Transforms
Subscribe
Mute Notifications
Award Token
Flag For Later
Size
1 KB
Subscribers
None
D8789.id31682.diff
View Options
diff --git a/swh/lister/pubdev/__init__.py b/swh/lister/pubdev/__init__.py
--- a/swh/lister/pubdev/__init__.py
+++ b/swh/lister/pubdev/__init__.py
@@ -17,21 +17,21 @@
Origins retrieving strategy
---------------------------
-To get a list of all package names we call `https://pub.dev/api/packages` endpoint.
+To get a list of all package names we call `https://pub.dev/api/package-names` endpoint.
There is no other way for discovery (no archive index, no database dump, no dvcs repository).
-Page listing
-------------
-
-There is only one page that list all origins url based
-on `https://pub.dev/api/packages/{pkgname}`.
-The origin url corresponds to the http api endpoint that returns complete information
-about the package versions (name, version, author, description, release date).
-
Origins from page
-----------------
-The lister yields all origins url from one page.
+The lister yields all origin urls from a single page.
+
+Getting last update date for each package
+-----------------------------------------
+
+Before sending a listed pubdev origin to the scheduler, we query the
+`https://pub.dev/api/packages/{pkgname}` endpoint to get the last update date
+for a package (date of its latest release). It enables Software Heritage to create
+new loading task for a package only if it has new releases since last visit.
Running tests
-------------
File Metadata
Details
Attached
Mime Type
text/plain
Expires
Thu, Jul 3, 12:33 PM (2 w, 4 d ago)
Storage Engine
blob
Storage Format
Raw Data
Storage Handle
3219992
Attached To
D8789: pubdev: Update outdated lister documentation
Event Timeline
Log In to Comment