[ it is not yet clear whether the best way to remain up-to-date wrt PyPI is to have a local mirror, or rather follow a list of changes there, but in the meantime here are some information about how to mirror PyPI, which I've tracked down for unrelated reasons ]
PyPI is easy to mirror, and a network of public mirrors already exist. The Python package bandersnatch automate the task of setting up an initial mirror and keeping it up to date.
At the time of writing, time/space figures about mirroring PyPI:
- a full mirror took about 1 day to retrieve
- a subsequent update (to catchup with new packages arrived over the mirroring day) took about 10 minutes
- on disk space (for compressed packages in various formats) is about 290 GB