Page MenuHomeSoftware Heritage

Add robots.txt to archive.softwareheritage.org to avoid crawlers
Closed, MigratedEdits Locked

Description

We don't want robots crawling archive.softwareheritage.org using API endpoints.

We need to add a robots.txt to that effect.

Event Timeline

zack moved this task from Restricted Project Column to Restricted Project Column on the Restricted Project board.Feb 1 2017, 12:52 PM
zack moved this task from Restricted Project Column to Restricted Project Column on the Restricted Project board.

I've written the trivial robots.txt, but as is it will be deployed as /static/robots.txt.
We need HTTP trickery to that end.