26 May 2022

Morsel #4 - bulk upload to Internet Archive with waybackpy and advertools

My fourth morsel is a way to backup your site to the Internet Archive.


GitHub link

Credit to Koray Tuğberk GÜBÜR for the code and the idea.

I only made a slight variation, using advertools to pull pages from a sitemap and to_list() over apply() and a lambda function when extracting URLs to iterate over. Purely a preference thing.

Also shout out to Elias Dabbas for the advertools library which has made sitemap handling in Python so much easier.