who is saving my website to the way back machine internet archive?

159 viewsOtherTechnology

I don’t really understand very well how it works. Is people actually saving my website or is it automatic? And if so why is mine being saved but not my friend’s?

In: Technology

3 Answers

Anonymous 0 Comments

> Is people actually saving my website or is it automatic?

Bots do so. It’d be quite an undertaking otherwise.

> And if so why is mine being saved but not my friend’s?

There are a couple of possible reasons. The Occam’s Razor calls for your website being just more popular than your friends’, but another possible plausible reason is that you didn’t make a robots.txt file – which is basically a file telling bots what’s cool to do on your website and what’s not. It *doesn’t* have to be respected, but reputable services will generally obey by it.

Anonymous 0 Comments

Thank you for the explanation! I still consider it witchcraft though.

Anonymous 0 Comments

Both. There are bots that crawl popular pages and save them regularly. For other pages, you can manually save it. If you try pulling up an un-archived page on the Wayback Machine (like this post, as of January 24, 2024 00:39 Neopia Standard Time) it’ll tell you

“Hrm. Wayback Machine has not archived that URL.”

followed by

“This page is available on the web! Help make the Wayback Machine more complete!”

and a button you can click that says “Save this URL in the Wayback Machine”.

Any rando out there can plug in a URL and click that button. Maybe someone saw something on your site that they wanted to make sure was recorded for posterity.

I highly recommend their FAQ page: https://help.archive.org/help/category/the-wayback-machine/

There are just a buncha giant servers with pretty blinking lights hanging out in San Francisco with all this info. There are pictures of them and some more technical bits here: https://blog.archive.org/2020/11/16/where-your-donation-goes/