This_There

First comment on Voat and it seems right to use it here. Nice work, OP.

Didot

Archive.org's history is 'curated'.

I suppose it could be seen like this to a degree, though it comes with the nature of being able to stay up for so long I'd imagine. The things affecting pages not visible on archive.org are robots.txt, webmaster requests for removal (which again don't remove the content but make it publicly inaccessible), and DMCA requests (though I've only ever seen this issued for their non-web page archives like videos, and they still keep it). Everything else is crawled automatically. If anything archive.is is a more curated form of archiving since it relies on users manually and individually archiving copies via the site, the only advantage is it ignores robots.txt.

It'd be wise to not rely on any archive websites alone, keep multiple backups of anything you value.

Absolutely. Trouble is how does a user upload and link others to this as well as proving its validity should it be questioned? Most will be just saving the pages as either browser HTML or MHT (which is better) rather than WARC which is used by archive sites.

gateaccount

Anyone else getting a 504 on archive.is? Hopefully it's just a traffic issue and not something more...

Didot

How were these archived, via a scraper script? Wondering if every submission and comment was archived.

MichaelWesten

Here is one that is almost fully complete, and will run locally/offline https://voat.co/v/pizzagate/1428191

Marou

Looks like someone threw up a quick forum after the reddit ban and did this super post to get everyone up to speed. In addition they've archived (everything) and linked it to the post.

http://archive.is/JmjuT

derram

https://archive.fo/https://www.ceddit.com/r/pizzagate/comments/ * :

[https://www.reddit.com/r/pizzagate/comments/](https://www.reddit.com/r/pizzagate/comments/)*: I'm @Eclipse_OW , the person behind the massive pedophile account database. AMA? : pizzagate

This has been an automated message.

PedePizza

Can someone eli5 archives for me? I've been archive.is what I could, where do these pages go/how can I put these to use. Most pages have been archived, thanks to all who know what they're doing.

Didot

Web archiving sites are used to make copies of an original web page, they're a bit like photocopiers and take a 'scan' of the page at a single point in time. So for example an archive of a web page scanned last year can be compared to one scanned today.

The purpose is so that if the original web page is edited, moved, or deleted people can refer to the archived copy. Unlike taking screenshots these archived copies of web pages contain the original page code, making them useful for proper citations since they're not open to user tampering like screenshots are.

The most well-known web archive site, archive.org is run by the Internet Archive which has been saving copies of web pages periodically and automatically since 1996.

There is another, separate archiving site: archive.is . Instead of automatically archiving web pages it relies on users to manually archive pages at will. The site was created by an individual who wanted the archives more censorship-proof. To explain: sites can tell archiving sites at any time ' sorry, I don't want you to archive this site or page ', and the archiving site can either respect the wish or ignore it. Archive.org respects it*, archive.is ignores it. So in that way archive.is is more censorship-proof.

To find an archived page users can go to an archive site and paste in the original web address and check for archived copies of that page. So for example if I wanted to look for an older version of the Google.com home page I could paste in the address to one of these archiving sites and check what it looked like at an earlier point in time, and also link others to it.

Hope that explained it well enough.

*From what I understand Archive.org will still archive such sites in the background just not make the pages publicly accessible.

PedePizza

Thanks boss

AnonCanuck

Nice work