trumpisPizza

Yes, I downloaded a few months ago, you need to use wget the unix command with the -c ( continue ) option for retry, as it will almost always fail.

The file is 6gb, and when you decompress it you end with 10gb, its all in .eml format, easy to use grep, or you can write a python script using 'mail' to process as you wish.

Lots of fun,

Again don't click on the browser that will fail, use

wget -c http://filepath

Run's for awhile and will fail, when it fails then run the cmd again it will pick up again, took me 3 days to download all 6gb podest 1-60,000

https://file.wikileaks.org/file/podesta-emails/podesta-emails.mbox-2016-11-06.gz

That's the link, but you must use "WGET -C" ... on the cmd line old school


Anybody wants remind me here I can share my python I wrote to process the headers, very interesting stuff.

My favorite is Alex jones said "There are 1,000's of pizza references", ... no there are not, there are 18 pizza references in 55,000 emails involving millions of words of text.

Like somebody said should have been called coffee-gate as "Lets meet for coffee" was said over 1,000 times :)


Post this OP as a sticky over at /v/pizzagateopen , as it will get deleted here, and this question does come up frequently.

It would be great to provide a tool site so researchers could be handed the python tools to do their own original work.

Godwillwin

I think you can click on the podesta email dump in wiki and download to your pc