• PDFid.py Output to CSV

    by  • December 2, 2010

    A lot of my initial data storage and collections have used pdfid.py from Didier Stevens. The tools is simple, quick and provides a lot of useful information in a single pass. After going through the code I saw that the output was being parsed out ...

    Read more →

    15K Random Dataset

    by  • December 1, 2010

    To gain an understand of what PDF files looked like from Google, I needed to gather a pretty large dataset programatically. Using a quick tool I wrote called Bighands, I was able to use the Google AJAX Search API with a random search query to down...

    Read more →

    PDF X-Ray

    by  • December 1, 2010

    Throughout this year we have seen a rise of attacks using PDFs as a delivery or exploit mechanism. One of the things I feel is lacking is a way to identify or distinguish between a malicious PDF and known good PDF. Tools like Virus Total or Wepawe...

    Read more →

    Gripes with the Security Community

    by  • December 1, 2010

    I know this is the first post (of many) and it should contain information, but it is a rant. I began working on a project that I will introduce in the next few postings and have taken notice of something within the security community. Sites that c...

    Read more →