• Posts Tagged ‘code’

    Releasing the malpdfobj Tool (beta)

    by  • January 1, 2011 • Uncategorized

    Progressing forward with my results from yesterday I was able to get most of the data I cared about in a JSON format. Having the JSON for each grouping of data was great, but didn't really do me any good because I could never get it into MongoDB t...

    Read more →

    PDFiD.py Output to JSON

    by  • December 10, 2010 • Uncategorized

    I want to store as much data as possible about this malware being collected, and I realized that a database would be the best idea in storing the data. One of the things I was playing around with in my head was taking these detailed PDFiD scans an...

    Read more →

    Bighands and Dirtyhands

    by  • December 2, 2010 • Uncategorized

    I needed a quick way to get PDF data local to my machine without having to go and individually look for documents. I decided the best way to get the data (hoping that it was mostly clean) was to use the Google AJAX Search API to randomly query for...

    Read more →

    PDFid.py Output to CSV

    by  • December 2, 2010 • Uncategorized

    A lot of my initial data storage and collections have used pdfid.py from Didier Stevens. The tools is simple, quick and provides a lot of useful information in a single pass. After going through the code I saw that the output was being parsed out ...

    Read more →