• Posts Tagged ‘project’

    RSS Keyword Summaries Using Bookworm

    by  • November 25, 2012 • Uncategorized

    A few weeks ago I needed a way of grabbing a bunch of different articles based on a set of basic keywords for one of my projects. Essentially what I wanted was a way to feed a whole bunch of different RSS feeds into a program, have it download the...

    Read more →

    ClassyPDF Tool Up for Grabs

    by  • July 31, 2012 • Uncategorized

    Back at the tail end of April I had posted about data mining PDF data in order to classify whether or not a document were malicious. In the post I had talked about data and an API, but never released the tool out to the public. It has been a few m...

    Read more →

    15K Random Dataset

    by  • December 1, 2010 • Uncategorized

    To gain an understand of what PDF files looked like from Google, I needed to gather a pretty large dataset programatically. Using a quick tool I wrote called Bighands, I was able to use the Google AJAX Search API with a random search query to down...

    Read more →

    PDF X-Ray

    by  • December 1, 2010 • Uncategorized

    Throughout this year we have seen a rise of attacks using PDFs as a delivery or exploit mechanism. One of the things I feel is lacking is a way to identify or distinguish between a malicious PDF and known good PDF. Tools like Virus Total or Wepawe...

    Read more →