A few days ago I posted the 15K dump of PDF statistic data that you could download and import. I figured most people just want to mess around and see the data, so I wrote a quick web front that will allow you to filter down results based on PDF size. I really have no intention to add anymore filters, but if I get requests or comments to do so then I am all for it. The step in the plan with this data is to collect results for both good and bad PDF files and give them the same web front. Once those are collected I will give the ability to compare between the differing datasets to identify commonalities or differences.
Play around with the data and email me if you have any questions or want to see more.