Http.//pybrary.net/pyPdf/ is a pure python pdf processing library. Given that processing numeric data is almost the same in all languages, I'd suggest pyPdf.
And I'd suggest the documentation is not complete, though there are good examples. My tests have been pretty impressive. PPDF.read('A1) — A2' The last test, the most complex, is the ability for the reader to create an array of numbers, or even a list of them. Here's a couple of files. If a user sends a file with a text field (in this case the title of the book) in their program they can enter the number, and it will be added to the array. There are also some examples. (I'll be working on all of this more and more, so be patient.) PPDF is a fork from PDF, which is also from scout. In addition to the basic functions, PDF has several additional ones The data format can be selected from the dropdown. There are a couple of functions for extracting files from the file size (including one that saves the file.