A tool and library that can extract various areas of text from a PDF, especially a scholarly article PDF.