Search results for "dist:Text-PDF2XML inline-java"
pdf2xml - extract text from PDF files and wraps it in XML
pdf2xml tries to combine the output of several conversion tools in order to improve the extraction of text from PDF documents. Currently, it uses pdftotext, Apache Tika and pdfxtk. In the default mode, it calls all tools to extract text and pdfxtk is...
TIEDEMANN/Text-PDF2XML-0.3.3 - 11 Feb 2019 14:54:41 UTC