public class PDFToString
extends Object
convert()
method that converts the text in a PDF file into an XML string that can be
pre-filtered and added to a Lucene database by the
XMLTextProcessor
class. Modifier and Type | Field and Description |
---|---|
(package private) static boolean |
mustConfigureLogger |
(package private) static PDFTextStripper |
stripper
PDFBox text stripper.
|
Constructor and Description |
---|
PDFToString() |
Modifier and Type | Method and Description |
---|---|
(package private) static String |
convert(InputStream PDFInputStream)
Convert a PDF file into an XML string.
|
static boolean mustConfigureLogger
static PDFTextStripper stripper
static String convert(InputStream PDFInputStream) throws IOException
PDFInputStream
- The stream of PDF data to convert to an
XML string.null
.IOException