|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
ObjectPDFToString
public class PDFToString
This class provides a single static convert()
method that converts the text in a PDF file into an XML string that can be
pre-filtered and added to a Lucene database by the
XMLTextProcessor
class.
Internally, the text of the PDF file is extracted using the PDFBox library.
Field Summary | |
---|---|
(package private) static boolean |
mustConfigureLogger
|
(package private) static PDFTextStripper |
stripper
PDFBox text stripper. |
Constructor Summary | |
---|---|
PDFToString()
|
Method Summary | |
---|---|
(package private) static String |
convert(InputStream PDFInputStream)
Convert a PDF file into an XML string. |
Methods inherited from class Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
static boolean mustConfigureLogger
static PDFTextStripper stripper
Constructor Detail |
---|
public PDFToString()
Method Detail |
---|
static String convert(InputStream PDFInputStream) throws IOException
PDFInputStream
- The stream of PDF data to convert to an
XML string.
null
.
IOException
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |