Modifier and Type | Method | Description |
---|---|---|
static POIOLE2TextExtractor |
ExtractorFactory.createExtractor(POIFSFileSystem fs) |
Modifier and Type | Method | Description |
---|---|---|
static POITextExtractor[] |
ExtractorFactory.getEmbededDocsTextExtractors(POIOLE2TextExtractor ext) |
Returns an array of text extractors, one for each of
the embeded documents in the file (if there are any).
|
Modifier and Type | Class | Description |
---|---|---|
class |
VisioTextExtractor |
Class to find all the text in a Visio file, and return it.
|
Modifier and Type | Class | Description |
---|---|---|
class |
PublisherTextExtractor |
Extract text from HPBF Publisher files
|
Modifier and Type | Class | Description |
---|---|---|
class |
PowerPointExtractor |
This class can be used to extract text from a PowerPoint file.
|
Modifier and Type | Class | Description |
---|---|---|
class |
OutlookTextExtactor |
A text extractor for HSMF (Outlook) .msg files.
|
Modifier and Type | Class | Description |
---|---|---|
class |
EventBasedExcelExtractor |
A text extractor for Excel files, that is based
on the hssf eventusermodel api.
|
class |
ExcelExtractor |
A text extractor for Excel files.
|
Modifier and Type | Class | Description |
---|---|---|
class |
Word6Extractor |
Class to extract the text from old (Word 6 / Word 95) Word Documents.
|
class |
WordExtractor |
Class to extract the text from a Word Document.
|
Copyright 2018 The Apache Software Foundation or its licensors, as applicable.