| Package | Description |
|---|---|
| net.sf.mmm.content.parser.api |
Provides the API for parsers to extract metadata from content.
|
| net.sf.mmm.content.parser.base |
Contains the base-implementation of the content-parser API.
|
| net.sf.mmm.content.parser.impl |
Contains the implementation of the
content-parser API. |
| net.sf.mmm.content.parser.impl.html |
Contains
ContentParser-implementations
for HTML-files. |
| net.sf.mmm.content.parser.impl.opendoc |
Contains
ContentParser-implementations
for OO-XML files. |
| net.sf.mmm.content.parser.impl.pdf |
Contains
ContentParser-implementations
for PDF files. |
| net.sf.mmm.content.parser.impl.poi |
Contains
ContentParser-implementations
for binary MS-Office files using POI. |
| net.sf.mmm.content.parser.impl.text |
Contains
ContentParser-implementations
for text-files. |
| net.sf.mmm.content.parser.impl.xml |
Contains
ContentParser-implementations
for XML files. |
| net.sf.mmm.search.indexer.base |
Contains the base-implementation of the
Search-Indexer API. |
| Modifier and Type | Method and Description |
|---|---|
GenericContext |
ContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options)
This method parses the document given as
inputStream and
extracts text and metadata returned as
GenericContext. |
| Modifier and Type | Class and Description |
|---|---|
class |
ContentParserOptionsBean
This is the implementation of
ContentParserOptions as simple java
bean. |
| Modifier and Type | Method and Description |
|---|---|
GenericContext |
AbstractContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options)
This method parses the document given as
inputStream and
extracts text and metadata returned as
GenericContext. |
protected abstract void |
AbstractContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
void |
ContentParserGeneric.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
void |
ContentParserHtml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
void |
AbstractContentParserOpenDoc.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
protected void |
AbstractContentParserOpenDoc.parseContent(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder buffer)
This method parses the content of the document.
|
protected void |
AbstractContentParserOpenDoc.parseMetadata(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context)
This method parses the metadata of the document.
|
protected void |
AbstractContentParserOpenDoc.parseStyles(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder buffer)
This method parses the content of the document.
|
| Modifier and Type | Method and Description |
|---|---|
void |
ContentParserPdf.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
protected String |
ContentParserXls.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs. |
protected String |
ContentParserPpt.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs. |
protected String |
ContentParserDoc.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs. |
protected abstract String |
AbstractContentParserPoi.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs. |
void |
AbstractContentParserPoi.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
void |
AbstractContentParserOoxml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
void |
AbstractContentParserTextMarkupAware.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer. |
void |
AbstractContentParserText.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer. |
void |
AbstractContentParserText.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
void |
ContentParserXml.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer. |
void |
ContentParserXml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
| Modifier and Type | Method and Description |
|---|---|
protected MutableSearchEntry |
ResourceSearchIndexerImpl.createEntry(SearchIndexer indexer,
DataResource resource,
String resourceUri,
ContentParserOptions options) |
Copyright © 2001–2016 mmm-Team. All rights reserved.