Package | Description |
---|---|
net.sf.mmm.content.parser.api |
Provides the API for parsers to extract metadata from content.
|
net.sf.mmm.content.parser.base |
Contains the base-implementation of the content-parser API.
|
net.sf.mmm.content.parser.impl |
Contains the implementation of the
content-parser API . |
net.sf.mmm.content.parser.impl.html |
Contains
ContentParser -implementations
for HTML-files. |
net.sf.mmm.content.parser.impl.opendoc |
Contains
ContentParser -implementations
for OO-XML files. |
net.sf.mmm.content.parser.impl.pdf |
Contains
ContentParser -implementations
for PDF files. |
net.sf.mmm.content.parser.impl.poi |
Contains
ContentParser -implementations
for binary MS-Office files using POI. |
net.sf.mmm.content.parser.impl.text |
Contains
ContentParser -implementations
for text-files. |
net.sf.mmm.content.parser.impl.xml |
Contains
ContentParser -implementations
for XML files. |
net.sf.mmm.search.indexer.base |
Contains the base-implementation of the
Search-Indexer API . |
Modifier and Type | Method and Description |
---|---|
GenericContext |
ContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options)
This method parses the document given as
inputStream and
extracts text and metadata returned as
GenericContext . |
Modifier and Type | Class and Description |
---|---|
class |
ContentParserOptionsBean
This is the implementation of
ContentParserOptions as simple java
bean. |
Modifier and Type | Method and Description |
---|---|
GenericContext |
AbstractContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options)
This method parses the document given as
inputStream and
extracts text and metadata returned as
GenericContext . |
protected abstract void |
AbstractContentParser.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ContentParserGeneric.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ContentParserHtml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
void |
AbstractContentParserOpenDoc.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
protected void |
AbstractContentParserOpenDoc.parseContent(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder buffer)
This method parses the content of the document.
|
protected void |
AbstractContentParserOpenDoc.parseMetadata(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context)
This method parses the metadata of the document.
|
protected void |
AbstractContentParserOpenDoc.parseStyles(InputStream inputStream,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder buffer)
This method parses the content of the document.
|
Modifier and Type | Method and Description |
---|---|
void |
ContentParserPdf.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
protected String |
ContentParserXls.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs . |
protected String |
ContentParserPpt.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs . |
protected String |
ContentParserDoc.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs . |
protected abstract String |
AbstractContentParserPoi.extractText(org.apache.poi.poifs.filesystem.POIFSFileSystem poiFs,
long filesize,
ContentParserOptions options)
This method extracts the text from the office document given by
poiFs . |
void |
AbstractContentParserPoi.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
void |
AbstractContentParserOoxml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
void |
AbstractContentParserTextMarkupAware.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer . |
void |
AbstractContentParserText.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer . |
void |
AbstractContentParserText.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ContentParserXml.parse(BufferedReader bufferedReader,
ContentParserOptions options,
MutableGenericContext context,
StringBuilder textBuffer)
This method parses the content of the given
bufferedReader and
appends the textual content to the textBuffer . |
void |
ContentParserXml.parse(InputStream inputStream,
long filesize,
ContentParserOptions options,
MutableGenericContext context) |
Modifier and Type | Method and Description |
---|---|
protected MutableSearchEntry |
ResourceSearchIndexerImpl.createEntry(SearchIndexer indexer,
DataResource resource,
String resourceUri,
ContentParserOptions options) |
Copyright © 2001–2016 mmm-Team. All rights reserved.