Interface | Description |
---|---|
MetadataEmbedder |
Interface for writing metadata properties back into the content file.
|
MetadataExtracter |
Interface for document property extracters.
|
MetadataExtracterConfig |
Interface for allowing Alfresco to provide configuration properties to metadata extracters like
Tika parser;
|
OpenOfficeMetadataWorker |
An interface that allows separation between the metadata extractor registry and the third party subsystem owning the
open office connection.
|
Class | Description |
---|---|
AbstractMappingMetadataExtracter |
Support class for metadata extracters that support dynamic and config-driven
mapping between extracted values and model properties.
|
DefaultOpenOfficeMetadataWorker |
The class doing the actual work of the OpenOfficeMetadataExtracter, based around an OpenOffice connection.
|
DWGMetadataExtracter |
Metadata extractor for the
MIMETYPE_APP_DWG
and
MIMETYPE_IMG_DWG
mimetypes. |
HtmlMetadataExtracter |
Extracts the following values from HTML documents:
|
MailMetadataExtracter |
Outlook MAPI format email meta-data extractor extracting the following values:
|
MediaTypeDisablingDocumentSelector |
Tika 1.6 has the ability to parse embedded artifacts, such as images in a PDF,
but this can be very resource intensive so adding this selector
to parsers and transformers that handle formats with embedded artifacts
will disable parsing of the specified content types.
|
MetadataExtracterConfigImpl |
Default implementation for the MetadataExtracterConfig;
|
MetadataExtracterLimits |
Represents maximum values (that result in exceptions if exceeded) or
limits on values (that result in EOF (End Of File) being returned early).
|
MetadataExtracterRegistry |
Holds and provides the most appropriate metadate extracter for a particular
mimetype.
|
MP3MetadataExtracter |
Extracts the following values from MP3 files:
|
OfficeMetadataExtracter |
Office file format Metadata Extracter.
|
OpenDocumentMetadataExtracter |
Metadata extractor for the
MIMETYPE_OPENDOCUMENT_XXX
mimetypes. |
OpenOfficeMetadataExtracter |
Extracts values from Star Office documents into the following:
|
PdfBoxMetadataExtracter |
Metadata extractor for the PDF documents.
|
PoiMetadataExtracter |
POI-based metadata extractor for Office 07 documents.
|
RFC822MetadataExtracter |
Metadata extractor for RFC822 mime emails.
|
TikaAudioMetadataExtracter |
A Metadata Extractor which makes use of the Apache
Tika Audio Parsers to extract metadata from your
media files.
|
TikaAutoMetadataExtracter |
A Metadata Extractor which makes use of the Apache
Tika auto-detection to select the best parser
to extract the metadata from your document.
|
TikaPoweredMetadataExtracter |
The parent of all Metadata Extractors which use
Apache Tika under the hood.
|
TikaPoweredMetadataExtracter.HeadContentHandler |
This content handler will capture entries from within
the header of the Tika content XHTML, but ignore the
rest.
|
TikaPoweredMetadataExtracter.MapCaptureContentHandler |
This content handler will grab all tags and attributes,
and record the textual content of the last seen one
of them.
|
TikaPoweredMetadataExtracter.NullContentHandler |
A content handler that ignores all the content it finds.
|
TikaSpringConfiguredMetadataExtracter |
A Metadata Extractor which makes use of Apache Tika,
and allows the selection of the Tika parser to be
sprung-in to extract the metadata from your document.
|
Enum | Description |
---|---|
MetadataExtracter.OverwritePolicy |
A enumeration of functional property overwrite policies.
|
Copyright © 2005–2017 Alfresco Software. All rights reserved.