org.alfresco.repo.content.metadata (Alfresco 5.3.a-SNAPSHOT API)

Interface Summary
Interface	Description
MetadataEmbedder	Interface for writing metadata properties back into the content file.
MetadataExtracter	Interface for document property extracters.
MetadataExtracterConfig	Interface for allowing Alfresco to provide configuration properties to metadata extracters like Tika parser;
OpenOfficeMetadataWorker	An interface that allows separation between the metadata extractor registry and the third party subsystem owning the open office connection.

Class Summary
Class	Description
AbstractMappingMetadataExtracter	Support class for metadata extracters that support dynamic and config-driven mapping between extracted values and model properties.
DefaultOpenOfficeMetadataWorker	The class doing the actual work of the OpenOfficeMetadataExtracter, based around an OpenOffice connection.
DWGMetadataExtracter	Metadata extractor for the `MIMETYPE_APP_DWG` and `MIMETYPE_IMG_DWG` mimetypes.
HtmlMetadataExtracter	Extracts the following values from HTML documents:
MailMetadataExtracter	Outlook MAPI format email meta-data extractor extracting the following values:
MediaTypeDisablingDocumentSelector	Tika 1.6 has the ability to parse embedded artifacts, such as images in a PDF, but this can be very resource intensive so adding this selector to parsers and transformers that handle formats with embedded artifacts will disable parsing of the specified content types.
MetadataExtracterConfigImpl	Default implementation for the MetadataExtracterConfig;
MetadataExtracterLimits	Represents maximum values (that result in exceptions if exceeded) or limits on values (that result in EOF (End Of File) being returned early).
MetadataExtracterRegistry	Holds and provides the most appropriate metadate extracter for a particular mimetype.
MP3MetadataExtracter	Extracts the following values from MP3 files:
OfficeMetadataExtracter	Office file format Metadata Extracter.
OpenDocumentMetadataExtracter	Metadata extractor for the `MIMETYPE_OPENDOCUMENT_XXX` mimetypes.
OpenOfficeMetadataExtracter	Extracts values from Star Office documents into the following:
PdfBoxMetadataExtracter	Metadata extractor for the PDF documents.
PoiMetadataExtracter	POI-based metadata extractor for Office 07 documents.
RFC822MetadataExtracter	Metadata extractor for RFC822 mime emails.
TikaAudioMetadataExtracter	A Metadata Extractor which makes use of the Apache Tika Audio Parsers to extract metadata from your media files.
TikaAutoMetadataExtracter	A Metadata Extractor which makes use of the Apache Tika auto-detection to select the best parser to extract the metadata from your document.
TikaPoweredMetadataExtracter	The parent of all Metadata Extractors which use Apache Tika under the hood.
TikaPoweredMetadataExtracter.HeadContentHandler	This content handler will capture entries from within the header of the Tika content XHTML, but ignore the rest.
TikaPoweredMetadataExtracter.MapCaptureContentHandler	This content handler will grab all tags and attributes, and record the textual content of the last seen one of them.
TikaPoweredMetadataExtracter.NullContentHandler	A content handler that ignores all the content it finds.
TikaSpringConfiguredMetadataExtracter	A Metadata Extractor which makes use of Apache Tika, and allows the selection of the Tika parser to be sprung-in to extract the metadata from your document.

Enum Summary
Enum Description

MetadataExtracter.OverwritePolicy
A enumeration of functional property overwrite policies.

Package org.alfresco.repo.content.metadata