public class TikaAutoMetadataExtracter extends TikaPoweredMetadataExtracter
author: -- cm:author title: -- cm:title subject: -- cm:description created: -- cm:created comments: geo:lat: -- cm:latitude geo:long: -- cm:longitude
TikaPoweredMetadataExtracter.HeadContentHandler, TikaPoweredMetadataExtracter.MapCaptureContentHandler, TikaPoweredMetadataExtracter.NullContentHandler
MetadataExtracter.OverwritePolicy
Modifier and Type | Field and Description |
---|---|
protected static org.apache.commons.logging.Log |
logger |
static ArrayList<String> |
SUPPORTED_MIMETYPES |
documentSelector, KEY_AUTHOR, KEY_COMMENTS, KEY_CREATED, KEY_DESCRIPTION, KEY_SUBJECT, KEY_TAGS, KEY_TITLE
MEGABYTE_SIZE, metadataExtracterConfig, NAMESPACE_PROPERTY_PREFIX, PROPERTY_COMPONENT_EMBED, PROPERTY_COMPONENT_EXTRACT, PROPERTY_PREFIX_METADATA
Constructor and Description |
---|
TikaAutoMetadataExtracter(org.apache.tika.config.TikaConfig tikaConfig) |
Modifier and Type | Method and Description |
---|---|
protected Map<String,Serializable> |
extractSpecific(org.apache.tika.metadata.Metadata metadata,
Map<String,Serializable> properties,
Map<String,String> headers)
Because some editors use JPEG_IMAGE_HEIGHT_TAG when
saving JPEG images , a more reliable source for
image size are the values provided by Tika
and not the exif/tiff metadata read from the file
This will override the tiff:Image size
which gets embedded into the alfresco node properties
for jpeg files that contain such exif information
|
protected org.apache.tika.parser.Parser |
getParser()
Does auto-detection to select the best Tika
Parser.
|
buildParseContext, buildSupportedMimetypes, embedInternal, extractRaw, extractSize, getDocumentSelector, getEmbedder, getExtractorContext, getInputStream, getMetadataSeparator, makeDate, needHeaderContents, setDocumentSelector, setMetadataSeparator
checkIsEmbedSupported, checkIsSupported, embed, extract, extract, extract, filterSystemProperties, getBeanName, getDefaultEmbedMapping, getDefaultMapping, getEmbedMapping, getExecutorService, getLimits, getMapping, getMimetypeService, init, isEmbeddingSupported, isSupported, newRawMap, putRawValue, readEmbedMappingProperties, readEmbedMappingProperties, readGlobalEmbedMappingProperties, readGlobalExtractMappingProperties, readMappingProperties, readMappingProperties, register, setApplicationContext, setBeanName, setDictionaryService, setEmbedMapping, setEmbedMappingProperties, setEnableStringTagging, setExecutorService, setFailOnTypeConversion, setInheritDefaultEmbedMapping, setInheritDefaultMapping, setMapping, setMappingProperties, setMetadataExtracterConfig, setMimetypeLimits, setMimetypeService, setOverwritePolicy, setProperties, setRegistry, setSupportedDateFormats, setSupportedEmbedMimetypes, setSupportedMimetypes
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
embed, isEmbeddingSupported
public TikaAutoMetadataExtracter(org.apache.tika.config.TikaConfig tikaConfig)
protected org.apache.tika.parser.Parser getParser()
getParser
in class TikaPoweredMetadataExtracter
protected Map<String,Serializable> extractSpecific(org.apache.tika.metadata.Metadata metadata, Map<String,Serializable> properties, Map<String,String> headers)
extractSpecific
in class TikaPoweredMetadataExtracter
Copyright © 2005–2017 Alfresco Software. All rights reserved.