public class PoiMetadataExtracter extends TikaPoweredMetadataExtracter implements org.springframework.beans.factory.InitializingBean
author: -- cm:author title: -- cm:title subject: -- cm:description created: -- cm:created Any custom property: -- [not mapped]Uses Apache Tika
AlfrescoPoiPatchUtils
to resolve the following issues:
TikaPoweredMetadataExtracter.HeadContentHandler, TikaPoweredMetadataExtracter.MapCaptureContentHandler, TikaPoweredMetadataExtracter.NullContentHandler
MetadataExtracter.OverwritePolicy
Modifier and Type | Field and Description |
---|---|
protected static org.apache.commons.logging.Log |
logger |
static ArrayList<String> |
SUPPORTED_MIMETYPES |
documentSelector, KEY_AUTHOR, KEY_COMMENTS, KEY_CREATED, KEY_DESCRIPTION, KEY_SUBJECT, KEY_TAGS, KEY_TITLE
MEGABYTE_SIZE, metadataExtracterConfig, NAMESPACE_PROPERTY_PREFIX, PROPERTY_COMPONENT_EMBED, PROPERTY_COMPONENT_EXTRACT, PROPERTY_PREFIX_METADATA
Constructor and Description |
---|
PoiMetadataExtracter() |
Modifier and Type | Method and Description |
---|---|
void |
afterPropertiesSet()
MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
Initialization of AlfrescoPoiPatchUtils properties for TikaPoweredMetadataExtracter.getExtractorContext() context |
protected org.apache.tika.parser.Parser |
getParser()
Returns the correct Tika Parser to process the document.
|
Set<String> |
getPoiAllowableXslfRelationshipTypes() |
Boolean |
isPoiExtractPropertiesOnly() |
void |
setPoiAllowableXslfRelationshipTypes(Set<String> poiAllowableXslfRelationshipTypes)
MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
|
void |
setPoiExtractPropertiesOnly(Boolean poiExtractPropertiesOnly)
MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
|
void |
setPoiFootnotesLimit(Integer poiFootnotesLimit)
MNT-577: Alfresco is running 100% CPU for over 10 minutes while extracting metadata for Word office document
|
buildParseContext, buildSupportedMimetypes, embedInternal, extractRaw, extractSize, extractSpecific, getDocumentSelector, getEmbedder, getExtractorContext, getInputStream, getMetadataSeparator, makeDate, needHeaderContents, setDocumentSelector, setMetadataSeparator
checkIsEmbedSupported, checkIsSupported, embed, extract, extract, extract, filterSystemProperties, getBeanName, getDefaultEmbedMapping, getDefaultMapping, getEmbedMapping, getExecutorService, getLimits, getMapping, getMimetypeService, init, isEmbeddingSupported, isSupported, newRawMap, putRawValue, readEmbedMappingProperties, readEmbedMappingProperties, readGlobalEmbedMappingProperties, readGlobalExtractMappingProperties, readMappingProperties, readMappingProperties, register, setApplicationContext, setBeanName, setDictionaryService, setEmbedMapping, setEmbedMappingProperties, setEnableStringTagging, setExecutorService, setFailOnTypeConversion, setInheritDefaultEmbedMapping, setInheritDefaultMapping, setMapping, setMappingProperties, setMetadataExtracterConfig, setMimetypeLimits, setMimetypeService, setOverwritePolicy, setProperties, setRegistry, setSupportedDateFormats, setSupportedEmbedMimetypes, setSupportedMimetypes
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
embed, isEmbeddingSupported
protected org.apache.tika.parser.Parser getParser()
TikaPoweredMetadataExtracter
TikaAutoMetadataExtracter
which makes use of the Tika auto-detection.getParser
in class TikaPoweredMetadataExtracter
public void setPoiFootnotesLimit(Integer poiFootnotesLimit)
poiFootnotesLimit
- - Integer
value which specifies limit of amount of footnotes of XWPF documentspublic void setPoiExtractPropertiesOnly(Boolean poiExtractPropertiesOnly)
poiExtractPropertiesOnly
- - Boolean
value which indicates that POI extractor must avoid building of the full document parts hierarchy and reading content of
the partspublic Boolean isPoiExtractPropertiesOnly()
public void setPoiAllowableXslfRelationshipTypes(Set<String> poiAllowableXslfRelationshipTypes)
public void afterPropertiesSet() throws Exception
AlfrescoPoiPatchUtils
properties for TikaPoweredMetadataExtracter.getExtractorContext()
contextafterPropertiesSet
in interface org.springframework.beans.factory.InitializingBean
Exception
Copyright © 2005–2017 Alfresco Software. All rights reserved.