public class PdfBoxMetadataExtracter extends TikaPoweredMetadataExtracter
author: -- cm:author title: -- cm:title subject: -- cm:description created: -- cm:created (custom metadata): --Uses Apache Tika
TikaPoweredMetadataExtracter.HeadContentHandler, TikaPoweredMetadataExtracter.MapCaptureContentHandler, TikaPoweredMetadataExtracter.NullContentHandler
MetadataExtracter.OverwritePolicy
Modifier and Type | Field and Description |
---|---|
protected static org.apache.commons.logging.Log |
pdfLogger |
static ArrayList<String> |
SUPPORTED_MIMETYPES |
documentSelector, KEY_AUTHOR, KEY_COMMENTS, KEY_CREATED, KEY_DESCRIPTION, KEY_SUBJECT, KEY_TAGS, KEY_TITLE, logger
MEGABYTE_SIZE, metadataExtracterConfig, NAMESPACE_PROPERTY_PREFIX, PROPERTY_COMPONENT_EMBED, PROPERTY_COMPONENT_EXTRACT, PROPERTY_PREFIX_METADATA
Constructor and Description |
---|
PdfBoxMetadataExtracter() |
Modifier and Type | Method and Description |
---|---|
protected org.apache.tika.parser.Parser |
getParser()
Returns the correct Tika Parser to process the document.
|
buildParseContext, buildSupportedMimetypes, embedInternal, extractRaw, extractSize, extractSpecific, getDocumentSelector, getEmbedder, getExtractorContext, getInputStream, getMetadataSeparator, makeDate, needHeaderContents, setDocumentSelector, setMetadataSeparator
checkIsEmbedSupported, checkIsSupported, embed, extract, extract, extract, filterSystemProperties, getBeanName, getDefaultEmbedMapping, getDefaultMapping, getEmbedMapping, getExecutorService, getLimits, getMapping, getMimetypeService, init, isEmbeddingSupported, isSupported, newRawMap, putRawValue, readEmbedMappingProperties, readEmbedMappingProperties, readGlobalEmbedMappingProperties, readGlobalExtractMappingProperties, readMappingProperties, readMappingProperties, register, setApplicationContext, setBeanName, setDictionaryService, setEmbedMapping, setEmbedMappingProperties, setEnableStringTagging, setExecutorService, setFailOnTypeConversion, setInheritDefaultEmbedMapping, setInheritDefaultMapping, setMapping, setMappingProperties, setMetadataExtracterConfig, setMimetypeLimits, setMimetypeService, setOverwritePolicy, setProperties, setRegistry, setSupportedDateFormats, setSupportedEmbedMimetypes, setSupportedMimetypes
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
embed, isEmbeddingSupported
protected org.apache.tika.parser.Parser getParser()
TikaPoweredMetadataExtracter
TikaAutoMetadataExtracter
which makes use of the Tika auto-detection.getParser
in class TikaPoweredMetadataExtracter
Copyright © 2005–2017 Alfresco Software. All rights reserved.