Skip navigation links
A B C D E F G H I J K L M N O P Q R S T U V W X _ 

A

a - Variable in class org.apache.tika.sax.xpath.CompositeMatcher
 
ABS_PEAK_AUDIO_FILE_PATH - Static variable in interface org.apache.tika.metadata.XMPDM
"The absolute path to the file's peak audio file.
ABSOLUTE_MAX_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The maximum length allowed for a ngram.
ABSOLUTE_MIN_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The minimum length allowed for a ngram.
ABSTRACT_PATH - Static variable in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
AbstractMetadataHandler - Class in org.apache.tika.parser.xml
Base class for SAX handlers that map SAX events into document metadata.
AbstractMetadataHandler(Metadata, String) - Constructor for class org.apache.tika.parser.xml.AbstractMetadataHandler
 
AbstractMetadataHandler(Metadata, Property) - Constructor for class org.apache.tika.parser.xml.AbstractMetadataHandler
 
AbstractParser - Class in org.apache.tika.parser
Abstract base class for new parsers.
AbstractParser() - Constructor for class org.apache.tika.parser.AbstractParser
 
AccessPermissionException - Exception in org.apache.tika.exception
Exception to be thrown when a document does not allow content extraction.
AccessPermissionException() - Constructor for exception org.apache.tika.exception.AccessPermissionException
 
AccessPermissionException(Throwable) - Constructor for exception org.apache.tika.exception.AccessPermissionException
 
AccessPermissionException(String) - Constructor for exception org.apache.tika.exception.AccessPermissionException
 
AccessPermissionException(String, Throwable) - Constructor for exception org.apache.tika.exception.AccessPermissionException
 
AccessPermissions - Interface in org.apache.tika.metadata
Until we can find a common standard, we'll use these options.
ACKNOWLEDGEMENT - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
acquireClient() - Method in class org.apache.tika.fork.ForkParser
 
acronym - Variable in class org.apache.tika.mime.MimeType
The MimeType acronym
ACRONYM_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
Activator - Class in org.apache.tika.parser.internal
 
Activator() - Constructor for class org.apache.tika.parser.internal.Activator
 
active - Variable in class org.apache.tika.fork.ForkServer
 
activeRow - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
activeTableId - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
add(String) - Method in class org.apache.tika.language.LanguageProfile
Adds a single occurrence of the given ngram to this profile.
add(String, long) - Method in class org.apache.tika.language.LanguageProfile
Adds multiple occurrences of the given ngram to this profile.
add(StringBuffer) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Adds ngrams from a single word to this profile
add(LanguageProfilerBuilder.QuickStringBuffer) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Adds the last NGrams from the specified word.
add(CharSequence) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Adds ngrams from a single word in this profile
add(StringBuffer, int) - Method in class org.apache.tika.language.LanguageProfilerBuilder
 
add(String, String) - Method in class org.apache.tika.metadata.Metadata
Add a metadata name/value mapping.
add(Property, String) - Method in class org.apache.tika.metadata.Metadata
Add a metadata property/value mapping.
add(MimeType) - Method in class org.apache.tika.mime.MimeTypes
Add the specified mime-type in the repository.
add(String, MimeType) - Method in class org.apache.tika.mime.Patterns
 
add(String, boolean, MimeType) - Method in class org.apache.tika.mime.Patterns
 
addAlias(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
 
addAlias(MimeType, MediaType) - Method in class org.apache.tika.mime.MimeTypes
Adds an alias for the given media type.
addBoundingBoxPattern(String, Map<Pattern, String>) - Method in class org.apache.tika.parser.gdal.GDALParser
 
addContent(ContentHandler, Metadata) - Method in class org.apache.tika.parser.RecursiveParserWrapper
 
addData(byte[], int, int) - Method in class org.apache.tika.detect.TextStatistics
 
addExtension(String) - Method in class org.apache.tika.mime.MimeType
Adds a known file extension to this type.
addExtension(String, MimeType) - Method in class org.apache.tika.mime.Patterns
 
addGlob(String, MimeType) - Method in class org.apache.tika.mime.Patterns
 
addingService(ServiceReference) - Method in class org.apache.tika.config.TikaActivator
 
ADDITIONAL_MODEL_INFO - Static variable in interface org.apache.tika.metadata.IPTC
Information about the ethnicity and other facets of the model(s) in a model-released image.
addLetter(char) - Method in class org.apache.tika.language.ProfilingWriter
 
addLink(URI) - Method in class org.apache.tika.mime.MimeType
Add a link to this mime type
addMagic(Magic) - Method in class org.apache.tika.mime.MimeType
 
addMetadata(Metadata, Map<String, Object>) - Method in class org.apache.tika.parser.audio.AudioParser
 
addMetadata(String) - Method in class org.apache.tika.parser.xml.AbstractMetadataHandler
Adds the given metadata value.
addMetadata(String) - Method in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
addMetadata(String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
addMetadata(String) - Method in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
addName(String, MimeType) - Method in class org.apache.tika.mime.Patterns
 
addPattern(MimeType, String) - Method in class org.apache.tika.mime.MimeTypes
Adds a file name pattern for the given media type.
addPattern(MimeType, String, boolean) - Method in class org.apache.tika.mime.MimeTypes
Adds a file name pattern for the given media type.
addPatternWithColon(String, Map<Pattern, String>) - Method in class org.apache.tika.parser.gdal.GDALParser
 
addPatternWithIs(String, Map<Pattern, String>) - Method in class org.apache.tika.parser.gdal.GDALParser
 
addPrefix(String, String) - Method in class org.apache.tika.sax.xpath.XPathParser
 
addProfile(String) - Static method in class org.apache.tika.language.LanguageIdentifier
 
addProfile(String, LanguageProfile) - Static method in class org.apache.tika.language.LanguageIdentifier
Adds a single language profile
addResource(Closeable) - Method in class org.apache.tika.io.TemporaryResources
Adds a new resource to the set of tracked resources that will all be closed when the TemporaryResources.close() method is called.
addRootXML(String, String) - Method in class org.apache.tika.mime.MimeType
Add some rootXML info to this mime-type
addSeparator() - Method in class org.apache.tika.language.ProfilingWriter
 
addService(Object, Object, int) - Static method in class org.apache.tika.config.ServiceLoader
 
addSpaceBetweenElements - Variable in class org.apache.tika.sax.TextContentHandler
 
addSuperType(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
 
addType(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
 
advance(int) - Method in class org.apache.tika.sax.SecureContentHandler
Records the given number of output characters (or more accurately UTF-16 code units).
afterRead(int) - Method in class org.apache.tika.io.ProxyInputStream
Invoked by the read methods after the proxied call has returned successfully.
afterRead(int) - Method in class org.apache.tika.io.TikaInputStream
 
ALBUM - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the album."
ALBUM_ARTIST - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the album artist or group for compilation albums."
algorithms - Variable in class org.apache.tika.parser.utils.CommonsDigester
 
ALIAS_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
ALIAS_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
allowDuplicateValues - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
allowEmptyValues - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
ALPHABET - Static variable in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
alphabet - Static variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
ALT_TAPE_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
"An alternative tape name, set via the project window or timecode dialog in Premiere.
ALTITUDE - Static variable in interface org.apache.tika.metadata.Geographic
The WGS84 Altitude of the Point
ALTITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
analyze(StringBuilder) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Analyzes a piece of text
AndClause - Class in org.apache.tika.mime
 
AndClause(Clause...) - Constructor for class org.apache.tika.mime.AndClause
 
annotations - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
Annotations() - Constructor for class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 
annotations - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
ID -> Text
APP_VERSION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
append(String) - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
append(char) - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
appendBuf(byte[], int, int) - Method in class org.apache.tika.io.TailStream
Adds the content of the given buffer to the internal tail buffer.
appendByte(byte) - Method in class org.apache.tika.io.TailStream
Adds the given byte to the internal tail buffer.
appendedValues(String[], String) - Method in class org.apache.tika.metadata.Metadata
 
APPLICATION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
application(String) - Static method in class org.apache.tika.mime.MediaType
 
APPLICATION_NAME - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
APPLICATION_VERSION - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
APPLICATION_XML - Static variable in class org.apache.tika.mime.MediaType
 
APPLICATION_ZIP - Static variable in class org.apache.tika.mime.MediaType
 
applyHint(List<MimeType>, MimeType) - Method in class org.apache.tika.mime.MimeTypes
Use the MimeType hint to try to clarify or specialise the current possible types list.
applyPatternsToOutput(String, Metadata, Map<Pattern, String>) - Method in class org.apache.tika.parser.gdal.GDALParser
 
applyProbilities(List<MimeType>, MimeType, MimeType) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ARTIST - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the artist or artists."
ARTWORK_OR_OBJECT - Static variable in interface org.apache.tika.metadata.IPTC
A set of metadata about artwork or an object in the item
ARTWORK_OR_OBJECT_DETAIL_COPYRIGHT_NOTICE - Static variable in interface org.apache.tika.metadata.IPTC
Contains any necessary copyright notice for claiming the intellectual property for artwork or an object in the image and should identify the current owner of the copyright of this work with associated intellectual property rights.
ARTWORK_OR_OBJECT_DETAIL_CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
Contains the name of the artist who has created artwork or an object in the image.
ARTWORK_OR_OBJECT_DETAIL_DATE_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
Designates the date and optionally the time the artwork or object in the image was created.
ARTWORK_OR_OBJECT_DETAIL_SOURCE - Static variable in interface org.apache.tika.metadata.IPTC
The organisation or body holding and registering the artwork or object in the image for inventory purposes.
ARTWORK_OR_OBJECT_DETAIL_SOURCE_INVENTORY_NUMBER - Static variable in interface org.apache.tika.metadata.IPTC
The inventory number issued by the organisation or body holding and registering the artwork or object in the image.
ARTWORK_OR_OBJECT_DETAIL_TITLE - Static variable in interface org.apache.tika.metadata.IPTC
A reference for the artwork or object in the image.
asAlphaNumeric(int) - Static method in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
asAlphaNumericLower(int) - Static method in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
asInputSource() - Method in class org.apache.tika.detect.AutoDetectReader
 
asRomanNumerals(int) - Static method in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
asRomanNumeralsLower(int) - Static method in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
ASSEMBLE_DOCUMENT - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can the user insert/rotate/delete pages.
assignableFrom(Collection<Class<? extends Detector>>, Class<? extends Detector>) - Method in class org.apache.tika.detect.CompositeDetector
 
assignableFrom(Collection<Class<? extends Parser>>, Class<? extends Parser>) - Method in class org.apache.tika.parser.CompositeParser
 
attachExternalParsers(TikaConfig) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
attachExternalParsers(List<ExternalParser>, TikaConfig) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
ATTRIBUTE_CONTENT_TYPE - Static variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
ATTRIBUTE_ID - Static variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
AttributeDependantMetadataHandler - Class in org.apache.tika.parser.xml
This adds a Metadata entry for a given node.
AttributeDependantMetadataHandler(Metadata, String, String) - Constructor for class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
AttributeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../@* XPath expression.
AttributeMatcher() - Constructor for class org.apache.tika.sax.xpath.AttributeMatcher
 
AttributeMetadataHandler - Class in org.apache.tika.parser.xml
SAX event handler that maps the contents of an XML attribute into a metadata field.
AttributeMetadataHandler(String, String, Metadata, String) - Constructor for class org.apache.tika.parser.xml.AttributeMetadataHandler
 
AttributeMetadataHandler(String, String, Metadata, Property) - Constructor for class org.apache.tika.parser.xml.AttributeMetadataHandler
 
attributesMapping - Variable in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
 
audio(String) - Static method in class org.apache.tika.mime.MediaType
 
AUDIO_CHANNEL_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio channel type."
AUDIO_COMPRESSOR - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio compression used.
AUDIO_MOD_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The date and time when the audio was last modified."
AUDIO_SAMPLE_RATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio sample rate.
AUDIO_SAMPLE_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio sample type."
AudioParser - Class in org.apache.tika.parser.audio
 
AudioParser() - Constructor for class org.apache.tika.parser.audio.AudioParser
 
AUTHOR - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
AUTHOR - Static variable in interface org.apache.tika.metadata.Office
Name of the principal author(s) of a document
AUTHORS_POSITION - Static variable in interface org.apache.tika.metadata.Photoshop
 
AUTO - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The elements that are automatically emitted by lazyStartHead, so skip them if they get sent to startElement/endElement by mistake.
AutoDetectParser - Class in org.apache.tika.parser
 
AutoDetectParser() - Constructor for class org.apache.tika.parser.AutoDetectParser
Creates an auto-detecting parser instance using the default Tika configuration.
AutoDetectParser(Detector) - Constructor for class org.apache.tika.parser.AutoDetectParser
 
AutoDetectParser(Parser...) - Constructor for class org.apache.tika.parser.AutoDetectParser
Creates an auto-detecting parser instance using the specified set of parser.
AutoDetectParser(Detector, Parser...) - Constructor for class org.apache.tika.parser.AutoDetectParser
 
AutoDetectParser(TikaConfig) - Constructor for class org.apache.tika.parser.AutoDetectParser
 
AutoDetectReader - Class in org.apache.tika.detect
An input stream reader that automatically detects the character encoding to be used for converting bytes to characters.
AutoDetectReader(InputStream, Charset) - Constructor for class org.apache.tika.detect.AutoDetectReader
 
AutoDetectReader(BufferedInputStream, Metadata, List<EncodingDetector>, LoadErrorHandler) - Constructor for class org.apache.tika.detect.AutoDetectReader
 
AutoDetectReader(InputStream, Metadata, ServiceLoader) - Constructor for class org.apache.tika.detect.AutoDetectReader
 
AutoDetectReader(InputStream, Metadata) - Constructor for class org.apache.tika.detect.AutoDetectReader
 
AutoDetectReader(InputStream) - Constructor for class org.apache.tika.detect.AutoDetectReader
 
autoPageNumberFormat - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
AutoPageNumberUtils - Class in org.apache.tika.parser.iwork
Utility class to allow for conversion from an integer to Roman numerals or alpha-numeric symbols in line with Pages auto numbering formats.
AutoPageNumberUtils() - Constructor for class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
available() - Method in class org.apache.tika.io.LookaheadInputStream
 
available() - Method in class org.apache.tika.io.NullInputStream
Return the number of bytes that can be read.
available() - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's available() method.

B

b - Variable in class org.apache.tika.sax.xpath.CompositeMatcher
 
BasicContentHandlerFactory - Class in org.apache.tika.sax
Basic factory for creating common types of ContentHandlers
BasicContentHandlerFactory(BasicContentHandlerFactory.HANDLER_TYPE, int) - Constructor for class org.apache.tika.sax.BasicContentHandlerFactory
 
BasicContentHandlerFactory.HANDLER_TYPE - Enum in org.apache.tika.sax
Common handler types for content.
beforeRead(int) - Method in class org.apache.tika.io.ProxyInputStream
Invoked by the read methods before the call is proxied.
binaryData - Variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
BinaryElementsDataHandler(EmbeddedDocumentExtractor, ContentHandler) - Constructor for class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
binaryMode - Variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
BITS_PER_SAMPLE - Static variable in interface org.apache.tika.metadata.TIFF
"Number of bits per component in each channel."
BLOB_SIZE_THRESHOLD - Static variable in class org.apache.tika.io.TikaInputStream
Blob size threshold that limits the largest BLOB size to be buffered fully in memory by the TikaInputStream.get(Blob, Metadata) method.
BodyContentHandler - Class in org.apache.tika.sax
Content handler decorator that only passes everything inside the XHTML <body/> tag to the underlying handler.
BodyContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that passes all XHTML body events to the given underlying content handler.
BodyContentHandler(Writer) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to the given writer.
BodyContentHandler(OutputStream) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to the given output stream using the default encoding.
BodyContentHandler(int) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to an internal string buffer.
BodyContentHandler() - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to an internal string buffer.
BS - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
BUF_SIZE - Static variable in class org.apache.tika.parser.strings.Latin1StringsParser
The size of the internal buffers.
buffer - Variable in class org.apache.tika.io.LookaheadInputStream
 
buffer - Variable in class org.apache.tika.language.ProfilingWriter
 
buffer - Variable in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
buffer - Variable in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
bufferBagged - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
The buffer used to capture characters when inside a bag li element.
bufferBagless - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
The buffer used to capture characters inside standard elements.
buffered - Variable in class org.apache.tika.io.LookaheadInputStream
 
bufferIsInFile - Variable in class org.apache.tika.utils.RereadableInputStream
Whether or not the stream's contents are being stored in a file as opposed to memory.
BufferUnderrunException() - Constructor for exception org.apache.tika.io.EndianUtils.BufferUnderrunException
 
build() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
build2() - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
Initialize the MimeTypes with this builder instance
Builder() - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
builder - Variable in class org.apache.tika.sax.SafeContentHandler.StringOutput
 
builderStack - Variable in class org.apache.tika.sax.LinkContentHandler
Stack of link builders, one for each level of nested links currently being processed.
bundleContext - Variable in class org.apache.tika.config.TikaActivator
 
byteBuffer - Variable in class org.apache.tika.utils.RereadableInputStream
The buffer used to store the stream's content; this storage is moved to a file when the stored data's size exceeds maxBytesInMemory.
bytesRead - Variable in class org.apache.tika.io.TailStream
The number of bytes that have been read so far.
bytesToTest - Variable in class org.apache.tika.detect.TextDetector
 

C

call(String, Object...) - Method in class org.apache.tika.fork.ForkClient
 
CALL - Static variable in class org.apache.tika.fork.ForkServer
 
call(ClassLoader, Object) - Method in class org.apache.tika.fork.ForkServer
 
CAN_MODIFY - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can any modifications be made to the document
CAN_MODIFY_ANNOTATIONS - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can the user modify annotations
CAN_PRINT - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can the user print the document
CAN_PRINT_DEGRADED - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can the user print an image-degraded version of the document.
CAPTION_WRITER - Static variable in interface org.apache.tika.metadata.Photoshop
 
cast(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
Returns the given stream casts to a TikaInputStream, or null if the stream is not a TikaInputStream.
catchEmbeddedExceptions - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
CATEGORY - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated. 
CATEGORY - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
CATEGORY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
A categorization of the content of this package.
CATEGORY - Static variable in interface org.apache.tika.metadata.Photoshop
 
CERTAINTY_LIMIT - Static variable in class org.apache.tika.language.LanguageIdentifier
 
CERTIFICATE - Static variable in interface org.apache.tika.metadata.XMPRights
A Web URL for a rights management certificate.
changeRate - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ChannelTypePropertyConverter() - Constructor for class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
Deprecated.
 
CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Characters in the document
CHARACTER_COUNT_WITH_SPACES - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
CHARACTER_COUNT_WITH_SPACES - Static variable in interface org.apache.tika.metadata.Office
The number of Characters in the document, including spaces
characterCount - Variable in class org.apache.tika.sax.SecureContentHandler
Number of output characters that Tika has produced so far.
CHARACTERS - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
characters(char[], int, int) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
characters - Variable in class org.apache.tika.mime.MimeTypesReader
 
characters(char[], int, int) - Method in class org.apache.tika.mime.MimeTypesReader
 
characters(char[], int, int) - Method in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.iwork.NumbersContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
characters(char[], int, int) - Method in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
characters(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
characters(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.LinkBuilder
 
characters(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.PhoneExtractingContentHandler
The characters method is called whenever a Parser wants to pass raw...
characters(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
Writes the given characters to the given character stream.
characters(char[], int, int) - Method in class org.apache.tika.sax.ToXMLContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
Writes the given characters to the given character stream.
characters(char[], int, int) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
characters(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
charactersOutput - Variable in class org.apache.tika.sax.SafeContentHandler
Output through the ContentHandler.characters(char[], int, int) method of the decorated content handler.
charAt(int) - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
charset - Variable in class org.apache.tika.detect.AutoDetectReader
 
CHARSET_FIRST_PATTERN - Static variable in class org.apache.tika.mime.MediaType
 
CHARSET_NAME_PATTERN - Static variable in class org.apache.tika.utils.CharsetUtils
 
CharsetUtils - Class in org.apache.tika.utils
 
CharsetUtils() - Constructor for class org.apache.tika.utils.CharsetUtils
 
check(String, int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
Checks to see if the command can be run.
check(String[], int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
Checks to see if the command can be run.
check(String, int...) - Static method in class org.apache.tika.parser.external.ExternalParser
Checks to see if the command can be run.
check(String[], int...) - Static method in class org.apache.tika.parser.external.ExternalParser
 
CHECK_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
checkSignature(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
ChildMatcher - Class in org.apache.tika.sax.xpath
Intermediate evaluation state of a .../*... XPath expression.
ChildMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.ChildMatcher
 
choices - Variable in class org.apache.tika.metadata.Property
The available choices for the open and closed choice value types.
CITY - Static variable in interface org.apache.tika.metadata.IPTC
Name of the city the content is focussing on -- either the place shown in visual media or referenced by text or audio media.
CITY - Static variable in interface org.apache.tika.metadata.Photoshop
 
CLASSLOADER_SPECIFIC_DEFAULT_TYPES - Static variable in class org.apache.tika.mime.MimeTypes
 
ClassLoaderProxy - Class in org.apache.tika.fork
 
ClassLoaderProxy(int) - Constructor for class org.apache.tika.fork.ClassLoaderProxy
 
ClassLoaderResource - Class in org.apache.tika.fork
 
ClassLoaderResource(ClassLoader) - Constructor for class org.apache.tika.fork.ClassLoaderResource
 
Clause - Interface in org.apache.tika.mime
Defines a clause to be evaluated.
clause - Variable in class org.apache.tika.mime.Magic
 
clause - Variable in class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
ClauseRecord(Clause) - Constructor for class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
clauses - Variable in class org.apache.tika.mime.AndClause
 
clauses - Variable in class org.apache.tika.mime.OrClause
 
clean(String) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
clean(String) - Static method in class org.apache.tika.sax.CleanPhoneText
 
clean(String) - Static method in class org.apache.tika.utils.CharsetUtils
Handle various common charset name errors, and return something that will be considered valid (and is normalized)
cleanPhoneRegex - Static variable in class org.apache.tika.sax.CleanPhoneText
 
CleanPhoneText - Class in org.apache.tika.sax
Class to help de-obfuscate phone numbers in text.
CleanPhoneText() - Constructor for class org.apache.tika.sax.CleanPhoneText
 
cleanSubstitutions - Static variable in class org.apache.tika.sax.CleanPhoneText
 
clear() - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
clearProfiles() - Static method in class org.apache.tika.language.LanguageIdentifier
Clears the current map of language profiles
ClimateForcast - Interface in org.apache.tika.metadata
Met keys from NCAR CCSM files in the Climate Forecast Convention.
close() - Method in class org.apache.tika.fork.ForkClient
 
close() - Method in class org.apache.tika.fork.ForkParser
 
close() - Method in class org.apache.tika.io.CloseShieldInputStream
Replaces the underlying input stream with a ClosedInputStream sentinel.
close() - Method in class org.apache.tika.io.LookaheadInputStream
 
close() - Method in class org.apache.tika.io.NullInputStream
Close this input stream - resets the internal state to the initial values.
close() - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's close() method.
close() - Method in class org.apache.tika.io.TemporaryResources
Closes all tracked resources.
close() - Method in class org.apache.tika.io.TikaInputStream
 
close() - Method in class org.apache.tika.language.ProfilingWriter
 
close() - Method in class org.apache.tika.parser.ParsingReader
Closes the read end of the pipe.
close() - Method in class org.apache.tika.utils.RereadableInputStream
Closes the input stream and removes the temporary file if one was created.
ClosedInputStream - Class in org.apache.tika.io
Closed input stream.
ClosedInputStream() - Constructor for class org.apache.tika.io.ClosedInputStream
 
closeOriginalStreamOnClose - Variable in class org.apache.tika.utils.RereadableInputStream
Specifies whether or not to close the original input stream when close() is called.
closeQuietly(Reader) - Static method in class org.apache.tika.io.IOUtils
Unconditionally close an Reader.
closeQuietly(Channel) - Static method in class org.apache.tika.io.IOUtils
Unconditionally close a Channel.
closeQuietly(Writer) - Static method in class org.apache.tika.io.IOUtils
Unconditionally close a Writer.
closeQuietly(InputStream) - Static method in class org.apache.tika.io.IOUtils
Unconditionally close an InputStream.
closeQuietly(OutputStream) - Static method in class org.apache.tika.io.IOUtils
Unconditionally close an OutputStream.
CloseShieldInputStream - Class in org.apache.tika.io
Proxy stream that prevents the underlying input stream from being closed.
CloseShieldInputStream(InputStream) - Constructor for class org.apache.tika.io.CloseShieldInputStream
Creates a proxy that shields the given input stream from being closed.
closeStream() - Method in class org.apache.tika.utils.RereadableInputStream
Closes the input stream currently used for reading (may either be the original stream or a memory or file stream after the first pass).
collapseWhitespaceInAnchor - Variable in class org.apache.tika.sax.LinkContentHandler
Whether to collapse whitespace in anchor text
collectServiceClassNames(URL, Collection<String>) - Method in class org.apache.tika.config.ServiceLoader
 
COLOR_MODE - Static variable in interface org.apache.tika.metadata.Photoshop
 
COLUMN_COUNT - Static variable in interface org.apache.tika.metadata.Database
 
COLUMN_NAME - Static variable in interface org.apache.tika.metadata.Database
 
command - Variable in class org.apache.tika.embedder.ExternalEmbedder
The external command to invoke.
command - Variable in class org.apache.tika.parser.external.ExternalParser
The external command to invoke.
command - Variable in class org.apache.tika.parser.gdal.GDALParser
 
COMMAND_LINE - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
COMMAND_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
commandAppendOperator - Variable in class org.apache.tika.embedder.ExternalEmbedder
 
commandAssignmentDelimeter - Variable in class org.apache.tika.embedder.ExternalEmbedder
 
commandAssignmentOperator - Variable in class org.apache.tika.embedder.ExternalEmbedder
 
COMMENT - Static variable in class org.apache.tika.config.ServiceLoader
 
COMMENT - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
COMMENT_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
COMMENTS - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
COMMENTS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
COMMENTS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
COMMON_CHARSETS - Static variable in class org.apache.tika.utils.CharsetUtils
 
CommonsDigester - Class in org.apache.tika.parser.utils
Implementation of DigestingParser.Digester that relies on commons.codec.digest.DigestUtils to calculate digest hashes.
CommonsDigester(int, CommonsDigester.DigestAlgorithm...) - Constructor for class org.apache.tika.parser.utils.CommonsDigester
 
CommonsDigester.DigestAlgorithm - Enum in org.apache.tika.parser.utils
 
CommonsDigester.SimpleBoundedInputStream - Class in org.apache.tika.parser.utils
Very slight modification of Commons' BoundedInputStream so that we can figure out if this hit the bound or not.
COMPANY - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
COMPANY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
compare(String, String) - Method in class org.apache.tika.metadata.serialization.PrettyMetadataKeyComparator
 
compare(String, String) - Method in class org.apache.tika.mime.Patterns.LengthComparator
 
compareTo(ServiceLoader.RankedService) - Method in class org.apache.tika.config.ServiceLoader.RankedService
 
compareTo(LanguageProfile.Interleaved.Entry) - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
compareTo(LanguageProfilerBuilder.NGramEntry) - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
 
compareTo(Property) - Method in class org.apache.tika.metadata.Property
 
compareTo(Magic) - Method in class org.apache.tika.mime.Magic
 
compareTo(MediaType) - Method in class org.apache.tika.mime.MediaType
 
compareTo(MimeType) - Method in class org.apache.tika.mime.MimeType
 
COMPILATION - Static variable in interface org.apache.tika.metadata.XMPDM
"An album created by various artists."
compile(String) - Method in class org.apache.tika.mime.Patterns
 
COMPOSER - Static variable in interface org.apache.tika.metadata.XMPDM
"The composer's name."
composite(Property, Property[]) - Static method in class org.apache.tika.metadata.Property
Constructs a new composite property from the given primary and array of secondary properties.
CompositeDetector - Class in org.apache.tika.detect
Content type detector that combines multiple different detection mechanisms.
CompositeDetector(MediaTypeRegistry, List<Detector>, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.CompositeDetector
 
CompositeDetector(MediaTypeRegistry, List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
 
CompositeDetector(List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
 
CompositeDetector(Detector...) - Constructor for class org.apache.tika.detect.CompositeDetector
 
CompositeExternalParser - Class in org.apache.tika.parser.external
A Composite Parser that wraps up all the available External Parsers, and provides an easy way to access them.
CompositeExternalParser() - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
 
CompositeExternalParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
 
CompositeMatcher - Class in org.apache.tika.sax.xpath
Composite XPath evaluation state.
CompositeMatcher(Matcher, Matcher) - Constructor for class org.apache.tika.sax.xpath.CompositeMatcher
 
CompositeParser - Class in org.apache.tika.parser
Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.
CompositeParser(MediaTypeRegistry, List<Parser>, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.CompositeParser
 
CompositeParser(MediaTypeRegistry, List<Parser>) - Constructor for class org.apache.tika.parser.CompositeParser
 
CompositeParser(MediaTypeRegistry, Parser...) - Constructor for class org.apache.tika.parser.CompositeParser
 
CompositeParser() - Constructor for class org.apache.tika.parser.CompositeParser
 
ConcurrentUtils - Class in org.apache.tika.utils
Utility Class for Concurrency in Tika
ConcurrentUtils() - Constructor for class org.apache.tika.utils.ConcurrentUtils
 
ConfigurableThreadPoolExecutor - Interface in org.apache.tika.concurrent
Allows Thread Pool to be Configurable.
connect() - Method in class org.apache.tika.fork.MemoryURLConnection
 
consumeErrorStream() - Method in class org.apache.tika.fork.ForkClient
Consumes all pending bytes from the standard error stream of the forked server process, and prints them out to the standard error stream of this process.
CONTACT - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
CONTACT_INFO_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
The contact information address part.
CONTACT_INFO_CITY - Static variable in interface org.apache.tika.metadata.IPTC
The contact information city part.
CONTACT_INFO_COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
The contact information country part.
CONTACT_INFO_EMAIL - Static variable in interface org.apache.tika.metadata.IPTC
The contact information email address part.
CONTACT_INFO_PHONE - Static variable in interface org.apache.tika.metadata.IPTC
The contact information phone number part.
CONTACT_INFO_POSTAL_CODE - Static variable in interface org.apache.tika.metadata.IPTC
The contact information part denoting the local postal code.
CONTACT_INFO_STATE_PROVINCE - Static variable in interface org.apache.tika.metadata.IPTC
The contact information part denoting regional information such as state or province.
CONTACT_INFO_WEB_URL - Static variable in interface org.apache.tika.metadata.IPTC
The contact information web address part.
ContainerExtractor - Interface in org.apache.tika.extractor
Tika container extractor interface.
content - Variable in class org.apache.tika.parser.epub.EpubParser
 
CONTENT_DISPOSITION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_ENCODING - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LANGUAGE - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LENGTH - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_MD5 - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_STATUS - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
CONTENT_STATUS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The status of the content.
CONTENT_TYPE - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_TYPE_HINT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
This is currently used to identify Content-Type that may be included within a document, such as in html documents (e.g.
contentEquals(InputStream, InputStream) - Static method in class org.apache.tika.io.IOUtils
Compare the contents of two Streams to determine if they are equal or not.
contentEquals(Reader, Reader) - Static method in class org.apache.tika.io.IOUtils
Compare the contents of two Readers to determine if they are equal or not.
ContentHandlerDecorator - Class in org.apache.tika.sax
Decorator base class for the ContentHandler interface.
ContentHandlerDecorator(ContentHandler) - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
Creates a decorator for the given SAX event handler.
ContentHandlerDecorator() - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
Creates a decorator that by default forwards incoming SAX events to a dummy content handler that simply ignores all the events.
contentHandlerFactory - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
ContentHandlerFactory - Interface in org.apache.tika.sax
Interface to allow easier injection of code for getting a new ContentHandler
ContentHandlerProxy - Class in org.apache.tika.fork
 
ContentHandlerProxy(int) - Constructor for class org.apache.tika.fork.ContentHandlerProxy
 
ContentHandlerResource - Class in org.apache.tika.fork
 
ContentHandlerResource(ContentHandler) - Constructor for class org.apache.tika.fork.ContentHandlerResource
 
context - Variable in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
context - Variable in class org.apache.tika.parser.ParseContext
Map of objects in this context
context - Variable in class org.apache.tika.parser.ParsingReader
The parse context.
contextClassLoader - Static variable in class org.apache.tika.config.ServiceLoader
The default context class loader to use for all threads, or null to automatically select the context class loader.
CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.DublinCore
An entity responsible for making contributions to the content of the resource.
CONTRIBUTOR - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#CONTRIBUTOR
CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
CONTROLLED_VOCABULARY_TERM - Static variable in interface org.apache.tika.metadata.IPTC
A term to describe the content of the image by a value from a Controlled Vocabulary.
CONVENTIONS - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
convert(Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
Deprecated.
How a standalone converter might work
convertAndSet(Metadata, Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
Deprecated.
How convert+set might work
copy(InputStream, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Copy bytes from an InputStream to an OutputStream.
copy(InputStream, Writer) - Static method in class org.apache.tika.io.IOUtils
Copy bytes from an InputStream to chars on a Writer using the default character encoding of the platform.
copy(InputStream, Writer, String) - Static method in class org.apache.tika.io.IOUtils
Copy bytes from an InputStream to chars on a Writer using the specified character encoding.
copy(Reader, Writer) - Static method in class org.apache.tika.io.IOUtils
Copy chars from a Reader to a Writer.
copy(Reader, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Copy chars from a Reader to bytes on an OutputStream using the default character encoding of the platform, and calling flush.
copy(Reader, OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
Copy chars from a Reader to bytes on an OutputStream using the specified character encoding, and calling flush.
copyLarge(InputStream, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Copy bytes from a large (over 2GB) InputStream to an OutputStream.
copyLarge(Reader, Writer) - Static method in class org.apache.tika.io.IOUtils
Copy chars from a large (over 2GB) Reader to a Writer.
COPYRIGHT - Static variable in interface org.apache.tika.metadata.XMPDM
"The copyright information."
COPYRIGHT_NOTICE - Static variable in interface org.apache.tika.metadata.IPTC
Contains any necessary copyright notice for claiming the intellectual property for this item and should identify the current owner of the copyright for the item.
COPYRIGHT_OWNER - Static variable in interface org.apache.tika.metadata.IPTC
Owner or owners of the copyright in the licensed image.
COPYRIGHT_OWNER_ID - Static variable in interface org.apache.tika.metadata.IPTC
The ID of the owner or owners of the copyright in the licensed image.
COPYRIGHT_OWNER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated.
COPYRIGHT_OWNER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
The name of the owner or owners of the copyright in the licensed image.
copyToTailBuffer(byte[], int, int) - Method in class org.apache.tika.io.TailStream
Copies the given buffer into the internal tail buffer at the current position.
count() - Method in class org.apache.tika.detect.TextStatistics
Returns the total number of bytes seen so far.
count(int) - Method in class org.apache.tika.detect.TextStatistics
Returns the number of occurrences of the given byte.
count(int, int) - Method in class org.apache.tika.detect.TextStatistics
 
count - Variable in class org.apache.tika.io.CountingInputStream
The count of bytes that have passed.
count - Variable in class org.apache.tika.language.LanguageProfile
The sum of all ngram counts in this profile.
count - Variable in class org.apache.tika.language.LanguageProfile.Counter
 
count - Variable in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
count - Variable in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
The number of occurences of this ngram in its profile
count - Variable in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
countControl() - Method in class org.apache.tika.detect.TextStatistics
Counts control characters (i.e.
countEightBit() - Method in class org.apache.tika.detect.TextStatistics
Counts eight bit characters, i.e.
counter - Static variable in class org.apache.tika.fork.MemoryURLStreamHandler
 
Counter() - Constructor for class org.apache.tika.language.LanguageProfile.Counter
 
CountingInputStream - Class in org.apache.tika.io
A decorating input stream that counts the number of bytes that have passed through the stream so far.
CountingInputStream(InputStream) - Constructor for class org.apache.tika.io.CountingInputStream
Constructs a new CountingInputStream.
COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
Full name of the country the content is focussing on -- either the country shown in visual media or referenced in text or audio media.
COUNTRY - Static variable in interface org.apache.tika.metadata.Photoshop
 
COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
Code of the country the content is focussing on -- either the country shown in visual media or referenced in text or audio media.
counts - Variable in class org.apache.tika.detect.TextStatistics
 
countSafeAscii() - Method in class org.apache.tika.detect.TextStatistics
Counts "safe" (i.e.
countSafeControl() - Method in class org.apache.tika.detect.TextStatistics
 
COVERAGE - Static variable in interface org.apache.tika.metadata.DublinCore
The extent or scope of the content of the resource.
COVERAGE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#COVERAGE
COVERAGE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
CP_NAME_PATTERN - Static variable in class org.apache.tika.utils.CharsetUtils
 
CR - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
create(String, InputStream, String) - Static method in class org.apache.tika.language.LanguageProfilerBuilder
Creates a new Language profile from (preferably quite large - 5-10k of lines) text file
create() - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates an empty instance; same as calling new MimeTypes().
create(Document) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified document.
create(InputStream...) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified input stream.
create(InputStream) - Static method in class org.apache.tika.mime.MimeTypesFactory
 
create(URL...) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the resource at the location specified by the URL.
create(URL) - Static method in class org.apache.tika.mime.MimeTypesFactory
 
create(String) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified file path, as interpreted by the class loader in getResource().
create(String, String) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance.
create(String, String, ClassLoader) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance.
create() - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
create(ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
create(String, ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
create(URL...) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
 
CREATE_DATE - Static variable in interface org.apache.tika.metadata.XMP
The date and time the resource was created.
createBootstrapJar() - Static method in class org.apache.tika.fork.ForkClient
Creates a temporary jar file that can be used to bootstrap the forked server process.
createComposite(List<Detector>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
createComposite(Class<? extends Detector>, List<Detector>, Set<Class<? extends Detector>>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
createComposite(Class<? extends ConfigurableThreadPoolExecutor>, List<ConfigurableThreadPoolExecutor>, Set<Class<? extends ConfigurableThreadPoolExecutor>>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
createComposite(List<ConfigurableThreadPoolExecutor>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
createComposite(List<Parser>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
createComposite(Class<? extends Parser>, List<Parser>, Set<Class<? extends Parser>>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
createComposite(List<Translator>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
createComposite(Class<? extends Translator>, List<Translator>, Set<Class<? extends Translator>>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
createComposite(List<T>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
createComposite(Class<? extends T>, List<T>, Set<Class<? extends T>>, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
CREATED - Static variable in interface org.apache.tika.metadata.DublinCore
Date of creation of the resource.
CREATED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
createDateFormat(String, TimeZone) - Static method in class org.apache.tika.metadata.Metadata
 
createDefault(MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
createDefault(MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
createDefault(MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
createDefault(MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
createDefault(MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
createTempFile() - Method in class org.apache.tika.io.TemporaryResources
Creates a temporary file that will automatically be deleted when the TemporaryResources.close() method is called, returning its path.
createTemporaryFile() - Method in class org.apache.tika.io.TemporaryResources
Creates and returns a temporary file that will automatically be deleted when the TemporaryResources.close() method is called.
createURL(byte[]) - Static method in class org.apache.tika.fork.MemoryURLStreamHandler
 
createURLStreamHandler(String) - Method in class org.apache.tika.fork.MemoryURLStreamHandlerFactory
 
CREATION_DATE - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
CREATION_DATE - Static variable in interface org.apache.tika.metadata.Office
When was the document created?
CreativeCommons - Interface in org.apache.tika.metadata
A collection of Creative Commons properties names.
CREATOR - Static variable in interface org.apache.tika.metadata.DublinCore
An entity primarily responsible for making the content of the resource.
CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
Contains the name of the person who created the content of this item, a photographer for photos, a graphic artist for graphics, or a writer for textual news, but in cases where the photographer should not be identified the name of a company or organisation may be appropriate.
CREATOR - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#CREATOR
CREATOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.XMP
The name of the first known tool used to create the resource.
CREATORS_CONTACT_INFO - Static variable in interface org.apache.tika.metadata.IPTC
The creator's contact information provides all necessary information to get in contact with the creator of this item and comprises a set of sub-properties for proper addressing.
CREATORS_JOB_TITLE - Static variable in interface org.apache.tika.metadata.IPTC
Contains the job title of the person who created the content of this item.
CREDIT - Static variable in interface org.apache.tika.metadata.Photoshop
 
CREDIT_LINE - Static variable in interface org.apache.tika.metadata.IPTC
The credit to person(s) and/or organisation(s) required by the supplier of the item to be used when published.
CryptoParser - Class in org.apache.tika.parser
Decrypts the incoming document stream and delegates further parsing to another parser instance.
CryptoParser(String, Provider, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
 
CryptoParser(String, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
 
CT - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
current - Variable in class org.apache.tika.mime.MimeTypesReader
 
currentColumn - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
currentColumn - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
currentDepth - Variable in class org.apache.tika.sax.SecureContentHandler
The current XML element depth.
currentElement - Variable in class org.apache.tika.sax.ToXMLContentHandler
 
currentID - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 
currentIndex - Variable in class org.apache.tika.io.TailStream
The current index into the tail buffer.
currentlyInUse - Variable in class org.apache.tika.fork.ForkParser
 
currentMenuItemId - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
currentText - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 

D

data - Variable in class org.apache.tika.fork.MemoryURLConnection
 
data - Variable in class org.apache.tika.fork.MemoryURLStreamRecord
 
Database - Interface in org.apache.tika.metadata
 
dataStack - Variable in class org.apache.tika.sax.DIFContentHandler
 
DATE - Static variable in interface org.apache.tika.metadata.DublinCore
A date associated with an event in the life cycle of the resource.
DATE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#CREATED
DATE_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
Designates the date and optionally the time the intellectual content was created rather than the date of the creation of the physical representation.
DATE_CREATED - Static variable in interface org.apache.tika.metadata.Photoshop
 
DateUtils - Class in org.apache.tika.utils
Date related utility methods and constants
DateUtils() - Constructor for class org.apache.tika.utils.DateUtils
 
DcXMLParser - Class in org.apache.tika.parser.xml
Dublin Core metadata parser
DcXMLParser() - Constructor for class org.apache.tika.parser.xml.DcXMLParser
 
decode(String) - Static method in class org.apache.tika.mime.HexCoDec
Decode a hex string
decode(char[]) - Static method in class org.apache.tika.mime.HexCoDec
Decode an array of hex chars
decode(char[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
Decode an array of hex chars.
decodeString(String, String) - Static method in class org.apache.tika.detect.MagicDetector
 
decodeValue(String, String) - Static method in class org.apache.tika.detect.MagicDetector
 
decorate(Detector, Element) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
decorate(ConfigurableThreadPoolExecutor, Element) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
decorate(Parser, Element) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
decorate(Translator, Element) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
decorate(T, Element) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
deepCopy(Metadata) - Method in class org.apache.tika.parser.RecursiveParserWrapper
 
DEFAULT_BUFFER_SIZE - Static variable in class org.apache.tika.io.IOUtils
The default buffer size to use.
DEFAULT_EXTENSION_TRUST - Static variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
DEFAULT_FILE_CONFIG - Static variable in class org.apache.tika.parser.strings.StringsParser
 
DEFAULT_LOADER - Static variable in class org.apache.tika.detect.AutoDetectReader
 
DEFAULT_MAGIC_TRUST - Static variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
probability parameters default value
DEFAULT_MAX_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The default max length of ngram
DEFAULT_META_TRUST - Static variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
DEFAULT_MIN_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The default min length of ngram
DEFAULT_NGRAM_LENGTH - Static variable in class org.apache.tika.language.LanguageProfile
 
DEFAULT_NUMBER_OF_BYTES_TO_TEST - Static variable in class org.apache.tika.detect.TextDetector
The number of bytes from the beginning of the document stream to test for control bytes.
DEFAULT_STRINGS_CONFIG - Static variable in class org.apache.tika.parser.strings.StringsParser
 
DEFAULT_TYPES - Static variable in class org.apache.tika.mime.MimeTypes
 
DefaultDetector - Class in org.apache.tika.detect
A composite detector based on all the Detector implementations available through the service provider mechanism.
DefaultDetector(MimeTypes, ServiceLoader, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.DefaultDetector
 
DefaultDetector(MimeTypes, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
 
DefaultDetector(MimeTypes, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
 
DefaultDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
 
DefaultDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultDetector
 
DefaultDetector() - Constructor for class org.apache.tika.detect.DefaultDetector
 
defaultEven - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
defaultFirst - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
defaultInit() - Static method in class org.apache.tika.metadata.serialization.JsonMetadataBase
 
defaultOdd - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
DefaultParser - Class in org.apache.tika.parser
A composite parser based on all the Parser implementations available through the service provider mechanism.
DefaultParser(MediaTypeRegistry, ServiceLoader, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultParser(MediaTypeRegistry, ServiceLoader) - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultParser(MediaTypeRegistry, ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultParser(ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultParser() - Constructor for class org.apache.tika.parser.DefaultParser
 
DefaultProbDetector - Class in org.apache.tika.detect
A version of DefaultDetector for probabilistic mime detectors, which use statistical techniques to blend the results of differing underlying detectors when attempting to detect the type of a given file.
DefaultProbDetector(ProbabilisticMimeDetectionSelector, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
 
DefaultProbDetector(ProbabilisticMimeDetectionSelector, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
 
DefaultProbDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
 
DefaultProbDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultProbDetector
 
DefaultProbDetector() - Constructor for class org.apache.tika.detect.DefaultProbDetector
 
DefaultTranslator - Class in org.apache.tika.language.translate
A translator which picks the first available Translator implementations available through the service provider mechanism.
DefaultTranslator(ServiceLoader) - Constructor for class org.apache.tika.language.translate.DefaultTranslator
 
DefaultTranslator() - Constructor for class org.apache.tika.language.translate.DefaultTranslator
 
delegate - Variable in class org.apache.tika.sax.DIFContentHandler
 
delegate - Variable in class org.apache.tika.sax.TextContentHandler
 
DELEGATING_PARSER - Static variable in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
DelegatingParser - Class in org.apache.tika.parser
Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.
DelegatingParser() - Constructor for class org.apache.tika.parser.DelegatingParser
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.ChildMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
Returns the XPath evaluation state that results from descending to a child element with the given name.
descend(String, String) - Method in class org.apache.tika.sax.xpath.NamedElementMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
DESCRIPTION - Static variable in interface org.apache.tika.metadata.DublinCore
An account of the content of the resource.
DESCRIPTION - Static variable in interface org.apache.tika.metadata.IPTC
A textual description, including captions, of the item's content, particularly used where the object is not text.
DESCRIPTION - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#DESCRIPTION
DESCRIPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
description - Variable in class org.apache.tika.mime.MimeType
Description of this media type.
description(Metadata, String, String) - Method in class org.apache.tika.sax.XMPContentHandler
 
DESCRIPTION_WRITER - Static variable in interface org.apache.tika.metadata.IPTC
Identifier or the name of the person involved in writing, editing or correcting the description of the content.
deserialize(JsonElement, Type, JsonDeserializationContext) - Method in class org.apache.tika.metadata.serialization.JsonMetadataDeserializer
Deserializes a json object (equivalent to: Map) into a Metadata object.
detect(InputStream, Metadata, List<EncodingDetector>, LoadErrorHandler) - Static method in class org.apache.tika.detect.AutoDetectReader
 
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.CompositeDetector
 
detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.Detector
Detects the content type of the given input document.
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.EmptyDetector
 
detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.EncodingDetector
Detects the character encoding of the given text document, or null if the encoding of the document can not be detected.
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.MagicDetector
 
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.NameDetector
Detects the content type of an input document based on the document name given in the input metadata.
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TextDetector
Looks at the beginning of the document input stream to determine whether the document is text or not.
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TrainedModelDetector
 
detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TypeDetector
Detects the content type of an input document based on a type hint given in the input metadata.
detect(InputStream, Metadata) - Method in class org.apache.tika.mime.MimeTypes
Automatically detects the MIME type of a document based on magic markers in the stream prefix and any given metadata hints.
detect(InputStream, Metadata) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
detect(InputStream, Metadata) - Method in class org.apache.tika.Tika
Detects the media type of the given document.
detect(InputStream, String) - Method in class org.apache.tika.Tika
Detects the media type of the given document.
detect(InputStream) - Method in class org.apache.tika.Tika
Detects the media type of the given document.
detect(byte[], String) - Method in class org.apache.tika.Tika
Detects the media type of the given document.
detect(byte[]) - Method in class org.apache.tika.Tika
Detects the media type of the given document.
detect(Path) - Method in class org.apache.tika.Tika
Detects the media type of the file at the given path.
detect(File) - Method in class org.apache.tika.Tika
Detects the media type of the given file.
detect(URL) - Method in class org.apache.tika.Tika
Detects the media type of the resource at the given URL.
detect(String) - Method in class org.apache.tika.Tika
Detects the media type of a document with the given file name.
detector - Variable in class org.apache.tika.config.TikaConfig
 
Detector - Interface in org.apache.tika.detect
Content type detector.
detector - Variable in class org.apache.tika.extractor.ParserContainerExtractor
 
detector - Variable in class org.apache.tika.mime.MagicMatch
 
detector - Variable in class org.apache.tika.parser.AutoDetectParser
The type detector used by this parser to auto-detect the type of a document.
detector - Variable in class org.apache.tika.Tika
The detector instance used by this facade.
detectors - Variable in class org.apache.tika.detect.CompositeDetector
 
detectorService - Variable in class org.apache.tika.parser.internal.Activator
 
detectorTracker - Variable in class org.apache.tika.config.TikaActivator
 
DetectorXmlLoader() - Constructor for class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
detectType(ZipArchiveEntry, ZipFile) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
detectType(ZipArchiveEntry, ZipArchiveInputStream) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
detectType(InputStream) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
DIFContentHandler - Class in org.apache.tika.sax
 
DIFContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.DIFContentHandler
 
digest(InputStream, Metadata, ParseContext) - Method in interface org.apache.tika.parser.DigestingParser.Digester
Digests an InputStream and sets the appropriate value(s) in the metadata.
digest(InputStream, Metadata, ParseContext) - Method in class org.apache.tika.parser.utils.CommonsDigester
 
DigestAlgorithm() - Constructor for enum org.apache.tika.parser.utils.CommonsDigester.DigestAlgorithm
 
digestEach(CommonsDigester.DigestAlgorithm, InputStream, Metadata) - Method in class org.apache.tika.parser.utils.CommonsDigester
 
digester - Variable in class org.apache.tika.parser.DigestingParser
 
digestFile(File, Metadata) - Method in class org.apache.tika.parser.utils.CommonsDigester
 
DigestingParser - Class in org.apache.tika.parser
 
DigestingParser(Parser, DigestingParser.Digester) - Constructor for class org.apache.tika.parser.DigestingParser
Creates a decorator for the given parser.
DigestingParser.Digester - Interface in org.apache.tika.parser
Interface for optional digester, if specified during construction.
DIGITAL_IMAGE_GUID - Static variable in interface org.apache.tika.metadata.IPTC
Globally unique identifier for the item.
DIGITAL_SOURCE_FILE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated. 
DIGITAL_SOURCE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
The type of the source of this digital image
DISC_NUMBER - Static variable in interface org.apache.tika.metadata.XMPDM
"The disc number for part of an album set."
dispose() - Method in class org.apache.tika.io.TemporaryResources
Calls the TemporaryResources.close() method and wraps the potential IOException into a TikaException for convenience when used within Tika.
distance - Variable in class org.apache.tika.language.LanguageIdentifier
 
distance(LanguageProfile) - Method in class org.apache.tika.language.LanguageProfile
Calculates the geometric distance between this and the given other language profile.
distanceInterleaved(LanguageProfile) - Method in class org.apache.tika.language.LanguageProfile
 
distanceStandard(LanguageProfile) - Method in class org.apache.tika.language.LanguageProfile
 
DL - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
DOC_SECURITY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
The common identifier for all versions and renditions of a resource.
DocumentPart() - Constructor for enum org.apache.tika.parser.iwork.PagesContentHandler.DocumentPart
 
DocumentSelector - Interface in org.apache.tika.extractor
Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance.
documentStarted - Variable in class org.apache.tika.sax.XHTMLContentHandler
Flag to indicate whether the document has been started.
doEndOfFile() - Method in class org.apache.tika.io.NullInputStream
Handle End of File.
doFile(File, FileConfig) - Method in class org.apache.tika.parser.strings.StringsParser
Runs the "file" command on the given file that aims at providing an alternative way to determine the file type.
doFooter() - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
doFormatDate(Calendar) - Static method in class org.apache.tika.utils.DateUtils
 
doHeader() - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
DONE - Static variable in class org.apache.tika.fork.ForkServer
 
doneSending() - Method in class org.apache.tika.fork.ContentHandlerProxy
 
doParse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
Does a best effort to extract Latin1 strings encoded with ISO-8859-1, UTF-8 or UTF-16.
doStrings(File, StringsConfig, XHTMLContentHandler) - Method in class org.apache.tika.parser.strings.StringsParser
Runs the "strings" command on the given file.
DR - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
DublinCore - Interface in org.apache.tika.metadata
A collection of Dublin Core metadata names.
DURATION - Static variable in interface org.apache.tika.metadata.XMPDM
"The duration of the media file."
dynamic - Variable in class org.apache.tika.config.ServiceLoader
 

E

EDIT_TIME - Static variable in interface org.apache.tika.metadata.MSOffice
How long has been spent editing the document?
element(String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
Emits an XHTML element with the given text content.
ELEMENT_BINARY - Static variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
ElementInfo(ToXMLContentHandler.ElementInfo, Map<String, String>) - Constructor for class org.apache.tika.sax.ToXMLContentHandler.ElementInfo
 
ElementMappingContentHandler - Class in org.apache.tika.sax
Content handler decorator that maps element QNames using a Map.
ElementMappingContentHandler(ContentHandler, Map<QName, ElementMappingContentHandler.TargetElement>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler
 
ElementMappingContentHandler.TargetElement - Class in org.apache.tika.sax
 
ElementMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of an XPath expression that targets an element.
ElementMatcher() - Constructor for class org.apache.tika.sax.xpath.ElementMatcher
 
ElementMetadataHandler - Class in org.apache.tika.parser.xml
SAX event handler that maps the contents of an XML element into a metadata field.
ElementMetadataHandler(String, String, Metadata, String) - Constructor for class org.apache.tika.parser.xml.ElementMetadataHandler
Constructor for string metadata keys.
ElementMetadataHandler(String, String, Metadata, String, boolean, boolean) - Constructor for class org.apache.tika.parser.xml.ElementMetadataHandler
Constructor for string metadata keys which allows change of behavior for duplicate and empty entry values.
ElementMetadataHandler(String, String, Metadata, Property) - Constructor for class org.apache.tika.parser.xml.ElementMetadataHandler
Constructor for Property metadata keys.
ElementMetadataHandler(String, String, Metadata, Property, boolean, boolean) - Constructor for class org.apache.tika.parser.xml.ElementMetadataHandler
Constructor for Property metadata keys which allows change of behavior for duplicate and empty entry values.
EMB_APP_VERSION - Static variable in interface org.apache.tika.metadata.RTFMetadata
if an application and version is given as part of the embedded object, this is the literal string
EMB_CLASS - Static variable in interface org.apache.tika.metadata.RTFMetadata
 
EMB_ITEM - Static variable in interface org.apache.tika.metadata.RTFMetadata
 
EMB_TOPIC - Static variable in interface org.apache.tika.metadata.RTFMetadata
 
embed(Metadata, InputStream, OutputStream, ParseContext) - Method in interface org.apache.tika.embedder.Embedder
Embeds related document metadata from the given metadata object into the given output stream.
embed(Metadata, InputStream, OutputStream, ParseContext) - Method in class org.apache.tika.embedder.ExternalEmbedder
Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler.
EMBEDDED_EXCEPTION - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
EMBEDDED_RELATIONSHIP_ID - Static variable in interface org.apache.tika.metadata.TikaMetadataKeys
 
EMBEDDED_RESOURCE_LIMIT_REACHED - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
EMBEDDED_RESOURCE_PATH - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
EMBEDDED_RESOURCE_TYPE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
EMBEDDED_RESOURCE_TYPE - Static variable in interface org.apache.tika.metadata.TikaMetadataKeys
 
EmbeddedContentHandler - Class in org.apache.tika.sax
Content handler decorator that prevents the EmbeddedContentHandler.startDocument() and EmbeddedContentHandler.endDocument() events from reaching the decorated handler.
EmbeddedContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.EmbeddedContentHandler
Created a decorator that prevents the given handler from receiving EmbeddedContentHandler.startDocument() and EmbeddedContentHandler.endDocument() events.
EmbeddedDocumentExtractor - Interface in org.apache.tika.extractor
 
EmbeddedParserDecorator(String) - Constructor for class org.apache.tika.parser.RecursiveParserWrapper.EmbeddedParserDecorator
 
EmbeddedResourceHandler - Interface in org.apache.tika.extractor
Tika container extractor callback interface.
EmbeddedResourceType() - Constructor for enum org.apache.tika.metadata.TikaCoreProperties.EmbeddedResourceType
 
Embedder - Interface in org.apache.tika.embedder
Tika embedder interface
EMPTY_ATTRIBUTES - Static variable in class org.apache.tika.sax.DIFContentHandler
 
EMPTY_ATTRIBUTES - Static variable in class org.apache.tika.sax.XHTMLContentHandler
 
EMPTY_ATTRIBUTES - Static variable in class org.apache.tika.sax.XMPContentHandler
 
EMPTY_ELEMENTS - Static variable in class org.apache.tika.sax.ToHTMLContentHandler
 
EmptyDetector - Class in org.apache.tika.detect
Dummy detector that returns application/octet-stream for all documents.
EmptyDetector() - Constructor for class org.apache.tika.detect.EmptyDetector
 
EmptyParser - Class in org.apache.tika.parser
Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.
EmptyParser() - Constructor for class org.apache.tika.parser.EmptyParser
 
EmptyTranslator - Class in org.apache.tika.language.translate
Dummy translator that always declines to give any text.
EmptyTranslator() - Constructor for class org.apache.tika.language.translate.EmptyTranslator
 
encode(byte[]) - Static method in class org.apache.tika.mime.HexCoDec
Hex encode an array of bytes
encode(byte[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
Hex encode an array of bytes
encoding - Variable in class org.apache.tika.parser.strings.StringsConfig
 
encoding - Variable in enum org.apache.tika.parser.strings.StringsEncoding
 
encoding - Variable in class org.apache.tika.sax.ToXMLContentHandler
 
EncodingDetector - Interface in org.apache.tika.detect
Character encoding detector.
EncryptedDocumentException - Exception in org.apache.tika.exception
 
EncryptedDocumentException() - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
 
EncryptedDocumentException(Throwable) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
 
EncryptedDocumentException(String) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
 
EncryptedDocumentException(String, Throwable) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
 
end() - Method in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 
END_DOCUMENT - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
END_ELEMENT - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
END_PREFIX_MAPPING - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
endDescription() - Method in class org.apache.tika.sax.XMPContentHandler
 
endDocument() - Method in class org.apache.tika.fork.ContentHandlerProxy
 
endDocument() - Method in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
endDocument() - Method in class org.apache.tika.parser.iwork.NumbersContentHandler
 
endDocument() - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
endDocument() - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endDocument() - Method in class org.apache.tika.sax.DIFContentHandler
 
endDocument() - Method in class org.apache.tika.sax.EmbeddedContentHandler
Ignored.
endDocument() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
 
endDocument() - Method in class org.apache.tika.sax.PhoneExtractingContentHandler
This method is called whenever the Parser is done parsing the file.
endDocument() - Method in class org.apache.tika.sax.SafeContentHandler
 
endDocument() - Method in class org.apache.tika.sax.TeeContentHandler
 
endDocument() - Method in class org.apache.tika.sax.TextContentHandler
 
endDocument() - Method in class org.apache.tika.sax.ToTextContentHandler
Flushes the character stream so that no characters are forgotten in internal buffers.
endDocument() - Method in class org.apache.tika.sax.XHTMLContentHandler
Ends the XHTML document by writing the following footer and clearing the namespace mappings:
endDocument() - Method in class org.apache.tika.sax.XMPContentHandler
Ends the XMP document by writing the following footer and clearing the namespace mappings:
endDocumentCalled - Variable in class org.apache.tika.sax.EndDocumentShieldingContentHandler
 
EndDocumentShieldingContentHandler - Class in org.apache.tika.sax
A wrapper around a ContentHandler which will ignore normal SAX calls to EndDocumentShieldingContentHandler.endDocument(), and only fire them later.
EndDocumentShieldingContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.EndDocumentShieldingContentHandler
Creates a decorator for the given SAX event handler.
endElement(String, String, String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
endElement(String, String, String) - Method in class org.apache.tika.mime.MimeTypesReader
 
endElement(String, String, String) - Method in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.iwork.NumbersContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
endElement(String, String, String) - Method in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endElement(String, String, String) - Method in class org.apache.tika.sax.DIFContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ElementMappingContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.LinkContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.SafeContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.SecureContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.TeeContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ToHTMLContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ToXMLContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
Ends the given element.
endElement(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
EndianUtils - Class in org.apache.tika.io
General Endian Related Utilties.
EndianUtils() - Constructor for class org.apache.tika.io.EndianUtils
 
EndianUtils.BufferUnderrunException - Exception in org.apache.tika.io
 
ENDLINE - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The elements that get appended with the XHTMLContentHandler.NL character.
endPrefixMapping(String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
endPrefixMapping(String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endPrefixMapping(String) - Method in class org.apache.tika.sax.TeeContentHandler
 
ENGINEER - Static variable in interface org.apache.tika.metadata.XMPDM
"The engineer's name."
entries - Variable in class org.apache.tika.language.LanguageProfile.Interleaved
 
entriesGeneratedAtCount - Variable in class org.apache.tika.language.LanguageProfile.Interleaved
 
Entry() - Constructor for class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
ENVI_MIME_TYPE - Static variable in class org.apache.tika.parser.envi.EnviHeaderParser
 
EnviHeaderParser - Class in org.apache.tika.parser.envi
 
EnviHeaderParser() - Constructor for class org.apache.tika.parser.envi.EnviHeaderParser
 
eof - Variable in class org.apache.tika.io.NullInputStream
 
EOF - Static variable in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
EOT - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
EpubContentParser - Class in org.apache.tika.parser.epub
Parser for EPUB OPS *.html files.
EpubContentParser() - Constructor for class org.apache.tika.parser.epub.EpubContentParser
 
EpubParser - Class in org.apache.tika.parser.epub
Epub parser
EpubParser() - Constructor for class org.apache.tika.parser.epub.EpubParser
 
EQ - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
equals(Object) - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
 
equals(Object) - Method in class org.apache.tika.metadata.Metadata
 
equals(Object) - Method in class org.apache.tika.metadata.Property
 
equals(Object) - Method in class org.apache.tika.mime.Magic
 
equals(Object) - Method in class org.apache.tika.mime.MediaType
 
equals(Object) - Method in class org.apache.tika.mime.MimeType
 
equals(String, String) - Static method in class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
equals(String, String) - Static method in class org.apache.tika.sax.xpath.NamedElementMatcher
 
EQUIPMENT_MAKE - Static variable in interface org.apache.tika.metadata.TIFF
"Manufacturer of the recording equipment."
EQUIPMENT_MODEL - Static variable in interface org.apache.tika.metadata.TIFF
"Model name or number of the recording equipment."
error - Variable in class org.apache.tika.fork.ForkClient
 
ERROR - Static variable in class org.apache.tika.fork.ForkServer
 
ERROR_CODES_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
ErrorParser - Class in org.apache.tika.parser
Dummy parser that always throws a TikaException without even attempting to parse the given document stream.
ErrorParser() - Constructor for class org.apache.tika.parser.ErrorParser
 
errors - Static variable in class org.apache.tika.language.LanguageIdentifier
 
ETX - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
eval(byte[]) - Method in class org.apache.tika.mime.AndClause
 
eval(byte[]) - Method in interface org.apache.tika.mime.Clause
Evaluates this clause with the specified chunk of data.
eval(byte[]) - Method in class org.apache.tika.mime.Magic
 
eval(byte[]) - Method in class org.apache.tika.mime.MagicMatch
 
eval(byte[]) - Method in class org.apache.tika.mime.OrClause
 
EVENT - Static variable in interface org.apache.tika.metadata.IPTC
Names or describes the specific event the content relates to.
EXAMPLE_NNMODEL_FILE - Static variable in class org.apache.tika.detect.NNExampleModelDetector
 
exception - Variable in class org.apache.tika.parser.NetworkParser.ParsingTask
 
ExceptionUtils - Class in org.apache.tika.utils
 
ExceptionUtils() - Constructor for class org.apache.tika.utils.ExceptionUtils
 
execCommand(String[]) - Method in class org.apache.tika.parser.gdal.GDALParser
 
execute(ParseContext, Runnable) - Static method in class org.apache.tika.utils.ConcurrentUtils
Execute a runnable using an ExecutorService from the ParseContext if possible.
executorService - Variable in class org.apache.tika.config.TikaConfig
 
ExecutorServiceXmlLoader() - Constructor for class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
expandCapacity(int) - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
ExpandedTitleContentHandler - Class in org.apache.tika.sax
Content handler decorator which wraps a TransformerHandler in order to allow the TITLE tag to render as <title></title> rather than <title/> which is accomplished by calling the ContentHandler.characters(char[], int, int) method with a length of 1 but a zero length char array.
ExpandedTitleContentHandler() - Constructor for class org.apache.tika.sax.ExpandedTitleContentHandler
 
ExpandedTitleContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.ExpandedTitleContentHandler
 
EXPERIMENT_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
EXPOSURE_TIME - Static variable in interface org.apache.tika.metadata.TIFF
"Exposure time in seconds."
extension_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
extension_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
extension_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
extension_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
extension_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
extension_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
extensions - Variable in class org.apache.tika.mime.MimeType
All known file extensions of this type, in order of preference (best first).
extensions - Variable in class org.apache.tika.mime.Patterns
Index of extension patterns of the form "*extension".
EXTERNAL_PARSERS_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
externalBoolean(String) - Static method in class org.apache.tika.metadata.Property
 
externalClosedChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
 
externalDate(String) - Static method in class org.apache.tika.metadata.Property
 
ExternalEmbedder - Class in org.apache.tika.embedder
Embedder that uses an external program (like sed or exiftool) to embed text content and metadata into a given document.
ExternalEmbedder() - Constructor for class org.apache.tika.embedder.ExternalEmbedder
 
externalInteger(String) - Static method in class org.apache.tika.metadata.Property
 
externalOpenChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
 
ExternalParser - Class in org.apache.tika.parser.external
Parser that uses an external program (like catdoc or pdf2txt) to extract text content and metadata from a given document.
ExternalParser() - Constructor for class org.apache.tika.parser.external.ExternalParser
 
ExternalParsersConfigReader - Class in org.apache.tika.parser.external
Builds up ExternalParser instances based on XML file(s) which define what to run, for what, and how to process any output metadata.
ExternalParsersConfigReader() - Constructor for class org.apache.tika.parser.external.ExternalParsersConfigReader
 
ExternalParsersConfigReaderMetKeys - Interface in org.apache.tika.parser.external
Met Keys used by the ExternalParsersConfigReader.
ExternalParsersFactory - Class in org.apache.tika.parser.external
Creates instances of ExternalParser based on XML configuration files.
ExternalParsersFactory() - Constructor for class org.apache.tika.parser.external.ExternalParsersFactory
 
externalReal(String) - Static method in class org.apache.tika.metadata.Property
 
externalText(String) - Static method in class org.apache.tika.metadata.Property
 
externalTextBag(String) - Static method in class org.apache.tika.metadata.Property
 
extract(TikaInputStream, ContainerExtractor, EmbeddedResourceHandler) - Method in interface org.apache.tika.extractor.ContainerExtractor
Processes a container file, and extracts all the embedded resources from within it.
extract(TikaInputStream, ContainerExtractor, EmbeddedResourceHandler) - Method in class org.apache.tika.extractor.ParserContainerExtractor
 
EXTRACT_CONTENT - Static variable in interface org.apache.tika.metadata.AccessPermissions
Should content be extracted, generally.
EXTRACT_FOR_ACCESSIBILITY - Static variable in interface org.apache.tika.metadata.AccessPermissions
Should content be extracted for the purposes of accessibility.
extractLinks(String) - Static method in class org.apache.tika.utils.RegexUtils
Extract urls from plain text.
extractMetadata(InputStream, Metadata) - Method in class org.apache.tika.parser.external.ExternalParser
 
extractMetFromOutput(String, Metadata) - Method in class org.apache.tika.parser.gdal.GDALParser
 
extractor - Variable in class org.apache.tika.extractor.ParserContainerExtractor.RecursiveParser
 
ExtractorHandler() - Constructor for class org.apache.tika.detect.XmlRootExtractor.ExtractorHandler
 
extractOutput(InputStream, XHTMLContentHandler) - Method in class org.apache.tika.parser.external.ExternalParser
Starts a thread that extracts the contents of the standard output stream of the given process to the given XHTML content handler.
extractOutput(InputStream) - Method in class org.apache.tika.parser.gdal.GDALParser
 
extractOutput(InputStream, XHTMLContentHandler) - Method in class org.apache.tika.parser.strings.StringsParser
Extracts ASCII strings using the "strings" command.
extractPhoneNumbers(String) - Static method in class org.apache.tika.sax.CleanPhoneText
 
extractRootElement(byte[]) - Method in class org.apache.tika.detect.XmlRootExtractor
 
extractRootElement(InputStream) - Method in class org.apache.tika.detect.XmlRootExtractor
 

F

F_NUMBER - Static variable in interface org.apache.tika.metadata.TIFF
"F-Number." The f-number is the focal length divided by the "effective" aperture diameter.
FAIL - Static variable in class org.apache.tika.sax.xpath.Matcher
State of a failed XPath evaluation, where nothing is matched.
fallback - Variable in class org.apache.tika.parser.CompositeParser
The fallback parser, used when no better parser is available.
FeedParser - Class in org.apache.tika.parser.feed
Feed parser.
FeedParser() - Constructor for class org.apache.tika.parser.feed.FeedParser
 
FF - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FictionBookParser - Class in org.apache.tika.parser.xml
 
FictionBookParser() - Constructor for class org.apache.tika.parser.xml.FictionBookParser
 
FictionBookParser.BinaryElementsDataHandler - Class in org.apache.tika.parser.xml
 
FILE_DATA_RATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The file data rate in megabytes per second.
FILE_EXTENSION - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The ngram profile file extension
FileConfig - Class in org.apache.tika.parser.strings
Configuration for the "file" (or file-alternative) command.
FileConfig() - Constructor for class org.apache.tika.parser.strings.FileConfig
Default constructor.
FilenameUtils - Class in org.apache.tika.io
 
FilenameUtils() - Constructor for class org.apache.tika.io.FilenameUtils
 
filePath - Variable in class org.apache.tika.parser.strings.FileConfig
 
fill() - Method in class org.apache.tika.io.LookaheadInputStream
 
FILL_IN_FORM - Static variable in interface org.apache.tika.metadata.AccessPermissions
Can the user fill in a form
fillBootstrapJar(File) - Static method in class org.apache.tika.fork.ForkClient
Fills in the jar file used to bootstrap the forked server process.
filter(char[], int, int, SafeContentHandler.Output) - Method in class org.apache.tika.sax.SafeContentHandler
Filters and outputs the contents of the given input buffer.
findClass(String) - Method in class org.apache.tika.fork.ClassLoaderProxy
 
findDuplicateParsers(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
Utility method that goes through all the component parsers and finds all media types for which more than one parser declares support.
findResource(String) - Method in class org.apache.tika.fork.ClassLoaderProxy
 
findResources(String) - Method in class org.apache.tika.fork.ClassLoaderProxy
 
findServiceResources(String) - Method in class org.apache.tika.config.ServiceLoader
Returns all the available service resources matching the given pattern, such as all instances of tika-mimetypes.xml on the classpath, or all org.apache.tika.parser.Parser service files.
firstEntry() - Method in class org.apache.tika.language.LanguageProfile.Interleaved
 
firstPass - Variable in class org.apache.tika.utils.RereadableInputStream
True when the original stream is being read; set to false when reading is set to use the stored data instead.
FLASH_FIRED - Static variable in interface org.apache.tika.metadata.TIFF
Did the Flash fire when taking this image?
flush() - Method in class org.apache.tika.language.ProfilingWriter
Ignored.
flushBuffer() - Method in class org.apache.tika.parser.strings.Latin1StringsParser
Flushes the internal output buffer to the content handler.
FLVParser - Class in org.apache.tika.parser.video
Parser for metadata contained in Flash Videos (.flv).
FLVParser() - Constructor for class org.apache.tika.parser.video.FLVParser
 
FMT_ANPA_1312 - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_ANPA_UPI - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_ANPA_UPI_DL - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_7901 - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_AP - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_BLM - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_CHAR - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_NYT - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_PHOTO - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_IPTC_RTR - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_NITF - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_NITF_RB - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FMT_NITF_TT - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
FOCAL_LENGTH - Static variable in interface org.apache.tika.metadata.TIFF
"Focal length of the lens, in millimeters."
footers - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
footnotes - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
Footnotes() - Constructor for class org.apache.tika.parser.iwork.PagesContentHandler.Footnotes
 
footnotes - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.Footnotes
Mark -> Text
ForkClient - Class in org.apache.tika.fork
 
ForkClient(ClassLoader, Object, List<String>) - Constructor for class org.apache.tika.fork.ForkClient
 
ForkObjectInputStream - Class in org.apache.tika.fork
An object input stream that uses a given class loader when deserializing objects.
ForkObjectInputStream(InputStream, ClassLoader) - Constructor for class org.apache.tika.fork.ForkObjectInputStream
Creates a new object input stream that uses the given class loader when deserializing objects.
ForkParser - Class in org.apache.tika.fork
 
ForkParser(ClassLoader, Parser) - Constructor for class org.apache.tika.fork.ForkParser
 
ForkParser(ClassLoader) - Constructor for class org.apache.tika.fork.ForkParser
 
ForkParser() - Constructor for class org.apache.tika.fork.ForkParser
 
ForkProxy - Interface in org.apache.tika.fork
 
ForkResource - Interface in org.apache.tika.fork
 
ForkServer - Class in org.apache.tika.fork
 
ForkServer(InputStream, OutputStream) - Constructor for class org.apache.tika.fork.ForkServer
Sets up a forked server instance using the given stdin/out communication channel.
FORMAT - Static variable in interface org.apache.tika.metadata.DublinCore
Typically, Format may include the media-type or dimensions of the resource.
FORMAT - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#FORMAT
FORMAT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
FORMAT - Variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
formatDate(Date) - Static method in class org.apache.tika.utils.DateUtils
Returns a ISO 8601 representation of the given date.
formatDate(Calendar) - Static method in class org.apache.tika.utils.DateUtils
Returns a ISO 8601 representation of the given date.
formatDateUnknownTimezone(Date) - Static method in class org.apache.tika.utils.DateUtils
Returns a ISO 8601 representation of the given date, which is in an unknown timezone.
forName(String) - Method in class org.apache.tika.mime.MimeTypes
Returns the registered media type with the given name (or alias).
forName(String) - Static method in class org.apache.tika.utils.CharsetUtils
Returns Charset impl, if one exists.
frequency - Variable in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
The frequency of this ngram in its profile
fromJson(Reader) - Static method in class org.apache.tika.metadata.serialization.JsonMetadata
Read metadata from reader.
fromJson(Reader) - Static method in class org.apache.tika.metadata.serialization.JsonMetadataList
Read metadata from reader.
FS - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 

G

GDALParser - Class in org.apache.tika.parser.gdal
Wraps execution of the Geospatial Data Abstraction Library (GDAL) gdalinfo tool used to extract geospatial information out of hundreds of geo file formats.
GDALParser() - Constructor for class org.apache.tika.parser.gdal.GDALParser
 
GENRE - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the genre."
Geographic - Interface in org.apache.tika.metadata
Geographic schema.
get(InputStream) - Static method in class org.apache.tika.io.TaggedInputStream
Casts or wraps the given stream to a TaggedInputStream instance.
get(InputStream, TemporaryResources) - Static method in class org.apache.tika.io.TikaInputStream
Casts or wraps the given stream to a TikaInputStream instance.
get(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
Casts or wraps the given stream to a TikaInputStream instance.
get(byte[]) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the given array of bytes.
get(byte[], Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the given array of bytes.
get(Path) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the file at the given path.
get(Path, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the file at the given path.
get(File) - Static method in class org.apache.tika.io.TikaInputStream
Deprecated.
use TikaInputStream.get(Path). In Tika 2.0, this will be removed or modified to throw an IOException.
get(File, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Deprecated.
use TikaInputStream.get(Path, Metadata). In Tika 2.0, this will be removed or modified to throw an IOException.
get(Blob) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the given database BLOB.
get(Blob, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the given database BLOB.
get(URI) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the resource at the given URI.
get(URI, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the resource at the given URI.
get(URL) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the resource at the given URL.
get(URL, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream from the resource at the given URL.
get(String) - Method in class org.apache.tika.metadata.Metadata
Get the value associated to a metadata name.
get(Property) - Method in class org.apache.tika.metadata.Metadata
Returns the value (if any) of the identified metadata property.
get(String) - Static method in class org.apache.tika.metadata.Property
Retrieve the property object that corresponds to the given key
get(Class<T>) - Method in class org.apache.tika.parser.ParseContext
Returns the object in this context that implements the given interface.
get(Class<T>, T) - Method in class org.apache.tika.parser.ParseContext
Returns the object in this context that implements the given interface, or the given default value if such an object is not found.
get() - Method in enum org.apache.tika.parser.strings.StringsEncoding
 
getAcronym() - Method in class org.apache.tika.mime.MimeType
Returns an acronym for this mime type.
getAliases(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Returns the set of known aliases of the given canonical media type.
getAllComponentParsers() - Method in class org.apache.tika.parser.CompositeParser
Returns all parsers registered with the Composite Parser, including ones which may not currently be active.
getAllComponentParsers() - Method in class org.apache.tika.parser.DefaultParser
 
getAttributesMapping() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
 
getBaseType() - Method in class org.apache.tika.mime.MediaType
Returns the base form of the MediaType, excluding any parameters, such as "text/plain" for "text/plain; charset=utf-8"
getBuilder() - Static method in class org.apache.tika.config.TikaConfig
 
getByteCount() - Method in class org.apache.tika.io.CountingInputStream
The number of bytes that have passed through this stream.
getByteCount() - Method in class org.apache.tika.sax.SecureContentHandler
 
getCause() - Method in exception org.apache.tika.io.TaggedIOException
Returns the wrapped exception.
getCause() - Method in exception org.apache.tika.sax.TaggedSAXException
Returns the wrapped exception.
getCharMap() - Static method in class org.apache.tika.parser.strings.Latin1StringsParser
Populates the valid ISO-8859-1 character map.
getCharset() - Method in class org.apache.tika.detect.AutoDetectReader
 
getCharsetICU - Static variable in class org.apache.tika.utils.CharsetUtils
 
getChild(Element, String) - Static method in class org.apache.tika.config.TikaConfig
 
getChildTypes(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Returns the set of known children of the given canonical media type
getChoices() - Method in class org.apache.tika.metadata.Property
Returns the (immutable) set of choices for the values of this property.
getClauses() - Method in class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
getCommand() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets the command to be run.
getCommand() - Method in class org.apache.tika.parser.external.ExternalParser
 
getCommand() - Method in class org.apache.tika.parser.gdal.GDALParser
 
getCommandAppendOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets the operator to append rather than replace a value for the command line tool, i.e.
getCommandAssignmentDelimeter() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets the delimiter for multiple assignments for the command line tool, i.e.
getCommandAssignmentOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets the assignment operator for the command line tool, i.e.
getCommandMetadataSegments(Metadata) - Method in class org.apache.tika.embedder.ExternalEmbedder
Constructs a collection of command line arguments responsible for setting individual metadata fields based on the given metadata.
getConfigInputStream(String, ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
 
getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.DcXMLParser
 
getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.FictionBookParser
 
getContentHandler(ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.XMLParser
 
getContentParser() - Method in class org.apache.tika.parser.epub.EpubParser
 
getContextClassLoader() - Static method in class org.apache.tika.config.ServiceLoader
Returns the context class loader of the current thread.
getCount() - Method in class org.apache.tika.io.CountingInputStream
The number of bytes that have passed through this stream.
getCount() - Method in class org.apache.tika.language.LanguageProfile
 
getCount(String) - Method in class org.apache.tika.language.LanguageProfile
 
getCount() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Returns the number of occurrences of this ngram in its profile
getDate(Property) - Method in class org.apache.tika.metadata.Metadata
Returns the value of the identified Date based metadata property.
getDecorationName() - Method in class org.apache.tika.parser.ParserDecorator
 
getDefaultConfig() - Static method in class org.apache.tika.config.TikaConfig
Provides a default configuration (TikaConfig).
getDefaultDetector(MimeTypes, ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
 
getDefaultDetectors(MimeTypes, ServiceLoader) - Static method in class org.apache.tika.detect.DefaultDetector
Finds all statically loadable detectors and sort the list by name, rather than discovery order.
getDefaultDetectors(ProbabilisticMimeDetectionSelector, ServiceLoader) - Static method in class org.apache.tika.detect.DefaultProbDetector
 
getDefaultExecutorService() - Static method in class org.apache.tika.config.TikaConfig
 
getDefaultMimeTypes(ClassLoader) - Static method in class org.apache.tika.config.TikaConfig
 
getDefaultMimeTypes() - Static method in class org.apache.tika.mime.MimeTypes
Get the default MimeTypes.
getDefaultMimeTypes(ClassLoader) - Static method in class org.apache.tika.mime.MimeTypes
Get the default MimeTypes.
getDefaultParser(MimeTypes, ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
 
getDefaultParsers(ServiceLoader) - Static method in class org.apache.tika.parser.DefaultParser
Finds all statically loadable parsers and sort the list by name, rather than discovery order.
getDefaultRegistry() - Static method in class org.apache.tika.mime.MediaTypeRegistry
Returns the built-in media type registry included in Tika.
getDefaultTranslator(ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
 
getDefaultTranslators(ServiceLoader) - Static method in class org.apache.tika.language.translate.DefaultTranslator
Finds all statically loadable translators and sort the list by name, rather than discovery order.
getDelegateParser(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
Returns the parser instance to which parsing tasks should be delegated.
getDescription() - Method in class org.apache.tika.mime.MimeType
Returns the description of this media type.
getDetector() - Method in class org.apache.tika.config.TikaConfig
Returns the configured detector instance.
getDetector() - Method in class org.apache.tika.mime.MagicMatch
 
getDetector() - Method in class org.apache.tika.parser.AutoDetectParser
Returns the type detector used by this parser to auto-detect the type of a document.
getDetector() - Method in class org.apache.tika.Tika
Returns the detector instance used by this facade.
getDetectors() - Method in class org.apache.tika.detect.CompositeDetector
Returns the component detectors.
getDetectors() - Method in class org.apache.tika.detect.DefaultDetector
 
getDetectors() - Method in class org.apache.tika.detect.DefaultProbDetector
 
getDublinCoreHandler(Metadata, Property, String) - Static method in class org.apache.tika.parser.xml.DcXMLParser
 
getEncoding() - Method in class org.apache.tika.parser.strings.StringsConfig
Returns the character encoding of the strings that are to be found.
getEndDocumentWasCalled() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
 
getErrors() - Static method in class org.apache.tika.language.LanguageIdentifier
Returns a string of error messages related to initializing langauge profiles
getExecutorService() - Method in class org.apache.tika.config.TikaConfig
 
getExtension() - Method in class org.apache.tika.mime.MimeType
Returns the preferred file extension of this type, or an empty string if no extensions are known.
getExtensions() - Method in class org.apache.tika.mime.MimeType
Returns the list of all known file extensions of this media type.
getFallback() - Method in class org.apache.tika.parser.CompositeParser
Returns the fallback parser.
getFile() - Method in class org.apache.tika.io.TikaInputStream
 
getFileChannel() - Method in class org.apache.tika.io.TikaInputStream
 
getFilePath() - Method in class org.apache.tika.parser.strings.FileConfig
Returns the "file" installation folder.
getFileProg() - Static method in class org.apache.tika.parser.strings.StringsParser
 
getFilteredStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
Simple util to get stack trace.
getFirstAvailable(ServiceLoader) - Static method in class org.apache.tika.language.translate.DefaultTranslator
Returns the first available translator, or null if none are
getFormatName() - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
getFrequency() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Returns the frequency of this ngram in its profile
getInputStream() - Method in class org.apache.tika.fork.MemoryURLConnection
 
getInt(Property) - Method in class org.apache.tika.metadata.Metadata
Returns the value of the identified Integer based metadata property.
getIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a BE int value from the beginning of a byte array
getIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a BE int value from a byte array
getIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a LE int value from the beginning of a byte array
getIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a LE int value from a byte array
getJavaCommand() - Method in class org.apache.tika.fork.ForkParser
Deprecated.
since 1.8
getJavaCommandAsList() - Method in class org.apache.tika.fork.ForkParser
Returns the command used to start the forked server process.
getLanguage() - Method in class org.apache.tika.language.LanguageIdentifier
Gets the identified language
getLanguage() - Method in class org.apache.tika.language.ProfilingHandler
Returns the language that best matches the current state of the language profile.
getLanguage() - Method in class org.apache.tika.language.ProfilingWriter
Returns the language that best matches the current state of the language profile.
getLength() - Method in class org.apache.tika.detect.MagicDetector
 
getLength() - Method in class org.apache.tika.io.TikaInputStream
Returns the length (in bytes) of this stream.
getLink() - Method in class org.apache.tika.sax.LinkBuilder
 
getLink(boolean) - Method in class org.apache.tika.sax.LinkBuilder
 
getLinks() - Method in class org.apache.tika.mime.MimeType
Get a list of links to help document this mime type
getLinks() - Method in class org.apache.tika.sax.LinkContentHandler
Returns the list of collected links.
getLoaderClass() - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
getLoaderClass() - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
getLoaderClass() - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
getLoaderClass() - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
getLoaderClass() - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
getLoadErrorHandler() - Method in class org.apache.tika.config.ServiceLoader
Returns the load error handler used by this loader.
getLoaderTagName() - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
getLoaderTagName() - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
getLoaderTagName() - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
getLoaderTagName() - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
getLoaderTagName() - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
getLocalName() - Method in class org.apache.tika.mime.MimeType.RootXML
 
getLongLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a LE long value from a byte array
getMagics() - Method in class org.apache.tika.mime.MimeType
 
getMappedTagName() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
 
getMaximumCompressionRatio() - Method in class org.apache.tika.sax.SecureContentHandler
Returns the maximum compression ratio.
getMaximumDepth() - Method in class org.apache.tika.sax.SecureContentHandler
Returns the maximum XML element nesting level.
getMaximumPackageEntryDepth() - Method in class org.apache.tika.sax.SecureContentHandler
Returns the maximum package entry nesting level.
getMaxStringLength() - Method in class org.apache.tika.Tika
Returns the maximum length of strings returned by the parseToString methods.
getMediaTypeRegistry() - Method in class org.apache.tika.config.TikaConfig
 
getMediaTypeRegistry() - Method in class org.apache.tika.mime.MimeTypes
 
getMediaTypeRegistry() - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
getMediaTypeRegistry() - Method in class org.apache.tika.parser.CompositeParser
Returns the media type registry used to infer type relationships.
getMetadata(String) - Static method in class org.apache.tika.parser.ParsingReader
Utility method that returns a Metadata instance for a document with the given name.
getMetadata() - Method in class org.apache.tika.parser.RecursiveParserWrapper
The first element in the returned list represents the data from the outer container file.
getMetadataCommandArguments() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets the map of Metadata keys to command line parameters.
getMetadataExtractionPatterns() - Method in class org.apache.tika.parser.external.ExternalParser
 
getMetadataKey() - Method in enum org.apache.tika.parser.utils.CommonsDigester.DigestAlgorithm
 
getMetaParser() - Method in class org.apache.tika.parser.epub.EpubParser
 
getMethod(Object, String) - Method in class org.apache.tika.fork.ForkServer
 
getMimeRepository() - Method in class org.apache.tika.config.TikaConfig
 
getMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
Deprecated.
getMimeType(File) - Method in class org.apache.tika.mime.MimeTypes
Deprecated.
Use Tika.detect(File) instead
getMimeType(byte[]) - Method in class org.apache.tika.mime.MimeTypes
Returns the MIME type that best matches the given first few bytes of a document stream.
getMinLength() - Method in class org.apache.tika.detect.TrainedModelDetector
 
getMinLength() - Method in class org.apache.tika.mime.MimeType
 
getMinLength() - Method in class org.apache.tika.mime.MimeTypes
Return the minimum length of data to provide to analyzing methods based on the document's content in order to check all the known MimeTypes.
getMinLength() - Method in class org.apache.tika.parser.strings.StringsConfig
Returns the minimum sequence length (characters) to print.
getMinSize() - Method in class org.apache.tika.parser.strings.Latin1StringsParser
Returns the minimum size of a character sequence to be extracted.
getName(String) - Static method in class org.apache.tika.io.FilenameUtils
This is a duplication of the algorithm and functionality available in commons io FilenameUtils.
getName() - Method in class org.apache.tika.language.LanguageProfilerBuilder
 
getName() - Method in class org.apache.tika.metadata.Property
 
getName() - Method in class org.apache.tika.mime.MimeType
Returns the name of this media type.
getNames(Metadata) - Method in class org.apache.tika.metadata.serialization.JsonMetadataBase.SortedJsonMetadataSerializer
 
getNames(Metadata) - Method in class org.apache.tika.metadata.serialization.JsonMetadataSerializer
Override to get a custom sort order or to filter names.
getNamespace() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
getNameSpaceURI() - Method in class org.apache.tika.mime.MimeType.RootXML
 
getNewContentHandler() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
 
getNewContentHandler(OutputStream, String) - Method in class org.apache.tika.sax.BasicContentHandlerFactory
 
getNewContentHandler() - Method in interface org.apache.tika.sax.ContentHandlerFactory
 
getNewContentHandler(OutputStream, String) - Method in interface org.apache.tika.sax.ContentHandlerFactory
 
getNumOfHidden() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
getNumOfInputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
getNumOfOutputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
getOpenContainer() - Method in class org.apache.tika.io.TikaInputStream
Returns the open container object, such as a POIFS FileSystem in the event of an OLE2 document being detected and processed by the OLE2 detector.
getOutputThreshold() - Method in class org.apache.tika.sax.SecureContentHandler
Returns the configured output threshold.
getParameters() - Method in class org.apache.tika.mime.MediaType
Returns an immutable sorted map of the parameters of this media type.
getParams() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
getParentTagName() - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
getParentTagName() - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
getParentTagName() - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
getParentTagName() - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
getParentTagName() - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
getParser(MediaType) - Method in class org.apache.tika.config.TikaConfig
Deprecated.
Use the TikaConfig.getParser() method instead
getParser() - Method in class org.apache.tika.config.TikaConfig
Returns the configured parser instance.
getParser(Metadata) - Method in class org.apache.tika.parser.CompositeParser
Returns the parser that best matches the given metadata.
getParser(Metadata, ParseContext) - Method in class org.apache.tika.parser.CompositeParser
 
getParser() - Method in class org.apache.tika.Tika
Returns the parser instance used by this facade.
getParsers(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
 
getParsers() - Method in class org.apache.tika.parser.CompositeParser
Returns the component parsers.
getParsers(ParseContext) - Method in class org.apache.tika.parser.DefaultParser
 
getPart() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
getPassword(Metadata) - Method in interface org.apache.tika.parser.PasswordProvider
Looks up the password for a document with the given metadata, and returns it for the Parser.
getPath() - Method in class org.apache.tika.io.TikaInputStream
 
getPatterns() - Method in class org.apache.tika.parser.gdal.GDALParser
 
getPoolSize() - Method in class org.apache.tika.fork.ForkParser
Returns the size of the process pool.
getPosition() - Method in class org.apache.tika.io.NullInputStream
Return the current position.
getPosition() - Method in class org.apache.tika.io.TikaInputStream
Returns the current position within the stream.
getPrefix(String) - Method in class org.apache.tika.sax.ToXMLContentHandler.ElementInfo
 
getPrimaryProperty() - Method in class org.apache.tika.metadata.Property
Gets the primary property for a composite property
getPriority() - Method in class org.apache.tika.mime.Magic
 
getProfile() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Returns the profile associated to this ngram
getProfile() - Method in class org.apache.tika.language.ProfilingHandler
Returns the language profile being built by this content handler.
getProfile() - Method in class org.apache.tika.language.ProfilingWriter
Returns the language profile being built by this writer.
getProperties(String) - Static method in class org.apache.tika.metadata.Property
 
getPropertyType(String) - Static method in class org.apache.tika.metadata.Property
Get the type of a property
getPropertyType() - Method in class org.apache.tika.metadata.Property
 
getQName(String, String) - Method in class org.apache.tika.sax.ToXMLContentHandler.ElementInfo
 
getQNameAsString(QName) - Static method in class org.apache.tika.sax.ElementMappingContentHandler
 
getRegisteredMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
Returns the registered, normalised media type with the given name (or alias).
getRel() - Method in class org.apache.tika.sax.Link
 
getResource(Class<T>) - Method in class org.apache.tika.io.TemporaryResources
Returns the latest of the tracked resources that implements or extends the given interface or class.
getResourceAsStream(String) - Method in class org.apache.tika.config.ServiceLoader
Returns an input stream for reading the specified resource from the configured class loader.
getResourceName(Metadata) - Method in class org.apache.tika.parser.RecursiveParserWrapper
 
getSAXParser() - Method in class org.apache.tika.parser.ParseContext
Returns the SAX parser specified in this parsing context.
getSAXParserFactory() - Method in class org.apache.tika.parser.ParseContext
Returns the SAX parser factory specified in this parsing context.
getSecondaryExtractProperties() - Method in class org.apache.tika.metadata.Property
Gets the secondary properties for a composite property
getSection(InputStream, String) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
getSection(InputStream, int, byte, byte, boolean) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
getSeq() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Returns the sequence of characters of this ngram
getServiceClass(Class<T>, String) - Method in class org.apache.tika.config.ServiceLoader
Loads and returns the named service class that's expected to implement the given interface.
getServiceLoader() - Method in class org.apache.tika.config.TikaConfig
 
getShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a BE short value from the beginning of a byte array
getShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a BE short value from a byte array
getShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a LE short value from the beginning of a byte array
getShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a LE short value from a byte array
getSimilarity(LanguageProfilerBuilder) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Calculates a score how well NGramProfiles match each other
getSize() - Method in class org.apache.tika.io.NullInputStream
Return the size this InputStream emulates.
getSize() - Method in class org.apache.tika.utils.RereadableInputStream
Returns the number of bytes read from the original stream.
getSorted() - Method in class org.apache.tika.language.LanguageProfilerBuilder
Returns a sorted list of ngrams (sort done by 1.
getSortedNgrams() - Method in class org.apache.tika.language.LanguageProfile.Interleaved
 
getStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
Get the full stacktrace as a string
getString(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
getStringsPath() - Method in class org.apache.tika.parser.strings.StringsConfig
Returns the "strings" installation folder.
getStringsProg() - Static method in class org.apache.tika.parser.strings.StringsParser
 
getSubtype() - Method in class org.apache.tika.mime.MediaType
Return the Sub-Type of the MediaType, such as "plain" for "text/plain"
getSupertype(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Returns the supertype of the given type.
getSupportedEmbedTypes(ParseContext) - Method in interface org.apache.tika.embedder.Embedder
Returns the set of media types supported by this embedder when used with the given parse context.
getSupportedEmbedTypes(ParseContext) - Method in class org.apache.tika.embedder.ExternalEmbedder
 
getSupportedEmbedTypes() - Method in class org.apache.tika.embedder.ExternalEmbedder
 
getSupportedLanguages() - Static method in class org.apache.tika.language.LanguageIdentifier
Returns what languages are supported for language identification
getSupportedTypes(ParseContext) - Method in class org.apache.tika.extractor.ParserContainerExtractor.RecursiveParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.fork.ForkParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.audio.AudioParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.audio.MidiParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CryptoParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.EmptyParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.envi.EnviHeaderParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.epub.EpubContentParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.epub.EpubParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ErrorParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.external.ExternalParser
 
getSupportedTypes() - Method in class org.apache.tika.parser.external.ExternalParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.feed.FeedParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.gdal.GDALParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.iwork.IWorkPackageParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.NetworkParser
 
getSupportedTypes(ParseContext) - Method in interface org.apache.tika.parser.Parser
Returns the set of media types supported by this parser when used with the given parse context.
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ParserDecorator
Delegates the method call to the decorated parser.
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.StringsParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.video.FLVParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.xml.FictionBookParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.xml.XMLParser
 
getTag() - Method in exception org.apache.tika.io.TaggedIOException
Returns the object reference used as the tag this exception.
getTag() - Method in exception org.apache.tika.sax.TaggedSAXException
Returns the object reference used as the tag this exception.
getTail() - Method in class org.apache.tika.io.TailStream
Returns an array with the last data read from the underlying stream.
getText(Node) - Static method in class org.apache.tika.config.TikaConfig
 
getText() - Method in class org.apache.tika.sax.Link
 
getTimeout() - Method in class org.apache.tika.parser.strings.StringsConfig
Returns the maximum time (in seconds) to wait for the "strings" command to terminate.
getTitle() - Method in class org.apache.tika.sax.Link
 
getTopLevelElementChildren(Element, String, String) - Static method in class org.apache.tika.config.TikaConfig
 
getTranslator() - Method in class org.apache.tika.config.TikaConfig
Returns the configured translator instance.
getTranslator() - Method in class org.apache.tika.language.translate.DefaultTranslator
Returns the current translator
getTranslator() - Method in class org.apache.tika.Tika
Returns the translator instance used by this facade.
getTranslators() - Method in class org.apache.tika.language.translate.DefaultTranslator
Returns all available translators
getType() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
getType() - Method in class org.apache.tika.mime.Magic
 
getType() - Method in class org.apache.tika.mime.MediaType
Return the Type of the MediaType, such as "text" for "text/plain"
getType() - Method in class org.apache.tika.mime.MimeType
Returns the normalized media type name.
getType() - Method in class org.apache.tika.mime.MimeType.RootXML
 
getType() - Method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
getType() - Method in class org.apache.tika.sax.Link
 
getTypes() - Method in class org.apache.tika.mime.MediaTypeRegistry
Returns the set of all known canonical media types.
getTypes() - Static method in class org.apache.tika.parser.strings.Latin1StringsParser
Returns the set of supported types.
getUByte(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
get the unsigned value of a byte.
getUIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a BE unsigned int value from a byte array
getUIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a BE unsigned int value from a byte array
getUIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a LE unsigned int value from a byte array
getUIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a LE unsigned int value from a byte array
getUniformTypeIdentifier() - Method in class org.apache.tika.mime.MimeType
Get the UTI for this mime type.
getUri() - Method in class org.apache.tika.sax.Link
 
getUShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a BE unsigned short value from the beginning of a byte array
getUShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a BE unsigned short value from a byte array
getUShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
Get a LE unsigned short value from the beginning of a byte array
getUShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
Get a LE unsigned short value from a byte array
getValue() - Method in class org.apache.tika.fork.ForkServer
 
getValues(Property) - Method in class org.apache.tika.metadata.Metadata
Get the values associated to a metadata name.
getValues(String) - Method in class org.apache.tika.metadata.Metadata
Get the values associated to a metadata name.
getValueType() - Method in class org.apache.tika.metadata.Property
 
getWrappedParser() - Method in class org.apache.tika.parser.ParserDecorator
Gets the parser wrapped by this ParserDecorator
ghostText - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
GLOB_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
globs - Variable in class org.apache.tika.mime.Patterns
Index of generic glob patterns, sorted by length.
GSON - Static variable in class org.apache.tika.metadata.serialization.JsonMetadata
 
GSON - Static variable in class org.apache.tika.metadata.serialization.JsonMetadataList
 

H

handle(String, MediaType, InputStream) - Method in interface org.apache.tika.extractor.EmbeddedResourceHandler
Called to process an embedded resource within the container.
handleException(SAXException) - Method in class org.apache.tika.sax.ContentHandlerDecorator
Handle any exceptions thrown by methods in this class.
handleException(SAXException) - Method in class org.apache.tika.sax.TaggedContentHandler
Tags any SAXExceptions thrown, wrapping and re-throwing.
handleGlobError(MimeType, String, MimeTypeException, String, Attributes) - Method in class org.apache.tika.mime.MimeTypesReader
 
handleIOException(IOException) - Method in class org.apache.tika.io.ProxyInputStream
Handle any IOExceptions thrown.
handleIOException(IOException) - Method in class org.apache.tika.io.TaggedInputStream
Tags any IOExceptions thrown, wrapping and re-throwing.
handleLoadError(String, Throwable) - Method in interface org.apache.tika.config.LoadErrorHandler
Handles a problem encountered when trying to load the specified service class.
handleMimeError(String, MimeTypeException, String, Attributes) - Method in class org.apache.tika.mime.MimeTypesReader
 
handler - Variable in class org.apache.tika.config.ServiceLoader
 
handler - Variable in class org.apache.tika.extractor.ParserContainerExtractor.RecursiveParser
 
handler - Variable in class org.apache.tika.fork.ContentHandlerResource
 
handler - Variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
handler - Variable in class org.apache.tika.sax.ContentHandlerDecorator
Decorated SAX event handler.
HANDLER_TYPE() - Constructor for enum org.apache.tika.sax.BasicContentHandlerFactory.HANDLER_TYPE
 
handlers - Variable in class org.apache.tika.sax.TeeContentHandler
 
hasAutoPageNumber - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
hasErrors() - Static method in class org.apache.tika.language.LanguageIdentifier
Tests whether there were errors initializing language config
hasFile() - Method in class org.apache.tika.io.TikaInputStream
 
hasFile(FileConfig) - Method in class org.apache.tika.parser.strings.StringsParser
Checks if the "file" command is supported.
hashCode() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
 
hashCode() - Method in class org.apache.tika.metadata.Property
 
hashCode() - Method in class org.apache.tika.mime.Magic
 
hashCode() - Method in class org.apache.tika.mime.MediaType
 
hashCode() - Method in class org.apache.tika.mime.MimeType
 
hasHeadings(String, String[]) - Method in class org.apache.tika.parser.gdal.GDALParser
 
hasHitBound() - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
hasLength() - Method in class org.apache.tika.io.TikaInputStream
 
hasMagic() - Method in class org.apache.tika.mime.MimeType
 
hasNext() - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
hasNgram() - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
hasParameters() - Method in class org.apache.tika.mime.MediaType
Checks whether this media type contains parameters.
hasRootXML() - Method in class org.apache.tika.mime.MimeType
 
hasStrings(StringsConfig) - Method in class org.apache.tika.parser.strings.StringsParser
Checks if the "strings" command is supported.
HEAD - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The elements that are in the section.
headEnded - Variable in class org.apache.tika.sax.XHTMLContentHandler
 
HeaderFooter(String) - Constructor for class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
headers - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
HEADLINE - Static variable in interface org.apache.tika.metadata.IPTC
A brief synopsis of the caption.
HEADLINE - Static variable in interface org.apache.tika.metadata.Photoshop
 
headStarted - Variable in class org.apache.tika.sax.XHTMLContentHandler
Flags to indicate whether the document head element has been started/ended.
HEX_CHARS - Static variable in class org.apache.tika.mime.HexCoDec
 
hexCharToNibble(char) - Static method in class org.apache.tika.mime.HexCoDec
Internal method to turn a hex char into a nibble.
HexCoDec - Class in org.apache.tika.mime
A set of Hex encoding and decoding utility methods.
HexCoDec() - Constructor for class org.apache.tika.mime.HexCoDec
 
HIDDEN_SLIDES - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
HISTORY - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
hitBound - Variable in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
hitMaxEmbeddedResources - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
HttpHeaders - Interface in org.apache.tika.metadata
A collection of HTTP header names.
HY - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 

I

i2r(StringBuffer, int, String, int, String, int, String, int, String, int) - Static method in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
IDENTIFIER - Static variable in interface org.apache.tika.metadata.DublinCore
Recommended best practice is to identify the resource by means of a string or number conforming to a formal identification system.
IDENTIFIER - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#IDENTIFIER
IDENTIFIER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
IDENTIFIER - Static variable in interface org.apache.tika.metadata.XMP
An unordered array of text strings that unambiguously identify the resource within a given context.
identifyPart(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
identifyStaticServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
Returns the defined static service providers of the given type, without attempting to load them.
IGNORABLE_WHITESPACE - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
Writes the given ignorable characters to the given character stream.
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
ignorableWhitespaceOutput - Variable in class org.apache.tika.sax.SafeContentHandler
Output through the ContentHandler.ignorableWhitespace(char[], int, int) method of the decorated content handler.
IGNORE - Static variable in interface org.apache.tika.config.LoadErrorHandler
Strategy that simply ignores all problems.
ignoreStream(InputStream) - Method in class org.apache.tika.parser.external.ExternalParser
Starts a thread that reads and discards the contents of the standard stream of the given process.
image(String) - Static method in class org.apache.tika.mime.MediaType
 
IMAGE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
IMAGE_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Images in the document
IMAGE_CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
Creator or creators of the image.
IMAGE_CREATOR_ID - Static variable in interface org.apache.tika.metadata.IPTC
The ID of the creator or creators of the image.
IMAGE_CREATOR_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated.
IMAGE_CREATOR_NAME - Static variable in interface org.apache.tika.metadata.IPTC
The name of the creator or creators of the image.
IMAGE_LENGTH - Static variable in interface org.apache.tika.metadata.TIFF
"Image height in pixels."
IMAGE_REGISTRY_ENTRY - Static variable in interface org.apache.tika.metadata.IPTC
Both a Registry Item Id and a Registry Organisation Id to record any registration of this item with a registry.
IMAGE_SUPPLIER - Static variable in interface org.apache.tika.metadata.IPTC
Identifies the most recent supplier of the item, who is not necessarily its owner or creator.
IMAGE_SUPPLIER_ID - Static variable in interface org.apache.tika.metadata.IPTC
Identifies the most recent supplier of the item, who is not necessarily its owner or creator.
IMAGE_SUPPLIER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated.
IMAGE_SUPPLIER_IMAGE_ID - Static variable in interface org.apache.tika.metadata.IPTC
Optional identifier assigned by the Image Supplier to the image.
IMAGE_SUPPLIER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
Identifies the most recent supplier of the item, who is not necessarily its owner or creator.
IMAGE_WIDTH - Static variable in interface org.apache.tika.metadata.TIFF
"Image width in pixels."
in - Variable in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
inBody - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inc() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Increments the number of occurrences of this ngram.
INDENT - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The elements that get prepended with the XHTMLContentHandler.TAB character.
inheritance - Variable in class org.apache.tika.mime.MediaTypeRegistry
Known type inheritance relationships.
init(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.ClassLoaderProxy
 
init(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
init(DataInputStream, DataOutputStream) - Method in interface org.apache.tika.fork.ForkProxy
 
init(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.InputStreamProxy
 
init() - Method in class org.apache.tika.mime.MimeTypes
Called after all configured types have been loaded.
init(InputStream) - Method in class org.apache.tika.parser.strings.StringsConfig
Initializes attributes.
initCommonCharsets(String...) - Static method in class org.apache.tika.utils.CharsetUtils
 
INITIAL_AUTHOR - Static variable in interface org.apache.tika.metadata.Office
Name of the initial creator/author of a document
initializeDefaultProbabilityParameters() - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
Initilize probability parameters with default values;
initProfiles() - Static method in class org.apache.tika.language.LanguageIdentifier
Builds the language profiles.
initProfiles(Map<String, LanguageProfile>) - Static method in class org.apache.tika.language.LanguageIdentifier
Initializes the language profiles from a user supplied initialized Map.
inMetadata - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inMetadata - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
inMetaDataAuthors - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inMetaDataTitle - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inParsableText - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inPart - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
inPos - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The position into the input buffer.
input - Variable in class org.apache.tika.fork.ClassLoaderProxy
 
input - Variable in class org.apache.tika.fork.ForkClient
 
input - Variable in class org.apache.tika.fork.ForkServer
Input stream for reading from the parent process
input - Variable in class org.apache.tika.fork.InputStreamProxy
 
input - Variable in class org.apache.tika.parser.NetworkParser.ParsingTask
 
input - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The input buffer.
INPUT_FILE_TOKEN - Static variable in class org.apache.tika.parser.external.ExternalParser
The token, which if present in the Command string, will be replaced with the input filename.
inputStream - Variable in class org.apache.tika.utils.RereadableInputStream
The inputStream currently being used by this object to read contents; may be the original stream passed in, or a stream that reads the saved copy.
InputStreamProxy - Class in org.apache.tika.fork
 
InputStreamProxy(int) - Constructor for class org.apache.tika.fork.InputStreamProxy
 
InputStreamResource - Class in org.apache.tika.fork
 
InputStreamResource(InputStream) - Constructor for class org.apache.tika.fork.InputStreamResource
 
inSheet - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
inSize - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The number of bytes into the input buffer.
inSlide - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
INSTANCE - Static variable in class org.apache.tika.detect.EmptyDetector
Singleton instance of this class.
INSTANCE - Static variable in class org.apache.tika.parser.EmptyParser
Singleton instance of this class.
INSTANCE - Static variable in class org.apache.tika.parser.ErrorParser
Singleton instance of this class.
INSTANCE - Static variable in class org.apache.tika.sax.xpath.AttributeMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.ElementMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.NodeMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.TextMatcher
 
INSTANCEID - Static variable in interface org.apache.tika.metadata.XMPMM
An identifier for a specific incarnation of a resource, updated each time a file is saved.
inStartElement - Variable in class org.apache.tika.sax.ToXMLContentHandler
 
INSTITUTION - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.IPTC
Any of a number of instructions from the provider or creator to the receiver of the item.
INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.Photoshop
 
INSTRUMENT - Static variable in interface org.apache.tika.metadata.XMPDM
"The musical instrument."
inTable - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
INTELLECTUAL_GENRE - Static variable in interface org.apache.tika.metadata.IPTC
Describes the nature, intellectual, artistic or journalistic characteristic of a item, not specifically its content.
interleaved - Variable in class org.apache.tika.language.LanguageProfile
Sorted ngram cache for faster distance calculation.
Interleaved() - Constructor for class org.apache.tika.language.LanguageProfile.Interleaved
 
internal - Variable in class org.apache.tika.metadata.Property
 
internalBoolean(String) - Static method in class org.apache.tika.metadata.Property
 
internalClosedChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
 
internalDate(String) - Static method in class org.apache.tika.metadata.Property
 
internalInteger(String) - Static method in class org.apache.tika.metadata.Property
 
internalIntegerSequence(String) - Static method in class org.apache.tika.metadata.Property
 
internalOpenChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
 
internalProcess(DataInputStream) - Method in class org.apache.tika.fork.ContentHandlerResource
 
internalRational(String) - Static method in class org.apache.tika.metadata.Property
 
internalReal(String) - Static method in class org.apache.tika.metadata.Property
 
internalText(String) - Static method in class org.apache.tika.metadata.Property
 
internalTextBag(String) - Static method in class org.apache.tika.metadata.Property
 
internalURI(String) - Static method in class org.apache.tika.metadata.Property
 
inText - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
inTheme - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
inTitle - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
IOExceptionWithCause - Exception in org.apache.tika.io
Subclasses IOException with the Throwable constructors missing before Java 6.
IOExceptionWithCause(String, Throwable) - Constructor for exception org.apache.tika.io.IOExceptionWithCause
Constructs a new instance with the given message and cause.
IOExceptionWithCause(Throwable) - Constructor for exception org.apache.tika.io.IOExceptionWithCause
Constructs a new instance with the given cause.
IOUtils - Class in org.apache.tika.io
General IO stream manipulation utilities.
IOUtils() - Constructor for class org.apache.tika.io.IOUtils
Instances should NOT be constructed in standard programming.
IPTC - Interface in org.apache.tika.metadata
IPTC photo metadata schema.
IPTC_LAST_EDITED - Static variable in interface org.apache.tika.metadata.IPTC
The date and optionally time when any of the IPTC photo metadata fields has been last edited
IptcAnpaParser - Class in org.apache.tika.parser.iptc
Parser for IPTC ANPA New Wire Feeds
IptcAnpaParser() - Constructor for class org.apache.tika.parser.iptc.IptcAnpaParser
 
IS_CONTROL_BYTE - Static variable in class org.apache.tika.detect.TextDetector
Lookup table for all the ASCII/ISO-Latin/UTF-8/etc.
isAnchor() - Method in class org.apache.tika.sax.Link
 
isAvailable() - Method in class org.apache.tika.language.translate.DefaultTranslator
 
isAvailable() - Method in class org.apache.tika.language.translate.EmptyTranslator
 
isAvailable() - Method in interface org.apache.tika.language.translate.Translator
 
isBagless - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
Whether or not the value was found in a standard element structure or inside a bag.
isCausedBy(SecureContentHandler) - Method in exception org.apache.tika.sax.SecureContentHandler.SecureSAXException
 
isCauseOf(IOException) - Method in class org.apache.tika.io.TaggedInputStream
Tests if the given exception was caused by this stream.
isCauseOf(SAXException) - Method in class org.apache.tika.sax.TaggedContentHandler
Tests if the given exception was caused by this handler.
isChar - Static variable in class org.apache.tika.parser.strings.Latin1StringsParser
The valid ISO-8859-1 character map.
isChar(byte) - Static method in class org.apache.tika.parser.strings.Latin1StringsParser
Tests if the byte is a ISO-8859-1 char.
isComposite(Detector) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
isComposite(Class<? extends Detector>) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
isComposite(ConfigurableThreadPoolExecutor) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
isComposite(Class<? extends ConfigurableThreadPoolExecutor>) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
isComposite(Parser) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
isComposite(Class<? extends Parser>) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
isComposite(Translator) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
isComposite(Class<? extends Translator>) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
isComposite(T) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
isComposite(Class<? extends T>) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
isDynamic() - Method in class org.apache.tika.config.ServiceLoader
Returns if the service loader is static or dynamic
isEmpty(String) - Method in class org.apache.tika.mime.MimeType.RootXML
Checks if a string is null or empty.
isExcluded(Collection<Class<? extends Detector>>, Class<? extends Detector>) - Method in class org.apache.tika.detect.CompositeDetector
 
isExcluded(Collection<Class<? extends Parser>>, Class<? extends Parser>) - Method in class org.apache.tika.parser.CompositeParser
 
isExternal() - Method in class org.apache.tika.metadata.Property
 
isImage() - Method in class org.apache.tika.sax.Link
 
isInstanceOf(Class<?>) - Method in class org.apache.tika.config.ServiceLoader.RankedService
 
isInstanceOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Checks whether the given media type equals the given base type or is a specialization of it.
isInstanceOf(String, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Parses and normalises the given media type string and checks whether the result equals the given base type or is a specialization of it.
isInternal() - Method in class org.apache.tika.metadata.Property
 
isInvalid(String) - Method in class org.apache.tika.sax.SafeContentHandler
Checks if the given string contains any invalid XML characters.
isInvalid(int) - Method in class org.apache.tika.sax.SafeContentHandler
Checks whether the given Unicode character is an invalid XML character and should be replaced for output.
isLeaf - Variable in class org.apache.tika.sax.DIFContentHandler
 
isMatchingElement(String, String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
isMatchingParentElement(String, String) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
isMimetype() - Method in class org.apache.tika.parser.strings.FileConfig
Returns true if the mime option is enabled.
isMostlyAscii() - Method in class org.apache.tika.detect.TextStatistics
Checks whether at least one byte was seen and that the bytes that were seen were mostly plain text (i.e.
isMultiValued(Property) - Method in class org.apache.tika.metadata.Metadata
Returns true if named value is multivalued.
isMultiValued(String) - Method in class org.apache.tika.metadata.Metadata
Returns true if named value is multivalued.
isMultiValuePermitted() - Method in class org.apache.tika.metadata.Property
Is the PropertyType one which accepts multiple values?
iso8601InputFormats - Static variable in class org.apache.tika.metadata.Metadata
Some parsers will have the date as a ISO-8601 string already, and will set that into the Metadata object.
ISO_NAME_PATTERN - Static variable in class org.apache.tika.utils.CharsetUtils
 
ISO_SPEED_RATINGS - Static variable in interface org.apache.tika.metadata.TIFF
"ISO Speed and ISO Latitude of the input device as specified in ISO 12232"
isQuoteAssignmentValues() - Method in class org.apache.tika.embedder.ExternalEmbedder
Gets whether or not to quote assignment values, i.e.
isReasonablyCertain() - Method in class org.apache.tika.language.LanguageIdentifier
Tries to judge whether the identification is certain enough to be trusted.
isRegex - Variable in class org.apache.tika.detect.MagicDetector
True if pattern is a regular expression, false otherwise.
ISREGEX_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
isSimpleName(String) - Static method in class org.apache.tika.mime.MediaType
 
isSpecializationOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
Checks whether the given media type a is a specialization of a more generic type b.
isStringIgnoreCase - Variable in class org.apache.tika.detect.MagicDetector
True if we're doing a case-insensitive string match, false otherwise.
isSupported(TikaInputStream) - Method in interface org.apache.tika.extractor.ContainerExtractor
Is this Container Extractor able to process the supplied container?
isSupported(TikaInputStream) - Method in class org.apache.tika.extractor.ParserContainerExtractor
 
isSupported(String) - Static method in class org.apache.tika.utils.CharsetUtils
Safely return whether is supported, without throwing exceptions
isSupportedICU - Static variable in class org.apache.tika.utils.CharsetUtils
 
isTikaInputStream(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
Checks whether the given stream is a TikaInputStream instance.
isTitleTagOpen - Variable in class org.apache.tika.sax.ExpandedTitleContentHandler
 
isValid(String) - Static method in class org.apache.tika.mime.MimeType
Checks that the given string is a valid Internet media type name based on rules from RFC 2054 section 5.3.
isWriteLimitReached(Throwable) - Method in class org.apache.tika.parser.RecursiveParserWrapper
Copied/modified from WriteOutContentHandler.
isWriteLimitReached(Throwable) - Method in class org.apache.tika.sax.WriteOutContentHandler
Checks whether the given exception (or any of it's root causes) was thrown by this handler as a signal of reaching the write limit.
IWORK_COMMON_ENTRY - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
All iWork files contain one of these, so we can detect based on it
IWORK_CONTENT_ENTRIES - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
Which files within an iWork file contain the actual content?
IWORKDocumentType(String, String, MediaType) - Constructor for enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
IWorkPackageParser - Class in org.apache.tika.parser.iwork
A parser for the IWork container files.
IWorkPackageParser() - Constructor for class org.apache.tika.parser.iwork.IWorkPackageParser
 
IWorkPackageParser.IWORKDocumentType - Enum in org.apache.tika.parser.iwork
 

J

jar - Variable in class org.apache.tika.fork.ForkClient
 
java - Variable in class org.apache.tika.fork.ForkParser
Java command line
JOB_ID - Static variable in interface org.apache.tika.metadata.IPTC
Number or identifier for the purpose of improved workflow handling.
JsonMetadata - Class in org.apache.tika.metadata.serialization
 
JsonMetadata() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadata
 
JsonMetadataBase - Class in org.apache.tika.metadata.serialization
 
JsonMetadataBase() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadataBase
 
JsonMetadataBase.SortedJsonMetadataSerializer - Class in org.apache.tika.metadata.serialization
 
JsonMetadataDeserializer - Class in org.apache.tika.metadata.serialization
Deserializer for Metadata If overriding this, remember that this is called from a static context.
JsonMetadataDeserializer() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadataDeserializer
 
JsonMetadataList - Class in org.apache.tika.metadata.serialization
 
JsonMetadataList() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadataList
 
JsonMetadataSerializer - Class in org.apache.tika.metadata.serialization
Serializer for Metadata If overriding this, remember that this is called from a static context.
JsonMetadataSerializer() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadataSerializer
 

K

KEY - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio's musical key."
KeynoteContentHandler - Class in org.apache.tika.parser.iwork
 
KeynoteContentHandler(XHTMLContentHandler, Metadata) - Constructor for class org.apache.tika.parser.iwork.KeynoteContentHandler
 
KEYWORDS - Static variable in interface org.apache.tika.metadata.IPTC
Keywords to express the subject of the content.
KEYWORDS - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
KEYWORDS - Static variable in interface org.apache.tika.metadata.Office
Keywords pertaining to a document.
KEYWORDS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 

L

LABEL - Static variable in interface org.apache.tika.metadata.XMP
A word or short phrase that identifies a resource as a member of a userdefined collection.
language - Variable in class org.apache.tika.language.LanguageIdentifier
 
LANGUAGE - Static variable in interface org.apache.tika.metadata.DublinCore
A language of the intellectual content of the resource.
LANGUAGE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#LANGUAGE
LANGUAGE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
LanguageIdentifier - Class in org.apache.tika.language
Identifier of the language that best matches a given content profile.
LanguageIdentifier(LanguageProfile) - Constructor for class org.apache.tika.language.LanguageIdentifier
Constructs a language identifier based on a LanguageProfile
LanguageIdentifier(String) - Constructor for class org.apache.tika.language.LanguageIdentifier
Constructs a language identifier based on a String of text content
LanguageProfile - Class in org.apache.tika.language
Language profile based on ngram counts.
LanguageProfile(int) - Constructor for class org.apache.tika.language.LanguageProfile
 
LanguageProfile() - Constructor for class org.apache.tika.language.LanguageProfile
 
LanguageProfile(String, int) - Constructor for class org.apache.tika.language.LanguageProfile
 
LanguageProfile(String) - Constructor for class org.apache.tika.language.LanguageProfile
 
LanguageProfile.Counter - Class in org.apache.tika.language
 
LanguageProfile.Interleaved - Class in org.apache.tika.language
 
LanguageProfile.Interleaved.Entry - Class in org.apache.tika.language
 
LanguageProfilerBuilder - Class in org.apache.tika.language
This class runs a ngram analysis over submitted text, results might be used for automatic language identification.
LanguageProfilerBuilder(String, int, int) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder
Constructs a new ngram profile
LanguageProfilerBuilder(String) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder
Constructs a new ngram profile where minlen=3, maxlen=3
LanguageProfilerBuilder.NGramEntry - Class in org.apache.tika.language
Inner class that describes a NGram
LanguageProfilerBuilder.QuickStringBuffer - Class in org.apache.tika.language
 
LANGUAGES_KEY - Static variable in class org.apache.tika.language.LanguageIdentifier
 
LAST_AUTHOR - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
LAST_AUTHOR - Static variable in interface org.apache.tika.metadata.Office
Name of the last (most recent) author of a document
LAST_MODIFIED - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
LAST_MODIFIED_BY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The user who performed the last modification.
LAST_PRINTED - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
LAST_PRINTED - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The date and time of the last printing.
LAST_SAVED - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
lastSeenMark - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.Footnotes
 
Latin1StringsParser - Class in org.apache.tika.parser.strings
Parser to extract printable Latin1 strings from arbitrary files with pure java.
Latin1StringsParser() - Constructor for class org.apache.tika.parser.strings.Latin1StringsParser
 
LATITUDE - Static variable in interface org.apache.tika.metadata.Geographic
The WGS84 Latitude of the Point
LATITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
lazyCloseStartElement() - Method in class org.apache.tika.sax.ToXMLContentHandler
 
lazyEndHead(boolean) - Method in class org.apache.tika.sax.XHTMLContentHandler
Generates the following XHTML prefix when called for the first time:
lazyStartHead() - Method in class org.apache.tika.sax.XHTMLContentHandler
Generates the following XHTML prefix when called for the first time:
length - Variable in class org.apache.tika.detect.MagicDetector
Length of the comparison window.
length - Variable in class org.apache.tika.io.TikaInputStream
Total length of the stream, or -1 if unknown.
length - Variable in class org.apache.tika.language.LanguageProfile
 
length() - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
LengthComparator() - Constructor for class org.apache.tika.mime.Patterns.LengthComparator
 
LF - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
LICENSE_LOCATION - Static variable in interface org.apache.tika.metadata.CreativeCommons
 
LICENSE_URL - Static variable in interface org.apache.tika.metadata.CreativeCommons
 
LICENSOR - Static variable in interface org.apache.tika.metadata.IPTC
A person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_CITY - Static variable in interface org.apache.tika.metadata.IPTC
The city of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
The country of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_EMAIL - Static variable in interface org.apache.tika.metadata.IPTC
The email of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_EXTENDED_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
The extended address of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_ID - Static variable in interface org.apache.tika.metadata.IPTC
The ID of the person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated.
LICENSOR_NAME - Static variable in interface org.apache.tika.metadata.IPTC
The name of the person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_POSTAL_CODE - Static variable in interface org.apache.tika.metadata.IPTC
The postal code of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_REGION - Static variable in interface org.apache.tika.metadata.IPTC
The region of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_STREET_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
The street address of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_TELEPHONE_1 - Static variable in interface org.apache.tika.metadata.IPTC
The phone number of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_TELEPHONE_2 - Static variable in interface org.apache.tika.metadata.IPTC
The phone number of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LICENSOR_URL - Static variable in interface org.apache.tika.metadata.IPTC
The URL of a person or company that should be contacted to obtain a licence for using the item or who has licensed the item.
LINE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
LINE_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of lines in the document
Link - Class in org.apache.tika.sax
 
Link(String, String, String, String) - Constructor for class org.apache.tika.sax.Link
 
Link(String, String, String, String, String) - Constructor for class org.apache.tika.sax.Link
 
LinkBuilder - Class in org.apache.tika.sax
 
LinkBuilder(String) - Constructor for class org.apache.tika.sax.LinkBuilder
 
LinkContentHandler - Class in org.apache.tika.sax
Content handler that collects links from an XHTML document.
LinkContentHandler() - Constructor for class org.apache.tika.sax.LinkContentHandler
Default constructor
LinkContentHandler(boolean) - Constructor for class org.apache.tika.sax.LinkContentHandler
Default constructor
links - Variable in class org.apache.tika.mime.MimeType
Documentation Links
links - Variable in class org.apache.tika.sax.LinkContentHandler
Collected links
LINKS_PATTERN - Static variable in class org.apache.tika.utils.RegexUtils
 
LINKS_REGEX - Static variable in class org.apache.tika.utils.RegexUtils
Regex pattern to get URLs within a plain text.
listType - Static variable in class org.apache.tika.metadata.serialization.JsonMetadataList
 
load(InputStream) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Loads a ngram profile from an InputStream (assumes UTF-8 encoded content)
loadDefaultModels(InputStream) - Method in class org.apache.tika.detect.NNExampleModelDetector
 
loadDefaultModels(ClassLoader) - Method in class org.apache.tika.detect.NNExampleModelDetector
this method gets overwritten to register load neural network models
loadDefaultModels(Path) - Method in class org.apache.tika.detect.TrainedModelDetector
 
loadDefaultModels(File) - Method in class org.apache.tika.detect.TrainedModelDetector
 
loadDefaultModels(InputStream) - Method in class org.apache.tika.detect.TrainedModelDetector
 
loadDefaultModels(ClassLoader) - Method in class org.apache.tika.detect.TrainedModelDetector
 
loadDynamicServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
Returns the available dynamic service providers of the given type.
loader - Variable in class org.apache.tika.config.ServiceLoader
 
loader - Variable in class org.apache.tika.detect.DefaultDetector
 
loader - Variable in class org.apache.tika.detect.DefaultProbDetector
 
loader - Variable in class org.apache.tika.fork.ClassLoaderResource
 
loader - Variable in class org.apache.tika.fork.ForkClient
 
loader - Variable in class org.apache.tika.fork.ForkObjectInputStream
The class loader used when deserializing objects.
loader - Variable in class org.apache.tika.fork.ForkParser
 
loader - Variable in class org.apache.tika.language.translate.DefaultTranslator
 
loader - Variable in class org.apache.tika.parser.DefaultParser
 
LoadErrorHandler - Interface in org.apache.tika.config
Interface for error handling strategies in service class loading.
loadOne(Element, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
loadOne(Element, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
loadOverall(Element, MimeTypes, ServiceLoader) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
loadProperties(InputStream) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
scan the news messsage and store the metadata and data into a map
loadServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
Returns all the available service providers of the given type.
loadStaticServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
Returns the available static service providers of the given type.
LOCAL_NAME_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
LOCAL_NAME_RDF_BAG - Static variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
LOCAL_NAME_RDF_LI - Static variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
localName - Variable in class org.apache.tika.mime.MimeType.RootXML
 
localName - Variable in class org.apache.tika.parser.xml.AttributeMetadataHandler
 
localName - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
location - Variable in class org.apache.tika.parser.RecursiveParserWrapper.EmbeddedParserDecorator
 
LOCATION_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
The location the content of the item was created.
LOCATION_CREATED_CITY - Static variable in interface org.apache.tika.metadata.IPTC
Name of the city of a location.
LOCATION_CREATED_COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
The ISO code of a country of a location.
LOCATION_CREATED_COUNTRY_NAME - Static variable in interface org.apache.tika.metadata.IPTC
The name of a country of a location.
LOCATION_CREATED_PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
The name of a subregion of a country - a province or state - of a location.
LOCATION_CREATED_SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
Name of a sublocation.
LOCATION_CREATED_WORLD_REGION - Static variable in interface org.apache.tika.metadata.IPTC
The name of a world region of a location.
LOCATION_SHOWN - Static variable in interface org.apache.tika.metadata.IPTC
A location the content of the item is about.
LOCATION_SHOWN_CITY - Static variable in interface org.apache.tika.metadata.IPTC
Name of the city of a location.
LOCATION_SHOWN_COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
The ISO code of a country of a location.
LOCATION_SHOWN_COUNTRY_NAME - Static variable in interface org.apache.tika.metadata.IPTC
The name of a country of a location.
LOCATION_SHOWN_PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
The name of a subregion of a country - a province or state - of a location.
LOCATION_SHOWN_SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
Name of a sublocation.
LOCATION_SHOWN_WORLD_REGION - Static variable in interface org.apache.tika.metadata.IPTC
The name of a world region of a location.
log - Static variable in class org.apache.tika.detect.NNExampleModelDetector
 
LOG_COMMENT - Static variable in interface org.apache.tika.metadata.XMPDM
"User's log comments."
logger - Static variable in class org.apache.tika.parser.xml.ElementMetadataHandler
Logger for this class
LONG_SIZE - Static variable in class org.apache.tika.io.EndianUtils
 
LONGITUDE - Static variable in interface org.apache.tika.metadata.Geographic
The WGS84 Longitude of the Point
LONGITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
LookaheadInputStream - Class in org.apache.tika.io
Stream wrapper that make it easy to read up to n bytes ahead from a stream that supports the mark feature.
LookaheadInputStream(InputStream, int) - Constructor for class org.apache.tika.io.LookaheadInputStream
Creates a lookahead wrapper for the given input stream.
looksLikeUTF8() - Method in class org.apache.tika.detect.TextStatistics
Checks whether the observed byte stream looks like UTF-8 encoded text.
LOOP - Static variable in interface org.apache.tika.metadata.XMPDM
"When true, the clip can be looped seamlessly."
LT - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 

M

Magic - Class in org.apache.tika.mime
Defines a magic for a MimeType.
Magic(MimeType, int, Clause) - Constructor for class org.apache.tika.mime.Magic
 
magic_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
magic_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
magic_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
MAGIC_PRIORITY_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MAGIC_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
magic_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
magic_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
magic_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
MagicDetector - Class in org.apache.tika.detect
Content type detection based on magic bytes, i.e.
MagicDetector(MediaType, byte[]) - Constructor for class org.apache.tika.detect.MagicDetector
Creates a detector for input documents that have the exact given byte pattern at the beginning of the document stream.
MagicDetector(MediaType, byte[], int) - Constructor for class org.apache.tika.detect.MagicDetector
Creates a detector for input documents that have the exact given byte pattern at the given offset of the document stream.
MagicDetector(MediaType, byte[], byte[], int, int) - Constructor for class org.apache.tika.detect.MagicDetector
Creates a detector for input documents that meet the specified magic match.
MagicDetector(MediaType, byte[], byte[], boolean, int, int) - Constructor for class org.apache.tika.detect.MagicDetector
Creates a detector for input documents that meet the specified magic match.
MagicDetector(MediaType, byte[], byte[], boolean, boolean, int, int) - Constructor for class org.apache.tika.detect.MagicDetector
Creates a detector for input documents that meet the specified magic match.
MagicMatch - Class in org.apache.tika.mime
Defines a magic match.
MagicMatch(MediaType, String, String, String, String) - Constructor for class org.apache.tika.mime.MagicMatch
 
magics - Variable in class org.apache.tika.mime.MimeType
The magics associated to this Mime-Type
magics - Variable in class org.apache.tika.mime.MimeTypes
Sorted list of all registered magics
main(String[]) - Static method in class org.apache.tika.fork.ForkServer
Starts a forked server process using the standard input and output streams for communication with the parent process.
main(String[]) - Static method in class org.apache.tika.language.LanguageProfilerBuilder
main method used for testing only
MANAGER - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
MANAGER - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
mapAttributes(Attributes) - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
 
mappedTagName - Variable in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
 
mappings - Variable in class org.apache.tika.sax.ElementMappingContentHandler
 
mark - Variable in class org.apache.tika.io.LookaheadInputStream
 
mark(int) - Method in class org.apache.tika.io.LookaheadInputStream
 
mark - Variable in class org.apache.tika.io.NullInputStream
 
mark(int) - Method in class org.apache.tika.io.NullInputStream
Mark the current position.
mark(int) - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's mark(int) method.
mark(int) - Method in class org.apache.tika.io.TailStream
This implementation saves the internal state including the content of the tail buffer so that it can be restored when ''reset()'' is called later.
mark - Variable in class org.apache.tika.io.TikaInputStream
Marked position, or -1 if there is no current mark.
mark(int) - Method in class org.apache.tika.io.TikaInputStream
 
mark(int) - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
markBuffer - Variable in class org.apache.tika.io.TailStream
A copy of the internal tail buffer used for mark() operations.
markBytesRead - Variable in class org.apache.tika.io.TailStream
The number of bytes read at the last mark() operation.
MARKED - Static variable in interface org.apache.tika.metadata.XMPRights
When true, indicates that this is a rights-managed resource.
markIndex - Variable in class org.apache.tika.io.TailStream
A copy of the current index used for mark() operations.
markLimit - Variable in class org.apache.tika.parser.utils.CommonsDigester
 
markSupported() - Method in class org.apache.tika.io.LookaheadInputStream
 
markSupported - Variable in class org.apache.tika.io.NullInputStream
 
markSupported() - Method in class org.apache.tika.io.NullInputStream
Indicates whether mark is supported.
markSupported() - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's markSupported() method.
markSupported() - Method in class org.apache.tika.io.TikaInputStream
 
mask - Variable in class org.apache.tika.detect.MagicDetector
Bit mask that is applied to the source bytes before pattern matching.
mask - Variable in class org.apache.tika.mime.MagicMatch
 
MASK_AUDIO - Static variable in class org.apache.tika.parser.video.FLVParser
 
MASK_VIDEO - Static variable in class org.apache.tika.parser.video.FLVParser
 
MATCH_MASK_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MATCH_OFFSET_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MATCH_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MATCH_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MATCH_VALUE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MATCHER - Static variable in class org.apache.tika.sax.BodyContentHandler
The XPath matcher used to select the XHTML body contents.
Matcher - Class in org.apache.tika.sax.xpath
XPath element matcher.
Matcher() - Constructor for class org.apache.tika.sax.xpath.Matcher
 
matcher - Variable in class org.apache.tika.sax.xpath.MatchingContentHandler
 
matchers - Variable in class org.apache.tika.sax.xpath.MatchingContentHandler
 
matches(byte[]) - Method in class org.apache.tika.mime.MimeType
 
matches(String, String) - Method in class org.apache.tika.mime.MimeType.RootXML
 
matches(String) - Method in class org.apache.tika.mime.Patterns
Find the MimeType corresponding to a resource name.
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.AttributeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches the named attribute of the element associated with this evaluation state.
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.ElementMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches the element associated with this evaluation state.
matchesElement() - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesMagic(byte[]) - Method in class org.apache.tika.mime.MimeType
 
matchesText() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches all text nodes whose parent is the element associated with this evaluation state.
matchesText() - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.TextMatcher
 
matchesXML(String, String) - Method in class org.apache.tika.mime.MimeType
 
MatchingContentHandler - Class in org.apache.tika.sax.xpath
Content handler decorator that only passes the elements, attributes, and text nodes that match the given XPath expression.
MatchingContentHandler(ContentHandler, Matcher) - Constructor for class org.apache.tika.sax.xpath.MatchingContentHandler
 
matchLevel - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
MAX - Static variable in class org.apache.tika.parser.iwork.AutoPageNumberUtils
 
max - Variable in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
MAX_AVAIL_HEIGHT - Static variable in interface org.apache.tika.metadata.IPTC
The maximum available height in pixels of the original photo from which this photo has been derived by downsizing.
MAX_AVAIL_WIDTH - Static variable in interface org.apache.tika.metadata.IPTC
The maximum available width in pixels of the original photo from which this photo has been derived by downsizing.
MAX_SIZE - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The profile max size (number of ngrams of the same size)
maxBytesInMemory - Variable in class org.apache.tika.utils.RereadableInputStream
Maximum number of bytes that can be stored in memory before storage will be moved to a temporary file.
maxDepth - Variable in class org.apache.tika.sax.SecureContentHandler
Maximum XML element nesting level.
maxEmbeddedResources - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
maxExtensionLength - Variable in class org.apache.tika.mime.Patterns
 
maxLength - Variable in class org.apache.tika.language.LanguageProfilerBuilder
The max length of ngram
maxPackageEntryDepth - Variable in class org.apache.tika.sax.SecureContentHandler
Maximum package entry nesting level.
maxStringLength - Variable in class org.apache.tika.Tika
Maximum length of the strings returned by the parseToString methods.
mediaType - Variable in class org.apache.tika.mime.MagicMatch
 
MediaType - Class in org.apache.tika.mime
Internet media type.
MediaType(String, String, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(String, String) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(String, int) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(MediaType, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(MediaType, String, String) - Constructor for class org.apache.tika.mime.MediaType
Creates a media type by adding a parameter to a base type.
MediaType(MediaType, Charset) - Constructor for class org.apache.tika.mime.MediaType
Creates a media type by adding the "charset" parameter to a base type.
MediaTypeRegistry - Class in org.apache.tika.mime
Registry of known Internet media types.
MediaTypeRegistry() - Constructor for class org.apache.tika.mime.MediaTypeRegistry
 
mediaTypesListFromDomElement(Element, String) - Static method in class org.apache.tika.config.TikaConfig
 
MemoryURLConnection - Class in org.apache.tika.fork
 
MemoryURLConnection(URL, byte[]) - Constructor for class org.apache.tika.fork.MemoryURLConnection
 
MemoryURLStreamHandler - Class in org.apache.tika.fork
 
MemoryURLStreamHandler() - Constructor for class org.apache.tika.fork.MemoryURLStreamHandler
 
MemoryURLStreamHandlerFactory - Class in org.apache.tika.fork
 
MemoryURLStreamHandlerFactory() - Constructor for class org.apache.tika.fork.MemoryURLStreamHandlerFactory
 
MemoryURLStreamRecord - Class in org.apache.tika.fork
 
MemoryURLStreamRecord() - Constructor for class org.apache.tika.fork.MemoryURLStreamRecord
 
menuItems - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
Message - Interface in org.apache.tika.metadata
A collection of Message related property names.
MESSAGE_BCC - Static variable in interface org.apache.tika.metadata.Message
 
MESSAGE_CC - Static variable in interface org.apache.tika.metadata.Message
 
MESSAGE_FROM - Static variable in interface org.apache.tika.metadata.Message
 
MESSAGE_RECIPIENT_ADDRESS - Static variable in interface org.apache.tika.metadata.Message
 
MESSAGE_TO - Static variable in interface org.apache.tika.metadata.Message
 
meta - Variable in class org.apache.tika.parser.epub.EpubParser
 
meta_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
meta_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
meta_neg - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
meta_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
meta_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
meta_trust - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
metadata - Variable in class org.apache.tika.fork.MetadataContentHandler
 
Metadata - Class in org.apache.tika.metadata
A multi-valued metadata container.
Metadata() - Constructor for class org.apache.tika.metadata.Metadata
Constructs a new, empty metadata.
metadata - Variable in class org.apache.tika.metadata.Metadata
A map of all metadata attributes.
metadata - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
metadata - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
metadata - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
metadata - Variable in class org.apache.tika.parser.NetworkParser.MetaHandler
 
metadata - Variable in class org.apache.tika.parser.ParsingReader
Metadata associated with the document being parsed.
metadata - Variable in class org.apache.tika.parser.xml.AbstractMetadataHandler
 
metadata - Variable in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
metadata - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
metadata - Variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
metadata - Variable in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
metadata - Variable in class org.apache.tika.sax.DIFContentHandler
 
metadata - Variable in class org.apache.tika.sax.PhoneExtractingContentHandler
 
metadata - Variable in class org.apache.tika.sax.XHTMLContentHandler
Metadata associated with the document.
metadata(Metadata) - Method in class org.apache.tika.sax.XMPContentHandler
 
METADATA_COMMAND_ARGUMENTS_SERIALIZED_TOKEN - Static variable in class org.apache.tika.embedder.ExternalEmbedder
Token to be replaced with a String array of metadata assignment command arguments
METADATA_COMMAND_ARGUMENTS_TOKEN - Static variable in class org.apache.tika.embedder.ExternalEmbedder
Token to be replaced with a String array of metadata assignment command arguments
METADATA_DATE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
METADATA_DATE - Static variable in interface org.apache.tika.metadata.XMP
The date and time that any metadata for this resource was last changed.
METADATA_KEY_ATTR - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
METADATA_MATCH_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
METADATA_MOD_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The date and time when the metadata was last modified."
METADATA_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
metadataCommandArguments - Variable in class org.apache.tika.embedder.ExternalEmbedder
Mapping of Tika metadata to command line parameters.
MetadataContentHandler - Class in org.apache.tika.fork
 
MetadataContentHandler(Metadata) - Constructor for class org.apache.tika.fork.MetadataContentHandler
 
MetadataHandler - Class in org.apache.tika.parser.xml
Deprecated.
MetadataHandler(Metadata, String) - Constructor for class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
MetadataHandler(Metadata, Property) - Constructor for class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
metadataKey - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
metaDataLocalName - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
metadataPatterns - Variable in class org.apache.tika.parser.external.ExternalParser
Regular Expressions to run over STDOUT to extract Metadata.
metadataPropertyQName - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
metaDataQName - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
metadatas - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
MetaHandler(Metadata) - Constructor for class org.apache.tika.parser.NetworkParser.MetaHandler
 
MIDDAY - Static variable in class org.apache.tika.utils.DateUtils
Custom time zone used to interpret date values without a time component in a way that most likely falls within the same day regardless of in which time zone it is later interpreted.
MidiParser - Class in org.apache.tika.parser.audio
 
MidiParser() - Constructor for class org.apache.tika.parser.audio.MidiParser
 
MIME_INFO_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MIME_TYPE_MAGIC - Static variable in interface org.apache.tika.metadata.TikaMimeKeys
 
MIME_TYPE_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MIME_TYPE_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
MimeType - Class in org.apache.tika.mime
Internet media type.
MimeType(MediaType) - Constructor for class org.apache.tika.mime.MimeType
Creates a media type with the give name and containing media type registry.
mimetype - Variable in class org.apache.tika.parser.strings.FileConfig
 
MimeType.RootXML - Class in org.apache.tika.mime
Defines a RootXML description.
MIMETYPE_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
MimeTypeException - Exception in org.apache.tika.mime
A class to encapsulate MimeType related exceptions.
MimeTypeException(String) - Constructor for exception org.apache.tika.mime.MimeTypeException
Constructs a MimeTypeException with the specified detail message.
MimeTypeException(String, Throwable) - Constructor for exception org.apache.tika.mime.MimeTypeException
Constructs a MimeTypeException with the specified detail message and root cause.
mimeTypes - Variable in class org.apache.tika.config.TikaConfig
 
MimeTypes - Class in org.apache.tika.mime
This class is a MimeType repository.
MimeTypes() - Constructor for class org.apache.tika.mime.MimeTypes
 
mimeTypes - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
MIMETYPES_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
MimeTypesFactory - Class in org.apache.tika.mime
Creates instances of MimeTypes.
MimeTypesFactory() - Constructor for class org.apache.tika.mime.MimeTypesFactory
 
MimeTypesReader - Class in org.apache.tika.mime
A reader for XML files compliant with the freedesktop MIME-info DTD.
MimeTypesReader(MimeTypes) - Constructor for class org.apache.tika.mime.MimeTypesReader
 
MimeTypesReader.ClauseRecord - Class in org.apache.tika.mime
 
MimeTypesReaderMetKeys - Interface in org.apache.tika.mime
Met Keys used by the MimeTypesReader.
minExtensionLength - Variable in class org.apache.tika.mime.Patterns
 
minLength - Variable in class org.apache.tika.language.LanguageProfilerBuilder
The min length of ngram
minLength - Variable in class org.apache.tika.mime.MimeType
The minimum length of data to provides for magic analyzis
minLength - Variable in class org.apache.tika.parser.strings.StringsConfig
 
MINOR_MODEL_AGE_DISCLOSURE - Static variable in interface org.apache.tika.metadata.IPTC
Age of the youngest model pictured in the image, at the time that the image was made.
minSize - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The minimum size of a character sequence to be extracted.
MODEL_AGE - Static variable in interface org.apache.tika.metadata.IPTC
Age of the human model(s) at the time this image was taken in a model released image.
MODEL_MAP - Variable in class org.apache.tika.detect.TrainedModelDetector
 
MODEL_NAME_ENGLISH - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
MODEL_RELEASE_ID - Static variable in interface org.apache.tika.metadata.IPTC
Optional identifier associated with each Model Release.
MODEL_RELEASE_STATUS - Static variable in interface org.apache.tika.metadata.IPTC
Summarizes the availability and scope of model releases authorizing usage of the likenesses of persons appearing in the photograph.
MODIFIED - Static variable in interface org.apache.tika.metadata.DublinCore
Date on which the resource was changed.
MODIFIED - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#MODIFIED
MODIFIED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
modifiedService(ServiceReference, Object) - Method in class org.apache.tika.config.TikaActivator
 
MODIFIER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
MODIFY_DATE - Static variable in interface org.apache.tika.metadata.XMP
The date and time the resource was last modified.
MSG_PATTERN - Static variable in class org.apache.tika.utils.ExceptionUtils
 
MSOffice - Interface in org.apache.tika.metadata
A collection of Microsoft Office and Open Document property names.
multiThreadedStreamCopy(InputStream, OutputStream) - Method in class org.apache.tika.embedder.ExternalEmbedder
Creates a new thread for copying a given input stream to a given output stream.

N

n - Variable in class org.apache.tika.language.ProfilingWriter
 
N_PAGES - Static variable in interface org.apache.tika.metadata.PagedText
"The number of pages in the document (including any in contained documents)."
name - Variable in class org.apache.tika.language.LanguageProfilerBuilder
The profile's name
name - Variable in class org.apache.tika.metadata.Property
 
name - Variable in class org.apache.tika.parser.xml.AbstractMetadataHandler
 
name - Variable in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
name - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
name - Variable in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
name - Variable in class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
name - Variable in class org.apache.tika.sax.xpath.NamedElementMatcher
 
NamedAttributeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../@name XPath expression.
NamedAttributeMatcher(String, String) - Constructor for class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
NamedElementMatcher - Class in org.apache.tika.sax.xpath
Intermediate evaluation state of a .../name... XPath expression.
NamedElementMatcher(String, String, Matcher) - Constructor for class org.apache.tika.sax.xpath.NamedElementMatcher
 
NameDetector - Class in org.apache.tika.detect
Content type detection based on the resource name.
NameDetector(Map<Pattern, MediaType>) - Constructor for class org.apache.tika.detect.NameDetector
Creates a new content type detector based on the given name patterns.
nameHoldingAttribute - Variable in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
namePrefix - Variable in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
names() - Method in class org.apache.tika.metadata.Metadata
Returns an array of the names contained in the metadata.
names - Variable in class org.apache.tika.mime.Patterns
Index of exact name patterns.
namespace - Variable in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
namespace - Variable in class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
namespace - Variable in class org.apache.tika.sax.xpath.NamedElementMatcher
 
NAMESPACE_PREFIX_DELIMITER - Static variable in class org.apache.tika.metadata.Metadata
The common delimiter used between the namespace abbreviation and the property name
NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
 
NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.XMP
 
NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.XMPIdq
 
NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.XMPMM
 
NAMESPACE_URI_DC - Static variable in interface org.apache.tika.metadata.DublinCore
 
NAMESPACE_URI_DC_TERMS - Static variable in interface org.apache.tika.metadata.DublinCore
 
NAMESPACE_URI_DOC_META - Static variable in interface org.apache.tika.metadata.Office
 
NAMESPACE_URI_IPTC_CORE - Static variable in interface org.apache.tika.metadata.IPTC
 
NAMESPACE_URI_IPTC_EXT - Static variable in interface org.apache.tika.metadata.IPTC
 
NAMESPACE_URI_PHOTOSHOP - Static variable in interface org.apache.tika.metadata.Photoshop
 
NAMESPACE_URI_PLUS - Static variable in interface org.apache.tika.metadata.IPTC
 
NAMESPACE_URI_XMP_RIGHTS - Static variable in interface org.apache.tika.metadata.XMPRights
 
namespaces - Variable in class org.apache.tika.sax.ToXMLContentHandler.ElementInfo
 
namespaces - Variable in class org.apache.tika.sax.ToXMLContentHandler
 
namespaceURI - Variable in class org.apache.tika.mime.MimeType.RootXML
 
NetworkParser - Class in org.apache.tika.parser
 
NetworkParser(URI, Set<MediaType>) - Constructor for class org.apache.tika.parser.NetworkParser
 
NetworkParser(URI) - Constructor for class org.apache.tika.parser.NetworkParser
 
NetworkParser.MetaHandler - Class in org.apache.tika.parser
 
NetworkParser.ParsingTask - Class in org.apache.tika.parser
 
NEWLINE - Static variable in class org.apache.tika.sax.DIFContentHandler
 
newline() - Method in class org.apache.tika.sax.XHTMLContentHandler
 
next() - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
ngram - Variable in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
ngramcounts - Variable in class org.apache.tika.language.LanguageProfilerBuilder
The total number of ngrams occurences
NGramEntry(CharSequence) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Constructs a new NGramEntry
NGramEntry(String, int) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Constructs a new NGramEntry
ngrams - Variable in class org.apache.tika.language.LanguageProfile
The ngrams that make up this profile.
ngrams - Variable in class org.apache.tika.language.LanguageProfilerBuilder
An index of the ngrams of the profile
NL - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The newline character that gets inserted after block elements.
NNExampleModelDetector - Class in org.apache.tika.detect
 
NNExampleModelDetector() - Constructor for class org.apache.tika.detect.NNExampleModelDetector
 
NNExampleModelDetector(Path) - Constructor for class org.apache.tika.detect.NNExampleModelDetector
 
NNExampleModelDetector(File) - Constructor for class org.apache.tika.detect.NNExampleModelDetector
 
NNTrainedModel - Class in org.apache.tika.detect
 
NNTrainedModel(int, int, int, float[]) - Constructor for class org.apache.tika.detect.NNTrainedModel
 
NNTrainedModelBuilder - Class in org.apache.tika.detect
 
NNTrainedModelBuilder() - Constructor for class org.apache.tika.detect.NNTrainedModelBuilder
 
NodeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../node() XPath expression.
NodeMatcher() - Constructor for class org.apache.tika.sax.xpath.NodeMatcher
 
normalize(String) - Static method in class org.apache.tika.io.FilenameUtils
Scans the given file name for reserved characters on different OSs and file systems and returns a sanitized version of the name with the reserved chars replaced by their hexadecimal value.
normalize() - Method in class org.apache.tika.language.LanguageProfilerBuilder
Normalizes the profile (calculates the ngrams frequencies)
normalize(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
 
NOTES - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
NOTES - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
notFound - Variable in class org.apache.tika.fork.ClassLoaderProxy
Names of resources that could not be found.
NS_URI_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
NULL_OUTPUT_STREAM - Static variable in class org.apache.tika.io.NullOutputStream
A singleton.
NullInputStream - Class in org.apache.tika.io
A functional, light weight InputStream that emulates a stream of a specified size.
NullInputStream(long) - Constructor for class org.apache.tika.io.NullInputStream
Create an InputStream that emulates a specified size which supports marking and does not throw EOFException.
NullInputStream(long, boolean, boolean) - Constructor for class org.apache.tika.io.NullInputStream
Create an InputStream that emulates a specified size with option settings.
NullOutputStream - Class in org.apache.tika.io
This OutputStream writes all data to the famous /dev/null.
NullOutputStream() - Constructor for class org.apache.tika.io.NullOutputStream
 
NUMBER_OF_BEATS - Static variable in interface org.apache.tika.metadata.XMPDM
"The number of beats."
numberOfColumns - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
numberOfColumns - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
numberOfColumns - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
numberOfSheets - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
numberOfSlides - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
NumbersContentHandler - Class in org.apache.tika.parser.iwork
 
NumbersContentHandler(XHTMLContentHandler, Metadata) - Constructor for class org.apache.tika.parser.iwork.NumbersContentHandler
 
numOfHidden - Variable in class org.apache.tika.detect.NNTrainedModel
 
numOfHidden - Variable in class org.apache.tika.detect.NNTrainedModelBuilder
 
numOfInputs - Variable in class org.apache.tika.detect.NNTrainedModel
 
numOfInputs - Variable in class org.apache.tika.detect.NNTrainedModelBuilder
 
numOfOutputs - Variable in class org.apache.tika.detect.NNTrainedModel
 
numOfOutputs - Variable in class org.apache.tika.detect.NNTrainedModelBuilder
 

O

OBJECT_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
OBJECT_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Objects in the document.
OCTET_STREAM - Static variable in class org.apache.tika.mime.MediaType
 
OCTET_STREAM - Static variable in class org.apache.tika.mime.MimeTypes
Name of the root type, application/octet-stream.
Office - Interface in org.apache.tika.metadata
Office Document properties collection.
OfficeOpenXMLCore - Interface in org.apache.tika.metadata
Core properties as defined in the Office Open XML specification part Two that are not in the DublinCore namespace.
OfficeOpenXMLExtended - Interface in org.apache.tika.metadata
Extended properties as defined in the Office Open XML specification part Four.
OfflineContentHandler - Class in org.apache.tika.sax
Content handler decorator that always returns an empty stream from the OfflineContentHandler.resolveEntity(String, String) method to prevent potential network or other external resources from being accessed by an XML parser.
OfflineContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.OfflineContentHandler
 
offset - Variable in class org.apache.tika.mime.MagicMatch
 
offsetRangeBegin - Variable in class org.apache.tika.detect.MagicDetector
First offset (inclusive) of the comparison window within the document input stream.
offsetRangeEnd - Variable in class org.apache.tika.detect.MagicDetector
Last offset (inclusive) of the comparison window within the document input stream.
openConnection(URL) - Method in class org.apache.tika.fork.MemoryURLStreamHandler
 
openContainer - Variable in class org.apache.tika.io.TikaInputStream
A opened container, such as a POIFS FileSystem for an OLE2 document, or a Zip file for a zip based (eg ooxml, odf) document.
OrClause - Class in org.apache.tika.mime
 
OrClause(List<Clause>) - Constructor for class org.apache.tika.mime.OrClause
 
org.apache.tika - package org.apache.tika
Apache Tika.
org.apache.tika.concurrent - package org.apache.tika.concurrent
 
org.apache.tika.config - package org.apache.tika.config
Tika configuration tools.
org.apache.tika.detect - package org.apache.tika.detect
Media type detection.
org.apache.tika.embedder - package org.apache.tika.embedder
 
org.apache.tika.exception - package org.apache.tika.exception
Tika exception.
org.apache.tika.extractor - package org.apache.tika.extractor
Extraction of component documents.
org.apache.tika.filetypedetector - package org.apache.tika.filetypedetector
Tika Java-7 FileTypeDetector implementations.
org.apache.tika.fork - package org.apache.tika.fork
Forked parser.
org.apache.tika.io - package org.apache.tika.io
IO utilities.
org.apache.tika.language - package org.apache.tika.language
Language detection.
org.apache.tika.language.translate - package org.apache.tika.language.translate
 
org.apache.tika.metadata - package org.apache.tika.metadata
Multi-valued metadata container, and set of constant metadata fields.
org.apache.tika.metadata.serialization - package org.apache.tika.metadata.serialization
 
org.apache.tika.mime - package org.apache.tika.mime
Media type information.
org.apache.tika.parser - package org.apache.tika.parser
Tika parsers.
org.apache.tika.parser.audio - package org.apache.tika.parser.audio
 
org.apache.tika.parser.envi - package org.apache.tika.parser.envi
 
org.apache.tika.parser.epub - package org.apache.tika.parser.epub
 
org.apache.tika.parser.external - package org.apache.tika.parser.external
External parser process.
org.apache.tika.parser.feed - package org.apache.tika.parser.feed
 
org.apache.tika.parser.gdal - package org.apache.tika.parser.gdal
 
org.apache.tika.parser.internal - package org.apache.tika.parser.internal
 
org.apache.tika.parser.iptc - package org.apache.tika.parser.iptc
 
org.apache.tika.parser.iwork - package org.apache.tika.parser.iwork
 
org.apache.tika.parser.strings - package org.apache.tika.parser.strings
 
org.apache.tika.parser.utils - package org.apache.tika.parser.utils
 
org.apache.tika.parser.video - package org.apache.tika.parser.video
 
org.apache.tika.parser.xml - package org.apache.tika.parser.xml
 
org.apache.tika.sax - package org.apache.tika.sax
SAX utilities.
org.apache.tika.sax.xpath - package org.apache.tika.sax.xpath
XPath utilities
org.apache.tika.utils - package org.apache.tika.utils
Utilities.
ORGANISATION_CODE - Static variable in interface org.apache.tika.metadata.IPTC
A set of metadata about artwork or an object in the item
ORGANISATION_NAME - Static variable in interface org.apache.tika.metadata.IPTC
Name of the organisation or company which is featured in the content.
ORIENTATION - Static variable in interface org.apache.tika.metadata.TIFF
"The Orientation of the image." 1 = 0th row at top, 0th column at left 2 = 0th row at top, 0th column at right 3 = 0th row at bottom, 0th column at right 4 = 0th row at bottom, 0th column at left 5 = 0th row at left, 0th column at top 6 = 0th row at right, 0th column at top 7 = 0th row at right, 0th column at bottom 8 = 0th row at left, 0th column at bottom
ORIGINAL_DATE - Static variable in interface org.apache.tika.metadata.TIFF
"Date and time when original image was generated"
ORIGINAL_DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
The common identifier for the original resource from which the current resource is derived.
originalInputStream - Variable in class org.apache.tika.utils.RereadableInputStream
Input stream originally passed to the constructor.
outPos - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The current position into the output buffer.
output - Variable in class org.apache.tika.fork.ClassLoaderProxy
 
output - Variable in class org.apache.tika.fork.ContentHandlerProxy
 
output - Variable in class org.apache.tika.fork.ForkClient
 
output - Variable in class org.apache.tika.fork.ForkServer
Output stream for writing to the parent process
output - Variable in class org.apache.tika.fork.InputStreamProxy
 
output(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
output - Variable in class org.apache.tika.parser.NetworkParser.ParsingTask
 
output - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The output buffer.
OUTPUT_FILE_TOKEN - Static variable in class org.apache.tika.parser.external.ExternalParser
The token, which if present in the Command string, will be replaced with the output filename.
outputTable(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
OWNER - Static variable in interface org.apache.tika.metadata.XMPRights
A list of legal owners of the resource.

P

packageEntryDepths - Variable in class org.apache.tika.sax.SecureContentHandler
Current number of nested <div class="package-entr"> elements.
PAGE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
PAGE_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Pages are there in the (paged) document
pageCount - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
PagedText - Interface in org.apache.tika.metadata
XMP Paged-text schema.
PagesContentHandler - Class in org.apache.tika.parser.iwork
 
PagesContentHandler(XHTMLContentHandler, Metadata) - Constructor for class org.apache.tika.parser.iwork.PagesContentHandler
 
PagesContentHandler.Annotations - Class in org.apache.tika.parser.iwork
Represents Annotations in a document.
PagesContentHandler.DocumentPart - Enum in org.apache.tika.parser.iwork
The (interesting) part of the document we're in.
PagesContentHandler.Footnotes - Class in org.apache.tika.parser.iwork
Represents Footnotes in a document.
PagesContentHandler.HeaderFooter - Class in org.apache.tika.parser.iwork
Represents the Headers or Footers in a document
PARAGRAPH_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
PARAGRAPH_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of individual Paragraphs in the document
parameters - Variable in class org.apache.tika.mime.MediaType
Immutable sorted map of media type parameters.
params - Variable in class org.apache.tika.detect.NNTrainedModelBuilder
 
parent - Variable in class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
parent - Variable in class org.apache.tika.sax.ToXMLContentHandler.ElementInfo
 
parentMatchLevel - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
parse(MediaType, String, String, String, String) - Static method in class org.apache.tika.detect.MagicDetector
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.extractor.ParserContainerExtractor.RecursiveParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.fork.ForkParser
 
parse(String) - Static method in class org.apache.tika.mime.MediaType
Parses the given string to a media type.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.AbstractParser
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.audio.AudioParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.audio.MidiParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.AutoDetectParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.AutoDetectParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.CompositeParser
Delegates the call to the matching component parser.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.CryptoParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
Looks up the delegate parser from the parsing context and delegates the parse operation to it.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.DigestingParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.EmptyParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.envi.EnviHeaderParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.epub.EpubContentParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.epub.EpubParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ErrorParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.external.ExternalParser
Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler.
parse(TikaInputStream, XHTMLContentHandler, Metadata, TemporaryResources) - Method in class org.apache.tika.parser.external.ExternalParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.feed.FeedParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.gdal.GDALParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
Deprecated.
This method will be removed in Apache Tika 1.0.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.iwork.IWorkPackageParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.NetworkParser
 
parse(TikaInputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.NetworkParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.NetworkParser.ParsingTask
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in interface org.apache.tika.parser.Parser
Parses a document stream into a sequence of XHTML SAX events.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ParserDecorator
Delegates the method call to the decorated parser.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ParserPostProcessor
Forwards the call to the delegated parser and post-processes the results as described above.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper.EmbeddedParserDecorator
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper
Acts like a regular parser except it ignores the ContentHandler and it automatically sets/overwrites the embedded Parser in the ParseContext object.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.strings.StringsParser
 
parse(String) - Static method in class org.apache.tika.parser.utils.CommonsDigester
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.video.FLVParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.xml.XMLParser
 
parse(String) - Method in class org.apache.tika.sax.xpath.XPathParser
Parses the given simple XPath expression to an evaluation state initialized at the document node.
parse(InputStream, Metadata) - Method in class org.apache.tika.Tika
Parses the given document and returns the extracted text content.
parse(InputStream) - Method in class org.apache.tika.Tika
Parses the given document and returns the extracted text content.
parse(Path) - Method in class org.apache.tika.Tika
Parses the file at the given path and returns the extracted text content.
parse(File) - Method in class org.apache.tika.Tika
Parses the given file and returns the extracted text content.
parse(URL) - Method in class org.apache.tika.Tika
Parses the resource at the given URL and returns the extracted text content.
PARSE_TIME_MILLIS - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
parseBody(byte[], HashMap<String, String>) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
ParseContext - Class in org.apache.tika.parser
Parse context.
ParseContext() - Constructor for class org.apache.tika.parser.ParseContext
 
parseDate(String) - Static method in class org.apache.tika.metadata.Metadata
Parses the given date string.
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in interface org.apache.tika.extractor.EmbeddedDocumentExtractor
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
parseFooter(byte[], HashMap<String, String>) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
parseHandlerType(String, BasicContentHandlerFactory.HANDLER_TYPE) - Static method in class org.apache.tika.sax.BasicContentHandlerFactory
Tries to parse string into handler type.
parseHeader(byte[], HashMap<String, String>) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
parseParameters(String) - Static method in class org.apache.tika.mime.MediaType
 
parsePrimitiveElementValue(String, Attributes) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
Returns the value of a primitive element e.g.: <sl:number sfa:number="0" sfa:type="f"/> - the number attribute <sl:string sfa:string="en"/> = the string attribute
parseProperty - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
parser - Variable in class org.apache.tika.config.TikaConfig
 
parser - Variable in class org.apache.tika.extractor.ParserContainerExtractor
 
parser - Variable in class org.apache.tika.fork.ForkParser
 
Parser - Interface in org.apache.tika.parser
Tika parser interface.
parser - Variable in class org.apache.tika.parser.ParserDecorator
The decorated parser instance.
parser - Variable in class org.apache.tika.parser.ParsingReader
Parser instance used for parsing the given binary stream.
PARSER - Static variable in class org.apache.tika.sax.BodyContentHandler
XHTML XPath parser.
parser - Variable in class org.apache.tika.Tika
The parser instance used by this facade.
PARSER_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
 
ParserContainerExtractor - Class in org.apache.tika.extractor
An implementation of ContainerExtractor powered by the regular Parser API.
ParserContainerExtractor() - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
 
ParserContainerExtractor(TikaConfig) - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
 
ParserContainerExtractor(Parser, Detector) - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
 
ParserContainerExtractor.RecursiveParser - Class in org.apache.tika.extractor
 
ParserDecorator - Class in org.apache.tika.parser
Decorator base class for the Parser interface.
ParserDecorator(Parser) - Constructor for class org.apache.tika.parser.ParserDecorator
Creates a decorator for the given parser.
ParserPostProcessor - Class in org.apache.tika.parser
Parser decorator that post-processes the results from a decorated parser.
ParserPostProcessor(Parser) - Constructor for class org.apache.tika.parser.ParserPostProcessor
Creates a post-processing decorator for the given parser.
parsers - Variable in class org.apache.tika.parser.CompositeParser
List of component parsers.
parserService - Variable in class org.apache.tika.parser.internal.Activator
 
parserTracker - Variable in class org.apache.tika.config.TikaActivator
 
ParserXmlLoader() - Constructor for class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
parseTableData(String) - Method in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
parseTableData(String, Attributes) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
parseText - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
parseToString(InputStream, Metadata) - Method in class org.apache.tika.Tika
Parses the given document and returns the extracted text content.
parseToString(InputStream, Metadata, int) - Method in class org.apache.tika.Tika
Parses the given document and returns the extracted text content.
parseToString(InputStream) - Method in class org.apache.tika.Tika
Parses the given document and returns the extracted text content.
parseToString(Path) - Method in class org.apache.tika.Tika
Parses the file at the given path and returns the extracted text content.
parseToString(File) - Method in class org.apache.tika.Tika
Parses the given file and returns the extracted text content.
parseToString(URL) - Method in class org.apache.tika.Tika
Parses the resource at the given URL and returns the extracted text content.
ParsingEmbeddedDocumentExtractor - Class in org.apache.tika.extractor
Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.
ParsingEmbeddedDocumentExtractor(ParseContext) - Constructor for class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
ParsingReader - Class in org.apache.tika.parser
Reader for the text content from a given binary stream.
ParsingReader(InputStream) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the given binary stream.
ParsingReader(InputStream, String) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the given binary stream with the given name.
ParsingReader(Path) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the file at the given path.
ParsingReader(File) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the given file.
ParsingReader(Parser, InputStream, Metadata, ParseContext) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the given binary stream with the given document metadata.
ParsingReader(Parser, InputStream, Metadata, ParseContext, Executor) - Constructor for class org.apache.tika.parser.ParsingReader
Creates a reader for the text content of the given binary stream with the given document metadata.
ParsingReader.ParsingTask - Class in org.apache.tika.parser
The background parsing task.
ParsingTask(TikaInputStream, OutputStream) - Constructor for class org.apache.tika.parser.NetworkParser.ParsingTask
 
ParsingTask() - Constructor for class org.apache.tika.parser.ParsingReader.ParsingTask
 
part - Variable in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
partExtractor - Variable in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
PasswordProvider - Interface in org.apache.tika.parser
Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents.
path - Variable in class org.apache.tika.io.TikaInputStream
The path to the file that contains the contents of this stream.
pattern - Variable in class org.apache.tika.detect.MagicDetector
The magic match pattern.
PATTERN_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
patternLength - Variable in class org.apache.tika.detect.MagicDetector
Length of the pattern, which in the case of regular expressions will not be the same as the comparison window length.
patterns - Variable in class org.apache.tika.detect.NameDetector
The regular expression patterns used for type detection.
patterns - Variable in class org.apache.tika.mime.MimeTypes
The patterns matcher
Patterns - Class in org.apache.tika.mime
Defines a MimeType pattern.
Patterns(MediaTypeRegistry) - Constructor for class org.apache.tika.mime.Patterns
 
Patterns.LengthComparator - Class in org.apache.tika.mime
 
peek(byte[]) - Method in class org.apache.tika.io.TikaInputStream
Fills the given buffer with upcoming bytes from this stream without advancing the current stream position.
PERSON - Static variable in interface org.apache.tika.metadata.IPTC
Name of a person the content of the item is about.
PHONE_NUMBERS - Static variable in class org.apache.tika.sax.PhoneExtractingContentHandler
 
PhoneExtractingContentHandler - Class in org.apache.tika.sax
Class used to extract phone numbers while parsing.
PhoneExtractingContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.PhoneExtractingContentHandler
Creates a decorator for the given SAX event handler and Metadata object.
PhoneExtractingContentHandler() - Constructor for class org.apache.tika.sax.PhoneExtractingContentHandler
Creates a decorator that by default forwards incoming SAX events to a dummy content handler that simply ignores all the events.
phoneRegex - Static variable in class org.apache.tika.sax.CleanPhoneText
 
Photoshop - Interface in org.apache.tika.metadata
XMP Photoshop metadata schema.
ping() - Method in class org.apache.tika.fork.ForkClient
 
PING - Static variable in class org.apache.tika.fork.ForkServer
 
PLAIN_TEXT - Static variable in class org.apache.tika.mime.MimeTypes
Name of the text type, text/plain.
PLUS_VERSION - Static variable in interface org.apache.tika.metadata.IPTC
The version number of the PLUS standards in place at the time of the transaction.
pool - Variable in class org.apache.tika.fork.ForkParser
 
poolSize - Variable in class org.apache.tika.fork.ForkParser
Process pool size
populateThetas(float[]) - Method in class org.apache.tika.detect.NNTrainedModel
 
pos - Variable in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
pos - Variable in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
position - Variable in class org.apache.tika.io.LookaheadInputStream
 
position - Variable in class org.apache.tika.io.NullInputStream
 
position - Variable in class org.apache.tika.io.TikaInputStream
Current read position within this stream.
predict(double[]) - Method in class org.apache.tika.detect.NNTrainedModel
 
predict(float[]) - Method in class org.apache.tika.detect.NNTrainedModel
The given input vector of unseen is m=(256 + 1) * n= 1 this returns a prediction probability
predict(double[]) - Method in class org.apache.tika.detect.TrainedModel
 
predict(float[]) - Method in class org.apache.tika.detect.TrainedModel
 
PREFIX - Static variable in interface org.apache.tika.metadata.AccessPermissions
 
PREFIX - Static variable in interface org.apache.tika.metadata.Database
 
PREFIX - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
 
PREFIX - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
PREFIX - Static variable in interface org.apache.tika.metadata.XMP
 
PREFIX - Static variable in interface org.apache.tika.metadata.XMPIdq
 
PREFIX - Static variable in interface org.apache.tika.metadata.XMPMM
 
prefix - Variable in class org.apache.tika.sax.XMPContentHandler
 
PREFIX_ - Static variable in interface org.apache.tika.metadata.XMP
The xmp prefix followed by the colon delimiter
PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPIdq
The xmpidq prefix followed by the colon delimiter
PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPMM
The xmpMM prefix followed by the colon delimiter
PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPRights
The xmpRights prefix followed by the colon delimiter
PREFIX_DC - Static variable in interface org.apache.tika.metadata.DublinCore
 
PREFIX_DC_TERMS - Static variable in interface org.apache.tika.metadata.DublinCore
 
PREFIX_DOC_META - Static variable in interface org.apache.tika.metadata.Office
 
PREFIX_IPTC_CORE - Static variable in interface org.apache.tika.metadata.IPTC
 
PREFIX_IPTC_EXT - Static variable in interface org.apache.tika.metadata.IPTC
 
PREFIX_PHOTOSHOP - Static variable in interface org.apache.tika.metadata.Photoshop
 
PREFIX_PLUS - Static variable in interface org.apache.tika.metadata.IPTC
 
PREFIX_RTF_META - Static variable in interface org.apache.tika.metadata.RTFMetadata
 
PREFIX_XMP_RIGHTS - Static variable in interface org.apache.tika.metadata.XMPRights
 
prefixes - Variable in class org.apache.tika.sax.xpath.XPathParser
 
preLoadOne(Class<? extends Detector>, String, MimeTypes) - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
preLoadOne(Class<? extends ConfigurableThreadPoolExecutor>, String, MimeTypes) - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
preLoadOne(Class<? extends Parser>, String, MimeTypes) - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
preLoadOne(Class<? extends Translator>, String, MimeTypes) - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
preLoadOne(Class<? extends T>, String, MimeTypes) - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
PRESENTATION_FORMAT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
PRESENTATION_FORMAT - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
PRESENTATION_HEIGHT - Static variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
PRESENTATION_WIDTH - Static variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
prettyInit() - Static method in class org.apache.tika.metadata.serialization.JsonMetadataBase
 
PrettyMetadataKeyComparator - Class in org.apache.tika.metadata.serialization
 
PrettyMetadataKeyComparator() - Constructor for class org.apache.tika.metadata.serialization.PrettyMetadataKeyComparator
 
primaryProperty - Variable in class org.apache.tika.metadata.Property
 
PRINT_DATE - Static variable in interface org.apache.tika.metadata.Office
When was the document last printed?
PRINT_DATE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
priorExtensionFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorExtensionFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorExtensionFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
priority - Variable in class org.apache.tika.mime.Magic
 
priority - Variable in class org.apache.tika.mime.MimeTypesReader
 
priorMagicFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorMagicFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorMagicFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
priorMetaFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorMetaFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
priorMetaFileType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ProbabilisticMimeDetectionSelector - Class in org.apache.tika.mime
Selector for combining different mime detection results based on probability
ProbabilisticMimeDetectionSelector() - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ProbabilisticMimeDetectionSelector(ProbabilisticMimeDetectionSelector.Builder) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ProbabilisticMimeDetectionSelector(MimeTypes) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ProbabilisticMimeDetectionSelector(MimeTypes, ProbabilisticMimeDetectionSelector.Builder) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
ProbabilisticMimeDetectionSelector.Builder - Class in org.apache.tika.mime
build class for probability parameters setting
probeContentType(Path) - Method in class org.apache.tika.filetypedetector.TikaFileTypeDetector
 
process(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.ClassLoaderResource
Processes a request for one (code 1) or many (code 2) class loader resources.
process(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.ContentHandlerResource
 
process - Variable in class org.apache.tika.fork.ForkClient
 
process(DataInputStream, DataOutputStream) - Method in interface org.apache.tika.fork.ForkResource
 
process(DataInputStream, DataOutputStream) - Method in class org.apache.tika.fork.InputStreamResource
 
processByte() - Method in class org.apache.tika.io.NullInputStream
Return a byte value for the read() method.
processBytes(byte[], int, int) - Method in class org.apache.tika.io.NullInputStream
Process the bytes for the read(byte[], offset, length) method.
processCommand(InputStream) - Method in class org.apache.tika.parser.gdal.GDALParser
 
PROCESSING_INSTRUCTION - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
processingInstruction(String, String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
processingInstruction(String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
processingInstruction(String, String) - Method in class org.apache.tika.sax.TeeContentHandler
 
processingInstruction(String, String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
processOutput(ContentHandler, Metadata, String) - Method in class org.apache.tika.parser.gdal.GDALParser
 
processRequests() - Method in class org.apache.tika.fork.ForkServer
 
profile - Variable in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
The NGRamProfile this NGram is related to
profile - Variable in class org.apache.tika.language.ProfilingWriter
 
PROFILE_SUFFIX - Static variable in class org.apache.tika.language.LanguageIdentifier
 
PROFILES - Static variable in class org.apache.tika.language.LanguageIdentifier
The available language profiles.
ProfilingHandler - Class in org.apache.tika.language
SAX content handler that builds a language profile based on all the received character content.
ProfilingHandler(ProfilingWriter) - Constructor for class org.apache.tika.language.ProfilingHandler
 
ProfilingHandler(LanguageProfile) - Constructor for class org.apache.tika.language.ProfilingHandler
 
ProfilingHandler() - Constructor for class org.apache.tika.language.ProfilingHandler
 
ProfilingWriter - Class in org.apache.tika.language
Writer that builds a language profile based on all the written content.
ProfilingWriter(LanguageProfile) - Constructor for class org.apache.tika.language.ProfilingWriter
 
ProfilingWriter() - Constructor for class org.apache.tika.language.ProfilingWriter
 
PROGRAM_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
PROJECT_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
properties - Static variable in class org.apache.tika.metadata.Property
 
PROPERTIES_FILE - Static variable in class org.apache.tika.language.LanguageIdentifier
 
PROPERTIES_OVERRIDE_FILE - Static variable in class org.apache.tika.language.LanguageIdentifier
 
Property - Class in org.apache.tika.metadata
XMP property definition.
Property(String, boolean, Property.PropertyType, Property.ValueType, String[], Property, Property[]) - Constructor for class org.apache.tika.metadata.Property
 
Property(String, boolean, Property.PropertyType, Property.ValueType, String[]) - Constructor for class org.apache.tika.metadata.Property
 
Property(String, boolean, Property.ValueType, String[]) - Constructor for class org.apache.tika.metadata.Property
 
Property(String, boolean, Property.ValueType) - Constructor for class org.apache.tika.metadata.Property
 
Property(String, boolean, Property.PropertyType, Property.ValueType) - Constructor for class org.apache.tika.metadata.Property
 
property - Static variable in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
Deprecated.
 
property - Variable in class org.apache.tika.parser.xml.AbstractMetadataHandler
 
property - Variable in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
property(String, String) - Method in class org.apache.tika.sax.XMPContentHandler
 
Property.PropertyType - Enum in org.apache.tika.metadata
 
Property.ValueType - Enum in org.apache.tika.metadata
 
PROPERTY_GROUP_IPTC_CORE - Static variable in interface org.apache.tika.metadata.IPTC
 
PROPERTY_GROUP_IPTC_EXT - Static variable in interface org.apache.tika.metadata.IPTC
 
PROPERTY_RELEASE_ID - Static variable in interface org.apache.tika.metadata.IPTC
Optional identifier associated with each Property Release.
PROPERTY_RELEASE_STATUS - Static variable in interface org.apache.tika.metadata.IPTC
Summarises the availability and scope of property releases authorizing usage of the properties appearing in the photograph.
propertyType - Variable in class org.apache.tika.metadata.Property
 
PropertyType() - Constructor for enum org.apache.tika.metadata.Property.PropertyType
 
PropertyTypeException - Exception in org.apache.tika.metadata
XMP property definition violation exception.
PropertyTypeException(String) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
 
PropertyTypeException(Property.PropertyType, Property.PropertyType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
 
PropertyTypeException(Property.ValueType, Property.ValueType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
 
PropertyTypeException(Property.PropertyType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
 
props - Static variable in class org.apache.tika.language.LanguageIdentifier
 
PROTECTED - Static variable in interface org.apache.tika.metadata.TikaMetadataKeys
 
provider - Variable in class org.apache.tika.parser.CryptoParser
 
PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
Name of the subregion of a country -- either called province or state or anything else -- the content is focussing on -- either the subregion shown in visual media or referenced by text or audio media.
ProxyInputStream - Class in org.apache.tika.io
A Proxy stream which acts as expected, that is it passes the method calls on to the proxied stream and doesn't change which methods are being called.
ProxyInputStream(InputStream) - Constructor for class org.apache.tika.io.ProxyInputStream
Constructs a new ProxyInputStream.
PUBLISHER - Static variable in interface org.apache.tika.metadata.DublinCore
An entity responsible for making the resource available.
PUBLISHER - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#PUBLISHER
PUBLISHER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
PULL_DOWN - Static variable in interface org.apache.tika.metadata.XMPDM
"The sampling phase of film to be converted to video (pull-down)."

Q

QuickStringBuffer() - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
QuickStringBuffer(char[]) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
QuickStringBuffer(int) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
QuickStringBuffer(String) - Constructor for class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
quoteAssignmentValues - Variable in class org.apache.tika.embedder.ExternalEmbedder
 

R

rank - Variable in class org.apache.tika.config.ServiceLoader.RankedService
 
RankedService(Object, int) - Constructor for class org.apache.tika.config.ServiceLoader.RankedService
 
RATING - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
RATING - Static variable in interface org.apache.tika.metadata.XMP
A user-assigned rating for this file.
ratio - Variable in class org.apache.tika.sax.SecureContentHandler
Maximum compression ratio.
RDF - Static variable in class org.apache.tika.sax.XMPContentHandler
The RDF namespace URI
read() - Method in class org.apache.tika.fork.InputStreamProxy
 
read(byte[], int, int) - Method in class org.apache.tika.fork.InputStreamProxy
 
read() - Method in class org.apache.tika.io.ClosedInputStream
Returns -1 to indicate that the stream is closed.
read(byte[]) - Method in class org.apache.tika.io.CountingInputStream
Reads a number of bytes into the byte array, keeping count of the number read.
read(byte[], int, int) - Method in class org.apache.tika.io.CountingInputStream
Reads a number of bytes into the byte array at a specific offset, keeping count of the number read.
read() - Method in class org.apache.tika.io.CountingInputStream
Reads the next byte of data adding to the count of bytes received if a byte is successfully read.
read() - Method in class org.apache.tika.io.LookaheadInputStream
 
read(byte[], int, int) - Method in class org.apache.tika.io.LookaheadInputStream
 
read() - Method in class org.apache.tika.io.NullInputStream
Read a byte.
read(byte[]) - Method in class org.apache.tika.io.NullInputStream
Read some bytes into the specified array.
read(byte[], int, int) - Method in class org.apache.tika.io.NullInputStream
Read the specified number bytes into an array.
read() - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's read() method.
read(byte[]) - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's read(byte[]) method.
read(byte[], int, int) - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's read(byte[], int, int) method.
read() - Method in class org.apache.tika.io.TailStream
This implementation adds the read byte to the internal tail buffer.
read(byte[]) - Method in class org.apache.tika.io.TailStream
This implementation delegates to the underlying stream and then adds the correct portion of the read buffer to the internal tail buffer.
read(byte[], int, int) - Method in class org.apache.tika.io.TailStream
This implementation delegates to the underlying stream and then adds the correct portion of the read buffer to the internal tail buffer.
read(InputStream) - Method in class org.apache.tika.mime.MimeTypesReader
 
read(Document) - Method in class org.apache.tika.mime.MimeTypesReader
 
read(InputStream) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
read(Document) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
read(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
read(char[], int, int) - Method in class org.apache.tika.parser.ParsingReader
Reads parsed text from the pipe connected to the parsing thread.
read() - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
read(byte[]) - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
Invokes the delegate's read(byte[]) method.
read(byte[], int, int) - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
Invokes the delegate's read(byte[], int, int) method.
read() - Method in class org.apache.tika.utils.RereadableInputStream
Reads a byte from the stream, saving it in the store if it is being read from the original stream.
readAMFData(DataInputStream, int) - Method in class org.apache.tika.parser.video.FLVParser
 
readAMFEcmaArray(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readAMFObject(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readAMFStrictArray(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readAMFString(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readByteFrequencies(InputStream) - Method in class org.apache.tika.detect.TrainedModelDetector
Read the inputstream and build a byte frequency histogram
readCharacters(DataInputStream) - Method in class org.apache.tika.fork.ContentHandlerResource
 
readCheckTagAndCheck(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
readDescription(NNTrainedModelBuilder, String) - Method in class org.apache.tika.detect.NNExampleModelDetector
read the comments where the model configuration is written, e.g the number of inputs, hiddens and output please ensure the first char in the given string is # In this example grb model file, there are 4 elements 1) type 2) number of input units 3) number of hidden units.
reader - Variable in class org.apache.tika.parser.ParsingReader
Buffered read end of the pipe.
readIntBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a BE int value from an InputStream
readIntLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a LE int value from an InputStream
readlimit - Variable in class org.apache.tika.io.NullInputStream
 
readLines(InputStream) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a list of Strings, one entry per line, using the default character encoding of the platform.
readLines(InputStream, String) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a list of Strings, one entry per line, using the specified character encoding.
readLines(Reader) - Static method in class org.apache.tika.io.IOUtils
Get the contents of a Reader as a list of Strings, one entry per line.
readLongBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a NE long value from an InputStream
readLongLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a LE long value from an InputStream
readMagicHeader(InputStream) - Method in class org.apache.tika.mime.MimeTypes
Reads the first MimeTypes.getMinLength() bytes from the given stream.
readMetadataPatterns(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
readMimeTypes(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
 
readNNParams(NNTrainedModelBuilder, String) - Method in class org.apache.tika.detect.NNExampleModelDetector
Read the next line for the model parameters and populate the build which later will be used to instantiate the instance of TrainedModel
readObject(DataInputStream, ClassLoader) - Static method in class org.apache.tika.fork.ForkObjectInputStream
Deserializes an object from the given stream.
readObject(ClassLoader) - Method in class org.apache.tika.fork.ForkServer
Deserializes an object from the given stream.
readParser(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
Builds and Returns an ExternalParser, or null if a check command was given that didn't match.
readShortBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a BE short value from an InputStream
readShortLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Get a LE short value from an InputStream
readStream() - Method in class org.apache.tika.fork.ClassLoaderProxy
 
readString(DataInputStream) - Method in class org.apache.tika.fork.ContentHandlerResource
 
readToEndOfStreamOnFirstRewind - Variable in class org.apache.tika.utils.RereadableInputStream
Specifies whether or not to read to the end of stream on first rewind.
readUE7(InputStream) - Static method in class org.apache.tika.io.EndianUtils
Gets the integer value that is stored in UTF-8 like fashion, in Big Endian but with the high bit on each number indicating if it continues or not
readUInt24(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readUInt32(DataInputStream) - Method in class org.apache.tika.parser.video.FLVParser
 
readUShortBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
 
readUShortLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
 
READY - Static variable in class org.apache.tika.fork.ForkServer
 
REALIZATION - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
reallyEndDocument() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
 
recordMark(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.Footnotes
Normally happens before the text of the mark
records - Static variable in class org.apache.tika.fork.MemoryURLStreamHandler
 
RecursiveParser(ContainerExtractor, EmbeddedResourceHandler) - Constructor for class org.apache.tika.extractor.ParserContainerExtractor.RecursiveParser
 
RecursiveParserWrapper - Class in org.apache.tika.parser
This is a helper class that wraps a parser in a recursive handler.
RecursiveParserWrapper(Parser, ContentHandlerFactory) - Constructor for class org.apache.tika.parser.RecursiveParserWrapper
Initialize the wrapper with RecursiveParserWrapper.catchEmbeddedExceptions set to true as default.
RecursiveParserWrapper(Parser, ContentHandlerFactory, boolean) - Constructor for class org.apache.tika.parser.RecursiveParserWrapper
Initialize the wrapper.
RecursiveParserWrapper.EmbeddedParserDecorator - Class in org.apache.tika.parser
 
REFERENCES - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
RegexUtils - Class in org.apache.tika.utils
Inspired from Nutch code class OutlinkExtractor.
RegexUtils() - Constructor for class org.apache.tika.utils.RegexUtils
 
registerModels(MediaType, TrainedModel) - Method in class org.apache.tika.detect.TrainedModelDetector
 
registry - Variable in class org.apache.tika.detect.CompositeDetector
 
registry - Variable in class org.apache.tika.mime.MediaTypeRegistry
Registry of known media types, including type aliases.
registry - Variable in class org.apache.tika.mime.MimeTypes
Registered media types and their aliases.
registry - Variable in class org.apache.tika.mime.Patterns
 
registry - Variable in class org.apache.tika.parser.CompositeParser
Media type registry.
REGISTRY_ENTRY_CREATED_ITEM_ID - Static variable in interface org.apache.tika.metadata.IPTC
A unique identifier created by a registry and applied by the creator of the item.
REGISTRY_ENTRY_CREATED_ORGANISATION_ID - Static variable in interface org.apache.tika.metadata.IPTC
An identifier for the registry which issued the corresponding Registry Image Id.
rel - Variable in class org.apache.tika.sax.Link
 
rel - Variable in class org.apache.tika.sax.LinkBuilder
 
RELATION - Static variable in interface org.apache.tika.metadata.DublinCore
A reference to a related resource.
RELATION - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#RELATION
RELATION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
RELATIVE_PEAK_AUDIO_FILE_PATH - Static variable in interface org.apache.tika.metadata.XMPDM
"The relative path to the file's peak audio file.
RELEASE_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The date the title was released."
releaseClient(ForkClient, boolean) - Method in class org.apache.tika.fork.ForkParser
 
remove(String) - Method in class org.apache.tika.metadata.Metadata
Remove a metadata and all its associated values.
removedService(ServiceReference, Object) - Method in class org.apache.tika.config.TikaActivator
 
removeService(Object) - Static method in class org.apache.tika.config.ServiceLoader
 
RENDITION_CLASS - Static variable in interface org.apache.tika.metadata.XMPMM
The rendition class name for this resource.
RENDITION_PARAMS - Static variable in interface org.apache.tika.metadata.XMPMM
Can be used to provide additional rendition parameters that are too complex or verbose to encode in xmpMM:RenditionClass
REPLACEMENT - Static variable in class org.apache.tika.sax.SafeContentHandler
Replacement for invalid characters.
replaceTailBuffer(byte[], int, int) - Method in class org.apache.tika.io.TailStream
Replaces the content of the internal tail buffer by the last portion of the given buffer.
RereadableInputStream - Class in org.apache.tika.utils
Wraps an input stream, reading it only once, but making it available for rereading an arbitrary number of times.
RereadableInputStream(InputStream, int, boolean, boolean) - Constructor for class org.apache.tika.utils.RereadableInputStream
Creates a rereadable input stream.
RESERVED - Static variable in class org.apache.tika.io.FilenameUtils
 
RESERVED_FILENAME_CHARACTERS - Static variable in class org.apache.tika.io.FilenameUtils
Reserved characters
reset() - Method in class org.apache.tika.fork.ForkServer
 
reset() - Method in class org.apache.tika.io.LookaheadInputStream
 
reset() - Method in class org.apache.tika.io.NullInputStream
Reset the stream to the point when mark was last called.
reset() - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's reset() method.
reset() - Method in class org.apache.tika.io.TailStream
This implementation restores this stream's state to the state when ''mark()'' was called the last time.
reset() - Method in class org.apache.tika.io.TikaInputStream
 
reset() - Method in class org.apache.tika.parser.RecursiveParserWrapper
reset() - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
resetByteCount() - Method in class org.apache.tika.io.CountingInputStream
Set the byte count back to 0.
resetCount() - Method in class org.apache.tika.io.CountingInputStream
Set the byte count back to 0.
RESOLUTION_HORIZONTAL - Static variable in interface org.apache.tika.metadata.TIFF
"Horizontal resolution in pixels per unit."
RESOLUTION_UNIT - Static variable in interface org.apache.tika.metadata.TIFF
"Units used for Horizontal and Vertical Resolutions." One of "Inch" or "cm"
RESOLUTION_VERTICAL - Static variable in interface org.apache.tika.metadata.TIFF
"Vertical resolution in pixels per unit."
resolveClass(ObjectStreamClass) - Method in class org.apache.tika.fork.ForkObjectInputStream
Loads the identified class from the specified class loader.
resolveEntity(String, String) - Method in class org.apache.tika.mime.MimeTypesReader
 
resolveEntity(String, String) - Method in class org.apache.tika.sax.OfflineContentHandler
Returns an empty stream.
resolveMetadataKey(String) - Method in class org.apache.tika.parser.iwork.NumbersContentHandler
 
resolveMetaDataKey(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
Returns a resolved key that is common in other document types or returns the specified metaDataLocalName if no common key could be found.
resource - Variable in class org.apache.tika.fork.ClassLoaderProxy
 
resource - Variable in class org.apache.tika.fork.ContentHandlerProxy
 
RESOURCE - Static variable in class org.apache.tika.fork.ForkServer
 
resource - Variable in class org.apache.tika.fork.InputStreamProxy
 
RESOURCE_NAME_KEY - Static variable in interface org.apache.tika.metadata.TikaMetadataKeys
 
resources - Variable in class org.apache.tika.fork.ForkClient
 
resources - Variable in class org.apache.tika.io.TemporaryResources
Tracked resources in LIFO order.
REVISION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The revision number.
REVISION_NUMBER - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
rewind() - Method in class org.apache.tika.utils.RereadableInputStream
"Rewinds" the stream to the beginning for rereading.
RIGHTS - Static variable in interface org.apache.tika.metadata.DublinCore
Information about rights held in and over the resource.
RIGHTS - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#RIGHTS
RIGHTS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
RIGHTS_USAGE_TERMS - Static variable in interface org.apache.tika.metadata.IPTC
The licensing parameters of the item expressed in free-text.
ROOT_XML_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
rootElement - Variable in class org.apache.tika.detect.XmlRootExtractor.ExtractorHandler
 
rootMediaType - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
rootMimeType - Variable in class org.apache.tika.mime.MimeTypes
Root type, application/octet-stream.
rootMimeTypeL - Variable in class org.apache.tika.mime.MimeTypes
 
rootXML - Variable in class org.apache.tika.mime.MimeType
The root-XML associated to this Mime-Type
RootXML(MimeType, String, String) - Constructor for class org.apache.tika.mime.MimeType.RootXML
 
RTF_PICT_META_PREFIX - Static variable in interface org.apache.tika.metadata.RTFMetadata
 
RTFMetadata - Interface in org.apache.tika.metadata
 
run() - Method in class org.apache.tika.fork.ForkServer
 
run() - Method in class org.apache.tika.parser.NetworkParser.ParsingTask
 
run() - Method in class org.apache.tika.parser.ParsingReader.ParsingTask
Parses the given binary stream and writes the text content to the write end of the pipe.

S

SafeContentHandler - Class in org.apache.tika.sax
Content handler decorator that makes sure that the character events (SafeContentHandler.characters(char[], int, int) or SafeContentHandler.ignorableWhitespace(char[], int, int)) passed to the decorated content handler contain only valid XML characters.
SafeContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.SafeContentHandler
 
SafeContentHandler.Output - Interface in org.apache.tika.sax
Internal interface that allows both character and ignorable whitespace content to be filtered the same way.
SafeContentHandler.StringOutput - Class in org.apache.tika.sax
 
SAMPLES_PER_PIXEL - Static variable in interface org.apache.tika.metadata.TIFF
"Number of components per pixel."
save(OutputStream) - Method in class org.apache.tika.language.LanguageProfilerBuilder
Writes NGramProfile content into OutputStream, content is outputted with UTF-8 encoding
SAVE_DATE - Static variable in interface org.apache.tika.metadata.Office
When was the document last saved?
saveByte(int) - Method in class org.apache.tika.utils.RereadableInputStream
Saves the byte read from the original stream to the store.
SCALE_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
"The musical scale used in the music.
scanFormat(InputStream) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
SCENE - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the scene."
SCENE_CODE - Static variable in interface org.apache.tika.metadata.IPTC
Describes the scene of a news content.
SCHEME - Static variable in interface org.apache.tika.metadata.XMPIdq
A qualifier providing the name of the formal identification scheme used for an item in the xmp:Identifier array.
secondaryExtractProperties - Variable in class org.apache.tika.metadata.Property
 
SecureContentHandler - Class in org.apache.tika.sax
Content handler decorator that attempts to prevent denial of service attacks against Tika parsers.
SecureContentHandler(ContentHandler, TikaInputStream) - Constructor for class org.apache.tika.sax.SecureContentHandler
Decorates the given content handler with zip bomb prevention based on the count of bytes read from the given counting input stream.
SecureContentHandler.SecureSAXException - Exception in org.apache.tika.sax
Private exception class used to indicate a suspected zip bomb.
SecureSAXException(String) - Constructor for exception org.apache.tika.sax.SecureContentHandler.SecureSAXException
 
SECURITY - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
select(Metadata) - Method in interface org.apache.tika.extractor.DocumentSelector
Checks if a document with the given metadata matches the specified selection criteria.
semicolon - Variable in class org.apache.tika.mime.MediaType
Location of the first ";" character separating the type part of MediaType.string from possible parameters.
sendCharacters(char[], int, int) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
sendInput(Process, InputStream) - Method in class org.apache.tika.parser.external.ExternalParser
Starts a thread that sends the contents of the given input stream to the standard input stream of the given process.
sendInputStreamToStdIn(InputStream, Process) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sends the contents of the given input stream to the standard input of the given process.
sendObject(Object, List<ForkResource>) - Method in class org.apache.tika.fork.ForkClient
Serializes the object first into an in-memory buffer and then writes it to the output stream with a preceding size integer.
sendObject(Object, DataOutputStream) - Static method in class org.apache.tika.fork.ForkObjectInputStream
Serializes the object first into an in-memory buffer and then writes it to the output stream with a preceding size integer.
sendRequest(int) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
sendStdErrToOutputStream(Process, OutputStream) - Method in class org.apache.tika.embedder.ExternalEmbedder
Starts a thread that reads and discards the contents of the standard stream of the given process.
sendStdOutToOutputStream(Process, OutputStream) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sends the standard output of the given process to the given output stream.
sendString(String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
SEP_CHARSEQ - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
The String form of the separator char
SEPARATOR - Static variable in class org.apache.tika.language.LanguageProfilerBuilder
separator char
seq - Variable in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
The sequence of characters of the ngram
serialize(Metadata, Type, JsonSerializationContext) - Method in class org.apache.tika.metadata.serialization.JsonMetadataSerializer
Serializes a Metadata object into effectively Map.
serializeMetadata(List<String>) - Static method in class org.apache.tika.embedder.ExternalEmbedder
Serializes a collection of metadata command line arguments into a single string.
serialVersionUID - Static variable in class org.apache.tika.detect.CompositeDetector
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.detect.DefaultDetector
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.detect.DefaultProbDetector
 
serialVersionUID - Static variable in class org.apache.tika.detect.NNExampleModelDetector
 
serialVersionUID - Static variable in class org.apache.tika.detect.TextDetector
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.detect.TrainedModelDetector
 
serialVersionUID - Static variable in class org.apache.tika.embedder.ExternalEmbedder
 
serialVersionUID - Static variable in class org.apache.tika.extractor.ParserContainerExtractor
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.fork.ClassLoaderProxy
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.fork.ContentHandlerProxy
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.fork.ForkParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.fork.InputStreamProxy
Serial version UID
serialVersionUID - Static variable in exception org.apache.tika.io.EndianUtils.BufferUnderrunException
 
serialVersionUID - Static variable in exception org.apache.tika.io.IOExceptionWithCause
Defines the serial version UID.
serialVersionUID - Static variable in class org.apache.tika.metadata.Metadata
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.mime.MediaType
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.MediaTypeRegistry
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.mime.MimeType.RootXML
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.MimeType
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.MimeTypes
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.Patterns.LengthComparator
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.Patterns
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
serialVersionUID - Static variable in class org.apache.tika.parser.AbstractParser
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.parser.audio.AudioParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.audio.MidiParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.AutoDetectParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.CompositeParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.CryptoParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.DefaultParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.EmptyParser
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.parser.envi.EnviHeaderParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.epub.EpubParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.ErrorParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.external.CompositeExternalParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.external.ExternalParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.feed.FeedParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.gdal.GDALParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.ParseContext
Serial version UID.
serialVersionUID - Static variable in class org.apache.tika.parser.ParserDecorator
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.RecursiveParserWrapper.EmbeddedParserDecorator
 
serialVersionUID - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
Generated serial version
serialVersionUID - Static variable in class org.apache.tika.parser.strings.FileConfig
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.strings.Latin1StringsParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.strings.StringsConfig
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.strings.StringsParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.video.FLVParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.xml.DcXMLParser
Serial version UID
serialVersionUID - Static variable in class org.apache.tika.parser.xml.FictionBookParser
 
serialVersionUID - Static variable in class org.apache.tika.parser.xml.XMLParser
Serial version UID
serialVersionUID - Static variable in exception org.apache.tika.sax.SecureContentHandler.SecureSAXException
Serial version UID.
serialVersionUID - Static variable in exception org.apache.tika.sax.WriteOutContentHandler.WriteLimitReachedException
Serial version UID
service - Variable in class org.apache.tika.config.ServiceLoader.RankedService
 
ServiceLoader - Class in org.apache.tika.config
Internal utility class that Tika uses to look up service providers.
ServiceLoader(ClassLoader, LoadErrorHandler, boolean) - Constructor for class org.apache.tika.config.ServiceLoader
 
ServiceLoader(ClassLoader, LoadErrorHandler) - Constructor for class org.apache.tika.config.ServiceLoader
 
ServiceLoader(ClassLoader) - Constructor for class org.apache.tika.config.ServiceLoader
 
ServiceLoader() - Constructor for class org.apache.tika.config.ServiceLoader
 
serviceLoader - Variable in class org.apache.tika.config.TikaConfig
 
ServiceLoader.RankedService - Class in org.apache.tika.config
 
serviceLoaderFromDomElement(Element, ClassLoader) - Static method in class org.apache.tika.config.TikaConfig
 
ServiceLoaderUtils - Class in org.apache.tika.utils
Service Loading and Ordering related utils
ServiceLoaderUtils() - Constructor for class org.apache.tika.utils.ServiceLoaderUtils
 
services - Static variable in class org.apache.tika.config.ServiceLoader
The dynamic set of services available in an OSGi environment.
set(String, String) - Method in class org.apache.tika.metadata.Metadata
Set metadata name/value.
set(Property, String) - Method in class org.apache.tika.metadata.Metadata
Sets the value of the identified metadata property.
set(Property, String[]) - Method in class org.apache.tika.metadata.Metadata
Sets the values of the identified metadata property.
set(Property, int) - Method in class org.apache.tika.metadata.Metadata
Sets the integer value of the identified metadata property.
set(Property, double) - Method in class org.apache.tika.metadata.Metadata
Sets the real or rational value of the identified metadata property.
set(Property, Date) - Method in class org.apache.tika.metadata.Metadata
Sets the date value of the identified metadata property.
set(Property, Calendar) - Method in class org.apache.tika.metadata.Metadata
Sets the date value of the identified metadata property.
set(MediaType...) - Static method in class org.apache.tika.mime.MediaType
Convenience method that returns an unmodifiable set that contains all the given media types.
set(String...) - Static method in class org.apache.tika.mime.MediaType
Convenience method that parses the given media type strings and returns an unmodifiable set that contains all the parsed types.
set(Class<T>, T) - Method in class org.apache.tika.parser.ParseContext
Adds the given value to the context as an implementation of the given interface.
setAcronym(String) - Method in class org.apache.tika.mime.MimeType
Set an acronym for the mime type
setAll(Properties) - Method in class org.apache.tika.metadata.Metadata
Copy All key-value pairs from properties.
setCommand(String...) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets the command to be run.
setCommand(String...) - Method in class org.apache.tika.parser.external.ExternalParser
Sets the command to be run.
setCommand(String) - Method in class org.apache.tika.parser.gdal.GDALParser
 
setCommandAppendOperator(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets the operator to append rather than replace a value for the command line tool, i.e.
setCommandAssignmentDelimeter(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets the delimiter for multiple assignments for the command line tool, i.e.
setCommandAssignmentOperator(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets the assignment operator for the command line tool, i.e.
setContentHandler(ContentHandler) - Method in class org.apache.tika.sax.ContentHandlerDecorator
Sets the underlying content handler.
setContentParser(Parser) - Method in class org.apache.tika.parser.epub.EpubParser
 
setContextClassLoader(ClassLoader) - Static method in class org.apache.tika.config.ServiceLoader
Sets the context class loader to use for all threads that access this class.
setCorePoolSize(int) - Method in interface org.apache.tika.concurrent.ConfigurableThreadPoolExecutor
 
setDescription(String) - Method in class org.apache.tika.mime.MimeType
Set the description of this media type.
setDetector(Detector) - Method in class org.apache.tika.parser.AutoDetectParser
Sets the type detector used by this parser to auto-detect the type of a document.
setDocumentLocator(Locator) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
setDocumentLocator(Locator) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
setDocumentLocator(Locator) - Method in class org.apache.tika.sax.DIFContentHandler
 
setDocumentLocator(Locator) - Method in class org.apache.tika.sax.TeeContentHandler
 
setDocumentLocator(Locator) - Method in class org.apache.tika.sax.TextContentHandler
 
setEncoding(StringsEncoding) - Method in class org.apache.tika.parser.strings.StringsConfig
Sets the character encoding of the strings that are to be found.
setFallback(Parser) - Method in class org.apache.tika.parser.CompositeParser
Sets the fallback parser.
setFilePath(String) - Method in class org.apache.tika.parser.strings.FileConfig
Sets the "file" installation folder.
setFormat(int) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
setGson(Gson) - Static method in class org.apache.tika.metadata.serialization.JsonMetadata
Enables setting custom configurations on Gson.
setGson(Gson) - Static method in class org.apache.tika.metadata.serialization.JsonMetadataList
Enables setting custom configurations on Gson.
setJavaCommand(List<String>) - Method in class org.apache.tika.fork.ForkParser
Sets the command used to start the forked server process.
setJavaCommand(String) - Method in class org.apache.tika.fork.ForkParser
Deprecated.
since 1.8
setMaxEmbeddedResources(int) - Method in class org.apache.tika.parser.RecursiveParserWrapper
Set the maximum number of embedded resources to store.
setMaximumCompressionRatio(long) - Method in class org.apache.tika.sax.SecureContentHandler
Sets the ratio between output characters and input bytes.
setMaximumDepth(int) - Method in class org.apache.tika.sax.SecureContentHandler
Sets the maximum XML element nesting level.
setMaximumPackageEntryDepth(int) - Method in class org.apache.tika.sax.SecureContentHandler
Sets the maximum package entry nesting level.
setMaximumPoolSize(int) - Method in interface org.apache.tika.concurrent.ConfigurableThreadPoolExecutor
 
setMaxStringLength(int) - Method in class org.apache.tika.Tika
Sets the maximum length of strings returned by the parseToString methods.
setMediaTypeRegistry(MediaTypeRegistry) - Method in class org.apache.tika.parser.CompositeParser
Sets the media type registry used to infer type relationships.
setMetadata(Metadata, HashMap<String, String>) - Method in class org.apache.tika.parser.iptc.IptcAnpaParser
 
setMetadataCommandArguments(Map<Property, String[]>) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets the map of Metadata keys to command line parameters.
setMetadataExtractionPatterns(Map<Pattern, String>) - Method in class org.apache.tika.parser.external.ExternalParser
Sets the map of regular expression patterns and Metadata keys.
setMetaParser(Parser) - Method in class org.apache.tika.parser.epub.EpubParser
 
setMimetype(boolean) - Method in class org.apache.tika.parser.strings.FileConfig
Sets the mime option.
setMinLength(int) - Method in class org.apache.tika.parser.strings.StringsConfig
Sets the minimum sequence length (characters) to print.
setMinSize(int) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
Sets the minimum size of a character sequence to be extracted.
setNumOfHidden(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
setNumOfInputs(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
setNumOfOutputs(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
setOpenContainer(Object) - Method in class org.apache.tika.io.TikaInputStream
Stores the open container object against the stream, eg after a Zip contents detector has loaded the file to decide what it contains.
setOutputThreshold(long) - Method in class org.apache.tika.sax.SecureContentHandler
Sets the threshold for output characters before the zip bomb prevention is activated.
setParams(float[]) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
setParsers(Map<MediaType, Parser>) - Method in class org.apache.tika.parser.CompositeParser
Sets the component parsers.
setPoolSize(int) - Method in class org.apache.tika.fork.ForkParser
Sets the size of the process pool.
setPrettyPrinting(boolean) - Static method in class org.apache.tika.metadata.serialization.JsonMetadata
 
setPrettyPrinting(boolean) - Static method in class org.apache.tika.metadata.serialization.JsonMetadataList
 
setProfile(LanguageProfilerBuilder) - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Associated a profile to this ngram
setQuoteAssignmentValues(boolean) - Method in class org.apache.tika.embedder.ExternalEmbedder
Sets whether or not to quote assignment values, i.e.
setRel(String) - Method in class org.apache.tika.sax.LinkBuilder
 
setStringsPath(String) - Method in class org.apache.tika.parser.strings.StringsConfig
Sets the "strings" installation folder.
setSuperType(MimeType, MediaType) - Method in class org.apache.tika.mime.MimeTypes
 
setSupportedEmbedTypes(Set<MediaType>) - Method in class org.apache.tika.embedder.ExternalEmbedder
 
setSupportedTypes(Set<MediaType>) - Method in class org.apache.tika.parser.external.ExternalParser
 
setTemporaryFileDirectory(Path) - Method in class org.apache.tika.io.TemporaryResources
Sets the directory to be used for the temporary files created by the TemporaryResources.createTempFile() method.
setTemporaryFileDirectory(File) - Method in class org.apache.tika.io.TemporaryResources
Sets the directory to be used for the temporary files created by the TemporaryResources.createTempFile() method.
setTimeout(int) - Method in class org.apache.tika.parser.strings.StringsConfig
Sets the maximum time (in seconds) to wait for the "strings" command to terminate.
setTitle(String) - Method in class org.apache.tika.sax.LinkBuilder
 
setType(MediaType) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
 
setUniformTypeIdentifier(String) - Method in class org.apache.tika.mime.MimeType
Set The Uniform Type Identifier
setURI(String) - Method in class org.apache.tika.sax.LinkBuilder
 
SHOT_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The date and time when the video was shot."
SHOT_LOCATION - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the location where the video was shot.
SHOT_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the shot or take."
shouldParseEmbedded(Metadata) - Method in interface org.apache.tika.extractor.EmbeddedDocumentExtractor
 
shouldParseEmbedded(Metadata) - Method in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
 
SIMPLE_TYPES - Static variable in class org.apache.tika.mime.MediaType
Set of basic types with normalized "type/subtype" names.
SimpleBoundedInputStream(long, InputStream) - Constructor for class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
 
SimpleThreadPoolExecutor - Class in org.apache.tika.concurrent
Simple Thread Pool Executor
SimpleThreadPoolExecutor() - Constructor for class org.apache.tika.concurrent.SimpleThreadPoolExecutor
 
size - Variable in class org.apache.tika.io.NullInputStream
 
size - Variable in class org.apache.tika.language.LanguageProfile.Interleaved
 
size() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
Returns the size of this ngram
size() - Method in class org.apache.tika.metadata.Metadata
Returns the number of metadata names in this metadata.
size() - Method in class org.apache.tika.mime.AndClause
 
size() - Method in interface org.apache.tika.mime.Clause
Returns the size of this clause.
size() - Method in class org.apache.tika.mime.Magic
 
size() - Method in class org.apache.tika.mime.MagicMatch
 
size() - Method in class org.apache.tika.mime.OrClause
 
size - Variable in class org.apache.tika.utils.RereadableInputStream
The total number of bytes read from the original stream at the time.
skip(long) - Method in class org.apache.tika.io.CountingInputStream
Skips the stream over the specified number of bytes, adding the skipped amount to the count.
skip(long) - Method in class org.apache.tika.io.LookaheadInputStream
 
skip(long) - Method in class org.apache.tika.io.NullInputStream
Skip a specified number of bytes.
skip(long) - Method in class org.apache.tika.io.ProxyInputStream
Invokes the delegate's skip(long) method.
skip(long) - Method in class org.apache.tika.io.TailStream
This implementation delegates to the read() method to ensure that the tail buffer is also filled if data is skipped.
skip(long) - Method in class org.apache.tika.io.TikaInputStream
 
skip(long) - Method in class org.apache.tika.parser.utils.CommonsDigester.SimpleBoundedInputStream
Invokes the delegate's skip(long) method.
SKIP_SIZE - Static variable in class org.apache.tika.io.TailStream
Constant for the default skip buffer size.
SKIPPED_ENTITY - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
skippedEntity(String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
skippedEntity(String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
skippedEntity(String) - Method in class org.apache.tika.sax.TeeContentHandler
 
skippedEntity(String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
SL - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
slash - Variable in class org.apache.tika.mime.MediaType
Location of the "/" character separating the type and the subtype tokens in MediaType.string.
SLIDE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
SLIDE_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Slides are there in the (presentation) document
slPageCount - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
SOFTWARE - Static variable in interface org.apache.tika.metadata.TIFF
"Software or firmware used to generate the image."
SOH - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
sorted - Variable in class org.apache.tika.language.LanguageProfilerBuilder
The NGrams of this profile sorted on the number of occurrences
SortedJsonMetadataSerializer() - Constructor for class org.apache.tika.metadata.serialization.JsonMetadataBase.SortedJsonMetadataSerializer
 
sortLoadedClasses(List<T>) - Static method in class org.apache.tika.utils.ServiceLoaderUtils
Sorts a list of loaded classes, so that non-Tika ones come before Tika ones, and otherwise in reverse alphabetical order
SOURCE - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
SOURCE - Static variable in interface org.apache.tika.metadata.DublinCore
A reference to a resource from which the present resource is derived.
SOURCE - Static variable in interface org.apache.tika.metadata.IPTC
Identifies the original owner of the copyright for the intellectual content of the item.
SOURCE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#SOURCE
SOURCE - Static variable in interface org.apache.tika.metadata.Photoshop
 
SOURCE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
SP - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
SPACE - Static variable in class org.apache.tika.sax.TextContentHandler
 
SPEAKER_PLACEMENT - Static variable in interface org.apache.tika.metadata.XMPDM
"A description of the speaker angles from center front in degrees.
SPECIAL - Static variable in class org.apache.tika.mime.MediaType
 
SPECIAL_OR_WHITESPACE - Static variable in class org.apache.tika.mime.MediaType
 
square(double) - Method in class org.apache.tika.language.LanguageProfile
 
SR - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
start(BundleContext) - Method in class org.apache.tika.config.TikaActivator
 
start(BundleContext) - Method in class org.apache.tika.parser.internal.Activator
 
start(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 
START_DOCUMENT - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
START_ELEMENT - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
START_PREFIX_MAPPING - Static variable in class org.apache.tika.fork.ContentHandlerProxy
 
startDescription(String, String, String) - Method in class org.apache.tika.sax.XMPContentHandler
 
startDocument() - Method in class org.apache.tika.fork.ContentHandlerProxy
 
startDocument() - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
startDocument() - Method in class org.apache.tika.sax.DIFContentHandler
 
startDocument() - Method in class org.apache.tika.sax.EmbeddedContentHandler
Ignored.
startDocument() - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
 
startDocument() - Method in class org.apache.tika.sax.TeeContentHandler
 
startDocument() - Method in class org.apache.tika.sax.TextContentHandler
 
startDocument() - Method in class org.apache.tika.sax.ToHTMLContentHandler
 
startDocument() - Method in class org.apache.tika.sax.ToXMLContentHandler
Writes the XML prefix.
startDocument() - Method in class org.apache.tika.sax.XHTMLContentHandler
Starts an XHTML document by setting up the namespace mappings when called for the first time.
startDocument() - Method in class org.apache.tika.sax.XMPContentHandler
Starts an XMP document by setting up the namespace mappings and writing out the following header:
startElement(String, String, String, Attributes) - Method in class org.apache.tika.detect.XmlRootExtractor.ExtractorHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.fork.MetadataContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.mime.MimeTypesReader
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.iwork.NumbersContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.iwork.PagesContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.NetworkParser.MetaHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.xml.AttributeDependantMetadataHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.xml.AttributeMetadataHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.xml.ElementMetadataHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.xml.FictionBookParser.BinaryElementsDataHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.parser.xml.MetadataHandler
Deprecated.
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.DIFContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ElementMappingContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.LinkContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.SafeContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.SecureContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.TeeContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.TextContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ToXMLContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.XHTMLContentHandler
Starts the given element.
startElement(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
startElement(String, String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
startElement(String, AttributesImpl) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
startPrefixMapping(String, String) - Method in class org.apache.tika.fork.ContentHandlerProxy
 
startPrefixMapping(String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
startPrefixMapping(String, String) - Method in class org.apache.tika.sax.TeeContentHandler
 
startPrefixMapping(String, String) - Method in class org.apache.tika.sax.ToXMLContentHandler
 
STATE - Static variable in interface org.apache.tika.metadata.Photoshop
 
stop(BundleContext) - Method in class org.apache.tika.config.TikaActivator
 
stop() - Method in class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
stop(BundleContext) - Method in class org.apache.tika.parser.internal.Activator
 
storeFile - Variable in class org.apache.tika.utils.RereadableInputStream
File used to store the stream's contents; is null until the stored content's size exceeds maxBytesInMemory.
storeOutputStream - Variable in class org.apache.tika.utils.RereadableInputStream
OutputStream used to save the content of the input stream in a temporary file.
stream - Variable in class org.apache.tika.fork.InputStreamResource
 
stream - Variable in class org.apache.tika.io.LookaheadInputStream
 
stream - Variable in class org.apache.tika.parser.ParsingReader
The binary stream being parsed.
stream - Variable in class org.apache.tika.sax.SecureContentHandler
The input stream that Tika is parsing.
STRETCH_MODE - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio stretch mode."
string - Variable in class org.apache.tika.mime.Magic
 
string - Variable in class org.apache.tika.mime.MediaType
Canonical string representation of this media type.
stringBuilder - Variable in class org.apache.tika.sax.PhoneExtractingContentHandler
 
StringOutput() - Constructor for class org.apache.tika.sax.SafeContentHandler.StringOutput
 
STRINGS_PRESENT - Static variable in class org.apache.tika.parser.strings.StringsParser
 
StringsConfig - Class in org.apache.tika.parser.strings
Configuration for the "strings" (or strings-alternative) command.
StringsConfig() - Constructor for class org.apache.tika.parser.strings.StringsConfig
Default contructor.
StringsConfig(InputStream) - Constructor for class org.apache.tika.parser.strings.StringsConfig
Loads properties from InputStream and then tries to close InputStream.
StringsEncoding - Enum in org.apache.tika.parser.strings
Character encoding of the strings that are to be found using the "strings" command.
StringsEncoding(char, String) - Constructor for enum org.apache.tika.parser.strings.StringsEncoding
 
StringsParser - Class in org.apache.tika.parser.strings
Parser that uses the "strings" (or strings-alternative) command to find the printable strings in a object, or other binary, file (application/octet-stream).
StringsParser() - Constructor for class org.apache.tika.parser.strings.StringsParser
 
stringsPath - Variable in class org.apache.tika.parser.strings.StringsConfig
 
stripTags(SyndContent) - Static method in class org.apache.tika.parser.feed.FeedParser
 
STX - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
SUB_CLASS_OF_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
SUB_CLASS_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
subclauses - Variable in class org.apache.tika.mime.MimeTypesReader.ClauseRecord
 
SUBJECT - Static variable in interface org.apache.tika.metadata.DublinCore
The topic of the content of the resource.
SUBJECT - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#KEYWORDS
SUBJECT - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The document's subject.
SUBJECT_CODE - Static variable in interface org.apache.tika.metadata.IPTC
Specifies one or more Subjects from the IPTC Subject-NewsCodes taxonomy to categorise the content.
SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
Name of a sublocation the content is focussing on -- either the location shown in visual media or referenced by text or audio media.
subSequence(int, int) - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
SubtreeMatcher - Class in org.apache.tika.sax.xpath
Evaluation state of a ...//... XPath expression.
SubtreeMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.SubtreeMatcher
 
SUPPLEMENTAL_CATEGORIES - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated. 
SUPPLEMENTAL_CATEGORIES - Static variable in interface org.apache.tika.metadata.Photoshop
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.audio.AudioParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.audio.MidiParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.envi.EnviHeaderParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.epub.EpubParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.feed.FeedParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.strings.Latin1StringsParser
The set of supported types
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.strings.StringsParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.video.FLVParser
 
SUPPORTED_TYPES - Static variable in class org.apache.tika.parser.xml.XMLParser
 
supportedEmbedTypes - Variable in class org.apache.tika.embedder.ExternalEmbedder
Media types supported by the external program.
supportedTypes - Variable in class org.apache.tika.parser.external.ExternalParser
Media types supported by the external program.
supportedTypes - Static variable in class org.apache.tika.parser.iwork.IWorkPackageParser
This parser handles all iWorks formats.
supportedTypes - Variable in class org.apache.tika.parser.NetworkParser
 
supportsComposite() - Method in class org.apache.tika.config.TikaConfig.DetectorXmlLoader
 
supportsComposite() - Method in class org.apache.tika.config.TikaConfig.ExecutorServiceXmlLoader
 
supportsComposite() - Method in class org.apache.tika.config.TikaConfig.ParserXmlLoader
 
supportsComposite() - Method in class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
supportsComposite() - Method in class org.apache.tika.config.TikaConfig.XmlLoader
 
SYN - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 

T

TAB - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The tab character gets inserted before table cells and list items.
TABLE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
TABLE_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Tables in the document
TABLE_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
 
TABLE_NAME - Static variable in interface org.apache.tika.metadata.Database
 
tableData - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
tableId - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
TABSPACE - Static variable in class org.apache.tika.sax.DIFContentHandler
 
tag - Variable in class org.apache.tika.io.TaggedInputStream
The unique (serializable) tag of this stream.
tag - Variable in exception org.apache.tika.io.TaggedIOException
The object reference used to tag the exception.
tag - Variable in exception org.apache.tika.sax.TaggedSAXException
The object reference used to tag the exception.
tag - Variable in class org.apache.tika.sax.WriteOutContentHandler
The unique tag associated with exceptions from stream.
tag - Variable in exception org.apache.tika.sax.WriteOutContentHandler.WriteLimitReachedException
Serializable tag of the handler that caused this exception
TaggedContentHandler - Class in org.apache.tika.sax
A content handler decorator that tags potential exceptions so that the handler that caused the exception can easily be identified.
TaggedContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.TaggedContentHandler
Creates a tagging decorator for the given content handler.
TaggedInputStream - Class in org.apache.tika.io
An input stream decorator that tags potential exceptions so that the stream that caused the exception can easily be identified.
TaggedInputStream(InputStream) - Constructor for class org.apache.tika.io.TaggedInputStream
Creates a tagging decorator for the given input stream.
TaggedIOException - Exception in org.apache.tika.io
An IOException wrapper that tags the wrapped exception with a given object reference.
TaggedIOException(IOException, Object) - Constructor for exception org.apache.tika.io.TaggedIOException
Creates a tagged wrapper for the given exception.
TaggedSAXException - Exception in org.apache.tika.sax
A SAXException wrapper that tags the wrapped exception with a given object reference.
TaggedSAXException(SAXException, Object) - Constructor for exception org.apache.tika.sax.TaggedSAXException
Creates a tagged wrapper for the given exception.
tailBuffer - Variable in class org.apache.tika.io.TailStream
The buffer in which the tail data is stored.
tailSize - Variable in class org.apache.tika.io.TailStream
The size of the internal tail buffer.
TailStream - Class in org.apache.tika.io
A specialized input stream implementation which records the last portion read from an underlying stream.
TailStream(InputStream, int) - Constructor for class org.apache.tika.io.TailStream
Creates a new instance of TailStream.
TAPE_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
"The name of the tape from which the clip was captured, as set during the capture process."
TargetElement(QName, Map<QName, QName>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
Creates an TargetElement, attributes of this element will be mapped as specified
TargetElement(String, String, Map<QName, QName>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
A shortcut that automatically creates the QName object
TargetElement(QName) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
Creates an TargetElement with no attributes, all attributes will be deleted from SAX stream
TargetElement(String, String) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
A shortcut that automatically creates the QName object
targetProperty - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
TB - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
TeeContentHandler - Class in org.apache.tika.sax
Content handler proxy that forwards the received SAX events to zero or more underlying content handlers.
TeeContentHandler(ContentHandler...) - Constructor for class org.apache.tika.sax.TeeContentHandler
 
tempFileDir - Variable in class org.apache.tika.io.TemporaryResources
Directory for temporary files, null for the system default.
TEMPLATE - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
TEMPLATE - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
TEMPO - Static variable in interface org.apache.tika.metadata.XMPDM
"The audio's tempo."
TemporaryResources - Class in org.apache.tika.io
Utility class for tracking and ultimately closing or otherwise disposing a collection of temporary resources.
TemporaryResources() - Constructor for class org.apache.tika.io.TemporaryResources
 
text(String) - Static method in class org.apache.tika.mime.MediaType
 
text(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.Annotations
 
text(String) - Method in class org.apache.tika.parser.iwork.PagesContentHandler.Footnotes
 
text - Variable in class org.apache.tika.sax.Link
 
text - Variable in class org.apache.tika.sax.LinkBuilder
 
TEXT_HTML - Static variable in class org.apache.tika.mime.MediaType
 
TEXT_PLAIN - Static variable in class org.apache.tika.mime.MediaType
 
TextContentHandler - Class in org.apache.tika.sax
TextContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.TextContentHandler
 
TextContentHandler(ContentHandler, boolean) - Constructor for class org.apache.tika.sax.TextContentHandler
 
TextDetector - Class in org.apache.tika.detect
Content type detection of plain text documents.
TextDetector() - Constructor for class org.apache.tika.detect.TextDetector
Constructs a TextDetector which will look at the default number of bytes from the beginning of the document.
TextDetector(int) - Constructor for class org.apache.tika.detect.TextDetector
Constructs a TextDetector which will look at a given number of bytes from the beginning of the document.
TextMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../text() XPath expression.
TextMatcher() - Constructor for class org.apache.tika.sax.xpath.TextMatcher
 
textMimeType - Variable in class org.apache.tika.mime.MimeTypes
Text type, text/plain.
TextStatistics - Class in org.apache.tika.detect
Utility class for computing a histogram of the bytes seen in a stream.
TextStatistics() - Constructor for class org.apache.tika.detect.TextStatistics
 
then - Variable in class org.apache.tika.sax.xpath.ChildMatcher
 
then - Variable in class org.apache.tika.sax.xpath.SubtreeMatcher
 
Theta1 - Variable in class org.apache.tika.detect.NNTrainedModel
 
Theta2 - Variable in class org.apache.tika.detect.NNTrainedModel
 
threshold - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
threshold(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
 
threshold - Variable in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
 
threshold - Variable in class org.apache.tika.sax.SecureContentHandler
Output threshold.
THROW - Static variable in interface org.apache.tika.config.LoadErrorHandler
Strategy that throws a RuntimeException with the given throwable as the root cause, thus interrupting the entire service loading operation.
throwable - Variable in class org.apache.tika.parser.ParsingReader
An exception (if any) thrown by the parsing thread.
throwEofException - Variable in class org.apache.tika.io.NullInputStream
 
throwIfCauseOf(Exception) - Method in class org.apache.tika.io.TaggedInputStream
Re-throws the original exception thrown by this stream.
throwIfCauseOf(SAXException) - Method in class org.apache.tika.sax.SecureContentHandler
Converts the given SAXException to a corresponding TikaException if it's caused by this instance detecting a zip bomb.
throwIfCauseOf(Exception) - Method in class org.apache.tika.sax.TaggedContentHandler
Re-throws the original exception thrown by this handler.
THUMBNAIL - Static variable in interface org.apache.tika.metadata.RTFMetadata
if set to true, this means that an image file is probably a "thumbnail" any time a pict/emf/wmf is in an object
TIFF - Interface in org.apache.tika.metadata
XMP Exif TIFF schema.
tika - Variable in class org.apache.tika.filetypedetector.TikaFileTypeDetector
 
Tika - Class in org.apache.tika
Facade class for accessing Tika functionality.
Tika(Detector, Parser) - Constructor for class org.apache.tika.Tika
Creates a Tika facade using the given detector and parser instances, but the default Translator.
Tika(Detector, Parser, Translator) - Constructor for class org.apache.tika.Tika
Creates a Tika facade using the given detector, parser, and translator instances.
Tika(TikaConfig) - Constructor for class org.apache.tika.Tika
Creates a Tika facade using the given configuration.
Tika() - Constructor for class org.apache.tika.Tika
Creates a Tika facade using the default configuration.
Tika(Detector) - Constructor for class org.apache.tika.Tika
Creates a Tika facade using the given detector instance, the default parser configuration, and the default Translator.
TIKA_CONTENT - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
TIKA_LINK_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
TIKA_META_EXCEPTION_PREFIX - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Use this to store parse exception information in the Metadata object.
TIKA_META_PREFIX - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Use this to prefix metadata properties that store information about the parsing process.
TIKA_MIME_FILE - Static variable in interface org.apache.tika.metadata.TikaMimeKeys
 
TIKA_UTI_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
 
TikaActivator - Class in org.apache.tika.config
Bundle activator that adjust the class loading mechanism of the ServiceLoader class to work correctly in an OSGi environment.
TikaActivator() - Constructor for class org.apache.tika.config.TikaActivator
 
TikaConfig - Class in org.apache.tika.config
Parse xml config file.
TikaConfig(String) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Path) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Path, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(File) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(File, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(URL) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(URL, ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(URL, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(InputStream) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Document) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Document, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Element) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Element, ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(Element, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
 
TikaConfig(ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
Creates a Tika configuration from the built-in media type rules and all the Parser implementations available through the service provider mechanism in the given class loader.
TikaConfig() - Constructor for class org.apache.tika.config.TikaConfig
Creates a default Tika configuration.
TikaConfig.DetectorXmlLoader - Class in org.apache.tika.config
 
TikaConfig.ExecutorServiceXmlLoader - Class in org.apache.tika.config
 
TikaConfig.ParserXmlLoader - Class in org.apache.tika.config
 
TikaConfig.TranslatorXmlLoader - Class in org.apache.tika.config
 
TikaConfig.XmlLoader<CT,T> - Class in org.apache.tika.config
 
TikaCoreProperties - Interface in org.apache.tika.metadata
Contains a core set of basic Tika metadata properties, which all parsers will attempt to supply (where the file format permits).
TikaCoreProperties.EmbeddedResourceType - Enum in org.apache.tika.metadata
A file might contain different types of embedded documents.
TikaException - Exception in org.apache.tika.exception
Tika exception
TikaException(String) - Constructor for exception org.apache.tika.exception.TikaException
 
TikaException(String, Throwable) - Constructor for exception org.apache.tika.exception.TikaException
 
TikaFileTypeDetector - Class in org.apache.tika.filetypedetector
 
TikaFileTypeDetector() - Constructor for class org.apache.tika.filetypedetector.TikaFileTypeDetector
 
TikaInputStream - Class in org.apache.tika.io
Input stream with extended capabilities.
TikaInputStream(Path) - Constructor for class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream instance.
TikaInputStream(File) - Constructor for class org.apache.tika.io.TikaInputStream
TikaInputStream(InputStream, TemporaryResources, long) - Constructor for class org.apache.tika.io.TikaInputStream
Creates a TikaInputStream instance.
TikaMetadataKeys - Interface in org.apache.tika.metadata
Contains keys to properties in Metadata instances.
TikaMimeKeys - Interface in org.apache.tika.metadata
A collection of Tika metadata keys used in Mime Type resolution
TIME_SIGNATURE - Static variable in interface org.apache.tika.metadata.XMPDM
"The time signature of the music."
timeout - Variable in class org.apache.tika.parser.strings.StringsConfig
 
TITLE - Static variable in interface org.apache.tika.metadata.DublinCore
A name given to the resource.
TITLE - Static variable in interface org.apache.tika.metadata.IPTC
A shorthand reference for the item.
TITLE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#TITLE
TITLE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
title - Variable in class org.apache.tika.sax.Link
 
title - Variable in class org.apache.tika.sax.LinkBuilder
 
TITLE_TAG - Static variable in class org.apache.tika.sax.ExpandedTitleContentHandler
 
tmp - Variable in class org.apache.tika.embedder.ExternalEmbedder
 
tmp - Variable in class org.apache.tika.io.TikaInputStream
Tracker of temporary resources.
tmpPos - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The temporary position into the output buffer.
toByteArray(InputStream) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a byte[].
toByteArray(Reader) - Static method in class org.apache.tika.io.IOUtils
Get the contents of a Reader as a byte[] using the default character encoding of the platform.
toByteArray(Reader, String) - Static method in class org.apache.tika.io.IOUtils
Get the contents of a Reader as a byte[] using the specified character encoding.
toByteArray(String) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
Use String.getBytes()
toCharArray(InputStream) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a character array using the default character encoding of the platform.
toCharArray(InputStream, String) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a character array using the specified character encoding.
toCharArray(Reader) - Static method in class org.apache.tika.io.IOUtils
Get the contents of a Reader as a character array.
ToHTMLContentHandler - Class in org.apache.tika.sax
SAX event handler that serializes the HTML document to a character stream.
ToHTMLContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToHTMLContentHandler
 
ToHTMLContentHandler() - Constructor for class org.apache.tika.sax.ToHTMLContentHandler
 
toInputStream(CharSequence) - Static method in class org.apache.tika.io.IOUtils
Convert the specified CharSequence to an input stream, encoded as bytes using the default character encoding of the platform.
toInputStream(CharSequence, String) - Static method in class org.apache.tika.io.IOUtils
Convert the specified CharSequence to an input stream, encoded as bytes using the specified character encoding.
toInputStream(String) - Static method in class org.apache.tika.io.IOUtils
Convert the specified string to an input stream, encoded as bytes using the default character encoding of the platform.
toInputStream(String, String) - Static method in class org.apache.tika.io.IOUtils
Convert the specified string to an input stream, encoded as bytes using the specified character encoding.
toJson(Metadata, Writer) - Static method in class org.apache.tika.metadata.serialization.JsonMetadata
Serializes a Metadata object to Json.
toJson(List<Metadata>, Writer) - Static method in class org.apache.tika.metadata.serialization.JsonMetadataList
Serializes a Metadata object to Json.
toString() - Method in class org.apache.tika.detect.MagicDetector
Returns a string representation of the Detection Rule.
toString() - Method in class org.apache.tika.io.CountingInputStream
 
toString(InputStream) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a String using the default character encoding of the platform.
toString(InputStream, String) - Static method in class org.apache.tika.io.IOUtils
Get the contents of an InputStream as a String using the specified character encoding.
toString(Reader) - Static method in class org.apache.tika.io.IOUtils
Get the contents of a Reader as a String.
toString(byte[]) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
Use String.String(byte[])
toString(byte[], String) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
Use String.String(byte[],String)
toString() - Method in class org.apache.tika.io.TaggedInputStream
 
toString() - Method in class org.apache.tika.io.TikaInputStream
 
toString() - Method in class org.apache.tika.language.LanguageIdentifier
 
toString() - Method in class org.apache.tika.language.LanguageProfile.Counter
 
toString() - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
toString() - Method in class org.apache.tika.language.LanguageProfile
 
toString() - Method in class org.apache.tika.language.LanguageProfilerBuilder.NGramEntry
 
toString() - Method in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
toString() - Method in class org.apache.tika.language.LanguageProfilerBuilder
 
toString() - Method in class org.apache.tika.metadata.Metadata
 
toString() - Method in class org.apache.tika.mime.AndClause
 
toString() - Method in class org.apache.tika.mime.Magic
 
toString() - Method in class org.apache.tika.mime.MagicMatch
 
toString() - Method in class org.apache.tika.mime.MediaType
 
toString() - Method in class org.apache.tika.mime.MimeType.RootXML
 
toString() - Method in class org.apache.tika.mime.MimeType
Returns the name of this media type.
toString() - Method in class org.apache.tika.mime.OrClause
 
toString() - Method in enum org.apache.tika.parser.strings.StringsEncoding
 
toString() - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
toString() - Method in class org.apache.tika.sax.DIFContentHandler
 
toString() - Method in class org.apache.tika.sax.Link
 
toString() - Method in class org.apache.tika.sax.SafeContentHandler.StringOutput
 
toString() - Method in class org.apache.tika.sax.TextContentHandler
 
toString() - Method in class org.apache.tika.sax.ToTextContentHandler
Returns the contents of the internal string buffer where all the received characters have been collected.
toString() - Method in class org.apache.tika.Tika
 
total - Variable in class org.apache.tika.detect.TextStatistics
 
TOTAL_TIME - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
TOTAL_TIME - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
ToTextContentHandler - Class in org.apache.tika.sax
SAX event handler that writes all character content out to a character stream.
ToTextContentHandler(Writer) - Constructor for class org.apache.tika.sax.ToTextContentHandler
Creates a content handler that writes character events to the given writer.
ToTextContentHandler(OutputStream) - Constructor for class org.apache.tika.sax.ToTextContentHandler
Creates a content handler that writes character events to the given output stream using the platform default encoding.
ToTextContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToTextContentHandler
Creates a content handler that writes character events to the given output stream using the given encoding.
ToTextContentHandler() - Constructor for class org.apache.tika.sax.ToTextContentHandler
Creates a content handler that writes character events to an internal string buffer.
ToXMLContentHandler - Class in org.apache.tika.sax
SAX event handler that serializes the XML document to a character stream.
ToXMLContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToXMLContentHandler
Creates an XML serializer that writes to the given byte stream using the given character encoding.
ToXMLContentHandler(String) - Constructor for class org.apache.tika.sax.ToXMLContentHandler
 
ToXMLContentHandler() - Constructor for class org.apache.tika.sax.ToXMLContentHandler
 
ToXMLContentHandler.ElementInfo - Class in org.apache.tika.sax
 
TRACK_NUMBER - Static variable in interface org.apache.tika.metadata.XMPDM
"A numeric value indicating the order of the audio file within its original recording."
TrainedModel - Class in org.apache.tika.detect
 
TrainedModel() - Constructor for class org.apache.tika.detect.TrainedModel
 
TrainedModelDetector - Class in org.apache.tika.detect
 
TrainedModelDetector() - Constructor for class org.apache.tika.detect.TrainedModelDetector
 
transformation - Variable in class org.apache.tika.parser.CryptoParser
 
TRANSITION_KEYWORDS_TO_DC_SUBJECT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Deprecated.
use TikaCoreProperties#KEYWORDS
TRANSITION_SUBJECT_TO_DC_DESCRIPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Deprecated.
use TikaCoreProperties#DESCRIPTION
TRANSITION_SUBJECT_TO_DC_TITLE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Deprecated.
use TikaCoreProperties#TITLE
TRANSITION_SUBJECT_TO_OO_SUBJECT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
Deprecated.
use OfficeOpenXMLCore#SUBJECT
translate(String, String, String) - Method in class org.apache.tika.language.translate.DefaultTranslator
Translate, using the first available service-loaded translator
translate(String, String) - Method in class org.apache.tika.language.translate.DefaultTranslator
Translate, using the first available service-loaded translator
translate(String, String, String) - Method in class org.apache.tika.language.translate.EmptyTranslator
 
translate(String, String) - Method in class org.apache.tika.language.translate.EmptyTranslator
 
translate(String, String, String) - Method in interface org.apache.tika.language.translate.Translator
Translate text between given languages.
translate(String, String) - Method in interface org.apache.tika.language.translate.Translator
Translate text to the given language.
translate(String, String, String) - Method in class org.apache.tika.Tika
Translate the given text String to and from the given languages.
translate(String, String) - Method in class org.apache.tika.Tika
Translate the given text String to the given language, attempting to auto-detect the source language.
translate(InputStream, String, String) - Method in class org.apache.tika.Tika
Translate the given text InputStream to and from the given languages.
translate(InputStream, String) - Method in class org.apache.tika.Tika
Translate the given text InputStream to the given language, attempting to auto-detect the source language.
translator - Variable in class org.apache.tika.config.TikaConfig
 
Translator - Interface in org.apache.tika.language.translate
Interface for Translator services.
translator - Variable in class org.apache.tika.Tika
The Translator instance used by this facade.
TranslatorXmlLoader() - Constructor for class org.apache.tika.config.TikaConfig.TranslatorXmlLoader
 
TRANSMISSION_REFERENCE - Static variable in interface org.apache.tika.metadata.Photoshop
 
treeStack - Variable in class org.apache.tika.sax.DIFContentHandler
 
trimMessage(String) - Static method in class org.apache.tika.utils.ExceptionUtils
Utility method to trim the message from a stack trace string.
type - Variable in class org.apache.tika.detect.MagicDetector
The matching media type.
type - Variable in class org.apache.tika.detect.NNTrainedModelBuilder
 
TYPE - Static variable in interface org.apache.tika.metadata.DublinCore
The nature or genre of the content of the resource.
TYPE - Static variable in class org.apache.tika.metadata.Metadata
Deprecated.
use TikaCoreProperties#TYPE
TYPE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
 
type - Variable in class org.apache.tika.mime.Magic
 
type - Variable in class org.apache.tika.mime.MagicMatch
 
type - Variable in class org.apache.tika.mime.MimeType.RootXML
 
type - Variable in class org.apache.tika.mime.MimeType
The normalized media type name.
type - Variable in class org.apache.tika.mime.MimeTypesReader
Current type
TYPE - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
type - Variable in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
 
type - Variable in class org.apache.tika.parser.iwork.PagesContentHandler.HeaderFooter
 
type - Variable in class org.apache.tika.sax.BasicContentHandlerFactory
 
type - Variable in class org.apache.tika.sax.Link
 
type - Variable in class org.apache.tika.sax.LinkBuilder
 
TYPE_METADATA - Static variable in class org.apache.tika.parser.video.FLVParser
 
TYPE_PATTERN - Static variable in class org.apache.tika.mime.MediaType
 
TypeDetector - Class in org.apache.tika.detect
Content type detection based on a content type hint.
TypeDetector() - Constructor for class org.apache.tika.detect.TypeDetector
 
types - Variable in class org.apache.tika.mime.MimeTypes
All the registered MimeTypes indexed on their canonical names
types - Variable in class org.apache.tika.mime.MimeTypesReader
 
types - Variable in class org.apache.tika.parser.CryptoParser
 
typesFromDomElement(Element) - Static method in class org.apache.tika.config.TikaConfig
 

U

ubyteToInt(byte) - Static method in class org.apache.tika.io.EndianUtils
Convert an 'unsigned' byte to an integer.
union(Map<String, String>, Map<String, String>) - Static method in class org.apache.tika.mime.MediaType
 
unknownCount - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
unmodifiableSet(String...) - Static method in class org.apache.tika.sax.XHTMLContentHandler
 
unquote(String) - Static method in class org.apache.tika.mime.MediaType
Fuzzy unquoting mechanism that works also with somewhat malformed quotes.
update(int) - Method in class org.apache.tika.fork.ForkServer
 
update(byte[], int, int) - Method in class org.apache.tika.fork.ForkServer
 
update(int) - Method in class org.apache.tika.language.LanguageProfile.Interleaved.Entry
 
update() - Method in class org.apache.tika.language.LanguageProfile.Interleaved
Ensure that the entries array is in sync with the ngrams.
updateInterleaved() - Method in class org.apache.tika.language.LanguageProfile
 
URGENCY - Static variable in interface org.apache.tika.metadata.IPTC
Deprecated. 
URGENCY - Static variable in interface org.apache.tika.metadata.Photoshop
 
uri - Variable in class org.apache.tika.parser.NetworkParser
 
uri - Variable in class org.apache.tika.parser.xml.AttributeMetadataHandler
 
uri - Variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
uri - Variable in class org.apache.tika.sax.Link
 
uri - Variable in class org.apache.tika.sax.LinkBuilder
 
uri - Variable in class org.apache.tika.sax.XMPContentHandler
 
URI_RDF - Static variable in class org.apache.tika.parser.xml.ElementMetadataHandler
 
url - Variable in class org.apache.tika.fork.MemoryURLStreamRecord
 
USAGE_TERMS - Static variable in interface org.apache.tika.metadata.XMPRights
A word or short phrase that identifies a resource as a member of a userdefined collection.
useFrameset - Variable in class org.apache.tika.sax.XHTMLContentHandler
 
useInterleaved - Static variable in class org.apache.tika.language.LanguageProfile
 
USER_DEFINED_METADATA_NAME_PREFIX - Static variable in interface org.apache.tika.metadata.MSOffice
For user defined metadata entries in the document, what prefix should be attached to the key names.
USER_DEFINED_METADATA_NAME_PREFIX - Static variable in interface org.apache.tika.metadata.Office
For user defined metadata entries in the document, what prefix should be attached to the key names.
UTC - Static variable in class org.apache.tika.utils.DateUtils
The UTC time zone.
UTF_8 - Static variable in class org.apache.tika.io.IOUtils
 
uti - Variable in class org.apache.tika.mime.MimeType
The http://en.wikipedia.org/wiki/Uniform_Type_Identifier

V

VALID_CHARS - Static variable in class org.apache.tika.mime.MediaType
See http://www.ietf.org/rfc/rfc2045.txt for valid mime-type characters.
value - Variable in class org.apache.tika.language.LanguageProfilerBuilder.QuickStringBuffer
 
value - Variable in class org.apache.tika.mime.MagicMatch
 
value - Variable in enum org.apache.tika.parser.strings.StringsEncoding
 
valueOf(String) - Static method in enum org.apache.tika.metadata.Property.PropertyType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.metadata.Property.ValueType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.metadata.TikaCoreProperties.EmbeddedResourceType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.parser.iwork.PagesContentHandler.DocumentPart
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.parser.strings.StringsEncoding
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.parser.utils.CommonsDigester.DigestAlgorithm
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.tika.sax.BasicContentHandlerFactory.HANDLER_TYPE
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.tika.metadata.Property.PropertyType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.metadata.Property.ValueType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.metadata.TikaCoreProperties.EmbeddedResourceType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.parser.iwork.IWorkPackageParser.IWORKDocumentType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.parser.iwork.PagesContentHandler.DocumentPart
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.parser.strings.StringsEncoding
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.parser.utils.CommonsDigester.DigestAlgorithm
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.tika.sax.BasicContentHandlerFactory.HANDLER_TYPE
Returns an array containing the constants of this enum type, in the order they are declared.
valueType - Variable in class org.apache.tika.metadata.Property
 
ValueType() - Constructor for enum org.apache.tika.metadata.Property.ValueType
 
VERSION - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
VERSION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
The version number.
video(String) - Static method in class org.apache.tika.mime.MediaType
 
VIDEO_ALPHA_MODE - Static variable in interface org.apache.tika.metadata.XMPDM
"The alpha mode."
VIDEO_ALPHA_UNITY_IS_TRANSPARENT - Static variable in interface org.apache.tika.metadata.XMPDM
"When true, unity is clear, when false, it is opaque."
VIDEO_COLOR_SPACE - Static variable in interface org.apache.tika.metadata.XMPDM
"The color space."
VIDEO_COMPRESSOR - Static variable in interface org.apache.tika.metadata.XMPDM
"Video compression used.
VIDEO_FIELD_ORDER - Static variable in interface org.apache.tika.metadata.XMPDM
"The field order for video."
VIDEO_FRAME_RATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The video frame rate."
VIDEO_MOD_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
"The date and time when the video was last modified."
VIDEO_PIXEL_ASPECT_RATIO - Static variable in interface org.apache.tika.metadata.XMPDM
"The aspect ratio, expressed as wd/ht.
VIDEO_PIXEL_DEPTH - Static variable in interface org.apache.tika.metadata.XMPDM
"The size in bits of each color component of a pixel.

W

waitForResponse(List<ForkResource>) - Method in class org.apache.tika.fork.ForkClient
 
waitForStartBeacon() - Method in class org.apache.tika.fork.ForkClient
 
WARN - Static variable in interface org.apache.tika.config.LoadErrorHandler
Strategy that logs warnings of all problems using a Logger created using the given class name.
WEB_STATEMENT - Static variable in interface org.apache.tika.metadata.XMPRights
A Web URL for a statement of the ownership and usage rights for this resource.
WHITESPACE - Static variable in class org.apache.tika.config.ServiceLoader
 
WIN_NAME_PATTERN - Static variable in class org.apache.tika.utils.CharsetUtils
 
withFallbacks(Collection<? extends Parser>, Set<MediaType>) - Static method in class org.apache.tika.parser.ParserDecorator
Deprecated.
Do not use until the TODOs are resolved, see TIKA-1509
withoutTypes(Parser, Set<MediaType>) - Static method in class org.apache.tika.parser.ParserDecorator
Decorates the given parser so that it never claims to support parsing of the given media types, but will work for all others.
withTypes(Parser, Set<MediaType>) - Static method in class org.apache.tika.parser.ParserDecorator
Decorates the given parser so that it always claims to support parsing of the given media types.
word - Variable in class org.apache.tika.language.LanguageProfilerBuilder
A StringBuffer used during analysis
WORD_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
Deprecated.
WORD_COUNT - Static variable in interface org.apache.tika.metadata.Office
The number of Words in the document
WORD_PROCESSING_NAMESPACE_URI - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
WORD_PROCESSING_PREFIX - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
 
WORK_TYPE - Static variable in interface org.apache.tika.metadata.CreativeCommons
 
wrappedParser - Variable in class org.apache.tika.parser.RecursiveParserWrapper
 
write(byte[], OutputStream) - Static method in class org.apache.tika.io.IOUtils
Writes bytes from a byte[] to an OutputStream.
write(byte[], Writer) - Static method in class org.apache.tika.io.IOUtils
Writes bytes from a byte[] to chars on a Writer using the default character encoding of the platform.
write(byte[], Writer, String) - Static method in class org.apache.tika.io.IOUtils
Writes bytes from a byte[] to chars on a Writer using the specified character encoding.
write(char[], Writer) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a char[] to a Writer using the default character encoding of the platform.
write(char[], OutputStream) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a char[] to bytes on an OutputStream.
write(char[], OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a char[] to bytes on an OutputStream using the specified character encoding.
write(CharSequence, Writer) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a CharSequence to a Writer.
write(CharSequence, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a CharSequence to bytes on an OutputStream using the default character encoding of the platform.
write(CharSequence, OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a CharSequence to bytes on an OutputStream using the specified character encoding.
write(String, Writer) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a String to a Writer.
write(String, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a String to bytes on an OutputStream using the default character encoding of the platform.
write(String, OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
Writes chars from a String to bytes on an OutputStream using the specified character encoding.
write(StringBuffer, Writer) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
replaced by write(CharSequence, Writer)
write(StringBuffer, OutputStream) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
replaced by write(CharSequence, OutputStream)
write(StringBuffer, OutputStream, String) - Static method in class org.apache.tika.io.IOUtils
Deprecated.
replaced by write(CharSequence, OutputStream, String)
write(byte[], int, int) - Method in class org.apache.tika.io.NullOutputStream
Does nothing - output to /dev/null.
write(int) - Method in class org.apache.tika.io.NullOutputStream
Does nothing - output to /dev/null.
write(byte[]) - Method in class org.apache.tika.io.NullOutputStream
Does nothing - output to /dev/null.
write(char[], int, int) - Method in class org.apache.tika.language.ProfilingWriter
 
write(char[], int, int) - Method in interface org.apache.tika.sax.SafeContentHandler.Output
 
write(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler.StringOutput
 
write(char) - Method in class org.apache.tika.sax.ToXMLContentHandler
Writes the given character as-is.
write(String) - Method in class org.apache.tika.sax.ToXMLContentHandler
Writes the given string of character as-is.
WRITE_LIMIT_REACHED - Static variable in class org.apache.tika.parser.RecursiveParserWrapper
 
writeAndCloseStream(DataOutputStream, InputStream) - Method in class org.apache.tika.fork.ClassLoaderResource
Sends the contents of the given input stream to the given output.
writeCharsAndEntity(char[], int, int, String) - Method in class org.apache.tika.sax.ToXMLContentHandler
Writes the given characters as-is followed by the given entity.
writeCount - Variable in class org.apache.tika.sax.WriteOutContentHandler
Number of characters written so far.
writeEscaped(char[], int, int, boolean) - Method in class org.apache.tika.sax.ToXMLContentHandler
Writes the given characters with XML meta characters escaped.
writeHisto(float[]) - Method in class org.apache.tika.detect.TrainedModelDetector
for testing purposes; this method write the histogram vector to a file.
writeLimit - Variable in class org.apache.tika.sax.BasicContentHandlerFactory
 
writeLimit - Variable in class org.apache.tika.sax.WriteOutContentHandler
The maximum number of characters to write to the character stream.
WriteLimitReachedException(String, Serializable) - Constructor for exception org.apache.tika.sax.WriteOutContentHandler.WriteLimitReachedException
 
WriteOutContentHandler - Class in org.apache.tika.sax
SAX event handler that writes content up to an optional write limit out to a character stream or other decorated handler.
WriteOutContentHandler(ContentHandler, int) - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes content up to the given write limit to the given content handler.
WriteOutContentHandler(Writer, int) - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes content up to the given write limit to the given character stream.
WriteOutContentHandler(Writer) - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes character events to the given writer.
WriteOutContentHandler(OutputStream) - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes character events to the given output stream using the default encoding.
WriteOutContentHandler(int) - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes character events to an internal string buffer.
WriteOutContentHandler() - Constructor for class org.apache.tika.sax.WriteOutContentHandler
Creates a content handler that writes character events to an internal string buffer.
WriteOutContentHandler.WriteLimitReachedException - Exception in org.apache.tika.sax
The exception used as a signal when the write limit has been reached.
writer - Variable in class org.apache.tika.language.ProfilingHandler
 
writer - Variable in class org.apache.tika.parser.ParsingReader
Write end of the pipe.
writer - Variable in class org.apache.tika.sax.ToTextContentHandler
The character stream.
writeReplacement(SafeContentHandler.Output) - Method in class org.apache.tika.sax.SafeContentHandler
Outputs the replacement for an invalid character.

X

xhtml - Variable in class org.apache.tika.parser.iwork.KeynoteContentHandler
 
xhtml - Variable in class org.apache.tika.parser.iwork.NumbersContentHandler
 
xhtml - Variable in class org.apache.tika.parser.iwork.PagesContentHandler
 
xhtml - Variable in class org.apache.tika.parser.strings.Latin1StringsParser
The output content handler.
XHTML - Static variable in class org.apache.tika.sax.XHTMLContentHandler
The XHTML namespace URI
XHTMLContentHandler - Class in org.apache.tika.sax
Content handler decorator that simplifies the task of producing XHTML events for Tika content parsers.
XHTMLContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.XHTMLContentHandler
 
XML - Static variable in class org.apache.tika.mime.MimeTypes
Name of the xml type, application/xml.
XmlLoader() - Constructor for class org.apache.tika.config.TikaConfig.XmlLoader
 
xmlMimeType - Variable in class org.apache.tika.mime.MimeTypes
 
XMLParser - Class in org.apache.tika.parser.xml
XML parser.
XMLParser() - Constructor for class org.apache.tika.parser.xml.XMLParser
 
XmlRootExtractor - Class in org.apache.tika.detect
Utility class that uses a SAXParser to determine the namespace URI and local name of the root element of an XML file.
XmlRootExtractor() - Constructor for class org.apache.tika.detect.XmlRootExtractor
 
XmlRootExtractor.ExtractorHandler - Class in org.apache.tika.detect
 
xmls - Variable in class org.apache.tika.mime.MimeTypes
Sorted list of all registered rootXML
XMP - Interface in org.apache.tika.metadata
 
XMP - Static variable in class org.apache.tika.sax.XMPContentHandler
The XMP namespace URI
XMPContentHandler - Class in org.apache.tika.sax
Content handler decorator that simplifies the task of producing XMP output.
XMPContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.XMPContentHandler
 
XMPDM - Interface in org.apache.tika.metadata
XMP Dynamic Media schema.
XMPDM.ChannelTypePropertyConverter - Class in org.apache.tika.metadata
Deprecated.
Experimental method, will change shortly
XMPIdq - Interface in org.apache.tika.metadata
 
XMPMM - Interface in org.apache.tika.metadata
 
XMPRights - Interface in org.apache.tika.metadata
XMP Rights management schema.
XPathParser - Class in org.apache.tika.sax.xpath
Parser for a very simple XPath subset.
XPathParser() - Constructor for class org.apache.tika.sax.xpath.XPathParser
 
XPathParser(String, String) - Constructor for class org.apache.tika.sax.xpath.XPathParser
 
XQ - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 
XS - Static variable in class org.apache.tika.parser.iptc.IptcAnpaParser
 

_

_COLOR_MODE_CHOICES_INDEXED - Static variable in interface org.apache.tika.metadata.Photoshop
 
_getValues(String) - Method in class org.apache.tika.metadata.Metadata
 
A B C D E F G H I J K L M N O P Q R S T U V W X _ 
Skip navigation links