Package org.apache.pdfbox.pdfparser
Class PDFStreamParser
- java.lang.Object
-
- org.apache.pdfbox.pdfparser.BaseParser
-
- org.apache.pdfbox.pdfparser.PDFStreamParser
-
public class PDFStreamParser extends BaseParser
This will parse a PDF byte stream and extract operands and such.- Author:
- Ben Litchfield
-
-
Field Summary
-
Fields inherited from class org.apache.pdfbox.pdfparser.BaseParser
A, ASCII_CR, ASCII_LF, B, D, DEF, document, E, ENDOBJ_STRING, ENDSTREAM_STRING, J, M, N, O, R, S, seqSource, STREAM_STRING, T
-
-
Constructor Summary
Constructors Constructor Description PDFStreamParser(byte[] bytes)
Constructor.PDFStreamParser(PDContentStream contentStream)
Constructor.PDFStreamParser(COSStream stream)
Deprecated.UsePDFStreamParser(PDContentStream)
instead.PDFStreamParser(PDStream stream)
Deprecated.UsePDFStreamParser(PDContentStream)
instead.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.util.List<java.lang.Object>
getTokens()
This will get the tokens that were parsed from the stream by theparse()
method.void
parse()
This will parse all the tokens in the stream.java.lang.Object
parseNextToken()
This will parse the next token in the stream.protected java.lang.String
readOperator()
This will read an operator from the stream.-
Methods inherited from class org.apache.pdfbox.pdfparser.BaseParser
isClosing, isClosing, isDigit, isDigit, isEndOfName, isEOL, isEOL, isSpace, isSpace, isWhitespace, isWhitespace, parseBoolean, parseCOSArray, parseCOSDictionary, parseCOSName, parseCOSString, parseDirObject, readExpectedChar, readExpectedString, readExpectedString, readGenerationNumber, readInt, readLine, readLong, readObjectNumber, readString, readString, readStringNumber, skipSpaces, skipWhiteSpaces
-
-
-
-
Constructor Detail
-
PDFStreamParser
@Deprecated public PDFStreamParser(PDStream stream) throws java.io.IOException
Deprecated.UsePDFStreamParser(PDContentStream)
instead.Constructor.- Parameters:
stream
- The stream to parse.- Throws:
java.io.IOException
- If there is an error initializing the stream.
-
PDFStreamParser
@Deprecated public PDFStreamParser(COSStream stream) throws java.io.IOException
Deprecated.UsePDFStreamParser(PDContentStream)
instead.Constructor.- Parameters:
stream
- The stream to parse.- Throws:
java.io.IOException
- If there is an error initializing the stream.
-
PDFStreamParser
public PDFStreamParser(PDContentStream contentStream) throws java.io.IOException
Constructor.- Parameters:
contentStream
- The content stream to parse.- Throws:
java.io.IOException
- If there is an error initializing the stream.
-
PDFStreamParser
public PDFStreamParser(byte[] bytes) throws java.io.IOException
Constructor.- Parameters:
bytes
- the bytes to parse.- Throws:
java.io.IOException
- If there is an error initializing the stream.
-
-
Method Detail
-
parse
public void parse() throws java.io.IOException
This will parse all the tokens in the stream. This will close the stream when it is finished parsing. You can then access these withgetTokens()
.- Throws:
java.io.IOException
- If there is an error while parsing the stream.
-
getTokens
public java.util.List<java.lang.Object> getTokens()
This will get the tokens that were parsed from the stream by theparse()
method.- Returns:
- All of the tokens in the stream.
-
parseNextToken
public java.lang.Object parseNextToken() throws java.io.IOException
This will parse the next token in the stream.- Returns:
- The next token in the stream or null if there are no more tokens in the stream.
- Throws:
java.io.IOException
- If an io error occurs while parsing the stream.
-
readOperator
protected java.lang.String readOperator() throws java.io.IOException
This will read an operator from the stream.- Returns:
- The operator that was read from the stream.
- Throws:
java.io.IOException
- If there is an error reading from the stream.
-
-