|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.jcodings.Encoding
org.jcodings.MultiByteEncoding
org.jcodings.unicode.UnicodeEncoding
org.jcodings.specific.UTF8Encoding
public final class UTF8Encoding
Field Summary | |
---|---|
protected static CaseFoldCodeItem[] |
EMPTY_FOLD_CODES
|
static UTF8Encoding |
INSTANCE
|
Fields inherited from class org.jcodings.MultiByteEncoding |
---|
A, EncLen, F, Trans, TransZero |
Fields inherited from class org.jcodings.Encoding |
---|
CHAR_INVALID, hashCode, isAsciiCompatible, isDummy, isFixedWidth, isSingleByte, maxLength, minLength, name, NEW_LINE |
Constructor Summary | |
---|---|
protected |
UTF8Encoding()
|
Method Summary | |
---|---|
protected void |
asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
|
protected CaseFoldCodeItem[] |
asciiCaseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end)
|
protected int |
asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
|
int |
codeToMbc(int code,
byte[] bytes,
int p)
Extracts code point into it's multibyte representation |
int |
codeToMbcLength(int code)
Returns character length given a code point Oniguruma equivalent: code_to_mbclen |
int[] |
ctypeCodeRange(int ctype,
IntHolder sbOut)
utf8_get_ctype_code_range |
protected boolean |
isCodeCTypeInternal(int code,
int ctype)
ONIGENC_IS_XXXXXX_CODE_CTYPE |
boolean |
isNewLine(byte[] bytes,
int p,
int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings |
boolean |
isReverseMatchAllowed(byte[] bytes,
int p,
int end)
onigenc_always_true_is_allowed_reverse_match |
int |
leftAdjustCharHead(byte[] bytes,
int p,
int s,
int end)
utf8_left_adjust_char_head |
int |
length(byte[] bytes,
int p,
int end)
Returns character length given stream, character position and stream end returns 1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise |
int |
mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] fold)
onigenc_ascii_mbc_case_fold |
int |
mbcToCode(byte[] bytes,
int p,
int end)
Returns code point for a character Oniguruma equivalent: mbc_to_code |
Methods inherited from class org.jcodings.unicode.UnicodeEncoding |
---|
applyAllCaseFold, caseFoldCodesByString, ctypeCodeRange, isCodeCType, propertyNameToCType |
Methods inherited from class org.jcodings.MultiByteEncoding |
---|
length, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength |
Methods inherited from class org.jcodings.Encoding |
---|
asciiToLower, asciiToUpper, digitVal, equals, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUpper, isWord, isWordGraphPrint, isXDigit, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, replicate, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal |
Methods inherited from class java.lang.Object |
---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final UTF8Encoding INSTANCE
protected static final CaseFoldCodeItem[] EMPTY_FOLD_CODES
Constructor Detail |
---|
protected UTF8Encoding()
Method Detail |
---|
public int length(byte[] bytes, int p, int end)
Encoding
1
for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise
length
in class Encoding
mbc_enc_len
modified for 1.9 purposes,public boolean isNewLine(byte[] bytes, int p, int end)
public int codeToMbcLength(int code)
Encoding
code_to_mbclen
codeToMbcLength
in class Encoding
public int mbcToCode(byte[] bytes, int p, int end)
Encoding
mbc_to_code
mbcToCode
in class Encoding
public int codeToMbc(int code, byte[] bytes, int p)
Encoding
codeToMbc
in class Encoding
code_to_mbc
public int mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] fold)
mbcCaseFold
in class UnicodeEncoding
flag
- case fold flagpp
- an IntHolder
that points at character headfold
- a buffer where to extract case folded character
Oniguruma equivalent: mbc_case_fold
public int[] ctypeCodeRange(int ctype, IntHolder sbOut)
ctypeCodeRange
in class Encoding
public int leftAdjustCharHead(byte[] bytes, int p, int s, int end)
leftAdjustCharHead
in class Encoding
bytes
- byte streamp
- positions
- stopend
- endpublic boolean isReverseMatchAllowed(byte[] bytes, int p, int end)
isReverseMatchAllowed
in class Encoding
protected final boolean isCodeCTypeInternal(int code, int ctype)
protected final int asciiMbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower)
protected final void asciiApplyAllCaseFold(int flag, ApplyAllCaseFoldFunction fun, Object arg)
protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |