| Interface | Description |
|---|---|
| CharsetListener |
Listener to be notified when Charset is detected.
|
| Class | Description |
|---|---|
| Constants | |
| EncodingDetectorInputStream |
Stream that detect encoding while reading.
|
| EncodingDetectorOutputStream |
Stream that detect encoding while reading.
|
| ReaderFactory |
Create a reader from a file with correct encoding
|
| UnicodeBOMInputStream |
The
UnicodeBOMInputStream class wraps any
InputStream and detects the presence of any Unicode BOM
(Byte Order Mark) at its beginning, as defined by
RFC 3629 - UTF-8, a
transformation format of ISO 10646
The
Unicode FAQ
defines 5 types of BOMs:
00 00 FE FF = UTF-32, big-endian
FF FE 00 00 = UTF-32, little-endian
FE FF = UTF-16, big-endian
FF FE = UTF-16, little-endian
EF BB BF = UTF-8
Use the UnicodeBOMInputStream.getBOM() method to know whether a BOM has been detected
or not. |
| UnicodeBOMInputStream.BOM |
Type safe enumeration class that describes the different types of Unicode
BOMs.
|
| UniversalDetector |
| Enum | Description |
|---|---|
| UniversalDetector.InputState |
Copyright © 2024. All rights reserved.