Skip navigation links

Oracle® Database Globalization Development Kit Java API Reference
11g Release 1 (11.1)

Book Part Number B28299-01


oracle.i18n.lcsd
Class LCSDetectionHTMLReader

java.lang.Object
  extended by java.io.Reader
      extended by oracle.i18n.lcsd.LCSDetectionReader
          extended by oracle.i18n.lcsd.LCSDetectionHTMLReader

All Implemented Interfaces:
Closeable, Readable

public class LCSDetectionHTMLReader
extends LCSDetectionReader

The LCSDetectionHTMLReader class extends the LCSDetectionReader class to support the language/encoding detection for input in HTML format.

You can choose the character set of the HTML meta value or the detected character set value as the input character set with the flag METAVAL or DETECTVAL. The default flag value is DETECTVAL.

The detection sampling length indicates how many bytes of plain text on which the detection feature will perform. The default sampling length is 1K. Generally, LCSD handles the language/encoding detection, and you do not need to set this value. By allowing changes to this value, you can control the detection sampling length.

You can get the detection results from the LCSDResultSet class if needed.

Any read method returns UTFDataFormatException if the source is UTF-8 data and an invalid UTF-8 sequence is found.

Since:
10.2

Field Summary
static int DETECTVAL
          Constant value to represent DETECTVAL flag.
static int METAVAL
          Constant value to represent METAVAL flag.

 

Fields inherited from class oracle.i18n.lcsd.LCSDetectionReader
DEFAULT_SAMPLING_SIZE

 

Fields inherited from class java.io.Reader
lock

 

Constructor Summary
LCSDetectionHTMLReader(InputStream in)
          Creates an LCSDetectionHTMLReader object.
LCSDetectionHTMLReader(InputStream in, int len)
          Creates an LCSDetectionHTMLReader object.
LCSDetectionHTMLReader(InputStream in, int len, int flag)
          Creates an LCSDetectionHTMLReader object.

 

Method Summary

 

Methods inherited from class oracle.i18n.lcsd.LCSDetectionReader
close, getResult, mark, markSupported, read, read, read, ready, reset

 

Methods inherited from class java.io.Reader
read, skip

 

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

 

Field Detail

METAVAL

public static final int METAVAL
Constant value to represent METAVAL flag.
See Also:
Constant Field Values

DETECTVAL

public static final int DETECTVAL
Constant value to represent DETECTVAL flag.
See Also:
Constant Field Values

Constructor Detail

LCSDetectionHTMLReader

public LCSDetectionHTMLReader(InputStream in)
                       throws IOException,
                              UTFDataFormatException
Creates an LCSDetectionHTMLReader object. Use the default sampling length and default profile for detection. The detected character set is used for conversion.
Parameters:
in - input stream that you want to detect
Throws:
IOException - if any I/O error occurs
UTFDataFormatException - if any invalid UTF-8 data sequence is detected. Note this occurs only if the source is UTF-8 data

LCSDetectionHTMLReader

public LCSDetectionHTMLReader(InputStream in,
                              int len)
                       throws IOException,
                              UTFDataFormatException
Creates an LCSDetectionHTMLReader object. Use the specified sampling length and default profile for detection. The detected character set is used for conversion.
Parameters:
in - input stream that you want to detect
len - the sampling length
Throws:
IOException - if any I/O error occurs
UTFDataFormatException - if any invalid UTF-8 data sequence is detected. Note this occurs only if the source is UTF-8 data

LCSDetectionHTMLReader

public LCSDetectionHTMLReader(InputStream in,
                              int len,
                              int flag)
                       throws IOException,
                              UTFDataFormatException
Creates an LCSDetectionHTMLReader object. Use the specified sampling length and default profile for detection. The detected character set is used for conversion if the flag is DETECTVAL or the meta value of the character set is used for conversion if the flag is METAVAL.
Parameters:
in - input stream that you want to detect
len - the sampling length
flag - METAVAL or DETECTVAL
Throws:
IOException - if any I/O error occurs
UTFDataFormatException - if any invalid UTF-8 data sequence is detected. Note this occurs only if the source is UTF-8 data

Skip navigation links

Oracle® Database Globalization Development Kit Java API Reference
11g Release 1 (11.1)

Book Part Number B28299-01


Copyright © 2003, 2007, Oracle. All rights reserved.