org.pentaho.platform.dataaccess.datasource.wizard.csv
Class UnicodeBOMInputStream

java.lang.Object
  extended by java.io.InputStream
      extended by org.pentaho.platform.dataaccess.datasource.wizard.csv.UnicodeBOMInputStream
All Implemented Interfaces:
Closeable

public class UnicodeBOMInputStream
extends InputStream

The UnicodeBOMInputStream class wraps any InputStream and detects the presence of any Unicode BOM (Byte Order Mark) at its beginning, as defined by RFC 3629 - UTF-8, a transformation format of ISO 10646

The Unicode FAQ defines 5 types of BOMs:

Use the getBOM() method to know whether a BOM has been detected or not.

Use the skipBOM() method to remove the detected BOM from the wrapped InputStream object.


Nested Class Summary
static class UnicodeBOMInputStream.BOM
          Type safe enumeration class that describes the different types of Unicode BOMs.
 
Constructor Summary
UnicodeBOMInputStream(InputStream inputStream)
          Constructs a new UnicodeBOMInputStream that wraps the specified InputStream.
 
Method Summary
 int available()
          
 void close()
          
 UnicodeBOMInputStream.BOM getBOM()
          Returns the BOM that was detected in the wrapped InputStream object.
 void mark(int readlimit)
          
 boolean markSupported()
          
 int read()
          
 int read(byte[] b)
          
 int read(byte[] b, int off, int len)
          
 void reset()
          
 long skip(long n)
          
 UnicodeBOMInputStream skipBOM()
          Skips the BOM that was found in the wrapped InputStream object.
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

UnicodeBOMInputStream

public UnicodeBOMInputStream(InputStream inputStream)
                      throws NullPointerException,
                             IOException
Constructs a new UnicodeBOMInputStream that wraps the specified InputStream.

Parameters:
inputStream - an InputStream.
Throws:
NullPointerException - when inputStream is null.
IOException - on reading from the specified InputStream when trying to detect the Unicode BOM.
Method Detail

getBOM

public final UnicodeBOMInputStream.BOM getBOM()
Returns the BOM that was detected in the wrapped InputStream object.

Returns:
a BOM value.

skipBOM

public final UnicodeBOMInputStream skipBOM()
                                    throws IOException
Skips the BOM that was found in the wrapped InputStream object.

Returns:
this UnicodeBOMInputStream.
Throws:
IOException - when trying to skip the BOM from the wrapped InputStream object.

read

public int read()
         throws IOException

Specified by:
read in class InputStream
Throws:
IOException

read

public int read(byte[] b)
         throws IOException,
                NullPointerException

Overrides:
read in class InputStream
Throws:
IOException
NullPointerException

read

public int read(byte[] b,
                int off,
                int len)
         throws IOException,
                NullPointerException

Overrides:
read in class InputStream
Throws:
IOException
NullPointerException

skip

public long skip(long n)
          throws IOException

Overrides:
skip in class InputStream
Throws:
IOException

available

public int available()
              throws IOException

Overrides:
available in class InputStream
Throws:
IOException

close

public void close()
           throws IOException

Specified by:
close in interface Closeable
Overrides:
close in class InputStream
Throws:
IOException

mark

public void mark(int readlimit)

Overrides:
mark in class InputStream

reset

public void reset()
           throws IOException

Overrides:
reset in class InputStream
Throws:
IOException

markSupported

public boolean markSupported()

Overrides:
markSupported in class InputStream