Class RepositoryFilenameUtils

java.lang.Object
org.pentaho.platform.repository.RepositoryFilenameUtils

public class RepositoryFilenameUtils extends Object
General filename and filepath manipulation utilities for the Hitachi Vantara Repository. NOTE: these methods will work independently of the underlying operating system. Most methods will translate a backslash (\) to a forward slash (/) but should be be depended upon to make that translation.

This class defines six components within a filename (example /dev/project/file.txt):

  • the prefix - /
  • the path - dev/project/
  • the full path - /dev/project/
  • the name - file.txt
  • the base name - file
  • the extension - txt
Note that this class works best if directory filenames end with a separator. If you omit the last separator, it is impossible to determine if the filename corresponds to a file or a directory. As a result, we have chosen to say it corresponds to a file.

This class only supports Hitachi Vantara Repository (Unix) style names. Prefixes are matched as follows:

 a/b/c.txt           --> ""          --> relative
 /a/b/c.txt          --> "/"         --> absolute
 

Origin of code: Apache Commons IO 2.1

Since:
Pentaho 5.0
Author:
David M. Kincade
  • Method Details

    • normalize

      public static String normalize(String filename)
      Normalizes a path, removing double and single dot path steps.

      This method normalizes a path to a standard format.

      A trailing slash will be retained. A double slash will be merged to a single slash (but UNC names are handled). A single dot path segment will be removed. A double dot will cause that path segment and the one before to be removed. If the double dot has no parent path segment to work with, null is returned.

      The output will be the same on both Unix and Windows except for the separator character.

       /foo//               -->   /foo/
       /foo/./              -->   /foo/
       /foo/../bar          -->   /bar
       /foo/../bar/         -->   /bar/
       /foo/../bar/../baz   -->   /baz
       //foo//./bar         -->   /foo/bar
       /../                 -->   null
       ../foo               -->   null
       foo/bar/..           -->   foo/
       foo/../../bar        -->   null
       foo/../bar           -->   bar
       
      Parameters:
      filename - the filename to normalize, null returns null
      Returns:
      the normalized filename, or null if invalid
    • normalize

      public static String normalize(String filename, boolean leadingSlash)
      Normalizes a path, removing double and single dot path steps.

      This method normalizes a path to a standard format.

      A trailing slash will be retained. A double slash will be merged to a single slash (but UNC names are handled). A single dot path segment will be removed. A double dot will cause that path segment and the one before to be removed. If the double dot has no parent path segment to work with, null is returned.

      The output will be the same on both Unix and Windows except for the separator character.

       /foo//               -->   /foo/
       /foo/./              -->   /foo/
       /foo/../bar          -->   /bar
       /foo/../bar/         -->   /bar/
       /foo/../bar/../baz   -->   /baz
       //foo//./bar         -->   /foo/bar
       /../                 -->   null
       ../foo               -->   null
       foo/bar/..           -->   foo/
       foo/../../bar        -->   null
       foo/../bar           -->   bar
       
      Parameters:
      filename - the filename to normalize, null returns null
      leadingSlash - will ensue there is a leading slash on the result if true
      Returns:
      the normalized filename, or null if invalid
    • normalizeNoEndSeparator

      public static String normalizeNoEndSeparator(String filename)
      Normalizes a path, removing double and single dot path steps, and removing any final directory separator.

      This method normalizes a path to a standard format.

      A trailing slash will be removed. A double slash will be merged to a single slash (but UNC names are handled). A single dot path segment will be removed. A double dot will cause that path segment and the one before to be removed. If the double dot has no parent path segment to work with, null is returned.

      The output will be the same on both Unix and Windows except for the separator character.

       /foo//               -->   /foo
       /foo/./              -->   /foo
       /foo/../bar          -->   /bar
       /foo/../bar/         -->   /bar
       /foo/../bar/../baz   -->   /baz
       //foo//./bar         -->   /foo/bar
       /../                 -->   null
       ../foo               -->   null
       foo/bar/..           -->   foo
       foo/../../bar        -->   null
       foo/../bar           -->   bar
       
      Parameters:
      filename - the filename to normalize, null returns null
      Returns:
      the normalized filename, or null if invalid
    • concat

      public static String concat(String basePath, String fullFilenameToAdd)
      Concatenates a filename to a base path using normal command line style rules.

      The effect is equivalent to resultant directory after changing directory to the first argument, followed by changing directory to the second argument.

      The first argument is the base path, the second is the path to concatenate. The returned path is always normalized via normalize(String), thus .. is handled.

      If pathToAdd is absolute (has an absolute prefix), then it will be normalized and returned. Otherwise, the paths will be joined, normalized and returned.

       /foo/ + bar          -->   /foo/bar
       /foo + bar           -->   /foo/bar
       /foo + /bar          -->   /bar
       /foo/a/ + ../bar     -->   foo/bar
       /foo/ + ../../bar    -->   null
       /foo/ + /bar         -->   /bar
       /foo/.. + /bar       -->   /bar
       /foo + bar/c.txt     -->   /foo/bar/c.txt
       /foo/c.txt + bar     -->   /foo/c.txt/bar (!)
       
      (!) Note that the first parameter must be a path. If it ends with a name, then the name will be built into the concatenated path. If this might be a problem, use getFullPath(String) on the base path argument.
      Parameters:
      basePath - the base path to attach to, always treated as a path
      fullFilenameToAdd - the filename (or path) to attach to the base
      Returns:
      the concatenated path, or null if invalid
    • separatorsToRepository

      public static String separatorsToRepository(String path)
      Converts all separators to the Repository (Unix) separator of forward slash.
      Parameters:
      path - the path to be changed, null ignored
      Returns:
      the updated path
    • getPrefixLength

      public static int getPrefixLength(String filename)
      Returns the length of the filename prefix,

      The prefix length includes the first slash in the full filename if applicable. Thus, it is possible that the length returned is greater than the length of the input string.

       a/b/c.txt           --> ""          --> relative
       /a/b/c.txt          --> "/"         --> absolute
       

      Parameters:
      filename - the filename to find the prefix in, null returns -1
      Returns:
      the length of the prefix, -1 if invalid or null
    • indexOfLastSeparator

      public static int indexOfLastSeparator(String filename)
      Returns the index of the last directory separator character.

      The position of the last forward or backslash is returned.

      Parameters:
      filename - the filename to find the last path separator in, null returns -1
      Returns:
      the index of the last separator character, or -1 if there is no such character
    • indexOfExtension

      public static int indexOfExtension(String filename)
      Returns the index of the last extension separator character, which is a dot.

      This method also checks that there is no directory separator after the last dot. To do this it uses indexOfLastSeparator(String)

      Parameters:
      filename - the filename to find the last path separator in, null returns -1
      Returns:
      the index of the last separator character, or -1 if there is no such character
    • getPrefix

      public static String getPrefix(String filename)
      Gets the prefix from a full filename.

      The prefix includes the first slash in the full filename where applicable.

       a/b/c.txt           --> ""          --> relative
       /a/b/c.txt          --> "/"         --> absolute
       

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the prefix of the file, null if invalid
    • getPath

      public static String getPath(String filename)
      Gets the path from a full filename, which excludes the prefix.

      The method is entirely text based, and returns the text before and including the last forward or backslash.

       a.txt        --> ""
       a/b/c        --> a/b/
       a/b/c/       --> a/b/c/
       /a.txt       --> ""
       /a/b/c       --> a/b/
       /a/b/c/      --> a/b/c/
       

      This method drops the prefix from the result. See getFullPath(String) for the method that retains the prefix.

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the path of the file, an empty string if none exists, null if invalid
    • getPathNoEndSeparator

      public static String getPathNoEndSeparator(String filename)
      Gets the path from a full filename, which excludes the prefix, and also excluding the final directory separator.

      The method is entirely text based, and returns the text before the last forward or backslash.

       a.txt        --> ""
       a/b/c        --> a/b
       a/b/c/       --> a/b/c
       /a.txt       --> ""
       /a/b/c       --> a/b
       /a/b/c/      --> a/b/c
       

      This method drops the prefix from the result. See getFullPathNoEndSeparator(String) for the method that retains the prefix.

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the path of the file, an empty string if none exists, null if invalid
    • getFullPath

      public static String getFullPath(String filename)
      Gets the full path from a full filename, which is the prefix + path.

      The method is entirely text based, and returns the text before and including the last forward or backslash.

       a.txt        --> ""
       a/b/c        --> a/b/
       a/b/c/       --> a/b/c/
       /a.txt       --> /
       /a/b/c       --> /a/b/
       /a/b/c/      --> /a/b/c/
       

      The output will be the same irrespective of the machine that the code is running on.

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the path of the file, an empty string if none exists, null if invalid
    • getFullPathNoEndSeparator

      public static String getFullPathNoEndSeparator(String filename)
      Gets the full path from a full filename, which is the prefix + path, and also excluding the final directory separator.

      This method will handle a file in either Unix or Windows format. The method is entirely text based, and returns the text before the last forward or backslash.

       a.txt        --> ""
       a/b/c        --> a/b
       a/b/c/       --> a/b/c
       /a.txt       --> /
       /a/b/c       --> /a/b
       /a/b/c/      --> /a/b/c
       

      The output will be the same irrespective of the machine that the code is running on.

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the path of the file, an empty string if none exists, null if invalid
    • getName

      public static String getName(String filename)
      Gets the name minus the path from a full filename.

      The text after the last forward or backslash is returned.

       a/b/c.txt --> c.txt
       a.txt     --> a.txt
       a/b/c     --> c
       a/b/c/    --> ""
       

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the name of the file without the path, or an empty string if none exists
    • getBaseName

      public static String getBaseName(String filename)
      Gets the base name, minus the full path and extension, from a full filename.

      The text after the last forward or backslash and before the last dot is returned.

       a/b/c.txt --> c
       a.txt     --> a
       a/b/c     --> c
       a/b/c/    --> ""
       

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the name of the file without the path, or an empty string if none exists
    • getExtension

      public static String getExtension(String filename)
      Gets the extension of a filename.

      This method returns the textual part of the filename after the last dot. There must be no directory separator after the dot.

       foo.txt      --> "txt"
       a/b/c.jpg    --> "jpg"
       a/b.txt/c    --> ""
       a/b/c        --> ""
       

      Parameters:
      filename - the filename to retrieve the extension of.
      Returns:
      the extension of the file or an empty string if none exists.
    • removeExtension

      public static String removeExtension(String filename)
      Removes the extension from a filename.

      This method returns the textual part of the filename before the last dot. There must be no directory separator after the dot.

       foo.txt    --> foo
       a/b/c.jpg  --> a/b/c
       a/b/c      --> a/b/c
       a.b/c      --> a.b/c
       

      Parameters:
      filename - the filename to query, null returns null
      Returns:
      the filename minus the extension
    • equals

      public static boolean equals(String filename1, String filename2)
      Checks whether two filenames are equal exactly.

      No processing is performed on the filenames other than comparison, thus this is merely a null-safe case-sensitive equals.

      Parameters:
      filename1 - the first filename to query, may be null
      filename2 - the second filename to query, may be null
      Returns:
      true if the filenames are equal, null equals null
      See Also:
      • IOCase.SENSITIVE
    • equalsNormalized

      public static boolean equalsNormalized(String filename1, String filename2)
      Checks whether two filenames are equal after both have been normalized.

      Both filenames are first passed to normalize(String). The check is then performed in a case-sensitive manner.

      Parameters:
      filename1 - the first filename to query, may be null
      filename2 - the second filename to query, may be null
      Returns:
      true if the filenames are equal, null equals null
      See Also:
      • IOCase.SENSITIVE
    • isExtension

      public static boolean isExtension(String filename, String extension)
      Checks whether the extension of the filename is that specified.

      This method obtains the extension as the textual part of the filename after the last dot. There must be no directory separator after the dot. The extension check is case-sensitive on all platforms.

      Parameters:
      filename - the filename to query, null returns false
      extension - the extension to check for, null or empty checks for no extension
      Returns:
      true if the filename has the specified extension
    • isExtension

      public static boolean isExtension(String filename, String[] extensions)
      Checks whether the extension of the filename is one of those specified.

      This method obtains the extension as the textual part of the filename after the last dot. There must be no directory separator after the dot. The extension check is case-sensitive on all platforms.

      Parameters:
      filename - the filename to query, null returns false
      extensions - the extensions to check for, null checks for no extension
      Returns:
      true if the filename is one of the extensions
    • isExtension

      public static boolean isExtension(String filename, Collection extensions)
      Checks whether the extension of the filename is one of those specified.

      This method obtains the extension as the textual part of the filename after the last dot. There must be no directory separator after the dot. The extension check is case-sensitive on all platforms.

      Parameters:
      filename - the filename to query, null returns false
      extensions - the extensions to check for, null checks for no extension
      Returns:
      true if the filename is one of the extensions
    • wildcardMatch

      public static boolean wildcardMatch(String filename, String wildcardMatcher)
      Checks a filename to see if it matches the specified wildcard matcher, always testing case-sensitive.

      The wildcard matcher uses the characters '?' and '*' to represent a single or multiple wildcard characters. This is the same as often found on Dos/Unix command lines. The check is case-sensitive always.

       wildcardMatch("c.txt", "*.txt")      --> true
       wildcardMatch("c.txt", "*.jpg")      --> false
       wildcardMatch("a/b/c.txt", "a/b/*")  --> true
       wildcardMatch("c.txt", "*.???")      --> true
       wildcardMatch("c.txt", "*.????")     --> false
       
      Parameters:
      filename - the filename to match on
      wildcardMatcher - the wildcard string to match against
      Returns:
      true if the filename matches the wildcard string
      See Also:
      • IOCase.SENSITIVE
    • escape

      public static String escape(String name, List<Character> reservedChars)
      Performs percent-encoding (as specified in IUnifiedRepository) on given name, only encoding the characters given in reservedChars. Assumes only ASCII characters in reservedChars.
      Parameters:
      name - name to escape
      reservedChars - chars within name to escape
      Returns:
      escaped name
    • unescape

      public static String unescape(String name)
      Reverts modifications of #escape(String) such that for all Strings t, t.equals(unescape(escape(t))). Assumes only ASCII characters have been escaped.
      Parameters:
      name - name to unescape
      Returns:
      unescaped name