Package com.ibm.icu.text
Class CanonicalIterator
java.lang.Object
com.ibm.icu.text.CanonicalIterator
This class allows one to iterate through all the strings that are canonically equivalent to a given
string. For example, here are some sample results:
Results for: {A WITH RING ABOVE}{d}{DOT ABOVE}{CEDILLA}
Note: the code is intended for use with small strings, and is not suitable for larger ones, since it has not been optimized for that situation.
1: {A}{RING ABOVE}{d}{DOT ABOVE}{CEDILLA} 2: {A}{RING ABOVE}{d}{CEDILLA}{DOT ABOVE} 3: {A}{RING ABOVE}{d WITH DOT ABOVE}{CEDILLA} 4: {A}{RING ABOVE}{d WITH CEDILLA}{DOT ABOVE} 5: {A WITH RING ABOVE}{d}{DOT ABOVE}{CEDILLA} 6: {A WITH RING ABOVE}{d}{CEDILLA}{DOT ABOVE} 7: {A WITH RING ABOVE}{d WITH DOT ABOVE}{CEDILLA} 8: {A WITH RING ABOVE}{d WITH CEDILLA}{DOT ABOVE} 9: {ANGSTROM SIGN}{d}{DOT ABOVE}{CEDILLA} 10: {ANGSTROM SIGN}{d}{CEDILLA}{DOT ABOVE} 11: {ANGSTROM SIGN}{d WITH DOT ABOVE}{CEDILLA} 12: {ANGSTROM SIGN}{d WITH CEDILLA}{DOT ABOVE}
Note: the code is intended for use with small strings, and is not suitable for larger ones, since it has not been optimized for that situation.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate StringBuilder
private int[]
private boolean
private final Normalizer2Impl
private final Normalizer2
private static final int
private String[][]
private static boolean
private static final int
private static boolean
private String
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionextract
(int comp, String segment, int segmentPos, StringBuffer buf) See if the decomposition of cp2 is at segment starting at segmentPos (with canonical rearrangment!) If so, take the remainder, and return the equivalentsprivate String[]
getEquivalents
(String segment) getEquivalents2
(String segment) Gets the NFD form of the current source we are iterating over.next()
Get the next canonically equivalent string.static void
Deprecated.This API is ICU internal only.private static void
Deprecated.This API is ICU internal only.void
reset()
Resets the iterator so that one can start again from the beginning.void
Set a new source for this iterator.
-
Field Details
-
PERMUTE_DEPTH_LIMIT
private static final int PERMUTE_DEPTH_LIMIT- See Also:
-
PROGRESS
private static boolean PROGRESS -
SKIP_ZEROS
private static boolean SKIP_ZEROS -
nfd
-
nfcImpl
-
source
-
done
private boolean done -
pieces
-
current
private int[] current -
buffer
-
RESULT_LIMIT
private static final int RESULT_LIMIT- See Also:
-
SET_WITH_NULL_STRING
-
-
Constructor Details
-
CanonicalIterator
Construct a CanonicalIterator object- Parameters:
source
- string to get results for
-
-
Method Details
-
getSource
Gets the NFD form of the current source we are iterating over.- Returns:
- gets the source: NOTE: it is the NFD form of the source originally passed in
-
reset
public void reset()Resets the iterator so that one can start again from the beginning. -
next
Get the next canonically equivalent string.
Warning: The strings are not guaranteed to be in any particular order.- Returns:
- the next string that is canonically equivalent. The value null is returned when the iteration is done.
-
setSource
Set a new source for this iterator. Allows object reuse.- Parameters:
newSource
- the source string to iterate against. This allows the same iterator to be used while changing the source string, saving object creation.
-
permute
Deprecated.This API is ICU internal only.Simple implementation of permutation.
Warning: The strings are not guaranteed to be in any particular order.- Parameters:
source
- the string to find permutations forskipZeros
- set to true to skip characters with canonical combining class zerooutput
- the set to add the results to
-
permute
@Deprecated private static void permute(String source, boolean skipZeros, Set<String> output, int depth) Deprecated.This API is ICU internal only.Simple implementation of permutation.
Warning: The strings are not guaranteed to be in any particular order.- Parameters:
source
- the string to find permutations forskipZeros
- set to true to skip characters with canonical combining class zerooutput
- the set to add the results todepth
- the depth of the recursive call.
-
getEquivalents
-
getEquivalents2
-
extract
See if the decomposition of cp2 is at segment starting at segmentPos (with canonical rearrangment!) If so, take the remainder, and return the equivalents
-