Text standardization components
| tJapaneseNumberNormalize | Normalizes Japanese numbers (kansūji) to regular Arabic numbers. |
| tJapaneseTokenize | Splits Japanese text into tokens. |
| tJapaneseTransliterate | Converts textual data in Japanese to kana and Latin scripts. |
| tStem | Enables to standardize data in columns before matching this data. |
| tTransliterate | Converts strings from many languages of the world to a standard set of characters (Universal Coded Character Set, UCS). |