tJapaneseTokenize Standard properties
These properties are used to configure tJapaneseTokenize running in the Standard Job framework.
The Standard tJapaneseTokenize component belongs to the Data Quality family.
The component in this framework is available in Talend Data Management Platform, Talend Big Data Platform, Talend Real-Time Big Data Platform, Talend Data Services Platform, and in Talend Data Fabric.
Basic settings
Properties | Description |
---|---|
Schema and Edit Schema |
|
Tokenization |
The columns from the output schema are added to the Column column in the Tokenization table. For each of the schema columns containing Japanese text to be tokenized, select the corresponding check box in the Tokenize column. You can select the check box in the header row to select all schema columns. |
Advanced settings
Properties | Description |
---|---|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level as well as at each component level. |
Usage
Usage guidance | Description |
---|---|
Usage rule |
This component is usually used as an intermediate component, and it requires an input component and an output component. |