These properties are used to configure tEmbeddingAI running in the Standard Job framework.
The Standard tEmbeddingAI component belongs to the Embedding family.
Basic settings
Schema and Edit schema |
A schema is a row description, and it defines the fields to be processed and passed on to the next component.
Click Edit schema to make changes to the schema. If you make changes, the schema automatically becomes built-in.
|
Guess schema | Click this button to retrieve the schema according to your settings. |
Platform | Select or enter the platform used to embed your input data.
|
Ali Bailian (DashScope) parameters |
|
Amazon Bedrock parameters |
|
Azure OpenAI parameters |
|
Cohere parameters |
|
Google Vertex AI parameters |
|
Hugging Face parameters |
|
ONNX parameters |
Select the Use local ONNX file check box to use a local tokenizer file. Specify the model and tokenizer paths, as well as the pooling mode to apply. If unselected, the default embedded tokenizer file will be used. See Introduction to ONNX for more information. |
Column for embedding |
Select or enter the schema column you want to apply the embedding model on. |
Advanced settings
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level as well as at each component level. |
Global Variables
ERROR_MESSAGE |
The error message generated by the component when an error occurs. This is an After variable and it returns a string. |
Usage
Usage rule |
This component can be used as a standalone component or as a start component of a Job or subJob. |