tEmbeddingAI
Leverages embedding models to efficiently process data with AI.
For a quick introduction on using this component in your Job, see the following video:
tEmbeddingAI Standard properties
These properties are used to configure tEmbeddingAI running in the Standard Job framework.
The Standard tEmbeddingAI component belongs to the AI family.
Basic settings
| Properties | Description |
|---|---|
|
Schema and Edit schema |
A schema is a row description, and it defines the fields to be processed and passed on to the next component.
Click Edit schema to make changes to the schema. If you make changes, the schema automatically becomes built-in.
|
| Guess schema | Click this button to retrieve the schema according to your settings. |
| Platform | Select the platform used to embed your input data.
|
| Ali Bailian (DashScope) parameters |
You can fill the API server URL in the Base URL parameter of the Advanced settings. |
| Amazon Bedrock parameters |
|
| Azure OpenAI parameters |
|
| Cohere parameters |
You can fill the API server URL in the Base URL parameter of the Advanced settings. |
| Google Vertex AI parameters |
|
| Hugging Face parameters |
You can fill the API server URL in the Base URL parameter of the Advanced settings. |
| Mistral AI parameters |
You can fill the API server URL in the Base URL parameter of the Advanced settings. |
| ONNX parameters |
Select the Use local ONNX file checkbox to use a local tokenizer file. Specify the model and tokenizer paths, as well as the pooling mode to apply. If unselected, the default embedded tokenizer file will be used. See Introduction to ONNX for more information. |
| Column for embedding |
Select or enter the schema column you want to apply the embedding model on. |
Advanced settings
| Properties | Description |
|---|---|
|
tStatCatcher Statistics |
Select this checkbox to gather the Job processing metadata at the Job level as well as at each component level. |
| Base URL | Type in the base URL address of the API server you want to access. This option is available for the Ali Bailian, Cohere, Hugging Face, and Mistral AI platforms. |
Global Variables
| Variables | Description |
|---|---|
|
ERROR_MESSAGE |
The error message generated by the component when an error occurs. This is an After variable and it returns a string. |
Usage
| Usage guidance | Description |
|---|---|
| Usage rule |
This component can be used as a standalone component or as a start component of a Job or subJob. |