tAthenaConfiguration properties for Apache Spark Batch
These properties are used to configure tAthenaConfiguration running in the Spark Batch Job framework.
The Spark Batch tAthenaConfiguration component belongs to the Databases family.
The component in this framework is available in all subscription-based Talend products with Big Data and Talend Data Fabric.
The AWS account must have permissions to run Athena queries and write query results to the specified S3 output location.
Basic settings
| Properties | Description |
|---|---|
| Region | Select the AWS Region for the Athena service endpoint. The default region is US East (N. Virginia) (us-east-1). |
| Catalog Name | Enter the AWS Glue Data Catalog name or federated catalog name to query. The value defaults to myCatalog. |
| Database Name | Enter the Athena (Glue) database name in the selected catalog. The value defaults to default. |
| S3 Output Location | Enter the Amazon S3 path for Athena query results, for example,
s3://my-bucket/athena-results/. The AWS principal must have s3:PutObject and s3:GetBucketLocation permissions on this path. |
| Access Key ID | Enter the AWS IAM access key ID for authentication. Leave empty to use the default AWS credential provider chain, such as environment variables or instance profile credentials. |
| Secret Access Key | Enter the AWS IAM secret access key paired with the Access Key ID. The value is stored and transmitted in encrypted form. |
| Session Token | Enter the temporary session token for short-lived AWS STS credentials, for
example, AssumeRole. Leave empty for long-lived IAM users. The value is stored and transmitted in encrypted form. |
Usage
| Usage guidance | Description |
|---|---|
|
Usage rule |
tAthenaConfiguration cannot start or terminate a subJob on its
own and has no flow output component. In the tAthenaInput component, select the tAthenaConfiguration component to use from the Connection property. Only OnSubjobOk and OnSubjobError trigger links are supported. No data flow components are supported. This component requires a tSparkConfiguration component in the same Job. |