Component-specific settings for tPartition

The following table describes the Job script functions and parameters that you can define in the setSettings {} function of the component.

Function/parameter	Description	Mandatory?
NBPARTITIONS	Specify the number of partitions you want to split the input dataset up into. The default value is 4.	No
PARTITION_KEY {}	Include in this function one or more sets of the following parameters to define the key or keys to be used for the partitioning. `SCHEMA_COLUMN`: Specify a schema column. `KEY_ATTRIBUTE`: Set this parameter to `true` to use the specified schema column as the key of the partitioning.	Yes
CUSTOMPARTITIONER	Set this parameter to `true` to use a Spark partitioner you need to import from outside the Studio. Then, you need to provide the following information: `PARTITIONER_FQCN`: Enter the fully qualified class name of the partitioner to be imported. `PARTITIONER_JAR {}`: Include in this function one or more `JAR_NAME` parameters to specify the jar file or files containing the specified class and its dependent jar files you need to import.	No
SORTINPARTITION	Set this parameter to `true` to sort the records within each partition. Then use the `COMPARATOR` parameter to define the type of sorting. Acceptable values of the `COMPARATOR` parameter : `NATURALORDER`: Sort keys in their natural order, for example, in the alphabetical order. `CUSTOMCOMPARATOR`: Use a custom program to sort the keys. Then you need to use the `COMPARATOR_FQCN` parameter to specify the fully qualified class name of the comparator to be imported, and use the `JAR_NAME` parameter to add the jar files to be loaded.	No
LABEL	Use this parameter to specify a text label for the component.	No

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!

Leave your feedback here