Adding a new database type
Talend Data Preparation allows a direct connection to various types of databases. You can use them as source to create new datasets.
You can manually enrich the list of databases from which you can import data.
The list of available database types for dataset creation actually depends on the JDBC drivers that you have stored in the <components_catalog_path>/.m2 folder.
Let's say that you have some customer data stored on an Oracle database, and you want to import it in Talend Data Preparation to perform cleansing operations. You will add a JDBC driver .jar file specific to Oracle databases to the Components Catalog folder structure to add this new source of data in the Talend Data Preparation interface.
In a Big Data context, if you want to run preparations made on data from your Oracle database, on the Hadoop cluster, the same driver must be added to the Spark Job Server folder structure.
You do not need to stop or restart any of the services to complete the following procedure.
Procedure
Results
The Oracle database is now available in the database type drop-down list in the import form.
When exporting a preparation made on data stored on your Oracle database, you can choose to process the data on the Talend Data Preparation server, or a Hadoop Cluster if you are using Big Data.