Skip to main content Skip to complementary content

Migrating to Talend Data Preparation 8.0.x with an embedded MongoDB

Before you begin

If you have installed and used Talend Dictionary Service to create custom semantic types or update the predefined ones, you must migrate Talend Dictionary Service before migrating Talend Data Preparation. See Migrating to Talend Dictionary Service 8.0.x.

Download Talend Data Preparation 8.0.x.

Upgrade Talend Administration Center. For more information, see Upgrading Talend Administration Center.

Procedure

  1. Stop your current Talend Data Preparation instance but keep the MongoDB instance running.
  2. Back up your MongoDB by running the following command.
    For Linux:
    <mongo_path>/bin
    ./mongodump -h <source_mongodb_host>:<source_mongo_port> -d <source_database> -u <source_mongodb_user> -p <source_mongodb_password> -o <dump_output>
    For Windows:
    <mongo_path>\bin>mongodump -h <source_mongodb_host>:<source_mongo_port> -d <source_database> -u <source_mongodb_user> -p <source_mongodb_password> -o <dump_output>
    The default value for the embedded MongoDB properties are the following:
    Property Value
    mongodb.host localhost
    mongodb.port 27017
    mongodb.database dataprep
    mongodb.user dataprep-user
    mongodb.password duser
  3. Stop the MongoDB instance from the previous version of Talend Data Preparation.
    Information noteNote: For Windows users, you must uninstall the current MongoDB before you install Talend Data Preparation.
  4. Install Talend Data Preparation and select the embedded MongoDB option. Do not start the instance.
    If you have installed Talend Data Preparation in service mode, stop the instance and MongoDB.
  5. Delete the database:
    1. Open a mongo shell as a Talend Data Preparation database user and execute the following commands.
      For Linux:
      cd <install_dir>/mongodb/bin
      ./mongo dataprep -u dataprep-user -p duser --port 27017
      For Windows:
      <install_dir>\mongodb\bin>mongo dataprep -u dataprep-user -p duser --port 27017
    2. Switch to the Talend Data Preparation database: use dataprep
    3. Drop the database: db.runCommand( { dropDatabase: 1 } )
    4. Exit mongo shell: exit
  6. Restore the MongoDB backup on the new 8.0 MongoDB instance by using the following command.
    For Linux:
    <mongo_path>/bin
    ./mongorestore -h <mongo_host>:<mongo_port> -u <mongo_tdp_username> -p <mongo_tdp_password> -d <mongo_tdp_database> <backup_location>
    For Windows:
    <mongo_path>\bin>mongorestore -h <mongo_host>:<mongo_port> -u <mongo_tdp_username> -p <mongo_tdp_password> -d <mongo_tdp_database> <backup_location>
  7. If you customized the <Data_Preparation_Path>/config/application.properties file in the previous version, you can copy the values from the old configuration file and add them to the new version.
  8. To retrieve the content of the data-set storage folder used for the previous Talend Data Preparation and migrate it to version 8.0, complete the next step according to the your situation:
    • If you have set a custom value for the dataset.content.store.file.location property, make sure that you set the same value in the application.properties file for the 8.0 version.
    • If you have never modified the value of the dataset.content.store.file.location property, copy the content of the <Data_Preparation_Path>/data folder used in 1.x/2.x, and paste it in the 8.0 installation folder, following the same path.
  9. Start Talend Data Preparation.
  10. Republish semantic types using Talend Dictionary Service API to migrate indexes to Lucene 8.

Results

You are now ready to use Talend Data Preparation.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!