Skip to main content Skip to complementary content

Profiling Hive

Once you create the Hive connection via the connection to the Hadoop distribution as outlined in Profiling an HDFS file, you can analyze the data present in all Hive tables.

Procedure

  1. Under the Metadata node in the DQ Repository tree view browse to the Hive connection.
  2. Right-click the Hive connection and select Overview Analysis.
    Contextual menu of a Hive connection.

    This analysis profiles database content to have an overview of the number of tables and rows per table. For further information, see Creating an analysis.

  3. Right-click a Hive table and select any of the analyses listed in the menu.
    Contextual menu of a table in a Hive connection.

    A wizard guides you through the steps to create the selected analysis. You can then assign indicators to the analyzed columns according to your need.

    For further information, see Column analyses, Table analyses and Analyzing duplicates.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!