Talend Data Catalog concepts
- Catalog
A catalog is an inventory of data assets, such as database tables, Data Integration Jobs or BI reports.
- Metadata
Metadata is structured information that describes a data resource, such as its name, type, location, author, date created, size and relationships with other data objects.
- Metadata repository
Metadata repository stores metadata created or imported from data sources, project configurations and reports.
- Metadata harvesting
Metadata harvesting means collecting metadata from a data source, by using Talend Data Catalog bridges. The metadata is imported in a model and stored in the metadata repository.
- Bridge
A bridge is a platform-dedicated connector. It uses a specific driver to connect to a source tool and collect its metadata.
You can import metadata from data stores, Data Integration tools, Business Intelligence tools and business applications.
- Stitching
Once created, models are linked together in a configuration to define the data flow in the information system.
- Configuration
A configuration is an environment or workspace where you connect models to each other to build a global schema of the enterprise information system.
- Glossary
A glossary captures and defines the enterprise vocabulary to build a common language that everyone can understand.
- Data profiling
Data profiling is the process of examining the data from data sources imported in your catalog and collecting statistics and information about this data.
- Data sampling
Data sampling allows to preview the content of database tables and data files imported in your catalog.
- Data class
Data classification helps you to detect, understand and classify the nature and purpose of the elements contained in the data sources imported in your catalog.
- Data-detected class
Data-detected classification detects common data patterns automatically based on predefined enumeration, patterns and regular expressions.
- Metadata-detected class
Metadata-detected classification detects classes by metadata attributes.
- Sensitivity label
A sensitivity label can be applied to repository objects to determine their level of confidentiality.
- Global role
The global role determines the global responsibilities that you have on all catalog assets.
- Object role
The object role determines the responsibilities that you have on specific catalog assets, such as glossaries or models.
- Worksheet
A worksheet allows to perform and save your searches or customize the tabs in the object pages.
- Dashboard
A dashboard provides an insight of the catalog assets and is customizable to meet your specific needs.