When you click Retrieve Schema, a new wizard opens up where
you can filter and display different objects in the HDFS.
Information noteNote: You can retrieve schema from CSV, Avro, and Sequence files.
In the Name filter field, you can enter the
name of the file(s) you are looking for to filter it/them.
Otherwise, you can expand the folders listed in this wizard by selecting the
check box before them. Then, select the file(s) of which you need to retrieve
the schema(s)
Each time when the schema retrieval is done for a file selected, the Creation status of this file becomes Success.
Click Next to open a new view on the wizard
that lists the selected file schema(s). You can select any of them to display
its details in the Schema area.
Modify the selected schema if needed. You can change the name of the schema
and according to your needs, you can also customize the schema structure in the
Schema area.
Indeed, the tool bar allows you to add, remove, or move columns in your schema.
To overwrite the modifications you made on this selected schema with its
default one, click Retrieve schema. Note that
this overwriting does not retain any custom edits.
Click Finish to complete the HDFS file schema
creation. All the retrieved schemas are displayed under the related HDFS
connection node in the Repository view.
If then you still need to edit a schema, right-click this schema under the
relevant HDFS connection node in the Repository view and from
the contextual menu, select Edit Schema to open this wizard
again and then make the modifications.
Information noteNote:
If you modify the schemas, ensure that the data type in the Type column is correctly defined.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!