You annotated the named entities in the CoNLL files to be used for training
the model.
Procedure
Double-click the tFileInputDelimited component to open
its Basic settings view and define its properties.
Set the Schema as Built-in and click
Edit schema to define the desired
schema.
The first column in the output schema must be
tokens and the last one must be
labels. In between, you can have columns
for features you added manually.
In the Folder/file field, specify the path to
the training data.
Leave the Die on error check box selected.
In the Advanced settings view of the
component, select the Custom encoding check box if you
encounter issues when processing the data.
From the Encoding list, select the encoding
to be used, UTF-8 in this example.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!