This component is used to apply a Spark window on the input RDD so that this Job always
analyzes the Tweets of the last 20 seconds at the end of each 15 seconds. This
creates, between every two window applications, the overlap of one micro batch,
counting 5 seconds as defined in the Batch size
field in the Spark configuration tab.
In the Window duration field, enter 20000, meaning 20
seconds.
Select the Define the slide duration check
box and in the field that is displayed, enter 15000, meaning 15 seconds.
Results
The configuration of the window is then displaed above the icon of tWindow in the Job you are designing.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!