Quiz: Spatio-Temporal Transformers
Test your understanding of transformer applications in video analysis.
1
What’s the purpose of positional embeddings in the Video Transformer Network (VTN) architecture?
A)
To represent the time frames of each video frame
B)
To provide spatial information to the transformer encoder
C)
To facilitate attention across spatial dimensions
D)
To introduce a time dimension and enable the modeling of temporal relations
Question 1 of 40 attempted
Get hands-on with 1200+ tech skills courses.