Summary: Working with VideoBERT, BART, and More
Discover how VideoBERT combines video and language tokens for pre-training using text and visual data. Learn about the BART transformer that reconstructs original text from corrupted input with techniques like token masking and shuffling. Gain practical insights into using BART for text summarization and applying ktrain for sentiment analysis and document processing tasks.
We'll cover the following...
We'll cover the following...
Key highlights
Summarized below are the main highlights of what we have learned in this chapter.
We started off by learning how VideoBERT works. We learned how VideoBERT is pre-trained by predicting the masked language and ...