详细解决方案
(四十七):Supervised Multimodal Bitransformers for Classifying Images and Text
热度:95 发布时间:2023-11-17 07:40:45.0
(四十七):Supervised Multimodal Bitransformers for Classifying Images and Text
- Abstract
- 1 Introduction
- 2 Multimodal Bitransformers
-
- 2.1 Image Encoder
- 2.2 Multimodal Transformer Input Layer
- 2.3 Classification
- 2.4 Pre-training
- 2.5 Fine-tuning
- 4 Results
- 6 Conclusion
- 出处: ViGIL@NeurIPS 2019
- 代码:https://paperswithcode.com/paper/supervised-multimodal-bitransformers-for
- 题目:用于分类图像和文本的监督多模态Bitransformers
- 主要内容?