我这几天结合之前的阅读梳理了一下关于multi-modal相关的文献,附件包含了所列文献,你可以按照以下内容进行系统调研:
一、调研建议首先从综述入手,再针对综述中的参考文献进行深入调研,以下是比较好的Multi-modal方面的一些综述:
(1)ACL 2020上有个Tutorial:Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web,地址:Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web
(2)KDD2020 上有个Tutorial:Multi-modal Network Representation Learning,地址:https://chuxuzhang.github.io/KDD20_Tutorial.html
(3)多模态视觉语言表征学习研究综述 (软件学报2021)
(4)Survey on Deep Multi-modal Data Analytics Collaboration Rivalry and Fusion (ACM Trans. Multimedia Comput. Commun. Appl. 2021)
(5)A Survey on Deep Learning for Multimodal Data Fusion (Neural Computation2020)
(6)Multimodal Machine Learning A Survey and Taxonomy (IEEE Transactions on Pattern Analysi