Multimodal Deep Multipage Document Classification using both Image and Text
Author(s): Qaisar Tanvir Originally published on Towards AI. Document AI using python and Tensorflow, using CNN (for image) and BERT (for text), and combining both in a multimodal model to get the best of both worlds Inspired from : https://link.springer.com/chapter/10.1007/978-3-030-43823-4_35 The conventional …