Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Vision, Language and Reading

non-profit
https://www.vlr.ai/
Activity Feed

AI & ML interests

Multimodal AI, Document Understanding, Reading Systems.

Recent Activity

emanuelevivoli  authored a paper about 2 months ago
CoSMo: A Multimodal Transformer for Page Stream Segmentation in Comic Books
emanuelevivoli  authored a paper about 2 months ago
Multimodal Transformer for Comics Text-Cloze
Llabres  updated a dataset 2 months ago
VLR-CVC/ComicsPAP
View all activity

Papers

ComicsPAP: understanding comic strips by picking the correct panel

One missing piece in Vision and Language: A Survey on Comics Understanding

View all Papers

Emanuele Vivoli's profile picture Tomás Ockier Poblet's profile picture Artemis Llabrés's profile picture Eric López's profile picture Khanh Nguyen's profile picture Mohamed Ali Souibgui's profile picture Dimosthenis Karatzas's profile picture Serra's profile picture

VLR-CVC 's datasets 1

VLR-CVC/ComicsPAP

Viewer • Updated Sep 29 • 80.6k • 1.57k • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs