FETA: Towards Specializing Foundational Models for Expert Task ApplicationsAmit AlfassyAssaf Arbelleet al.2022NeurIPS 2022
Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessChristoph AuerMichele Dolfiet al.2022CLOUD 2022
pNLP-Mixer: an Efficient all-MLP Architecture for LanguageFrancesco FuscoPeter Staaret al.2023ACL 2023
PatCID: Large-scale chemical-structure database from images in patent documentsIngmar MeijerValery Weberet al.2023ACS Fall 2023
ESG Accountability Made Easy: DocQA at Your ServiceLokesh MishraCesar Berrospi Ramiset al.2024AAAI 2024
Docling: An Efficient Open-Source Toolkit for AI-driven Document ConversionNikos LivathinosChristoph Aueret al.2025AAAI 2025
Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIsLokesh MishraSohayl Dhibiet al.2024ACL 2024
PatCID: an open-access dataset of chemical structures in patent documentsLucas MorinValéry Weberet al.2024Nature Communications
Skin Tone Analysis for Representation in Educational Materials (STAR-ED) Using Machine LearningGirmaw Abebe TadesseCelia Cintaset al.2023npj Digital Medicine