A relational framework for information extraction

Ronald Fagin; Benny Kimelfeld; Frederick R. Reiss; Stijn Vansummeren

doi:10.1145/2935694.2935696

SIGMOD Record

Paper

01 Dec 2015

A relational framework for information extraction

View publication

Abstract

Information Extraction commonly refers to the task of populating a relational schema, having predefined underlying semantics, from textual content. This task is pervasive in contemporary computational challenges associated with Big Data. In this article we provide an overview of our work on document spanners-a relational framework for Information Extraction that is inspired by rule-based systems such as IBM's SystemT.

Paper