Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||
| DOI | 10.1145/3196959.3196968 | ||
| Año | 2018 | ||
| Tipo | proceedings paper |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
Rule-based information extraction has lately received a fair amount of attention from the database community, with several languages appearing in the last few years. Although information extraction systems are intended to deal with semistructured data, all language proposals introduced so far are designed to output relations, thus making them incapable of handling incomplete information. To remedy the situation, we propose to extend information extraction languages with the ability to use mappings, thus allowing us to work with documents which have missing or optional parts. Using this approach, we simplify the semantics of regex formulas and extraction rules, two previously defined methods for extracting information. We extend them with the ability to handle incomplete data, and study how they compare in terms of expressive power. We also study computational properties of these languages, focusing on the query enumeration problem, as well as satisfiability and containment.
| Revista | ISSN |
|---|---|
| Pods'18: Proceedings Of The 37 Th Acm Sigmod Sigact Sigai Symposium On Principles Of Database Systems | 978-1-4503-4706-8 |
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | Maturana, Francisco | Hombre |
Carnegie Mellon Univ - Estados Unidos
|
| 2 | Riveros, Cristian | Hombre |
Pontificia Universidad Católica de Chile - Chile
|
| 3 | Vrgoc, Domagoj | Hombre |
Pontificia Universidad Católica de Chile - Chile
|
| 4 | Arenas, M | - | |
| 5 | Ugarte, M | - | |
| 6 | VanDenBussche, J | - |
| Fuente |
|---|
| FONDECYT |
| Nucleus Millennium Center for Semantic Web Research |
| Nucleus Millennium Center for Semantic Web Research |
| Agradecimiento |
|---|
| The authors would like to thank Juan L. Reutter for many helpful comments during the preparation of this paper. The authors were supported by the Nucleus Millennium Center for Semantic Web Research grant NC120004. Vrgoc was also supported by FONDECYT project nr. 11160383, and Riveros by FONDECYT project nr. 11150653. |