Service-Oriented Architecture for automatic markup of documents. An use case for legal documents

Tools

CIFUENTES-SILVA, Francisco Adolfo (2014) Service-Oriented Architecture for automatic markup of documents. An use case for legal documents. Paper presented at: IFLA WLIC 2014 - Lyon - Libraries, Citizens, Societies: Confluence for Knowledge in Session 121 - Law Libraries with Parliamentary Libraries, Information Technology and Committee on Freedom of Access to Information and Freedom of Expression (FAIFE). In: IFLA WLIC 2014, 16-22 August 2014, Lyon, France.

Bookmark or cite this item: https://library.ifla.org/id/eprint/1048

Preview

PDF (215kB)

Language: Spanish (Original)

Available under licence Creative Commons Attribution.

Bookmark or cite this item: https://library.ifla.org/id/eprint/1048/1/121-cifuentes-es.pdf

Abstract

English

Service-Oriented Architecture for automatic markup of documents. An use case for legal documents

The problem of information extraction and automatic markup of plain text to XML, has been resolved partially in a specific domain of legal documents. Techniques such as named entity recognition, hierarchy detection of text sections and others has led to partially identify and retrieve different kind of information inside non structured documents. In this paper we introduce different interconnected components, the NLP techniques used on each component and the workflow needed for processing a plain text document and to generate a new full marked XML version of the document. The generated XML complies with the schema legal standard Akoma-Ntoso and is highly enriched with named entities, semantic URIS, structural sections, lists and elements sequences, between others. As an use case we analyze the experience of the Library of Congress of Chile in the context of the 'History of Law project' and Parliamentary Labor, where these architecture had a key role in order to accomplish the final product and results of processing and marking up different types or models of documents used in the legislative process.

Item Type:

Conference or Workshop Item (Paper)

Conference details:

IFLA WLIC 2014 - Lyon - Libraries, Citizens, Societies: Confluence for Knowledge

Session 121 - Access to law at the digital cross roads: Innovative solutions to complex challenges - Law Libraries with Parliamentary Libraries, Information Technology and Committee on Freedom of Access to Information and Freedom of Expression (FAIFE)

Tuesday 19 August 2014 09:30 - 12:45 | Room: Forum 1

Related URLs:

Congress website

Divisions:

Division 1 Library Types > Law Libraries Section
Division 1 Library Types > Library and Research Services for Parliaments Section
Division 3 Library Services > Information Technology Section
Division 4 Support of the Profession > Committee on Freedom of Access to Information and Freedom of Expression (FAIFE)

Authors:

Name	Affiliation	Country
CIFUENTES-SILVA, Francisco Adolfo	Servicios y Sistemas de Información en Red, Biblioteca del Congreso Nacional de Chile, Valparaíso	Chile

Uncontrolled Keywords:

Linked Open Data, Semantic Web, Akoma-Ntoso, Machine Learning, e-parliament

Date Deposited:

17 Aug 2014 09:10

Last Modified:

14 Aug 2017 08:55

URI:

https://library.ifla.org/id/eprint/1048

FOR IFLA HQ (login required)

Edit item

Search form

Service-Oriented Architecture for automatic markup of documents. An use case for legal documents

Abstract

Service-Oriented Architecture for automatic markup of documents. An use case for legal documents

Session 121 - Access to law at the digital cross roads: Innovative solutions to complex challenges - Law Libraries with Parliamentary Libraries, Information Technology and Committee on Freedom of Access to Information and Freedom of Expression (FAIFE)

FOR IFLA HQ (login required)