Archiving and Accessing HTML-Based Newspapers Using XML and CDATA Strings

WEIG, Eric (2016) Archiving and Accessing HTML-Based Newspapers Using XML and CDATA Strings. Paper presented at: IFLA WLIC 2016 – Columbus, OH – Connections. Collaboration. Community in Session S21 - Satellite Meeting: News Media. In: News, new roles & preservation advocacy: moving libraries into action, 10-12 August 2016, Lexington, KY, USA.

Bookmark or cite this item: http://library.ifla.org/id/eprint/2096
[img]
Preview
Language: English (Original)
Available under licence Creative Commons Attribution.

Abstract

Archiving and Accessing HTML-Based Newspapers Using XML and CDATA Strings

This article outlines one in-house model for archiving and providing access to HTML-based news in the Kentucky Digital Newspaper Program (KDNP) at the University of Kentucky (UK). To allow for search and retrieval of HTML-based news in the KDNP which already contains news content digitized from analog sources, the encapsulation of HTML content using XML encoded CDATA strings read by a prototype open-source PHP viewer is described.

FOR IFLA HQ (login required)

Edit item Edit item
.