Mining large datasets for the humanities

LEONARD, Peter (2014) Mining large datasets for the humanities. Paper presented at: IFLA WLIC 2014 - Lyon - Libraries, Citizens, Societies: Confluence for Knowledge in Session 119 - Academic and Research Libraries with Serials and Other Continuing Resources and Committee on Copyright and other Legal Matters (CLM). In: IFLA WLIC 2014, 16-22 August 2014, Lyon, France.

Bookmark or cite this item: https://library.ifla.org/id/eprint/930
[img]
Preview
Language: English (Original)
Available under licence Creative Commons Attribution.

Abstract

Mining large datasets for the humanities

This paper considers how libraries can support humanities scholars in working with large digitized collections of cultural material. Although disciplines such as corpus linguistics have already made extensive use of these collections, fields such as literature, history, and cultural studies stand at the threshold of new opportunity. Libraries can play an important role in helping these scholars make sense of big cultural data. In part, this is because many humanities graduate programs neither consider data skills a prerequisite, nor train their students in data analysis methods. As the ‘laboratory for the humanities,’ libraries are uniquely suited to host new forms of collaborative exploration of big data by humanists. But in order to do this successfully, libraries must consider three challenges: 1) How to evolve technical infrastructure to support the analysis, not just the presentation, of digitized artifacts. 2) How to work with data that may fall under both copyright and licensing restrictions. 3) How to serve as trusted partners with disciplines that have evolved thoughtful critiques of quantitative and algorithmic methodologies.

FOR IFLA HQ (login required)

Edit item Edit item
.