Wednesday, October 22, 2014 ..:: Home » Award Recipients - Round 2 (2011) ::..   
Site Navigation

 Round Two Welcome Message Minimize
Welcome to the second round of the Digging into Data Challenge. During the first round, in 2009, nearly 90 international research teams competed in the challenge. Ultimately, eight remarkable projects were awarded grants.
In 2011, the Digging into Data Challenge has returned for a second round, this time much larger, with sponsorship from eight international research funders, representing Canada, the Netherlands, the United Kingdom, and the United States.
What is the "challenge" we speak of?  The idea behind the Digging into Data Challenge is to address how "big data" changes the research landscape for the humanities and social sciences. Now that we have massive databases of materials used by scholars in the humanities and social sciences -- ranging from digitized books, newspapers, and music to transactional data like web searches, sensor data or cell phone records -- what new, computationally-based research methods might we apply? As the world becomes increasingly digital, new techniques will be needed to search, analyze, and understand these everyday materials. Digging into Data challenges the research community to help create the new research infrastructure for 21st century scholarship. 
Applicants will form international teams from at least two of the participating countries.  Winning teams will receive grants from two or more of the funding agencies and, two years later, will be invited to show off their work at a special conference sponsored by the eight funders.

Let's get digging.

 Press for Round 2 Minimize

Press Releases about the Winners of Round Two (January 2012)


Press Releases About the Launch of Round Two (March 2011)


Digging into Data Challenge in the News

Drexel University, January 24,2012, "iSchool Assistant Professor Michael Khoo Receives Digging into Data Challenge Grant"

Virginia Tech, January 17, 2012, "Virginia Tech researchers win Digging into Data Challenge"

University of North Carolina, January 9, 2012, "Digging Into Data challenge grant winners include SILS Professor, Dr. Richard Marciano"

McGill University, January 6, 2012, "Schulich School of Music scholars among winners of Digging into Data Challenge"

University of Oxford, January 6, 2012, "New Digging Into Data Challenge projects announced"

Kansas City Business Journal, January 5, 2012, "Saint Luke’s researchers will study Egyptian mummies"

Saint Luke's Health System, January 4, 2012, "Saint Luke's receives funding to further research into ancient Egyptian mummies"

Indiana University, School of Library and Information Science, January 4, 2012, "Digging into Data Challenge"

The London Free Press, January 3, 2012, "UWO ground zero for mummies"

University of Guelph, January 3, 2012, "U of G Prof Wins Grant to Dig Up Data"

The New York Times, August 17, 2011, "As the Gavels Fell: 240 Years at Old Bailey"

ScienceNews, July 30, 2011, "Crime’s digital past"

Nature, June 23, 2011, "Word Play" [PDF]

The Chronicle of Higher Education, June 12, 2011, "Digging Into Data, Day 2: Making Tools and Using Them"

The Chronicle of Higher Education, June 10, 2011, "Digging Into Data in the Humanities, Day 1"

Times Higher Education, April 28, 2011, "Research intelligence - Let's dig a little deeper"

The New York Times, November 16, 2010, "Digital Keys for Unlocking the Humanities’ Riches"

The Globe and Mail, June 18, 2010, "Supercomputers seek to ‘model humanity’"

The Lincoln Journal Star, February 6, 2010, "UNL team aims to digitize railroad history"

NetRadio, January 9, 2010, "Digging into the data: UNL leads international research in railroad history"

McGill Reporter, December 17, 2009, "Two McGill researchers among winners of new international competition"

The Mason Gazette, December 15, 2009, "Digging through the History of Crime Wins Center a Federal Grant"

The Chronicle of Higher Education, December 13, 2009 "How to Prepare Your College for an Uncertain Digital Future"

HPCWire, December 11, 2009, "Grant Supports Computational Analysis Of Manuscripts, Maps and Quilts"

Inside HPC, December 7, 2009, "What would you do with one million books?", December 4, 2009, "'Digging into Data Challenge' grant awarded"

The Tufts Daily, December 4, 2009, "Classics department researchers earn grant"

The Chronicle of Higher Education, December 4, 2009, "A 'New Digital Class' Digs Into Data"

 List of Round Two (2011) Awardees Minimize

Please click the "more information" link to learn more about each project and to access slide presentations from the 2013 Digging Conference held in Montreal, Canada.

Cascades, Islands, or Streams? Time, Topic, and Scholarly Activities in Humanities and Social Science Research

Principal Investigators: Cassidy R. Sugimoto, Ying Ding, Staša Milojević, Indiana University, Bloomington, NSF; Mike Thelwall, University of Wolverhampton, AHRC/ESRC/JISC; Vincent Larivière, Université de Montréal, SSHRC.

Description: This project will examine topic lifecycles across heterogeneous corpora, including not only scholarly and scientific literature, but also social networks, blogs, and other materials. While the growth of large-scale datasets has enabled examination within scientific datasets, there is little research that looks across datasets. The team will analyze the importance of various scholarly activities for creating, sustaining, and propelling new knowledge; compare and triangulate the results of topic analysis methods; and develop transparent and accessible tools. This work should identify which scholarly activities are indicative of emerging areas and identify datasets that should no longer be marginalized, but built into understandings and measurements of scholarship.

More Information



Principal Investigators: Robert C. Stacey, University of Washington, IMLS; Arno Knobbe, Leiden University, NWO; Sarah Rees Jones, University of York, AHRC/ESRC/JISC; Michael Gervers, University of Toronto, SSHRC. Additional participating institutions: University of Brighton, Columbia University.

Description: This project will develop new ways of exploring the full text content of digital historical records. The project will demonstrate its approach using medieval charters which survive in abundance from the 12th to the 16th centuries and are one of the richest sources for studying the lives of people in the past. The new ChartEx tools will enable users to really dig into the content of these records, to recover their rich descriptions of places and people, and to go far beyond current digital catalogues which restrict searches to a few key facts about each document (the ‘metadata’).

More Information


Digging into Connected Repositories (DiggiCORE)

Principal Investigators: Markus Muhr, The European Library Office, NWO; Zdenek Zdrahal, Petr Knoth The Open University, AHRC/ESRC/JISC

Description: This project will analyze a vast set of Open Access research publications using Natural Language Processing and social network analysis methods to identify patterns in the behavior of research communities, to recognize trends in research disciplines, to learn new insights about the citation behaviors of researchers and to discover features that distinguish papers with high impact. This will enable the development of better methods for exploratory search and browsing in digital collections or new ways of evaluating research or the researcher’s impact.

More Information 


Digging by Debating

Principal Investigators: Colin Allen and Katy Börner, Indiana University, Bloomington, NEH; Andrew Ravenscroft, University of East London, Chris Reed, University of Dundee, and David Bourget, University of London, AHRC/ESRC/JISC.

Description: A project to develop and implement a multi-scale workbench, called "InterDebates", with the goal of digging into data provided by hundreds of thousands, eventually millions, of digitized books, bibliographic databases of journal articles, and comprehensive reference works written by experts. The team’s hypotheses are: that detailed and identifiable arguments drive many aspects of research in the sciences and the humanities; that argumentative structures can be extracted from large datasets using a mixture of automated and social computing techniques; and, that the availability of such analyses will enable innovative interdisciplinary research, and may also play a role in supporting better-informed critical debates among students and the general public.

More Information


Digging into Human Rights Violations:  Anaphora Resolution and Emergent Witnesses

Principal Investigators: Ben Miller, Georgia State University, NSF; Lu Xiao, University of Western Ontario, SSHRC. Additional participating institutions: University of North Florida.

Description: This project will develop an automated reader for large text archives of human rights abuses that will reconstruct stories from fragments scattered across a collection, and an interface for navigating those stories.  By improving on anaphora resolution techniques in Natural Language Processing for the connection of pronouns to specific nouns, this system will help researchers and courts reveal witnesses and patterns contained in their own collections.

More Information


Digging into Metadata: Enhancing Social Science and Humanities Research

Principal Investigators: Mick Khoo, Drexel University, IMLS; Diana Massam, University of Manchester, AHRC/ESRC/JISC. Additional participating institutions: University of Glamorgan.

Description: The project will automatically generate new forms of metadata tags from existing metadata records and associated resources that will support discovery across multiple repositories.  The project will utilize four repositories that vary in size, domain, metadata creation method and workflow, and quality.  PERTAINS, a tool developed by one of the partner schools, will be used to analyze the metadata records in each repository and then to generate Dewey Decimal Classification-based tags.  Clustering algorithms will be used to generate an index of similarity and match between resources in different repositories.  After conducting a search, the user will retrieve a list of resources from the different collections that have been tagged in similar ways. Visualization techniques will be used to display the results in ways that enhance the research process.

More Information


Electronic Locator of Vertical Interval Successions (ELVIS): The First Large Data-Driven Research Project on Musical Style

Principal Investigators: Michael Scott Cuthbert, Massachusetts Institute of Technology, NEH; Frauke Jürgensen, University of Aberdeen, AHRC/ESRC/JISC; Julie E. Cumming, McGill University, SSHRC. Additional participating institutions: Yale University.

Description: A project to study changes in Western musical style from 1300 to 1900, using the digitized collections of several large music repositories. The team notes that in order to understand style change in Western polyphonic music we need to be able to describe acceptable vertical sonorities (chords) and melodic motions in each period, and how they change over time. The project aims to do this for European polyphony from 1300 to 1900, using advanced music information retrieval techniques to study highly contrasting kinds of music that are nevertheless unified by common concepts of tonality, consonance vs. dissonance, and voice leading.

More Information


An Epidemiology of Information: Data Mining the 1918 Influenza Pandemic

Principal Investigators: Edward T. Ewing, Bernice L. Hausman, Bruce Pencek, and Narendran Ramakrishnan, Virginia Polytechnic Institute & State University, NEH; Gunther Eysenbach, University of Toronto, SSHRC.

Description: This project seeks to harness the power of data mining techniques with the interpretive analytics of the humanities and social sciences to understand how newspapers shaped public opinion and represented authoritative knowledge during this deadly pandemic. This project makes use of the more than 100 newspaper titles for 1918 available from Chronicling America at the United States Library of Congress and the Peel’s Prairie Provinces collection at the University of Alberta Library. The application of algorithmic techniques enables the domain expert to systematically explore a broad repository of data and identify qualitative features of the pandemic in the small scale as well as the genealogy of information flow in the large scale. This research can provide methods for understanding the spread of information and the flow of disease in other societies facing the threat of pandemics.

More Information


Imagery Lenses for Visualizing Text Corpora

Principal Investigators: Katharine Coles, University of Utah, NEH; Min Chen, University of Oxford, AHRC/ESRC/JISC.

Description: A project to explore new visualization techniques for use in large scale linguistic and literary corpora using the collections of the British National Corpus and various smaller archives of poetry. The team will investigate whether or not advanced visualization techniques can provide an interface that enables humanities researchers to use their domain knowledge dynamically, while using the computational capability of computers. In particular, can data visualization help users make new observations and generate new hypotheses? The aim of this project is to answer the above methodological research question, and to create a set of new visualization tools for future scholarly research.

More Information


IMPACT Radiological Mummy Database

Principal Investigators: Randall Thompson, Saint Luke’s Mid America Heart Institute, NEH; Andrew Nelson, University of Western Ontario, SSHRC. Additional participating institutions:  Al Azhar Medical School, Cairo, Quinnipiac University, Canadian Museum of Civilization, University of Southern California, University of California, San Diego, Mount Sinai School of Medicine, South Coast Radiological Medical Group, Newport Diagnostic Center, University of California, Irvine, Wisconsin Heart Hospital.

Description: This project is designed to provide mummy and medical researchers with a large-scale comparative database of medical imaging of mummified human remains. This departure from a case-study model for mummy studies will drive the field towards a large-scale comparative and epidemiological paradigm. The Canadian team will be investigating the evisceration and excerebration components of the Egyptian mummification tradition, and the US teams will apply the database to a greatly expanded study of atherosclerosis in ancient Egyptian mummies, as part of the IMPACT Ancient Health Research Group, and to the refinement of a novel system of diagnosis by consensus for mummified remains.

More Information


Integrated Social History Environment for Research (ISHER)-Digging into Social Unrest

Principal Investigators: Dan Roth, University of Illinois, Urbana-Champaign, NSF; Antal van den Bosch, Tilburg University, NWO; Sophia Ananiadou, The University of Manchester, AHRC/ESRC/JISC. Additional participating institutions: International Institute of Social History.

Description: This project will develop an integrated environment using sophisticated text mining tools to facilitate knowledge discovery in social history research. It will provide social historians and social scientists with the means to detect and associate events, trends, people, organizations, and other entities of specific interest to social historians.

More Information


Integrating Data Mining and Data Management Technologies for Scholarly Inquiry

Principal Investigators: Ray R. Larson, University of California, Berkeley and Richard Marciano, University of North Carolina at Chapel Hill, IMLS; Paul B. Watry, University of Liverpool, AHRC/ESRC/JISC.  Additional participating institutions: Internet Archive, JSTOR.

Description: This project will integrate large-scale collections including JSTOR and the books collections of the Internet Archive stored and managed in a distributed preservation environment. It will also incorporate text mining and Natural Language Processing software capable of generating dynamic links to related resources discussing the same persons, places, and events. In this 17-month project we go beyond basic analysis by providing a prototype system developed to provide expert system support to scholars in their work.

More Information


Mining Microdata: Economic Opportunity and Spatial Mobility in Britain, Canada and the United States, 1850-1911

Principal Investigators: Evan Roberts, University of Minnesota, NSF; Kevin Schürer, University of Leicester, AHRC/ESRC/JISC; Kris E. Inwood, University of Guelph, SSHRC. Additional participating institutions: University of Alberta, Université de Montréal, University of Essex.

Description: This project will make use of novel data-mining technology to exploit one of the largest population databases in the world, a vast collection of harmonized 19th and early 20th century census microdata from Britain, Canada, and the United States originally digitized for genealogical research. The goal is to shed light on the impact of economic opportunity and spatial mobility on social structure in Europe and North America.

More Information


Trading Consequences

Principal Investigators: Ewan Klein, University of Edinburgh, AHRC/ESRC/JISC; Colin M. Coates, York University, SSHRC. Additional participating institutions: University of St Andrews.

Description: This project will examine the economic and environmental consequences of commodity trading during the nineteenth century. The project team will be using information extraction techniques to study large corpora of digitized documents from the nineteenth century. This innovative digital resource will allow historians to discover novel patterns and to explore new hypotheses, both through structured query and through a variety of visualization tools.

More Information

 Conference Sponsors Minimize

Sponsor Logos

Plus additional support from:

   CFI Logo

Privacy/Terms of Use