Hostname: page-component-8448b6f56d-c47g7 Total loading time: 0 Render date: 2024-04-23T22:42:32.761Z Has data issue: false hasContentIssue false

Citizen science archaeological finds on the Semantic Web: the FindSampo framework

Published online by Cambridge University Press:  12 July 2021

Eero Hyvönen*
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Heikki Rantala
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Esko Ikkala
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Mikko Koho
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Jouni Tuominen
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Babatunde Anafi
Affiliation:
Department of Computer Science, Aalto University, Finland Helsinki Centre for Digital Humanities, University of Helsinki, Finland
Suzie Thomas
Affiliation:
Department of Cultures, University of Helsinki, Finland
Anna Wessman
Affiliation:
Department of Cultures, University of Helsinki, Finland
Eljas Oksanen
Affiliation:
Department of Cultures, University of Helsinki, Finland
Ville Rohiola
Affiliation:
Finnish Heritage Agency, Helsinki, Finland
Jutta Kuitunen
Affiliation:
Finnish Heritage Agency, Helsinki, Finland
Minna Ryyppö
Affiliation:
Finnish Heritage Agency, Helsinki, Finland
*
*Author for correspondence: ✉ eero.hyvonen@aalto.fi
Rights & Permissions [Opens in a new window]

Abstract

FindSampo fosters collecting, sharing, publishing and studying archaeological finds discovered by the public. The framework includes the following: a mobile find-reporting system; a semantic portal for researchers, the public and collection managers to use; and a Linked Open Data service for creating custom data analyses and for application developers.

Type
Project Gallery
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.

Introduction

FindSampo develops a prototype framework system for supporting mobile finds data reporting in the field, and for studying archaeological artefacts discovered and reported by the public. While it is unique in responding to the archaeological conditions in Finland, and in providing solutions to its users’ needs (Wessman et al. Reference Wessman, Jameson and Musteaţă2019), the framework and its implementation are open source and can be replicated for use elsewhere. FindSampo sits within a broader context of digitising Finnish heritage (Hyvönen Reference Hyvönen, Reinsone, Skadiņa, Baklāne and Daugavietis2020), the European Public Finds Recording Network (2021) and the ARIADNEPlus (n.d.) infrastructure project.

Challenges

Metal-detectorists’ finds can contribute to archaeological knowledge and research. In Finland, however, it has been laborious to access data regarding new metal-detector finds, especially from a researcher perspective, and there is a backlog in the cataloguing process at the Finnish Heritage Agency preventing up-to-date research. Hence, a user-friendly tool for reporting, viewing, browsing and researching metal-detector finds to access high-quality metadata in a timely manner was needed. We adopted a ‘citizen science’ approach, conducting surveys, interviews and focus groups for future users to express their preferences.

FindSampo framework in action

FindSampo's framework consists of three major components:

  • FindSampo Reporter: a mobile application demonstrator to minimise the reporting work in the field, foster collaborative work with the Finnish Heritage Agency and create a Linked Open Data repository to be used in applications (Hassanzadeh et al. Reference Hassanzadeh, Hyvönen, Ikkala, Tuominen, Thomas, Wessman, Rohiola, Adamou, Daga and Meroño-Peñuela2020).

  • FindSampo Portal: a research prototype (FindSampo n.d.) of a portal for studying the finds online.

  • FindSampo Data Service: based on the Linked Data (LD) approach, standards and best practices of World Wide Web Consortium (W3C) (Heath & Bizer Reference Heath and Bizer2011). The linked data service (LD n.d.) with its application programming interface can be used for data analysis in digital humanities research with modest programming skills, and by developers for creating new applications of the find data.

FindSampo is based on the ‘Sampo model’ (Hyvönen Reference Hyvönen, Reinsone, Skadiņa, Baklāne and Daugavietis2020) using the “FAIR guiding principles for scientific data management and stewardship” (GoFair n.d.). This model includes three components. A business model for collating, aggregating and publishing heterogeneous, distributed data from different content providers based on a shared ontology infrastructure. An approach to interface design, where the data can be re-used and accessed independently from multiple application perspectives, while the data reside in a single SPARQL (Protocol and RDF Query Language) endpoint (WC3 2015). A two-step model for accessing and analysing the data, where the focus of interest is first filtered out using a faceted semantic search, and then visualised and analysed by ready-to-use Digital Humanities tools of the portal. Implementing user interfaces based on this model is supported by the open source Sampo-UI framework (SeCo n.d.).

In FindSampo archaeological finds can be searched using the faceted search paradigm (Tunkelang Reference Tunkelang2009), allowing narrowing of the result set by making orthogonal category value selections, such as object type, material, time period and place, based on underlying ontologies (Figure 1). Once a result set of interest has been found, ready-to-use data analytic tools and visualisations can be applied to it with additional contextual information. For example, it is possible to visualise finds on maps at the same time as seeing protected archaeological sites. If the question is about an individual find, its ‘home page’ can be studied further (Figure 2).

Figure 1. Different views in FindSampo Reporter. From top left: Clustered Map view provides an aggregated view of filtered finds on the map; HeatMap view visualises the filtered finds distribution in colours; Table view lists the finds in a traditional way; Statistics view illustrates statistical distributions of the finds along different facet dimensions, here based on the selected finds’ material (graphics by P. Hassanzadeh).

Figure 2. The novel timeline visualisation in the FindSampo portal's user interface (after Anafi et al. Reference Anafi, Koho and Hyvönen2020). The activated filters are shown on top of the facets in the faceted search section, and the result of the filters is displayed on the activated tab in the results area. The timeline visualisation shows the distribution of the filtered finds over time. Finds are grouped by material type, providing the user with a new perspective on the material distribution of the finds chronologically (graphics by B. Anafi).

The FindSampo Data Service with its SPARQL endpoint (WC3 n.d.) and data download facility can be used for custom-made analyses. Different software tools can be employed for this. Figure 3 presents a matrix showing probabilities that two types of items of the same era, here Iron Age, are found in the same area, here a municipality.

Figure 3. Analysis and visualisation of co-occurring Iron Age object types found in the same municipality, made using Python Matplotlib library and a Google Colab notebook. If coins are found then the probability for jewellery is 0.93, but finding jewellery indicates coins with less probability, i.e. 0.41. Probability for co-occurrence of weapon and coin finds seems low (graphics by H. Rantala).

Infrastructure: data and ontologies

The data used in FindSampo contain initially approximately 3000 finds transformed into LD and published on the LDF.fi platform (Hyvönen et al. Reference Hyvönen, Tuominen, Alonen, Mäkelä, Presutti, Blomqvist, Troncy, Sack, Papadakis and Tordai2014). This data model describes the finds in terms of values taken from a set of (hierarchical) ontologies, such as object types, materials and time periods. The ontologies collate heterogeneous data from different data sources and are used to enrich the data by data linking to external data sources and by reasoning based on the Semantic Web logical standards (W3C n.d.). The shared ontology infrastructure includes a new object type ontology of archaeological finds interlinked with the MAO/TAO ontology for Museum Domain and Applied Arts (Finnish Thesaurus and Ontology Service n.d.) and the Art and Architecture Thesaurus of the Getty Research Centre (Getty Research Institute n.d.), and a time-period ontology interlinked with the PeriodO (n.d.) ontology, as recommended for international semantic interoperability in the ARIADNEplus project.

Future visions

The surge of new metal-detected find records in Finland since the 2010s is rewriting our understanding of material culture and associated fields in social, cultural and economic history. To actualise these developments, the FindSampo framework offers novel, ground-breaking qualitative and quantitative research tools to advance digital humanities and citizen science research. Furthermore, a new Marie Skłodowska-Curie project (CORDIS 2021), which began in September 2020, will deploy FindSampo and other Finnish Heritage Agency archaeological data to produce new analysis of large-scale and long-term development of Finnish archaeological landscapes. To test the usability of the FindSampo framework for other find datasets, we plan to apply it to the large Portable Antiquities Scheme (2021) database managed by the British Museum (2021). These initiatives push towards a deeper understanding of the agency of the public as creators of new knowledge about the past.

Acknowledgements

CSC—IT Center for Science, Finland has provided computational resources for the work.

Funding statement

This article is an output of the research project SuALT—The Finnish Archaeological Finds Recording Linked Open Database (2017–2021), funded by the Academy of Finland (decision numbers 310854, 310859 and 310860). Thanks to AriadnePlus and the Marie Skłodowska-Curie project DeepFIN (grant agreement 896044) for additional funding.

References

Anafi, B., Koho, M. & Hyvönen, E.. 2020. Temporal visualization and data analysis of archaeological finds: case FindSampo. Conference on Cultural Heritage and New Technologies (CHNT 25). Museum Stadt Archäologie Wien, November, 2020. Available at: https://www.chnt.at/abstract-2020/ (accessed 4 June 2021).Google Scholar
AriadnePlus. n.d. Available at: https://ariadne-infrastructure.eu (accessed 4 June 2021)Google Scholar
British Museum. 2021. Treasure and the Portable Antiquities Scheme. Available at: https://www.britishmuseum.org/our-work/national/treasure-and-portable-antiquities-scheme (accessed 4 June 2021).Google Scholar
CORDIS. 2021. Assessing archaeological deep time in Finland through spatial exploration 500 BCE–1520 CE. Available at: https://cordis.europa.eu/project/id/896044 (accessed 4 June 2021).Google Scholar
European Public Finds Recording Network. 2021. Available at: https://www.helsinki.fi/en/networks/european-public-finds-recording-network (accessed 4 June 2021).Google Scholar
FindSampo. n.d. FindSampo portal. Available at: https://findsampo.fi (accessed 4 June 2021)Google Scholar
Finnish Thesaurus and Ontology Service. n.d. Available at: https://finto.fi/maotao/fi/ (accessed 4 June 2021).Google Scholar
Getty Research Institute. n.d. Art & architecture thesaurus® online. Available at: https://www.getty.edu/research/tools/vocabularies/aat/ (accessed 4 June 2021).Google Scholar
GoFair. n.d. FAIR principles. Available at: https://www.go-fair.org/fair-principles (accessed 4 June 2021).Google Scholar
Hassanzadeh, P., Hyvönen, E., Ikkala, E., Tuominen, J., Thomas, S., Wessman, A. & Rohiola, V.. 2020. FindSampo platform for reporting and studying archaeological finds using citizen science, in Adamou, A., Daga, E. & Meroño-Peñuela, A. (ed.) Proceedings of the 3rd workshop on humanities in the Semantic Web (WHiSe), Heraklion, Greece, June 2, 2020: 3340. CEUR Workshop Proceedings. Available at: http://ceur-ws.org/Vol-2695/ (accessed 4 June 2021).Google Scholar
Heath, T. & Bizer, C.. 2011. Linked Data: evolving the web into a global data space. Palo Alto: Morgan & Claypool. https://doi.org/10.2200/S00334ED1V01Y201102WBE001Google Scholar
Hyvönen, E. 2020. ‘Sampo’ model and semantic portals for digital humanities on the Semantic Web, in Reinsone, S., Skadiņa, I., Baklāne, A. & Daugavietis, J. (ed.) DHN 2020: digital humanities in the Nordic countries. Proceedings of the digital humanities in the Nordic countries 5th conference, Riga, Latvia, October 21–23, 2020: 373–78. CEUR Workshop Proceedings. Available at: http://ceur-ws.org/Vol-2612/ (accessed 4 June 2021).Google Scholar
Hyvönen, E., Tuominen, J., Alonen, M. & Mäkelä, E.. 2014. Linked Data Finland: a 7-star model and platform for publishing and re-using linked datasets, in Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. & Tordai, A. (ed.) The Semantic Web: ESWC 2014 satellite events. ESWC 2014: 226–30. Cham: Springer. https://doi.org/10.1007/978-3-319-11955-7_24CrossRefGoogle Scholar
LD. n.d. FindSampo linked data source. Available at: https://ldf.fi/dataset/findsampo (accessed 4 June 2021).Google Scholar
PeriodO. n.d. Available at: https://perio.do/en/ (accessed 4 June 2021).Google Scholar
Portable Antiquities Scheme. 2021. Available at: https://finds.org.uk (accessed 4 June 2021).Google Scholar
Semantic Computing Research Group (SeCo). n.d. Available at: https://seco.cs.aalto.fi/tools/sampo-ui/ (accessed 4 June 2021).Google Scholar
Tunkelang, D. 2009. Faceted search: synthesis lectures on information concepts, retrieval, and services. Palo Alto: Morgan & Claypool. https://doi.org/10.2200/S00190ED1V01Y200904ICR005CrossRefGoogle Scholar
WC3. 2015. Semantic Web. Available at: https://www.w3.org/standards/semanticweb/ (accessed 4 June 2021).Google Scholar
Wessman, A. et al. 2019. Citizen science in archaeology: developing a collaborative web service for archaeological finds in Finland, in Jameson, J. & Musteaţă, S. (ed.) Transforming heritage practice in the 21st century: one world archaeology: 337–52. Cham: Springer. https://doi.org/10.1007/978-3-030-14327-5_23CrossRefGoogle Scholar
Figure 0

Figure 1. Different views in FindSampo Reporter. From top left: Clustered Map view provides an aggregated view of filtered finds on the map; HeatMap view visualises the filtered finds distribution in colours; Table view lists the finds in a traditional way; Statistics view illustrates statistical distributions of the finds along different facet dimensions, here based on the selected finds’ material (graphics by P. Hassanzadeh).

Figure 1

Figure 2. The novel timeline visualisation in the FindSampo portal's user interface (after Anafi et al.2020). The activated filters are shown on top of the facets in the faceted search section, and the result of the filters is displayed on the activated tab in the results area. The timeline visualisation shows the distribution of the filtered finds over time. Finds are grouped by material type, providing the user with a new perspective on the material distribution of the finds chronologically (graphics by B. Anafi).

Figure 2

Figure 3. Analysis and visualisation of co-occurring Iron Age object types found in the same municipality, made using Python Matplotlib library and a Google Colab notebook. If coins are found then the probability for jewellery is 0.93, but finding jewellery indicates coins with less probability, i.e. 0.41. Probability for co-occurrence of weapon and coin finds seems low (graphics by H. Rantala).