Show simple item record

dc.contributor.authorBaker, Andrew
dc.contributor.authorHarvey, Virginia L.
dc.contributor.authorBuckley, Michael
dc.date.accessioned2023-06-15T09:22:27Z
dc.date.available2023-06-15T09:22:27Z
dc.date.issued2023-04-22
dc.identifierhttps://chesterrep.openrepository.com/bitstream/handle/10034/627858/BakerEtAlnedmid_AAM.pdf?sequence=4
dc.identifier.citationBaker, A., Harvey, V. L., & Buckley, M. (2023). Machine learning for collagen peptide biomarker determination in the taxonomic identification of archaeological fish remains. Journal of Archaeological Science: Reports, 49, 104001. https://doi.org/10.1016/j.jasrep.2023.104001en_US
dc.identifier.issn2352-409X
dc.identifier.doi10.1016/j.jasrep.2023.104001
dc.identifier.urihttp://hdl.handle.net/10034/627858
dc.description.abstractSpecies identification of archaeofaunal remains can be informative of changing local and global ecosystems, as well as how we interacted with them in the past. However, with the vast majority of assemblages being dominated by morphologically indeterminate specimens, methods of biomolecular species identification are becoming more popular, such as the protein fingerprint-based identification approach known as ZooMS (Zooarchaeology by Mass Spectrometry). As larger datasets are being produced, Machine Learning techniques such as those using Random Forest algorithms have been considered to expedite the identification process. However, it has proven difficult to extract meaningful biomarkers from these processes alone and so far only tackled for mammals. Here, we introduce a novel approach based on the principles of the ID3 algorithm for biomarker identification from ZooMS spectral databases, focussing on archaeological fish remains from Europe and the Caribbean. We show that the tool is highly effective at generating comprehensible lists of family- and genus-level biomarkers. At family-level identification, the average sensitivity across five families of fish was ∼0.9, where a specificity of 1 would indicate a greatly effective algorithm that outperforms both traditional Random Forest and Naïve Bayes approaches; at the genus level the mean sensitivity reduced to ∼0.8 across the nine genera tested. However, some anomalous matches were produced, with accuracy dropping when distinguishing between genera of the same family, such as Epinephelus and Mycteroperca belonging to the Serranidae. Therefore, while this tool has value in rapidly producing lists of biomarkers and can efficaciously identify new ZooMS fingerprints based on those markers, it indicates that manual intervention remains a requirement at finer taxonomic resolutions.en_US
dc.publisherElsevieren_US
dc.relation.urlhttps://www.sciencedirect.com/science/article/pii/S2352409X23001761?via%3Dihuben_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.subjectZooMSen_US
dc.subjectIchthyoarchaeologyen_US
dc.subjectID3en_US
dc.subjectSpecies Identificationen_US
dc.titleMachine Learning for collagen peptide biomarker determination in the taxonomic identification of archaeological fish remainsen_US
dc.typeArticleen_US
dc.identifier.eissn2352-4103en_US
dc.contributor.departmentUniversity of Chester; University of Manchesteren_US
dc.identifier.journalJournal of Archaeological Science: Reportsen_US
or.grant.openaccessYesen_US
rioxxterms.funderBBSRC scholarship funding to AB; Royal Society Fellowship Funding to MBen_US
rioxxterms.identifier.projectBB/M011208/1; UF120473en_US
rioxxterms.versionAMen_US
rioxxterms.versionofrecord10.1016/j.jasrep.2023.104001en_US
rioxxterms.licenseref.startdate2025-04-22
dcterms.dateAccepted2023-04-03
rioxxterms.publicationdate2023-04-22
dc.date.deposited2023-06-15en_US


Files in this item

Thumbnail
Name:
Publisher version
Thumbnail
Name:
BakerEtAlnedmid_AAM.pdf
Embargo:
2025-04-22
Size:
806.5Kb
Format:
PDF
Request:
Article - AAM

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International