English
 
Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Ten simple rules to follow when cleaning occurrence data in palaeobiology

Jones, L. A., Dean, C. D., Allen, B. J., Drage, H. B., Flannery‐Sutherland, J. T., Gearty, W., Chiarenza, A. A., Dillon, E. M., Farina, B. M., Godoy, P. L. (2025): Ten simple rules to follow when cleaning occurrence data in palaeobiology. - Palaeontology, 68, 5, e70028.
https://doi.org/10.1111/pala.70028

Item is

Files

show Files
hide Files
:
5037669.pdf (Publisher version), 2MB
Name:
5037669.pdf
Description:
-
OA-Status:
Hybrid
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Jones, Lewis A.1, Author
Dean, Christopher D.1, Author
Allen, Bethany J.2, Author           
Drage, Harriet B.1, Author
Flannery‐Sutherland, Joseph T.1, Author
Gearty, William1, Author
Chiarenza, Alfio Alessandro1, Author
Dillon, Erin M.1, Author
Farina, Bruna M.1, Author
Godoy, Pedro L.1, Author
Affiliations:
1External Organizations, ou_persistent22              
24.7 Earth Surface Process Modelling, 4.0 Geosystems, Departments, GFZ Publication Database, GFZ Helmholtz Centre for Geosciences, ou_1729888              

Content

show
hide
Free keywords: -
 Abstract: Large datasets of fossil occurrences, often downloaded from online community-maintained databases, are a vital resource for understanding broad-scale evolutionary patterns, such as how biodiversity has changed through time and space. Such datasets, however, are not infallible and must be ‘cleaned’ of inaccurate, incomplete, or duplicate data prior to analysis. Researchers must decide upon the extent, feasibility, and value of data cleaning steps to perform, but while guides are available for working with neontological occurrences, there is currently no clear procedure for palaeobiological data despite its unique attributes. Here, we outline ten rules that aim to aid the process of cleaning fossil occurrence data for downstream analysis. These rules cover the major steps involved in processing data prior to analysis, including project setup, data exploration and cleaning, and finalizing and reporting work. We provide accompanying examples and a vignette covering the entire data cleaning process to demonstrate the application of each rule. We believe that these rules will serve as a useful guideline to support data cleaning and foster new standards for the palaeobiological community.

Details

show
hide
Language(s):
 Dates: 2025-10-242025
 Publication Status: Finally published
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1111/pala.70028
GFZPOF: p4 T5 Future Landscapes
OATYPE: Hybrid Open Access
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Palaeontology
Source Genre: Journal, SCI, Scopus
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 68 (5) Sequence Number: e70028 Start / End Page: - Identifier: Publisher: Wiley
CoNE: https://gfzpublic.gfz.de/cone/journals/resource/20221101