Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 30;9(4):e11415.
doi: 10.1002/aps3.11415. eCollection 2021 Apr.

Estimating herbarium specimen digitization rates: Accounting for human experience

Affiliations

Estimating herbarium specimen digitization rates: Accounting for human experience

Caleb Powell et al. Appl Plant Sci. .

Abstract

Premise: Herbaria are invaluable sources for understanding the natural world, and in recent years there has been a concerted effort to digitize these collections. To organize such efforts, a method for estimating the necessary labor is desired. This work analyzes digitization productivity reports of 105 participants from eight herbaria, deriving generalized labor estimates that account for human experience.

Methods and results: Individuals' rates of digitization were grouped based on cumulative time performing each task and then used to estimate a series of generalized labor projection models. In most cases, productivity was shown to improve with experience, suggesting longer technician retention can reduce labor requirements by 20%.

Conclusions: Using student labor is a common tactic for digitization efforts, and the resulting outreach exposes future professionals to natural history collections. However, overcoming the learning curve should be considered when estimating the labor necessary to digitize a collection.

Keywords: biodiversity data; digitization rates; herbaria; natural history collections.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The average technician skeletal databasing rate (specimen/minute) as a function of cumulative hours performing skeletal databasing. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.
Figure 2
Figure 2
The average technician imaging rate (specimen/minute) as a function of cumulative hours imaging. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.
Figure 3
Figure 3
The average technician barcode application rate (specimen/minute) as a polynomial function of cumulative hours applying barcodes. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.

References

    1. Gries, C. , Gilbert E. E., and Franz N. M.. 2014. Symbiota – A virtual platform for creating voucher‐based biodiversity information communities. Biodiversity Data Journal 2: e1114. - PMC - PubMed
    1. Harris, K. M. , and Marsico T. D.. 2017. Digitizing specimens in a small herbarium: A viable workflow for collections working with limited resources. Applications in Plant Sciences 5: 1600125. - PMC - PubMed
    1. McKinney, W. 2010. Data structures for statistical computing in Python. Proceedings of the 9th Python in Science Conference 445: 51–56.
    1. Motowidlo, S. J. , and Van Scotter J. R.. 1994. Evidence that task performance should be distinguished from contextual performance. Journal of Applied Psychology 79(4): 475. - PubMed
    1. Nelson, G. , Paul D., Riccardi G., and Mast A.. 2012. Five task clusters that enable efficient and effective digitization of biological collections. ZooKeys 209: 19–45. - PMC - PubMed

LinkOut - more resources