A multi-dimensional dataset of Ordovician to Silurian graptolite specimens for virtual examination,global correlation,and shale gas exploration
Author:
Honghe Xu
Date:
2022-06-23
DOI:
10.5281/zenodo.6688671
License:
Category:
C_CC_8_4
Keywords:
graptoliteAIspecimensimagepalaeontologystratigraphy
10.47GB
585
0
Abstract
A unique high-resolution image dataset consists of key graptolite species used for dating rocks, global correlation, and “gold caliper” for locating shale gas favourable exploration beds (FEBs) in China. All images were taken from 1,550 carefully curated graptolite specimens, taxonomically belong to 113 graptolite species or subspecies. These specimens were collected from 154 representative geological sections of the Ordovician to Silurian sediments of China and published in 1958-2020. All specimens are housed at the Nanjing Institute of Geology and Palaeontology (NIGP), Chinese Academy of Sciences (CAS). Detailed scientific information of every piece of fossil specimen is given in the attached spreadsheet file. My working group spent over two years to complete photographing every specimen using a single-lens reflex camera Nikon D800E with Nikkor 60 mm macro-lens and Leica M125 and M205C microscopes equipped with Leica cameras. Every image is well focused and better shows the morphology of graptolite bodies. In total, we took 40,597 images, including 20,644 camera photos (each with a resolution of 4,912 × 7,360) and 19,953 microscope photos (each with a resolution of 2,720 × 2,048). Photos of low contrast or bad focus were removed from the whole collection. We only kept and selected the photos that show the visual morphology of every specimen and the diagnostic character of each graptolite species that the specimens represent. We selected one image for each specimen as the present final dataset, uploaded to and stored in our cloud server. We incorporated revision suggestions from distinguished palaeontologists to generate the ground-truth labels, providing a taxonomical authority of the dataset. The dataset potentially contributes to a range of scientific activities and provides 1) easy access to high-resolution images of 2951 specimens of 113 graptolite species for teaching and training in palaeontology and geologic survey; 2) Global bio-stratigraphic correlation using graptolites, especially with those bio-zone species; 3) A standard fossil specimen image dataset used in shale gas industry to improve exploration efficiency, and 4) The potential aid of developing image-based automated classification model. Every specimen has two photos, one is original, another shows specimen with a scale bar. Occasionally in some large image the scale bar is embedded and beside the fossil specimen. For example, in the file name: ‘9721Cardiograptus_amplus_S.jpg’, genus name and species epithet are connected by the underline symbol, avoiding the space symbol. ‘9721’ is the specimen number, ‘Cardiograptus_amplus’ means species name is Cardiograptus amplus, ‘_S’ means it is a photo with scale bar.. In all scale bar, the minimum unit is millimeter. All specimen images are in 49 folders, every of which is zipped to one file that is about tens of MB to 740 MB in size.
Entity Data
Reference This Dataset
Honghe Xu.A multi-dimensional dataset of Ordovician to Silurian graptolite specimens for virtual examination,global correlation,and shale gas exploration.DDE Repository.2023-12-15.doi:10.5281/zenodo.6688671