Articles | Volume 14, issue 11
https://doi.org/10.5194/gmd-14-6711-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/gmd-14-6711-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
dh2loop 1.0: an open-source Python library for automated processing and classification of geological logs
Ranee Joshi
CORRESPONDING AUTHOR
Mineral Exploration Cooperative Research Centre, Centre for
Exploration Targeting, School of Earth Sciences, The University of Western
Australia, Perth, Australia
Kavitha Madaiah
Mineral Exploration Cooperative Research Centre, Centre for
Exploration Targeting, School of Earth Sciences, The University of Western
Australia, Perth, Australia
Mark Jessell
Mineral Exploration Cooperative Research Centre, Centre for
Exploration Targeting, School of Earth Sciences, The University of Western
Australia, Perth, Australia
Mark Lindsay
Mineral Exploration Cooperative Research Centre, Centre for
Exploration Targeting, School of Earth Sciences, The University of Western
Australia, Perth, Australia
CSIRO Mineral Resources, CSIRO - Kensington, Australian Resources
Research Centre (ARRC), Kensington, Australia
Guillaume Pirot
Mineral Exploration Cooperative Research Centre, Centre for
Exploration Targeting, School of Earth Sciences, The University of Western
Australia, Perth, Australia
Related authors
Guillaume Pirot, Ranee Joshi, Jérémie Giraud, Mark Douglas Lindsay, and Mark Walter Jessell
Geosci. Model Dev., 15, 4689–4708, https://doi.org/10.5194/gmd-15-4689-2022, https://doi.org/10.5194/gmd-15-4689-2022, 2022
Short summary
Short summary
Results of a survey launched among practitioners in the mineral industry show that despite recognising the importance of uncertainty quantification it is not very well performed due to lack of data, time requirements, poor tracking of interpretations and relative complexity of uncertainty quantification. To alleviate the latter, we provide an open-source set of local and global indicators to measure geological uncertainty among an ensemble of geological models.
Mark Jessell, Vitaliy Ogarko, Yohan de Rose, Mark Lindsay, Ranee Joshi, Agnieszka Piechocka, Lachlan Grose, Miguel de la Varga, Laurent Ailleres, and Guillaume Pirot
Geosci. Model Dev., 14, 5063–5092, https://doi.org/10.5194/gmd-14-5063-2021, https://doi.org/10.5194/gmd-14-5063-2021, 2021
Short summary
Short summary
We have developed software that allows the user to extract sufficient information from unmodified digital maps and associated datasets that we are able to use to automatically build 3D geological models. By automating the process we are able to remove human bias from the procedure, which makes the workflow reproducible.
Vitaliy Ogarko and Mark Jessell
EGUsphere, https://doi.org/10.5194/egusphere-2025-1294, https://doi.org/10.5194/egusphere-2025-1294, 2025
This preprint is open for discussion and under review for Geoscientific Model Development (GMD).
Short summary
Short summary
We developed a new method to reconstruct underground rock layers from drillhole data, using an advanced algorithm to ensure geologically realistic results. By combining data from multiple drillholes, our approach reduces uncertainty and improves accuracy. Tested on South Australian data, it successfully predicted stratigraphy and highlighted ways to enhance data quality. This innovation makes geological analysis more reliable, aiding exploration and resource management.
Mark Douglas Lindsay, Vitaliy Ogarko, Jeremie Giraud, and Mosayeb Khademi
EGUsphere, https://doi.org/10.5194/egusphere-2024-3754, https://doi.org/10.5194/egusphere-2024-3754, 2025
Preprint archived
Short summary
Short summary
Geophysical data is used to understand the Earth's structure. Geophysical inversion is an optimisation technique that constructs a model of the Earth using geophysical data. Inversion is complex and requires constraints to produce a plausible model. Overfitting is a disadvantageous effect that can affect techniques that rely on optimisation. A new feature in an inversion platform reduces overfitting by informing inversion of spatial uncertainty contained within the geophysical data.
Léonard Moracchini, Guillaume Pirot, Kerry Bardot, Mark W. Jessell, and James L. McCallum
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2024-154, https://doi.org/10.5194/gmd-2024-154, 2024
Revised manuscript accepted for GMD
Short summary
Short summary
To facilitate the exploration of alternative hydrogeological scenarios, we propose to approximate costly physical simulations of contaminant transport by more affordable shortest distances computations. It enables to accept or reject scenarios within a predefined confidence interval. In particular, it can allow to estimate the probability of a fault acting as a preferential path or a barrier.
Alan Robert Alexander Aitken, Ian Delaney, Guillaume Pirot, and Mauro A. Werder
The Cryosphere, 18, 4111–4136, https://doi.org/10.5194/tc-18-4111-2024, https://doi.org/10.5194/tc-18-4111-2024, 2024
Short summary
Short summary
Understanding how glaciers generate sediment and transport it to the ocean is important for understanding ocean ecosystems and developing knowledge of the past cryosphere from marine sediments. This paper presents a new way to simulate sediment transport in rivers below ice sheets and glaciers and quantify volumes and characteristics of sediment that can be used to reveal the hidden record of the subglacial environment for both past and present glacial conditions.
Vitaliy Ogarko, Kim Frankcombe, Taige Liu, Jeremie Giraud, Roland Martin, and Mark Jessell
Geosci. Model Dev., 17, 2325–2345, https://doi.org/10.5194/gmd-17-2325-2024, https://doi.org/10.5194/gmd-17-2325-2024, 2024
Short summary
Short summary
We present a major release of the Tomofast-x open-source gravity and magnetic inversion code that is enhancing its performance and applicability for both industrial and academic studies. We focus on real-world mineral exploration scenarios, while offering flexibility for applications at regional scale or for crustal studies. The optimisation work described in this paper is fundamental to allowing more complete descriptions of the controls on magnetisation, including remanence.
Jiateng Guo, Xuechuang Xu, Luyuan Wang, Xulei Wang, Lixin Wu, Mark Jessell, Vitaliy Ogarko, Zhibin Liu, and Yufei Zheng
Geosci. Model Dev., 17, 957–973, https://doi.org/10.5194/gmd-17-957-2024, https://doi.org/10.5194/gmd-17-957-2024, 2024
Short summary
Short summary
This study proposes a semi-supervised learning algorithm using pseudo-labels for 3D geological modelling. We establish a 3D geological model using borehole data from a complex real urban local survey area in Shenyang and make an uncertainty analysis of this model. The method effectively expands the sample space, which is suitable for geomodelling and uncertainty analysis from boreholes. The modelling results perform well in terms of spatial morphology and geological semantics.
Jérémie Giraud, Hoël Seillé, Mark D. Lindsay, Gerhard Visser, Vitaliy Ogarko, and Mark W. Jessell
Solid Earth, 14, 43–68, https://doi.org/10.5194/se-14-43-2023, https://doi.org/10.5194/se-14-43-2023, 2023
Short summary
Short summary
We propose and apply a workflow to combine the modelling and interpretation of magnetic anomalies and resistivity anomalies to better image the basement. We test the method on a synthetic case study and apply it to real world data from the Cloncurry area (Queensland, Australia), which is prospective for economic minerals. Results suggest a new interpretation of the composition and structure towards to east of the profile that we modelled.
Guillaume Pirot, Ranee Joshi, Jérémie Giraud, Mark Douglas Lindsay, and Mark Walter Jessell
Geosci. Model Dev., 15, 4689–4708, https://doi.org/10.5194/gmd-15-4689-2022, https://doi.org/10.5194/gmd-15-4689-2022, 2022
Short summary
Short summary
Results of a survey launched among practitioners in the mineral industry show that despite recognising the importance of uncertainty quantification it is not very well performed due to lack of data, time requirements, poor tracking of interpretations and relative complexity of uncertainty quantification. To alleviate the latter, we provide an open-source set of local and global indicators to measure geological uncertainty among an ensemble of geological models.
Richard Scalzo, Mark Lindsay, Mark Jessell, Guillaume Pirot, Jeremie Giraud, Edward Cripps, and Sally Cripps
Geosci. Model Dev., 15, 3641–3662, https://doi.org/10.5194/gmd-15-3641-2022, https://doi.org/10.5194/gmd-15-3641-2022, 2022
Short summary
Short summary
This paper addresses numerical challenges in reasoning about geological models constrained by sensor data, especially models that describe the history of an area in terms of a sequence of events. Our method ensures that small changes in simulated geological features, such as the position of a boundary between two rock layers, do not result in unrealistically large changes to resulting sensor measurements, as occur presently using several popular modeling packages.
Mark Jessell, Jiateng Guo, Yunqiang Li, Mark Lindsay, Richard Scalzo, Jérémie Giraud, Guillaume Pirot, Ed Cripps, and Vitaliy Ogarko
Earth Syst. Sci. Data, 14, 381–392, https://doi.org/10.5194/essd-14-381-2022, https://doi.org/10.5194/essd-14-381-2022, 2022
Short summary
Short summary
To robustly train and test automated methods in the geosciences, we need to have access to large numbers of examples where we know
the answer. We present a suite of synthetic 3D geological models with their gravity and magnetic responses that allow researchers to test their methods on a whole range of geologically plausible models, thus overcoming one of the fundamental limitations of automation studies.
Jérémie Giraud, Vitaliy Ogarko, Roland Martin, Mark Jessell, and Mark Lindsay
Geosci. Model Dev., 14, 6681–6709, https://doi.org/10.5194/gmd-14-6681-2021, https://doi.org/10.5194/gmd-14-6681-2021, 2021
Short summary
Short summary
We review different techniques to model the Earth's subsurface from geophysical data (gravity field anomaly, magnetic field anomaly) using geological models and measurements of the rocks' properties. We show examples of application using idealised examples reproducing realistic features and provide theoretical details of the open-source algorithm we use.
Mahtab Rashidifard, Jérémie Giraud, Mark Lindsay, Mark Jessell, and Vitaliy Ogarko
Solid Earth, 12, 2387–2406, https://doi.org/10.5194/se-12-2387-2021, https://doi.org/10.5194/se-12-2387-2021, 2021
Short summary
Short summary
One motivation for this study is to develop a workflow that enables the integration of geophysical datasets with different coverages that are quite common in exploration geophysics. We have utilized a level set approach to achieve this goal. The utilized technique parameterizes the subsurface in the same fashion as geological models. Our results indicate that the approach is capable of integrating information from seismic data in 2D to guide the 3D inversion results of the gravity data.
Lachlan Grose, Laurent Ailleres, Gautier Laurent, Guillaume Caumon, Mark Jessell, and Robin Armit
Geosci. Model Dev., 14, 6197–6213, https://doi.org/10.5194/gmd-14-6197-2021, https://doi.org/10.5194/gmd-14-6197-2021, 2021
Short summary
Short summary
Fault discontinuities in rock packages represent the plane where two blocks of rock have moved. They are challenging to incorporate into geological models because the geometry of the faulted rock units are defined by not only the location of the discontinuity but also the kinematics of the fault. In this paper, we outline a structural geology framework for incorporating faults into geological models by directly incorporating kinematics into the mathematical framework of the model.
Mark Jessell, Vitaliy Ogarko, Yohan de Rose, Mark Lindsay, Ranee Joshi, Agnieszka Piechocka, Lachlan Grose, Miguel de la Varga, Laurent Ailleres, and Guillaume Pirot
Geosci. Model Dev., 14, 5063–5092, https://doi.org/10.5194/gmd-14-5063-2021, https://doi.org/10.5194/gmd-14-5063-2021, 2021
Short summary
Short summary
We have developed software that allows the user to extract sufficient information from unmodified digital maps and associated datasets that we are able to use to automatically build 3D geological models. By automating the process we are able to remove human bias from the procedure, which makes the workflow reproducible.
Lachlan Grose, Laurent Ailleres, Gautier Laurent, and Mark Jessell
Geosci. Model Dev., 14, 3915–3937, https://doi.org/10.5194/gmd-14-3915-2021, https://doi.org/10.5194/gmd-14-3915-2021, 2021
Short summary
Short summary
LoopStructural is an open-source 3D geological modelling library with a model design allowing for multiple different algorithms to be used for comparison for the same geology. Geological structures are modelled using structural geology concepts and techniques, allowing for complex structures such as overprinted folds and faults to be modelled. In the paper, we demonstrate automatically generating a 3-D model from map2loop-processed geological survey data of the Flinders Ranges, South Australia.
Cited articles
Ailleres, L., Jessell, M., de Kemp, E., Caumon, G., Wellmann, F., Grose,
L., Armit, R., Lindsay, M., Giraud, J., Brodaric, B., Harrison, M., and
Courrioux, G.: Loop – Enabling 3D stochastic geological modelling, ASEG
Extended Abstracts, 2019, 1–3, https://doi.org/10.1080/22020586.2019.12072955, 2019.
Anand, R. R. and Butt, C. R. M.: A guide for mineral exploration through
the regolith in the Yilgarn Craton, Western Australia, Aust. J.
Earth Sci., 57, 1015–1114,
https://doi.org/10.1080/08120099.2010.522823, 2010.
Arabjamaloei, R., Edalatkha, S., Jamshidi, E., Nabaei, M., Beidokhti, M.,
and Azad, M.: Exact Lithologic Boundary Detection Based on Wavelet Transform
Analysis and Real-Time Investigation of Facies Discontinuities Using
Drilling Data, Pet. Sci. Technol., 29, 569–578, https://doi.org/10.1080/10916460903419206, 2011.
Barley, M. E., Brown, S. J. A., Krapez, B., and Kositcin, N.: Physical
volcanology and geochemistry of a Late Archaean volcanic arc: Kurnalpi and
Gindalbie Terranes, Eastern Goldfields Superterrane, Western Australia,
Precambrian Res., 161, 53–76, https://doi.org/10.1016/j.precamres.2007.06.019, 2008.
Chace, F. M.: Abbreviations in field and mine geological mapping, Econ. Geol., 51, 712–723, 1956.
Cockbain, A. E.: Regolith geology of the Yilgarn Craton – Introduction,
Aust. J. Earth Sci., 49, 1–1, https://doi.org/10.1046/j.1440-0952.2002.00913.x, 2002.
Cohen, A.: FuzzyWuzzy: Fuzzy string matching in python, ChairNerd Blog, available at: http://chairnerd.seatgeek.com/fuzzywuzzy-fuzzy-string-matching-in-python/ (last accessed: 29 September 2021), 2011.
Culshaw, M. G.: From concept towards reality: developing the attributed 3D
geological model of the shallow subsurface, Q. J. Eng.
Geol. Hydroge., 38, 231–284, https://doi.org/10.1144/1470-9236/04-072, 2005.
Eggleton, R. A.: The regolith glossary, Cooperative Centre for Landscape Evolution and Mineral Exploration, 8, 144 pp., National Capital Printing, Canberra, Australia, 2001.
Emelyanova, I., Pervukhina, M., Clennell, M., and Dyt, C.: Unsupervised identification of electrofacies employing machine learning, 79th EAGE Conference and Exhibition 2017, Paris, France, 12–16 June 2017, EAGE, 2017.
Erkan, G. and Radev, D. R.: LexRank: Graph-based lexical centrality as
salience in text summarization, J. Artif. Intell. Res.,
22, 457–479, https://doi.org/10.1613/jair.1523, 2004.
Fuentes, I., Padarian, J., Iwanaga, T., and Vervoort, R. W.: 3D lithological
mapping of borehole descriptions using word embeddings, Comput.
Geosci., 141, 104516, https://doi.org/10.1016/j.cageo.2020.104516, 2020.
Gillespie, M. and Styles, M.: BGS rock classification scheme, Volume 1, Classification of igneous rocks, Keyworth, Nottingham, British Geological Survey, 31 pp., RR/99/006, 1999.
Gonzalez, J., Rodrigues, P., and Cohen, A.: Fuzzywuzzy: Fuzzy string matching in python, available at: https://github.com/seatgeek/fuzzywuzzy (last access: 30 September 2021), 2017.
Gregorio, F. D. and Varrazzo, D.: psycopg2, available at: http://initd.org/psycopg/docs/ (Stand: 17 June 2021) (last access: 30 September 2021), 2018.
Griffin, R. E.: When are Old Data New Data?, GeoResJ, 6, 92–97,
https://doi.org/10.1016/j.grj.2015.02.004, 2015.
Groves, D. I., Goldfarb, R. J., Knox-Robinson, C. M., Ojala, J., Gardoll,
S., Yun, G. Y., and Holyland, P.: Late-kinematic timing of orogenic gold
deposits and significance for computer-based exploration techniques with
emphasis on the Yilgarn Block, Western Australia, Ore Geol. Rev., 17,
1–38, https://doi.org/10.1016/S0169-1368(00)00002-0, 2000.
Hall, M. and Keppie, F.: Striplog: new open source software
for handling and analysing discontinuous and qualitative data, oral paper at
the Atlantic Geoscience Society Colloquium, Truro, Nova Scotia, Canada, 5–6 February 2016.
Hallsworth, C. R. and Knox, R.: BGS rock classification scheme, Volume 3, classification of sediments and sedimentary rocks, British Geological Survey, Keyworth, Nottingham, 24 pp., RR/99/003, 1999.
Higgins, R. F. and Mehta, S.: SeatGeek, Harvard Business School Case 819-013, October 2018.
Hill, E. J., Robertson, J., and Uvarova, Y.: Multiscale hierarchical
domaining and compression of drill hole data, Comput. Geosci.,
79, 47–57, https://doi.org/10.1016/j.cageo.2015.03.005, 2015.
Hill, E. J., Pearce, M. A., and Stromberg, J. M.: Improving Automated
Geological Logging of Drill Holes by Incorporating Multiscale Spatial
Methods, Math. Geosci., 53, 21–53, https://doi.org/10.1007/s11004-020-09859-0, 2020.
Jallan, Y., Brogan, E., Ashuri, B., and Clevenger, C. M.: Application of
Natural Language Processing and Text Mining to Identify Patterns in
Construction-Defect Litigation Cases, Journal of Legal Affairs and Dispute
Resolution in Engineering and Construction, 11, 04519024, https://doi.org/10.1061/(Asce)La.1943-4170.0000308, 2019.
Joshi, R., Madaiah, K., and Jessell, M.: Loop3D/dh2loop: dh2loop first release (v1.0.0), Zenodo [data set], https://doi.org/10.5281/zenodo.4043568, 2020.
Kumari, S., Mohan, A., and Saberwal, G.: Hidden duplicates: 10s or 100s of
Indian trials, registered with ClinicalTrials.gov, have not been registered
in India, as required by law, PLoS One, 15, e0234925,
https://doi.org/10.1371/journal.pone.0234925, 2020.
Lark, R. M., Thorpe, S., Kessler, H., and Mathers, S. J.: Interpretative modelling of a geological cross section from boreholes: sources of uncertainty and their quantification, Solid Earth, 5, 1189–1203, https://doi.org/10.5194/se-5-1189-2014, 2014.
Le Vaillant, M., Hill, J., and Barnes, S. J.: Simplifying drill-hole domains
for 3D geochemical modelling: An example from the Kevitsa Ni-Cu-(PGE)
deposit, Ore Geol. Rev., 90, 388–398, https://doi.org/10.1016/j.oregeorev.2017.05.020,
2017.
Levenshtein, V.: Leveinshtein distance, available at: http://en.wikipedia.org/wiki/Levenshtein_distance (last access: 30 September 2021), 1965.
Lin, C.-Y. and Hovy, E.: Automatic evaluation of summaries using N-gram co-occurrence statistics, in: Proceedings of the 2003 Human
Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Edmonton, Canada, May–June 2003, 71–78, https://doi.org/10.3115/1073445.1073465, 2003.
Lindsay, M.: Geological Interpretation of Geophysics Support from
Machine Learning of Drillcore, in: Geological Survey of Western Australia Open Day 2019, Perth, Australia, available at: https://geodocs.dmirs.wa.gov.au/Web/documentlist/16/DirectLink/GeologicalInterpretationGS (last access: 30 September 2021), 2019.
Liu, K., Hogan, W. R., and Crowley, R. S.: Natural Language Processing
methods and systems for biomedical ontology learning, J. Biomed. Inform., 44,
163–179, https://doi.org/10.1016/j.jbi.2010.07.006, 2011.
Liu, T. and Guo, J.: Text similarity computing based on standard deviation, in: Advances in Intelligent Computing, edited by: Huang, D. S., Zhang, X. P., and Huang, G. B.,
International Conference on Intelligent Computing 2005, Lecture Notes in Computer Science, vol. 3644, 456–464, https://doi.org/10.1007/11538059_48, 2005.
Loper, E. and Bird, S.: NLTK: the natural language toolkit, arXiv [preprint], arXiv:cs/0205028, 17 May 2002.
Martínez-Vargas, A.: PyGSLIB [online], Python 2.7 package,
Opengeostat Consulting, available at:
https://github.com/opengeostat/pygslib (last access: 30 September 2021), 2016.
McKinney, W.: pandas: a foundational Python library for data analysis
and statistics, Python for High Performance and Scientific Computing, 14, 1–9, 2011.
McMillan, A. and Powell, J.: British Geological Survey Rock Classification Scheme,
Volume 4: The Classification of Artificial (man made) Ground and Natural Superficial Deposits: Applications to Geological Maps and Datasets in the UK, British Geolgoical Survey Research Report, RR 99-04, 1999.
Miles, A. and Bechhofer, S.: SKOS simple knowledge organization
system reference, W3C recommendation, 18, World Wide Web Consortium, available at: http://www.w3.org/TR/skos-reference/ (last access: 30 September 2021), 2009.
Müller, T., Cotterell, R., Fraser, A., and Schütze, H.: Joint lemmatization
and morphological tagging with lemming, in: Proceedings of the 2015 Conference on Empirical Methods in Natural Language
Processing, Association for Computational Linguistics, Lisbon, Portugal, 2268–2274, https://doi.org/10.18653/v1/D15-1272, 2015.
Myers, J.: Precambrian Tectonic History of the West Australian Craton and
Adjacent Orogens, Annu. Rev. Earth Pl. Sc., 21, 453–485,
https://doi.org/10.1146/annurev.ea.21.050193.002321, 1993.
Okuda, T., Tanaka, E., and Kasai, T.: A method for the correction of garbled
words based on the Levenshtein metric, IEEE T. Comput., 100,
172–178, 1976.
Otter, D. W., Medina, J. R., and Kalita, J. K.: A Survey of the Usages of
Deep Learning for Natural Language Processing, IEEE T. Neural Net. Lear., 32, 604–624, https://doi.org/10.1109/TNNLS.2020.2979670, 2020.
Padarian, J. and Fuentes, I.: Word embeddings for application in geosciences: development, evaluation, and examples of soil-related concepts, SOIL, 5, 177–187, https://doi.org/10.5194/soil-5-177-2019, 2019.
Park, S. H., Ryu, K. H., and Gilbert, D.: Fast similarity search for protein
3D structures using topological pattern matching based on spatial relations,
Int. J. Neural. Syst., 15, 287–296, https://doi.org/10.1142/S0129065705000244, 2005.
Qiu, Q., Xie, Z., Wu, L., and Tao, L.: Dictionary-Based Automated
Information Extraction From Geological Documents Using a Deep Learning
Algorithm, Earth and Space Science, 7, e2019EA000993, https://doi.org/10.1029/2019ea000993, 2020.
Ralph, J..: Mindat.org, available at: https://www.mindat.org/min-50468.html, last access:
30 September 2021.
Raymond, O., Duclaux, G., Boisvert, E., Cipolloni, C., Cox, S., Laxton, J., Letourneau, F., Richard, S., Ritchie, A., and Sen, M.:
GeoSciML v3.0 – a significant upgrade of the CGI-IUGS geoscience data model, EGU General Assembly 2012, Vienna, Austria, 22–27 April, 2012, 2711, available at: https://ui.adsabs.harvard.edu/abs/2012EGUGA..14.2711R (last access: 30 September 2021), 2012.
Recasens, M., Danescu-Niculescu-Mizil, C., and Jurafsky, D.: Linguistic
models for analyzing and detecting biased language, in: Proceedings
of the 51st Annual Meeting of the Association for Computational
Linguistics, Sofia, Bulgaria, August 2013, Volume 1: Long Papers, 1650–1659, available at: https://aclanthology.org/P13-1162.pdf (last access: 30 September 2021),
2013.
Richard, S. M. and CGI Interoperability Working Group: GeoSciML – a GML application for geoscience information interchange, Digital Mapping Techniques 2007, Workshop Proceedings: U.S. Geological Survey, 1285, 47–59, 2007.
Riganti, A., Farrell, T. R., Ellis, M. J., Irimies, F., Strickland, C. D.,
Martin, S. K., and Wallace, D. J.: 125 years of legacy data at the
Geological Survey of Western Australia: Capture and delivery, GeoResJ, 6,
175–194, https://doi.org/10.1016/j.grj.2015.02.015, 2015.
Rivera-Quiroz, F. A. and Miller, J.: Extracting Data from Legacy
Taxonomic Literature: Applications for planning field work, Biodiversity Information Science and Standards, 3, e37082, https://doi.org/10.3897/biss.3.37082, 2019.
Robertson, S.: BGS rock classification scheme, Volume 2, Classification
of metamorphic rocks, British Geological Survey Research Report, RR 99–02, 24 pp., 1999.
Rosenbaum, M. S., McMillan, A. A., Powell, J. H., Cooper, A. H., Culshaw, M.
G., and Northmore, K. J.: Classification of artificial (man-made) ground,
Eng. Geol., 69, 399–409, https://doi.org/10.1016/S0013-7952(02)00282-X, 2003.
Ross, P. S., Bourke, A., and Fresia, B.: A multi-sensor logger for rock
cores: Methodology and preliminary results from the Matagami mining camp,
Canada, Ore Geol. Rev., 53, 93–111, https://doi.org/10.1016/j.oregeorev.2013.01.002,
2013.
Rothwell, R. G. and Rack, F. R.: New techniques in sediment core analysis:
an introduction, New Techniques in Sediment Core Analysis, The Geological Society of London, 267, 1–29, https://doi.org/10.1144/Gsl.Sp.2006.267.01.01, 2006.
Schetselaar, E. M. and Lemieux, D.: A drill hole query algorithm for
extracting lithostratigraphic contacts in support of 3D geologic modelling
in crystalline basement, Comput. Geosci., 44, 146–155,
https://doi.org/10.1016/j.cageo.2011.10.015, 2012.
Simons, B., Boisvert, E., Brodaric, B., Cox, S., Duffy, T. R., Johnson,B. R., Laxton, J. L., and Richard, S.: GeoSciML: enabling the exchange of geological map data, ASEG Extended Abstracts,
2006, 1–4, https://doi.org/10.1071/ASEG2006ab162, 2006.
Smith, M. J., Keesstra, S., and Rose, J.: Use of legacy data in
geomorphological research, GeoResJ, 6, 74–80, https://doi.org/10.1016/j.grj.2015.02.008,
2015.
Vearncombe, J., Conner, G., and Bright, S.: Value from legacy data, T. I. Min.
Metall. B, 125, 231–246, https://doi.org/10.1080/03717453.2016.1190442, 2016.
Vearncombe, J., Riganti, A., Isles, D., and Bright, S.: Data upcycling, Ore
Geol. Rev., 89, 887–893, https://doi.org/10.1016/j.oregeorev.2017.07.009, 2017.
Wang, C. and Ma, X.: Text Mining to Facilitate Domain Knowledge
Discovery, in: Text Mining-Analysis, Programming and Application,
IntechOpen, https://doi.org/10.5772/intechopen.85362, 2019.
Wilbur, W. J. and Sirotkin, K.: The Automatic Identification of Stop Words,
J. Inf. Sci., 18, 45–55, https://doi.org/10.1177/016555159201800106,
1992.
Zhou, C. Y., Ouyang, J. W., Ming, W. H., Zhang, G. H., Du, Z. C., and Liu,
Z.: A Stratigraphic Prediction Method Based on Machine Learning, Appl.
Sci.-Basel, 9, 3553, https://doi.org/10.3390/app9173553, 2019.
Zhou, Q., Liu, H. H., Bodvarsson, G. S., and Oldenburg, C. M.: Flow and
transport in unsaturated fractured rock: effects of multiscale heterogeneity
of hydrogeologic properties, J. Contam. Hydrol., 60, 1–30, https://doi.org/10.1016/s0169-7722(02)00080-3, 2003.
Zhu, G., Gao, M., Kong, F., and Li, K.: Application of Logging While
Drilling Tool in Formation Boundary Detection and Geo-steering, Sensors
(Basel), 19, 2754, https://doi.org/10.3390/s19122754, 2019.
Short summary
We have developed a software that allows the user to extract and standardize drill hole information from legacy datasets and/or different drilling campaigns. It also provides functionality to upscale the lithological information. These functionalities were possible by developing thesauri to identify and group geological terminologies together.
We have developed a software that allows the user to extract and standardize drill hole...