Submitted as: development and technical paper 04 Feb 2021

Submitted as: development and technical paper | 04 Feb 2021

Review status: a revised version of this preprint is currently under review for the journal GMD.

dh2loop 1.0: an open-source python library for automated processing and classification of geological logs

Ranee Joshi1,2, Kavitha Madaiah1,2, Mark Jessell1,2, Mark Lindsay1,2, and Guillaume Pirot1,2 Ranee Joshi et al.
  • 1Centre of Exploration Targeting, School of Earth Sciences, University of Western Australia, 35 Stirling Highway, Crawley 6009 Western Australia
  • 2Mineral Exploration Cooperative Research Centre (MinEx CRC), School of Earth Sciences, University of Western Australia, 35 Stirling Highway, Crawley 6009 Western Australia

Abstract. Exploration and mining companies rely on geological drill core logs to target and obtain initial information on geology of the area to build models for prospectivity mapping or mine planning. A huge amount of legacy drilling data is available in geological survey but cannot be used directly as it is compiled and recorded in an unstructured textural form and using different formats depending on the database structure, company, logging geologist, investigation method, investigated materials and/or drilling campaign. It is subjective and plagued with uncertainty as it is likely to have been conducted by tens to hundreds geologists, all of whom would have their own personal biases. However, this is valuable information that adds value to geoscientific data for research and exploration, specifically in efficiently targeting sustainable new discoveries and providing better shallow subsurface constraints for 3D geological models.

dh2loop ( is an open-source python library that provides the functionality to extract and standardize geologic drill hole data and export it into readily importable interval tables (collar, survey, lithology). In this contribution, we extract, process and classify lithological logs from the Geological Survey of Western Australia Mineral Exploration Reports Database in the Yalgoo-Singleton Greenstone Belt (YSGB) region. For this study case, the extraction rate for collar, survey and lithology data is respectively 93 %, 865 and 34 %. It also addresses the subjective nature and variability of nomenclature of lithological descriptions within and across different drilling campaigns by using thesauri and fuzzy string matching. 86% of the extracted lithology data is successfully matched to lithologies in the thesauri. Since this process can be tedious, we attempted to test the string matching with the comments, which resulted to a matching rate of 16 % (7,870 successfully matched records out of 47,823 records). The standardized lithological data is then classified into multi-level groupings that can be used to systematically upscale and downscale drill hole data inputs for multiscale 3D geological modelling. dh2loop formats legacy data bridging the gap between utilization and maximization of legacy drill hole data and drill hole analysis functionalities available in existing python libraries (lasio, welly, striplog).

Ranee Joshi et al.

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on gmd-2020-391', Anonymous Referee #1, 08 Mar 2021
    • AC1: 'Reply on RC1', Ranee Joshi, 18 Mar 2021
  • RC2: 'Comment on gmd-2020-391', Ignacio Fuentes, 10 Mar 2021
    • AC2: 'Reply on RC2', Ranee Joshi, 18 Mar 2021
  • RC3: 'Comment on gmd-2020-391', Matt Hall, 18 Mar 2021
    • AC3: 'Reply on RC3', Ranee Joshi, 31 Mar 2021
  • EC1: 'Invitation of revised submission', Andrew Wickert, 19 Mar 2021
    • AC4: 'Reply on EC1', Ranee Joshi, 31 Mar 2021
      • EC2: 'Reply on AC4', Andrew Wickert, 02 Apr 2021

Ranee Joshi et al.

Model code and software

Loop3D/dh2loop: dh2loop first release (Version v1.0.0) Ranee Joshi, Kavitha Madaiah, and Mark Jessell

Ranee Joshi et al.


Total article views: 761 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
550 190 21 761 4 5
  • HTML: 550
  • PDF: 190
  • XML: 21
  • Total: 761
  • BibTeX: 4
  • EndNote: 5
Views and downloads (calculated since 04 Feb 2021)
Cumulative views and downloads (calculated since 04 Feb 2021)

Viewed (geographical distribution)

Total article views: 578 (including HTML, PDF, and XML) Thereof 578 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
Latest update: 15 Jun 2021
Short summary
We have developed a software that allows the user to extract and standardize drill hole information from legacy datasets and/or different drilling campaigns. It also provides functionality to upscale the lithological information. These functionalities were possible by developing thesauri to identify and group geological terminologies together.