Detecting entities such as well names in unstructured text can be useful for many aspects of information discovery.
Lookup lists from corporate databases and regular expression pattern rules can be useful. They do have limitations though, it can be difficult to predict sometimes what may lie within thousands of old reports and documents.
Having a machine learning model tuned and trained on thousands of public domain examples may help and support existing digitalisation activities. This new capability was added to the GeoClassifier® algorithm recently.
The screenshot shows some examples in oil & gas, geothermal, hydrogeology, mining and carbon capture sectors.
Tested on several hundred UK License Reports gave 96% accuracy detecting 718 well names.
For more information: