Topics mentioned with articles containing the word ‘geoscience’, past 10 years from The Guardian newspaper. Words are clustered automatically by their similarity (cosine) shown by the colour groupings. #geosciences #informationscience
Detecting mentions of fossils in text reports and papers without using a list of names.
The Python OpportunityFinder® algorithm from Infoscience Technologies can now automatically detect fossil names and their associated Lithostratigraphic Units and Geological Ages without a prior list of names. This can be useful because it is not always possible to predefine all the names and variations you are likely to come across in text. Furthermore, the way... Continue Reading →
Mining Geoscience Text from the Geological Survey of Queensland for hidden Geo-Resource evidence
Text mining algorithms were used to discover hidden geo-resource (metals, elements, minerals) associations in reports, maps, sketches and logs from the archives of the Geological Survey of Queensland in Australia. The Geological Survey of Queensland have made a number of excellent improvements recently increasing the accessibility of these data. A subset of report packages over... Continue Reading →
Using text analytics to detect new natural hydrogen plays in the subsurface.
In April 2021 I worked on the world’s first large scale text analytics project to detect natural hydrogen. Working with a natural resources exploration company, this was to detect both explicit and implicit evidence for natural hydrogen in legacy oil & gas documentation. This contributed to the development of a new hydrogen play. It was... Continue Reading →
Largest Petroleum Systems Taxonomy and NLP machine learning training sets in the industry
The OpportunityFinder® algorithm has now exceeded 50,000 terms in its lexicon for detecting petroleum systems automatically in text. This is combined with hundreds of thousands of labelled data for machine learning. These can support laser like tasks, improve search & discovery, insights, knowledge mining and also support the tuning of very large language models. http://www.infosciencetechnologies.com
Comparing patterns of potential source rocks in text by geological age to their contribution in actual producing oil and gas fields.
Some research I conducted recently comparing the counts of potential oil and gas source rock "mentions" by geological age in unstructured text, to some (rather old) actual data published in the literature on the age of hydrocarbons generated from source rocks in producing oil & gas fields. Over 48,000 terms from a lexicon were applied to... Continue Reading →
Mineral to Hydrocarbon Occurrence and HC Source Rock Association (From Text Frequency)
Figure 1 - Mineral Associations to HC Source Rock (x-axis) and HC Occurrence (y-axis) Natural Language Processing (NLP) and Machine Learning was applied to 16 million geoscience sentences using over 48,000 lexicon terms. Their frequency of association (in the same sentence) is shown in Fig 1. Some patterns confirm known associations, such as the association... Continue Reading →
Deriving Hydrocarbon to Metal / Mineral associations found in unstructured text for use as potential ore analogues and exploratory data analysis
Figure 1 - Association (Text) Frequency between chemical elements (by group) and hydrocarbon source rock (x-axis) and hydrocarbon occurrence (y-axis). Deriving Hydrocarbon to Metal / Mineral associations found in unstructured text for use as potential ore analogues and exploratory data analysis: Applying Machine Learning and Natural Language Processing (NLP) to 16 Million Geoscience Sentences. Hydrocarbon... Continue Reading →
Merycoidodon skull
Fascinated by this Merycoidodon (Oreodont) fossil skull. An extinct herbivore from the Oligocene (~25 Million Years Ago) with no known close living relatives today. Related to the camel but 'sheep-like', it would have likely resembled a pig in appearance, but with a longer body, at about 1.5 metres with short limbs and probably moved in herds... Continue Reading →
Text Analytics and Renewables Geothermal Projects
We are testing text analytics algorithms within a renewables geothermal project in collaboration with the British Geological Survey. BGS are investigating mine water in underground abandoned coal mines as a low carbon sustainable heat source for housing and manufacturing, and have several other potential use cases for knowledge extraction from their data archives to meet... Continue Reading →