Predicting hydrocarbon plays from text using machine learning and natural language processing. I recently tested the OpportunityFinder Algorithm on a selection of public domain geoscience literature. Only literature published between 1990 to 2010 was used, some time before a major gas discovery was made in the area. The hypothesis was whether the algorithm could surface... Continue Reading →
The Cambrian Explosion (…of search technology)
Happy New Year everyone! A light hearted look at enterprise search evolution to start the New Year. This is an extended summary, the full article will be published in "Search Insights" in March 2020 including the link to digital transformation. The Cambrian explosion occurred more than 500 million years ago, a rapid burst that diversified... Continue Reading →
Continued interest in search & analytics blog
It's not year end quite yet, but surprised to see visitors to my blog surge past last years total, going past 8,000 this week. I was invited to write my thoughts on the 'future of search' in the enterprise by 'The Search Network' recently. It will be published in 'Search Insights' early 2020. Search engines... Continue Reading →
Business Analytics Lecture
Enjoyed giving a 3 hour lecture and exercises on text analytics for postgrad students on the Business Analytics MSc at Robert Gordon University in Aberdeen today. Fascinating to see students who have majored in subjects as diverse as Finance, Art, Banking and Psychology - developing dual skills - proactively realising how important programming and analytics... Continue Reading →
Google is making the biggest change in search for 5 years
Earlier this year I wrote about text embeddings / language models and some uses in oil & gas and geoscience Article here Google is deploying its Bidirectional Encoder Representations from Transformers (BERT) algorithm - it has gone mainstream. It is reported this will improve 1 in 10 searches Google Blog Post here Language models and... Continue Reading →
The difference between geoscientists and engineers….using text analytics
It was Firth who first said a word’s meaning can be somewhat defined by the company it keeps – in other words its word associations. This theory is behind high dimensional vectorspace and many disambiguation techniques to determine the ‘sense’ of a word or phrase when it can have many meanings. What is often not addressed... Continue Reading →
Data Science and Business Analytics Lecture
Looking forward to giving the guest lecture & workshop at the end of November on the new MSc Business Analytics course at Robert Gordon University: https://www.rgu.ac.uk/study/courses/1177-pgcert-pgdip-msc-business-analytics
North Sea Cores of Source Rock, Reservoir and Seal
Received core from North Sea core this week. Great samples of source rock, reservoir and seal in a nice old Conoco core box! Text mining geoscience literature using machine learning and natural language processing for clues of new exploration plays, analogues and missed pay is enjoyable ... but you can't beat the real thing 🤓... Continue Reading →
Petroleum Data Management Lecture
Looking forward to giving a lecture on the 29th October 'Text Analytics in the Oil & Gas Industry' for the Petroleum Data Management Graduate Certificate. The online study course is run by Robert Gordon University in Aberdeen, Scotland, UK. more here
The Hidden Codes in Geoscience Text
In Petroleum Geoscience, traces of hydrocarbons are referred to as a 'show' or 'shows'. Thousands of labelled example sentences were used to build a predictive machine classifier (based on word patterns). However, this did not work so well compared to detecting and disambiguating other concepts such as 'mature', 'migration'. This was probably due to the... Continue Reading →