Using text embeddings to display search results spatially.

Building a statistical vector-space model from your corpus of text documents affords many advantages. Take a search query such as 'carbonatite'. Using text embeddings (vectors) we can display results not just on a map, but also by how 'similar' those locations are to the query. See the associated screenshot. This allows us to discover locations... Continue Reading →

Geological sub-discipline query popularity in Google

Global search queries (Google) on geological sub-disciplines past 12 months. Graph shows relative popularity over time and the map shows what dominates per country. The spike for engineering geology (yellow) on 6th February 2023 coincides with the 7.8 magnitude earthquake in Turkey. #geology #engineeringgeology #hydrogeology #volcanology #mininggeology #petroleumgeology

The rise of the vector database

The rise of the vector database. I’ve been writing about the use of word vectors in geoscience since 2015, but recently some exciting developments have emerged. A vector is an array of numbers which can be used to represent words based on complex word co-occurrence. Taking the cosine similarity between vectors enables us to find... Continue Reading →

The grand challenges of geoscience

I created this blog exactly 8 years ago in mid 2015. The aim was to share ideas, research, technologies and methods on text analytics, search and data management applied to geoscience. This would hopefully stimulate and accelerate the exploitation of geoscience information by practitioners for the benefit of industry and society. It has gone from... Continue Reading →

Using Natural Language Processing to detect historical flooding events and risk reduction projects from newspapers

I came across this fascinating open access research paper by Lai et al (2022). Using NLP to extract street flooding events (green) and risk reduction projects (red) from hundreds of thousands of newspaper articles in the United States. By spatially viewing this data gaps in governmental strategies could be identified. I found this passage of... Continue Reading →

Website Powered by WordPress.com.

Up ↑