Enjoyed presenting at BEOS in London today. Excellent presentations from Joseph Nicholson and Joe Court on digitalisation. Big thanks to Marcella Cilia for facilitating and everyone at BEOS for the excellent event. Busy room and really interesting questions on data and Artificial Intelligence.
Open Access Geological Data: Macrostrat Lithostratigraphy
More incredible open-access geological datasets. Macrostrat is the world's largest homogenised geological map database, over 2.5 Million geological polygons and over 50,000 stratigraphical names all open and accessible via an API to download. There is also a range of interactive tools, including a Digital Elevation Profile and mobile fieldbook for a smartphone.Global Big Data and... Continue Reading →
European Geosciences Union (EGU) – The Great Debate: Geological Large Language Models
Honoured to be invited to take part in this international debate on AI at the European Geosciences Union (EGU) General Assembly in Vienna during May.Large Language Models (LLMs) and other advanced AI tools are reshaping how researchers and practitioners approach data analysis, synthesis, and communication across Geosciences. This Great Debate at the EGU General Assembly... Continue Reading →
Opensource Geological Tools: Large Language Model AI Assistant for Mineral Resource Analysis and Visualisation
Opensource tool released: Run a Large Language Model driven AI solution locally on your machine for insights into mineral production, reserves, trends, prices, substitutes, and recycling resources.Mohanty et al (2025) have released, Opensource, the code and data from their research in Github. Link in the comments. It is designed to run with Llama-3-8B a lightweight... Continue Reading →
Opensource Geological Tools: Plate reconstruction
In terms of outreach and education there are some great open-source visualisations, data and tools for earth sciences. Earthviewer allows people to explore Earth's history in deep time, from 4.5 billion years ago to present day. This includes the location of major cities and where they were at various points of geological time to help... Continue Reading →
Open Access Geological Data: Global Lithology Map
There are incredible open-access datasets out there. For example the GLiM Global Lithological Map containing over 1.2 million polygons of rock types.From Hartmann and Moosdorf (2012) made interactive through a web viewer in 2017. Link in the comments.AbstractLithology describes the geochemical, mineralogical, and physical properties of rocks. It plays a key role in many processes... Continue Reading →
Spatialising word vectors
In areas of sparse data, patterns in text may be a helpful geoscience screening tool. One technique may be to build a text embedding model which allows you to compare the vectors of target geological concepts to location names.Disambiguation here is vitally important. The prototype example shown is for the vector of 'Monzonite' to vectors... Continue Reading →
Combining minerals and lithology text embeddings for data discovery
I've combined text embeddings generated from word co-occurrences within thousands of geological reports for both lithology and minerals in a 3D t-SME plot. Following on from some recent posts I made, it may be interesting to explore similarity (cosine vector similarity) between lithologies-lithologies, minerals-minerals and lithologies-minerals.This is a technique anyone can conduct on large volumes... Continue Reading →
Text Embeddings for Rock Classifications
I tested if we might differentiate rock types and their associations based on the patterns of words that occur around them in large archives of geological reports. Using a text embeddings model generated through the unsupervised machine learning from thousands of geological survey reports, approximately 2,000 rock type names were compared to each other. The... Continue Reading →
Text Embeddings for Mineral Association Discovery
Data driven discovery: It may be interesting to compare the similarities of minerals based on their co-occuring words in large amounts of archive geological reports, to actual known reported mineral occurrences in databases such as Mindat. One could perhaps easily automate this algorithmic comparison, leaving ranked "candidate" mineral associations not present in reference databases. There... Continue Reading →