Finished labelling 25,000 petroleum geoscience sentences for machine learning

machine learning

It’s taken me 6 months elapsed time, but I have finally finished manually labelling 25,000 (yes – twenty-five thousand!) petroleum geoscience sentences from global public domain sources.

I’m using these to experiment training a machine learning classifier which, using deep context, can predict the topics of any passage of geoscience text hitherto unseen by the algorithm.

I used a very specific methodology when labelling which will allow a variety of novel use cases. One use case I’ll be testing is the potential to predict contexts which could lead to new plays and opportunities.

Advertisement

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

Website Powered by WordPress.com.

Up ↑

%d bloggers like this: