HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Automatic extraction of materials and properties from superconductors scientific literature

Luca Foppiano Pedro Baptista de Castro Pedro Ortiz Suarez Kensei Terashima Yoshihiko Takano Masashi Ishii

Automatic extraction of materials and properties from superconductors scientific literature

Abstract

The automatic extraction of materials and related properties from the scientific literature is gaining attention in data-driven materials science (Materials Informatics). In this paper, we discuss Grobid-superconductors, our solution for automatically extracting superconductor material names and respective properties from text. Built as a Grobid module, it combines machine learning and heuristic approaches in a multi-step architecture that supports input data as raw text or PDF documents. Using Grobid-superconductors, we built SuperCon2, a database of 40324 materials and properties records from 37700 papers. The material (or sample) information is represented by name, chemical formula, and material class, and is characterized by shape, doping, substitution variables for components, and substrate as adjoined information. The properties include the Tc superconducting critical temperature and, when available, applied pressure with the Tc measurement method.

Benchmarks

BenchmarkMethodologyMetrics
ner-on-supermatsuperconductors-Scibert
F1: 77.03
Precision: 73.69
Recall: 80.69

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Automatic extraction of materials and properties from superconductors scientific literature | Papers | HyperAI