HyperAIHyperAI

Command Palette

Search for a command to run...

MASSW Scientific Workflow Dataset

Date

a year ago

Size

998.33 MB

Organization

Publish URL

github.com

Paper URL

arxiv.org

The MASSW (Multi-Aspect Summarization of Scientific Workflows) dataset is a comprehensive text dataset focusing on summarizing various aspects of scientific workflows. It was jointly released in 2024 by researchers from the University of Michigan, Ann Arbor, Purdue University, and LG AI Research Institute.MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows".

MASSW contains more than 152k peer-reviewed publications from 17 top computer science conferences, covering a time span of the past 50 years. The core feature of this dataset is that it defines 5 key aspects of the scientific workflow: context, key ideas, methods, results, and expected impact. These aspects are used to extract and structure information from each publication to generate a structured summary. This process not only improves the accessibility of information, but also facilitates various downstream tasks and analyses.

MASSW.torrent
Seeding 2Downloading 0Completed 128Total Downloads 200
  • MASSW/
    • README.md
      1.69 KB
    • README.txt
      3.39 KB
      • data/
          • MASSW/
            • massw_metadata_v1.jsonl
              854.73 MB
            • massw_v1.tsv
              998.33 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MASSW Scientific Workflow Dataset | Datasets | HyperAI