HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification

{Ivana Lu{\v{c}}i{\'c} Sowmya Vajjala}

OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification

Abstract

This paper describes the collection and compilation of the OneStopEnglish corpus of texts written at three reading levels, and demonstrates its usefulness for through two applications - automatic readability assessment and automatic text simplification. The corpus consists of 189 texts, each in three versions (567 in total). The corpus is now freely available under a CC by-SA 4.0 license and we hope that it would foster further research on the topics of readability assessment and text simplification.

Benchmarks

BenchmarkMethodologyMetrics
text-classification-on-onestopenglishSMO (Sequential Minimal Optimization)
Accuracy (5-fold): 0.781

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification | Papers | HyperAI