8 months ago

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3

Abstract

The explosion of visual content available online underscores the requirementfor an accurate machine assessor to robustly evaluate scores across diversetypes of visual contents. While recent studies have demonstrated theexceptional potentials of large multi-modality models (LMMs) on a wide range ofrelated fields, in this work, we explore how to teach them for visual ratingaligned with human opinions. Observing that human raters only learn and judgediscrete text-defined levels in subjective studies, we propose to emulate thissubjective process and teach LMMs with text-defined rating levels instead ofscores. The proposed Q-Align achieves state-of-the-art performance on imagequality assessment (IQA), image aesthetic assessment (IAA), as well as videoquality assessment (VQA) tasks under the original LMM structure. With thesyllabus, we further unify the three tasks into one model, termed the OneAlign.In our experiments, we demonstrate the advantage of the discrete-level-basedsyllabus over direct-score-based variants for LMMs. Our code and thepre-trained weights are released at https://github.com/Q-Future/Q-Align.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 34 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 34 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 34 more

Abstract

Build AI with AI

HyperAI Newsletters

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3

Haoning Wu ♦♡ 1 Zicheng Zhang ♪ 2 Weixia Zhang 2 Chaofeng Chen 1 Liang Liao 1 Chunyi Li 2 Yixuan Gao 1,2 Annan Wang 1 Erli Zhang 1 Wenxiu Sun 3