HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid
Vision Tokenizer

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Yanghao Li, Rui Qian, Bowen Pan, et al.

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models

Supervised Fine-Tuning

Ranjie Duan, Jiexi Liu, Xiaojun Jia, et al.

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

Dulhan Jayalath, Shashwat Goel, Thomas Foster, et al.

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Code Generation

Jane Luo, Xin Zhang, Steven Liu, et al.

Synthetic bootstrapped pretraining

Zitong Yang, Aonan Zhang, Hong Liu, et al.

Skilful global seasonal predictions from a machine learning weather model trained on reanalysis data

Chris Kent, Adam A. Scaife, Nick J. Dunstone, et al.

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial
Search and Reasoning

Liang Hu, Jianpeng Jiao, Jiashuo Liu, et al.

Understand Before You Generate: Self-Guided Training for Autoregressive
Image Generation

Image Generation

Image Understanding

Xiaoyu Yue, Zidong Wang, Yuqing Wang, et al.

Evolving Language Models without Labels: Majority Drives Selection,
Novelty Promotes Variation

Reinforcement Learning

Yujun Zhou, Zhenwen Liang, Haolin Liu, et al.

Reasoning over Boundaries: Enhancing Specification Alignment via
Test-time Delibration

Haoran Zhang, Yafu Li, Xuyang Hu, et al.

FlowRL: Matching Reward Distributions for LLM Reasoning

Reinforcement Learning

Xuekai Zhu, Daixuan Cheng, Dinghuai Zhang, et al.

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform
Data

Zhaoyang Liu, JingJing Xie, Zichen Ding, et al.

Are Large Pre-trained Vision Language Models Effective Construction Safety Inspectors?

Visual Question Answering

Image Captioning

Xuezheng Chen, Zhengbo Zou

HTSC-2025: A Benchmark Dataset of Ambient-Pressure High-Temperature Superconductors for AI-Driven Critical Temperature Prediction

Xiao-Qi Han, Ze-Feng Gao, Xin-De Wang, et al.

Discovery of Unstable Singularities

Yongji Wang, Mehdi Bennani, James Martens, et al.

VCBench: Benchmarking LLMs in Venture Capital

Rick Chen, Joseph Ternasky, Afriyie Samuel Kwesi, et al.

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Retrieval-Augmented Generation

Intelligent Question Answering

Ailing Yu, Lan Yao, Jingnan Liu, et al.

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via
Machine Unlearning

Code Generation

Zhaoyang Chu, Yao Wan, Zhikun Zhang, et al.

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Computer Vision

Image Understanding

Xu Zheng, Chenfei Liao, Ziqiao Weng, et al.

Hala Technical Report: Building Arabic-Centric Instruction & Translation
Models at Scale

Hasan Abed Al Kader Hammoud, Mohammad Zbeeb, Bernard Ghanem

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

Reinforcement Learning

Daya Guo, Dejian Yang, Haowei Zhang, et al.

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Supervised Fine-Tuning

Pulkit Verma, Ngoc La, Anthony Favier, et al.

OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft

Multi-Task Learning

Zihao Wang, Muyao Li, Kaichen He, et al.

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury, Sinead Williamson, Adam Goliński, et al.

ReSum: Unlocking Long-Horizon Search Intelligence via Context
Summarization

Xixi Wu, Kuan Li, Yida Zhao, et al.

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon
Agents

Zile Qiao, Guoxin Chen, Xuanzhong Chen, et al.

Towards General Agentic Intelligence via Environment Scaling

Runnan Fang, Shihao Cai, Baixuan Li, et al.

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic
Data and Scalable Reinforcement Learning

Reinforcement Learning

Kuan Li, Zhongwang Zhang, Huifeng Yin, et al.

Scaling Agents via Continual Pre-training

Liangcai Su, Zhen Zhang, Guangyu Li, et al.

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for
Open-Ended Deep Research

Retrieval-Augmented Generation

Zijian Li, Xin Guan, Bo Zhang, et al.

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

Yuxi Li, Yi Liu, Gelei Deng, et al.

REFRAG: Rethinking RAG based Decoding

Retrieval-Augmented Generation

Xiaoqiang Lin, Aritra Ghosh, Bryan Kian Hsiang Low, et al.

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid
Vision Tokenizer

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Yanghao Li, Rui Qian, Bowen Pan, et al.

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models

Supervised Fine-Tuning

Ranjie Duan, Jiexi Liu, Xiaojun Jia, et al.

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

Dulhan Jayalath, Shashwat Goel, Thomas Foster, et al.

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Code Generation

Jane Luo, Xin Zhang, Steven Liu, et al.

Synthetic bootstrapped pretraining

Zitong Yang, Aonan Zhang, Hong Liu, et al.

Skilful global seasonal predictions from a machine learning weather model trained on reanalysis data

Chris Kent, Adam A. Scaife, Nick J. Dunstone, et al.

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial
Search and Reasoning

Liang Hu, Jianpeng Jiao, Jiashuo Liu, et al.

Understand Before You Generate: Self-Guided Training for Autoregressive
Image Generation

Image Generation

Image Understanding

Xiaoyu Yue, Zidong Wang, Yuqing Wang, et al.

Evolving Language Models without Labels: Majority Drives Selection,
Novelty Promotes Variation

Reinforcement Learning

Yujun Zhou, Zhenwen Liang, Haolin Liu, et al.

Reasoning over Boundaries: Enhancing Specification Alignment via
Test-time Delibration

Haoran Zhang, Yafu Li, Xuyang Hu, et al.

FlowRL: Matching Reward Distributions for LLM Reasoning

Reinforcement Learning

Xuekai Zhu, Daixuan Cheng, Dinghuai Zhang, et al.

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform
Data

Zhaoyang Liu, JingJing Xie, Zichen Ding, et al.

Are Large Pre-trained Vision Language Models Effective Construction Safety Inspectors?

Visual Question Answering

Image Captioning

Xuezheng Chen, Zhengbo Zou

HTSC-2025: A Benchmark Dataset of Ambient-Pressure High-Temperature Superconductors for AI-Driven Critical Temperature Prediction

Xiao-Qi Han, Ze-Feng Gao, Xin-De Wang, et al.

Discovery of Unstable Singularities

Yongji Wang, Mehdi Bennani, James Martens, et al.

VCBench: Benchmarking LLMs in Venture Capital

Rick Chen, Joseph Ternasky, Afriyie Samuel Kwesi, et al.

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Retrieval-Augmented Generation

Intelligent Question Answering

Ailing Yu, Lan Yao, Jingnan Liu, et al.

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via
Machine Unlearning

Code Generation

Zhaoyang Chu, Yao Wan, Zhikun Zhang, et al.

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Computer Vision

Image Understanding

Xu Zheng, Chenfei Liao, Ziqiao Weng, et al.

Hala Technical Report: Building Arabic-Centric Instruction & Translation
Models at Scale

Hasan Abed Al Kader Hammoud, Mohammad Zbeeb, Bernard Ghanem

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

Reinforcement Learning

Daya Guo, Dejian Yang, Haowei Zhang, et al.

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Supervised Fine-Tuning

Pulkit Verma, Ngoc La, Anthony Favier, et al.

OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft

Multi-Task Learning

Zihao Wang, Muyao Li, Kaichen He, et al.

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury, Sinead Williamson, Adam Goliński, et al.

ReSum: Unlocking Long-Horizon Search Intelligence via Context
Summarization

Xixi Wu, Kuan Li, Yida Zhao, et al.

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon
Agents

Zile Qiao, Guoxin Chen, Xuanzhong Chen, et al.

Towards General Agentic Intelligence via Environment Scaling

Runnan Fang, Shihao Cai, Baixuan Li, et al.

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic
Data and Scalable Reinforcement Learning

Reinforcement Learning

Kuan Li, Zhongwang Zhang, Huifeng Yin, et al.

Scaling Agents via Continual Pre-training

Liangcai Su, Zhen Zhang, Guangyu Li, et al.

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for
Open-Ended Deep Research

Retrieval-Augmented Generation

Zijian Li, Xin Guan, Bo Zhang, et al.

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

Yuxi Li, Yi Liu, Gelei Deng, et al.

REFRAG: Rethinking RAG based Decoding

Retrieval-Augmented Generation

Xiaoqiang Lin, Aritra Ghosh, Bryan Kian Hsiang Low, et al.

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Synthetic bootstrapped pretraining

Skilful global seasonal predictions from a machine learning weather model trained on reanalysis data

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

FlowRL: Matching Reward Distributions for LLM Reasoning

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Are Large Pre-trained Vision Language Models Effective Construction Safety Inspectors?

HTSC-2025: A Benchmark Dataset of Ambient-Pressure High-Temperature Superconductors for AI-Driven Critical Temperature Prediction

Discovery of Unstable Singularities

VCBench: Benchmarking LLMs in Venture Capital

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Towards General Agentic Intelligence via Environment Scaling

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Scaling Agents via Continual Pre-training

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

REFRAG: Rethinking RAG based Decoding

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Synthetic bootstrapped pretraining

Skilful global seasonal predictions from a machine learning weather model trained on reanalysis data

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

FlowRL: Matching Reward Distributions for LLM Reasoning

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Are Large Pre-trained Vision Language Models Effective Construction Safety Inspectors?

HTSC-2025: A Benchmark Dataset of Ambient-Pressure High-Temperature Superconductors for AI-Driven Critical Temperature Prediction

Discovery of Unstable Singularities

VCBench: Benchmarking LLMs in Venture Capital

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Towards General Agentic Intelligence via Environment Scaling

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Scaling Agents via Continual Pre-training

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

REFRAG: Rethinking RAG based Decoding