HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

Code2World: A GUI World Model via Renderable Code Generation

Code2World: A GUI World Model via Renderable Code Generation

Code Generation

Yuhao Zheng, Li'an Zhong, Yi Wang, et al.

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Shaobo Wang, Xuan Ouyang, Tianyi Xu, et al.

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Yucheng Hu, Jianke Zhang, Yuanfei Luo, et al.

THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Object Recognition

Martin N Hebert, Oliver Contier, Lina Teichmann, et al.

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Isomorphic Labs Team

SKILLRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Reinforcement Learning

Peng Xia, Jianwen Chen, Hanyang Wang, et al.

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Diffusion Model

Tiwei Bie, Maosong Cao, Xiang Cao, et al.

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Diffusion Model

Image Generation

Yunze Tong, Mushui Liu, Canyu Zhao, et al.

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Yalcin Tur, Jalal Naghiyev, Haoquan Fang, et al.

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Jun Han, Shuo Zhang, Wei Li, et al.

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Multimodal Representation

Xiaomin Yu, Yi Xin, Wenjie Zhang, et al.

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Video Generation

SII-OpenMOSS Team, Donghua Yu, Mingshu Chen, et al.

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Ajay Jaiswal, Lauren Hannah, Han-Byul Kim, et al.

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Video Understanding

Shenyuan Gao, William Liang, Kaiyuan Zheng, et al.

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Reinforcement Learning

Daniil Plyusov, Alexey Gorbatovski, Boris Shaposhnikov, et al.

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Lianhai Ren, Yucheng Ding, Xiao Liu, et al.

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Audio and Speech Processing

Georgii Aparin, Tasnima Sadekova, Alexey Rukhovich, et al.

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Reinforcement Learning

Shumin Wang, Yuexiang Xie, Wenhao Zhang, et al.

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Fangzhi Xu, Hang Yan, Qiushi Sun, et al.

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3 Team, Chengfeng Dou, Fan Yang, et al.

Generative Modeling via Drifting

Diffusion Model

Image Generation

Mingyang Deng, He Li, Tianhong Li, Kaiming He

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Text Generation

Junfeng Fang, Houcheng Jiang, Kun Wang, et al.

Learning to Reason in 13 Parameters

Intelligent Question Answering

John X. Morris, Niloofar Mireshghallah, Mark Ibrahim, et al.

DFlash: Block Diffusion for Flash Speculative Decoding

Diffusion Model

Jian Chen, Yesheng Liang, Zhijian Liu

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Video Generation

Diffusion Model

Shuo Chen, Cong Wei, Sun Sun, et al.

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Haozhen Zhang, Quanyu Long, Jianzhu Bao, et al.

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Reinforcement Learning

Fanfan Liu, Youyang Yin, Peng Shi, et al.

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Zhenxiong Yu, Zhi Yang, Zhiheng Jin, et al.

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Johannes Kirmayr, Lukas Stappen, Elisabeth André

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Multimodal Representation

Neil Zeghidour, Eugene Kharitonov, Manu Orsini, et al.

Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation

Diffusion Model

Jiantao Lin, Xin Yang, Meixi Chen, et al.

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition

Audio Recognition

Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, et al.

Code2World: A GUI World Model via Renderable Code Generation

Code2World: A GUI World Model via Renderable Code Generation

Code Generation

Yuhao Zheng, Li'an Zhong, Yi Wang, et al.

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Shaobo Wang, Xuan Ouyang, Tianyi Xu, et al.

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Yucheng Hu, Jianke Zhang, Yuanfei Luo, et al.

THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Object Recognition

Martin N Hebert, Oliver Contier, Lina Teichmann, et al.

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Isomorphic Labs Team

SKILLRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Reinforcement Learning

Peng Xia, Jianwen Chen, Hanyang Wang, et al.

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Diffusion Model

Tiwei Bie, Maosong Cao, Xiang Cao, et al.

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Diffusion Model

Image Generation

Yunze Tong, Mushui Liu, Canyu Zhao, et al.

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Yalcin Tur, Jalal Naghiyev, Haoquan Fang, et al.

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Jun Han, Shuo Zhang, Wei Li, et al.

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Multimodal Representation

Xiaomin Yu, Yi Xin, Wenjie Zhang, et al.

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Video Generation

SII-OpenMOSS Team, Donghua Yu, Mingshu Chen, et al.

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Ajay Jaiswal, Lauren Hannah, Han-Byul Kim, et al.

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Video Understanding

Shenyuan Gao, William Liang, Kaiyuan Zheng, et al.

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Reinforcement Learning

Daniil Plyusov, Alexey Gorbatovski, Boris Shaposhnikov, et al.

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Lianhai Ren, Yucheng Ding, Xiao Liu, et al.

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Audio and Speech Processing

Georgii Aparin, Tasnima Sadekova, Alexey Rukhovich, et al.

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Reinforcement Learning

Shumin Wang, Yuexiang Xie, Wenhao Zhang, et al.

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Fangzhi Xu, Hang Yan, Qiushi Sun, et al.

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3 Team, Chengfeng Dou, Fan Yang, et al.

Generative Modeling via Drifting

Diffusion Model

Image Generation

Mingyang Deng, He Li, Tianhong Li, Kaiming He

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Text Generation

Junfeng Fang, Houcheng Jiang, Kun Wang, et al.

Learning to Reason in 13 Parameters

Intelligent Question Answering

John X. Morris, Niloofar Mireshghallah, Mark Ibrahim, et al.

DFlash: Block Diffusion for Flash Speculative Decoding

Diffusion Model

Jian Chen, Yesheng Liang, Zhijian Liu

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Video Generation

Diffusion Model

Shuo Chen, Cong Wei, Sun Sun, et al.

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Haozhen Zhang, Quanyu Long, Jianzhu Bao, et al.

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Reinforcement Learning

Fanfan Liu, Youyang Yin, Peng Shi, et al.

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Zhenxiong Yu, Zhi Yang, Zhiheng Jin, et al.

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Johannes Kirmayr, Lukas Stappen, Elisabeth André

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Multimodal Representation

Neil Zeghidour, Eugene Kharitonov, Manu Orsini, et al.

Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation

Diffusion Model

Jiantao Lin, Xin Yang, Meixi Chen, et al.

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition

Audio Recognition

Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, et al.

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

SKILLRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

MOVA: Towards Scalable and Synchronized Video-Audio Generation

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Generative Modeling via Drifting

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Learning to Reason in 13 Parameters

DFlash: Block Diffusion for Flash Speculative Decoding

Context Forcing: Consistent Autoregressive Video Generation with Long Context

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

SKILLRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

MOVA: Towards Scalable and Synchronized Video-Audio Generation

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Generative Modeling via Drifting

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Learning to Reason in 13 Parameters

DFlash: Block Diffusion for Flash Speculative Decoding

Context Forcing: Consistent Autoregressive Video Generation with Long Context

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition