Papers

Nahyun Lee, Dongkeun Yoon, Guijin Son, et al.

A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks

Benchmarks

Tomer Keren, Nitay Calderon, Asaf Yehudai, et al.

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Mind Lab, Song Cao, Vic Cao, et al.

Model Training

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Image Generation

Haozhe Zhao, Shuzheng Si, Zhenhailong Wang, et al.

TACK: A statistical evaluation of degradation activity on a novel TArgeting Chimeras Knowledge dataset

Stefano Ribes, Nils Dunlop, Rocío Mercado

Deep Learning

Narrative Weaver: Towards Controllable Long-Range Visual Consistency with Multi-Modal Conditioning

Zhengjian Yao, Yongzhi Li, Xinyuan Gao, et al.

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Minhua Lin, Juncheng Wu, Zijun Wang, et al.

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Reinforcement Learning

Nianyi Lin, Jiajie Zhang, Lei Hou, et al.

Trust-Region Behavior Blending for On-Policy Distillation

Reinforcement Learning

Daniil Plyusov, Alexey Gorbatovski, Alexey Malakhov, et al.

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Text-to-Speech

Ruiqi Li, Yu Zhang, Changhao Pan, et al.

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Any-to-Any

Image Generation

Yuqing Wang, Zhijie Lin, Ceyuan Yang, et al.

GrepSeek: Training Search Agents for Direct Corpus Interaction

Alireza Salemi, Chang Zeng, Atharva Nijasure, et al.

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Tianyi Zhou, Dongrui Liu, Leitao Yuan, et al.

Agentic Systems as Boosting Weak Reasoning Models

Varun Sunkaraneni, Pierfrancesco Beneventano, Riccardo Neumarker, et al.

Reasoning

YoCausal: How Far is Video Generation from World Model? A Causality Perspective

You-Zhe Xie, Yu-Hsuan Li, Jie-Ying Lee, et al.

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Min Zhao, Hongzhou Zhu, Bokai Yan, et al.

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

Fangtai Wu, Hailong Guo, Shijie Huang, et al.

Image Generation

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Retrieval-Augmented Generation

Intelligent Question Answering

Jinheon Baek, Soyeong Jeong, Sangwoo Park, et al.

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Qiuyue Wang, Mingsheng Li, Jian Guan, et al.

Qwen

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Dongrui Liu, Yu Li, Zhonghao Yang, et al.

Qwen

World Action Models: The Next Frontier in Embodied AI

Embodied Intelligence

Siyin Wang, Junhao Shi, Zhaoyang Fu, et al.

World Action Models are Zero-shot Policies

Seonghyeon Ye, Yunhao Ge, Kaiyuan Zheng, et al.

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Mathematics

Guijin Son, Seungyeop Yi, Minju Gwak, et al.

Self-Improving Language Models with Bidirectional Evolutionary Search

Guowei Xu, Zhenting Qi, Huangyuan Su, et al.

Model Training

From Pixels to Words -- Towards Native One-Vision Models at Scale

Haiwen Diao, Jiahao Wang, Penghao Wu, et al.

Video Understanding

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Minki Kang, Shizhe Diao, Ryo Hachiuma, et al.

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Reinforcement Learning

Preference Modeling

Hongru Hou, Tiehua Mei, Denghui Geng, et al.

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Fangfu Liu, Kai He, Tianchang Shen, et al.

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Text-to-Image

Minjun Zhu, Zhen Lin, Yixuan Weng, et al.

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

Guiyao Tie, Jiawen Shi, Dingjie Song, et al.

Embodied Intelligence

Agent Harness Engineering: A Survey

Junjie Li, Xi Xiao, Yunbei Zhang, et al.

D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing