Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts































Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts






























Agentic Reinforced Policy Optimization
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Deep Researcher with Test-Time Diffusion
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Captain Cinema: Towards Short Movie Generation
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
NABLA: Neighborhood Adaptive Block-Level Attention
Group Sequence Policy Optimization
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
RAVine: Reality-Aligned Evaluation for Agentic Search
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning
DesignLab: Designing Slides Through Iterative Detection and Correction
Yume: An Interactive World Generation Model
Pixels, Patterns, but No Poetry: To See The World like Humans
MedChatZH: a Better Medical Adviser Learns from Better Instructions
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers
Agentic Reinforced Policy Optimization
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Deep Researcher with Test-Time Diffusion
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Captain Cinema: Towards Short Movie Generation
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
NABLA: Neighborhood Adaptive Block-Level Attention
Group Sequence Policy Optimization
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
RAVine: Reality-Aligned Evaluation for Agentic Search
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning
DesignLab: Designing Slides Through Iterative Detection and Correction
Yume: An Interactive World Generation Model
Pixels, Patterns, but No Poetry: To See The World like Humans
MedChatZH: a Better Medical Adviser Learns from Better Instructions
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers