Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

ELF: Embedded Language Flows

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

Rubric-based On-policy Distillation

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Qwen-Image-2.0 Technical Report

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

Fast Byte Latent Transformer

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Flow-OPD: On-Policy Distillation for Flow Matching Models

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

When to Trust Imagination: Adaptive Action Execution for World Action Models

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

MiA-Signature: Approximating Global Activation for Long-Context Understanding

Continuous Latent Diffusion Language Model

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

ZAYA1-8B Technical Report

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

ELF: Embedded Language Flows

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

Rubric-based On-policy Distillation

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Qwen-Image-2.0 Technical Report

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

Fast Byte Latent Transformer

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Flow-OPD: On-Policy Distillation for Flow Matching Models

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

When to Trust Imagination: Adaptive Action Execution for World Action Models

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

MiA-Signature: Approximating Global Activation for Long-Context Understanding

Continuous Latent Diffusion Language Model

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

ZAYA1-8B Technical Report

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World