Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why































LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why






























A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training
ELF: Embedded Language Flows
PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents
Rubric-based On-policy Distillation
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
Qwen-Image-2.0 Technical Report
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
Fast Byte Latent Transformer
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI
HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex
Flow-OPD: On-Policy Distillation for Flow Matching Models
MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems
When to Trust Imagination: Adaptive Action Execution for World Action Models
RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation
MiA-Signature: Approximating Global Activation for Long-Context Understanding
Continuous Latent Diffusion Language Model
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
ZAYA1-8B Technical Report
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training
ELF: Embedded Language Flows
PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents
Rubric-based On-policy Distillation
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
Qwen-Image-2.0 Technical Report
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
Fast Byte Latent Transformer
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI
HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex
Flow-OPD: On-Policy Distillation for Flow Matching Models
MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems
When to Trust Imagination: Adaptive Action Execution for World Action Models
RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation
MiA-Signature: Approximating Global Activation for Long-Context Understanding
Continuous Latent Diffusion Language Model
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
ZAYA1-8B Technical Report
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World