Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

A Simple Baseline for Streaming Video Understanding

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery































A Simple Baseline for Streaming Video Understanding

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery






























Steerable Visual Representations
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Generative World Renderer
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
Terminal Agents Suffice for Enterprise Automation
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers
Cheap Bootstrap for Fast Uncertainty Quantification of Stochastic Gradient Descent
Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning
Early Exiting Predictive Coding Neural Networks for Edge AI
Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients
The capacity region of classes of product broadcast channels
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING
LightMover: Generative Light Movement with Color and Intensity Controls
Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation
Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition
A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI
Text Data Integration
Unified Number-Free Text-to-Motion Generation Via Flow Matching
SEAR: Schema-Based Evaluation and Routing for LLM Gateways
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
EpochX: Building the Infrastructure for an Emergent Agent Civilization
TAPS: Task Aware Proposal Distributions for Speculative Sampling
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset
Steerable Visual Representations
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Generative World Renderer
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
Terminal Agents Suffice for Enterprise Automation
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers
Cheap Bootstrap for Fast Uncertainty Quantification of Stochastic Gradient Descent
Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning
Early Exiting Predictive Coding Neural Networks for Edge AI
Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients
The capacity region of classes of product broadcast channels
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING
LightMover: Generative Light Movement with Color and Intensity Controls
Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation
Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition
A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI
Text Data Integration
Unified Number-Free Text-to-Motion Generation Via Flow Matching
SEAR: Schema-Based Evaluation and Routing for LLM Gateways
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
EpochX: Building the Infrastructure for an Emergent Agent Civilization
TAPS: Task Aware Proposal Distributions for Speculative Sampling
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset