Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

From Editor to Dense Geometry Estimator

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Multi-View 3D Point Tracking

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

On the Theoretical Limitations of Embedding-Based Retrieval

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Open Data Synthesis For Deep Research

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

From Editor to Dense Geometry Estimator

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Multi-View 3D Point Tracking

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

On the Theoretical Limitations of Embedding-Based Retrieval

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Open Data Synthesis For Deep Research

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents