Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Automated Algorithmic Discovery for Gravitational-Wave Detection Guided by LLM-Informed Evolutionary Monte Carlo Tree Search































Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Automated Algorithmic Discovery for Gravitational-Wave Detection Guided by LLM-Informed Evolutionary Monte Carlo Tree Search






























Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report
CellForge: Agentic Design of Virtual Cell Models
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Multimodal Referring Segmentation: A Survey
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
SWE-Exp: Experience-Driven Software Issue Resolution
PixNerd: Pixel Neural Field Diffusion
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Co-Producing AI: Toward an Augmented, Participatory Lifecycle
iLRM: An Iterative Large 3D Reconstruction Model
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models
C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
RecGPT Technical Report
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
The Outcome of the 2022 Landslide4Sense Competition: Advanced Landslide Detection from Multi-Source Satellite Imagery
Less is More for Synthetic Speech Detection in the Wild
Solution-aware vs global ReLU selection: partial MILP strikes back for DNN verification
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
BANG: Dividing 3D Assets via Generative Exploded Dynamics
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report
CellForge: Agentic Design of Virtual Cell Models
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Multimodal Referring Segmentation: A Survey
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
SWE-Exp: Experience-Driven Software Issue Resolution
PixNerd: Pixel Neural Field Diffusion
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Co-Producing AI: Toward an Augmented, Participatory Lifecycle
iLRM: An Iterative Large 3D Reconstruction Model
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models
C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
RecGPT Technical Report
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
The Outcome of the 2022 Landslide4Sense Competition: Advanced Landslide Detection from Multi-Source Satellite Imagery
Less is More for Synthetic Speech Detection in the Wild
Solution-aware vs global ReLU selection: partial MILP strikes back for DNN verification
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
BANG: Dividing 3D Assets via Generative Exploded Dynamics
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents