Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games

Speculative Speculative Decoding































Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games

Speculative Speculative Decoding






























Using Learning Progressions to Guide AI Feedback for Science Learning
HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations
Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
Gravity Falls: A Comparative Analysis of Domain-Generation Algorithm (DGA) Detection Methods for Mobile Device Spearphishing
From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence
The Design Space of Tri-Modal Masked Diffusion Models
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
RubricBench: Aligning Model-Generated Rubrics with Human Standards
MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
OpenAutoNLU: Open Source AutoML Library for NLU
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
Multi-agent cooperation through in-context co-player inference
ACTIONENGINE: From Reactive to Programmatic GUI Agents via State Machine Memory
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era
Mode Seeking meets Mean Seeking for Fast Long Video Generation
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
Enhancing Spatial Understanding in Image Generation via Reward Modeling
dLLM: Simple Diffusion Language Modeling
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
OmniGAIA: Towards Native Omni-Modal AI Agents
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
The Trinity of Consistency as a Defining Principle for General World Models
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Using Learning Progressions to Guide AI Feedback for Science Learning
HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations
Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
Gravity Falls: A Comparative Analysis of Domain-Generation Algorithm (DGA) Detection Methods for Mobile Device Spearphishing
From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence
The Design Space of Tri-Modal Masked Diffusion Models
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
RubricBench: Aligning Model-Generated Rubrics with Human Standards
MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
OpenAutoNLU: Open Source AutoML Library for NLU
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
Multi-agent cooperation through in-context co-player inference
ACTIONENGINE: From Reactive to Programmatic GUI Agents via State Machine Memory
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era
Mode Seeking meets Mean Seeking for Fast Long Video Generation
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
Enhancing Spatial Understanding in Image Generation via Reward Modeling
dLLM: Simple Diffusion Language Modeling
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
OmniGAIA: Towards Native Omni-Modal AI Agents
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
The Trinity of Consistency as a Defining Principle for General World Models
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation