Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning































Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning






























DreamOmni2: Multimodal Instruction-based Editing and Generation
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning
UniVideo: Unified Understanding, Generation, and Editing for Videos
MemMamba: Rethinking Memory Patterns in State Space Model
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning
Extract-0: A Specialized Language Model for Document Information Extraction
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction
WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild
Token-Aware Editing of Internal Activations for Large Language Model Alignment
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Agent Learning via Early Experience
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Qwen2.5 Technical Report
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning
CoDA: Coding LM via Diffusion Adaptation
Fast-dLLM v2: Efficient Block-Diffusion LLM
Less is More: Recursive Reasoning with Tiny Networks
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
DreamOmni2: Multimodal Instruction-based Editing and Generation
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning
UniVideo: Unified Understanding, Generation, and Editing for Videos
MemMamba: Rethinking Memory Patterns in State Space Model
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning
Extract-0: A Specialized Language Model for Document Information Extraction
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction
WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild
Token-Aware Editing of Internal Activations for Large Language Model Alignment
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Agent Learning via Early Experience
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Qwen2.5 Technical Report
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning
CoDA: Coding LM via Diffusion Adaptation
Fast-dLLM v2: Efficient Block-Diffusion LLM
Less is More: Recursive Reasoning with Tiny Networks
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information