Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs

A Survey of Reinforcement Learning for Large Reasoning Models































FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs

A Survey of Reinforcement Learning for Large Reasoning Models






























Measuring and mitigating overreliance is necessary for building human-compatible AI
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
Reconstruction Alignment Improves Unified Multimodal Models
Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Visual Representation Alignment for Multimodal Large Language Models
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
WenetSpeech-Yue: A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
SheetDesigner: MLLM-Powered Spreadsheet Layout Generation with Rule-Based and Vision-Based Reflection
Autonomous Code Evolution Meets NP-Completeness
Reinforcement Learning Foundations for Deep Research Systems: A Survey
Reinforced Visual Perception with Tools
Does DINOv3 Set a New Medical Vision Standard?
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Reverse-Engineered Reasoning for Open-Ended Generation
OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration
CURE: Controlled Unlearning for Robust Embeddings -- Mitigating Conceptual Shortcuts in Pre-Trained Language Models
MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting
LuxDiT: Lighting Estimation with Video Diffusion Transformer
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning
Set Block Decoding is a Language Model Inference Accelerator
Symbolic Graphics Programming with Large Language Models
Why Language Models Hallucinate
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Recomposer: Event-roll-guided generative audio editing
Transition Models: Rethinking the Generative Learning Objective
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Towards a Unified View of Large Language Model Post-Training
Measuring and mitigating overreliance is necessary for building human-compatible AI
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
Reconstruction Alignment Improves Unified Multimodal Models
Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Visual Representation Alignment for Multimodal Large Language Models
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
WenetSpeech-Yue: A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
SheetDesigner: MLLM-Powered Spreadsheet Layout Generation with Rule-Based and Vision-Based Reflection
Autonomous Code Evolution Meets NP-Completeness
Reinforcement Learning Foundations for Deep Research Systems: A Survey
Reinforced Visual Perception with Tools
Does DINOv3 Set a New Medical Vision Standard?
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Reverse-Engineered Reasoning for Open-Ended Generation
OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration
CURE: Controlled Unlearning for Robust Embeddings -- Mitigating Conceptual Shortcuts in Pre-Trained Language Models
MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting
LuxDiT: Lighting Estimation with Video Diffusion Transformer
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning
Set Block Decoding is a Language Model Inference Accelerator
Symbolic Graphics Programming with Large Language Models
Why Language Models Hallucinate
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Recomposer: Event-roll-guided generative audio editing
Transition Models: Rethinking the Generative Learning Objective
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Towards a Unified View of Large Language Model Post-Training