Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture































Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture






























SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench
UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
UQ: Assessing Language Models on Unsolved Questions
CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards
TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
AWorld: Orchestrating the Training Recipe for Agentic AI
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
rStar2-Agent: Agentic Reasoning Technical Report
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
MobileCLIP2: Improving Multi-Modal Reinforced Training
AI-AI Esthetic Collaboration with Explicit Semiotic Awareness and Emergent Grammar Development
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation
Predicting the Order of Upcoming Tokens Improves Language Modeling
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench
UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
UQ: Assessing Language Models on Unsolved Questions
CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards
TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
AWorld: Orchestrating the Training Recipe for Agentic AI
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
rStar2-Agent: Agentic Reasoning Technical Report
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
MobileCLIP2: Improving Multi-Modal Reinforced Training
AI-AI Esthetic Collaboration with Explicit Semiotic Awareness and Emergent Grammar Development
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation
Predicting the Order of Upcoming Tokens Improves Language Modeling
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies