Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

QFFT, Question-Free Fine-Tuning for Adaptive Reasoning

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure































QFFT, Question-Free Fine-Tuning for Adaptive Reasoning

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure






























AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
Efficient Medical VIE via Reinforcement Learning
Scaling Test-time Compute for LLM Agents
Iterative transcription factor screening enables rapid generation of microglia-like cells from human iPSC
TaskCraft: Automated Generation of Agentic Tasks
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Polystyrene nanoplastics disrupt the intestinal microenvironment by altering bacteria-host interactions through extracellular vesicle-delivered microRNAs
Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
The Diffusion Duality
Effective Red-Teaming of Policy-Adherent Agents
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation
Unified differentiable learning of electric response
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Text-Aware Image Restoration with Diffusion Models
Magistral
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
Sapiens: Foundation for Human Vision Models
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
SAM 2: Segment Anything in Images and Videos
The Llama 3 Herd of Models
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
Efficient Medical VIE via Reinforcement Learning
Scaling Test-time Compute for LLM Agents
Iterative transcription factor screening enables rapid generation of microglia-like cells from human iPSC
TaskCraft: Automated Generation of Agentic Tasks
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Polystyrene nanoplastics disrupt the intestinal microenvironment by altering bacteria-host interactions through extracellular vesicle-delivered microRNAs
Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
The Diffusion Duality
Effective Red-Teaming of Policy-Adherent Agents
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation
Unified differentiable learning of electric response
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Text-Aware Image Restoration with Diffusion Models
Magistral
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
Sapiens: Foundation for Human Vision Models
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
SAM 2: Segment Anything in Images and Videos
The Llama 3 Herd of Models