Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation































DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation






























K-EXAONE Technical Report
The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems
Training AI Co-Scientists Using Rubric Rewards
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
IQuest-Coder-V1 Technical Report
Recursive Language Models
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
On the Role of Discreteness in Diffusion LLMs
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Scaling Open-Ended Reasoning to Predict the Future
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
mHC: Manifold-Constrained Hyper-Connections
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs
GraphLocator: Graph-guided Causal Reasoning for Issue Localization
Evaluating Parameter Efficient Methods for RLVR
End-to-End Test-Time Training for Long Context
DreamOmni3: Scribble-based Editing and Generation
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs
K-EXAONE Technical Report
The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems
Training AI Co-Scientists Using Rubric Rewards
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
IQuest-Coder-V1 Technical Report
Recursive Language Models
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
On the Role of Discreteness in Diffusion LLMs
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Scaling Open-Ended Reasoning to Predict the Future
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
mHC: Manifold-Constrained Hyper-Connections
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs
GraphLocator: Graph-guided Causal Reasoning for Issue Localization
Evaluating Parameter Efficient Methods for RLVR
End-to-End Test-Time Training for Long Context
DreamOmni3: Scribble-based Editing and Generation
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs