Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

MIRepNet: A Pipeline and Foundation Model for EEG-Based Motor Imagery Classification































Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

MIRepNet: A Pipeline and Foundation Model for EEG-Based Motor Imagery Classification






























HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale
ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
Toward long-range ENSO prediction with an explainable deep learning model
OmniArch: Building Foundation Model for Scientific Computing
VA-MoE: Channel-Adapted MoE for Incremental Weather Forecasting
UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding
DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
Reconstructing 4D Spatial Intelligence: A Survey
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts
Agentic Reinforced Policy Optimization
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Deep Researcher with Test-Time Diffusion
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Captain Cinema: Towards Short Movie Generation
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
NABLA: Neighborhood Adaptive Block-Level Attention
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale
ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
Toward long-range ENSO prediction with an explainable deep learning model
OmniArch: Building Foundation Model for Scientific Computing
VA-MoE: Channel-Adapted MoE for Incremental Weather Forecasting
UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding
DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
Reconstructing 4D Spatial Intelligence: A Survey
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts
Agentic Reinforced Policy Optimization
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Deep Researcher with Test-Time Diffusion
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Captain Cinema: Towards Short Movie Generation
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
NABLA: Neighborhood Adaptive Block-Level Attention