HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

Why Language Models Hallucinate

Why Language Models Hallucinate

Adam Tauman Kalai, Ofir Nachum, Santosh S. Vempala, et al.

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Yinglin Duan, Zhengxia Zou, Tongwei Gu, et al.

Recomposer: Event-roll-guided generative audio editing

Daniel P. W. Ellis, Eduardo Fonseca, Ron J. Weiss, et al.

Transition Models: Rethinking the Generative Learning Objective

Diffusion Model

Zidong Wang, Yiyuan Zhang, Xiaoyu Yue, et al.

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow
Real Instructions?

Supervised Fine-Tuning

Qinyan Zhang, Xinping Lei, Ruijie Miao, et al.

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Haiyuan Wan, Chen Yang, Junchi Yu, et al.

Towards a Unified View of Large Language Model Post-Training

Supervised Fine-Tuning

Reinforcement Learning

Xingtai Lv, Yuxin Zuo, Youbang Sun, et al.

From Editor to Dense Geometry Estimator

Depth Estimation

Diffusion Model

JiYuan Wang, Chunyu Lin, Lei Sun, et al.

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Yang Wang, Chenghao Xiao, Chia-Yi Hsiao, et al.

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Xingyue Huang, Rishabh, Gregor Franke, et al.

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

Matthew Ho, Chen Si, Zhaoxiang Feng, et al.

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Reinforcement Learning

Zeyu Gan, Hao Yi, Yong Liu

Multi-View 3D Point Tracking

3D Machine Vision

Depth Estimation

Frano Rajič, Haofei Xu, Marko Mihajlovic, et al.

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Image Generation

Dong She, Siming Fu, Mushui Liu, et al.

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

Diffusion Model

Image Generation

Xuechao Zou, Shun Zhang, Xing Fu, et al.

On the Theoretical Limitations of Embedding-Based Retrieval

Retrieval-Augmented Generation

Orion Weller, Michael Boratko, Iftekhar Naim, et al.

LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations

Daniela Gottesman, Alon Gilae-Dotan, Ido Cohen, et al.

Open Data Synthesis For Deep Research

Ziyi Xia, Kun Luo, Hongjin Qian, et al.

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Embodied Intelligence

Huang Fang, Mengxi Zhang, Heng Dong, et al.

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

Multimodal Representation

Dan Kalifa, Uriel Singer, Kira Radinsky

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

Multi-Task Learning

Xingxuan Zhang, Gang Ren, Han Yu, et al.

epiGPTope: A machine learning-based epitope generator and classifier

Natalia Flechas Manrique, Alberto Martínez, Elena López-Martínez, et al.

GenCompositor: Generative Video Compositing with Diffusion Transformer

Video Generation

Video Processing

Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, et al.

DCPO: Dynamic Clipping Policy Optimization

Reinforcement Learning

Shihui Yang, Chengfeng Dou, Peidong Guo, et al.

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic

Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Baichuan-M2: Scaling Medical Capability with Large Verifier System

Baichuan-M2 Team, Chengfeng Dou, Chong Liu, et al.

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Dongfu Jiang, Yi Lu, Zhuofeng Li, et al.

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Hao Lu, Jiahao Wang, Yaolun Zhang, et al.

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Christopher F. Brown, Michal R. Kazmierski, Valerie J. Pasquarella, et al.

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Code Generation

Zihan Wang, Jiaze Chen, Zhicheng Liu, et al.

TileLang: A Composable Tiled Programming Model for AI Systems

Wang Lei, Cheng Yu, Shi Yining, et al.

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Sara Vera Marjanović, Arkil Patel, Vaibhav Adlakha, et al.

Why Language Models Hallucinate

Why Language Models Hallucinate

Adam Tauman Kalai, Ofir Nachum, Santosh S. Vempala, et al.

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Yinglin Duan, Zhengxia Zou, Tongwei Gu, et al.

Recomposer: Event-roll-guided generative audio editing

Daniel P. W. Ellis, Eduardo Fonseca, Ron J. Weiss, et al.

Transition Models: Rethinking the Generative Learning Objective

Diffusion Model

Zidong Wang, Yiyuan Zhang, Xiaoyu Yue, et al.

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow
Real Instructions?

Supervised Fine-Tuning

Qinyan Zhang, Xinping Lei, Ruijie Miao, et al.

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Haiyuan Wan, Chen Yang, Junchi Yu, et al.

Towards a Unified View of Large Language Model Post-Training

Supervised Fine-Tuning

Reinforcement Learning

Xingtai Lv, Yuxin Zuo, Youbang Sun, et al.

From Editor to Dense Geometry Estimator

Depth Estimation

Diffusion Model

JiYuan Wang, Chunyu Lin, Lei Sun, et al.

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Yang Wang, Chenghao Xiao, Chia-Yi Hsiao, et al.

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Xingyue Huang, Rishabh, Gregor Franke, et al.

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

Matthew Ho, Chen Si, Zhaoxiang Feng, et al.

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Reinforcement Learning

Zeyu Gan, Hao Yi, Yong Liu

Multi-View 3D Point Tracking

3D Machine Vision

Depth Estimation

Frano Rajič, Haofei Xu, Marko Mihajlovic, et al.

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Image Generation

Dong She, Siming Fu, Mushui Liu, et al.

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

Diffusion Model

Image Generation

Xuechao Zou, Shun Zhang, Xing Fu, et al.

On the Theoretical Limitations of Embedding-Based Retrieval

Retrieval-Augmented Generation

Orion Weller, Michael Boratko, Iftekhar Naim, et al.

LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations

Daniela Gottesman, Alon Gilae-Dotan, Ido Cohen, et al.

Open Data Synthesis For Deep Research

Ziyi Xia, Kun Luo, Hongjin Qian, et al.

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Embodied Intelligence

Huang Fang, Mengxi Zhang, Heng Dong, et al.

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

Multimodal Representation

Dan Kalifa, Uriel Singer, Kira Radinsky

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

Multi-Task Learning

Xingxuan Zhang, Gang Ren, Han Yu, et al.

epiGPTope: A machine learning-based epitope generator and classifier

Natalia Flechas Manrique, Alberto Martínez, Elena López-Martínez, et al.

GenCompositor: Generative Video Compositing with Diffusion Transformer

Video Generation

Video Processing

Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, et al.

DCPO: Dynamic Clipping Policy Optimization

Reinforcement Learning

Shihui Yang, Chengfeng Dou, Peidong Guo, et al.

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic

Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Baichuan-M2: Scaling Medical Capability with Large Verifier System

Baichuan-M2 Team, Chengfeng Dou, Chong Liu, et al.

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Dongfu Jiang, Yi Lu, Zhuofeng Li, et al.

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Hao Lu, Jiahao Wang, Yaolun Zhang, et al.

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Christopher F. Brown, Michal R. Kazmierski, Valerie J. Pasquarella, et al.

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Code Generation

Zihan Wang, Jiaze Chen, Zhicheng Liu, et al.

TileLang: A Composable Tiled Programming Model for AI Systems

Wang Lei, Cheng Yu, Shi Yining, et al.

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Sara Vera Marjanović, Arkil Patel, Vaibhav Adlakha, et al.

Recomposer: Event-roll-guided generative audio editing

Transition Models: Rethinking the Generative Learning Objective

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Towards a Unified View of Large Language Model Post-Training

From Editor to Dense Geometry Estimator

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Multi-View 3D Point Tracking

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

On the Theoretical Limitations of Embedding-Based Retrieval

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Open Data Synthesis For Deep Research

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Recomposer: Event-roll-guided generative audio editing

Transition Models: Rethinking the Generative Learning Objective

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Towards a Unified View of Large Language Model Post-Training

From Editor to Dense Geometry Estimator

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Multi-View 3D Point Tracking

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

On the Theoretical Limitations of Embedding-Based Retrieval

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Open Data Synthesis For Deep Research

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning