6 months ago

Abstract

The advent of wearable computers enables a new source of context for AI that is embedded in egocentric sensor data. This new egocentric data comes equipped with fine-grained 3D location information and thus presents the opportunity for a novel class of spatial foundation models that are rooted in 3D space. To measure progress on what we term Egocentric Foundation Models (EFMs) we establish EFM3D, a benchmark with two core 3D egocentric perception tasks. EFM3D is the first benchmark for 3D object detection and surface regression on high quality annotated egocentric data of Project Aria. We propose Egocentric Voxel Lifting (EVL), a baseline for 3D EFMs. EVL leverages all available egocentric modalities and inherits foundational capabilities from 2D foundation models. This model, trained on a large simulated dataset, outperforms existing methods on the EFM3D benchmark.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

6 months ago

3D Machine Vision

Multimodal

Multimodal Representation

Multimodality

3D Model

Task/Problem

Julian Straub Daniel DeTone Tianwei Shen Nan Yang Chris Sweeney Richard Newcombe

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

6 months ago

3D Machine Vision

Multimodal

Multimodal Representation

Multimodality

3D Model

Task/Problem

Julian Straub Daniel DeTone Tianwei Shen Nan Yang Chris Sweeney Richard Newcombe

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models

Julian Straub Daniel DeTone Tianwei Shen Nan Yang Chris Sweeney Richard Newcombe

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models

Julian Straub Daniel DeTone Tianwei Shen Nan Yang Chris Sweeney Richard Newcombe

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models

Julian Straub Daniel DeTone Tianwei Shen Nan Yang Chris Sweeney Richard Newcombe

Abstract

Build AI with AI

HyperAI Newsletters