NeurIPS 2025 Workshop on
SPACE in Vision, Language, and Embodied AI (SpaVLE)

6/7 December 2025, San Diego Convention Center, San Diego, USA

About SpaVLE

The SpaVLE workshop aims to bridge the historically siloed efforts of NLP, CV, and robotics communities by fostering cross-disciplinary dialogue to advance research on spatial understanding and representation.

Where

San Diego Convention Center, San Diego, USA

When

Saturday/Sunday
6/7 December 2025

Overview

We never directly "see" space, but we perceive and reconstruct it, describe it through language, and plan our actions in accordance with its constraints.

Background

At the core of embodied cognition lies the challenge of understanding and reasoning about space, its representations, dynamics, how it forms from and constrains visual perception, language communication, action, and interaction with the environment. Spatial representations are inherently multimodal, integrating vision, language, and motor control. They must capture both 2D and 3D spatial structures, from local object relationships to global scene geometry. Although we live in a dynamic 3D world, the images and videos we perceive offer only partial, static glimpses of this environment. Reconstructing an accurate, temporally coherent model of the 3D world from such limited observations remains a central challenge in computer vision. Spatial reasoning is equally critical for language understanding, especially in human-robot interaction, where interpreting grounded references to objects and locations is essential for effective communication. For embodied agents, building robust spatial representations is vital to support real-world navigation, manipulation, and dialogue. Despite advances in data-driven pretraining across 2D/3D vision, language, and action modalities, spatial understanding remains an open challenge. For example, (vision-)language models still fall short of human-level spatial reasoning, often struggling to interpret the semantics of spatial expressions, particularly when perspective, memory, or embodiment constraints are involved.

Scope and Goal

This workshop is particularly interested in how spatial representations can be learned from multimodal data, and applied to core tasks in computer vision (CV), natural language processing (NLP), and robotics. Spatial understanding has historically been pursued separately by these communities, each adopting distinct approaches and problem formulations. The goal of this joint workshop is to provide a platform for much-needed cross-disciplinary dialogue, advancing research on spatial understanding and representation by bringing together diverse perspectives. We aim to foster discussion on how spatial representations, whether symbolic, neural, verbal, or geometric, can be learned, evaluated, and deployed across modalities and tasks. A key focus is aligning these approaches with the demands of real-world applications and addressing practical challenges such as the high cost of real-robot experiments, the scarcity of multimodal 3D and embodied data, and the complexities of human-in-the-loop evaluation and non-verbal communication.

Call for Papers

Topics of Interest

The scope and topics include, but are not limited to:

  • Foundations of Spatial Representation and Reasoning. Models and formalisms for representing space, including symbolic, geometric, neural, or hybrid approaches, and reasoning about objects, spatial relations, and dynamics in 2D and 3D environments.
  • Multimodal Spatial Grounding. Learning and aligning spatial concepts across language, vision (2D / 3D), and action modalities, including spatial representation learning, cross-modal fusion, and spatially informed planning.
  • Applications in NLP, Vision, Robotics, and Generative AI. Applying spatial understanding to tasks such as instruction following, embodied manipulation, situated dialogue, scene understanding, and spatially grounded generation (e.g., image/video synthesis, layout prediction, or simulation).
  • Evaluation and Benchmarking Spatial Intelligence. Developing datasets, tasks, and metrics for assessing spatial reasoning, generative fidelity, grounded communication, and skill learning in embodied or interactive contexts.
  • Spatial Reasoning in Foundation Models. Examining how large language and vision-language models handle spatial understanding, identifying current limitations, and exploring methods to integrate spatial priors or structural biases for improved generalization and robustness.

Submission Format

We welcome (1) Short papers up to 4 pages and (2) Full papers up to 9 pages.

  • Long Research Paper (up to 9 pages)
  • Short Research Paper (up to 4 pages)
  • Dataset/Benchmark Paper (up to 9 pages, authors are strongly encouraged to follow the Datasets & Benchmarks Policies)
  • Survey/Review Paper (up to 9 pages)
  • Position Paper (up to 4 pages)

The page limit excludes references and supplementary materials. Submissions must be prepared in PDF format using the NeurIPS 2025 style template through OpenReview.

  • Submission Protocol: OpenReview
  • Submission Template: Zip
  • Archival Option: All accepted papers will be non-archival.

Important Dates

  • Submission Open: July 7, 2025
  • Submission Deadline: August 22, 2025
  • Notification of Acceptance: September 22, 2025
  • Camera-Ready Deadline: October 25, 2025
  • Workshop Day: December 6 or 7, 2025 (co-located with NeurIPS 2025)

Keynote Speakers (Alphabetical Order)

Amir Zadeh

Lambda Labs

Barbara Landau

Johns Hopkins University

Dieter Fox

University of Washington / Allen Institute for AI

Joshua Tenenbaum

Massachusetts Institute of Technology

Joyce Chai

University of Michigan

Ranjay Krishna

University of Washington / Allen Institute for AI

Saining Xie

New York University

Confirmed Panelists (Alphabetical Order)

Anthony G. Cohn

University of Leeds

Organizing Committee

Martin Ziqiao Ma

University of Michigan

Freda Shi

University of Waterloo / Vector Institute

Jiayuan Mao

Massachusetts Institute of Technology

Jiafei Duan

University of Washington

Manling Li

Northwestern University

David Hsu

National University of Singapore

Parisa Kordjamshidi

Michigan State University

Event Schedule

(Tentative Schedule)

Welcome

Opening Remarks.

Keynote Joshua Tenenbaum

Title: TBD

Keynote Barbara Landau

Title: TBD

Poster Presentations

Poster session 1 and coffee break.

Keynote Joyce Chai

Title: TBD

Oral Presentations

Oral session 1 (2 papers).

Panel Discussion

Panel discussions with lunch break.

Keynote Ranjay Krishna

Title: TBD

Keynote Saining Xie

Title: TBD

Poster Presentations

Poster session 2 and coffee break.

Keynote Dieter Fox

Title: TBD

Keynote Amir Zadeh

Title: TBD

Oral Presentations

Oral session 2 (4 papers).

Closing Remarks

Free discussions and paper award annoucement.

Keynote Joshua Tenenbaum

Title: TBD

Keynote Barbara Landau

Title: TBD

Keynote Joyce Chai

Title: TBD

Panel Discussion

Panel discussions with lunch break.

Keynote Ranjay Krishna

Title: TBD

Keynote Saining Xie

Title: TBD

Keynote Dieter Fox

Title: TBD

Keynote Amir Zadeh

Title: TBD

Poster Presentations

Poster session 1 and coffee break.

Oral Presentations

Oral session 1 (2 papers).

Poster Presentations

Poster session 2 and coffee break.

Oral Presentations

Oral session 2 (4 papers).

Workshop Venue

San Diego Convention Center

Room: TBD

111 Harbor Dr, San Diego, CA 92101

Sponsors

We are seeking additional sponsors to support the workshop and the community for traveling and registration. If you are interested in sponsoring, please contact us.

We are pleased that Lambda, Alquist Robotics, EdenSign will sponsor our workshop. In support of research excellence and community engagement, we will provide $3,000 in cloud credits for the Best Paper Award, and $1,500 in cloud credits for the Runner-Up Paper Award.

Frequently Asked Questions

Can I submit a paper that has been accepted in NeurIPS 2025 or previously published other conferences/journals?

No. Per NeurIPS 2025 workshop guidlines, workshops are not a venue for work that has been previously published in other conferences on machine learning or related fields. Work that is presented at the main NeurIPS conference should not appear in a workshop, including as part of an invited talk.

Can I submit a manuscript that is not accepted yet but is on arxiv and/or is currently under review?

Yes, the SpaVLE workshop is non-archival. Note that vision conferences such as CVPR and ICCV consider peer-reviewed workshop papers as publications if their length exceeds 4 pages (excluding references), even if they do not appear in a proceedings. If you are considering submitting your paper later to vision conferences, please consider submitting an abridged version as a short paper (up to 4 pages).

Can I volunteer as a reviewer for SpaVLE?

Yes! Please fill in this form and we will be in touch!

Contact

Address

111 Harbor Dr, San Diego, CA 92101

Join Slack

TBD

Email Us

spavle.committee@gmail.com