
NeurIPS 2025 Workshop on
SPACE in Vision, Language, and Embodied AI (SpaVLE)
6/7 December 2025, San Diego Convention Center, San Diego, USA
About SpaVLE
The SpaVLE workshop aims to bridge the historically siloed efforts of NLP, CV, and robotics communities by fostering cross-disciplinary dialogue to advance research on spatial understanding and representation.
Where
San Diego Convention Center, San Diego, USA
When
Saturday/Sunday
6/7 December 2025
Overview
We never directly "see" space, but we perceive and reconstruct it, describe it through language, and plan our actions in accordance with its constraints.
Background
At the core of embodied cognition lies the challenge of understanding and reasoning about space, its representations, dynamics, how it forms from and constrains visual perception, language communication, action, and interaction with the environment. Spatial representations are inherently multimodal, integrating vision, language, and motor control. They must capture both 2D and 3D spatial structures, from local object relationships to global scene geometry. Although we live in a dynamic 3D world, the images and videos we perceive offer only partial, static glimpses of this environment. Reconstructing an accurate, temporally coherent model of the 3D world from such limited observations remains a central challenge in computer vision. Spatial reasoning is equally critical for language understanding, especially in human-robot interaction, where interpreting grounded references to objects and locations is essential for effective communication. For embodied agents, building robust spatial representations is vital to support real-world navigation, manipulation, and dialogue. Despite advances in data-driven pretraining across 2D/3D vision, language, and action modalities, spatial understanding remains an open challenge. For example, (vision-)language models still fall short of human-level spatial reasoning, often struggling to interpret the semantics of spatial expressions, particularly when perspective, memory, or embodiment constraints are involved.
Scope and Goal
This workshop is particularly interested in how spatial representations can be learned from multimodal data, and applied to core tasks in computer vision (CV), natural language processing (NLP), and robotics. Spatial understanding has historically been pursued separately by these communities, each adopting distinct approaches and problem formulations. The goal of this joint workshop is to provide a platform for much-needed cross-disciplinary dialogue, advancing research on spatial understanding and representation by bringing together diverse perspectives. We aim to foster discussion on how spatial representations, whether symbolic, neural, verbal, or geometric, can be learned, evaluated, and deployed across modalities and tasks. A key focus is aligning these approaches with the demands of real-world applications and addressing practical challenges such as the high cost of real-robot experiments, the scarcity of multimodal 3D and embodied data, and the complexities of human-in-the-loop evaluation and non-verbal communication.
Call for Papers
Topics of Interest
The scope and topics include, but are not limited to:
- Foundations of Spatial Representation and Reasoning. Models and formalisms for representing space, including symbolic, geometric, neural, or hybrid approaches, and reasoning about objects, spatial relations, and dynamics in 2D and 3D environments.
- Multimodal Spatial Grounding. Learning and aligning spatial concepts across language, vision (2D / 3D), and action modalities, including spatial representation learning, cross-modal fusion, and spatially informed planning.
- Applications in NLP, Vision, Robotics, and Generative AI. Applying spatial understanding to tasks such as instruction following, embodied manipulation, situated dialogue, scene understanding, and spatially grounded generation (e.g., image/video synthesis, layout prediction, or simulation).
- Evaluation and Benchmarking Spatial Intelligence. Developing datasets, tasks, and metrics for assessing spatial reasoning, generative fidelity, grounded communication, and skill learning in embodied or interactive contexts.
- Spatial Reasoning in Foundation Models. Examining how large language and vision-language models handle spatial understanding, identifying current limitations, and exploring methods to integrate spatial priors or structural biases for improved generalization and robustness.
Submission Format
We welcome (1) Short papers up to 4 pages and (2) Full papers up to 9 pages.
- Long Research Paper (up to 9 pages)
- Short Research Paper (up to 4 pages)
- Dataset/Benchmark Paper (up to 9 pages, authors are strongly encouraged to follow the Datasets & Benchmarks Policies)
- Survey/Review Paper (up to 9 pages)
- Position Paper (up to 4 pages)
The page limit excludes references and supplementary materials. Submissions must be prepared in PDF format using the NeurIPS 2025 style template through OpenReview.
- Submission Protocol: OpenReview
- Submission Template: Zip
- Archival Option: All accepted papers will be non-archival.
Important Dates
- Submission Open: July 7, 2025
- Submission Deadline: August 22, 2025
- Notification of Acceptance: September 22, 2025
- Camera-Ready Deadline: October 25, 2025
- Workshop Day: December 6 or 7, 2025 (co-located with NeurIPS 2025)
Keynote Speakers (Alphabetical Order)

Amir Zadeh
Lambda Labs
Barbara Landau
Johns Hopkins University
Dieter Fox
University of Washington / Allen Institute for AI
Joshua Tenenbaum
Massachusetts Institute of Technology
Joyce Chai
University of Michigan
Ranjay Krishna
University of Washington / Allen Institute for AI
Saining Xie
New York UniversityConfirmed Panelists (Alphabetical Order)

Anthony G. Cohn
University of LeedsOrganizing Committee

Martin Ziqiao Ma
University of Michigan
Freda Shi
University of Waterloo / Vector Institute
Jiayuan Mao
Massachusetts Institute of Technology
Jiafei Duan
University of Washington
Manling Li
Northwestern University
David Hsu
National University of Singapore
Parisa Kordjamshidi
Michigan State UniversityEvent Schedule
(Tentative Schedule)
Welcome
Opening Remarks.

Keynote Joshua Tenenbaum
Title: TBD

Keynote Barbara Landau
Title: TBD
Poster Presentations
Poster session 1 and coffee break.

Keynote Joyce Chai
Title: TBD
Oral Presentations
Oral session 1 (2 papers).
Panel Discussion
Panel discussions with lunch break.

Keynote Ranjay Krishna
Title: TBD

Keynote Saining Xie
Title: TBD
Poster Presentations
Poster session 2 and coffee break.

Keynote Dieter Fox
Title: TBD

Keynote Amir Zadeh
Title: TBD
Oral Presentations
Oral session 2 (4 papers).
Closing Remarks
Free discussions and paper award annoucement.

Keynote Joshua Tenenbaum
Title: TBD

Keynote Barbara Landau
Title: TBD

Keynote Joyce Chai
Title: TBD
Panel Discussion
Panel discussions with lunch break.

Keynote Ranjay Krishna
Title: TBD

Keynote Saining Xie
Title: TBD

Keynote Dieter Fox
Title: TBD

Keynote Amir Zadeh
Title: TBD
Poster Presentations
Poster session 1 and coffee break.
Oral Presentations
Oral session 1 (2 papers).
Poster Presentations
Poster session 2 and coffee break.
Oral Presentations
Oral session 2 (4 papers).
Workshop Venue
San Diego Convention Center
Room: TBD
111 Harbor Dr, San Diego, CA 92101
Accepted Papers
To be announced.
Sponsors
We are seeking additional sponsors to support the workshop and the community for traveling and registration. If you are interested in sponsoring, please contact us.
We are pleased that Lambda, Alquist Robotics, EdenSign will sponsor our workshop. In support of research excellence and community engagement, we will provide $3,000 in cloud credits for the Best Paper Award, and $1,500 in cloud credits for the Runner-Up Paper Award.



Frequently Asked Questions
Can I submit a paper that has been accepted in NeurIPS 2025 or previously published other conferences/journals?
No. Per NeurIPS 2025 workshop guidlines, workshops are not a venue for work that has been previously published in other conferences on machine learning or related fields. Work that is presented at the main NeurIPS conference should not appear in a workshop, including as part of an invited talk.
Can I submit a manuscript that is not accepted yet but is on arxiv and/or is currently under review?
Yes, the SpaVLE workshop is non-archival. Note that vision conferences such as CVPR and ICCV consider peer-reviewed workshop papers as publications if their length exceeds 4 pages (excluding references), even if they do not appear in a proceedings. If you are considering submitting your paper later to vision conferences, please consider submitting an abridged version as a short paper (up to 4 pages).
Can I volunteer as a reviewer for SpaVLE?
Yes! Please fill in this form and we will be in touch!
Contact
Address
111 Harbor Dr, San Diego, CA 92101
Join Slack
TBD
Email Us
spavle.committee@gmail.com