Abstracts

Add abstract

Want to add your dissertation abstract to this database? It only takes a minute!

Search abstract

Search for abstracts by subject, author or institution

Share this abstract

Holistic Scene Understanding and Goal-directed Multi-agent Event Parsing

by Yixin Chen

Institution: University of California
Department:
Degree:
Year: 2022
Keywords: Statistics
Posted: 3/25/2025
Record ID: 2321002
Full text PDF: https://escholarship.org/uc/item/3bb429zf


Abstract

Humans, even young infants, are adept at perceiving and understanding complex indoor scenes and events. Holistic scene understanding involves abundant aspects, including 3D human pose, objects, physical relations, functionality, etc. Besides the physical and functional configuration of the scene, interpreting human actions and goal-oriented tasks is a higher-level goal, and requires reasoning about the complex structures in activities along the temporal dimension. When multiple people are in the scene, collaborations and communications inevitably happen, in both verbal and non-verbal forms. Despite the recent remarkable progress in artificial intelligence, building an intelligent machine with human-like perception and reasoning capability for the aforementioned complex tasks remains a significant and challenging problem.In this dissertation, we study the holistic scene understanding and goal-directed multi-agent event parsing by identifying the critical problems from various perspectives. We first propose a framework for holistic 3D scene parsing and human pose estimation, with a particular focus on human-object interaction and physical commonsense reasoning. Contact information is critical in modeling the fine-grained human-object relations from visual cues. We demonstrate how to extract meaningful contact information from 2D images and its usefulness in 3D human pose estimation. Then we introduce our efforts in understanding goal-directed actions, concurrent multi-tasks, and collaborations among multi-agents. Finally, we investigate the two typical types of human communications by proposing a spatial and temporal model for shared attention and examining the power of both language and gesture under the embodied reference setting.

Add abstract

Want to add your dissertation abstract to this database? It only takes a minute!

Search abstract

Search for abstracts by subject, author or institution

Share this abstract

Featured Books

Book cover thumbnail image
Electric Cooperative Managers' Strategies to Enhan...
by White, Michael Edward
   
Book cover thumbnail image
The Filipina-South Floridian International Interne... Agency, Culture, and Paradox
by Haley, Pamela S.
   
Book cover thumbnail image
Bullied! Coping with Workplace Bullying
by Gattis, Vanessa M.
   
Book cover thumbnail image
Commodification of Sexual Labor Contribution of Internet Communities to Prostituti...
by Young, Jeffrey R.
   
Book cover thumbnail image
The Census of Warm Debris Disks in the Solar Neigh...
by Patel, Rahul I.
   
Book cover thumbnail image
Performance, Managerial Skill, and Factor Exposure...
by Avci, S. Burcu
   
Book cover thumbnail image
The Deritualization of Death Toward a Practical Theology of Caregiving for the ...
by Gibson, Charles Lynn
   
Book cover thumbnail image
Emotional Intelligence and Leadership Styles Exploring the Relationship between Emotional Intel...
by Olagundoye, Eniola O.
   
Book cover thumbnail image
Solution or Stalemate? Peace Process in Turkey, 2009-2013
by Yurtbay, Baturay
   
Book cover thumbnail image
Risk Factors and Business Models Understanding the Five Forces of Entrepreneurial R...
by Miles, D. Anthony