Layout-induced Video Representation for Recognizing Agent-in-Place Actions

Yu, Ruichi; Wang, Hongcheng; Li, Ang; Zheng, Jingxiao; Morariu, Vlad I.; Davis, Larry S.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1804.01429 (cs)

[Submitted on 4 Apr 2018 (v1), last revised 1 Apr 2019 (this version, v3)]

Title:Layout-induced Video Representation for Recognizing Agent-in-Place Actions

Authors:Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu, Larry S. Davis

View PDF

Abstract:We address the recognition of agent-in-place actions, which are associated with agents who perform them and places where they occur, in the context of outdoor home surveillance. We introduce a representation of the geometry and topology of scene layouts so that a network can generalize from the layouts observed in the training set to unseen layouts in the test set. This Layout-Induced Video Representation (LIVR) abstracts away low-level appearance variance and encodes geometric and topological relationships of places in a specific scene layout. LIVR partitions the semantic features of a video clip into different places to force the network to learn place-based feature descriptions; to predict the confidence of each action, LIVR aggregates features from the place associated with an action and its adjacent places on the scene layout. We introduce the Agent-in-Place Action dataset to show that our method allows neural network models to generalize significantly better to unseen scenes.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1804.01429 [cs.CV]
	(or arXiv:1804.01429v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1804.01429

Submission history

From: Ruichi Yu [view email]
[v1] Wed, 4 Apr 2018 14:25:04 UTC (10,230 KB)
[v2] Tue, 20 Nov 2018 16:34:39 UTC (7,772 KB)
[v3] Mon, 1 Apr 2019 04:46:05 UTC (8,115 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ruichi Yu
Hongcheng Wang
Ang Li
Jingxiao Zheng
Vlad I. Morariu

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Layout-induced Video Representation for Recognizing Agent-in-Place Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Layout-induced Video Representation for Recognizing Agent-in-Place Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators