Disentangling Visual Priors: Unsupervised Learning of Scene Interpretations with Compositional Autoencoder

Krzysztof Krawiec; Antoni Nowinowski

doi:10.1007/978-3-031-71167-1_13

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Disentangling Visual Priors: Unsupervised Learning of Scene Interpretations with Compositional Autoencoder

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Disentangling Visual Priors: Unsupervised Learning of Scene Interpretations with Compositional Autoencoder

Autorzy

Krzysztof Krawiec (WIiT) ^{[ 1 ][ 2.3 ][ P ]}
Antoni Nowinowski (WIiT) ^{[ 2 ][ 2.3 ][ SzD ]}

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ 2 ]} Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ P ]} pracownik | ^{[ SzD ]} doktorant ze Szkoły Doktorskiej

Dyscyplina naukowa (Ustawa 2.0)

[2.3] Informatyka techniczna i telekomunikacja

Rok publikacji

2024

Typ rozdziału

rozdział w monografii naukowej / referat

Język publikacji

angielski

Słowa kluczowe

EN

scene representation
image understanding
disentanglement

Streszczenie

EN Contemporary deep learning architectures lack principled means for capturing and handling fundamental visual concepts, like objects, shapes, geometric transforms, and other higher-level structures. We propose a neurosymbolic architecture that uses a domain-specific language to capture selected priors of image formation, including object shape, appearance, categorization, and geometric transforms. We express template programs in that language and learn their parameterization with features extracted from the scene by a convolutional neural network. When executed, the parameterized program produces geometric primitives which are rendered and assessed for correspondence with the scene content and trained via auto-association with gradient. We confront our approach with a baseline method on a synthetic benchmark and demonstrate its capacity to disentangle selected aspects of the image formation process, learn from small data, correct inference in the presence of noise, and out-of-sample generalization.

Data udostępnienia online

10.09.2024

Strony (od-do)

240 - 256

DOI

10.1007/978-3-031-71167-1_13

URL

https://link.springer.com/chapter/10.1007/978-3-031-71167-1_13

Książka

Neural-Symbolic Learning and Reasoning : 18th International Conference, NeSy 2024, Barcelona, Spain, September 9–12, 2024, Proceedings, Part I

Zaprezentowany na

18th International Conference on Neural-Symbolic Learning and Reasoning NeSy 2024, 9-12.09.2024, Barcelona, Spain