Compositionality and Disentanglement: Structured Representations in Simple Neural Models
Kompozicionalita a disentanglement: Strukturované reprezentace v jednoduchých neuronových modelech
dissertation thesis (DEFENDED)
View/ Open
Permanent link
http://hdl.handle.net/20.500.11956/207781Identifiers
Study Information System: 192781
Collections
- Kvalifikační práce [12077]
Author
Advisor
Referee
Smith, Kenny
Le, Phong
Faculty / Institute
Faculty of Mathematics and Physics
Discipline
Computational linguistics
Department
Institute of Formal and Applied Linguistics
Date of defense
30. 3. 2026
Publisher
Univerzita Karlova, Matematicko-fyzikální fakultaLanguage
English
Grade
Pass
Keywords (Czech)
kompozicionalita|zobecňování|simulace vývoje jazyka|disentanglementKeywords (English)
compositionality|generalization|emergent languages|disentanglementNázev práce: K : S ý ý Autor: M A Pracoviště: Ú í Vedoucí práce: . RND. P P, P.D., Ú í Abstrakt: Z , ý í č í ý (í ) čí (ý ). V ří ů: ří. K í čí . N ří í ů . P ěří í čí žý ěí ý . D , ž ší ňí, říě, ž čí . P í í í ů (), í, í ý í ě ě ý ů. S čí, ž í í, ž ý č í ý...
Title: Compositionality and Disentanglement: Structured Representations in Simple Neural Models Author: Michal Auersperger Department: Institute of Formal and Applied Linguistics Supervisor: doc. RNDr. Pavel Pecina, Ph.D., Institute of Formal and Applied Linguistics Abstract: We study conditions that encourage the emergence of structured neural represen- tations in communication games with discrete channels and in autoencoders with continuous channels. Across both settings, we examine two pressures: compres- sion, implemented through message length penalties and bounded activations, and training pressure, introduced through agent resetting and dropout. In signaling games, we propose grammar-based compositionality metrics derived from proba- bilistic context-free grammar induction and show that structured protocols support better generalization under constrained training conditions. For continuous repre- sentations, we introduce a vocabulary-based reconstruction measure that evaluates whether latent fragments can be recombined into valid codes. Experiments in con- trolled settings suggest that compression and training pressures promote represen- tations whose parts can be reused more systematically. Keywords: compositionality, generalization, emergent languages, disentan- glement iii
