| dc.contributor.advisor | Sychrovský, David | |
| dc.creator | Černý, Jakub | |
| dc.date.accessioned | 2025-09-26T09:46:19Z | |
| dc.date.available | 2025-09-26T09:46:19Z | |
| dc.date.issued | 2025 | |
| dc.identifier.uri | http://hdl.handle.net/20.500.11956/202629 | |
| dc.description.abstract | Despite deep learning's remarkable empirical success, its theoretical un- derpinnings lag behind. Information theory provides a powerful framework for analyzing internal network representations, particularly through recent advances in information bottleneck (IB) theory and the information plane. This thesis investigates how the structure of the information plane, specifi- cally the clustering behavior of internal representations, influences neural net- work performance. We introduce Purity theory, a novel framework for quan- tifying layer-wise clustering, complementing established IB perspectives. Our analysis reveals a significant correlation between information plane structure and generalization performance in binary classification tasks. Leveraging this correlation, we propose a new information-theoretic metric that effectively predicts model generalization capability. Furthermore, we develop a model selection algorithm based on this metric, which demonstrably outperforms selection based solely on training loss. | en_US |
| dc.description.abstract | Pr ̌es pozoruhodny ́ empiricky ́ u 'spe ̌ch hluboke ́ho uc ̌enı ́(deep learning) jeho teoreticke ́ za ́klady zaosta ́vajı ́. Teorie informace poskytuje u ́c ̌inny ́ ra ́mec pro analy ́zu vnitr ̌nı ́ch reprezentacı ́ v sı ́tı ́ch, zejme ́na dı ́ky neda ́vne ́mu pokroku v teorii informac ̌nı ́ho hrdla (information bottleneck, IB) a konceptu infor- mac ̌nı ́roviny. Tato pra ́ce zkouma ́, jak struktura informac ̌nı ́roviny, konkre ́tne ̌ shlukova ́nı ́vnitr ̌nı ́ch reprezentacı ́, ovlivn ̌uje vy ́kon neuronovy ́ch sı ́tı ́. Pr ̌edstavujeme Purity teorii, novy ́ ra ́mec pro kvantifikaci shlukova ́nı ́reprezentacı ́po vrstva ́ch, ktery ́ dopln ̌uje sta ́vajı ́cı ́ IB perspektivy. Nas ̌e analy ́za odhaluje vy ́znamnou korelaci mezi strukturou informac ̌nı ́roviny a schopnostı ́generalizace v u ́loha ́ch bina ́rnı ́ klasifikace. Na za ́klade ̌ te ́to korelace navrhujeme novou metriku zaloz ̌enou na teorii informace, ktera ́ u ́c ̌inne ̌ pr ̌edpovı ́da ́ schopnost modelu generalizovat. Da ́le vyvı ́jı ́me algoritmus pro vy ́be ̌r modelu vyuz ̌ı ́vajı ́cı ́ tuto metriku, ktery ́ prokazatelne ̌ pr ̌ekona ́va ́ vy ́be ̌r zaloz ̌eny ́ vy ́hradne ̌ na tre ́novacı ́ ztra ́te ̌ (train loss). | cs_CZ |
| dc.language | Čeština | cs_CZ |
| dc.language.iso | cs_CZ | |
| dc.publisher | Univerzita Karlova, Matematicko-fyzikální fakulta | cs_CZ |
| dc.subject | Hluboké Učení|Strojové Učení|Teorie Informačního Hrdla|Neuronové Sítě | cs_CZ |
| dc.subject | Deep Learning|information bottleneck theory|Machine Learning|Neural Networks | en_US |
| dc.title | Aplikace teorie informace na studium učení hlubokých neuronových sítí | cs_CZ |
| dc.type | bakalářská práce | cs_CZ |
| dcterms.created | 2025 | |
| dcterms.dateAccepted | 2025-09-05 | |
| dc.description.department | Katedra aplikované matematiky | cs_CZ |
| dc.description.department | Department of Applied Mathematics | en_US |
| dc.description.faculty | Faculty of Mathematics and Physics | en_US |
| dc.description.faculty | Matematicko-fyzikální fakulta | cs_CZ |
| dc.identifier.repId | 282204 | |
| dc.title.translated | Aplications of information theory to the study of deep learning | en_US |
| dc.contributor.referee | Schmid, Martin | |
| thesis.degree.name | Bc. | |
| thesis.degree.level | bakalářské | cs_CZ |
| thesis.degree.discipline | Mathematics for Information Technologies | en_US |
| thesis.degree.discipline | Matematika pro informační technologie | cs_CZ |
| thesis.degree.program | Matematika pro informační technologie | cs_CZ |
| thesis.degree.program | Mathematics for Information Technologies | en_US |
| uk.thesis.type | bakalářská práce | cs_CZ |
| uk.taxonomy.organization-cs | Matematicko-fyzikální fakulta::Katedra aplikované matematiky | cs_CZ |
| uk.taxonomy.organization-en | Faculty of Mathematics and Physics::Department of Applied Mathematics | en_US |
| uk.faculty-name.cs | Matematicko-fyzikální fakulta | cs_CZ |
| uk.faculty-name.en | Faculty of Mathematics and Physics | en_US |
| uk.faculty-abbr.cs | MFF | cs_CZ |
| uk.degree-discipline.cs | Matematika pro informační technologie | cs_CZ |
| uk.degree-discipline.en | Mathematics for Information Technologies | en_US |
| uk.degree-program.cs | Matematika pro informační technologie | cs_CZ |
| uk.degree-program.en | Mathematics for Information Technologies | en_US |
| thesis.grade.cs | Výborně | cs_CZ |
| thesis.grade.en | Excellent | en_US |
| uk.abstract.cs | Pr ̌es pozoruhodny ́ empiricky ́ u 'spe ̌ch hluboke ́ho uc ̌enı ́(deep learning) jeho teoreticke ́ za ́klady zaosta ́vajı ́. Teorie informace poskytuje u ́c ̌inny ́ ra ́mec pro analy ́zu vnitr ̌nı ́ch reprezentacı ́ v sı ́tı ́ch, zejme ́na dı ́ky neda ́vne ́mu pokroku v teorii informac ̌nı ́ho hrdla (information bottleneck, IB) a konceptu infor- mac ̌nı ́roviny. Tato pra ́ce zkouma ́, jak struktura informac ̌nı ́roviny, konkre ́tne ̌ shlukova ́nı ́vnitr ̌nı ́ch reprezentacı ́, ovlivn ̌uje vy ́kon neuronovy ́ch sı ́tı ́. Pr ̌edstavujeme Purity teorii, novy ́ ra ́mec pro kvantifikaci shlukova ́nı ́reprezentacı ́po vrstva ́ch, ktery ́ dopln ̌uje sta ́vajı ́cı ́ IB perspektivy. Nas ̌e analy ́za odhaluje vy ́znamnou korelaci mezi strukturou informac ̌nı ́roviny a schopnostı ́generalizace v u ́loha ́ch bina ́rnı ́ klasifikace. Na za ́klade ̌ te ́to korelace navrhujeme novou metriku zaloz ̌enou na teorii informace, ktera ́ u ́c ̌inne ̌ pr ̌edpovı ́da ́ schopnost modelu generalizovat. Da ́le vyvı ́jı ́me algoritmus pro vy ́be ̌r modelu vyuz ̌ı ́vajı ́cı ́ tuto metriku, ktery ́ prokazatelne ̌ pr ̌ekona ́va ́ vy ́be ̌r zaloz ̌eny ́ vy ́hradne ̌ na tre ́novacı ́ ztra ́te ̌ (train loss). | cs_CZ |
| uk.abstract.en | Despite deep learning's remarkable empirical success, its theoretical un- derpinnings lag behind. Information theory provides a powerful framework for analyzing internal network representations, particularly through recent advances in information bottleneck (IB) theory and the information plane. This thesis investigates how the structure of the information plane, specifi- cally the clustering behavior of internal representations, influences neural net- work performance. We introduce Purity theory, a novel framework for quan- tifying layer-wise clustering, complementing established IB perspectives. Our analysis reveals a significant correlation between information plane structure and generalization performance in binary classification tasks. Leveraging this correlation, we propose a new information-theoretic metric that effectively predicts model generalization capability. Furthermore, we develop a model selection algorithm based on this metric, which demonstrably outperforms selection based solely on training loss. | en_US |
| uk.file-availability | V | |
| uk.grantor | Univerzita Karlova, Matematicko-fyzikální fakulta, Katedra aplikované matematiky | cs_CZ |
| thesis.grade.code | 1 | |
| uk.publication-place | Praha | cs_CZ |
| uk.thesis.defenceStatus | O | |