Rozpoznávání a klasifikace učebnic pomocí hlubokého učení

Vondrák, David

Recognition and classification of textbooks by deep learning

diploma thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (347.6Kb)

Permanent link

http://hdl.handle.net/20.500.11956/171774

Identifiers

Study Information System: 236467

Referee

Novák, Michal

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Software Systems

Department

Institute of Formal and Applied Linguistics

Date of defense

8. 2. 2022

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

Czech

Grade

Excellent

Keywords (Czech)

hluboké učení|klasifikace knih|neuronové sítě|učebnice

Keywords (English)

deep learning|book classification|neural networks|textbook

Cílem práce bylo použití hlubokého učení k rozpoznávání učebnic a jejich klasifikaci do vyučovacích předmětů a úrovní, a to na základě textových údajů, jako je název knihy, autor, nakladatel a stručný popis obsahu. Jako součást práce formulujeme vlastní definici učebnice, vytváříme dataset pomocí extrakce údajů ze zdrojů dostupných na internetu a ručně anotujeme trénovací a testovací mno- žinu dat. Pro klasifikaci používáme jako baseline naivní bayesovský klasifikátor, z neuronových sítí pak konvoluční a rekurentní architekturu i jejich kombinace. Porovnáváme také různý způsob reprezentace dat vektory (tzv. word embedding) a dosažené výsledky podrobně analyzujeme. Výsledný nejlepší model dosahuje na testovacích datech u všech tří úloh vysokých úspěšností a nabízí se tak možnost jeho uplatnění v praxi. 1

Abstract (English)

The aim of the thesis was to use deep learning methods for recognizing text- books and classifying their subject and level, based on text parameters, like name of the book, author, publisher or brief content description. As part of the thesis, we formulate custom definition of textbook, create a dataset by extracting data from source available on the internet and manually label train and test dataset. We use naive bayes classifier as a baseline and then neural networks with con- volutional, recurrent or combined architectures. We compare various methods of representing data with vectors (word embedding) and analyze the results in de- tail. Resulting best model reaches high accuracy in all three tasks which suggests the possibility of its application in practice. 1

Citace dokumentu

Metadata

Show full item record