Použití filtrovacích algoritmů ve shlukové analýze

Pacovský, Matěj

Use of filter algorithms in cluster analysis

dc.contributor.advisor	Antoch, Jaromír
dc.creator	Pacovský, Matěj
dc.date.accessioned	2017-05-07T19:31:25Z
dc.date.available	2017-05-07T19:31:25Z
dc.date.issued	2012
dc.identifier.uri	http://hdl.handle.net/20.500.11956/45940
dc.description.abstract	Práce je rozdělena do pěti kapitol. V prvních dvou kapitolách shrnuji sebrané poznatky o shlukové analýze dat, uvádím definice pojmů použitých v~práci a popisuji algoritmus k-průměrů. Ve třetí kapitole se zabývám filtrovacím algoritmem, který využívá filtrovací heuristiku během průchodu MRKD-stromem a tím urychluje algoritmus k-průměrů. Ve čtvrté kapitole popisuji algoritmus x-průměrů, který využívá všechny dosud zmíněné poznatky. V páté kapitole testuji všechny algoritmy na uměle vytvořených datech a na reálných datech z fyziky, přitom se v některých případech odkazuji na program WEKA, v němž je algoritmus x-průměrů naimplementován. Algoritmy o kterých pojednává tato práce jsou určeny pro objekty popsané pouze kvantitativními proměnnými. Jsou také vhodné k použití na velké datové soubory. Na přiloženém CD uvádím implementaci algoritmů v jazyku Matlab.	cs_CZ
dc.description.abstract	The thesis is divided into five chapters. In the first two chapters I give the overview of clustering data analysis, I present definitions of terms used in the work and describe the k-means algorithm. Third chapter focuses on the filtering algorithm that uses heuristics when algorithm pass throught the MRKD-tree. The fourth chapter describes the x-means algorithm that uses all of the above-mentioned findings. In the fifth chapter I test all algorithms both on artificial and real data from physics. In some cases I refer to the WEKA program where the x-means algorithm is implemented. Algoritms that are discussed in this thesis are intended only for objects described by quantitative variables. They are also suitable for large datasets. In the attached CD I present the implementation of algorithms in Matlab language.	en_US
dc.language	Čeština	cs_CZ
dc.language.iso	cs_CZ
dc.publisher	Univerzita Karlova, Matematicko-fyzikální fakulta	cs_CZ
dc.subject	Shluková analýza	cs_CZ
dc.subject	algoritmus k-průměrů	cs_CZ
dc.subject	filtrovací algoritmus	cs_CZ
dc.subject	algoritmus x-průměrů	cs_CZ
dc.subject	Cluster analysis	en_US
dc.subject	k-means algorithm	en_US
dc.subject	x-means algorithm	en_US
dc.subject	filtering algorithm	en_US
dc.title	Použití filtrovacích algoritmů ve shlukové analýze	cs_CZ
dc.type	bakalářská práce	cs_CZ
dcterms.created	2012
dcterms.dateAccepted	2012-06-18
dc.description.department	Department of Probability and Mathematical Statistics	en_US
dc.description.department	Katedra pravděpodobnosti a matematické statistiky	cs_CZ
dc.description.faculty	Faculty of Mathematics and Physics	en_US
dc.description.faculty	Matematicko-fyzikální fakulta	cs_CZ
dc.identifier.repId	107741
dc.title.translated	Use of filter algorithms in cluster analysis	en_US
dc.contributor.referee	Novák, Petr
dc.identifier.aleph	001479810
thesis.degree.name	Bc.
thesis.degree.level	bakalářské	cs_CZ
thesis.degree.discipline	Financial Mathematics	en_US
thesis.degree.discipline	Finanční matematika	cs_CZ
thesis.degree.program	Mathematics	en_US
thesis.degree.program	Matematika	cs_CZ
uk.thesis.type	bakalářská práce	cs_CZ
uk.taxonomy.organization-cs	Matematicko-fyzikální fakulta::Katedra pravděpodobnosti a matematické statistiky	cs_CZ
uk.taxonomy.organization-en	Faculty of Mathematics and Physics::Department of Probability and Mathematical Statistics	en_US
uk.faculty-name.cs	Matematicko-fyzikální fakulta	cs_CZ
uk.faculty-name.en	Faculty of Mathematics and Physics	en_US
uk.faculty-abbr.cs	MFF	cs_CZ
uk.degree-discipline.cs	Finanční matematika	cs_CZ
uk.degree-discipline.en	Financial Mathematics	en_US
uk.degree-program.cs	Matematika	cs_CZ
uk.degree-program.en	Mathematics	en_US
thesis.grade.cs	Velmi dobře	cs_CZ
thesis.grade.en	Very good	en_US
uk.abstract.cs	Práce je rozdělena do pěti kapitol. V prvních dvou kapitolách shrnuji sebrané poznatky o shlukové analýze dat, uvádím definice pojmů použitých v~práci a popisuji algoritmus k-průměrů. Ve třetí kapitole se zabývám filtrovacím algoritmem, který využívá filtrovací heuristiku během průchodu MRKD-stromem a tím urychluje algoritmus k-průměrů. Ve čtvrté kapitole popisuji algoritmus x-průměrů, který využívá všechny dosud zmíněné poznatky. V páté kapitole testuji všechny algoritmy na uměle vytvořených datech a na reálných datech z fyziky, přitom se v některých případech odkazuji na program WEKA, v němž je algoritmus x-průměrů naimplementován. Algoritmy o kterých pojednává tato práce jsou určeny pro objekty popsané pouze kvantitativními proměnnými. Jsou také vhodné k použití na velké datové soubory. Na přiloženém CD uvádím implementaci algoritmů v jazyku Matlab.	cs_CZ
uk.abstract.en	The thesis is divided into five chapters. In the first two chapters I give the overview of clustering data analysis, I present definitions of terms used in the work and describe the k-means algorithm. Third chapter focuses on the filtering algorithm that uses heuristics when algorithm pass throught the MRKD-tree. The fourth chapter describes the x-means algorithm that uses all of the above-mentioned findings. In the fifth chapter I test all algorithms both on artificial and real data from physics. In some cases I refer to the WEKA program where the x-means algorithm is implemented. Algoritms that are discussed in this thesis are intended only for objects described by quantitative variables. They are also suitable for large datasets. In the attached CD I present the implementation of algorithms in Matlab language.	en_US
uk.publication.place	Praha	cs_CZ
uk.grantor	Univerzita Karlova, Matematicko-fyzikální fakulta, Katedra pravděpodobnosti a matematické statistiky	cs_CZ
dc.identifier.lisID	990014798100106986