Verb Valency Frames Disambiquation

Semecký, Jiří

Verb Valency Frames Disambiquation

dc.creator	Semecký, Jiří
dc.date.accessioned	2021-05-19T16:40:49Z
dc.date.available	2021-05-19T16:40:49Z
dc.date.issued	2008
dc.identifier.uri	http://hdl.handle.net/20.500.11956/12165
dc.description.abstract	Semantic analysis has become a bottleneck of many natural language applications. Machine translation, automatic question answering, dialog management, and others rely on high quality semantic analysis. Verbs are central elements of clauses with strong influence on the realization of whole sentences. Therefore the semantic analysis of verbs plays a key role in the analysis of natural language. We believe that solid disambiguation of verb senses can boost the performance of many real-life applications. In this thesis, we investigate the potential of statistical disambiguation of verb senses. Each verb occurrence can be described by diverse types of information. We investigate which information is worth considering when determining the sense of verbs. Different types of classification methods are tested with regard to the topic. In particular, we compared the Na¨ive Bayes classifier, decision trees, rule-based method, maximum entropy, and support vector machines. The proposed methods are thoroughly evaluated on two different Czech corpora, VALEVAL and the Prague Dependency Treebank. Significant improvement over the baseline is observed.	en_US
dc.language	English	cs_CZ
dc.language.iso	en_US
dc.publisher	Univerzita Karlova, Matematicko-fyzikální fakulta	cs_CZ
dc.title	Verb Valency Frames Disambiquation	en_US
dc.type	rigorózní práce	cs_CZ
dcterms.created	2008
dcterms.dateAccepted	2008-01-14
dc.description.department	Institute of Formal and Applied Linguistics	en_US
dc.description.department	Ústav formální a aplikované lingvistiky	cs_CZ
dc.description.faculty	Faculty of Mathematics and Physics	en_US
dc.description.faculty	Matematicko-fyzikální fakulta	cs_CZ
dc.identifier.repId	50522
dc.title.translated	Verb Valency Frames Disambiquation	cs_CZ
dc.identifier.aleph	001138311
thesis.degree.name	RNDr.
thesis.degree.level	rigorózní řízení	cs_CZ
thesis.degree.discipline	Data Engineering	en_US
thesis.degree.discipline	Datové inženýrství	cs_CZ
thesis.degree.program	Informatics	en_US
thesis.degree.program	Informatika	cs_CZ
uk.thesis.type	rigorózní práce	cs_CZ
uk.taxonomy.organization-cs	Matematicko-fyzikální fakulta::Ústav formální a aplikované lingvistiky	cs_CZ
uk.taxonomy.organization-en	Faculty of Mathematics and Physics::Institute of Formal and Applied Linguistics	en_US
uk.faculty-name.cs	Matematicko-fyzikální fakulta	cs_CZ
uk.faculty-name.en	Faculty of Mathematics and Physics	en_US
uk.faculty-abbr.cs	MFF	cs_CZ
uk.degree-discipline.cs	Datové inženýrství	cs_CZ
uk.degree-discipline.en	Data Engineering	en_US
uk.degree-program.cs	Informatika	cs_CZ
uk.degree-program.en	Informatics	en_US
thesis.grade.cs	Uznáno	cs_CZ
thesis.grade.en	Recognized	en_US
uk.abstract.en	Semantic analysis has become a bottleneck of many natural language applications. Machine translation, automatic question answering, dialog management, and others rely on high quality semantic analysis. Verbs are central elements of clauses with strong influence on the realization of whole sentences. Therefore the semantic analysis of verbs plays a key role in the analysis of natural language. We believe that solid disambiguation of verb senses can boost the performance of many real-life applications. In this thesis, we investigate the potential of statistical disambiguation of verb senses. Each verb occurrence can be described by diverse types of information. We investigate which information is worth considering when determining the sense of verbs. Different types of classification methods are tested with regard to the topic. In particular, we compared the Na¨ive Bayes classifier, decision trees, rule-based method, maximum entropy, and support vector machines. The proposed methods are thoroughly evaluated on two different Czech corpora, VALEVAL and the Prague Dependency Treebank. Significant improvement over the baseline is observed.	en_US
uk.file-availability	V
uk.grantor	Univerzita Karlova, Matematicko-fyzikální fakulta, Ústav formální a aplikované lingvistiky	cs_CZ
thesis.grade.code	U
uk.publication-place	Praha	cs_CZ
uk.thesis.defenceStatus	U
dc.identifier.lisID	990011383110106986

Soubory tohoto záznamu

Název:: 150014488.pdf
Velikost:: 1.343Mb
Formát:: application/pdf
Popis:: Text práce

Zobrazit/otevřít

Název:: 150014489.pdf
Velikost:: 79.16Kb
Formát:: application/pdf
Popis:: Abstrakt

Zobrazit/otevřít

Název:: 150014490.pdf
Velikost:: 79.90Kb
Formát:: application/pdf
Popis:: Abstrakt (anglicky)

Zobrazit/otevřít

Název:: 150003341.pdf
Velikost:: 18.42Kb
Formát:: application/pdf
Popis:: Záznam o průběhu obhajoby

Zobrazit/otevřít

Tento záznam se objevuje v následujících sbírkách

Kvalifikační práce [11325]
Theses

Zobrazit minimální záznam