Portál TUL - Browse IS/STAG

Browse IS/STAG (S025)

Main menu for Browse IS/STAG

Search for a Thesis

Print/export:

Data export to PDF format - which you can print easily...

Bookmark this link in your browser so that you may quickly load this IS/STAG page in the future.

Not logged-in user will see only submitted theses.

Only logged-in user will see student personal numbers.

Dates found, count: 1

Search result paging

Found 1 records Print Export to xls List URL

Surname (Maiden name)	Name	Title	Thesis status		Supervisors	Reviewers	Type of thesis	Date of def.	Title
Student	Type of thesis	-	-	-	-	-	-	-	-	-	-
Černý	Jiří	Automatic detection of topics Automatic detection of topics			Drábková Jindra	-	Master thesis	11.06.2008	Automatic detection of topics
Jiří Černý	Master thesis	0XX	0XX	0XX	0XX	0XX	0XX	0XX	0XX	0XX	0XX

Thesis info Automatická detekce témat

Basic data

Annotation
The document you are accessing is protected by copyright law. Unauthorised use may lead to criminal sanctions.
Name	Černý Jiří
Acad. Yr.	2007/2008
Assigning department	ITE
Date of defence	Jun 11, 2008
Type of thesis	Master thesis
Thesis status	Thesis finished and defended successfully (DUO).
Completeness of mandatory entries	- The following mandatory fields are not filled in for this Thesis.: Title in English
Main topic	Automatická detekce témat
Main topic in English	Automatic detection of topics
Title according to student	Automatická detekce témat
English title as given by the student	-
Parallel name	Automatic detection of topics
Subtitle	-
Supervisor	Drábková Jindra, Ing. Ph.D.
Annotation	vyhledání a zhodnocení informací o automatické klasikaci dokumentů, seznámení s jazykem Perl a balíkem LWP pro potřeby práce s textovými dokumenty, nalezení klasikátorů v programu WEKA, porovnání různých metod klasikace a parametrizace textů.
Annotation in English	The aim of diploma thesis is to find sufficient sequence which can sort out unsigned text documents. It means to prepare a lot of training data for classifier learning. The fruitfulness of classifer is tested by the help of testing data. Newspaper articles from server zpravy.atlas.cz are used as a testing data. The first part of diploma thesis is about automatic detection theory. The second part of diploma thesis is about finding the classifier by the help of program WEKA. Data is processed by the help of programming language Perl and package LWP. Simple text isn't suitable for next processing. For this reason a global dictionary is created. Documents are converted into feature vectors. These vectors can be written by the help of different representation. In diploma thesis different sorts of representation are tested. Program WEKA is used for training classifiers, cluster analysis and select attributes. In this program different representation feature vectors and classifiers algorithms are tested.
Keywords	Perl, Weka, automatická klasifikace, klasifikátor, příznakový vektor, třídění dokumentů
Keywords in English	Perl, Weka, automatic classification, classifier, feature vector, sort out documents
Length of the covering note	102
Language	CZ
vyhledání a zhodnocení informací o automatické klasikaci dokumentů, seznámení s jazykem Perl a balíkem LWP pro potřeby práce s textovými dokumenty, nalezení klasikátorů v programu WEKA, porovnání různých metod klasikace a parametrizace textů.
Annotation in English
The aim of diploma thesis is to find sufficient sequence which can sort out unsigned text documents. It means to prepare a lot of training data for classifier learning. The fruitfulness of classifer is tested by the help of testing data. Newspaper articles from server zpravy.atlas.cz are used as a testing data. The first part of diploma thesis is about automatic detection theory. The second part of diploma thesis is about finding the classifier by the help of program WEKA. Data is processed by the help of programming language Perl and package LWP. Simple text isn't suitable for next processing. For this reason a global dictionary is created. Documents are converted into feature vectors. These vectors can be written by the help of different representation. In diploma thesis different sorts of representation are tested. Program WEKA is used for training classifiers, cluster analysis and select attributes. In this program different representation feature vectors and classifiers algorithms are tested.
Keywords
Perl, Weka, automatická klasifikace, klasifikátor, příznakový vektor, třídění dokumentů
Keywords in English
Perl, Weka, automatic classification, classifier, feature vector, sort out documents
Research Plan	-
Research Plan
-
Recommended resources	-
Recommended resources
-
Enclosed appendices	1 DVD
Appendices bound in thesis	-
Taken from the library	Yes
Full text of the thesis
Appendices
Reviewer's report
Supervisor's report
Defence procedure record	-
Defence procedure record file