DocumentCode
2113290
Title
Markov-Based Automatic Term Extraction
Author
Zhou, Zili ; Wang, Yanna ; Gu, Junzhong
Author_Institution
Comput. Sci. & Technol. Dept., East China Normal Univ., Shang Hai
fYear
2008
fDate
18-18 Dec. 2008
Firstpage
86
Lastpage
89
Abstract
This paper presents an automatic term extraction method based on Markov process. The method aims to extract multi-word domain terms from English corpora. The paper proves that the extracting term process is a Markov chain firstly, and then gives the steps of the Markov-based method. In order to evaluate our method, we use a corpus related to computer science got by Web crawlers, and extract domain terms by methods introduced in the paper. The experiment data shows that our method out performs other methods.
Keywords
Markov processes; information retrieval; English corpora; Markov chain; Markov process; Markov-based automatic term extraction; Web crawlers; multi-word domain term; Biomedical engineering; Computer science; Data mining; Educational institutions; Humans; Mutual information; Ontologies; Paper technology; Physics; Statistics; Markov; term extraction; transition probability;
fLanguage
English
Publisher
ieee
Conference_Titel
Future BioMedical Information Engineering, 2008. FBIE '08. International Seminar on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3561-6
Type
conf
DOI
10.1109/FBIE.2008.84
Filename
5076692
Link To Document