Search. Read. Cite.

Easy to search. Easy to read. Easy to cite with credible sources.

Journal of Applied Sciences

Year: 2009  |  Volume: 9  |  Issue: 4  |  Page No.: 794 - 798

Knowledge Acquisition from Textual Documents for the Construction of Medicinal Herbs Domain Ontology

I. Zaharudin, S.A. Noah and M.M. Noor

Abstract

In this study a semi automatic acquisition of domain relevant terms from digital documents in e-newspaper related to Malaysian medicinal herbs is presented. This study proposes (1) TFIDF-based term classification method for acquiring single word terms, (2) recognition of multi-word using TerMine software to acquire multiword terms and (3) Hearst`s methodology of acquiring semantic relationships of hyponym. The results show the benefits of using these methods in selecting relevant terms from domain specific corpus. From this study it is believed that the combination of these three methods might be helpful to select relevant terms as well as minimize the effort to discard irrelevant terms manually from wide collection of terms from the corpus.

Cited References Fulltext