Subscribe Now Subscribe Today
Science Alert
 
FOLLOW US:     Facebook     Twitter
Blue
   
Curve Top
Research Journal of Information Technology
  Year: 2011 | Volume: 3 | Issue: 1 | Page No.: 44-52
DOI: 10.3923/rjit.2011.44.52
A New Term-ranking Approach that Supports Improved Searching in Literature Digital Libraries
Sulieman Bani-Ahmad and Ghadeer Al-Dweik

Abstract:
In example-based searching, users look for the set of most similar publications to a given one. This requires estimating similarities between publications. A tf.idf formula can be used to compute publication-to-publication text-based similarity, e.g., the Okapi BM25 formula. Studies show that augmenting the importance of search terms in the BM25 formulae improve similarity scores. To this end, we introduce a term-ranking technique and use it for improving publication similarity scores. The proposed term-ranking algorithm is a slight modification of the TextRank algorithm that utilizes the well-known PageRank algorithm to identify the important term/phrases within texts. The proposed approach considers the length of sentences to identify links between terms rather than considering fixed window size. We experimentally found that the proposed approach works well when paired with Okapi BM25.
 [Fulltext PDF]   [Fulltext HTML]   [XML: Abstract + References]   [References]   [View Citation]  [Report Citation]
 RELATED ARTICLES:
  •    Query Suggestion Generation Methods for Mobile Phones
  •    A Novel Document Ranking Algorithm That Supports Mobile Healthcare Information Access Effectiveness
  •    On Using the Research-Pyramid Model to Enhance Literature Digital Libraries
How to cite this article:

Sulieman Bani-Ahmad and Ghadeer Al-Dweik, 2011. A New Term-ranking Approach that Supports Improved Searching in Literature Digital Libraries. Research Journal of Information Technology, 3: 44-52.

DOI: 10.3923/rjit.2011.44.52

URL: https://scialert.net/abstract/?doi=rjit.2011.44.52

 
COMMENT ON THIS PAPER
 
 
 

 

 
 
 
 
 
 
 
 
 

 
 
 
 

Curve Bottom