Lin Li
School of Computer Science and Technology, Wuhan University of Technology, Wuhan, 430070, Hubei
Shi Qiao
School of Computer Science and Technology, Wuhan University of Technology, Wuhan, 430070, Hubei
ShiliXiong
Department of Advertising, UniversityofIllinoisatUrbana-Champaign, Urbana, USA
ABSTRACT
Accompanied by the gradual integration of the microblogging in life, people would like to focus on the subjects they are interested in so that the technique of microblog retrieving becomes more and more popular. Currently, the engine of microblog retrieval uses list-view to show the retrieval results. Although considering the forwarding account and comment time it is still inconvenience for users to know the retrieval content in generally. This study puts forward the method that we organize the results according to the topic threads in order to improve the quality of microblog retrieval. Firstly, we made some earlier stage processing to the microblogs and then we gave out the Manual Sampling based Dynamic Incremental Clustering Algorithm. Finally we utilized the algorithm to extract the topic thread and show the result to the users. The data source is from Sina Weibo and it contains 14 topics and 74662 microblogs in total. The results show that, the algorithm is effective. Compared with the traditional k-means clustering it is 5 times faster and its precision is near.
PDF References Citation
How to cite this article
Lin Li, Shi Qiao and ShiliXiong, 2013. Topic Thread Extraction from Search Results of Microblogs. Information Technology Journal, 12: 4534-4538.
DOI: 10.3923/itj.2013.4534.4538
URL: https://scialert.net/abstract/?doi=itj.2013.4534.4538
DOI: 10.3923/itj.2013.4534.4538
URL: https://scialert.net/abstract/?doi=itj.2013.4534.4538
REFERENCES
- Efron, M., P. Organisciak and K. Fenlon, 2012. Improving retrieval of short texts through document expansion. Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, August 12-16, 2012, Portland, OR., USA., pp: 911-920.
CrossRef - Elsas, J.L. and J.G. Carbonell, 2009. It pays to be picky: An evaluation of thread retrieval in online forums. Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, July 19-23, 2009, Boston, MA., USA., pp: 714-715.
CrossRef - Hu, X., L. Tang, J.L. Tang and H. Liu, 2013. Exploiting social relations for sentiment analysis in microblogging. Proceedings of the 6th ACM International Conference on Web Search and Data, February 4-8, 2013, Rome, Italy, pp: 537-546.
CrossRef - Lin, C., C. Lin, J.X. Li, D.D. Wang, Y. Chen and T. Li, 2012. Generating event storylines from microblogs. Proceedings of the 21st ACM International Conference on Information and Knowledge Management, October 29-November 2, 2012, Maui, HI., USA., pp: 175-184.
CrossRef - Pervin, N., F. Fang, A. Datta, K. Dutta and D.E. Vandermeer, 2013. Fast, scalable and context-sensitive detection of trending topics in microblog post streams. ACM Trans. Manage. Inf. Syst., Vol. 3.
CrossRef - Qamra, A., B. Tseng and E.Y. Chang, 2006. Mining blog stories using community-based and temporal clustering. Proceedings of the 15th ACM International Conference on Information and Knowledge Management, November 5-11, 2006, Arlington, VA., USA., pp: 58-67.
CrossRef - Qureshi, M.A., C. O'Riordan and G. Pasi, 2012. Short-text domain specific key terms/phrases extraction using an n-gram model with wikipedia. Proceedings of the 21st ACM International Conference on Information and Knowledge Management, October 29-November 2, 2012, Maui, HI., USA., pp: 2515-2518.
CrossRef - Seo, J.W., W.B. Croft and D.A. Smith, 2009. Online community search using thread structure. Proceedings of the 18th ACM Conference on Information and Knowledge Management, November 2-6, 2009, Hong Kong, China, pp: 1907-1910.
CrossRef - Shen, D., Q. Yang, J.T. Sun and Z. Chen, 2006. Thread detection in dynamic text message streams. Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 6-11, 2006, Seattle, Washington, USA., pp: 35-42.
CrossRef - Sun, A., M. Hu and E.P. Lim, 2008. Searching blogs and news: A study on popular queries. Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 20-24, 2008, Singapore, pp: 729-730.
CrossRef - Vitale, D., P. Ferragina and U. Scaiella, 2012. Classification of Short Texts by Deploying Topical Annotations. In: Advances in Information Retrieval, Baeza-Yates, R., A.P. de Vries, H. Zaragoza, B.B. Cambazoglu, V. Murdock, R. Lempel, F. Silvestri (Eds.). Springer-Verlag, Berlin, Germany, pp: 376-387.
- Xi, W., J. Lind and E. Brill, 2004. Learning effective ranking functions for newsgroup search. Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 25-29, 2004, Sheffield, UK., pp: 394-401.
CrossRef