Journal of Jilin University(Information Science Ed
Previous Articles Next Articles
LI Ling, REN Qing, FU Yuan, CHEN He, MEI Sheng-min
Online:
Published:
Abstract:
In order to process huge amount of data generated in the social network with efficiency and scalability, we designed the distributed TF-IDF (Term Frequency-Inverse Document Frequency) algorithm suitable for MapReduce, and implemented this algorithm on Hadoop. This algorithm extracts key words in user's weibo, in this way user's interest could be found, and the corresponding service could be recommended to the user. In order to verify the validity and scalability of the distributed TFIDF algorithm, the results of the distributed TF-IDF algorithm and TextRank algorithm was compared. The experimental results show that key words extracted by the distributed TF-IDF algorithm could represent characteristics of the user more accurately. By Contrasting the response time, it could be seen that the distributed TF -IDF algorithm has a good scalability.
Key words: hadoop, distributed TF-IDF algorithm, mapreduce, textrank algorithm
CLC Number:
LI Ling, REN Qing, FU Yuan, CHEN He, MEI Sheng-min. Algorithm for Social Network Recommendation Service Based on Hadoop[J].Journal of Jilin University(Information Science Ed, 2013, 31(4): 359-.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://xuebao.jlu.edu.cn/xxb/EN/
http://xuebao.jlu.edu.cn/xxb/EN/Y2013/V31/I4/359
Cited