吉林大学学报(信息科学版) ›› 2021, Vol. 39 ›› Issue (3): 339-347.

基于 Word2vec 的信息窄化测度及影响因素研究

徐 翔, 靳 菁   

  1. 同济大学 艺术与传媒学院, 上海 200000
  • 收稿日期:2020-09-15 出版日期:2021-05-24 发布日期:2021-05-25
  • 作者简介:徐翔(1983— ), 男, 江西上饶人, 同济大学教授, 博士, 主要从事社交媒体挖掘和用户数据挖掘研究, (Tel) 86-18049932369(E-mail)xuxiang210089@163.com
Research on Measurement and Influencing Factors of Information Narrowing Based on Word2vec

XU Xiang, JIN Qing   

  1. School of Art and Media, Tongji University, Shanghai 200000, China
  • Received:2020-09-15 Online:2021-05-24 Published:2021-05-25

摘要: 为清晰而明确地掌握社交媒体使用与用户信息窄化的关系及其作用程度, 选取典型的社交媒体之一新浪微博(N= 7 825), 分析微博使用度、 活跃度、 影响度的现实指标所伴随的用户信息窄化。 从两方面实证考量用户内容在多种使用指标中的信息窄化。 结合配对样本 t 检验的结果显示, 微博媒介的使用程度越高的用户层级, 其语义上的自我相似度越高, 内容类型的分布均衡程度和丰富程度越低。

Abstract: In order to understand the relationship between social media usage and information cocoon, this research takes Sina Weibo as an example, to analyze information cocoon accompanied by Weibo usage, activity, and impact. We use Word2vec, one of accessible NLP ( Natural Language Processing) technology of word embedding, and k-means, a kind of clustering method, to explore the information cocoon and narrowing scope. The result of statistical paired T test shows that, as the development of users' level in social media, there is a remarkable trend of rising in semantic similarity of UGC(User Generated Content). The distribution and richness of content categories will also decrease accordingly. The result inspires us to rethink the relation between social media usage and information cocoon. The classification of users does not bring more flexible discourse space. Rather, deeper, and higher users suffer more from similar content.

