New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network

Rodriguez Andrew; Kim Byunghoon; Turkoz Mehmet; Lee Jae Min; Coh Byoung Youl; Jeong Myong K<sup>*</sup>

doi:10.1007/s11192-015-1531-8

摘要

Being able to effectively measure similarity between patents in a complex patent citation network is a crucial task in understanding patent relatedness. In the past, techniques such as text mining and keyword analysis have been applied for patent similarity calculation. The drawback of these approaches is that they depend on word choice and writing style of authors. Most existing graph-based approaches use common neighbor-based measures, which only consider direct adjacency. In this work we propose new similarity measures for patents in a patent citation network using only the patent citation network structure. The proposed similarity measures leverage direct and indirect co-citation links between patents. A challenge is when some patents receive a large number of citations, thus are considered more similar to many other patents in the patent citation network. To overcome this challenge, we propose a normalization technique to account for the case where some pairs are ranked very similar to each other because they both are cited by many other patents. We validate our proposed similarity measures using US class codes for US patents and the well-known Jaccard similarity index. Experiments show that the proposed methods perform well when compared to the Jaccard similarity index.

出版日期2015-5

全文

访问全文

收藏分享被引(42) 浏览

更新时间：2024-05-09 00:00

New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友