中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
An Objective Effect Evaluation Framework for Vectorization Models on Patent Semantic Similarity Measurement

文献类型:期刊论文

作者Li, Jiazheng2,3; Zhou, Jian1; Cao, Mengyun4
刊名ELECTRONICS
出版日期2025-10-15
卷号14期号:20页码:14
关键词comparative study language model patent semantic similarity measurement statistical hypothesis testing text vectorization
ISSN号2079-9292
DOI10.3390/electronics14204056
英文摘要How to objectively evaluate the effect of different vectorization models in measuring similarity between patents is a fundamental issue, which can help to select high-performance vectorization models to support advanced patent services. Based on the rank consistency index and hypothesis testing approach, a framework for evaluating the effect of different vectorization models on patents' similarity is proposed based on whether the model can accurately predict the similarity ranking of patents. Integrating the factors of time and technical field, an empirical study is conducted under the proposed framework to objectively evaluate the effect of six mainstream text vectorization models for assessing the semantic similarity of patents, which is evaluated based on Chinese patents (English Translation) from 2010 to 2024. The results show that the performance of Llama 2 is the best among six compared models in all years and in all technical fields. The proposed framework can objectively evaluate the similarity measurement effect of different vectorization models and provides a basis for the selection of the vectorization model for patent semantic similarity measurement for advanced patent services.
资助项目Natural Science Foundation of Fujian Province, China[2022J05157] ; Natural Science Foundation of Xiamen, China[3502Z20227049]
WOS研究方向Computer Science ; Engineering ; Physics
语种英语
WOS记录号WOS:001601463600001
出版者MDPI
源URL[http://119.78.100.204/handle/2XEOYT63/41616]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Cao, Mengyun
作者单位1.Chinese Acad Sci, Natl Sci Lib, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
3.Univ Chinese Acad Sci, Beijing 101408, Peoples R China
4.Jimei Univ, Coll Comp Engn, Xiamen 361021, Peoples R China
推荐引用方式
GB/T 7714
Li, Jiazheng,Zhou, Jian,Cao, Mengyun. An Objective Effect Evaluation Framework for Vectorization Models on Patent Semantic Similarity Measurement[J]. ELECTRONICS,2025,14(20):14.
APA Li, Jiazheng,Zhou, Jian,&Cao, Mengyun.(2025).An Objective Effect Evaluation Framework for Vectorization Models on Patent Semantic Similarity Measurement.ELECTRONICS,14(20),14.
MLA Li, Jiazheng,et al."An Objective Effect Evaluation Framework for Vectorization Models on Patent Semantic Similarity Measurement".ELECTRONICS 14.20(2025):14.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。