Evaluating the effectiveness of keyword search strategy for patent identification
The great usage of patent data in management of technology and innovation highlights the significance of patent identification. For identifying patents related to cross-disciplinary or integrated technologies or products that are emerging and cannot be clearly defined through patent classes or no definite related patent class exists in the patent system, keyword search is an appropriate identification method. However, literature has not explicitly addressed the effectiveness evaluation of keyword search strategy for patent identification, namely which parts of patent information (title, abstract, claims and description) should be used and how to select effective keywords to achieve a high level of effectiveness. This paper tries to fill this gap through evaluating the effectiveness of keyword search strategies of using different parts of patent information with different clusters of keywords, in the case of automotive software related patents issued by the USPTO. Clusters of keywords are selected through evaluating the effectiveness of each keyword by measuring recall and precision. Effectiveness of keyword search strategies of using different parts of patent information is evaluated with the indicators of type I error (missing patents that should be identified) and type II error (retrieving irrelevant patents). The results show that the most effective method of identifying patents in a specific domain through keyword search is using the patent information in the title, abstract and claims. This paper provides a principle of keywords selection for patent identification by using specific criteria of recall and precision.
Graphical abstractFigure optionsDownload full-size imageDownload as PowerPoint slideHighlights► Effectiveness of each selected keyword is tested in terms of recall and precision. ► Effectiveness of keyword search is evaluated by type I error and type II error. ► The most effective keyword search is using a patent's “title, abstract and claims”. ► It sheds new light upon the use of patent claims in patent identification. ► A patent's “description” is a noisy source in patent identification.
Journal: World Patent Information - Volume 35, Issue 1, March 2013, Pages 20–30