Overview of Patent Retrieval Task at NTCIR-4 Atsushi Fujii (Univ. - - PowerPoint PPT Presentation

overview of patent retrieval task at ntcir 4
SMART_READER_LITE
LIVE PREVIEW

Overview of Patent Retrieval Task at NTCIR-4 Atsushi Fujii (Univ. - - PowerPoint PPT Presentation

Overview of Patent Retrieval Task at NTCIR-4 Atsushi Fujii (Univ. of Tsukuba) Makoto Iwayama (Hitaci, Ltd.) Noriko Kando (National Inst. of Informatics) Introduction Large test collections for Human Language Technology (HLT) have been


slide-1
SLIDE 1

Overview of Patent Retrieval Task at NTCIR-4

Atsushi Fujii (Univ. of Tsukuba) Makoto Iwayama (Hitaci, Ltd.) Noriko Kando (National Inst. of Informatics)

slide-2
SLIDE 2

2

Introduction

  • Large test collections for Human Language

Technology (HLT) have been produced in TREC, CLEF, NTCIR

– Targets are newspaper, technical paper, Web

  • Commercial patent retrieval systems have
  • perated for a long time
  • But, less attention in the HLT research

community

slide-3
SLIDE 3

3

NTCIR-3 Workshop (2001-2002)

  • In NTCIR-3, the first effort was made to

produce test collection for patent IR

– technology survey – requested to search for patents related to a specific technology (e.g., gasoline direct- injection engine)

  • But, process of patent IR differs depending
  • n the purpose

– technology survey, invalidity search, etc.

  • We performed a different task in NTCIR-4
slide-4
SLIDE 4

4

NTCIR-4 Workshop (2003-2004)

  • NTCIR workshop is in one and half years

– difficult to explore long-term research topics

  • Two different patent tasks were performed

– invalidity search task: short-term – patent map generation task: long-term

  • feasibility study (FS) task

focus of today’s talk

slide-5
SLIDE 5

5

NTCIR-4 Workshop (2003-2004)

  • NTCIR workshop is in one and half years

– difficult to explore long-term research topics

  • Two different patent tasks were performed

– invalidity search task: short-term – patent map generation task: long-term

  • feasibility study (FS) task
slide-6
SLIDE 6

6

Invalidity search task

  • Find the patents that can invalidate the demand in

a patent application (claim)

– given a patent claim, each group searches a collection for patents similar to the claim

  • This task is usually performed by

– examiners in a government patent office – searchers of IP division in private companies

  • This can be seen as patent-to-patent associative

retrieval

– both queries and documents are patents

slide-7
SLIDE 7

7

Process of producing test collection

system1 system2 pooling relevance judgment evaluation assessors (human experts)

search target (doc. collection)

search results2 search results1 pooled results search topic Test Collection runs relevant docs. relevant docs. preliminary search

slide-8
SLIDE 8

8

Process of producing test collection

system1 system2 pooling relevance judgment evaluation assessors (human experts)

search target (doc. collection)

search results2 search results1 pooled results search topic runs relevant docs. relevant docs. preliminary search

slide-9
SLIDE 9

9

  • Unexamined patent application

– Japanese full text published in 1993-1997 – 1.7M documents (40GB)

  • JAPIO Patent Abstract

– professional abstracts – length is standardized in approx. 400 characters – vocabulary is controlled

  • Patent Abstracts of Japan (PAJ)

– English translations of JAPIO Abstract

Document collection

editing translation provided for NTCIR-4

slide-10
SLIDE 10

10

Process of producing test collection

system1 system2 pooling relevance judgment evaluation assessors (human experts)

search target (doc. collection)

search results2 search results1 pooled results search topic runs relevant docs. relevant docs. preliminary search

slide-11
SLIDE 11

11

Search topics

  • Japanese patent application rejected by Japanese

Patent Office (JPO)

– at least one relevant document exists

  • 34 topics were selected by members of “Japan

Intellectual Property Association” (JIPA)

– patent search experts in IP division – also in charge of relevance judgment

  • English, Korean, and simplified/traditional Chinese

translations for cross-language patent IR

slide-12
SLIDE 12

12

Search topics (cont.)

  • In preliminary study, the number of relevant

documents for a topic was small (< 10)

  • Evaluation results obtained with our

collection can potentially be unreliable

  • QA task overcomes this problem by

increasing the number of questions (> 100)

  • So, we produced additional topics
slide-13
SLIDE 13

13

Additional topics

  • We produced 69 additional topics
  • Additional topics are also Japanese patent

applications rejected by JPO

  • We used only the citations provided by JPO

as relevant documents

– no additional human judgments were needed

slide-14
SLIDE 14

14

Example search topic

<TOPIC> <NUM>008</NUM> <LANG>EN</LANG> <FDATE>19960527</FDATE> <CLAIM>(Claim 1) A sensor device, characterized in that an open recessed part is formed on a box-shaped forming base, a conductive film of a designated pattern is formed on the surface

  • f the forming base including the inner surface of the recessed

part, an element for a sensor is bonded to the recessed part, and the forming base is closed with a cover.</CLAIM> ... </TOPIC> Relevant documents must be prior art, which had been open to the public before the topic patent was filed

Target for invalidation

Date of filing (May 27, 1996)

slide-15
SLIDE 15

15

Process of producing test collection

system1 system2 pooling relevance judgment evaluation assessors (human experts)

search target (doc. collection)

search results2 search results1 pooled results search topic runs relevant docs. relevant docs. preliminary search

slide-16
SLIDE 16

16

Search results

  • For each topic, top 1000 documents are

sorted according to the relevance degree

  • For each document, passages are also sorted

– document retrieval and passage retrieval were performed

  • Passages are paragraphs determined by

applicants

  • 110 results were submitted from 8 groups
slide-17
SLIDE 17

17

Example retrieval result

0001 890 1993-123456-5 1 9999 ntc1 0001 870 1993-123456-3 1 9999 ntc1 0001 860 1993-123456-0 1 9999 ntc1 0001 850 1993-123456-12 1 9999 ntc1 0001 990 1995-384359-23 2 9998 ntc1 0001 980 1995-384359-2 2 9998 ntc1 0001 970 1995-384359-8 2 9998 ntc1 0002 890 1994-000002-3 1 9999 ntc1 0002 850 1994-000002-1 1 9999 ntc1 ... Topic Passage score Document ID Document rank Document score System ID

slide-18
SLIDE 18

18

Process of producing test collection

system1 system2 pooling relevance judgment evaluation assessors (human experts)

search target (doc. collection)

search results2 search results1 pooled results search topic runs relevant docs. relevant docs. preliminary search

slide-19
SLIDE 19

19

Relevance judgment

  • Document-based relevant judgment was

performed based on the following two ranks

– A: patent that can invalidate topic claim – B: patent that can invalidate topic claim, when used with other patents (but should be related to most of components)

  • Submitted search results were evaluated by

mean average precision (MAP)

slide-20
SLIDE 20

20

Details of relevant documents (A)

citation JIPA system

19 17 25 58 40

total number of documents is 159

slide-21
SLIDE 21

21

Details of relevant documents (B)

citation JIPA system

12 42 27 72 32

total number of documents is 185

slide-22
SLIDE 22

22

slide-23
SLIDE 23

23

Formal run results

  • no significant difference b/w the results of

main topics (34) and additional topics (69)

  • please see proceedings for details
slide-24
SLIDE 24

24

Passage-based relevance judgment

  • For each relevant document (either A or B),

passage-based relevant judgment was performed as follows:

– if a passage can be grounds to judge the document as relevant, this passage is relevant – if a group of passages can be grounds to judge the document as relevant, this passage group is relevant

  • assessors searched for relevant passages and

groups exhaustively

slide-25
SLIDE 25

25

Passage-based evaluation

  • Relevant passage group is equally

informative as a single relevant passage

  • New concept of combinational relevance is

proposed

  • In the conventional evaluation for IR,

relevant items (e.g. documents and passages) are independent and therefore combinations are not considered

slide-26
SLIDE 26

26

Example of passage-based evaluation

a relevant document (A or B) ……… relevant passage group

  • evaluation score is determined

by a search length in which a user obtains sufficient grounds

  • final score is averaged over all

relevant (A/B) documents

search length = 5

slide-27
SLIDE 27

27

Baseline IR system

  • Organizers provided participants with a

baseline IR system on the Web

– return document list in response to a query – indented for glass-box comparative evaluation

  • Fundamentally, each group was able to

participate only by developing front/back- end modules

– i.e., query processing and passage retrieval

  • two groups used the baseline system
slide-28
SLIDE 28

28

Example methods used by participants

  • claim structure analysis

– dividing claim into subtopics – dividing preamble and essential parts – different term weights depending on the part

  • different usages of classification (IPC)

– filtering, hierarchy, probabilistic model

slide-29
SLIDE 29

29

NTCIR-4 Workshop (2003-2004)

  • NTCIR workshop is in one and half years

– difficult to explore long-term research topics

  • Two different patent tasks were performed

– invalidity search task: short-term – patent map generation task: long-term

  • feasibility study (FS) task
slide-30
SLIDE 30

30

Scenario of patent map generation

search topic classification documents retrieval visualization topics and documents in NTCIR-3 collection

application JAPIO abst PAJ

multi-dimensional matrix

slide-31
SLIDE 31

31

Task description

  • In principle, given a search topic, relevant

patents are retrieved and organized into a multi-dimensional matrix

  • In practice, given a search topic, relevant

patents and x/y-axes, each participant submits a two-dimensional matrix

– the number of topics was 6

  • human experts evaluated matrix subjectively
slide-32
SLIDE 32

32

1998-012923 1998-247745 1998-256597 1998-135514 1998-256668 1998-135516 1998-242586 1998-247761

structure of light emitting element

1998-242515 1998-270757 1998-173230 1998-209499 1998-256602 1998-242518 1998-215034 1998-223930

electrode arrangement

1998-209495 1998-190063 1998-209498 1998-107318

electrode composition

1998-145000 1998-233554

structure of active layer emission intensity emission stability long

  • perating

life reliability crystalline

problems to be solved solutions

Example map (blue light-emitting diode)

given participants identify lines and columns

slide-33
SLIDE 33

33

Patent map generation task

  • 6 topics were used

– gasoline direct-injection engine – hair care cosmetic products – functional carpet – blue light-emitting diode – solid high-polymer-type fuel cell – ultra hydrophilization of plastic surfaces

  • human experts produced reference maps

and evaluated submitted maps subjectively

slide-34
SLIDE 34

34

Summary

  • NTCIR-4 patent collection can be used for

– retrieval of semi-structured long documents – associative patent retrieval – passage retrieval – classification and text mining (patent map)

  • All data will be open to the public after the

workshop meeting

slide-35
SLIDE 35

35

Outstanding issues in NTCIR-4

  • For invalidity search, the number of

relevant documents was inherently small

– evaluation results can potentially be unreliable – to overcome this problem, the number of topics must be increased (cf. question answering task)

  • Passage-based evaluation was not used as
  • fficial result
  • The number of participants was small

– 8 groups (all Japanese groups)

slide-36
SLIDE 36

36

Plan for NTCIR-5

  • Two main tasks
  • retrieval task

– using more topics (> 1000) – exploring passage-based evaluation

  • classification (categorization) task

– a variation of patent map generation – to evaluate machine learning methods

  • round-table meeting on June 28, 2004