`Hayashiet al.
`
`119;
`
`{54]
`
`[75]
`
`INFORMATION PROCESSING SYSTEM
`AND METHODFOR PROCESSING
`DOCUMENT BY USING STRUCTURED
`KEYWORDS
`
`Inventors: Takehisa Hayashi, Sagamihara;
`Kouki Noguchi, Kokubunji; Tsuneya
`Kurihara, Tokyo; Masahiro Abe,
`Iruma, all of Japan
`
`[73] Assignee: Hitachi, Ltd., Tokyo, Japan
`
`[21] Appl. No.: 741,760
`
`[22] Filed:
`
`Aug. 7, 1991
`
`Foreign Application Priority Data
`[30]
`Aug. 22, 1990 [JP]
`Japan ....cssssssssssssssssessssssessees 2-219039
`
`[SL] Urnt, CUS occccesesccsccsssesesssssssesssssessseseons GO6F 15/40
`
`[52] ULS. C1. eeceeeeeeeere 364/419.07; 364/419.17
`{58] Field of Search.................008 364/419; 395/600
`
`[56]
`
`References Cited
`U.S. PATENT DOCUMENTS
`
`4,868,733 9/1989 Fujisawaet al. ..
`4,958,284 9/1990 Bishopet al.
`..
`
`4,972,349 11/1990 Klienberger.......
`
`2/1991 Burkowski et al... 364/200
`4,991,087
`2/1991 Brooksetal. ......
`364/900
`4,992,972
`5,099,426 3/1992 Carlgrenet al. ose 364/419
`5,123,103 6/1992 Ohtaki et al.
`eee 395/600
`5,168,565 12/1992 Morita .........eeescsssceneeseeseees 395/600
`
`
`
`ACUANAT
`5,307,266
`Apr. 26, 1994
`
`US005307266A
`{11] Patent Number:
`
`(45) Date of Patent:
`
`FOREIGN PATENT DOCUMENTS
`0032194A1
`7/1981 European Pat. Off.
`.
`0280866A2
`9/1988 European Pat. Off.
`.
`0361464A2 4/1990 European Pat. Off.
`.
`
`Primary Examiner—Roy N. Envall, Jr.
`Assistant Examiner~A. Bodendorf
`Attorney, Agent, or Firm—Antonelli, Terry, Stout &
`Kraus
`
`ABSTRACT
`[57]
`A document processing system for processing docu-
`ments by using structured keywords comprises an out-
`put system and a receiver system. The output system
`includesa first storage for storing a structured keyword
`dictionary containing structured keywords among
`which relations are systematically structured, and link-
`age unit providing linkage information for establishing
`correspondences between constituent parts of an input
`document and corresponding ones of the keywords.
`- The receiver system is coupled to the output system and
`includes a second storage for storing structured key-
`words among which relations are systematically struc-
`tured, and retrieving unit having inputs supplied with
`the documentand the linkage information for retrieving
`the document to thereby form data of a predetermined
`edition format by using the structured keyword read
`out from the second storage. Data transfer between the
`output system and the receiver systems can be per-
`formed either on-line or off-line.
`
`15 Claims, 17 Drawing Sheets
`
`
`TRIVAL iW
`10 Fe
`
`BUEFER
`
`
`FORMATION
`
`COMMUNICATION
`Bruen LI
`INTERFACE
`
`=
`
`Gee)ae
`
`
`
`
`
`INFORMATIONSENDINGUSER
`
`
`
`
`
`INFORMATIONRECEIVINGUSER
`
`_! 3COMMUNICATIONNETWORK
`
`=
`FROM OTHER
`INFORMATION
`SENDING USERS
`
`FOO! INFORMATION RECEIVER SYSTEM
`
`
`
`Page | of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 1 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 1 of 17
`
`5,307,266
`
`|
`FIG.
`£2QOINFORMATION SENDER SYSTEM
`
`
`~
`
`31
`DOCUMENT
`
`|jg|
`=
`i
`3
`
`DOCUMENT-
`e
`2
`KEYWORD LINK
`
` [| AFFIXING UNIT
`
`
`o
`wn
`5
`
`710 RETRIVAL INFORMATION
`————
`nore
`BUFFER
`
`,
`
`z
`te
`Ca]:
`5a
`=<
`=
`s
`°
`
`
`
`
`
`
`
`STRUCTURED
`KEYWORD
`DICTIONARY
`
`|
`
`«|
`
`'
`
`i
`
`e o
`
`2
`3
`&
`o
`5
`=
`<<
`z
`&
`=
`
`fo
`2
`2
`3
`&
`&
`3
`&
`<
`z
`=
`
`x
`=
`=-_
`2
`oo E
`=z
`,
`=z
`S
`
`=
`>
`
`>3e
`
`o
`500
`
`—;
`FROM OTHER
`INFORMATION
`SENDING USERS
`_!
`
`s@Ol INFORMATION RECEIVER SYSTEM
`,
`,
`aoe
`DICTIONARY
`
`110 RETRIEVAL
`INFORMATION STORAGE |
`
`(3!
`
`102.
`
`
`
`=
`z
`oa
`ze
`=
`= E
`Qo
`
`
`‘
`
`'
`
`ts
`=
`nes
`ui
`a
`i
`S
`
`
`
`
`
`
`
`
`SYSNSl
`
`50
`
`Page 2 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 2 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 2 of 17
`
`5,307,266
`
`FIG. 2
`
`STRUCTURED KEYWORD
`
`
`
`KEYWORD OF HIGHER
`RANK CONCEPT
`
`KEYWORD
`
`
`
`
`SYNONYM KEYWORD |
`
`
`KEYWORD OF LOWER |
`
`RANK CONCEPT
`
`
`
`INK
`
`
`
`Page 3 of 26
`
`~ MINDGEEKEXHIBIT 1006
`
`Page 3 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 3 of 17
`
`5,307,266
`
`MICROPROCESSOR
`CHIP NAME (ID)
`
`ARCHITECTURE
`
`FIG. 3
`
`STRUCT
`
`K
`
`‘CISC
`
`86 SERIES
`68 SERIES
`
`RISC
`
`keSERIES
`tes SERIES
`Be SERIES
`DATE — SERIES
`
`DATE OF PUBLICATION (MONTH, YEAR)
`,»DATE OF SHIPPING (MONTH , YEAR)
`
`MANUFACTURER NAME
`
`COMPANY A
`COMPANY B
`COMPANY C
`
`\
`
`PERFORMANCE
`
`OPERATION PERFORMANCE (MIPS)
`CLOCK FREQUENCY (MHz)
`
`.»POWER CONSUMPTION (W)
`‘
`
`\
`\
`\
`
`“SEMICONDUCTOR TECHNOLOGY
`
`PROCESS
`
`CMOS
`, ECL
`
`THE GENERATION
`
`2.00pm
`LE Sam
`
`.
`
`Page 4 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 4 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 4 of 17
`
`5,307,266
`
`FIG. 4A
`
`START
`
`
`
`INPUTTING OF KEYWORD
`REPRESENTING TITLE
`OF DOCUMENT
`
`
`
`
`
`
`EXTRACTION OF KEYWORD
`ROUG
`PROCESSING OF STRUCTURED
`KEYWORD DICTIONARY
`AND DOCUMENTDATA
`
`
`
`40}
`
`402
`
`SELECTION OF KEYWORDS
`BY USER
`
`403
`
`
`
`FORMING OF LINK TO
`KEYWORD CORRESPONDING
`DESCRIPTION CANDIDATE
`BTARSING OF DOCUMENT
`
`
`
`
`
`CONFIRMATION OF VALIDITY
`OF FORMED LINK,
`ERROR CORRECTION AND
`ADDITIONAL DOCUMENT
`INPUT BY USER
`
`
`
`KEYWORDS
`ARE SUFFICIENT
`?
`
`Page 5 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 5 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 5 of 17
`
`5,307,266
`
`FIG. 4B
`
`COLLECTION OF DATA
`BY RETRIEVAL AND
`EDITING
`
`FIG. 4C
`
`DATA COLLECTON
`PREPARATORY
`PROCESSING
`
`14000
`:
`
`
`
`DATA COLLECTION
`PREPARATORY
`PROCESSING
`
`
`
`
`4000
`
`
`
`DATA COLLECTION
`PROCESSING
`FIG. 4
`
`
`
`
`DESIGNATION OF
`EDITING FORMAT
`
`
`
`
`
`
`
`
`
`
`4200 DESIGNATION OF
`RET RIEVAL-ORIENTED
`ITEM, CONDITION AND
`STRUCTURED KEYWORD
`(FIG. 4E)
`
`CHECK OF RESULT OF
`DATA COLLECTION
`
`
`
`
`Page 6 of 26
`
`-
`
`MINDGEEK EXHIBIT 1006
`
`Page 6 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 60f17
`
`5,307,266
`
`FIG. 4D erence OF y4loo
`
`EDITING FORMAT
`
`RETRIEVAL
`
`PREPARATION OF
`FIELD FOR
`DESIGNATING
`CONDITION AND
`ITEMS FOR
`
`FIG. 4E
`
`DESIGNATION OF RETRIEVAL —
`ORIENTED ITEMS, CONDITION AND
`STRUCTURED KEYWORD
`
`4200
`
`DESIGNATION.OF CONDITION FOR
`RETRIEVAL IN CONDITION FIELD OF
`EDITING FORMAT (FIG. 4F)
`
`
`
`
`
`
`
`(FIG. 46)
`
`DESIGNATE OF ITEM FOR RETRIEVAL IN
`ITEM FIELD OF EDITING FORMAT
`
`
`Page 7 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 7 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 7 of 17
`
`5,307,266
`
`FIG. 4F
`(oesenarion OF CONDITION FOR
`L~4210
`
`RETRIEVAL IN CONDITION FIELD
`
`
`
`
`INPUTTING OR SELECTION OF
`CANDIDATES FOR ONE OR MORE
`KEYWORDS REPRESENTING TITLE
`
`
` PRESENTATION OF
`OF DOCUMENT FOR RETRIEVAL
`
`
`
`KEYWORD CANDIDATE FOR
`CORRECTING KEYWORD
`
`GIVEN BY USER SO AS TO
`
` MATCHING PROCESSING OF KEYWORD
`BE INCLUDED IN STRUCTURED
`
`
`CANDIDATES WITH KEYWORDSIN
`KEYWORD DICTIONARY
`STRUCTURED KEYWORD DICTIONARY
`
`42l2-|
`
`
`FOR RETRIEVAL
`
`DETERMINATION OF DOMAIN OF
`STRUCTURED KEYWORD DICTIONARY
`
`4213
`
`SELECTION OF KEYWORD PRESENTATION OF
`
`KEYWORD CANDIDATES
`CONFORMING TO
`STRUCTURED KEYWORD
`DICTIONARY
`
`
`
`ESTABLISHMENT OF CONDITION FOR
`
`RETRIEVAL BY INPUTTING OR
`
`
`
`MATCHING PROCESSING OF KEYWORDS
`OF HIGHER AND LOWER RANK
`CONCEPTS
`
`
`KS
`
`421\6-|
`
`Tyes
`_
`DETERMINATION OF CONDITION
`FOR RETRIEVAL
`
`4217
`
`Page 8 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 8 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 8 of 17
`
`5,307,266
`
`FIG. 4G
`
`sae OF ITEMS ory4220
`
`RETRIEVAL IN ITEM FIELD
`
`
`
` DESIGNATION OF ITEMS FOR
`RETRIEVAL BY INPUTTING OR
`
`SELECTION OF KEYWORDS
`
`
`
`
`PRESENTATION OF
`KEYWORD CANDIDATES IN
`
`
`CONFORMANCE WITH .
`
`
`MATCHING PROCESSING WITH
`STRUCTURED KEYWORD
`
`
`KEYWORDS IN STRUCTURED
`4222 | DICTIONARY
`
`
`
`KEYWORD DICTIONARY
`
`
`
`
`
`
`Page 9 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 9 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 9 of 17
`
`5,307,266
`
`FIG. 4H
`(DATA COLLECTION PROCESSING}~GOOO
`
`
`
`RETRIEVAL THROUGH MATCHING
`PROCESSING OF CONDITION FOR RETRIEVAL ©
`WITH STRUCTURED KEYWORDS
`
`
`
`FIG. 4]
`EDITION OF DATA IN ACCORDANCE
`WITH EDITING FORMAT
`
`
`L-G 200
`
`MATCHING PROCESSING OF STRUCTURED
`KEYWORDS OF ITEMS FOR RETRIEVAL AND
`THOSE OF RETRIEVAL INFORMATION
`
`
`
`
`
`
`
`
`
`
`
`ENTRY OF KEYWORD CORRESPONDING
`DOCUMENT CONSTITUENT IN ITEM FIELD
`IN CONFORMANCE WITH EDITION FORMAT
`
`Page 10 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`EXTRACTION OF DOCUMENT CONSTITUENT
`
`CORRESPONDING TO KEYWORD BY USING
`
`DOCUMENT— KEYWORD LINKAGE
`
`INFORMATION
`
`[~6220
`
`Page 10 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 10 of 17
`
`5,307,266
`
`FIG. SA
`
`THIS TIME, COMPANY A HAS DEVELOPED A HIGH-PERFORMANCE
`MICROPROCESSOR
`OF RISE STRUCTURE ADOPTING 1.3#m_
`
`CMOS.
`13
`| OM | P eee
`
`l
`
`V3 MAY,
`
`"87 L_
`
`FIG. 5C
`FIG. 5B
`]
`PeTeD
`* MICROPROCESSOR
`* DEVELOPMENT
`NEW MODEL
`
`(KEYWORDS DESIGNATED=MICROPROCESSOR | DEVELOPMENT2
`BY SENDER SYSTEM
`AS SUBJECT MATTER
`MANUFACTURER
`OF DOCUMENT)
`NAME II
`SEMICONDUCTOR
`TECHNOLOGY
`
`STRUCTURED KEYWORD
`
`FIG. 5D
`
`THE
`"GENERATION 12
`PROCESS 13,
`ARCHITECTURE 14
`
`PERFORMANCE
`
`OPERATION
`PERFORMANCE (MIPS )I5
`
`CHIP NAME 21
`
`DATE
`
`DATE OF PUBLICATION
`(MONTH, YEAR) 31
`
`Page 11 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 11 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 11 of 17
`
`5,307,266
`
`ITEMS/CONDITION|
`/
`FOR RETRIEVAL /
`
`EDITING
`FORMAT
`
`FIG. 8
`
`
`iPfsee
`
`
`[fenfarmee|ee[
`frome|aoe|
`
`
`
`
`
`
`Page 12 of 26 ©
`
`MINDGEEK EXHIBIT 1006
`
`Page 12 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 12 of 17
`
`5,307,266
`
`FIG. 7
`
`-70!
`GO : DESIGNATION OF CONDITION FOR RETRIEVAL
`-T02
`‘SEIDENSERED?EBER
`
`
`
`YM=MICROPROCESSOR | -703
`
`? DATE
`DATE OF PUBLICATION
`(MONTH, YEAR)
`?
`& (RETRIEVE : JANUARY.°87 CY M)
`
`al
`
`: ENTRY:YM)
`
`;
`
`a2: ENTRY: (MICROPROCESSOR
`
`? MANUFACTURER NAME) ;
`
`a3 : ENTRY:(MICROPROCESSOR
`
`? ARCHITECTURE
`
`-704
`_-
`705
`-706
`
`-707
`
`-708
`
`-709
`
`-710
`
`-7II
`
`?(SELECT:CISC,RISC)); -712
`
`04 : ENTRY: (MICROPROCESSOR
`
`? PERFORMANCE
`
`OPERATION
`(MIPS) )
`
`-713
`
`-714
`
`-
`
`as
`
`Page 13 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 13 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 13 of 17
`
`5,307,266
`
`FIG. 9A
`
`FIG. 9B
`
`
`
`Y3SNONIONSNOLLIYAYOINI
`
`
`
`YISNSNIAZIIYNOLLVAYOINI
`
`=i STRUCT URED
`KEYWORD BUFFER
`
`K
`
`RisEYWO
`|; RUCTIRED
`
`OCUMEN
`DATA BUFFER
`
`COLLECTED
`9|
`DATA STORAGE
`RETRIEVAL INFORMATION
`
`|oe ee =
`
`ioe
`
`re
`
`Page 14 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 14 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 14 of 17
`
`5,307,266
`
`FAG.
`
`0
`
`
`
`3
`
`,eY|:bf||BeSe(r=:
` gS
`OFbauo-z=ofEgatassone
`i)|ESPes
`GiqaetaSus3
`
`
`
`Y3SNONIGNSSNOLLVAYOINIY3SNONIAIZISYNOLLVWYOINI
`
`pezeeHoIBBSS!
`
`WUOMLINNOLLVOINNIWWODtt--+
`
`eeee
`
`Page 15 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 15 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 15 of 17
`
`5,307,266
`
`FIG.
`
`II
`
`KEYWORD- DOCUMENT )
`MICROPROCESSOR|(1)
`LINKAGE INFORMATION
`MANUEACTURERF— —anwns — — —{COMPANYAl HL
`
`SEMICONDUCTOR
`TECHNOLOGY
`HE GENERATION}- ————T3m} L2
`process}--—-——— {cmos} M13
`
`ARCHITECTURE[-— — — ——— 4{RISC {U4
`
`PERFORMANCE
`OPERAce (MIPS ———IOMIPS — | IIS
`
`recccc4
`
`— {199580}
`
`| En |
`
`DATA OF
`
`ig
`
`PUBLICATION (MONTH YEARY~7|MAY. 871 13t
`(‘eeaion sua
`
`[-__] strUcTURED KEYWORDS
`
`LOCATION DISIGNATING
`LINKAGE INFORMATION
`
`[” ~~ ~ 71 KEYWORD CORRESPONDING
`L.———-J DOCUMENT CONSTITUENTS
`
`Page 16 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 16 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 16 of 17
`
`5,307,266
`
`FIG. 12A
`
`FIG. 12B
`
`clISO COLLECTED EDITED DATA _
`
`IIGO COLLECTEDDOCUMENT DATA
`
`EDITING
`
`300!
`
`STRUCTURED
`
`FORMAT
`KEY WORDS
`
`
`DOCUMENT
`IDENTIFYING
`INFORMATION
`OS
`
`
`
`
`
`Page 17 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 17 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S. Patent
`
`Apr. 26, 1994
`
`Sheet 17 of 17
`
`5,307,266
`
`FIG.
`
`3
`'
`
`r8
`
`
`
`
`
`
`
`yasnONIGNSSNOLLVWYOIN!
`
` e|#]lee25=|SOF]|teQa||ity||Sa8=lax
`
`«BiewlSNN]m4SlagO|StgBig“|%=2/6tT2arafAzuz=sew;eo
`=—“aBlas
`tis1paoOin
`i=2Fu8eos=zeiSe=aroHussbeeostsene5-65©<Omi
`
`NOLIVOINAWAODNOLIVOINNWWOD
`NOLLVIINNWWOD
`
`F4
`
`
`
`SWIYSLNISJOVSYUSINI
`
`MYOMLAN
`
`
`
`U3SNONIAIIO3YNOLLVAUOINI
`
`NOLLWOANNWAO
`
`JWIYUIINI
`
`Page 18 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 18 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`
`
`1
`
`5,307,266
`
`2
`In view of the above and other objects which will be
`apparent as description proceeds,
`there is provided
`according to an aspect ofthe presentinvention an infor-
`mation processing system comprising a combination of
`.a sender system and a receiver system, wherein the
`sender system includes a structured keyword dictionary
`containing keywords among which relations are sys-
`tematically structured, a unit for adding linkage infor-
`mation to constituent parts of a documentas inputted
`which bear respective relations to the keywords se-
`lected from the structured keyword dictionary and a
`unit for sending out retrieval information containing the
`structured keywords, the linkage information and the
`document data added with the linkage information,
`while the receiver system includes a retrieving unit
`responsive to reception of the retrieval information
`from the sender system for retrieving the document
`data by using the linkage information and the structured
`keywords.
`The structured keyword mentioned above may be
`implemented on a knowledge domain basis so as to have
`at least one ofthelinks including a link to a keyword
`representing a higher rank concept, a link to a keyword
`representing a lower rank keyword and a link to a key-
`word representing a synonym,as is illustrated in FIGS.
`2 and 3 of the accompanying drawings.
`Correspondence between the keyword selected from
`the structured keywords and the corresponding constit-
`uent part of the document should be established in light
`of the structure of the structured keywords such that a
`keyword of concern representing an upper rank con-
`cept of the semantic content of a constituent part oftlie
`documentis linked to that document part which thus
`represents the lower rank concept of the keyword of
`concern,as will be elucidated later on by reference to
`FIG.5.
`Further, the retrieving unit for retrieving the docu-
`ment data with the aid ofretrieval information and the
`structured keywords may be composed of a functional
`part for designating a keyword needed for the retrieval
`by consulting the structured keyword dictionary, a
`storage for storing the structured keywords, a retriev-
`ing unit for retrieving documentdata by using the struc-
`tured keywordsstored in the storage and the retrieval
`information, and a secondstoragefor storing the result
`of the retrieval.
`In this conjunction, the secondstoragefor storing the
`result of the retrieval may preferably be imparted with
`a function for editing the data resulting from the re-
`trieval
`in accordance with a designated or inputted
`format for editing and storing the result of editing so
`that automatic editing of the documentdata as retrieved
`can be performed.
`For retrieving the document data with the aid of
`retrieval information andthe structured keyword desig-
`nated and stored for the retrieval, the constituent part of
`the document representing the lower rank concept of
`that keyword should preferably be extracted by using
`the linkage information.
`information for re-
`The retrieval information (i.e.
`trieval) may preferably includein addition to at least the
`structured keyword, the linkage information for indi-
`cating correspondence between the structure keyword
`and a corresponding constituent part of the document
`and document data added with the linkage information
`as described above,at least one of information resulting
`from copying or extraction of a constituent part of the
`document corresponding to the structured keyword,
`
`INFORMATION PROCESSING SYSTEM AND
`METHOD FOR PROCESSING DOCUMENT BY
`USING STRUCTURED KEYWORDS
`
`BACKGROUND OF THE INVENTION
`. The present invention relates to information process-
`ing method and. system for automatically collecting
`desired information from a large amountofinformation.
`As an information processing system for acquisition
`of information, there is heretofore known an informa-
`tion retrieval system which is so arranged as to make
`access to a database or a knowledgebasein whichinfor-
`mation has previously been stored or accumulated,as is
`described in JP-A-60-140443.
`Further, as the methods for retrieval of information,
`there are known a method in which the user designates
`items forretrieval in accordance with items of a table
`constituting a part of a database onthebasis of informa-
`tion concerninga data storage structure adoptedin the
`database and a method of simplifying designation of the
`items for retrieval by resorting to an associative re-
`trieval and a synonym processing. Besides, there has
`already been proposed a method according to which
`documents added with keywords are stored as they are
`for allowing extraction of document constituent parts
`for which coincidence is found with the keywords and
`a method according to which a stored document is
`retrieved when the keywords available for the retrieval
`coincide with synonyms detected from all the texts of
`that document.
`,
`The first mentioned prior art method is however
`disadvantageous in that information other than the pre-
`determined table items can not be processed because of
`the tabular structure of the database. If the number of
`table items is increased in an effort to cope with the
`above problem, then the structure of the database be-
`comes complicated, involving difficulty in maintenance
`and managementthereof.
`In the case of the document retrieving methods in
`which keywords are used for retrieval, the requisite
`information as wantedbythe user can be obtained only
`whenthe user having read the extracted documentpart
`can understand the content thereof. As a consequence,
`wheninformation is to be collected for a specific item
`or matter from many documents, the burden to be borne
`by the user will increase significantly, giving rise to a
`problem.
`
`SUMMARYOF THE INVENTION
`It is therefore an object of the present invention to
`provide an information processing system whichis ca-
`pable of automatically collecting necessary or de-
`manded information from a large amount of stored
`information.
`It is another object of the present invention to pro-
`vide an information processing system which is substan-
`tially immune to the shortcomings of the prior art
`method,such as difficulty in maintenance and manage-
`ment, and the serious burden imposed on the user and
`others upon automatic collection of information.
`A further object of the invention is to provide a data-
`base retrieving method and system capableofcollecting
`automatically those data which meet the demand of the
`user by allowing extraction of the content of a docu-
`ment having meaning implied by keywords as desig-
`nated.
`.
`
`35
`
`45
`
`50
`
`60
`
`65
`
`Page 19 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 19 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`3
`documentpart location designating linkage information
`indicating the position or location of that constituent
`part in the documentoridentification information for
`identifying an original document to which the docu-
`ment part belongs. In that case, the editing can corre-
`spondingly be simplified while facilitating confirmation
`and correction of the document data to another advan-
`tage.
`information supplied to the
`the retrieval
`Besides,
`receiver system from the sender system maybe trans-
`mitted through a communication network,as illustrated
`in FIG. 1.
`|
`;
`Alternatively, the retrieval information to be trans-
`ferred as the output/input information may include at
`least information capable of being written in an informa-
`tion carrying medium and read therefrom,as in the case
`of an embodimentof the invention shownin FIG.9.
`According to anotheraspect of the invention, there is
`further proposed for achieving the previously men-
`tioned objects an information processing method,
`wherein a process for establishing correspondences
`between the structured keywords and corresponding
`documentdata includes an input procedurefor allowing
`a user to input a keyword representing subject matter of
`a document, a procedure for extracting a keyword
`through matching processing of the keywords con-
`tained in a structured keyword dictionary with the
`document data, a linkage forming procedure for form-
`ing a link to a candidate for constituent parts (descrip-
`tion) in the document which corresponds to the key-
`word through syntax analysis (parsing) of the document
`data and a procedure for allowing the user to confirm
`the validity of the formed link orcorrect the link,as will
`hereinafter be described in detail by reference to FIG.4.
`According to anotheraspect of the invention,thereis
`provided an information processing method in which a
`process of collecting document data throughretrieval
`for editing includes an editing format designating proce-
`dure, a procedure for designating itemsto be retrieved,
`conditions for the retrieval and the structured keywords
`and a data collecting procedure,
`wherein the editing format designating procedure
`includes a procedurefor inputting an editing format and
`items to be retrieved and a procedure for designating
`fields for the conditions and the items for retrieval,
`the procedure for designating the items to be re-
`trieved, the condition for retrieval and the structured
`keyword includes a procedure for designating the re-
`trieval condition to be entered in the retrieval condition
`designating field of the editing format and a procedure
`for designating the retrieval items to be entered in the
`retrieval item designating field of the editing format,
`and
`the data collecting procedure includes a retrieval
`procedure for performing in responseto the input of the
`retrieval information a matching processing between
`the structured keyword ofthe retrieval information and
`that ofthe retrieval condition, and a data editing proce-
`dure for editing the document data in accordance with
`the editing format, wherein the data editing procedure
`includes a matching processing procedure for perform-
`ing a matching processing between the structured key-
`word ofthe item for retrieval and that of the retrieval
`information, a document part extracting procedure for
`extracting a constituent part of document correspond-
`ing to the keyword by using document-keyword link-
`age information (i.e. linkage information for interlinking
`a document part and a keyword) contained in the re-
`
`15
`
`20
`
`25
`
`35
`
`40
`
`45
`
`50
`
`©
`
`55
`
`60
`
`65
`
`5,307,266
`
`4
`trieval information, and a storing procedure for storing
`the documentpart corresponding to the keyword in the
`retrieval item designating field in accordance with the
`editing format.
`The keywords used for the retrieval of information
`accordingto the present invention are conformedto the
`structured keyword dictionary in which the relations
`between or among the keywords are systematically
`structured on a knowledge-domain basis (i.e. in each
`domain of knowledge), wherein those keywordslinked
`together in a standardized semantic relationships, such
`as the relationship among concepts of higher and lower
`ranks, are used in the respective relevant knowledge
`domain. Accordingly, there arises scarce differences of
`individuals in understanding the keywords because the
`semantic relations between the keywords are easy to
`understand distinctively.
`In conjunction with establishment of correspon-
`dences betweenthe the constituent parts of a document
`and the structured keyword, it is noted that those con-
`stituent parts of the document which should semianti-
`cally belong to a same keyword may assume various
`meanings. Under the circumstances,it is taught accord-
`ing to the invention to previously establish correspon-
`dences between the keywordsselected from the struc-
`tured keyword dictionary and the constituent parts of
`the documentby using the linkage information. Conse-
`quently, according to the invention, the user can get rid
`of trouble of handling unnecessarily lots of data for the
`retrieval. Furthermore, difficulty in maintenance and
`managementcan significantly be mitigated by virtue of
`establishment of correspondences between the key-
`wordsandthe constituent parts of the documentas well
`as owing to utilization of the standardized structured
`keyword dictionary as a basis.
`Aswill be seen from the foregoing, because the link-
`age information for establishing correspondences be-
`tween the keywordsand the constituent parts of a docu-
`ment which semantically correspond to the above key-
`words are addedto the constituent parts of a document,
`there can easily be extracted the constituent parts of the
`document, such as words, phrases/clauses and sen-
`tences which semantically correspond to the keywords
`designated by the user for the item for which he or she
`wants to acquire information.
`Additionally, by storing internally of the information
`processing system the keywords designated by the user
`for the item for which information is to be acquired,
`extracting the constituent parts of the documentcorre-
`sponding semantically to the designated keywords from
`those documents supplied by way of an information
`network or other various media and storing the ex-
`tracted document constituent parts, there can be real-
`ized an automatic data collection.
`By designating the editing format for editing the data
`collected, the data desired by the user can be progres-
`sively and increasingly stored and accumulated. By way
`of example, as the editing format, a table framework
`may be provided, whereon the keywords maybe desig-
`nated at locations corresponding to the items of the
`table. When fresh document data are supplied to the
`information processing system according to the inven-
`tion,
`the document constituent parts corresponding
`semantically to the individual keywords mentioned
`above can be extracted by using the linkage information
`and then can be written in the table at corresponding
`columns. By repeating this procedure, the table can be
`autonomously and increasingly expanded.
`
`Page 20 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 20 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`5,307,266
`
`5
`Finally, by attaching to the extracted document con-
`stituent parts the linkage information indicating the
`locations in a document from which the constituent
`parts have been extracted or by attaching the identifica-
`tion number of the document to which the extracted
`~ documentconstituent parts have belonged, the user can
`straightforwardly read out the relevant parts of the
`documentand easily confirm whether or not the corre-
`spondences between the keywords and the document
`parts are correct, whereon an error, if any, can be cor-
`rected.
`
`BRIEF DESCRIPTION OF THE DRAWINGS
`
`6
`semantically same meaning (i.e. synonym) as the given
`keyword on the basis of the field or domain of the
`knowledge to which the given keyword belongs(i.e. on
`a knowledge-domain basis), as is illustrated in FIG.2.
`Thus, the keyword dictionary 2 may be termed as a
`structured keyword dictionary containing keywords
`which are systematically structured by means ofinter-
`keyword linkages or relations established as mentioned
`above. This dictionary 2 will hereinafter be termed the
`structured keyword dictionary, while the keywords
`related to one anotherbythe links will be referred to as
`the structured keywords. Turning back to FIG. 1, a
`reference numeral 3 denotes a unit for selecting from
`the structured keyword dictionary 2 the structured
`keywords in the domain to which the subject matter of
`a given document contained in the document data stor-
`age 1 relates, for the purpose of adding to the selected
`document the structured keywords and linkage infor-
`mation which is required for establishing correspon-
`dence between the structured keywords and relevant
`constituent parts of the selected document. A numeral
`31 denotesa user interface for actually establishing the
`correspondences between the structured keywords and
`the constituent parts of the document in accordance
`with the output information of the unit 3. Further, a
`retrieval information storage unit generally denoted by
`10 serves for storing the retrieval information and in-
`cludes a buffer storage 12 for storing the structured
`keywords(also designated by 12) as selected, a buffer 11
`for storing the link-affixed document data (11) added
`with the information of the linkages between the con-
`Stituent parts of the document and the structured key-
`words (hereinafter simply referred to as the document-
`keyword linkage information), and a buffer 13 for stor-
`ing the document-keyword linkage information (13)
`itself. A reference numeral 21 denotes an interface
`through which the retrieval information 10 is sent out
`onto the communication network 500.
`In the information receiver system 201 which can be
`connectedto the information sender system 200, a refer-
`ence numeral 121 denotes a communication interface
`through which the retrieval
`information described
`aboveis received, and a numeral 110 denotes a storage
`unit for storing the retrieval information as received.
`This storage unit 110 also includes a buffer 112 for stor-
`ing the structured keywords(also designated by 112), a
`buffer 111 for storing the link-affixed document data
`(111) and a buffer 113 for storing the document-
`keyword linkage information, as in the case of thestor-
`age unit 10 incorporated in the information sender sys-
`tem 200. At least one receiver system connected to the
`sender systems can receive the retrieval information 10
`sent out onto the communication network 500 through
`the communication interface 121, whereon theretrieval
`information 10 received is stored in the storage unit 110
`correspondingly.
`The user who wants to receive some information can
`designate with the aid of a retrieval item/condition/-
`structured keyword designating unit 102 the items for
`the information to be acquired, the condition for re-
`trieval, such as a scope or coverageoftheretrieval, and
`relevant structured keywords, whereon these desig-
`nated data are stored in a buffer storage 103. In the
`information receiver system 201 a reference numeral
`101 designates a structured keyword dictionary which
`has same contents as those of the structured keyword
`dictionary 2 incorporated in. the information sender
`system 200. Further, the user demanding information
`
`20
`
`25
`
`FIG.1 is a functional block diagram showing a docu-
`ment processing system according to an embodimentof 15
`the invention;
`FIG.2 is a view for illustrating an exampleof a struc-
`ture of a structured keyword employed according to
`the teaching of the invention;
`FIG.3 is a view for illustrating, by way of example
`only, linkages among structured keywords;
`FIGS.4A to 4I are flow charts for illustrating exem-
`plary procedures involved in operation of the document
`processing carried out by the system shown in FIG.1;
`FIGS. 5A to 5D and FIGS.6 to 8 are viewsforillus-
`trating, by way of example, in what manner a document
`is processed according to an embodimentof the inven-
`tion by using the system shown in FIG.1;
`FIGS. 9A and 9B and FIG. 10 are functional block
`diagrams showing document processing systems ac-
`cording to further embodiments of the invention, re-
`spectively;
`FIG. 11 and FIGS. 12A and 12B are viewsforillus-
`trating, by way of example only, operations of the docu-
`ment processing systems according to the further em-
`bodiments of the invention; and
`FIG. 13 is a functional block diagram showing yet
`another embodiment of the document processing sys-
`tem according to the invention.
`DESCRIPTION OF THE PREFERRED
`EMBODIMENTS
`
`30
`
`35
`
`Now,the present invention will be described in detail
`in conjunction with preferred or exemplary embodi-
`ments thereof by reference to the accompanying draw-
`ings.
`FIG.1 showsin a functional block diagram a general
`arrangement of a document processing system accord-
`ing to a first embodimentofthe invention. In this figure,
`a reference numeral 200 denotes generally an informa-
`tion sender system adapted to generate retrieval infor-
`mation (i.e. information for retrieval) to be transmitted
`through a communication network 500 to an informa-
`tion receiver system which is generally denoted by a
`numeral 201 and arranged to perform processing for
`retrieval and editing by utilizing the retrieval informa-
`tion as supplied. Thus, it can be said that the document
`processing system illustrated in FIG. 1 is implemented
`in the form of a retrieval information transmission/-
`reception system, so to say. In the information sender
`system 200, a reference numeral 1 designates a docu-
`ment data storage, and a numeral 2 denotes a keyword
`dictionary storage. Any given one of the keywords
`contained in the keyword dictionary 2 is so structured
`as to haveatleast onelink, including a link leading to a
`keyword representing an upper rank concept of the
`given keyword, a link to a keyword representing a
`lower rank concept and a link to a keyword having
`
`45
`
`50
`
`55
`
`60
`
`65
`
`Page 21 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 21 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`5,307,266
`
`7
`can designate through the user interface 131 a format
`for editing of information or data (hereinafter also re-
`ferred to as the editing format) with the aid of an editing
`format designating unit 104, the designated editing for-
`mat being then stored in an editing format buffer 105. A
`referenc