`Houser et al.
`
`USOO5774859A
`Patent Number:
`11
`(45) Date of Patent:
`
`5,774,859
`Jun. 30, 1998
`
`54 INFORMATION SYSTEM HAVING A
`SPEECH INTERFACE
`75 Inventors: Peter B. Houser, Poway, Calif.; Mark
`E. Schutte, Sugar Hill, Ga.; Gloria J.
`Majid, San Diego, Calif.
`
`73 Assignee: Scientific-Atlanta, Inc., Norcross, Ga.
`
`21 Appl. No.: 367,997
`1-1.
`22 Filed:
`Jan. 3, 1995
`51
`Int. Cl. ................................. H04N 7/00, G10L 9/00
`52 U.S. Cl. .............................................................. 70427s
`58 Field of Search .................................. 395/2.79, 2.84;
`704/275, 270
`
`56)
`
`FOREIGN PATENT DOCUMENTS
`4351094 5/1991 Japan ............................... G10L 3/OO
`OTHER PUBLICATIONS
`Lefkowitz, “Voice-Recognition Home TV Coming This
`Year; Service Merges Computer, Phone, and Cable Tech
`nologies (BBN Hark Systems Corp, Time Warner Cable,
`Full Service Network).” Computer Shopper, V. 15, n2,
`p.68(1), Feb. 1995.
`“Voice Operated Controls and Switches Are Developed,”
`Veterans
`Administration
`Journal
`Announcement
`GRA18422, Abstract, Jul., 1984.
`Sugaya et al., “Voice Remote Controlled TV Set." COMP
`CON 79 Proceedings using microprocessors extending our
`reach, 1979.
`Dawson, “Time Warner Pursues Voice as New Remote',
`Multichannel News, Jan. 9, 1995, pp. 31 and 34.
`Primary Examiner David R. Hudspeth
`i: Kellv A
`Aitat ityr"RAM M
`ttorney, Agent, Or Firm-Kenneth M. Massaron1; Kelly A.
`Gardner
`
`References Cited
`U.S. PATENT DOCUMENTS
`4,516,207 5/1985 Moriyama et al. ..................... 364/424
`4,641,292 2/1987 Tunnell et al. ......................... 367/198
`4,677.569 6/1987 Nakano et al. .
`... 364/513.5
`4,706,121 11/1987 Young ..................................... 358/142
`ABSTRACT
`57
`4,776,016 10/1988 Hansen ...................................... 381/42
`4,977.45s 12/1990 Young ...
`... 358/142 A System for controlling a device Such as a television and for
`4,989,253
`1/1991 Liang et al. ...
`... 381/110
`controlling access to broadcast information Such as Video
`5,086,385 2/1992 Launey et al. .......................... 364/188
`di
`d
`inf
`is disclosed. Th
`s
`5,151,789 9/1992 Young .................................. issio,
`audio, and/or text information is disclosed. The system
`5,157.760 10/1992 Akagiri ....................................... 3052
`includes a first receiver for receiving utterances of a speaker,
`5,199,080 3/1993 Kimura et al.
`... 381/110
`a Second receiver for receiving vocabulary data defining a
`5,230,023
`7/1993 Nakano .......
`... 381/110
`Vocabulary of utterances, and a processor for executing a
`5,267,323 11/1993 Kimura ................................... 381/110
`Speech recognition algorithm using the received vocabulary
`5,303,148 4/1994 Mattson et al......
`... 364/413.01
`data to recognize the utterances of the Speaker and for
`5,335,276 8/1994 Thompson et al. ....................... 380/21
`controlling the device and the access to the broadcast
`8.
`al.".
`E.2.
`information in accordance with the recognized utterances of
`5.479.266 1919: SN N.
`5. the Speaker.
`5,479.268 12/1995 Young et al. .
`... 358/335
`5,557,319 9/1996 Gurusami et al. ........................ 348/11
`
`10 Claims, 28 Drawing Sheets
`
`
`
`4-1
`
`14-n
`
`DEVICE
`
`NFORMATION
`DISTRIBUTION
`CENTER
`
`NFORMATION
`
`TERMINAL
`UNIT
`
`to
`
`SPEECH DATA
`
`Page 1
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 1 of 28
`
`5,774,859
`
`
`
`B01AB0
`
`Page 2
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 2 of 28
`
`5,774,859
`
`
`
`No.OOONE
`
`| OHNI
`
`U-37 || ~
`
`No.OOONE,
`
`| OHNI
`
`Page 3
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`US. Patent
`
`Jun. 30, 1998
`
`Sheet 3 0f 28
`
`5,774,859
`5,774,859
`
`
`
`
`
`
`
`
`
`7meTmmwTNNFfl$952002mm>momm//mm—mmF\\
`
`N:o3.vmw
`
`3;
`
`a
`
`$2528
`
`E15
`
`E.925-5-50
`E1.
`
`$25200:
`
`
`
`$352QOmm>momm
`
`Hmmzcmm
`
`mommmoomm
`
`mm<m<k<o
`
`
`momeooE29:2lSE28
`DEEmm5:528
`
`225:5;
`
`-SE55
`
`mm?vEO>>EZ
`2952222550
`
`wzjlzm
`
`EEQEQQ
`
`momsom
`
`m3
`
`Page 4
`
`AMAZON 1004
`
`Amazon V. SpeakWare
`IPR2019-00999
`
`mm6E
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Page 4
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`
`
`
`
`
`
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 4 of 28
`
`5,774,859
`
`
`
`Page 5
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 5 of 28
`
`5,774,859
`
`T?
`
`
`
`CJEHO ||WAS
`
`
`
`
`
`
`
`
`
`T08] N00
`
`ZOZ
`
`Page 6
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 6 of 28
`
`5,774,859
`
`NIV/W
`
`HOSSE|00}}d
`
`808Z09
`
`909908
`
`
`
`Page 7
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`US. Patent
`
`Jun. 30, 1998
`
`Sheet 7 0f 28
`
`5,774,859
`
`
`
`FIG.5
`
`200
`
`MAINPROCESSOR
`
`224
`
`330
`
`INTER- FACE
`
`328
`
`KRCR
`
`H
`
`336J
`
`mm
`
`334
`
`332
`
`160
`
`><
`l.—
`
`0L
`
`u
`:1
`
`o0
`
`2E P
`
`age 8
`
`AMAZON 1004
`
`Amazon V. SpeakWare
`IPR2019-00999
`
`cc
`
`0a
`
`:
`
`Page 8
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`US. Patent
`
`Jun. 30, 1998
`
`Sheet 8 0f 28
`
`5,774,859
`5,774,859
`
`I
`2 1
`to('3
`d
`
`E "7
`
`< ..g
`
`(V)
`
`N
`0
`
`m0
`
`O
`or:
`
`95
`2%
`._
`(LL!
`20
`E
`“-
`
`8
`N
`
`<r
`&
`
`I
`O Q:LL|
`a :22ZLL
`—
`
`
`
`a
`m
`
`95
`a:
`
`/\/
`
`‘D
`
`\
`
`U:
`LL
`
`to
`
`o
`7—
`
`Page 9
`
`AMAZON 1004
`
`Amazon V. SpeakWare
`IPR2019-00999
`
`OQ
`
`u
`C)
`
`0L
`
`Page 9
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`US. Patent
`
`Jun. 30, 1998
`
`Sheet 9 0f 28
`
`5,774,859
`
`3:
`3:”
`4%
`”-00
`
`<E):
`0:33
`C")
`
`Em
`
`u“)
`C0
`
`200
`
`MAINPROCESSOR
`
`224
`
`.
`3E5
`
`gu—
`
`FRONTMICmCODEC
`
`B °
`
`°
`
`354
`
`352
`
`.
`25
`LL
`
`Page 10
`
`AMAZON 1004
`
`Amazon V. SpeakWare
`IPR2019-00999
`
`Page 10
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 10 0f 28
`
`5,774,859
`
`NIV/W
`
`HOSSB00}}d
`
`098698/98
`
`Page 11
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 11 of 28
`
`5,774,859
`
`FIG. 9
`
`
`
`768
`
`766
`GUDE
`
`764
`
`772
`VOICE
`
`753
`
`752
`
`769
`
`767
`SELECT
`
`u765
`
`755
`
`MUTE
`
`762 nu-r
`
`LAST
`
`763
`
`756
`
`757
`
`758
`
`Page 12
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 12 of 28
`
`5,774,859
`
`
`
`W
`
`FIG 10
`
`25
`
`FIG 11
`
`64
`7:30 PM
`7:00PM
`03/05
`WXIA, HOLLYWO GILLIGANS
`14
`SQUARES
`ISLAND
`HBO
`TERMINATOR2
`15
`(R) (CC)
`KABC : TERMINA-
`16
`TOR2
`ESPN
`GOLF
`
`-
`
`62 -
`
`---
`
`WRESTLING
`
`H----
`
`WIMBLEDON
`
`60
`8:00PM
`8:30 PM
`BRADY
`TOMAND
`BUNCH
`JERRY
`HOME ALONE
`(PG) (CC)
`
`J-s 66
`
`BOWLING
`
`-----
`
`SHOW BAD NEWSBEARS
`
`TMC
`19
`
`ONE GOOD COOP
`(R)
`
`CITIZEN
`
`NVISIBLE
`MAN
`
`CURRENT TIME: 7:15 PM CURRENT DATE: O3/05
`
`-
`
`|
`
`PRESS (INFO FOR DETAILS, SELECT TO TUNE.
`PRESSMENUTOEXIT.
`
`Page 13
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 13 of 28
`
`5,774,859
`
`
`
`
`
`FIG. 12A
`
`MAINMENU
`
`1.
`2.
`3.
`4.
`E.
`7
`8
`9
`
`PAYPER VIEW
`WHAT'S ON CABLE
`MESSAGES/INFORMATION
`FAVORITE CHANNELS
`VCR PROGRAMTIMER
`SLEEPTIMER
`PARENTAL CONTROL
`CABLE TERMINAL SETUP
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 12B
`
`WHAT'S ON CABLE
`
`VIEWING CHANNEL
`FAVORITE CHANNEL
`CHANNEL LISTINGS
`CATEGORIES
`EXIT
`
`PLEASE MAKE SELECTION.
`
`Page 14
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 14 of 28
`
`5,774,859
`
`FIG. 12C
`
`
`
`VIEWING GUIDE
`
`1
`2.
`3.
`
`4.
`
`TUNE TWTO PROGRAM
`RECORD PROGRAM
`PROVIDE INFORMATION
`ON PROGRAM
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 12D
`
`
`
`YOU"WE CHOSEN:
`ESPN CH 17
`SEE THE WORLD'S FINEST GOLFER'S
`NFIRST ROUND ACTION AT THE
`U.S. OPEN.
`STARTS: 7:00PM ENDS: 7:30PM
`
`RECORD:
`1. ONE-TIME
`2. DALY
`3. WEEKLY
`4. EXIT
`
`PLEASE MAKE SELECTION.
`
`Page 15
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`Sheet 15 of 28
`FIG. 12E
`
`5,774,859
`
`VCR IS SCHEDULED TO RECORD:
`
`1 3/5/94 CH 17 7:00 PM TO 7:30PM
`
`FIG. 12F
`
`
`
`FAVORITE CHANNEL GUIDE
`7:00 PM 7:30 PM 8:00 PM 8:30 PM
`KABC TERMIN
`16
`ATOR 2
`
`WRESTLING
`
`ESPN
`
`17
`
`DIS
`29
`
`GOLF
`
`WIMBELDON
`
`BOWLNG
`
`JEOPARDY
`
`NATIONAL
`GE6RAEC
`
`MICKEY
`M6SE
`
`CURRENT TIME: 6:51 PM
`
`DATE: 3/5/94
`
`Page 16
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 16 of 28
`
`5,774,859
`
`FIG. 12G
`
`19 TMC
`29 DIS
`
`14 WXIA
`15 HBO
`16 KABC
`17 ESPN
`18 SHOW
`
`1, WEWING GUIDE
`2. MAINMENU
`3. EXIT
`
`PLEASE MAKE SELECTION.
`
`
`
`FIG. 12H
`
`CATEGORIES
`
`CHILDREN/EDUCATION/SCIENCE
`DRAMAVROMANCE
`ENTERTAINMENT/VARIETY
`MOVIES
`NEWS/TALK/MAGAZINE
`SPECIALS/MN-SERIES
`PRE-RECORDED SPORTS
`LIVE SPORTS
`ACTION/MYSTERY/HORROR
`COMEDY
`MSCELLANEOUS
`EXIT
`
`g
`
`PLEASE MAKE SELECTION.
`
`Page 17
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 17 of 28
`
`5,774,859
`
`
`
`
`
`FIG. 12
`
`LIVE SPORTS
`
`BASEBALL
`BASKETBALL
`SOCCER
`MORE LISTINGS
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 12J
`
`BASEBALL
`
`CH
`
`PROGRAMNAME
`
`1. O2
`2. O7
`3. EXIT
`
`CUBS VSMETS
`BRAVES VS REDS
`
`PLEASE MAKE SELECTION.
`
`Page 18
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 18 of 28
`
`5,774,859
`
`FIG. 12K
`
`VCR PROGRAMTIMER
`
`USE PLUS CODES
`SET PROGRAMTIMER
`CHANGE PROGRAMTIMER
`CLEAR PROGRAMTIMER
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 12L
`
`
`
`PLUS CODES
`
`STATE THE CODEONE DIGIT
`ATA TIME
`
`STATE "OK"WHEN YOU ARE DONE.
`STATE "EXIT" TOEXIT.
`
`Page 19
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 19 of 28
`
`5,774,859
`
`FIG. 12M
`
`PROGRAMTIMER
`
`1.
`2.
`3.
`
`EPG PROGRAMVCR
`VOICE PROGRAMVCR
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 12N
`
`
`
`SELECT PROGRAM NUMBER
`
`Page 20
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 20 0f 28
`
`5,774,859
`
`FIG. 120
`
`STATE
`
`"DAY."
`
`EXAMPLE: SATURDAY
`
`FIG. 12P
`
`
`
`PROGRAM
`
`RECORD ONJUNE 11, 1994
`STATE:
`
`"STARTHOUR, START MINUTE,
`AMORPM"
`
`EXAMPLE: "EIGHT ZEROZERO PM"
`
`Page 21
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 21 of 28
`
`5,774,859
`
`FIG. 120
`
`START TIME: 9:05 PM
`
`STATE:
`"STOP HOUR, STOP MINUTE, AMORPM"
`
`FIG. 12R
`
`
`
`STOPTIME: 10:35 PM
`
`STATE: CHANNEL
`
`Page 22
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 22 of 28
`
`5,774,859
`
`FIG. 12S
`
`PROGRAM 1
`
`9:05 PM
`START TIME:
`10:35 PM
`STOPTIME:
`DATE: JUNE 11, 1994
`CHANNEL: O7
`SPEED
`1. SP
`2. EP
`3. LP
`
`PLEASE MAKE SELECTION.
`
`FIG. 12T
`
`
`
`PROGRAM
`
`9:05 PM
`START TIME:
`10:35 PM
`STOPTIME:
`DATE: JUNE 11, 1994
`CHANNEL: O7
`SPEED: SP
`
`STATE "OK" F THIS
`INFORMATION IS CORRECT.
`
`Page 23
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 23 of 28
`
`5,774,859
`
`FIG. 12U
`
`
`
`YOUR VCR WILL TAPE THE
`PROGRAMMING ON:
`
`JUNE 11, 1994
`FROM 9:05 TO 10:35 PM
`ON CHO7
`ATSPSPEED
`
`Page 24
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 24 of 28
`
`5,774,859
`
`FIG. 13A
`
`SEARCH.
`
`FIG. 13B
`
`
`
`SEARCH.
`ONLY FOOTBALL
`
`47 TEMS HAVE BEEN FOUND.
`
`Page 25
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 25 0f 28
`
`5,774,859
`
`FIG. 13C
`
`SEARCH.
`ONLY FOOTBALL
`ONLY SPORTS
`
`42 ITEMS HAVE BEEN FOUND.
`
`FIG. 13D
`
`
`
`SEARCH.
`ONLY FOOTBALL
`ONLY SPORTS
`ONLY TODAY
`
`23 ITEMS HAVE BEEN FOUND.
`
`Page 26
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 26 of 28
`
`5,774,859
`
`FIG. 13E
`
`SEARCH.
`
`ONLY FOOTBALL
`ONLY SPORTS
`ONLY TODAY
`ONLY AFTERELEVEN AM
`
`19 TEMS HAVE BEEN FOUND.
`
`FIG. 13F
`
`
`
`SEARCH.
`
`ONLY FOOTBALL
`ONLY SPORTS
`ONLY TODAY
`ONLY AFTER ELEVEN AM
`ONLY BEFORE SIX PM
`
`12 ITEMS HAVE BEEN FOUND.
`
`Page 27
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 27 of 28
`
`5,774,859
`
`
`
`
`
`FIG. 130
`
`LIST
`
`PENN STATE - NOTRE DAME
`CHANNEL 10
`12:30 PM
`OHIO STATE - MICHIGAN
`CHANNEL 5
`1:00 PM
`
`es
`
`es
`
`MORE
`EXIT
`
`PLEASE MAKE SELECTION.
`
`FIG. 14
`
`PAY-PER-VIEW
`
`TERMINATOR 2
`FORREST GUMP
`THE LIONKING
`PULP FICTION
`SPEED
`MORE
`EXIT
`
`PLEASE MAKE SELECTION.
`
`Page 28
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`U.S. Patent
`
`Jun. 30, 1998
`
`Sheet 28 of 28
`
`5,774,859
`
`B01AB0
`
`
`
`
`
`Page 29
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`1
`INFORMATION SYSTEM HAVING A
`SPEECH INTERFACE
`
`5,774,859
`
`2
`ing an On-Screen User Interface for a Subscription Televi
`Sion Terminal', incorporated herein by reference, can
`greatly enhance the ability of a user to acceSS available
`features. The System described in that application signifi
`cantly reduces the number of key presses and data entries
`required to utilize available features, moreover, a key pad is
`designed in Such a way as to group keys together in a manner
`to facilitate their use. However, even with Such an improved
`interface, certain tasks are simply not well-Suited for execu
`tion using a keypad on a Subscriber terminal unit or an
`asSociated remote control. For example, while a Subscriber
`might be very interested in using an EPG to determine the
`channel number and time for the next showing of a particular
`television show or to find all movies being shown in the next
`week Starring a particular actress, Such tasks are not easily
`implemented using menu Screens and key presses.
`
`SUMMARY OF THE INVENTION
`The present invention adds a Speech recognition interface
`to a Subscriber terminal unit in an information System for
`implementing Spoken control of electronic devices at the
`Subscriber location and of access to information transmitted
`to the Subscriber terminal unit. The Speech recognition
`interface enhances the interface of the Subscriber with the
`information System by allowing control using language
`which is naturally spoken by the subscriber. Such natural
`language control affords ease of use as well as permitting the
`implementation of tasks which are not easily implemented
`using menu Screens and key presses.
`In accordance with one aspect of the instant invention, a
`System for controlling a device includes a first receiver for
`receiving utterances of a Speaker, a Second receiver for
`receiving vocabulary data defining a vocabulary of
`utterances, and a processor for executing a speech recogni
`tion algorithm using the received vocabulary data to recog
`nize the utterances of the Speaker and for controlling the
`device in accordance with the recognized utterances of the
`Speaker.
`In accordance with another aspect of the invention, a
`method for controlling a device includes the Steps of receiv
`ing utterances of a speaker, receiving vocabulary data defin
`ing a Vocabulary of utterances, using the received Vocabu
`lary data to recognize the utterances of the Speaker, and
`controlling the device in accordance with the recognized
`utterances of the Speaker.
`In accordance with another aspect of the invention, a
`System for controlling access to broadcast information
`includes a first receiver for receiving utterances of a Speaker,
`a Second receiver for receiving vocabulary data defining a
`Vocabulary of utterances, and a processor for executing a
`Speech recognition algorithm using the received vocabulary
`data to recognize the utterances of the Speaker and for
`controlling access to the broadcast information in accor
`dance with the recognized utterances of the Speaker.
`In accordance with another aspect of the invention, a
`method for controlling access to broadcast information
`includes the Steps of receiving utterances of a Speaker,
`receiving vocabulary data defining a vocabulary of
`utterances, using the received vocabulary data to recognize
`the utterances of the Speaker, and controlling access to the
`broadcast information in accordance with the recognized
`utterances of the Speaker.
`In accordance with yet another aspect of the invention, a
`System for controlling a device and for controlling access to
`broadcast information includes a first receiver for receiving
`utterances of a speaker, a Second receiver for receiving
`
`15
`
`25
`
`BACKGROUND OF THE INVENTION
`1. Technical Field
`The present invention generally relates to an information
`System having a speech interface and, more particularly, to
`a Subscription information System including a Subscriber
`terminal unit which provides for control of electronic
`devices and access to information Services by Spoken com
`mands.
`2. Description of the Related Art
`The volume of information and number of features being
`made available to Subscribers in information Systems. Such as
`Subscription television Systems are continuously increasing.
`For example, the advent of optical fiber networks and links
`to telecommunication networks enables an almost limitleSS
`Supply of information and features to be provided to Sub
`Scribers. Already Subscription television Systems are being
`used for addressed message transmissions, pay-per-view
`programming, impulse pay-per-View programming, favorite
`channel features, and parental control. Cable television
`System operators may also provide text Services Such as
`news articles, Sports Scores, Stock market quotations, and
`weather reports. Relatively new features Such as Video game
`delivery, wide area network access, home Shopping, travel
`reservations Service, home banking, energy management,
`Video conferencing and other Such Services are also being
`provided.
`Recently, Some companies have begun offering electronic
`television Schedules for purchase by Subscription television
`providers who, in turn, provide the Schedules to their Sub
`Scribers. The Schedules contain a listing of television pro
`grams by time and channel, and are typically displayed as a
`35
`grid on the television screen. Such a system is shown in PCT
`Publication No. WO 92/04801, applied for by Insight
`Telecast, Inc. (hereinafter “the Insight System’) and incor
`porated herein by reference. That publication describes a
`display System for maneuvering a cursor through a televi
`Sion Schedule. The System also describes that a Subscriber
`can choose a program for recording by moving a cursor to
`the Selected program; the System will automatically program
`the Subscriber's video cassette recorder.
`Also recently, So-called near-video-on-demand Services
`have been introduced. These Services provide a simulation
`of the Videotape rental experience. One Such System is
`described in Walter, U.S. Pat. No. 5,168,353, incorporated
`herein by reference. Walker describes in column 4, lines
`18-27 that a converter may be enabled to descramble two
`channels carrying the same program at Staggered intervals.
`This allows returning to the same portion of the video after
`a pause equal to the Staggered time interval. The Walker
`patent gives as an example a Subscriber ordering a program
`that begins at 8:00 p.m. The converter unscrambles the 8:00
`showing as well as the 8:15 showing So that the viewer may
`return to the same point in the program after a fifteen minute
`break.
`This increase in information and features has generally
`resulted in a corresponding increase in the complexity of the
`subscriber's interface with the system. Since the subscrib
`er’s “comfort level” with an interface for accessing infor
`mation and features will often determine whether the Sub
`Scriber will utilize the information and features, there is a
`need for user-friendly interfaces. An interface Such as that
`described in commonly assigned copending application Ser.
`No. 07/800,836 entitled “Method and Apparatus for Provid
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`Page 30
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`5,774,859
`
`3
`Vocabulary data defining a Vocabulary of utterances, and a
`processor for executing a speech recognition algorithm
`using the received vocabulary data to recognize the utter
`ances of the Speaker and for controlling the device and the
`access to the broadcast information in accordance with the
`recognized utterances of the Speaker.
`In accordance with yet another aspect of the invention, a
`method for controlling a device and for controlling access to
`broadcast information includes the Steps of receiving utter
`ances of a Speaker, receiving vocabulary data defining a
`Vocabulary of utterances, using the received vocabulary data
`to recognize the utterances of the Speaker, and controlling
`the device and the access to the broadcast information in
`accordance with the recognized utterances of the Speaker.
`In accordance with yet another aspect of the invention, a
`System for controlling a device includes a transmitter for
`transmitting vocabulary data for defining a vocabulary of
`utterances, and a terminal unit including a first receiver for
`receiving utterance data of a speaker, a Second receiver for
`receiving the transmitted vocabulary data, and a processor
`for executing a speech recognition algorithm using the
`received vocabulary data to recognize utterances of the
`Speaker and for controlling Said device in accordance with
`the recognized utterances of the Speaker.
`In accordance with yet another aspect of the invention, a
`method for controlling a device includes the Steps of trans
`mitting vocabulary data for defining a vocabulary of
`utterances, receiving utterance data of a Speaker, receiving
`the transmitted vocabulary data, using the received Vocabu
`lary data to recognize utterances of the Speaker, and con
`trolling the device in accordance with the recognized utter
`ances of the Speaker.
`In accordance with Still another aspect of the invention, a
`System for controlling access to broadcast information
`includes a transmitter for transmitting vocabulary data defin
`ing a vocabulary of utterances and a terminal unit including
`a first receiver for receiving utterance data of a speaker, a
`Second receiver for receiving the transmitted vocabulary
`data, and a processor for executing a speech recognition
`algorithm using the received vocabulary data to recognize
`utterances of the Speaker and for controlling access to the
`broadcast information in accordance with the recognized
`utterances of the Speaker.
`In accordance with Still another aspect of the invention, a
`method for controlling access to broadcast information
`includes the Steps of transmitting vocabulary data for defin
`ing a vocabulary of utterances, receiving utterance data of a
`Speaker, receiving the transmitted Vocabulary data, using the
`received vocabulary data to recognize utterances of the
`Speaker, and accessing the information in accordance with
`the recognized utterances of the Speaker.
`In accordance with Still another aspect of the invention, a
`System for controlling a device and for controlling access to
`broadcast information includes a transmitter for transmitting
`Vocabulary data defining a vocabulary of utterances and a
`terminal unit including a first receiver for receiving utter
`ance data of a speaker, a Second receiver for receiving the
`transmitted vocabulary data, and a processor for executing a
`Speech recognition algorithm using the received vocabulary
`data to recognize utterances of the Speaker and for control
`ling the device and the access to the broadcast information
`in accordance with the recognized utterances of the Speaker.
`In accordance with Still another aspect of the invention, a
`method for controlling a device and for controlling access to
`broadcast information includes the Steps of transmitting
`Vocabulary data for defining a vocabulary of utterances,
`
`15
`
`25
`
`35
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`4
`receiving utterance data of a Speaker, receiving the trans
`mitted vocabulary data, using the received Vocabulary data
`to recognize utterances of the Speaker, and controlling the
`device and the access to the information in accordance with
`the recognized utterances of the Speaker.
`In accordance with Still another aspect of the invention, a
`System for controlling a television receiver to allow user
`Selection of broadcast programs from program Schedule
`information includes a first receiver for receiving the pro
`gram Schedule information, a Second receiver for receiving
`utterances of a user, and a processor for executing a speech
`recognition algorithm to recognize the utterances of the user
`and for Selecting broadcast programs from the program
`Schedule information in accordance with the recognized
`utterances of the user.
`In accordance with another aspect of the invention, a
`remote control for transmitting utterance data to a receiver
`includes a microphone for receiving utterances of a Speaker,
`an analog-to-digital converter for converting utterance Sig
`nals from the microphone to digital utterance Signals, a
`digital signal processor for extracting time-variant spectral
`characteristics of the digital utterance Signals, and a trans
`mitter for transmitting the time-variant spectral characteris
`tics to the receiver.
`In accordance with Still another aspect of the invention, a
`Subscription television System includes a head-end installa
`tion for transmitting vocabulary data for defining a Vocabu
`lary of spoken commands and a Subscriber terminal unit
`connected to a television. The Subscriber terminal unit
`includes a first receiver for receiving spoken command data,
`a second receiver for receiving the vocabulary data trans
`mitted from the head-end installation, and a processor for
`executing a Speech recognition algorithm using the Vocabu
`lary data to recognize spoken commands and for controlling
`the television in accordance with the recognized spoken
`commands.
`In accordance with another aspect of the invention, an
`information broadcasting center for broadcasting informa
`tion to at least one terminal unit includes a first transmitter
`for transmitting text information and a Second transmitter for
`transmitting phonemic data corresponding to at least a
`portion of the text information.
`In accordance with another aspect of the invention, a
`Subscriber terminal unit includes a tuner for tuning channels
`of a television signal, each channel carrying a respective
`program identified by a program title, a memory for Storing
`a speech recognition algorithm for recognizing spoken pro
`gram titles, and a processor for executing the Speech rec
`ognition algorithm to recognize a program title spoken by a
`Subscriber and for generating a tuner control Signal for
`controlling the tuner to tune to the channel carrying the
`program corresponding to the recognized program title.
`These and other features and advantages of the present
`invention will be better understood from a reading of the
`following detailed description in conjunction with the
`accompanying drawings.
`
`BRIEF DESCRIPTION OF THE DRAWINGS
`FIG. 1 is a generalized block diagram of an information
`System in accordance with the instant invention.
`FIGS. 2A, 2B, and 2C are a block diagram of a subscrip
`tion television System in which the instant invention is
`implemented.
`FIG. 3 is a detailed block diagram of Subscriber terminal
`unit 160 shown in FIG. 2C.
`
`Page 31
`
`AMAZON 1004
`Amazon v. SpeakWare
`IPR2019-00999
`
`
`
`5,774,859
`
`S
`FIG. 4 is a block diagram of a first hardware arrangement
`of the Speech recognition interface of the instant invention.
`FIG. 5 is a block diagram of a Second hardware arrange
`ment of the Speech recognition interface of the instant
`invention.
`FIG. 6 is a block diagram of a third hardware arrangement
`of the Speech recognition interface of the instant invention.
`FIG. 7 is a block diagram of a fourth hardware arrange
`ment of the Speech recognition interface of the instant
`invention.
`FIG. 8 is a block diagram of a fifth hardware arrangement
`of the Speech recognition interface of the instant invention.
`FIG. 9 is an illustration of remote control 166 shown in
`FIG. 2C.
`FIG. 10 is a diagram of a volume line bar.
`FIG. 11 is a diagram of an electronic programming guide
`grid.
`FIGS. 12A-12U are diagrams illustrating Screens gener
`ated during the Speech recognition operation.
`FIGS. 13 A-13G are diagrams illustrating additional
`Screens generated during the Speech recognition operation.
`FIG. 14 is a diagram illustrating an additional Screen
`generated during the Speech recognition operation.
`FIG. 15 is a block diagram of another system in which the
`Speech recognition interface of the instant invention may be
`utilized.
`
`DETAILED DESCRIPTION
`The following conventions will be used throughout this
`detailed description. The <>symbols will be used to denote
`keypad commands input via the keypads to be described
`below, e.g., <Enter>. Quotation marks and capital letters will
`be used to designate spoken commands, e.g., “RECORD’ or
`“POWER ON.” Capital letters and italics will be utilized to
`indicate variables Such as channel numbers, etc.
`FIG. 1 is a generalized block diagram of an information
`system 10 in accordance with the instant invention. Infor
`mation system 10 includes an information distribution center
`12 which receives information from one or more remotely
`located information providers 14-1, . . . , 14-n and Supplies
`or broadcasts this information to a terminal unit 16. “Infor
`mation” as used herein includes, but is not limited to, analog
`Video, analog audio, digital video, digital audio, text Services
`Such as news articles, Sports Scores, Stock market quotations,
`and weather reports, electronic messages, electronic pro
`gram guides, database information, Software including game
`programs, and wide area network data. Alternatively or in
`addition, information distribution center 12 may locally
`generate information and Supply this locally generated infor
`mation to terminal unit 16.
`The information transmitted by information distribution
`center 12 to terminal unit 16 includes vocabulary data
`representative of a vocabulary of Spoken Sounds or words
`(“utterances”). This vocabulary provides, for example, for
`spoken control of a device 18 and for spoken control of
`access to the information transmitted by information distri
`bution center 12. Specifically, terminal unit 16 receives
`vocabulary data from information distribution center 12 and
`speech ("utterance”) data from a user. Terminal unit 16
`includes a processor for executing a speech recognition
`algorithm for comparing the Vocabulary data and the Spoken
`command data to recognize, for example, commands for
`controlling device 18 or commands for accessing informa
`tion transmitted by information distribution center 12. Ter
`
`6
`minal unit 16 then appropriately generates a command for
`controlling device 18 or for accessing information transmit
`ted by information distribution center 12. AS used herein, a
`Speech recognition algorithm refers to an algorithm which
`converts spoken audio input into text or corresponding
`commands. A speaker Verification algorithm refers to an
`algorithm which Verifies the claimed identity of a speaker
`based upon a Sample of the claimant's Speech. A speaker
`identification algorithm refers to an algorithm which iden
`tifies a speaker from a list of previously Sampled alternatives
`based upon audio input from a speaker. A Speaker identifi
`cation algorithm may be used, for example, to limit the
`ability to control the device and/or access information to
`particular Speakers.
`The vocabulary data transmitted from information distri
`bution center 12 to terminal unit 16 may, for example, be
`phoneme data. A phoneme is a member of the Set of the
`Smallest units of Speech that Serve to distinguish one utter
`ance from another in a language or dialect. Each Sound or
`spoken word in the Vocabulary may thus be represented by
`a combination of phonemes. Alternatively, the Vocabulary
`data may be template data generated by having a perSon or
`perSons Speak each Sound or word. Each spoken Sound or
`word in the Vocabulary may thus be represented by a
`respective corresponding template. It should be noted that
`although the system of FIG. 1 illustrates a system in which
`information from information providers 14-1,..., 14-n and
`the Vocabulary data are transmitted over the same commu
`nication link, the invention is not limited in this respect.
`Thus, information from information service providers 14-1,
`. . . , 14-n and the Vocabulary data may be transmitted over
`different communications links.
`Many different arrangements may be utilized to provide
`the speech data to terminal unit 16. In a first illustrative, but
`non-limiting, arrangement, a remote control is provided
`which includes a wireleSS microphone or related transducer
`for transmitting Sounds or words Spoken by a user to
`terminal unit 16 via electrical, optical, or radio frequency
`Signals. Terminal unit 16 then includes a receiver, an analog
`front end for conditioning the received signal, a codec for
`perform