`WORLD INTELLECTUAL PROPERTY ORGANIZATION
`International Bureau
`INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)
`WO 00/21194
`(51) International Patent Classification 6 :
`(11) International Publication Number:
`H03G 3/20, H04B 15/00, H04R 25/00
`
`Al
`
`(43) International Publication Date:
`
`13 April 2000 (13.04.00)
`
`(21) International Application Number:
`
`PCT/US99/23234
`
`(22) International Filing Date:
`
`5 October 1999 (05.10.99)
`
`(81) Designated States: AU, CA, JP, European patent (AT, BE,
`CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC,
`NL, PT, SE).
`
`(30) Priority Data:
`09/168,101
`
`8 October 1998 (08.10.98)
`
`Published
`With international search report.
`
`US
`
`(71) Applicant: RESOUND CORPORATION [US/US]; Seaport
`Centre, 220 Saginaw Drive, Redwood City, CA 94063 (US).
`
`(72) Inventors: PUTHUFF, Steven, H.; 13001 Saratoga-Sunnyvale
`Road, Saratoga, CA 95070 (US). T AENZER, Jon, C.; 888
`Laverne Way, Los Altos, CA 94022-1107 (US). GEDDES,
`Earl, R.; 1388 Medinah Drive, Itasca, IL 60143 (US).
`
`(74) Agent: KREBS, Robert, E.; Bums, Doane, Swecker & Mathis,
`LLP, P.O. Box 1404, Alexandria, VA 22313-1404 (US).
`
`(54) Title: DUAL-SENSOR VOICE TRANSMISSION SYSTEM
`
`PD Mic
`
`PK.DET.&
`THRESH.
`
`,,
`
`~
`
`(0
`
`a,.o
`
`(57) Abstract
`
`A pickup system (10) utilizes both sound pressure disturbances sensed by a microphone (20) and bone conduction sensed by a
`vibration sensor (30) such as an accelerometer, velocity sensor or displacement sensor to pick up and faithfully transmit a talker's voice
`despite acoustically noisy ambient environments. The system senses the microphone signal amplitude and adjusts the gain of associated
`amplification circuitry (23, 26) to effectively and smoothly fade in the microphone signal and fade out the vibration sensor signal as ambient
`noise increases. The pickup system (10) may be part of a stand alone ear level communication transmitter worn behind the talker's ear, or
`it may be implemented in a telephone system or other communication systems.
`
`- i -
`
`Sony v. Jawbone
`
`U.S. Patent No. 8,467,543
`
`Sony Ex. 1007
`
`
`
`FOR THE PURPOSES OF INFORMATION ONLY
`
`Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT.
`
`AL
`AM
`AT
`AU
`AZ
`BA
`BB
`BE
`BF
`BG
`BJ
`BR
`BY
`CA
`CF
`CG
`CH
`CI
`CM
`CN
`cu
`CZ
`DE
`DK
`EE
`
`Albania
`Annenia
`Austria
`Australia
`Azerbaijan
`Bosnia and Herzegovina
`Barbados
`Belgium
`Burkina Faso
`Bulgaria
`Benin
`Brazil
`Belarus
`Canada
`Central African Republic
`Congo
`Switzerland
`COte d'Ivoire
`Cameroon
`China
`Cuba
`Czech Republic
`Gennany
`Denmark
`Estonia
`
`ES
`Fl
`FR
`GA
`GB
`GE
`GH
`GN
`GR
`HU
`IE
`IL
`IS
`IT
`JP
`KE
`KG
`KP
`
`KR
`KZ
`LC
`LI
`LK
`LR
`
`Spain
`Finland
`France
`Gabon
`United Kingdom
`Georgia
`Ghana
`Guinea
`Greece
`Hungary
`Ireland
`Israel
`Iceland
`Italy
`Japan
`Kenya
`Kyrgyzstan
`Democratic People's
`Republic of Korea
`Republic of Korea
`Kazakstan
`Saint Lucia
`Liechtenstein
`Sri Lanka
`Liberia
`
`LS
`LT
`LU
`LV
`MC
`MD
`MG
`MK
`
`ML
`MN
`MR
`MW
`MX
`NE
`NL
`NO
`NZ
`PL
`PT
`RO
`RU
`SD
`SE
`SG
`
`Lesotho
`Lithuania
`Luxembourg
`Latvia
`Monaco
`Republic of Moldova
`Madagascar
`The fonner Yugoslav
`Republic of Macedonia
`Mali
`Mongolia
`Mauritania
`Malawi
`Mexico
`Niger
`Netherlands
`Norway
`New Zealand
`Poland
`Portugal
`Romania
`Russian Federation
`Sudan
`Sweden
`Singapore
`
`SI
`SK
`SN
`sz
`TD
`TG
`TJ
`TM
`TR
`TT
`UA
`UG
`us
`uz
`VN
`YU
`zw
`
`Slovenia
`Slovakia
`Senegal
`Swaziland
`Chad
`Togo
`Tajikistan
`Turkmenistan
`Turkey
`Trinidad and Tobago
`Ukraine
`Uganda
`United States of America
`Uzbekistan
`Viet Nam
`Yugoslavia
`Zimbabwe
`
`- ii -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`DUAL-SENSOR VOICE TRANSMISSION SYSTEM
`
`BACKGROUND OF THE INVENTION
`FIELD OF THE INVENTION
`
`The invention relates to two-way communication devices, and more
`particularly, to ear level voice pickups used in two-way communications.
`
`5
`
`DESCRIPTION OF RELATED ART
`
`Existing ear level pickup technology uses a conventional microphone to
`sense and relay the wearer's voice to a listener at a remote location. The
`microphone responds to sound signal air pressure disturbances and converts these
`to a representative electrical signal which is then provided to a transmitter. The
`transmitter relays the electrical signal, via any appropriate means such as radio
`waves or telephone lines, to a receiver at a remote location, where the electrical
`signal is reconverted, using a loud speaker, to an audible sound signal for hearing
`by the listener at the remote location.
`
`A problem afflicting such systems is their susceptibility to degradation by
`high ambient noise levels present in the environment. When the pickup wearer is
`immersed in a noisy environment, the environmental sounds interfere with the
`talker's voice and, above certain levels, overshadow that voice and frustrate
`intelligibility at the remote location. As these systems are often targeted for
`wearers whose lines of work necessitate exposure to high ambient noise levels--
`e.g., government field agents such as secret service employees assigned to
`
`10
`
`15
`
`20
`
`- 1 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-2-
`
`protect officials and to communicate in the midst of large crowds or during
`assassination attempts or other crises where panic-struck crowds are particularly
`noisy or firefighters dispatched to sites of roaring conflagrations, or rescuers
`beneath a hovering helicopter--it is particularly important that the systems be
`designed to adequately reject unwanted noise and maintain the requisite
`functionality permitting clear communication between talker and listener.
`
`One obvious approach to the suppression of ambient noise pickup has
`been to use a near-field microphone, such as a boom microphone, disposed in
`close proximity to the talker's mouth. With an appropriate mechanical or
`electronic microphone configuration, sounds that originate at a distance to the
`microphone can be de-emphasized, such that only sounds originating in the
`vicinity of the microphone--i.e., those from the talker's mouth--are picked up for
`transmission. However, there are disadvantages to such configurations,
`including the need to place the microphone close to the mouth, thereby detracting
`from the discreetness and convenience of the device while increasing its size and
`cost and the number of components used. Additionally, the user of the
`microphone must rotate the microphone into place before speaking and must be
`careful not to knock the headset supporting the microphone out of position or off
`his head during wear. All these problems detract from the utility of the device.
`
`Another known type of sound pickup device relies on bone conduction. It
`is recognized that a talker's voice reaches the talker's own ear not only through
`the normal pathway of air pressure waves, but also through conduction by the
`jaw and other bones of the skull as induced by vibrations from the talker's sound-
`generating apparatus during speech. Bone conduction involves conveyance of
`speech-induced vibrations directly to the ear canal, or, for that matter, to any part
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 2 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-3-
`
`of a person's body supported by bone structure. Since the bones provide a direct
`pathway from the sound source--that is, from the larynx and vocal chords for
`example--a device which is disposed against suitable bone structures of the talker
`so as to detect the bone vibrations is able to circumvent a noisy acoustic
`environment.
`
`Another advantage to bone conduction derives from a talker's natural
`reaction to high ambient noise levels. Under normal circumstances, bone
`conduction is a poor medium for high frequency, low energy sound transmission,
`but is an excellent medium for low frequency, high energy transmission. Raising
`the level of speech, by louder speaking in reaction to a noisy environment,
`imparts greater energy to the bones--especially information-bearing consonant
`energy which normally lies at the higher frequencies and to which, all things
`being equal, air pressure conduction is normally better suited than bone
`conduction. The talker, by raising his or her voice, is disproportionately raising
`the sound level of the consonant component of speech, to which is attributed 80-
`90% of intelligibility, and is thus naturally compensating for the normally
`consonant-unfriendly nature of bone conduction by imparting greater energy
`precisely at the sound frequencies at which bone conduction is poor.
`
`The prior art has used vibration transducers and even accelerometers to
`detect bone vibration. To that end, throat microphones were first introduced in
`WWII and worn by bomber pilots who needed to communicate in extremely
`noisy environments. The throat microphones picked up vibrations at the talker's
`throat and provided a clearer representation of his voice than a sound pressure
`microphone, which would have picked up the overshadowing ambient acoustic
`noise as well. Such devices were crude and the quality of their signal poor.
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 3 -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`-4-
`
`A device known as the Voiceducer™, manufactured by Tempco™, also
`takes advantage of bone conduction. It is placed in the user's ear canal and
`operates to pick up vibrations induced therein by the user's speech apparatus as
`described above. Similarly, Andrea Electronics™ manufactures a device having
`two conventional air pressure microphones, each oriented differently from the
`other, and one of which is designed to pick up air pressure vibrations in the ear
`canal whose source is in fact due to bone vibration. The dual conversion relied
`upon by this device produces a further degradation of performance. Other prior
`art bone conduction pickup apparatus include bone vibration sensors disposed
`directly atop the user's head, inertial microphones, and ceramic microphones
`operating without diaphragms.
`
`SUMMARY OF THE INVENTION
`
`The present invention exploits the phenomenon of bone conduction to
`suppress the deleterious effects of high acoustic noise environments without
`suffering the degradation of voice pickup quality in low noise environments
`demonstrated by prior art bone conduction pickup systems. The above-described
`human speech characteristics are exploited by providing a device which
`adaptively senses sound through bone conduction when acoustic noise levels are
`high. As discussed above, bone conduction alone is generally a poor expedient
`for transfer of intelligible speech, and in an acoustically quiet environment an air
`pressure microphone is much better suited to picking up a talker's voice. It is
`only when acoustic noise levels are high that the quality of air pressure
`microphone detection is degraded enough to make bone conduction a better
`alternative. The device in accordance with the invention accordingly relies on
`both bone conduction and air pressure vibration to pick up the talker's voice.
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 4 -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`-5-
`
`The bone conducted speech is detected using a motion transducer comprising a
`vibration sensor, and specifically, in a preferred embodiment, an accelerometer,
`although use of other types of vibration sensors, such as displacement and
`velocity sensors, is also contemplated. When ambient noise levels are low,
`sound detection is effected primarily through a conventional microphone, which
`may comprise, inter alia, a directional microphone, a pressure difference
`microphone or an omni-directional microphone. An adjustable transition circuit
`is provided for fading out the conventional microphone signal and fading in the
`bone vibration sensor signal with an increase in ambient noise levels.
`
`The transition circuit, in one embodiment, combines the signals from the
`microphone and vibration sensor, sensing the average amplitude of the
`microphone signal and inversely adjusting the gain of pre-amplifiers and/or
`amplifiers connected to the outputs of the vibration sensor and microphone. As
`the average amplitude of the microphone signal increases relative to a
`predetermined threshold, due to increased ambient noise levels, the gain of the
`vibration sensor is increased while that of the microphone is decreased.
`Alternatively, the gain adjustment can be controlled by sensing the overall noise
`level detected by the vibration sensor rather than the microphone. Gain
`adjustment is effected such that the amplitude of the overall output signal is
`maintained at a smoothly changing level throughout the transition range.
`
`In a second, preferred embodiment, the signal from the microphone is
`amplified linearly up to a predetermined level only, after which the signal level is
`limited and prevented from further increase. The signal from the vibration
`sensor, initially amplified at a lower gain level than that of the microphone, is not
`limited and eventually, after the predetermined level, surpasses the limited
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 5 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-6-
`
`microphone signal and comes to dominate, thereby improving system
`intelligibility and increasing immunity to ambient noise effects. Such a
`configuration can be implemented using available sound processing circuitry and
`off-the-shelf technology, at reduced production costs and power consumption
`requirements.
`
`An additional advantage of the invention is its particular suitability to
`voice-activated applications, as the bone conduction expedient improves system
`immunity to false triggering due to noisy environments. The invention realizes
`power conservation advantages by for example turning off a radio transmitter in
`the absence of the user's voice.
`
`Yet another advantage of the invention derives from the use of the two
`modalities of sound transfer--namely, air pressure waves and bone conduction.
`The two modalities permit the use of signal correlation techniques to enhance the
`voice signal, thus improving overall system performance. Characteristics of the
`loudness of speech derived from signals from either or both modalities can be
`used to control the system of the invention. For example the amplitude envelope
`of the bone conducted signal can be used to dynamically vary the gain of the
`microphone amplifier, so that speech portions of the microphone signal are
`enhanced while noise portions are suppressed. These characteristics include
`signal RMS value, peak value, average value, and energy level.
`
`5
`
`10
`
`15
`
`20
`
`The invention finds utility in numerous voice pickup applications,
`including but not limited to ear level pickups, telephony and dual microphone
`directional pickup systems.
`
`- 6 -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`-7-
`
`BRIEF DESCRIPTION OF THE DRAWINGS
`
`Many advantages of the present invention will be apparent to those skilled
`in the art with a reading of this specification in conjunction with the attached
`drawings, wherein like reference numerals are applied to like elements and
`wherein:
`
`5
`
`FIG. 1 is a schematic diagram of a dual-sensor device in accordance with
`the invention;
`
`FIG. 2 is a graphical illustration of the signal output of the transducers in
`one embodiment of the device of FIG. 1;
`
`10
`
`FIG. 3 is a schematic diagram of a preferred embodiment of a dual-sensor
`device in accordance with the invention;
`
`FIG. 4 is a graphical illustration of the signal output of the transducers in
`another embodiment of the device of FIG. 3;
`
`15
`
`FIG. 5 is a schematic illustration showing an embodiment in accordance
`with the invention in which selection between the accelerometer and microphone
`outputs is based on the detected envelopes of the signals;
`
`FIG. 6 is a schematic illustration in which phase correlation of the
`microphone and accelerometer signals is effected in accordance with the
`invention;
`
`- 7 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-8-
`
`FIGS. 7 and 8 are graphical illustrations of the use of signal compression
`and expansion contemplated by exemplary embodiments in accordance with the
`invention;
`
`5
`
`FIG. 9 is a schematic illustration of an embodiment in accordance with
`the invention in which the detected signals are processed by frequency band
`components;
`
`FIG. 10 is a schematic illustration of an embodiment in accordance with
`the invention in which variable filtering is used; and
`
`FIG. 11 is a graphical representation of the operation of the circuit of
`FIG. 10.
`
`10
`
`DESCRIPTION OF THE PREFERRED EMBODIMENTS
`
`FIG. 1 shows a schematic implementation of a dual-sensor device 10 in
`accordance with the invention. The device 10 uses an acoustic sound transducer,
`such as a microphone 20, and an accelerometer 30 whose outputs are selectively
`manipulated by a processing circuit 16 to compensate for obfuscation of the
`talker's voice due to the presence of ambient acoustic noise. The device is held
`against the skin of the talker's skull and may take the form of an ear level pickup
`worn behind the ear, or a telephone handset held against the ear during
`telephonic communication. Other possibilities include placement in a hat band or
`a face mask worn with, for example, a firefighter's helmet.
`
`15
`
`20
`
`- 8 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-9-
`
`Microphone 20 may be any conventional acoustic sound transducer such
`as, e.g., a standard microphone, directional microphone, omni-directional
`microphone, or pressure difference/gradient microphone or other microphone.
`Accelerometer 30 is a transducer device which senses the vibratory displacement
`of the bones in proximity to the ear as imparted primarily by vibrations during
`speech. The output of accelerometer 30 is a second derivative representation of
`bone displacement exhibiting, in the frequency domain, a 12 dB/octave rise
`which matches the approximately 12 dB/octave roll-off of bone conduction
`speech signals at the ear. It is contemplated that vibration sensors other than
`accelerometers may be used, and accordingly, use of the latter is not intended to
`be limiting. For instance, a velocity sensor exhibiting a first derivative rise in
`the frequency domain of 6 dB/octave can be used preferably with an electronic 6
`dB/octave frequency equalization, or similarly, a displacement sensor, may each
`be used in lieu of accelerometer 30 and also fall within the purview of the
`invention.
`
`Signals from the transducers 20 and 30 are fed through corresponding
`equalization circuits 22 and 32, which operate to adjust the frequency response of
`the transducer signals and provide a flatter frequency response. Equalization
`circuits 22, 32 can be designed to provide greater gain at higher frequencies than
`gain at lower frequencies to thereby decrease the "basiness" of the response,
`especially of accelerometer 30. It is also known that the bony structure of the
`head imparts certain resonances to the signal, so that the equalization is also
`intended to correct for those changes in the signal as well.
`
`5
`
`10
`
`15
`
`20
`
`Variable or adjustable gain pre-amplifiers 24 and 34 are connected to the
`equalization circuits 22, 32. These pre-amplifiers, along with faders or
`
`25
`
`- 9 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-10-
`
`amplifiers 26 and 36, serve to control the gain of the signals representing the
`transducer responses. An adder circuit 38 combines the signals from microphone
`20 and accelerometer 30 and provides the output signal 40 as the output signal of
`the pickup device through variable gain amplifier 46.
`
`5
`
`10
`
`15
`
`20
`
`25
`
`In operation, the output from the pre-amplifier 24 is applied to a control
`circuit 44 comprising for example an amplitude detector which senses the average
`amplitude of the signal from the microphone 20. When this average amplitude
`reaches or exceeds a predetermined threshold with a rise in ambient noise levels,
`the control circuit 44 commences a fading effect, decreasing the gain of amplifier
`26 while increasing that of amplifier 36. In this manner, the bone conduction
`component of the audio signal, comprising primarily the talker's speech, is
`amplified to a greater extent than the ambient noise signal at high ambient noise
`levels. At low ambient noise levels, the microphone signal is permitted to
`dominate as the output signal. The microphone singal under lower ambient noise
`levels provides a qualitatively better signal despite a measure of equalization
`applied to the accelerometer signal by equalization circuit 32 to improve the
`quality thereof. Typically, the value of the threshold will depend on the
`particular application and may vary anywhere in the range of and beyond 85 -
`100 dB SPL (sound pressure level--at 110 dB SPL ambient noise levels, a person
`is unable to hear his or her own voice), while the transition region is selected to
`span around 20 dB, although this value is just exemplary, with the actual value
`being selected based on the particular application. Beyond the cross-over
`threshold, the signal from the accelerometer 30 will predominate, exceeding the
`microphone signal by about 20-30 dB, while preceding the threshold, the
`microphone signal predominates. A graphical representation of the system signal
`output is depicted in FIG. 2, which shows the microphone signal dominating the
`
`- 10 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-11-
`
`accelerometer signal below a threshold level (shown by the dashed line), while
`the accelerometer signal dominates above the threshold. The transition region is
`the region where the transition between these two states occurs. Peak detector 42
`is provided to maintain the level of the overall signal constant, operating to adjust
`the gains of pre-amplifiers 24 and 34 in accordance with the output 40.
`
`In the preferred embodiment in accordance with FIG. 3, a dual-sensor
`device 50 is shown in which gain control is effected only on the output signal of
`one of the transducers--the microphone 70. Processing circuit 56 uses existing
`sound processing circuitry, depicted within the dashed region 60, in conjunction
`with accelerometer 80, equalization circuit 82, pre-amplifier 84, and adder 76 to
`provide the enhanced voice pickup device of the invention. Circuit 60 comprises
`equalization circuit 72, AGC (automatic gain control) amplifier circuit 74 having
`a specific control circuit (not shown) for controlling the gain thereof, and a
`bandsplit filter device 78 per known multi-band full dynamic range compression
`sound processing technology used in, e.g., hearing aid technology and ideally
`suited for operation at low voltage and current requirements.
`
`The dual-sensor 50 operates as follows. The output of microphone 70 is
`provided to equalization circuit 72 which again serves to flatten the frequency
`response for improved quality. The signal is then fed to AGC circuit 74, which
`is designed to have a gain limiting threshold at around 85 dB SPL. Circuit 74
`operates as a linear amplifier, limiting amplification beyond the selected
`threshold, which in this implementation is 85 dB SPL such that the output signal
`simply will not become louder with an increase of the input signal. It is to be
`noted that limiting is different from clipping in that clipping, which eliminates the
`peaks of the signals above a predetermined threshold, consequently introduces
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 11 -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`-12-
`
`undesirable distortions of the sound signal. Signal limiters are well known in the
`art.
`
`By contrast, the gain of pre-amplifier circuit 84 is not limited, and as
`ambient noise levels increase, the accelerometer signal, unlike the microphone
`signal, continues to increase. Although initially set at a lower level than that of
`the microphone signal, the accelerometer signal eventually surpasses that of the
`microphone when the ambient noise level reaches a sufficient magnitude because
`the talker naturally raises his voice in order to hear himself. Adder 76 operates
`to combine the signals, with output from the microphone predominating at the
`low ambient noise level and the output from the accelerometer predominating at
`the high ambient noise level. Again, the accelerometer is exemplary and can be
`any vibration sensor.
`
`The characteristic signal output is graphically represented in FIG. 4. As
`in the previous embodiment, the microphone signal output dominates at sound
`levels below the cross-over threshold, while the accelerometer signal dominates
`at levels above the transition threshold.
`
`The dual-sensor 50 is designed with readily available circuitry and off-
`the-shelf components. It has minimal voltage and current consumption
`characteristics and is well-suited for miniaturized, in-the-ear applications as
`contemplated for ear level pickup devices. Inherent advantages afforded thereby
`include reduced cost and simplicity of design and assembly, as well as enhanced
`performance provided by the unique application of the two transducers. Also, as
`noise increases, the ratio of accelerometer signal to microphone signal improves
`
`5
`
`10
`
`15
`
`20
`
`- 12 -
`
`
`
`WO 00/21194
`
`PCT /US99/23234
`
`13-
`
`i.e., noise output becomes progressively a smaller and smaller portion of output
`giving a progressively better signal-to-noise ratio.
`
`FIG. 5 shows an embodiment in accordance with the invention in which
`selection between the accelerometer and microphone outputs is based on the
`detected envelopes of the signals. Signals from microphone 92 are provided to
`pre-amplifier 93 and envelope detector 94, while signals from accelerometer 96
`are provided to pre-amplifier 97 and envelope detector 98. A switching circuit
`It should be noted that while the
`95 selects the more variable of these signals.
`noise envelope is nearly constant, the voice envelope is greatly time-varying as
`noise increases. Consequently, the microphone signal will vary less but the
`accelerometer signal will continue to vary. In lower noise, microphone signal
`will vary more than accelerometer signal due to low-pass nature of bone
`conduction. The envelope detectors 94 and 98 can have short (-10 ms) time
`constants for selection of individual syllables, or long (-1 s) so gradual selection
`is made as background noise varies. Additionally, the switching circuit 95 can
`be replaced preferably with a fader circuit to permit gradual rather than abrupt
`transitions.
`
`FIG. 6 alternatively shows phase correlation of the microphone and
`accelerometer signals in accordance with the invention. Signals from the
`microphone 100 and accelerometer 130 are applied to correlator 120 via pre-
`amplifiers 110 and 140, respectively, after the phase difference of the
`accelerometer signal is corrected in phase difference correction circuit 150. By
`thus correlating the phase corrected signals, either on an average energy basis,
`instantaneous signal level basis or in individual multi-frequency bands (via Fast
`Fourier Transformation implemented by digital signal processing, for example),
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 13 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`-14-
`
`the correlated components of the signals can be passed to the output, whereas the
`uncorrelated components can be determined to be noise and prevented from
`passing to the output. In other words, the determination can be made either on a
`time-basis only or on both a frequency and time-basis simultaneously.
`
`5
`
`10
`
`15
`
`20
`
`25
`
`In an alternative embodiment, AGC amplifier 74 and pre-amp 84 of FIG.
`3 can be configured to provide non-linear amplification, controlled by an
`appropriate control circuit, such that AGC amplifier 74 provides signal
`compression at a predetermined ratio, while pre-amp 84 provides signal
`expansion at a predetermined ratio. Various combinations of amplification
`characteristics relative to a transition threshold can be used. More particularly,
`as shown in FIG. 7, using the AGC amplifier 74, the microphone signal can be
`amplified at a linear, 1: 1 ratio below the transition threshold and compressed at a
`N:l ratio (N> 1, e.g., 2.5, 10, etc.) above the transition threshold, while the
`accelerometer signal, using pre-amplifier 84, is expanded at a 1 :M ratio (M > 1,
`e.g., 2 or 1.5, etc.) throughout the operational range. Alternatively, as shown in
`FIG. 8, the microphone signal can be amplified at a linear 1:1 ratio below the
`transition threshold and compressed at a N: 1 (N > 1) ratio above the transition
`threshold, while the accelerometer signal is expanded at a 1 :M ratio (M > 1)
`below the transition threshold and linearly amplified at a 1: 1 ratio above the
`threshold. The amplifiers are designed to produce equal output levels at the
`transition SPL level. Those skilled in the art will recognize that many different
`combinations of linear amplification, compression and expansion of the detected
`signals are possible and would depend on the particular application. Moreover,
`the transition decision, made by the control circuit, can be linked to any
`characteristic of one or both detected signals, including RMS values, peak
`values, average values, signal energy, correlation and variability, again
`
`- 14 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`15
`
`depending on the particular application contemplated and the physical and
`economic constraints imposed.
`
`In another embodiment in accordance with the invention and
`schematically shown in FIG. 9, the signals from the microphone and vibration
`sensor can be split into high and low frequency bands and then faded separately.
`In FIG. 9, the signal from microphone 160 is amplified by amplifier 162 and then
`applied to bandsplit filter 164 serving to split the signal into a high frequency
`band and a low frequency band. Similarly, the signal from vibration sensor 166
`is amplified by amplifier 168 and then applied to bandsplit filter 170 serving to
`split the signal into high and low frequency bands. Bandsplit filters 164 and 170
`preferably have the same critical frequency t The signals are then respectively
`applied to high frequency and low frequency fader circuits 172 and 174, the
`outputs of which are summed in adder 176 and provided as the system output.
`Fader circuits 172 and 174 comprise any circuit discussed above designed to
`implement the fade over functions between the microphone and the vibration
`sensor.
`
`The circuit of FIG. 9 is well-suited to exploit the recognized phenomenon
`that most background acoustic noise--the drone of machinery, the roar of a
`blazing fire, etc.--lies at the low frequency end of the noise spectrum. By
`splitting the input signal into low and high frequency band components, fading is
`effected earlier--i.e., at a lower overall noise intensity level--for the low
`frequency band component than for the high frequency band component simply
`because the high frequency band contains less energy than the low frequency
`band. This maintains more of the high frequency intelligibility which passes
`through the acoustic path to the microphone 160 while still removing much of the
`
`5
`
`10
`
`15
`
`20
`
`25
`
`- 15 -
`
`
`
`WO 00/21194
`
`PCT/US99/23234
`
`16
`
`excess bass noise due to the earlier fade over in the low frequency band to the
`cleaner bone conduction signal of vibration sensor 166, which preferably
`comprises an accelerometer.
`
`The detected sound spectrum can be split into more than two bands, each
`of which is faded at a corresponding intensity level. Digital signal processing
`techniques readily lend themselves to such an implementation, with Fast Fourier
`Transform techniques being used in conjunction with suitable algorithms for
`effecting the fading when the signal intensity at a band exceeds a threshold
`intensity. The threshold intensity would be determined at the level at which the
`microphone 160 signal indicates there is so much noise that the voicing is no
`longer passing through, and the signal in that band would then be faded over to
`bone conduction from vibration sensor 166. The threshold level would be
`predetermined and device-specific, depending on the characteristics of the overall
`system design and the particular application for the device, but would generally
`be set at about 95 -100 dB SPL noise level.
`
`A particular advantage of such a multi-band processing system is realized
`when the interfering noise is at specific regions in the noise sound spectrum.
`Such a case occurs from the whine of an electric motor or the whistle of an air
`conditioning system for example. In such narrow band situations the noise is not
`dominated by low frequencies, and thus only the bands in which the narrow band
`interference occurs are affected. The digital signal processing in accordance with
`the inv