`
`Advancing Technology
`for Humanity
`
`DECLARATION OF GERARD P. GRENIER
`
`I, Gerard P. Grenier, am over twenty-one (21) years of age. I have never been convicted
`of a felony, and I am fully competent to make this declaration. I declare the following to be true
`to the best of my knowledge, information and belief:
`
`1. I am Senior Director of Content Management of The Institute of Electrical and
`Electronics Engineers, Incorporated ("IEEE").
`
`2. IEEE is a neutral third party in this dispute.
`
`3. Neither I nor IEEE itself is being compensated for this declaration.
`
`4. Among my responsibilities as Senior Director of Content Management, I act as a
`custodian of certain records for IEEE.
`
`5. I make this declaration based on my personal knowledge and information contained
`in the business records of IEEE.
`
`6. As part of its ordinary course of business, IEEE publishes and makes available
`technical articles and standards. These publications are made available for public
`download through the IEEE digital library, IEEE Xplore.
`
`7. It is the regular practice of IEEE to publish articles and other writings including
`article abstracts and make them available to the public through IEEE Xplore. IEEE
`maintains copies of the publications in the ordinary course of its regularly conducted
`activities.
`
`8. The article below has been attached as Exhibit A to this declaration:
`
`A. Jacek Dmochowski, et al, "Direction of Arrival Estimation Using the
`Parameterized Spatial Correlation Matrix", IEEE Transactions on Audio,
`Speech, and Language Processing, Vol. 5, Issue 4, April 23, 2007.
`
`9. I obtained a copy of Exhibit A through IEEE Xplore, where it is maintained in the
`ordinary course of IEEE's business. Exhibit A is a true and correct copy of the
`Exhibit, as it existed on or about May 19, 2020.
`
`IEEE
`10. The article and abstract from IEEE Xplore shows the date of publication.
`Xplore populates this information using the metadata associated with the publication.
`
`445 Hoes Lane Piscataway, NJ 08854
`
`IPR PETITION
`US RE48,371
`Sonos Ex. 1028
`
`
`
`11. Jacek Dmochowski, et al, "Direction of Arrival Estimation Using the Parameterized
`Spatial Correlation Matrix" was published in IEEE Transactions on Audio, Speech,
`and Language Processing, Vol. 5, Issue 4. IEEE Transactions on Audio, Speech, and
`Language Processing, Vol. 5, Issue 4 was published on April 23, 2007. The article is
`currently available for public download from the IEEE digital library, IEEE Xplore.
`
`12. I hereby declare that all statements made herein of my own knowledge are true and
`that all statements made on information and belief are believed to be true, and further
`that these statements were made with the knowledge that willful false statements and
`the like are punishable by fine or imprisonment, or both, under 18 U.S.C. § 1001.
`
`I declare under penalty of perjury that the foregoing statements ar
`Executed on: :)D-l}1a7-- i}DJO
`
`
`
`
`
`
`
`
`
`EXHIBIT A
`EXHIBIT A
`
`
`
`IEEE.org
`
`
`
`IEEE Xplore
`
`
`
`IEEE-SA
`
`
`
`IEEE Spectrum
`
`
`
`More Sites
`
`SUBSCRIBE
`
`SUBSCRIBE
`Personal Sign In
`
`Cart
`
`
`
`Create Account
`
`
`
`Browse My Settings Help
`
`Institutional Sign In
`
`Institutional Sign In
`
`All
`
`
`
`
`
`ADVANCED SEARCH
`
`Journals & Magazines > IEEE Transactions on Audio, S... > Volume: 15 Issue: 4
`
`Direction of Arrival Estimation Using the Parameterized Spatial
`Correlation Matrix
`Publisher: IEEE
`
`Cite This
`
`
`
`Cite This
`
`
`Alerts
`
`Manage
`Content Alerts
`
`Add to
`Citation Alerts
`
` << Results | Next >
`
`3 Author(s)
`
`Jacek Dmochowski ; Jacob Benesty ; Sofine Affes All Authors
`
`1825
`Full
`Text Views
`
`1P
`
`atent
`Citation
`
`46
`Paper
`Citations
`
`Abstract
`
`Document Sections
`
`I.
`
`Introduction
`
`II. Signal Model
`
`III. Parameterized
`Spatial
`Correlation Matrix
`
`IV. Broadband
`Spatial Spectral
`Estimators
`
`V. Simulation
`Evaluation
`
`Authors
`
`Figures
`
`References
`
`Abstract: The estimation of the direction-of-arrival (DOA) of one or more acoustic
`sources is an area that has generated much interest in recent years, with applications
`like autom... View more
`
` Metadata
`Abstract:
`The estimation of the direction-of-arrival (DOA) of one or more acoustic sources is an
`area that has generated much interest in recent years, with applications like automatic
`video camera steering and multiparty stereophonic teleconferencing entering the
`market. DOA estimation algorithms are hindered by the effects of background noise and
`reverberation. Methods based on the time-differences-of-arrival (TDOA) are commonly
`used to determine the azimuth angle of arrival of an acoustic source. TDOA-based
`methods compute each relative delay using only two microphones, even though
`additional microphones are usually available. This paper deals with DOA estimation
`based on spatial spectral estimation, and establishes the parameterized spatial
`correlation matrix as the framework for this class of DOA estimators. This matrix jointly
`takes into account all pairs of microphones, and is at the heart of several broadband
`spatial spectral estimators, including steered-response power (SRP) algorithms. This
`paper reviews and evaluates these broadband spatial spectral estimators, comparing
`
` Export to
`
`Collabratec
`
`Downl
`
` Back to Results | Next >
`
`More Like This
`
`Direction of Arrival Estimation Using
`Microphone Array Processing for Moving
`Humanoid Robots
`IEEE/ACM Transactions on Audio, Speech,
`and Language Processing
`Published: 2015
`
`Direction of Arrival Estimation of
`Reflections from Room Impulse Responses
`Using a Spherical Microphone Array
`IEEE/ACM Transactions on Audio, Speech,
`and Language Processing
`Published: 2015
`
`Show More
`
`Top Organizations with Patents
`on Technologies Mentioned in
`This Article
`
`
`
`
`
`Citations
`
`Keywords
`
`Metrics
`
`More Like This
`
`their performance to TDOA-based locators. In addition, an eigenanalysis of the
`parameterized spatial correlation matrix is performed and reveals that such analysis
`allows one to estimate the channel attenuation from factors such as uncalibrated
`microphones. This estimate generalizes the broadband minimum variance spatial
`spectral estimator to more general signal models. A DOA estimator based on the
`multichannel cross correlation coefficient (MCCC) is also proposed. The performance of
`all proposed algorithms is included in the evaluation. It is shown that adding extra
`microphones helps combat the effects of background noise and reverberation.
`Furthermore, the link between accurate spatial spectral estimation and corresponding
`DOA estimation is investigated. The application of the minimum variance and MCCC
`methods to the spatial spectral estimation problem leads to better resolution than that of
`the ...
`
`(View more)
`
`Published in: IEEE Transactions on Audio, Speech, and Language Processing (
`Volume: 15 , Issue: 4 , May 2007 )
`
`Page(s): 1327 - 1339
`
`INSPEC Accession Number: 9413967
`
`Date of Publication: 23 April 2007
`
`DOI: 10.1109/TASL.2006.889795
`
` ISSN Information:
`
`Publisher: IEEE
`
` Contents
`
`I. Introduction
`Propagating signals contain much information about the sources that
`emit them. Indeed, the location of a signal source is of much interest in
`many applications, and there exists a large and increasing need to
`locate and track sound sources. For example, a signal-enhancing
`
`beamformer [1], [2] must continuously monitor the position of the desiredSign in to Continue Reading
`signal source in order to provide the desired directivity and interference
`suppression. This paper is concerned with estimating the direction-of-
`arrival (DOA) of acoustic sources in the presence of significant levels of
`both noise and reverberation.
`
`Authors
`
`Figures
`
`References
`
`Citations
`
`Keywords
`
`Metrics
`
`
`
`
`
`
`
`
`
`
`
`
`
`IEEE Personal Account
`
`Purchase Details
`
`Profile Information
`
`Need Help?
`
`CHANGE USERNAME/PASSWORD
`
`PAYMENT OPTIONS
`
`COMMUNICATIONS PREFERENCES
`
`US & CANADA: +1 800 678 4333
`
`VIEW PURCHASED DOCUMENTS
`
`PROFESSION AND EDUCATION
`
`WORLDWIDE: +1 732 981 0060
`
`TECHNICAL INTERESTS
`
`CONTACT & SUPPORT
`
`Follow
`
`
`
`About IEEE Xplore | Contact Us | Help | Accessibility | Terms of Use | Nondiscrimination Policy | Sitemap | Privacy & Opting Out of Cookies
`A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
`
`
`
`© Copyright 2020 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.
`
`IEEE Account
`
` Purchase Details
`
` Profile Information
`
` Need Help?
`
`» Change Username/Password
`» Update Address
`
`» Payment Options
`» Order History
`» View Purchased Documents
`
`» Communications Preferences
`» Profession and Education
`» Technical Interests
`
`» US & Canada: +1 800 678 4333
`» Worldwide: +1 732 981 0060
`» Contact & Support
`
`
` About IEEE Xplore Contact Us
`|
`
`
`|
`
`Help
`
`
`|
`
`Accessibility
`
`
`|
`
`Terms of Use
`
`
`|
`
`Nondiscrimination Policy
`
`
`|
`
`Sitemap
`
`
`|
`
`Privacy & Opting Out of Cookies
`
`A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
`© Copyright 2020 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.
`
`
`
`IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 4, MAY 2007
`
`1327
`
`Direction of Arrival Estimation Using the
`Parameterized Spatial Correlation Matrix
`
`Jacek Dmochowski, Jacob Benesty, Senior Member, IEEE, and Sofiène Affes, Senior Member, IEEE
`
`Abstract—The estimation of the direction-of-arrival (DOA) of
`one or more acoustic sources is an area that has generated much
`interest in recent years, with applications like automatic video
`camera steering and multiparty stereophonic teleconferencing
`entering the market. DOA estimation algorithms are hindered by
`the effects of background noise and reverberation. Methods based
`on the time-differences-of-arrival (TDOA) are commonly used
`to determine the azimuth angle of arrival of an acoustic source.
`TDOA-based methods compute each relative delay using only two
`microphones, even though additional microphones are usually
`available. This paper deals with DOA estimation based on spatial
`spectral estimation, and establishes the parameterized spatial cor-
`relation matrix as the framework for this class of DOA estimators.
`This matrix jointly takes into account all pairs of microphones,
`and is at the heart of several broadband spatial spectral estima-
`tors, including steered-response power (SRP) algorithms. This
`paper reviews and evaluates these broadband spatial spectral esti-
`mators, comparing their performance to TDOA-based locators. In
`addition, an eigenanalysis of the parameterized spatial correlation
`matrix is performed and reveals that such analysis allows one to
`estimate the channel attenuation from factors such as uncalibrated
`microphones. This estimate generalizes the broadband minimum
`variance spatial spectral estimator to more general signal models.
`A DOA estimator based on the multichannel cross correlation
`coefficient (MCCC) is also proposed. The performance of all
`proposed algorithms is included in the evaluation. It is shown that
`adding extra microphones helps combat the effects of background
`noise and reverberation. Furthermore, the link between accurate
`spatial spectral estimation and corresponding DOA estimation
`is investigated. The application of the minimum variance and
`MCCC methods to the spatial spectral estimation problem leads
`to better resolution than that of the commonly used fixed-weighted
`SRP spectrum. However, this increased spatial spectral resolution
`does not always translate to more accurate DOA estimation.
`
`Index Terms—Circular arrays, delay-and-sum beamforming
`(DSB), direction-of-arrival (DOA) estimation, linear spatial predic-
`tion, microphone arrays, multichannel cross correlation coefficient
`(MCCC), spatial correlation matrix, time delay estimation.
`
`I. INTRODUCTION
`
`P ROPAGATING signals contain much information about
`
`the sources that emit them. Indeed, the location of a signal
`source is of much interest in many applications, and there exists
`a large and increasing need to locate and track sound sources.
`
`Manuscript received September 6, 2006; revised November 8, 2006. The as-
`sociate editor coordinating the review of this manuscript and approving it for
`publication was Dr. Hiroshi Sawada.
`The authors are with the Institut National de la Recherche Scientifique-
`Énergie, Matériaux, et Télécommunications (INRS-EMT), Université du
`Québec, Montréal, QC H5A 1K6, Canada (e-mail: dmochow@emt.inrs.ca).
`Digital Object Identifier 10.1109/TASL.2006.889795
`
`For example, a signal-enhancing beamformer [1], [2] must con-
`tinuously monitor the position of the desired signal source in
`order to provide the desired directivity and interference sup-
`pression. This paper is concerned with estimating the direc-
`tion-of-arrival (DOA) of acoustic sources in the presence of sig-
`nificant levels of both noise and reverberation.
`The two major classes of broadband DOA estimation
`techniques are those based on the time-differences-of-arrival
`(TDOA) and spatial spectral estimators. The latter terminology
`arises from the fact that spatial frequency corresponds to the
`wavenumber vector, whose direction is that of the propagating
`signal. Therefore, by looking for peaks in the spatial spectrum,
`one is determining the DOAs of the dominant signal sources.
`The TDOA approach is based on the relationship between
`DOA and relative delays across the array. The problem of es-
`timating these relative delays is termed “time delay estimation”
`[3]. The generalized cross-correlation (GCC) approach of [4],
`[5] is the most popular time delay estimation technique. Alter-
`native methods of estimating the TDOA include phase regres-
`sion [6] and linear prediction preprocessing [7]. The resulting
`relative delays are then mapped to the DOA by an appropriate
`inverse function that takes into account array geometry.
`Even though multiple-microphone arrays are commonplace
`in time delay estimation algorithms, there has not emerged a
`clearly preferred way of combining the various measurements
`from multiple microphones. Notice that in the TDOA approach,
`the time delays are estimated using only two microphones at a
`time, even though one usually has several more sensor outputs at
`one’s disposal. The averaging of measurements from indepen-
`dent pairs of microphones is not an optimal way of combining
`the measurements, as each computed time delay is derived from
`only two microphones, and thus often contains significant levels
`of corrupting noise and interference. It is thus well known that
`current TDOA-based DOA estimation algorithms are plagued
`by the effects of both noise and especially reverberation.
`To that end, Griebel and Brandstein [8] map all “realizable”
`combinations of microphone-pair delays to the corresponding
`source locations, and maximize simultaneously the sum (across
`various microphone pairs) of cross-correlations across all pos-
`sible locations. This approach is notable, as it jointly maximizes
`the results of the cross-correlations between the various micro-
`phone pairs.
`The spatial spectral estimation problem is well defined in the
`narrowband signal community. There are three major methods:
`the steered conventional beamformer approach (also termed
`the “Bartlett” estimate), the minimum variance estimator (also
`termed the “Capon” or maximum-likelihood estimator), and
`the linear spatial predictive spectral estimator. Reference [9]
`
`1558-7916/$25.00 © 2007 IEEE
`
`Authorized licensed use limited to: IEEE Publications Operations Staff. Downloaded on May 19,2020 at 19:37:21 UTC from IEEE Xplore. Restrictions apply.
`
`
`
`1328
`
`IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 4, MAY 2007
`
`provides an excellent overview of these approaches. These
`three approaches are unified in their use of the narrowband
`spatial correlation matrix, as outlined in the next section.
`The situation is more scattered in the broadband signal
`case. Various spectral estimators have been proposed, but there
`does not exist any common framework for organizing these
`approaches. The steered conventional beamformer approach
`applies to broadband signals. The delay-and-sum beamformer
`(DSB) is steered to all possible DOAs to determine the DOA
`which emits the most energy. An alternative formulation of
`this approach is termed the “steered-response power” (SRP)
`method, which exploits the fact that the DSB output power may
`be written as a sum of cross-correlations. The computational
`requirements of the SRP method are a hindrance to practical
`implementation [8]. A detailed treatment of steered-beam-
`former approaches to source localization is given in [10], and
`the statistical optimality of the approach is shown in [11]–[13].
`Krolik and Swingler develop a broadband minimum variance
`estimator based on the steered conventional beamformer [14],
`which may be viewed as an adaptive weighted SRP algorithm.
`There have also been approaches that generalize narrowband
`localization algorithms (i.e., MUSIC [15]) to broadband sig-
`nals through subband processing and subsequent combining
`(see [16], for example). A broadband linear spatial predictive
`approach to time delay estimation is outlined in [17] and [18].
`This approach, which is limited to linear array geometries,
`makes use of all the channels in a joint fashion via the time
`delay parameterized spatial correlation matrix.
`This paper attempts to unify broadband spatial spectral esti-
`mators into a single framework and compares their performance
`from a DOA estimation standpoint to TDOA-based algorithms.
`This unified framework is the azimuth parameterized spatial
`correlation matrix, which is at the heart of all broadband spa-
`tial spectral estimators.
`In addition, several new ideas are presented. First, due to
`the parametrization, well-known narrowband array processing
`notions [19] are applied to the DOA estimation problem, gen-
`eralizing these ideas to the broadband case. A DOA estimator
`based on the eigenanalysis of the parameterized spatial corre-
`lation matrix ensues. More importantly, it is shown that this
`eigenanalysis allows one to estimate the channel attenuation
`from factors such as uncalibrated microphones. The existing
`minimum variance approach to broadband spatial spectral esti-
`mation is reformulated in the context of a more general signal
`model which accounts for such attenuation factors. Further-
`more, the ideas of [17] and [18] are extended to more general
`array geometries (i.e., circular) via the azimuth parameterized
`spatial correlation matrix, resulting in a minimum entropy DOA
`estimator.
`Circular arrays (see [20]–[22], for example) offer some ad-
`vantages over their linear counterparts. A circular array provides
`spatial discrimination over the entire 360 azimuth range, which
`is particularly important for applications that require front-to-
`back signal enhancement, such as teleconferencing. Further-
`more, a circular array geometry allows for more compact de-
`signs. While the contents of this paper apply generally to planar
`array geometries, the circular geometry is used throughout the
`simulation portion.
`
`Fig. 1. Circular array geometry.
`
`Section II presents the signal propagation model in planar
`(i.e., circular) arrays and serves as the foundation for the re-
`mainder of the paper. Section III reviews the role of the tradi-
`tional, nonparameterized spatial correlation matrix in narrow-
`band DOA estimation, and shows how the parameterized ver-
`sion of the spatial correlation matrix allows for generalization
`to broadband signals. Section IV describes the existing and pro-
`posed broadband spatial spectral estimators in terms of the pa-
`rameterized spatial correlation matrix. Section V outlines the
`simulation model employed throughout this paper and evaluates
`the performance of all spatial spectral estimators and TDOA-
`based methods in both reverberation- and noise-limited envi-
`ronments. Concluding statements are given in Section VI.
`The spatial spectral estimation approach to DOA estimation
`has limitations in certain reverberant environments. If an inter-
`fering signal or reflection arrives at the array with a higher en-
`ergy than the direct-path signal, the DOA estimate will be false,
`even though the spatial spectral estimate is accurate. Such situ-
`ations arise when the source is oriented towards a reflective bar-
`rier and away from the array. This problem is beyond the scope
`of this paper and is not addressed herein. Rather, the focus of
`this paper is on the evaluation of spatial spectral estimators in
`noisy and reverberant environments and on their application to
`DOA estimation.
`
`II. SIGNAL MODEL
`
`elements in a 2-D geom-
`Assume a planar array of
`etry, shown in Fig. 1 (i.e., circular geometry), whose outputs
`,
`, where
`is the time index.
`are denoted by
`Denoting the azimuth angle of arrival by , propagation of the
`signal from a far-field source to microphone is modeled as:
`
`(1)
`
`, are the attenuation factors due to
`,
`where
`channel effects,
`is the propagation time, in samples, from the
`to microphone 0,
`is an additive noise
`unknown source
`signal at the th microphone, and
`, is the
`
`,
`
`Authorized licensed use limited to: IEEE Publications Operations Staff. Downloaded on May 19,2020 at 19:37:21 UTC from IEEE Xplore. Restrictions apply.
`
`
`
`DMOCHOWSKI et al.: DIRECTION OF ARRIVAL ESTIMATION USING THE PARAMETERIZED SPATIAL CORRELATION MATRIX
`
`1329
`
`relative delay between microphones 0 and . In matrix form, the
`array signal model becomes:
`
`...
`
`...
`
`...
`
`. . .
`
`...
`
`. . .
`. . .
`
`...
`
`...
`
`although presented in far-field planar context, easily generalize
`to the near-field spherical case by including the range and ele-
`vation in the forthcoming parametrization.
`
`III. PARAMETERIZED SPATIAL CORRELATION MATRIX
`In narrowband signal applications, a common space-time
`statistic is that of the spatial correlation matrix [19], which is
`given by
`
`(2)
`
`where
`
`(5)
`
`(6)
`
`The function
`relates the angle of arrival to the relative delays
`between microphone elements 0 and , and is derived for the case
`of an equispaced circular array in the following manner. When
`operating in the far-field, the time delay between microphone
`and the center of the array is given by [23]
`
`where the azimuth angle (relative to the selected angle refer-
`,
`ence) of the th microphone is denoted by
`,
`denotes the array radius, and is the speed of signal
`propagation. It easily follows that
`
`(3)
`
`(4)
`
`may
`It is also worth mentioning that the additive noise
`. In that
`be temporally correlated with the desired signal
`case, a reverberant environment is modeled. The anechoic en-
`vironment is modeled by making the additive noise temporally
`uncorrelated with the source signal. In either case, the additive
`noise may be spatially correlated across the sensors.
`It should also be stated that the signal model presented above
`makes use of the far-field assumption, in that the incoming wave
`is assumed to be planar, such that all sensors perceive the same
`DOA. An error is incurred if the signal source is actually lo-
`cated in the near-field; in that case, the relative delays are also
`a function of the range. In the most general case (i.e., a source
`takes three
`in the near-field of a 3-D geometry), the function
`parameters: the azimuth, range, and elevation. This paper fo-
`cuses on a specific subset of this general model: a source located
`in the far-field with only a slight elevation, such that a single
`parameter suffices. This is commonly the case in a teleconfer-
`encing environment. Nevertheless, the concepts of this paper,
`
`the superscript
`denotes conjugate transpose, as complex sig-
`de-
`nals are commonly used in narrowband applications, and
`notes the transpose of a matrix or vector. To steer these array
`outputs to a particular DOA, one applies a complex weight to
`each sensor output, whose phase performs the steering, and then
`sums the sensor outputs to form the output beam. Now, if the
`input signal is no longer narrowband, each frequency requires
`its own complex weight to appropriately phase-shift the signal
`at that frequency. In the context of broadband spatial spectral
`estimation, the spatial correlation matrix may be computed at
`each temporal frequency, and the resulting spatial spectrum is
`now a function of the temporal frequency. For broadband appli-
`cations, these narrowband estimates may be assimilated into a
`time-domain statistic, a procedure termed “focusing,” which is
`described in [24]. The resulting structure is termed a “focused
`covariance matrix.”
`In this paper, broadband spatial spectral estimation is
`addressed in another manner. Instead of implementing the
`steering delays in the complex weighting at each sensor, the
`delays are actually implemented as a time-delay in the spatial
`correlation matrix, which is now parameterized. Thus, each
`microphone output is appropriately delayed before computing
`this parameterized spatial correlation matrix:
`
`(7)
`
`and real signals are assumed from this point on. The delays are
`a function of the assumed azimuth DOA, which becomes the
`parameter. The parameterized spatial correlation matrix is for-
`mally written as shown by (8) and (9) at the bottom of the page.
`is not simply the array observation matrix, as is
`The matrix
`commonly used in narrowband beamforming models. Instead, it
`is a parameterized correlation matrix that represents the signal
`
`...
`
`...
`
`. . .
`
`...
`
`(8)
`
`(9)
`
`Authorized licensed use limited to: IEEE Publications Operations Staff. Downloaded on May 19,2020 at 19:37:21 UTC from IEEE Xplore. Restrictions apply.
`
`
`
`1330
`
`IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 4, MAY 2007
`
`powers across the array emanating from azimuth . Each off-di-
`is a single cross-correlation term
`agonal entry in the matrix
`. Notice that the various
`and a function of the azimuth angle
`microphone pairs are combined in a joint fashion, in that altering
`affects all off-diagonal entries of
`. This
`the steering angle
`property allows for the more prudent combining of microphone
`measurements as compared to the ad hoc method of averaging
`independent pairs of cross-correlation results.
`This paper relates broadband spatial spectral estimators in
`terms of the parameterized spatial correlation matrix
`
`], by an amount
`[or advanced, depending on the sign of
`.
`that takes into account the array geometry, via the function
`The estimate of the spatial spectral power at azimuth angle
`is given by the power of the beamformer output when steered to
`azimuth . Therefore, to form the entire spectrum, one needs to
`steer the beam and compute the output power across the entire
`azimuth space.
`The steered-beamformer spectral estimate is given by
`
`Substitution of (12) into (13) leads to
`
`(10)
`
`Expression (14) may be written more neatly in matrix notation
`as
`
`(13)
`
`(14)
`
`(15)
`
`(16)
`
`(17)
`
`where
`is the steered azimuth
`is some estimation function,
`is the estimate of the broadband spatial spec-
`angle, and
`trum at azimuth angle
`.
`The DOA estimate follows directly from the spatial spectrum,
`in that peaks in the spectrum correspond to assumed source
`locations. For the case of a single source, which is the case
`throughout this paper, the estimate of the source’s DOA is given
`by
`
`where
`is the DOA estimate.
`Note that this broadband extension is not without caveats:
`care must be taken when spacing the microphones to ensure that
`spatial aliasing [2] does not result.
`It is also important to point out that the GCC method is quite
`compatible with DOA estimation based on the parameterized
`spatial correlation matrix—the cross-correlation estimates that
`comprise the matrix may be computed in the frequency-domain
`using a GCC variant such as the phase transform (PHAT) [4].
`This paper focuses on how to extract the DOA estimate from the
`parameterized spatial correlation matrix; the ideas presented are
`general in that they do not hinge on any particular method for
`computing the actual cross correlations.
`
`IV. BROADBAND SPATIAL SPECTRAL ESTIMATORS
`The following subsections detail the existing and proposed
`broadband spatial spectral estimation methods, relating each to
`the parameterized spatial correlation matrix.
`
`A. Steered Conventional Beamforming and the SRP Algorithm
`The aim of a DSB is to time-align the received signals in
`the array aperture, such that the desired signal is coherently
`summed, while signals from other directions are incoherently
`summed and thus attenuated. Using the model of Section II, the
`is given as
`output of a DSB steered to an angle of arrival of
`
`(12)
`
`The delays
`steer the beamformer to the desired DOA,
`help shape the beam accord-
`while the beamformer weights
`ingly. The weights here have been made dependent on the de-
`, for a reason that will become apparent
`sired angle of arrival
`in future subsections. In (12), the received signals are delayed
`
`(11)
`
`where
`
`The DOA estimate is thus given by
`
`The maximization of a steered beamformer output power is
`equivalent to maximizing a quadratic of the beamformer weight
`vector with respect to the angle of arrival. Altering the angle
`affects the parameter in the quadratic form, namely, the param-
`eterized spatial correlation matrix.
`The well-known SRP algorithm [10] follows directly from a
`for all
`, and
`is a vector
`special case of (17), where
`of
`ones:
`
`(18)
`
`For this special case of fixed unit weights, this means that the
`maximization of the power of a steered DSB is equivalent to the
`.
`maximization of the sum of the entries of
`The SRP algorithm has garnered significant attention re-
`cently: see [10], [25], and [26]. In all of these implementations,
`is used, which is fixed with respect
`the weighting of
`to both the data and the steering angle. Given the well-known
`classical results on the advantages of adaptive beamforming
`over fixed beamforming, it is therefore surprising that adaptive
`weighting schemes have not been investigated more in the
`context of DOA estimation based on the parameterized spatial
`correlation matrix (A fixed weighting scheme is proposed in
`[27]). Notice that from (15), this is an effectively “narrowband”
`weight selection, in that the pre-aligning of the microphones
`requires only the selection of a single weight per channel. Note,
`however, that this weight selection must be performed for all
`. To that end, the following section presents one such
`angles
`adaptive weighting scheme, proposed by Krolik [14].
`
`Authorized licensed use limited to: IEEE Publications Operations Staff. Downloaded on May 19,2020 at 19:37:21 UTC from IEEE Xplore. Restrictions apply.
`
`
`
`DMOCHOWSKI et al.: DIRECTION OF ARRIVAL ESTIMATION USING THE PARAMETERIZED SPATIAL CORRELATION MATRIX
`
`1331
`
`B. Minimum Variance
`The minimum variance approach to spatial spectral esti-
`mation involves selecting weights that pass a signal [i.e., a
`] propagating from azimuth with
`broadband plane wave
`unity gain, while minimizing the total output power, given by
`. The application of the minimum variance method
`to broadband spatial spectral estimation is given in [14].
`The unity gain constraint proposed by [14] is
`
`is apparent that the vector may be estimated from the eigenanal-
`.
`ysis of
`To that end, consider another adaptive weight selection
`method, which follows from the ideas of narrowband beam-
`forming [19]. This weight selection attempts to nontrivially
`maximize the output energy of the steered-beamformer for a
`given azimuth
`
`(19)
`
`subject to
`
`vector follows from the fact that the signal is already
`and the
`time-aligned across the array before minimum variance pro-
`cessing. It is as if the signal is coming from the broadside of
`a linear array.
`Using the method of Lagrange multipliers in conjunction with
`, the minimum variance weights
`the cost function
`become
`
`It is well known that the solution to the above constrained opti-
`mization is the vector that maximizes the Rayleigh quotient [2]
`, which is in turn given by the eigenvector
`. The resulting
`corresponding to the maximum eigenvalue of
`spatial spectral estimate is given by
`
`(20)
`
`(28)
`
`(26)
`
`(27)
`
`The resulting minimum variance spatial spectral estimate is
`found by substituting the weights of (20) into the cost function:
`
`The broadband minimum variance DOA estimator is thus given
`by
`
`(21)
`
`(22)
`
`The next section presents a new idea: the eigenanalysis of the
`parameterized spatial correlation matrix.
`
`C. Eigenanalysis of the Parameterized Spatial Correlation
`Matrix
`Using the signal model of Section II, notice that when the
`steered azimuth matches the actual azimuth , the parameter-
`ized spatial correlation matrix may be decomposed into signal
`and noise components in the following manner:
`
`where
`
`is the signal power
`
`and
`
`(23)
`
`(24)
`
`(25)
`
`where
`is
`, and
`is the maximum eigenvalue of
`the corre