throbber
Presented at: IEEE Signal Processing Society, Eighth Workshop on Image
`and Multidimensional Signal Processing, Cannes France, September 1993.
`
`Course-to-fine Estimation of Visual Motion
`Eero P. Simoncelli
`Computer and Information Science Department
`University of Pennsylvania
`
`The recovery of motion information from visual input is an important task for both natural and artificial vision
`systems. The standard approach to representing motion information is via the image velocity field:
`that is, the
`projection of the motion of points in the three-dimensional world onto the image plane. As an approximation to
`this, computer vision techniques typically compute an estimate of the motion field from the spatial and temporal
`variations of image brightness. This field of approximate velocities is known as the “optical flow”.
`One common source of difficulty in optical flow estimation systems is temporal aliasing. Motion picture and
`video imagery is typically sampled below the Nyquist rate in time. Filter-based (including differential) algorithms
`will typically give erroneous results in these situations. For optical flow algorithms based on matching, the errors
`appear as “false matches”.
`A number of authors have developed “coarse-to-fine” processing strategies for handling the temporal aliasing
`problem in the context of motion estimation [7, 9, 5, 1]. These algorithms are typically efficiently implemented using
`“image pyramids” [3]. The motivating concept in these approaches is that the aliasing only affects the high spatial
`frequencies of the input sequence. Thus, one can accurately estimate image velocities of a spatially lowpass-filtered
`copy of the input imagery. These estimates may then be used as initial guesses for estimating the motion at finer
`scales. Alternatively, one can “undo” the motion in finer scale images by warping these images according to the
`velocity estimated from a coarser scale. The velocity field of the warped images may then be computed and added
`as a correction term to the current estimate. This process may be repeated recursively until one reaches the finest
`scale of the input imagery. Multi-scale approaches also provide an efficient means of imposing regularization or
`smoothness constraints in the optical flow problem [4, 8].
`One shortcoming of the coarse-to-fine approaches mentioned above is that if the coarse scale estimate is incorrect
`by a substantial amount, the finer scales will be unable to recover from these errors. To correct this, we must have
`knowledge of the uncertainty in the coarse-scale estimates. We have developed a filter-based coarse-to-fine algorithm,
`which includes an explicit model for the errors in the estimator. The details of the algorithm are described in [11].
`We give the basic solution here.
`We define a state evolution equation for the image velocity field, ~v, with respect to scale:
`~n0l  N 0; L 0
`
`~vl + 1 = El~vl + ~n0l;
`
`where l is an index for scale (larger values of l correspond to finer scales). Note that indices for spatial and
`temporal position are implicit. El is a linear interpolation operator used to extend a coarse scale flow field to finer
`resolution. 1 ~n0 is a random variable that represents the uncertainty of the prediction of the fine-scale motion from
`the coarse-scale motion. We assume that the ~n0l are pointwise independent, zero-mean, and normally distributed.
`We also define a measurement equation, based on the standard brightness gradient constraint [6]:
`
`(cid:0)ftl = ~fsl ~vl + n2 + ~fsl ~n1;
`
`where ~fsl is the spatial gradient of the image and ftl is its temporal partial derivative. This constraint holds at
`each point in space and time (again, spatial and temporal indices are implicit). The uncertainty model, described
`in [10], incorporates both an additive and a multiplicative noise term. We again assume that the random variables
`~n1 and n2 are zero-mean, independent and normally distributed.
`Given these two equations, we may derive the optimal estimator for ~vl + 1, the velocity at the fine scale, given
`an estimate for the velocity at the previous coarse scale, ~vl, and a set of fine scale derivative measurements. The
`solution is in the form of a standard Kalman filter, but with the time variable replaced by the scale, l:
`
`~vl + 1 = El~vl +K l + 1l + 1
`L l + 1 = L
`
`l + 1 (cid:0) Kl + 1 ~f Ts l + 1L
`1If the algorithm is implemented using a pyramid data structure, images at coarser scales will be represented at lower resolution.
`Petitioner Lenovo (United States) Inc. - Ex. 1018
`
`l + 1
`
`1 of 2
`
`

`

`s l + 1(cid:0)L
`Kl + 1 = L
`l + 1 ~fsl + 1 h ~f T
`
`l + 1 = (cid:0)ftl + 1 (cid:0) ~f Ts l + 1El~vl
`l + 1 = ElL lElT + L 0;
`
`l + 1 + L 1 ~fsl + 1 + L 2i(cid:0)1
`
`where l corresponds to an innovations process.
`One important modification must be made to the standard form. We cannot compute the derivative measurements
`at scale l without making use of the velocity estimate at scale l (cid:0) 1, due to the temporal aliasing problem. Therefore,
`we must write l in terms of derivatives of the warped sequence. We can view the definition of the innovations
`process  as a temporal partial derivative of the warped sequence:
`
`
`
`l + 1 = (cid:0)ftl + 1 (cid:0) ~f Ts l + 1El~vl
`
`f (cid:0)~x + tEl~v l; t
`W ff l + 1; El~vlg;
`
`@ @
`
`t
`
`@ @
`
`t
`
` (cid:0)
`
`= (cid:0)
`
`where the operator Wfg “warps” the first argument (an image) according to the second argument (a vector field).
`Thus, the innovations process is computed as the temporal derivative of the image at scale l + 1, after it has been
`warped with the interpolated flow field estimate from scale l.
`In
`We have implemented this algorithm using a “Gaussian image pyramid” multi-scale data structure [3].
`order to make the solution computationally feasible, we ignore the off-diagonal elements in L
`l + 1 (i.e., the
`correlations between adjacent interpolated flow vectors). We have applied the algorithm to a number of synthetic
`image sequences, including the realistic “Yosemite” sequence which was generated by texture-mapping an aerial
`photograph onto a range map. The results are excellent, and compare quite favorably to those reported in a recent
`experimental comparison of optical flow techniques [2].
`
`References
`
`[1] P. Anandan. A computational framework and an algorithm for the measurement of visual motion. Intl. J.
`Comp. Vis., 2:283–310, 1989.
`[2] J. L. Barron, D. J. Fleet, and S. S. Beauchemin. Performance of optical flow techniques. TR #RPL-TR-9107,
`Queen’s University, Kingston, Ontario, Robotics and Perception Lab, July 1992.
`[3] P. J. Burt. Fast filter transforms for image processing. Comp. Graphics and Im. Proc., 16:20–51, 1981.
`[4] K. C. Chou, A. S. Willsky, A. Benveniste, and M. Basseville. Recursive and iterative estimation algorithms
`for multi-resolution stochastic processes. TR # LIDS-P-1857, MIT Laboratory for Information and Decision
`Sciences, March 1989.
`[5] W. Enkelmann and H. Nagel. Investigation of multigrid algorithms for estimation of optical flow fields in
`image sequences. Comp. Vis. Graphics Image Proc., 43:150–177, 1988.
`[6] B. K. P. Horn and B. G. Schunck. Determining optical flow. Artificial Intelligence, 17:185–203, 1981.
`[7] B. D. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In
`Proc. 7th Int. Joint Conf. Artificial Intelligence, pages 674–679, Vancouver, 1981.
`[8] M. Luettgen, W. Karl, and A. Willsky. Efficient multiscale regularization with applications to the computation
`of optical flow. TR # LIDS-P-2115, MIT Laboratory for Information and Decision Systems, 1992. Submitted
`to IEEE Trans. Image Proc.
`[9] L. Quam. Hierarchical warp stereo. In Proceedings of the DARPA Image Understanding Workshop, September
`1984.
`[10] E. P. Simoncelli, E. H. Adelson, and D. J. Heeger. Probability distributionsof optical flow. In IEEE Conference
`on Computer Vision and Pattern Recognition, Mauii, Hawaii, June 1991.
`[11] E. P. Simoncelli. Distributed Analysis and Representation of Visual Motion. PhD Thesis, MIT Department
`of Electrical Engineering and Computer Science. January 1993.
`
`2 of 2
`
`L
`

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket