Search | arXiv e-print repository

Oriented bounding boxes using multiresolution contours for fast interference detection of arbitrary geometry objects

Authors: L. A. Rivera, Vania V. Estrela, P. C. P. Carvalho

Abstract: Interference detection of arbitrary geometric objects is not a trivial task due to the heavy computational load imposed by implementation issues. The hierarchically structured bounding boxes help us to quickly isolate the contour of segments in interference. In this paper, a new approach is introduced to treat the interference detection problem involving the representation of arbitrary shaped obje… ▽ More Interference detection of arbitrary geometric objects is not a trivial task due to the heavy computational load imposed by implementation issues. The hierarchically structured bounding boxes help us to quickly isolate the contour of segments in interference. In this paper, a new approach is introduced to treat the interference detection problem involving the representation of arbitrary shaped objects. Our proposed method relies upon searching for the best possible way to represent contours by means of hierarchically structured rectangular oriented bounding boxes. This technique handles 2D objects boundaries defined by closed B-spline curves with roughness details. Each oriented box is adapted and fitted to the segments of the contour using second order statistical indicators from some elements of the segments of the object contour in a multiresolution framework. Our method is efficient and robust when it comes to 2D animations in real time. It can deal with smooth curves and polygonal approximations as well results are present to illustrate the performance of the new method. △ Less

Submitted 11 November, 2016; originally announced November 2016.

Comments: 8 pages, 10 figures

Journal ref: The 12-th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision'2004, WSCG 2004, University of West Bohemia, Campus Bory, Plzen-Bory, Czech Republic, February 2-6, 2004

arXiv:1611.03268 [pdf]

Error concealment by means of motion refinement and regularized Bregman divergence

Authors: Alessandra M. Coelho, Vania V. Estrela, Felipe P. do Carmo, Sandro R. Fernandes

Abstract: This work addresses the problem of error concealment in video transmission systems over noisy channels employing Bregman divergences along with regularization. Error concealment intends to improve the effects of disturbances at the reception due to bit-errors or cell loss in packet networks. Bregman regularization gives accurate answers after just some iterations with fast convergence, better accu… ▽ More This work addresses the problem of error concealment in video transmission systems over noisy channels employing Bregman divergences along with regularization. Error concealment intends to improve the effects of disturbances at the reception due to bit-errors or cell loss in packet networks. Bregman regularization gives accurate answers after just some iterations with fast convergence, better accuracy, and stability. This technique has an adaptive nature: the regularization functional is updated according to Bregman functions that change from iteration to iteration according to the nature of the neighborhood under study at iteration n. Numerical experiments show that high-quality regularization parameter estimates can be obtained. The convergence is sped up while turning the regularization parameter estimation less empiric, and more automatic. △ Less

Submitted 10 November, 2016; originally announced November 2016.

Comments: 8 pages, 4 figures

arXiv:1611.02637 [pdf]

doi 10.1109/mmsp.2009.5293264

Estimating motion with principal component regression strategies

Authors: Felipe P. do Carmo, Vania Vieira Estrela, Joaquim Teixeira de Assis

Abstract: In this paper, two simple principal component regression methods for estimating the optical flow between frames of video sequences according to a pel-recursive manner are introduced. These are easy alternatives to dealing with mixtures of motion vectors in addition to the lack of prior information on spatial-temporal statistics (although they are supposed to be normal in a local sense). The 2D mot… ▽ More In this paper, two simple principal component regression methods for estimating the optical flow between frames of video sequences according to a pel-recursive manner are introduced. These are easy alternatives to dealing with mixtures of motion vectors in addition to the lack of prior information on spatial-temporal statistics (although they are supposed to be normal in a local sense). The 2D motion vector estimation approaches take into consideration simple image properties and are used to harmonize regularized least square estimates. Their main advantage is that no knowledge of the noise distribution is necessary, although there is an underlying assumption of localized smoothness. Preliminary experiments indicate that this approach provides robust estimates of the optical flow. △ Less

Submitted 8 November, 2016; originally announced November 2016.

Comments: 6 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:1610.02923

Journal ref: Proceedings of the IEEE International Workshop on Multimedia Signal Processing, 2009, MMSP '09, 2009

arXiv:1611.01298 [pdf]

doi 10.1109/SIBGRA.2003.1241027

Regularized Pel-Recursive Motion Estimation Using Generalized Cross-Validation and Spatial Adaptation

Authors: Vania V. Estrela, Luis A. Rivera, Paulo C. Beggio, Ricardo T. Lopes

Abstract: The computation of 2-D optical flow by means of regularized pel-recursive algorithms raises a host of issues, which include the treatment of outliers, motion discontinuities and occlusion among other problems. We propose a new approach which allows us to deal with these issues within a common framework. Our approach is based on the use of a technique called Generalized Cross-Validation to estimate… ▽ More The computation of 2-D optical flow by means of regularized pel-recursive algorithms raises a host of issues, which include the treatment of outliers, motion discontinuities and occlusion among other problems. We propose a new approach which allows us to deal with these issues within a common framework. Our approach is based on the use of a technique called Generalized Cross-Validation to estimate the best regularization scheme for a given pixel. In our model, the regularization parameter is a matrix whose entries can account for diverse sources of error. The estimation of the motion vectors takes into consideration local properties of the image following a spatially adaptive approach where each moving pixel is supposed to have its own regularization matrix. Preliminary experiments indicate that this approach provides robust estimates of the optical flow. △ Less

Submitted 4 November, 2016; originally announced November 2016.

Comments: 8 pages, 6 figures in Proceedings of the XVI Brazilian Symposium on Computer Graphics and Image Processing, 2003. SIBGRAPI 2003. IEEE. arXiv admin note: text overlap with arXiv:1403.7365, arXiv:1611.00960

arXiv:1611.00960 [pdf]

doi 10.1117/12.632674

Adaptive mixed norm optical flow estimation

Authors: Vania V. Estrela, Matthias O. Franz, Ricardo T. Lopes, G. P. De Araujo

Abstract: The pel-recursive computation of 2-D optical flow has been extensively studied in computer vision to estimate motion from image sequences, but it still raises a wealth of issues, such as the treatment of outliers, motion discontinuities and occlusion. It relies on spatio-temporal brightness variations due to motion. Our proposed adaptive regularized approach deals with these issues within a common… ▽ More The pel-recursive computation of 2-D optical flow has been extensively studied in computer vision to estimate motion from image sequences, but it still raises a wealth of issues, such as the treatment of outliers, motion discontinuities and occlusion. It relies on spatio-temporal brightness variations due to motion. Our proposed adaptive regularized approach deals with these issues within a common framework. It relies on the use of a data-driven technique called Mixed Norm (MN) to estimate the best motion vector for a given pixel. In our model, various types of noise can be handled, representing different sources of error. The motion vector estimation takes into consideration local image properties and it results from the minimization of a mixed norm functional with a regularization parameter depending on the kurtosis. This parameter determines the relative importance of the fourth norm and makes the functional convex. The main advantage of the developed procedure is that no knowledge of the noise distribution is necessary. Experiments indicate that this approach provides robust estimates of the optical flow. △ Less

Submitted 3 November, 2016; originally announced November 2016.

Comments: 8 pages, 4 figures. arXiv admin note: text overlap with arXiv:1403.7365

Journal ref: Proc. SPIE 5960, Visual Communications and Image Processing 2005, 59603W, July 31, 2006, Beijing, China

arXiv:1610.02923 [pdf]

doi 10.5772/38129

EM-Based Mixture Models Applied to Video Event Detection

Authors: Alessandra Martins Coelho, Vania V. Estrela

Abstract: Surveillance system (SS) development requires hi-tech support to prevail over the shortcomings related to the massive quantity of visual information from SSs. Anything but reduced human monitoring became impossible by means of its physical and economic implications, and an advance towards an automated surveillance becomes the only way out. When it comes to a computer vision system, automatic video… ▽ More Surveillance system (SS) development requires hi-tech support to prevail over the shortcomings related to the massive quantity of visual information from SSs. Anything but reduced human monitoring became impossible by means of its physical and economic implications, and an advance towards an automated surveillance becomes the only way out. When it comes to a computer vision system, automatic video event comprehension is a challenging task due to motion clutter, event understanding under complex scenes, multilevel semantic event inference, contextualization of events and views obtained from multiple cameras, unevenness of motion scales, shape changes, occlusions and object interactions among lots of other impairments. In recent years, state-of-the-art models for video event classification and recognition include modeling events to discern context, detecting incidents with only one camera, low-level feature extraction and description, high-level semantic event classification, and recognition. Even so, it is still very burdensome to recuperate or label a specific video part relying solely on its content. Principal component analysis (PCA) has been widely known and used, but when combined with other techniques such as the expectation-maximization (EM) algorithm its computation becomes more efficient. This chapter introduces advances associated with the concept of Probabilistic PCA (PPCA) analysis of video event and it also aims at looking closely to ways and metrics to evaluate these less intensive EM implementations of PCA and KPCA. △ Less

Submitted 10 October, 2016; originally announced October 2016.

Comments: 25 pages, 8 figures, Available from: http://www.intechopen.com/books/principal-component-analysis-engineering-applications/em-based-mixture-models-applied-to-video-event-detection, Chapter from book "Principal Component Analysis - Engineering Applications", Dr. Parinya Sanguansat (Ed.), InTech, 2012. arXiv admin note: text overlap with arXiv:1404.1100 by other authors

arXiv:1610.02902 [pdf]

doi 10.4018/978-1-4666-9978-6.ch039

Content Based Image Retrieval (CBIR) in Remote Clinical Diagnosis and Healthcare

Authors: Albany E. Herrmann, Vania Vieira Estrela

Abstract: Content-Based Image Retrieval (CBIR) locates, retrieves and displays images alike to one given as a query, using a set of features. It demands accessible data in medical archives and from medical equipment, to infer meaning after some processing. A problem similar in some sense to the target image can aid clinicians. CBIR complements text-based retrieval and improves evidence-based diagnosis, admi… ▽ More Content-Based Image Retrieval (CBIR) locates, retrieves and displays images alike to one given as a query, using a set of features. It demands accessible data in medical archives and from medical equipment, to infer meaning after some processing. A problem similar in some sense to the target image can aid clinicians. CBIR complements text-based retrieval and improves evidence-based diagnosis, administration, teaching, and research in healthcare. It facilitates visual/automatic diagnosis and decision-making in real-time remote consultation/screening, store-and-forward tests, home care assistance and overall patient surveillance. Metrics help comparing visual data and improve diagnostic. Specially designed architectures can benefit from the application scenario. CBIR use calls for file storage standardization, querying procedures, efficient image transmission, realistic databases, global availability, access simplicity, and Internet-based structures. This chapter recommends important and complex aspects required to handle visual content in healthcare. △ Less

Submitted 10 October, 2016; originally announced October 2016.

Comments: 28 pages, 6 figures, Book Chapter from "Encyclopedia of E-Health and Telemedicine"

Journal ref: Encyclopedia of E-Health and Telemedicine. IGI Global, 2016. 495-520. Web. 10 Oct. 2016

arXiv:1603.09599 [pdf]

doi 10.4018/978-1-4666-8654-0.ch002

Total Variation Applications in Computer Vision

Authors: Vania V. Estrela, Hermes Aguiar Magalhaes, Osamu Saotome

Abstract: The objectives of this chapter are: (i) to introduce a concise overview of regularization; (ii) to define and to explain the role of a particular type of regularization called total variation norm (TV-norm) in computer vision tasks; (iii) to set up a brief discussion on the mathematical background of TV methods; and (iv) to establish a relationship between models and a few existing methods to solv… ▽ More The objectives of this chapter are: (i) to introduce a concise overview of regularization; (ii) to define and to explain the role of a particular type of regularization called total variation norm (TV-norm) in computer vision tasks; (iii) to set up a brief discussion on the mathematical background of TV methods; and (iv) to establish a relationship between models and a few existing methods to solve problems cast as TV-norm. For the most part, image-processing algorithms blur the edges of the estimated images, however TV regularization preserves the edges with no prior information on the observed and the original images. The regularization scalar parameter λ controls the amount of regularization allowed and it is an essential to obtain a high-quality regularized output. A wide-ranging review of several ways to put into practice TV regularization as well as its advantages and limitations are discussed. △ Less

Submitted 31 March, 2016; originally announced March 2016.

Comments: 24 pages, Book Title: Handbook of Research on Emerging Perspectives in Intelligent Pattern Recognition, Analysis, and Image Processing, Editor Narendra Kumar Kamila, IGI Global, 2016, http://www.igi-global.com/chapter/total-variation-applications-in-computer-vision/141626

arXiv:1603.09558 [pdf]

doi 10.1109/MMSP.2009.5293265

Sub-pixel accuracy edge fitting by means of B-spline

Authors: R. L. B. Breder, Vania V. Estrela, J. T. de Assis

Abstract: Local perturbations around contours strongly disturb the final result of computer vision tasks. It is common to introduce a priori information in the estimation process. Improvement can be achieved via a deformable model such as the snake model. In recent works, the deformable contour is modeled by means of B-spline snakes which allows local control, concise representation, and the use of fewer pa… ▽ More Local perturbations around contours strongly disturb the final result of computer vision tasks. It is common to introduce a priori information in the estimation process. Improvement can be achieved via a deformable model such as the snake model. In recent works, the deformable contour is modeled by means of B-spline snakes which allows local control, concise representation, and the use of fewer parameters. The estimation of the sub-pixel edges using a global B-spline model relies on the contour global determination according to a maximum likelihood framework and using the observed data likelihood. This procedure guarantees that the noisiest data will be filtered out. The data likelihood is computed as a consequence of the observation model which includes both orientation and position information. Comparative experiments of this algorithm and the classical spline interpolation have shown that the proposed algorithm outperforms the classical approach for Gaussian and Salt & Pepper noise. △ Less

Submitted 31 March, 2016; originally announced March 2016.

Comments: 5 pages, Proceedings of the MMSP '09. IEEE International Workshop on Multimedia Signal Processing, ISBN 978-1-4244-4463-2

arXiv:1603.08095 [pdf]

doi 10.1109/ACSSC.2009

Blind signal separation and identification of mixtures of images

Authors: Felipe P. do Carmo, Joaquim T. de Assis, Vania V. Estrela, Alessandra M. Coelho

Abstract: In this paper, a fresh procedure to handle image mixtures by means of blind signal separation relying on a combination of second order and higher order statistics techniques are introduced. The problem of blind signal separation is reassigned to the wavelet domain. The key idea behind this method is that the image mixture can be decomposed into the sum of uncorrelated and/or independent sub-bands… ▽ More In this paper, a fresh procedure to handle image mixtures by means of blind signal separation relying on a combination of second order and higher order statistics techniques are introduced. The problem of blind signal separation is reassigned to the wavelet domain. The key idea behind this method is that the image mixture can be decomposed into the sum of uncorrelated and/or independent sub-bands using wavelet transform. Initially, the observed image is pre-whitened in the space domain. Afterwards, an initial separation matrix is estimated from the second order statistics de-correlation model in the wavelet domain. Later, this matrix will be used as an initial separation matrix for the higher order statistics stage in order to find the best separation matrix. The suggested algorithm was tested using natural images.Experiments have confirmed that the use of the proposed process provides promising outcomes in identifying an image from noisy mixtures of images. △ Less

Submitted 26 March, 2016; originally announced March 2016.

Comments: 6 pages

arXiv:1403.7365 [pdf]

Expectation-Maximization Technique and Spatial-Adaptation Applied to Pel-Recursive Motion Estimation

Authors: Vania Vieira Estrela, Marcos Henrique da Silva Bassani

Abstract: Pel-recursive motion estimation isa well-established approach. However, in the presence of noise, it becomes an ill-posed problem that requires regularization. In this paper, motion vectors are estimated in an iterative fashion by means of the Expectation-Maximization (EM) algorithm and a Gaussian data model. Our proposed algorithm also utilizes the local image properties of the scene to improve t… ▽ More Pel-recursive motion estimation isa well-established approach. However, in the presence of noise, it becomes an ill-posed problem that requires regularization. In this paper, motion vectors are estimated in an iterative fashion by means of the Expectation-Maximization (EM) algorithm and a Gaussian data model. Our proposed algorithm also utilizes the local image properties of the scene to improve the motion vector estimates following a spatially adaptive approach. Numerical experiments are presented that demonstrate the merits of our method. △ Less

Submitted 28 March, 2014; originally announced March 2014.

Comments: 6 pages, pp. 204-209, Proceedings of the 8th World Multi-Conference on Systemics, Cybernetics and Informatics, Volume XVI, Organized by the International Institute of Informatics and Systemics, International Federation of Systems Research: IFSR, Edited by Nagib Callaos, Maria Sanchez, and Juan M. Pineda, TIB/UB Hannover, ISSN 12615810X, July 18-21, 2004, Orlando, Florida, USA

Journal ref: ISSN 12615810X, 2004

arXiv:1312.6497 [pdf]

doi 10.4018/978-1-4666-2660-7.ch006

State-of-the Art Motion Estimation in the Context of 3D TV

Authors: Vania V. Estrela, Alessandra M. Coelho

Abstract: Progress in image sensors and computation power has fueled studies to improve acquisition, processing, and analysis of 3D streams along with 3D scenes/objects reconstruction. The role of motion compensation/motion estimation (MCME) in 3D TV from end-to-end user is investigated in this chapter. Motion vectors (MVs) are closely related to the concept of disparities, and they can help improving dynam… ▽ More Progress in image sensors and computation power has fueled studies to improve acquisition, processing, and analysis of 3D streams along with 3D scenes/objects reconstruction. The role of motion compensation/motion estimation (MCME) in 3D TV from end-to-end user is investigated in this chapter. Motion vectors (MVs) are closely related to the concept of disparities, and they can help improving dynamic scene acquisition, content creation, 2D to 3D conversion, compression coding, decompression/decoding, scene rendering, error concealment, virtual/augmented reality handling, intelligent content retrieval, and displaying. Although there are different 3D shape extraction methods, this chapter focuses mostly on shape-from-motion (SfM) techniques due to their relevance to 3D TV. SfM extraction can restore 3D shape information from a single camera data. △ Less

Submitted 23 December, 2013; originally announced December 2013.

Journal ref: Multimedia Networking and Coding. IGI Global, 2013. 148-173. Web. 23 Dec. 2013

Showing 1–12 of 12 results for author: Estrela, V V