Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Ozcinar, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.13004  [pdf, other

    cs.CV cs.AI cs.MM

    Spherical Vision Transformer for 360-degree Video Saliency Prediction

    Authors: Mert Cokelek, Nevrez Imamoglu, Cagri Ozcinar, Erkut Erdem, Aykut Erdem

    Abstract: The growing interest in omnidirectional videos (ODVs) that capture the full field-of-view (FOV) has gained 360-degree saliency prediction importance in computer vision. However, predicting where humans look in 360-degree scenes presents unique challenges, including spherical distortion, high resolution, and limited labelled data. We propose a novel vision-transformer-based model for omnidirectiona… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 12 pages, 4 figures, accepted to BMVC 2023

  2. arXiv:2303.06907  [pdf, other

    cs.CV eess.IV

    ST360IQ: No-Reference Omnidirectional Image Quality Assessment with Spherical Vision Transformers

    Authors: Nafiseh Jabbari Tofighi, Mohamed Hedi Elfkir, Nevrez Imamoglu, Cagri Ozcinar, Erkut Erdem, Aykut Erdem

    Abstract: Omnidirectional images, aka 360 images, can deliver immersive and interactive visual experiences. As their popularity has increased dramatically in recent years, evaluating the quality of 360 images has become a problem of interest since it provides insights for capturing, transmitting, and consuming this new media. However, directly adapting quality assessment methods proposed for standard natura… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  3. arXiv:2103.08467  [pdf, other

    cs.LG eess.SP

    Ensemble approach for detection of depression using EEG features

    Authors: Egils Avots, Klavs Jermakovs, Maie Bachmann, Laura Paeske, Cagri Ozcinar, Gholamreza Anbarjafari

    Abstract: Depression is a public health issue which severely affects one's well being and cause negative social and economic effect for society. To rise awareness of these problems, this publication aims to determine if long lasting effects of depression can be determined from electoencephalographic (EEG) signals. The article contains accuracy comparison for SVM, LDA, NB, kNN and D3 binary classifiers which… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 8 pages, 2 figures

  4. arXiv:2101.10396  [pdf, other

    eess.IV cs.CV

    Quality Assessment of Super-Resolved Omnidirectional Image Quality Using Tangential Views

    Authors: Cagri Ozcinar, Aakanksha Rana

    Abstract: Omnidirectional images (ODIs), also known as 360-degree images, enable viewers to explore all directions of a given 360-degree scene from a fixed point. Designing an immersive imaging system with ODI is challenging as such systems require very large resolution coverage of the entire 360 viewing space to provide an enhanced quality of experience (QoE). Despite remarkable progress on single image su… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: Paper Accepted at Electronic Imaging

  5. arXiv:2010.12540  [pdf, other

    cs.IR cs.CY cs.MM

    Comprehensive Empirical Evaluation of Deep Learning Approaches for Session-based Recommendation in E-Commerce

    Authors: Mohamed Maher, Perseverance Munga Ngoy, Aleksandrs Rebriks, Cagri Ozcinar, Josue Cuevas, Rajasekhar Sanagavarapu, Gholamreza Anbarjafari

    Abstract: Boosting sales of e-commerce services is guaranteed once users find more matching items to their interests in a short time. Consequently, recommendation systems have become a crucial part of any successful e-commerce services. Although various recommendation techniques could be used in e-commerce, a considerable amount of attention has been drawn to session-based recommendation systems during the… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: 48 pages, 17 figures, journal

  6. arXiv:2008.03195  [pdf, other

    cs.CV eess.IV

    A Study on Visual Perception of Light Field Content

    Authors: Ailbhe Gill, Emin Zerman, Cagri Ozcinar, Aljosa Smolic

    Abstract: The effective design of visual computing systems depends heavily on the anticipation of visual attention, or saliency. While visual attention is well investigated for conventional 2D images and video, it is nevertheless a very active research area for emerging immersive media. In particular, visual attention of light fields (light rays of a scene captured by a grid of cameras or micro lenses) has… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To appear in Irish Machine Vision and Image Processing (IMVIP) 2020

    ACM Class: I.2.10; I.4; I.5

  7. arXiv:2008.01116  [pdf, other

    eess.IV cs.CV

    Sub-Pixel Back-Projection Network For Lightweight Single Image Super-Resolution

    Authors: Supratik Banerjee, Cagri Ozcinar, Aakanksha Rana, Aljosa Smolic, Michael Manzke

    Abstract: Convolutional neural network (CNN)-based methods have achieved great success for single-image superresolution (SISR). However, most models attempt to improve reconstruction accuracy while increasing the requirement of number of model parameters. To tackle this problem, in this paper, we study reducing the number of parameters and computational cost of CNN-based SISR methods while maintaining the a… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: To appear in IMVIP 2020

  8. arXiv:1908.06752  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    Towards Generating Ambisonics Using Audio-Visual Cue for Virtual Reality

    Authors: Aakanksha Rana, Cagri Ozcinar, Aljoscha Smolic

    Abstract: Ambisonics i.e., a full-sphere surround sound, is quintessential with 360-degree visual content to provide a realistic virtual reality (VR) experience. While 360-degree visual content capture gained a tremendous boost recently, the estimation of corresponding spatial sound is still challenging due to the required sound-field microphones or information about the sound-source locations. In this pape… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  9. arXiv:1908.04297  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Super-resolution of Omnidirectional Images Using Adversarial Learning

    Authors: Cagri Ozcinar, Aakanksha Rana, Aljosa Smolic

    Abstract: An omnidirectional image (ODI) enables viewers to look in every direction from a fixed point through a head-mounted display providing an immersive experience compared to that of a standard image. Designing immersive virtual reality systems with ODIs is challenging as they require high resolution content. In this paper, we study super-resolution for ODIs and propose an improved generative adversari… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

  10. arXiv:1902.07653  [pdf, other

    cs.CV

    On the effect of age perception biases for real age regression

    Authors: Julio C. S. Jacques Junior, Cagri Ozcinar, Marina Marjanovic, Xavier Baró, Gholamreza Anbarjafari, Sergio Escalera

    Abstract: Automatic age estimation from facial images represents an important task in computer vision. This paper analyses the effect of gender, age, ethnic, makeup and expression attributes of faces as sources of bias to improve deep apparent age prediction. Following recent works where it is shown that apparent age labels benefit real age estimation, rather than direct real to real age regression, our mai… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: Accepted in the 14th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2019)

  11. arXiv:1805.03105  [pdf, other

    cs.MM

    Optimization of Occlusion-Inducing Depth Pixels in 3-D Video Coding

    Authors: Pan Gao, Cagri Ozcinar, Aljosa Smolic

    Abstract: The optimization of occlusion-inducing depth pixels in depth map coding has received little attention in the literature, since their associated texture pixels are occluded in the synthesized view and their effect on the synthesized view is considered negligible. However, the occlusion-inducing depth pixels still need to consume the bits to be transmitted, and will induce geometry distortion that i… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  12. arXiv:1801.08863  [pdf, other

    cs.CV cs.GR

    3D Scanning: A Comprehensive Survey

    Authors: Morteza Daneshmand, Ahmed Helmi, Egils Avots, Fatemeh Noroozi, Fatih Alisinanoglu, Hasan Sait Arslan, Jelena Gorbova, Rain Eric Haamer, Cagri Ozcinar, Gholamreza Anbarjafari

    Abstract: This paper provides an overview of 3D scanning methodologies and technologies proposed in the existing scientific and industrial literature. Throughout the paper, various types of the related techniques are reviewed, which consist, mainly, of close-range, aerial, structure-from-motion and terrestrial photogrammetry, and mobile, terrestrial and airborne laser scanning, as well as time-of-flight, st… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: 18 pages, 3 figures

  13. arXiv:1711.03362  [pdf, other

    cs.MM

    Estimation of optimal encoding ladders for tiled 360° VR video in adaptive streaming systems

    Authors: Cagri Ozcinar, Ana De Abreu, Sebastian Knorr, Aljosa Smolic

    Abstract: Given the significant industrial growth of demand for virtual reality (VR), 360° video streaming is one of the most important VR applications that require cost-optimal solutions to achieve widespread proliferation of VR technology. Because of its inherent variability of data-intensive content types and its tiled-based encoding and streaming, 360° video requires new encoding ladders in adaptive str… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: The 19th IEEE International Symposium on Multimedia (ISM 2017), Taichung, Taiwan

    Journal ref: The 19th IEEE International Symposium on Multimedia (ISM 2017), Taichung, Taiwan

  14. arXiv:1711.02386  [pdf, other

    cs.MM

    Viewport-aware adaptive 360° video streaming using tiles for virtual reality

    Authors: Cagri Ozcinar, Ana De Abreu, Aljosa Smolic

    Abstract: 360° video is attracting an increasing amount of attention in the context of Virtual Reality (VR). Owing to its very high-resolution requirements, existing professional streaming services for 360° video suffer from severe drawbacks. This paper introduces a novel end-to-end streaming system from encoding to displaying, to transmit 8K resolution 360° video and to provide an enhanced VR experience us… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: IEEE International Conference on Image Processing (ICIP) 2017

    Journal ref: 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017