Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Tekalp, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11273  [pdf, other

    eess.IV cs.CV

    Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution

    Authors: Cansu Korkmaz, A. Murat Tekalp

    Abstract: Transformer-based models have achieved remarkable results in low-level vision tasks including image super-resolution (SR). However, early Transformer-based approaches that rely on self-attention within non-overlapping windows encounter challenges in acquiring global information. To activate more input pixels globally, hybrid attention models have been proposed. Moreover, training by solely minimiz… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: total of 10 pages including references, 5 tables and 5 figures, accepted for NTIRE 2024 Single Image Super Resolution (x4) challenge

  2. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  3. arXiv:2403.11791  [pdf, other

    eess.IV cs.CV

    PAON: A New Neuron Model using Padé Approximants

    Authors: Onur Keleş, A. Murat Tekalp

    Abstract: Convolutional neural networks (CNN) are built upon the classical McCulloch-Pitts neuron model, which is essentially a linear model, where the nonlinearity is provided by a separate activation function. Several researchers have proposed enhanced neuron models, including quadratic neurons, generalized operational neurons, generative neurons, and super neurons, with stronger nonlinearity than that pr… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE ICIP 2024

  4. arXiv:2402.19215  [pdf, other

    eess.IV cs.CV

    Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: Super-resolution (SR) is an ill-posed inverse problem, where the size of the set of feasible solutions that are consistent with a given low-resolution image is very large. Many algorithms have been proposed to find a "good" solution among the feasible solutions that strike a balance between fidelity and perceptual quality. Unfortunately, all known methods generate artifacts and hallucinations whil… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted for IEEE CVPR 2024, total of 11 pages, 3 pages for references, 7 figures and 2 tables

  5. arXiv:2402.07597  [pdf, other

    eess.IV cs.CV

    Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback

    Authors: Cansu Korkmaz, Ege Cirakman, A. Murat Tekalp, Zafer Dogan

    Abstract: Super-resolution (SR) is an ill-posed inverse problem with a large set of feasible solutions that are consistent with a given low-resolution image. Various deterministic algorithms aim to find a single solution that balances fidelity and perceptual quality; however, this trade-off often causes visual artifacts that bring ambiguity in information-centric applications. On the other hand, diffusion m… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: total of 7 pages with double column, 1 and a half for references, 6 figures and 2 tables, submitted to IEEE ICIP 2024

  6. arXiv:2306.16544  [pdf, other

    eess.IV cs.CV

    Multi-Scale Deformable Alignment and Content-Adaptive Inference for Flexible-Rate Bi-Directional Video Compression

    Authors: M. Akın Yılmaz, O. Ugur Ulas, A. Murat Tekalp

    Abstract: The lack of ability to adapt the motion compensation model to video content is an important limitation of current end-to-end learned video compression models. This paper advances the state-of-the-art by proposing an adaptive motion-compensation model for end-to-end rate-distortion optimized hierarchical bi-directional video compression. In particular, we propose two novelties: i) a multi-scale def… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2023

  7. Multi-Field De-interlacing using Deformable Convolution Residual Blocks and Self-Attention

    Authors: Ronglei Ji, A. Murat Tekalp

    Abstract: Although deep learning has made significant impact on image/video restoration and super-resolution, learned deinterlacing has so far received less attention in academia or industry. This is despite deinterlacing is well-suited for supervised learning from synthetic data since the degradation model is known and fixed. In this paper, we propose a novel multi-field full frame-rate deinterlacing netwo… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted to ICIP 2022

  8. arXiv:2209.08568  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    MMSR: Multiple-Model Learned Image Super-Resolution Benefiting From Class-Specific Image Priors

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: Assuming a known degradation model, the performance of a learned image super-resolution (SR) model depends on how well the variety of image characteristics within the training set matches those in the test set. As a result, the performance of an SR model varies noticeably from image to image over a test set depending on whether characteristics of specific images are similar to those in the trainin… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted for publication in IEEE ICIP 2022 Conference

  9. arXiv:2209.08564  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Perception-Distortion Trade-off in the SR Space Spanned by Flow Models

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan, Erkut Erdem, Aykut Erdem

    Abstract: Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature ($τ$) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fus… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted for publication in IEEE ICIP 2022 Conference

  10. arXiv:2206.13613  [pdf, other

    eess.IV cs.CV

    Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit Allocation

    Authors: Eren Cetin, M. Akin Yilmaz, A. Murat Tekalp

    Abstract: This paper presents improvements and novel additions to our recent work on end-to-end optimized hierarchical bi-directional video compression to further advance the state-of-the-art in learned video compression. As an improvement, we combine motion estimation and prediction modules and compress refined residual motion vectors for improved rate-distortion performance. As novel addition, we adapted… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP 2022)

    Report number: 1850

  11. arXiv:2112.09529  [pdf, other

    eess.IV cs.CV

    End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

    Authors: M. Akın Yılmaz, A. Murat Tekalp

    Abstract: Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the combinatorial nature of the end-to-end optimization problem. Learned VC allows end-to-end rate-distortion (R-D) optimized training of nonlinear transform, motion and entr… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in IEEE Transactions on Image Processing on 15 Dec. 2021

  12. arXiv:2106.00504  [pdf, other

    eess.IV cs.LG eess.SP

    Two-stage domain adapted training for better generalization in real-world image restoration and super-resolution

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: It is well-known that in inverse problems, end-to-end trained networks overfit the degradation model seen in the training set, i.e., they do not generalize to other types of degradations well. Recently, an approach to first map images downsampled by unknown filters to bicubicly downsampled look-alike images was proposed to successfully super-resolve such images. In this paper, we show that any inv… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in IEEE ICIP 2021 Conference

  13. arXiv:2105.12794  [pdf, other

    cs.CV eess.IV

    DFPN: Deformable Frame Prediction Network

    Authors: M. Akın Yılmaz, A. Murat Tekalp

    Abstract: Learned frame prediction is a current problem of interest in computer vision and video compression. Although several deep network architectures have been proposed for learned frame prediction, to the best of our knowledge, there is no work based on using deformable convolutions for frame prediction. To this effect, we propose a deformable frame prediction network (DFPN) for task oriented implicit… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2021

  14. arXiv:2105.12107  [pdf, other

    eess.IV cs.CV

    Self-Organized Variational Autoencoders (Self-VAE) for Learned Image Compression

    Authors: M. Akın Yılmaz, Onur Keleş, Hilal Güven, A. Murat Tekalp, Junaid Malik, Serkan Kıranyaz

    Abstract: In end-to-end optimized learned image compression, it is standard practice to use a convolutional variational autoencoder with generalized divisive normalization (GDN) to transform images into a latent space. Recently, Operational Neural Networks (ONNs) that learn the best non-linearity from a set of alternatives, and their self-organized variants, Self-ONNs, that approximate any non-linearity via… ▽ More

    Submitted 28 May, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2021

  15. arXiv:2104.14868  [pdf, other

    eess.IV cs.MM

    On the Computation of PSNR for a Set of Images or Video

    Authors: Onur Keleş, M. Akın Yılmaz, A. Murat Tekalp, Cansu Korkmaz, Zafer Dogan

    Abstract: When comparing learned image/video restoration and compression methods, it is common to report peak-signal to noise ratio (PSNR) results. However, there does not exist a generally agreed upon practice to compute PSNR for sets of images or video. Some authors report average of individual image/frame PSNR, which is equivalent to computing a single PSNR from the geometric mean of individual image/fra… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: accepted for publication in Picture Coding Symposium (PCS) 2021

  16. arXiv:2102.06531  [pdf, ps, other

    eess.IV cs.LG

    Editorial: Introduction to the Issue on Deep Learning for Image/Video Restoration and Compression

    Authors: A. Murat Tekalp, Michele Covell, Radu Timofte, Chao Dong

    Abstract: Recent works have shown that learned models can achieve significant performance gains, especially in terms of perceptual quality measures, over traditional methods. Hence, the state of the art in image restoration and compression is getting redefined. This special issue covers the state of the art in learned image/video restoration and compression to promote further progress in innovative architec… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, vol. 15, no. 2, FEBRUARY 2021

  17. Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

    Authors: M. Akin Yilmaz, A. Murat Tekalp

    Abstract: We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), a convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both statele… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at IEEE ICIP 2019

  18. End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

    Authors: M. Akin Yilmaz, A. Murat Tekalp

    Abstract: Conventional video compression methods employ a linear transform and block motion model, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to combinatorial nature of the end-to-end optimization problem. Learned video compression allows end-to-end rate-distortion optimized training of all nonlinear modules, quantization… ▽ More

    Submitted 26 May, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: This work is accepted for publication in IEEE ICIP 2020

  19. arXiv:2007.08922  [pdf, other

    eess.IV cs.CV cs.LG

    Can Learned Frame-Prediction Compete with Block-Motion Compensation for Video Coding?

    Authors: Serkan Sulun, A. Murat Tekalp

    Abstract: Given recent advances in learned video prediction, we investigate whether a simple video codec using a pre-trained deep model for next frame prediction based on previously encoded/decoded frames without sending any motion side information can compete with standard video codecs based on block-motion compensation. Frame differences given learned frame predictions are encoded by a standard still-imag… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in Springer Journal of Signal, Image and Video Processing