Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Akman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.02025  [pdf, other

    eess.IV cs.CV

    MAEVI: Motion Aware Event-Based Video Frame Interpolation

    Authors: Ahmet Akman, Onur Selim Kılıç, A. Aydın Alatan

    Abstract: Utilization of event-based cameras is expected to improve the visual quality of video frame interpolation solutions. We introduce a learning-based method to exploit moving region boundaries in a video sequence to increase the overall interpolation quality.Event cameras allow us to determine moving areas precisely; and hence, better video frame interpolation quality can be achieved by emphasizing t… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Submitted to International Conference on Image Processing (ICIP) 2023

  2. arXiv:2209.09359  [pdf, other

    cs.CV

    E-VFIA : Event-Based Video Frame Interpolation with Attention

    Authors: Onur Selim Kılıç, Ahmet Akman, A. Aydın Alatan

    Abstract: Video frame interpolation (VFI) is a fundamental vision task that aims to synthesize several frames between two consecutive original video images. Most algorithms aim to accomplish VFI by using only keyframes, which is an ill-posed problem since the keyframes usually do not yield any accurate precision about the trajectories of the objects in the scene. On the other hand, event-based cameras provi… ▽ More

    Submitted 1 March, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted to 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)

  3. arXiv:2206.12879  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Data Augmentation for Dementia Detection in Spoken Language

    Authors: Anna Hlédiková, Dominika Woszczyk, Alican Akman, Soteris Demetriou, Björn Schuller

    Abstract: Dementia is a growing problem as our society ages, and detection methods are often invasive and expensive. Recent deep-learning techniques can offer a faster diagnosis and have shown promising results. However, they require large amounts of labelled data which is not easily available for the task of dementia detection. One effective solution to sparse data problems is data augmentation, though the… ▽ More

    Submitted 16 July, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted to INTERSPEECH 2022

  4. arXiv:2203.06064  [pdf, other

    cs.SD cs.LG

    Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet

    Authors: Björn W. Schuller, Alican Akman, Yi Chang, Harry Coppock, Alexander Gebhard, Alexander Kathan, Esther Rituerto-González, Andreas Triantafyllopoulos, Florian B. Pokorny

    Abstract: Among the seventeen Sustainable Development Goals (SDGs) proposed within the 2030 Agenda and adopted by all the United Nations member states, the 13$^{th}$ SDG is a call for action to combat climate change for a better world. In this work, we provide an overview of areas in which audio intelligence -- a powerful but in this context so far hardly considered technology -- can contribute to overcome… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  5. arXiv:2202.08981  [pdf, other

    cs.SD cs.LG eess.AS

    A Summary of the ComParE COVID-19 Challenges

    Authors: Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Jing Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller

    Abstract: The COVID-19 pandemic has caused massive humanitarian and economic damage. Teams of scientists from a broad range of disciplines have searched for methods to help governments and communities combat the disease. One avenue from the machine learning field which has been explored is the prospect of a digital mass test which can detect COVID-19 from infected individuals' respiratory sounds. We present… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 18 pages, 13 figures

  6. arXiv:2107.14549  [pdf, other

    cs.SD cs.LG eess.AS

    Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges

    Authors: Alican Akman, Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Lyn Jones, Björn W. Schuller

    Abstract: We report on cross-running the recent COVID-19 Identification ResNet (CIdeR) on the two Interspeech 2021 COVID-19 diagnosis from cough and speech audio challenges: ComParE and DiCOVA. CIdeR is an end-to-end deep learning neural network originally designed to classify whether an individual is COVID-positive or COVID-negative based on coughing and breathing audio recordings from a published crowdsou… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: 5 pages, 1 figure

  7. arXiv:1203.4597  [pdf, ps, other

    cs.LG stat.ML

    A Novel Training Algorithm for HMMs with Partial and Noisy Access to the States

    Authors: Huseyin Ozkan, Arda Akman, Suleyman S. Kozat

    Abstract: This paper proposes a new estimation algorithm for the parameters of an HMM as to best account for the observed data. In this model, in addition to the observation sequence, we have \emph{partial} and \emph{noisy} access to the hidden state sequence as side information. This access can be seen as "partial labeling" of the hidden states. Furthermore, we model possible mislabeling in the side inform… ▽ More

    Submitted 20 March, 2012; originally announced March 2012.

    Comments: Submitted to Digital Signal Processing, Elsevier