Jump to content

Peak signal-to-noise ratio: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
 
(40 intermediate revisions by 30 users not shown)
Line 1: Line 1:
{{Short description|Metric used to measure signal quality}}
'''Peak signal-to-noise ratio''', often abbreviated '''PSNR''', is an engineering term for the ratio between the maximum possible power of a [[Signal (information theory)|signal]] and the power of corrupting [[noise]] that affects the fidelity of its representation. Because many signals have a very wide [[dynamic range]], PSNR is usually expressed in terms of the [[logarithm]]ic [[decibel]] scale.
{{Broader|Signal-to-noise ratio}}


'''Peak signal-to-noise ratio''' ('''PSNR''') is an engineering term for the ratio between the maximum possible power of a [[Signal (information theory)|signal]] and the power of corrupting [[noise]] that affects the fidelity of its representation. Because many signals have a very wide [[dynamic range]], PSNR is usually expressed as a [[logarithm]]ic quantity using the [[decibel]] scale.
PSNR is most commonly used to measure the quality of reconstruction of lossy compression [[codec]]s (e.g., for [[image compression]]). The signal in this case is the original data, and the noise is the error introduced by compression. When comparing compression codecs, PSNR is an ''approximation'' to human perception of reconstruction quality. Although a higher PSNR generally indicates that the reconstruction is of higher quality, in some cases it may not. One has to be extremely careful with the range of validity of this metric; it is only conclusively valid when it is used to compare results from the same codec (or codec type) and same content.<ref>{{Cite journal | last1 = Huynh-Thu | first1 = Q. | last2 = Ghanbari | first2 = M. | doi = 10.1049/el:20080522 | title = Scope of validity of PSNR in image/video quality assessment | journal = Electronics Letters | volume = 44 | issue = 13 | pages = 800 | year = 2008 | pmid = | pmc = }}</ref><ref>[http://web.mit.edu/xiphmont/Public/theora/demo7.html MIT.edu]</ref>


PSNR is commonly used to quantify reconstruction quality for images and video subject to [[lossy compression]].
PSNR is most easily defined via the [[mean squared error]] (''MSE''). Given a noise-free ''m''×''n'' monochrome image ''I'' and its noisy approximation ''K'', ''MSE'' is defined as:
:<math>\mathit{MSE} = \frac{1}{m\,n}\sum_{i=0}^{m-1}\sum_{j=0}^{n-1} [I(i,j) - K(i,j)]^2</math>


== Definition ==
The PSNR (in dB) is defined as:
PSNR is most easily defined via the [[mean squared error]] (''MSE''). Given a noise-free ''m''×''n'' monochrome image ''I'' and its noisy approximation ''K'', ''MSE'' is defined as
:<math>\begin{align}\mathit{PSNR} &= 10 \cdot \log_{10} \left( \frac{\mathit{MAX}_I^2}{\mathit{MSE}} \right)\\
: <math>\mathit{MSE} = \frac{1}{m\,n}\sum_{i=0}^{m-1}\sum_{j=0}^{n-1} [I(i,j) - K(i,j)]^2.</math>
&= 20 \cdot \log_{10} \left( \frac{\mathit{MAX}_I}{\sqrt{\mathit{MSE}}} \right)\\
&= 20 \cdot \log_{10} \left( {\mathit{MAX}_I} \right) - 10 \cdot \log_{10} \left( {{\mathit{MSE}}} \right)\end{align}</math>


The PSNR (in [[decibel|dB]]) is defined as
Here, ''MAX<sub>I</sub>'' is the maximum possible pixel value of the image. When the pixels are represented using 8 bits per sample, this is 255. More generally, when samples are represented using linear [[Pulse-code modulation|PCM]] with ''B'' bits per sample, ''MAX<sub>I</sub>'' is 2<sup><var>B</var></sup>−1. For [[color image]]s with three [[RGB]] values per pixel, the definition of PSNR is the same except the MSE is the sum over all squared value differences divided by image size and by three. Alternately, for color images the image is converted to a different [[color space]] and PSNR is reported against each channel of that color space, e.g., [[YCbCr]] or [[HSL and HSV|HSL]].<ref>{{cite web|last=Oriani|first=Emanuele|title=qpsnr: A quick PSNR/SSIM analyzer for Linux|url=http://qpsnr.youlink.org/|accessdate=6 April 2011}}</ref><ref>{{cite web|title=pnmpsnr User Manual|url=http://netpbm.sourceforge.net/doc/pnmpsnr.html|accessdate=6 April 2011}}</ref>
: <math>\begin{align}
\mathit{PSNR} &= 10 \cdot \log_{10} \left( \frac{\mathit{MAX}_I^2}{\mathit{MSE}} \right) \\
&= 20 \cdot \log_{10} \left( \frac{\mathit{MAX}_I}{\sqrt{\mathit{MSE}}} \right) \\
&= 20 \cdot \log_{10}(\mathit{MAX}_I) - 10 \cdot \log_{10} (\mathit{MSE}).
\end{align}</math>


Here, ''MAX<sub>I</sub>'' is the maximum pixel value of the original image.
Typical values for the PSNR in [[lossy compression|lossy]] image and video compression are between 30 and 50&nbsp;[[decibel|dB]], provided the bit depth is 8&nbsp;[[bit]]s, where higher is better. For 16-bit data typical values for the PSNR are between 60 and 80&nbsp;[[decibel|dB]].<ref>{{cite book|last=Welstead|first=Stephen T.|title=Fractal and wavelet image compression techniques|year=1999|publisher=SPIE Publication|isbn=978-0-8194-3503-3|pages=155–156|url=https://books.google.com/books?id=evGyv-mogukC&lpg=PA155&dq=image%20compression%20acceptable%20PSNR&pg=PA156#v=onepage&q=image%20compression%20acceptable%20PSNR&f=false}}</ref><ref>{{cite journal|author=Raouf Hamzaoui, Dietmar Saupe|title=Fractal Image Compression|journal=Document and image compression|date=May 2006|volume=968|pages=168–169|url=https://books.google.com/books?id=FmqPOsEYRsEC&lpg=PA229&dq=image%20compression%20acceptable%20PSNR&pg=PA168#v=onepage&q=PSNR&f=false|accessdate=5 April 2011|editor1-first=Mauro|editor1-last=Barni|publisher=CRC Press|isbn=9780849335563}}</ref> Acceptable values for wireless transmission quality loss are considered to be about 20&nbsp;dB to 25&nbsp;dB.<ref>Thomos, N., Boulgouris, N. V., & Strintzis, M. G. (2006, January). Optimized Transmission of JPEG2000 Streams Over Wireless Channels. IEEE Transactions on Image Processing , 15 (1).</ref><ref>Xiangjun, L., & Jianfei, C. Robust transmission of JPEG2000 encoded images over packet loss channels. ICME 2007 (pp. 947-950). School of Computer Engineering, [[Nanyang Technological University]].</ref>


=== Application in color images ===
In the absence of noise, the two images ''I'' and ''K'' are identical, and thus the MSE is zero. In this case the PSNR is infinite (or undefined, see [[Division by zero]]).<ref>{{cite book|last=Salomon|first=David|title=Data Compression: The Complete Reference|year=2007|publisher=Springer|isbn=978-1846286025|url=https://books.google.com/books?id=ujnQogzx_2EC&lpg=PA281&ots=FolwqB8qsN&dq=PSNR%20infinite&pg=PA281#v=onepage&q=PSNR%20infinite&f=false|edition=4|accessdate=26 July 2012|page=281}}</ref>
For [[color image]]s with three [[RGB]] values per pixel, the definition of PSNR is the same except that the MSE is the sum over all squared value differences (now for each color, i.e. three times as many differences as in a monochrome image) divided by image size and by three. Alternately, for color images the image is converted to a different [[color space]] and PSNR is reported against each channel of that color space, e.g., [[YCbCr]] or [[HSL and HSV|HSL]].<ref>{{cite web|last=Oriani|first=Emanuele|title=qpsnr: A quick PSNR/SSIM analyzer for Linux|url=http://qpsnr.youlink.org/|access-date=6 April 2011}}</ref><ref>{{cite web|title=pnmpsnr User Manual|url=http://netpbm.sourceforge.net/doc/pnmpsnr.html|access-date=6 April 2011}}</ref>

== Quality estimation with PSNR ==
PSNR is most commonly used to measure the quality of reconstruction of lossy compression [[codec]]s (e.g., for [[image compression]]). The signal in this case is the original data, and the noise is the error introduced by compression. When comparing compression codecs, PSNR is an ''approximation'' to human perception of reconstruction quality.

Typical values for the PSNR in [[lossy compression|lossy]] image and video compression are between 30 and 50&nbsp;dB, provided the bit depth is 8&nbsp;[[bit]]s, where higher is better. The processing quality of 12-bit images is considered high when the PSNR value is 60&nbsp;dB or higher.<ref>{{Cite journal|last1=Faragallah|first1=Osama S.|last2=El-Hoseny|first2=Heba|last3=El-Shafai|first3=Walid|last4=El-Rahman|first4=Wael Abd|last5=El-Sayed|first5=Hala S.|last6=El-Rabaie|first6=El-Sayed M.|last7=El-Samie|first7=Fathi E. Abd|last8=Geweid|first8=Gamal G. N.|date=2021|title=A Comprehensive Survey Analysis for Present Solutions of Medical Image Fusion and Future Directions|journal=IEEE Access|volume=9|pages=11358–11371|doi=10.1109/ACCESS.2020.3048315|issn=2169-3536|quote=This paper presents a survey study of medical imaging modalities and their characteristics. In addition, different medical image fusion approaches and their appropriate quality metrics are presented.|doi-access=free|bibcode=2021IEEEA...911358F }}</ref><ref>{{Cite journal|last1=Chervyakov|first1=Nikolay|last2=Lyakhov|first2=Pavel|last3=Nagornov|first3=Nikolay|date=2020-02-11|title=Analysis of the Quantization Noise in Discrete Wavelet Transform Filters for 3D Medical Imaging|journal=Applied Sciences|language=en|volume=10|issue=4|pages=1223|doi=10.3390/app10041223|issn=2076-3417|quote=The image processing quality is considered high if PSNR value is greater than 60 dB for images with 12 bits per color.|doi-access=free}}</ref> For 16-bit data typical values for the PSNR are between 60 and 80&nbsp;dB.<ref>{{cite book|last=Welstead|first=Stephen T.|title=Fractal and wavelet image compression techniques|year=1999|publisher=SPIE Publication|isbn=978-0-8194-3503-3|pages=155–156|url=https://books.google.com/books?id=evGyv-mogukC&q=image%20compression%20acceptable%20PSNR&pg=PA156}}</ref><ref>{{cite book|author=Raouf Hamzaoui, Dietmar Saupe|title=Fractal Image Compression|journal=Document and Image Compression|date=May 2006|volume=968|pages=168–169|url=https://books.google.com/books?id=FmqPOsEYRsEC&q=image%20compression%20acceptable%20PSNR&pg=PA168|access-date=5 April 2011|editor1-first=Mauro|editor1-last=Barni|publisher=CRC Press|isbn=9780849335563}}</ref> Acceptable values for wireless transmission quality loss are considered to be about 20&nbsp;dB to 25&nbsp;dB.<ref>Thomos, N., Boulgouris, N. V., & Strintzis, M. G. (2006, January). Optimized Transmission of JPEG2000 Streams Over Wireless Channels. IEEE Transactions on Image Processing, 15 (1).</ref><ref>Xiangjun, L., & Jianfei, C. Robust transmission of JPEG2000 encoded images over packet loss channels. ICME 2007 (pp. 947-950). School of Computer Engineering, [[Nanyang Technological University]].</ref>

In the absence of noise, the two images ''I'' and ''K'' are identical, and thus the MSE is zero. In this case the PSNR is infinite (or undefined, see [[Division by zero]]).<ref>{{cite book|last=Salomon|first=David|title=Data Compression: The Complete Reference|year=2007|publisher=Springer|isbn=978-1846286025|url=https://books.google.com/books?id=ujnQogzx_2EC&q=PSNR%20infinite&pg=PA281|edition=4|access-date=26 July 2012|page=281}}</ref>


{{multiple image
{{multiple image
Line 31: Line 43:
}}
}}


== Performance comparison ==
The metric PSNR-HVS-M that is additionally based on criteria of contrast perception and masking delivered the best approximation of human visual quality judgement as of 2007 according to an evaluation by the developers, beating PSNR, [[UQI]], and [[structural similarity|SSIM]] by large margin and even having a distinct advantage over [[DCTune]] and PSNR-HVS.<ref name="PSNR-HVS-M">{{citation/core|Surname1=Nikolay Ponomarenko|Surname2= Flavia Silvestri|Surname3= Karen Egiazarian|Surname4= Marco Carli|Surname5= Jaakko Astola|Surname6= Vladimir Lukin|Periodical=CD-ROM Proceedings of the Third International Workshop on Video Processing and Quality Metrics for Consumer Electronics VPQM-07, 25.–26. Januar 2007|Title=On between-coefficient contrast masking of DCT basis functions |PublicationPlace=Scottsdale AZ|language=German|URL=http://ponomarenko.info/vpqm07_p.pdf}}</ref>
Although a higher PSNR generally correlates with a higher quality reconstruction, in many cases it may not. One has to be extremely careful with the range of validity of this metric; it is only conclusively valid when it is used to compare results from the same codec (or codec type) and same content.<ref name=":0">{{Cite journal|last1=Huynh-Thu|first1=Q.|last2=Ghanbari|first2=M.|year=2008|title=Scope of validity of PSNR in image/video quality assessment|journal=Electronics Letters|volume=44|issue=13|pages=800|doi=10.1049/el:20080522|bibcode=2008ElL....44..800H}}</ref>
==Matlab code==

[https://in.mathworks.com/matlabcentral/fileexchange/60833-psnr-and-mse-finding Psnr mse matlab code]
Generally, when it comes to estimating the [[Image quality|quality of images]] and [[Video quality|videos]] as perceived by humans, PSNR has been shown to perform very poorly compared to other quality metrics.<ref name=":0" /><ref>{{Cite journal|last1=Huynh-Thu|first1=Quan|last2=Ghanbari|first2=Mohammed|date=2012-01-01|title=The accuracy of PSNR in predicting video quality for different video scenes and frame rates|journal=Telecommunication Systems|language=en|volume=49|issue=1|pages=35–48|doi=10.1007/s11235-010-9351-x|s2cid=43713764|issn=1018-4864}}</ref>

== Variants ==
PSNR-HVS<ref>Egiazarian, Karen, Jaakko Astola, Nikolay Ponomarenko, Vladimir Lukin, Federica Battisti, and Marco Carli (2006). "New full-reference quality metrics based on HVS." In Proceedings of the Second International Workshop on Video Processing and Quality Metrics, vol. 4.</ref> is an extension of PSNR that incorporates properties of the human visual system such as [[Contrast (vision)|contrast perception]].

PSNR-HVS-M improves on PSNR-HVS by additionally taking into account [[visual masking]].<ref>{{Cite journal|last1=Ponomarenko|first1=N.|last2=Ieremeiev|first2=O.|last3=Lukin|first3=V.|last4=Egiazarian|first4=K.|last5=Carli|first5=M.|date=February 2011|title=Modified image visual quality metrics for contrast change and mean shift accounting|url=https://ieeexplore.ieee.org/document/5744476|journal=2011 11th International Conference the Experience of Designing and Application of CAD Systems in Microelectronics (CADSM)|pages=305–311}}</ref> In a 2007 study, it delivered better approximations of human visual quality judgements than PSNR and [[structural similarity index measure|SSIM]] by large margin. It was also shown to have a distinct advantage over [[DCTune]] and PSNR-HVS.<ref name="PSNR-HVS-M">{{citation|surname1=Nikolay Ponomarenko|surname2= Flavia Silvestri|surname3= Karen Egiazarian|surname4= Marco Carli|surname5= Jaakko Astola|surname6= Vladimir Lukin|periodical=CD-ROM Proceedings of the Third International Workshop on Video Processing and Quality Metrics for Consumer Electronics VPQM-07, 25.–26. Januar 2007|title=On between-coefficient contrast masking of DCT basis functions |location=Scottsdale AZ|language=de|url=http://ponomarenko.info/vpqm07_p.pdf}}</ref>

==See also==
==See also==
* [[Czenakowski distance]]
* [[Data compression ratio]]
* [[Data compression ratio]]
* [[PEVQ|Perceptual Evaluation of Video Quality]] (PEVQ)
* [[Perceptual Evaluation of Video Quality]] (PEVQ)
* [[Structural similarity index measure]] (SSIM)
* [[Signal-to-noise ratio]]
* [[Structural similarity]] (SSIM) index
* [[Subjective video quality]]
* [[Subjective video quality]]
* [[Video Multimethod Assessment Fusion]]
* [[Video quality]]
* [[Video quality]]


Line 46: Line 66:


{{Noise}}
{{Noise}}
{{Machine learning evaluation metrics}}


[[Category:Image compression]]
[[Category:Image compression]]
Line 52: Line 73:
[[Category:Digital television]]
[[Category:Digital television]]
[[Category:Engineering ratios]]
[[Category:Engineering ratios]]

[[vi:PSNR]]

Latest revision as of 14:50, 2 July 2024

Peak signal-to-noise ratio (PSNR) is an engineering term for the ratio between the maximum possible power of a signal and the power of corrupting noise that affects the fidelity of its representation. Because many signals have a very wide dynamic range, PSNR is usually expressed as a logarithmic quantity using the decibel scale.

PSNR is commonly used to quantify reconstruction quality for images and video subject to lossy compression.

Definition

[edit]

PSNR is most easily defined via the mean squared error (MSE). Given a noise-free m×n monochrome image I and its noisy approximation K, MSE is defined as

The PSNR (in dB) is defined as

Here, MAXI is the maximum pixel value of the original image.

Application in color images

[edit]

For color images with three RGB values per pixel, the definition of PSNR is the same except that the MSE is the sum over all squared value differences (now for each color, i.e. three times as many differences as in a monochrome image) divided by image size and by three. Alternately, for color images the image is converted to a different color space and PSNR is reported against each channel of that color space, e.g., YCbCr or HSL.[1][2]

Quality estimation with PSNR

[edit]

PSNR is most commonly used to measure the quality of reconstruction of lossy compression codecs (e.g., for image compression). The signal in this case is the original data, and the noise is the error introduced by compression. When comparing compression codecs, PSNR is an approximation to human perception of reconstruction quality.

Typical values for the PSNR in lossy image and video compression are between 30 and 50 dB, provided the bit depth is 8 bits, where higher is better. The processing quality of 12-bit images is considered high when the PSNR value is 60 dB or higher.[3][4] For 16-bit data typical values for the PSNR are between 60 and 80 dB.[5][6] Acceptable values for wireless transmission quality loss are considered to be about 20 dB to 25 dB.[7][8]

In the absence of noise, the two images I and K are identical, and thus the MSE is zero. In this case the PSNR is infinite (or undefined, see Division by zero).[9]

Original uncompressed image
Q=90, PSNR 45.53dB
Q=30, PSNR 36.81dB
Q=10, PSNR 31.45dB
Example luma PSNR values for a cjpeg compressed image at various quality levels.

Performance comparison

[edit]

Although a higher PSNR generally correlates with a higher quality reconstruction, in many cases it may not. One has to be extremely careful with the range of validity of this metric; it is only conclusively valid when it is used to compare results from the same codec (or codec type) and same content.[10]

Generally, when it comes to estimating the quality of images and videos as perceived by humans, PSNR has been shown to perform very poorly compared to other quality metrics.[10][11]

Variants

[edit]

PSNR-HVS[12] is an extension of PSNR that incorporates properties of the human visual system such as contrast perception.

PSNR-HVS-M improves on PSNR-HVS by additionally taking into account visual masking.[13] In a 2007 study, it delivered better approximations of human visual quality judgements than PSNR and SSIM by large margin. It was also shown to have a distinct advantage over DCTune and PSNR-HVS.[14]

See also

[edit]

References

[edit]
  1. ^ Oriani, Emanuele. "qpsnr: A quick PSNR/SSIM analyzer for Linux". Retrieved 6 April 2011.
  2. ^ "pnmpsnr User Manual". Retrieved 6 April 2011.
  3. ^ Faragallah, Osama S.; El-Hoseny, Heba; El-Shafai, Walid; El-Rahman, Wael Abd; El-Sayed, Hala S.; El-Rabaie, El-Sayed M.; El-Samie, Fathi E. Abd; Geweid, Gamal G. N. (2021). "A Comprehensive Survey Analysis for Present Solutions of Medical Image Fusion and Future Directions". IEEE Access. 9: 11358–11371. Bibcode:2021IEEEA...911358F. doi:10.1109/ACCESS.2020.3048315. ISSN 2169-3536. This paper presents a survey study of medical imaging modalities and their characteristics. In addition, different medical image fusion approaches and their appropriate quality metrics are presented.
  4. ^ Chervyakov, Nikolay; Lyakhov, Pavel; Nagornov, Nikolay (2020-02-11). "Analysis of the Quantization Noise in Discrete Wavelet Transform Filters for 3D Medical Imaging". Applied Sciences. 10 (4): 1223. doi:10.3390/app10041223. ISSN 2076-3417. The image processing quality is considered high if PSNR value is greater than 60 dB for images with 12 bits per color.
  5. ^ Welstead, Stephen T. (1999). Fractal and wavelet image compression techniques. SPIE Publication. pp. 155–156. ISBN 978-0-8194-3503-3.
  6. ^ Raouf Hamzaoui, Dietmar Saupe (May 2006). Barni, Mauro (ed.). Fractal Image Compression. Vol. 968. CRC Press. pp. 168–169. ISBN 9780849335563. Retrieved 5 April 2011. {{cite book}}: |journal= ignored (help)
  7. ^ Thomos, N., Boulgouris, N. V., & Strintzis, M. G. (2006, January). Optimized Transmission of JPEG2000 Streams Over Wireless Channels. IEEE Transactions on Image Processing, 15 (1).
  8. ^ Xiangjun, L., & Jianfei, C. Robust transmission of JPEG2000 encoded images over packet loss channels. ICME 2007 (pp. 947-950). School of Computer Engineering, Nanyang Technological University.
  9. ^ Salomon, David (2007). Data Compression: The Complete Reference (4 ed.). Springer. p. 281. ISBN 978-1846286025. Retrieved 26 July 2012.
  10. ^ a b Huynh-Thu, Q.; Ghanbari, M. (2008). "Scope of validity of PSNR in image/video quality assessment". Electronics Letters. 44 (13): 800. Bibcode:2008ElL....44..800H. doi:10.1049/el:20080522.
  11. ^ Huynh-Thu, Quan; Ghanbari, Mohammed (2012-01-01). "The accuracy of PSNR in predicting video quality for different video scenes and frame rates". Telecommunication Systems. 49 (1): 35–48. doi:10.1007/s11235-010-9351-x. ISSN 1018-4864. S2CID 43713764.
  12. ^ Egiazarian, Karen, Jaakko Astola, Nikolay Ponomarenko, Vladimir Lukin, Federica Battisti, and Marco Carli (2006). "New full-reference quality metrics based on HVS." In Proceedings of the Second International Workshop on Video Processing and Quality Metrics, vol. 4.
  13. ^ Ponomarenko, N.; Ieremeiev, O.; Lukin, V.; Egiazarian, K.; Carli, M. (February 2011). "Modified image visual quality metrics for contrast change and mean shift accounting". 2011 11th International Conference the Experience of Designing and Application of CAD Systems in Microelectronics (CADSM): 305–311.
  14. ^ Nikolay Ponomarenko; Flavia Silvestri; Karen Egiazarian; Marco Carli; Jaakko Astola; Vladimir Lukin, "On between-coefficient contrast masking of DCT basis functions" (PDF), CD-ROM Proceedings of the Third International Workshop on Video Processing and Quality Metrics for Consumer Electronics VPQM-07, 25.–26. Januar 2007 (in German), Scottsdale AZ