Jump to content

Imprecise probability: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Citation bot (talk | contribs)
Add: s2cid. | Use this bot. Report bugs. | Suggested by BrownHairedGirl | Linked from User:BrownHairedGirl/Articles_with_bare_links | #UCB_webform_linked 948/2197
Changing short description from "Framework utilizing probability theory to provide less subjective basis for decision-making under uncertainty" to "Probability theory for low quality data"
 
(14 intermediate revisions by 8 users not shown)
Line 1: Line 1:
{{Short description|Probability theory for low quality data}}
'''Imprecise probability''' generalizes [[probability theory]] to allow for partial probability specifications, and is applicable when information is scarce, vague, or conflicting, in which case a unique [[probability distribution]] may be hard to identify. Thereby, the theory aims to represent the available knowledge more accurately. Imprecision is useful for dealing with [[expert elicitation]], because:
'''Imprecise probability''' generalizes [[probability theory]] to allow for partial probability specifications, and is applicable when information is scarce, vague, or conflicting, in which case a unique [[probability distribution]] may be hard to identify. Thereby, the theory aims to represent the available knowledge more accurately. Imprecision is useful for dealing with [[expert elicitation]], because:
* People have a limited ability to determine their own subjective probabilities and might find that they can only provide an interval.
* People have a limited ability to determine their own subjective probabilities and might find that they can only provide an interval.
Line 13: Line 14:
| location = New York
| location = New York
| year = 1950
| year = 1950
}}</ref> [[Pierre-Simon Laplace|Laplace]], [[Bruno de Finetti|de Finetti]],<ref name="DEFINETTI1974">{{cite book | last= de Finetti| first= Bruno| year= 1974| title=Theory of Probability|location=New York|publisher=Wiley}}</ref> [[Frank P. Ramsey|Ramsey]], [[Cox's theorem|Cox]], [[Dennis Lindley|Lindley]], and many others. However, this has not been unanimously accepted by scientists, statisticians, and probabilists: it has been argued that some modification or broadening of probability theory is required, because one may not always be able to provide a probability for every event, particularly when only little information or data is available&mdash;an early example of such criticism is [[George Boole|Boole]]'s critique<ref name="BOOLE1854">{{cite book
}}</ref> [[Pierre-Simon Laplace|Laplace]], [[Bruno de Finetti|de Finetti]],<ref name="DEFINETTI1974">{{cite book | last= de Finetti| first= Bruno| year= 1974| title=Theory of Probability|location=New York|publisher=Wiley}}</ref> [[Frank P. Ramsey|Ramsey]], [[Richard Threlkeld Cox|Cox]], [[Dennis Lindley|Lindley]], and many others. However, this has not been unanimously accepted by scientists, statisticians, and probabilists: it has been argued that some modification or broadening of probability theory is required, because one may not always be able to provide a probability for every event, particularly when only little information or data is available&mdash;an early example of such criticism is [[George Boole|Boole]]'s critique<ref name="BOOLE1854">{{cite book
| last = Boole
| last = Boole
| first = George
| first = George
Line 30: Line 31:
| first = Cedric A. B.
| first = Cedric A. B.
| title = Consistency in statistical inference and decision
| title = Consistency in statistical inference and decision
| journal = Journal of the Royal Statistical Society
| journal = [[Journal of the Royal Statistical Society]]
| year = 1961
| year = 1961
}}</ref><ref name="WILLIAMS1970">{{Cite conference
}}</ref><ref name="WILLIAMS1970">{{Cite conference
Line 45: Line 46:
| first = Peter M.
| first = Peter M.
| title = Notes on conditional previsions
| title = Notes on conditional previsions
| journal = International Journal of Approximate Reasoning
| journal = [[International Journal of Approximate Reasoning]]
| year = 2007
| year = 2007
| doi = 10.1016/j.ijar.2006.07.019
| doi = 10.1016/j.ijar.2006.07.019
Line 92: Line 93:


== History ==
== History ==
The idea to use imprecise probability has a long history. The first formal treatment dates back at least to the middle of the nineteenth century, by [[George Boole]],<ref name="BOOLE1854"/> who aimed to reconcile the theories of logic and probability. In the 1920s, in ''[[A Treatise on Probability]]'', [[John Maynard Keynes|Keynes]]<ref name="KEYNES1921">{{cite book
The idea to use imprecise probability has a long history. The first formal treatment dates back at least to the middle of the nineteenth century, by [[George Boole]],<ref name="BOOLE1854"/> who aimed to reconcile the theories of [[mathematical logic|logic]] and probability. In the 1920s, in ''[[A Treatise on Probability]]'', [[John Maynard Keynes|Keynes]]<ref name="KEYNES1921">{{cite book
| last = Keynes
| last = Keynes
| first = John Maynard
| first = John Maynard
Line 101: Line 102:
| url = https://archive.org/details/treatiseonprobab007528mbp
| url = https://archive.org/details/treatiseonprobab007528mbp
}}</ref> formulated and applied an explicit interval estimate approach to probability.
}}</ref> formulated and applied an explicit interval estimate approach to probability.
Work on imprecise probability models proceeded fitfully throughout the 20th century, with important contributions by [[Bernard Koopman]], [[Cedric Smith (statistician)|C.A.B. Smith]], [[I.J. Good]], [[Arthur P. Dempster|Arthur Dempster]], [[Glenn Shafer]], P.M Williams, [[Henry Kyburg]], [[Isaac Levi]] and [[Teddy Seidenfeld]].<ref>https://plato.stanford.edu/entries/imprecise-probabilities/supplement-historical.html</ref>
Work on imprecise probability models proceeded fitfully throughout the 20th century, with important contributions by [[Bernard Koopman]], [[Cedric Smith (statistician)|C.A.B. Smith]], [[I.J. Good]], [[Arthur P. Dempster|Arthur Dempster]], [[Glenn Shafer]], [[Peter M. Williams]], [[Henry Kyburg]], [[Isaac Levi]], and [[Teddy Seidenfeld]].<ref>{{Cite web|url=https://plato.stanford.edu/entries/imprecise-probabilities/supplement-historical.html|title = Imprecise Probabilities > Historical appendix: Theories of imprecise belief (Stanford Encyclopedia of Philosophy)}}</ref>
At the start of the 90's, the field started to gather some momentum, with the publication of Peter Walley's foundational book "Statistical Reasoning with Imprecise Probabilities"<ref name="WALLEY1991"/>
At the start of the 1990s, the field started to gather some momentum, with the publication of [[Peter Walley]]'s book ''Statistical Reasoning with Imprecise Probabilities''<ref name="WALLEY1991"/>
(which is also where the term "imprecise probability" originates).
(which is also where the term "imprecise probability" originates).
The 1990s also saw important works by Kuznetsov,<ref name="KUZNETSOV1991">{{Cite book
The 1990s also saw important works by Kuznetsov,<ref name="KUZNETSOV1991">{{Cite book
Line 121: Line 122:
| location = New York
| location = New York
| year = 2000
| year = 2000
}}</ref> and [[non-parametric statistics]].<ref>{{Cite journal | doi = 10.1016/j.jspi.2003.07.003 | title = Nonparametric predictive inference and interval probability | year = 2004 | last1 = Augustin | first1 = T. | last2 = Coolen | first2 = F. P. A.| journal = Journal of Statistical Planning and Inference | volume = 124 | issue = 2 | pages = 251–272| url = https://epub.ub.uni-muenchen.de/1637/1/paper_257.pdf }}</ref> Included are also concepts based on [[Choquet integral|Choquet integration]],<ref>{{Cite journal | doi = 10.1016/j.jmaa.2008.05.071 | title = n-Monotone exact functionals | year = 2008 | last1 = de Cooman | first1 = G. | last2 = Troffaes | first2 = M. C. M.| last3 = Miranda | first3 = E. | journal = Journal of Mathematical Analysis and Applications | volume = 347 | issue = 1 | pages = 143–156|bibcode = 2008JMAA..347..143D |arxiv = 0801.1962 | s2cid = 6561656 }}</ref> and so-called two-monotone and totally monotone [[capacity (statistics)|capacities]],<ref name="HUBER1973">{{Cite journal
}}</ref> and [[non-parametric statistics]].<ref>{{Cite journal | doi = 10.1016/j.jspi.2003.07.003 | title = Nonparametric predictive inference and interval probability | year = 2004 | last1 = Augustin | first1 = T. | last2 = Coolen | first2 = F. P. A.| journal = Journal of Statistical Planning and Inference | volume = 124 | issue = 2 | pages = 251–272| url = https://epub.ub.uni-muenchen.de/1637/1/paper_257.pdf }}</ref> Included are also concepts based on [[Choquet integral|Choquet integration]],<ref>{{Cite journal | doi = 10.1016/j.jmaa.2008.05.071 | title = n-Monotone exact functionals | year = 2008 | last1 = de Cooman | first1 = G. | last2 = Troffaes | first2 = M. C. M.| last3 = Miranda | first3 = E. | journal = [[Journal of Mathematical Analysis and Applications]] | volume = 347 | issue = 1 | pages = 143–156|bibcode = 2008JMAA..347..143D |arxiv = 0801.1962 | s2cid = 6561656 }}</ref> and so-called two-monotone and totally monotone [[capacity (statistics)|capacities]],<ref name="HUBER1973">{{Cite journal
| volume = 1
| volume = 1
| pages = 251–263
| pages = 251–263
Line 128: Line 129:
|author2=V. Strassen
|author2=V. Strassen
| title = Minimax tests and the Neyman-Pearson lemma for capacities
| title = Minimax tests and the Neyman-Pearson lemma for capacities
| journal = The Annals of Statistics
| journal = [[The Annals of Statistics]]
| year = 1973
| year = 1973
| doi = 10.1214/aos/1176342363
| doi = 10.1214/aos/1176342363
| issue = 2
| issue = 2
| doi-access = free
| doi-access = free
}}</ref> which have become very popular in [[artificial intelligence]] under the name [[Dempster–Shafer theory#Belief and plausibility|(Dempster-Shafer) belief functions]].<ref name="DEMPSTER1967">{{cite journal
}}</ref> which have become very popular in [[artificial intelligence]] under the name [[Dempster–Shafer theory#Belief and plausibility|(Dempster–Shafer) belief functions]].<ref name="DEMPSTER1967">{{cite journal
|title=Upper and lower probabilities induced by a multivalued mapping
|title=Upper and lower probabilities induced by a multivalued mapping
|journal=The Annals of Mathematical Statistics
|journal=The Annals of Mathematical Statistics
Line 150: Line 151:
| publisher = Princeton University Press
| publisher = Princeton University Press
| year = 1976
| year = 1976
| isbn = 978-0-691-08175-5
| url = https://archive.org/details/mathematicaltheo0000shaf
| url = https://archive.org/details/mathematicaltheo0000shaf
| url-access = registration
| url-access = registration
}}</ref> Moreover, there is a strong connection<ref name="DECOOMAN2008">{{Cite journal | doi = 10.1016/j.artint.2008.03.001 | title = Imprecise probability trees: Bridging two theories of imprecise probability | year = 2008 | last1 = de Cooman | first1 = G. | last2 = Hermans | first2 = F. | journal = Artificial Intelligence | volume = 172 | issue = 11 | pages = 1400–1427| arxiv = 0801.1196 | s2cid = 14060218 }}</ref> to [[Glenn Shafer|Shafer]] and Vovk's notion of [[game-theoretic probability]].<ref name="SHAFER2001">{{Cite book
}}</ref> Moreover, there is a strong connection<ref name="DECOOMAN2008">{{Cite journal | doi = 10.1016/j.artint.2008.03.001 | title = Imprecise probability trees: Bridging two theories of imprecise probability | year = 2008 | last1 = de Cooman | first1 = G. | last2 = Hermans | first2 = F. | journal = [[Artificial Intelligence (journal)|Artificial Intelligence]] | volume = 172 | issue = 11 | pages = 1400–1427| arxiv = 0801.1196 | s2cid = 14060218 }}</ref> to [[Glenn Shafer|Shafer]] and [[Vladimir Vovk|Vovk]]'s notion of [[game-theoretic probability]].<ref name="SHAFER2001">{{Cite book
| publisher = Wiley
| publisher = Wiley
| last = Shafer
| last = Shafer
Line 165: Line 167:
* [[credal set]]s, or sets of probability distributions
* [[credal set]]s, or sets of probability distributions
* [[previsions]]<ref name="DEFINETTI1974" />
* [[previsions]]<ref name="DEFINETTI1974" />
* Random set theory
* [[Random set theory]]
* Dempster-Shafer evidence theory
* [[Dempster–Shafer theory|Dempster–Shafer evidence theory]]
* [[Upper and lower probabilities|lower and upper probabilities, or interval probabilities]]<ref name="BOOLE1854" /><ref name="WEICHSELBERGER2000" /><ref name="KEYNES1921" />
* [[Upper and lower probabilities|lower and upper probabilities, or interval probabilities]]<ref name="BOOLE1854" /><ref name="WEICHSELBERGER2000" /><ref name="KEYNES1921" />
* [[belief functions]]<ref name="DEMPSTER1967" /><ref name="SHAFER1976" />
* [[belief functions]]<ref name="DEMPSTER1967" /><ref name="SHAFER1976" />
* [[Possibility theory|possibility and necessity measures]]<ref>{{cite journal|title=Fuzzy sets as a basis for a theory of possibility|journal=Fuzzy Sets and Systems|year=1978|first=L. A.|last=Zadeh|volume=1|pages=3–28 |doi=10.1016/0165-0114(78)90029-5 |hdl=10338.dmlcz/135193|hdl-access=free}}</ref><ref>{{cite book | last = Dubois | first = Didier |author2=Henri Prade | title = Théorie des possibilité | publisher = Masson | year = 1985 | location = Paris }}</ref><ref>{{cite book | last = Dubois | first = Didier |author2=Henri Prade | title = Possibility Theory - An Approach to Computerized Processing of Uncertainty | publisher = Plenum Press | year = 1988 | location = New York | url =https://archive.org/details/possibilitytheor0000dubo| url-access = registration }}</ref>
* [[Possibility theory|possibility and necessity measures]]<ref>{{cite journal|title=Fuzzy sets as a basis for a theory of possibility|journal=[[Fuzzy Sets and Systems]]|year=1978|first=L. A.|last=Zadeh|volume=1|pages=3–28 |doi=10.1016/0165-0114(78)90029-5 |hdl=10338.dmlcz/135193|hdl-access=free}}</ref><ref>{{cite book | last = Dubois | first = Didier |author2=Henri Prade | title = Théorie des possibilité | publisher = Masson | year = 1985 | location = Paris }}</ref><ref>{{cite book | last = Dubois | first = Didier |author2=Henri Prade | title = Possibility Theory - An Approach to Computerized Processing of Uncertainty | publisher = Plenum Press | year = 1988 | location = New York | isbn = 978-0-306-42520-2 | url =https://archive.org/details/possibilitytheor0000dubo| url-access = registration }}</ref>
* [[lower and upper previsions]]<ref name="WILLIAMS1970" /><ref name="WILLIAMS2007" /><ref name="WALLEY1991"/><ref name="TROFFAES2014">{{cite book | last1=Troffaes | first1=Matthias C. M. | last2=de Cooman | first2=Gert | date=2014 | title=Lower previsions | doi=10.1002/9781118762622 | publisher=Wiley | isbn=9780470723777}}</ref>
* [[lower and upper previsions]]<ref name="WILLIAMS1970" /><ref name="WILLIAMS2007" /><ref name="WALLEY1991"/><ref name="TROFFAES2014">{{cite book | last1=Troffaes | first1=Matthias C. M. | last2=de Cooman | first2=Gert | date=2014 | title=Lower previsions | doi=10.1002/9781118762622 | publisher=Wiley | isbn=978-0-470-72377-7}}</ref>
* comparative probability orderings<ref name="KEYNES1921"/><ref>{{cite journal|title=Sul significato soggettivo della probabilità|journal=Fundamenta Mathematicae|year=1931|first=Bruno|last=de Finetti|volume=17|pages=298–329 |doi=10.4064/fm-17-1-298-329|doi-access=free}}</ref><ref>{{cite book | last = Fine | first = Terrence L. | title = Theories of Probability | publisher = Academic Press | year = 1973 | location = New York | url =https://archive.org/details/theoriesofprobab0000fine| url-access = registration }}</ref><ref>{{cite journal|title=The axioms of subjective probability|journal=Statistical Science|year=1986|first=P. C.|last=Fishburn|volume=1|issue=3|pages=335–358 |doi=10.1214/ss/1177013611 |doi-access=free}}</ref>
* comparative probability orderings<ref name="KEYNES1921"/><ref>{{cite journal|title=Sul significato soggettivo della probabilità|journal=[[Fundamenta Mathematicae]]|year=1931|first=Bruno|last=de Finetti|volume=17|pages=298–329 |doi=10.4064/fm-17-1-298-329|doi-access=free}}</ref><ref>{{cite book | last = Fine | first = Terrence L. | title = Theories of Probability | publisher = Academic Press | year = 1973 | location = New York | isbn = 978-0-12-256450-5 | url =https://archive.org/details/theoriesofprobab0000fine| url-access = registration }}</ref><ref>{{cite journal|title=The axioms of subjective probability|journal=Statistical Science|year=1986|first=P. C.|last=Fishburn|volume=1|issue=3|pages=335–358 |doi=10.1214/ss/1177013611 |doi-access=free}}</ref>
* partial preference orderings
* partial preference orderings
* sets of desirable gambles<ref name="WILLIAMS1970" /><ref name="WILLIAMS2007" /><ref name="WALLEY1991"/>
* sets of desirable gambles<ref name="WILLIAMS1970" /><ref name="WILLIAMS2007" /><ref name="WALLEY1991"/>
* [[probability box|p-boxes]]<ref>{{cite web |url=http://www.ramas.com/unabridged.zip |title=Constructing Probability Boxes and Dempster-Shafer Structures |access-date=2009-09-23 |last=Ferson |first=Scott |author2=Vladik Kreinovich |author3=Lev Ginzburg |author4=David S. Myers |author5=Kari Sentz |year=2003 |work=SAND2002-4015 |publisher=Sandia National Laboratories |author2-link=Vladik Kreinovich |archive-url=https://web.archive.org/web/20110722073459/http://www.ramas.com/unabridged.zip |archive-date=2011-07-22 |url-status=dead }}</ref>
* [[probability box|p-boxes]]<ref>{{cite web |url=http://www.ramas.com/unabridged.zip |title=Constructing Probability Boxes and Dempster-Shafer Structures |access-date=2009-09-23 |last=Ferson |first=Scott |author2=Vladik Kreinovich |author3=Lev Ginzburg |author4=David S. Myers |author5=Kari Sentz |year=2003 |work=SAND2002-4015 |publisher=Sandia National Laboratories |author2-link=Vladik Kreinovich |archive-url=https://web.archive.org/web/20110722073459/http://www.ramas.com/unabridged.zip |archive-date=2011-07-22 }}</ref>
* [[robust Bayes analysis|robust Bayes methods]]<ref>{{Cite book| first=James O. | last=Berger| contribution=The robust Bayesian viewpoint| title=Robustness of Bayesian Analyses| url=https://archive.org/details/robustnessbayesi00kada | url-access=limited | editor-first=J. B.| editor-last=Kadane| publisher=Elsevier Science| pages=[https://archive.org/details/robustnessbayesi00kada/page/n74 63]–144| year=1984 }}</ref>
* [[robust Bayes analysis|robust Bayes methods]]<ref>{{Cite book| first=James O. | last=Berger| contribution=The robust Bayesian viewpoint| title=Robustness of Bayesian Analyses| url=https://archive.org/details/robustnessbayesi00kada | url-access=limited | editor-first=J. B.| editor-last=Kadane| publisher=Elsevier Science| pages=[https://archive.org/details/robustnessbayesi00kada/page/n74 63]–144| year=1984 | isbn=978-0-444-86209-9}}</ref>


==Interpretation of imprecise probabilities==
==Interpretation of imprecise probabilities==
A unification of many of the above-mentioned imprecise probability theories was proposed by Walley,<ref name="WALLEY1991"/> although this is in no way the first attempt to formalize imprecise probabilities. In terms of [[probability interpretations]], Walley's formulation of imprecise probabilities is based on the [[Bayesian probability|subjective variant of the Bayesian interpretation]] of probability. Walley defines upper and lower probabilities as special cases of upper and lower previsions and the gambling framework advanced by [[Bruno de Finetti]]. In simple terms, a decision maker's lower prevision is the highest price at which the decision maker is sure he or she would buy a gamble, and the upper prevision is the lowest price at which the decision maker is sure he or she would buy the opposite of the gamble (which is equivalent to selling the original gamble). If the upper and lower previsions are equal, then they jointly represent the decision maker's [[fair price]] for the gamble, the price at which the decision maker is willing to take either side of the gamble. The existence of a fair price leads to precise probabilities.
A unification of many of the above-mentioned imprecise probability theories was proposed by Walley,<ref name="WALLEY1991"/> although this is in no way the first attempt to formalize imprecise probabilities. In terms of [[probability interpretations]], Walley's formulation of imprecise probabilities is based on the [[Bayesian probability|subjective variant of the Bayesian interpretation]] of probability. Walley defines upper and lower probabilities as special cases of upper and lower previsions and the gambling framework advanced by [[Bruno de Finetti]]. In simple terms, a decision maker's lower prevision is the highest price at which the decision maker is sure he or she would buy a gamble, and the upper prevision is the lowest price at which the decision maker is sure he or she would buy the opposite of the gamble (which is equivalent to selling the original gamble). If the upper and lower previsions are equal, then they jointly represent the decision maker's [[fair price]] for the gamble, the price at which the decision maker is willing to take either side of the gamble. The existence of a fair price leads to precise probabilities.


The allowance for imprecision, or a gap between a decision maker's upper and lower previsions, is the primary difference between precise and imprecise probability theories. Such gaps arise naturally in [[prediction market|betting markets]] which happen to be financially [[illiquid]] due to [[asymmetric information]]. This gap is also given by [[Henry Kyburg]] repeatedly for his interval probabilities, though he and [[Isaac Levi]] also give other reasons for intervals, or sets of distributions, representing states of belief.
The allowance for imprecision, or a gap between a decision maker's upper and lower previsions, is the primary difference between precise and imprecise probability theories. Such gaps arise naturally in [[prediction market|betting markets]] that happen to be financially [[illiquid]] due to [[asymmetric information]]. This gap is also given by [[Henry Kyburg]] repeatedly for his interval probabilities, though he and [[Isaac Levi]] also give other reasons for intervals, or sets of distributions, representing states of belief.


==Issues with imprecise probabilities==
==Issues with imprecise probabilities==
One issue with imprecise probabilities is that there is often an independent degree of caution or boldness inherent in the use of one interval, rather than a wider or narrower one. This may be a degree of confidence, degree of fuzzy membership, or threshold of acceptance. This is not as much of a problem for intervals that are lower and upper bounds derived from a set of probability distributions, e.g., a set of priors followed by conditionalization on each member of the set. However, it can lead to the question why some distributions are included in the set of priors and some are not.
One issue with imprecise probabilities is that there is often an independent degree of caution or boldness inherent in the use of one interval, rather than a wider or narrower one. This may be a degree of confidence, degree of [[fuzzy membership]], or threshold of acceptance. This is not as much of a problem for intervals that are lower and upper bounds derived from a set of probability distributions, e.g., a set of priors followed by conditionalization on each member of the set. However, it can lead to the question why some distributions are included in the set of priors and some are not.


Another issue is why one can be precise about two numbers, a lower bound and an upper bound, rather than a single number, a point probability. This issue may be merely rhetorical, as the robustness of a model with intervals is inherently greater than that of a model with point-valued probabilities. It does raise concerns about inappropriate claims of precision at endpoints, as well as for point values.
Another issue is why one can be precise about two numbers, a lower bound and an upper bound, rather than a single number, a point probability. This issue may be merely rhetorical, as the robustness of a model with intervals is inherently greater than that of a model with point-valued probabilities. It does raise concerns about inappropriate claims of precision at endpoints, as well as for point values.


A more practical issue is what kind of decision theory can make use of imprecise probabilities.<ref>Seidenfeld, Teddy. "Decisions with indeterminate probabilities." Behavioral and Brain Sciences 6, no. 2 (1983): 259-261.</ref> For fuzzy measures, there is the work of Yager.<ref>Yager, R.R., 1978. Fuzzy decision making including unequal objectives. Fuzzy sets and systems, 1(2), pp.87-95.</ref> For convex sets of distributions, Levi's works are instructive.<ref>Levi, I., 1990. Hard choices: Decision making under unresolved conflict. Cambridge University Press.</ref> Another approach asks whether the threshold controlling the boldness of the interval matters more to a decision than simply taking the average or using a [[Hurwicz]] decision rule.<ref>Loui, R.P., 1986. Decisions with indeterminate probabilities. Theory and Decision, 21(3), pp.283-309.</ref> Other approaches appear in the literature.<ref>Guo, P. and Tanaka, H., 2010. Decision making with interval probabilities. European Journal of Operational Research, 203(2), pp.444-454.</ref><ref>Caselton, W.F. and Luo, W., 1992. Decision making with imprecise probabilities: Dempster‐Shafer theory and application. Water Resources Research, 28(12), pp.3071-3083.</ref><ref>Breese, J.S. and Fertig, K.W., 2013. Decision making with interval influence diagrams. arXiv preprint arXiv:1304.1096.</ref><ref>Gärdenfors, P. and Sahlin, N.E., 1982. Unreliable probabilities, risk taking, and decision making. Synthese, 53(3), pp.361-386.</ref>
A more practical issue is what kind of decision theory can make use of imprecise probabilities.<ref>{{cite journal |last=Seidenfeld |first=Teddy |title=Decisions with indeterminate probabilities |journal=Behavioral and Brain Sciences |volume=6 |issue=2 |year=1983 |pages=259–261 |doi=10.1017/S0140525X0001582X |s2cid=145583756 }}</ref> For fuzzy measures, there is the work of [[Ronald R. Yager]].<ref>{{cite journal |last=Yager |first=R. R. |year=1978 |title=Fuzzy decision making including unequal objectives |journal=Fuzzy Sets and Systems |volume=1 |issue=2 |pages=87–95 |doi=10.1016/0165-0114(78)90010-6 }}</ref> For convex sets of distributions, Levi's works are instructive.<ref>{{cite book |last=Levi |first=I. |year=1990 |title=Hard choices: Decision making under unresolved conflict |publisher=Cambridge University Press |isbn=0-521-38630-6 }}</ref> Another approach asks whether the threshold controlling the boldness of the interval matters more to a decision than simply taking the average or using a [[Leonid Hurwicz|Hurwicz]] decision rule.<ref>{{cite journal |last=Loui |first=R. P. |year=1986 |title=Decisions with indeterminate probabilities |journal=Theory and Decision |volume=21 |issue=3 |pages=283–309 |doi=10.1007/BF00134099 |s2cid=121036131 }}</ref> Other approaches appear in the literature.<ref>{{cite journal |last1=Guo |first1=P. |last2=Tanaka |first2=H. |year=2010 |title=Decision making with interval probabilities |journal=[[European Journal of Operational Research]] |volume=203 |issue=2 |pages=444–454 |doi=10.1016/j.ejor.2009.07.020 |s2cid=10582873 }}</ref><ref>{{cite journal |last1=Caselton |first1=W. F. |last2=Luo |first2=W. |year=1992 |title=Decision making with imprecise probabilities: Dempster‐Shafer theory and application |journal=Water Resources Research |volume=28 |issue=12 |pages=3071–3083 |doi=10.1029/92WR01818 }}</ref><ref>{{cite arXiv |last1=Breese |first1=J. S. |last2=Fertig |first2=K. W. |year=2013 |title=Decision making with interval influence diagrams |class=cs.AI |eprint=1304.1096 }}</ref><ref>{{cite journal |last1=Gärdenfors |first1=P. |last2=Sahlin |first2=N. E. |year=1982 |title=Unreliable probabilities, risk taking, and decision making |journal=[[Synthese]] |volume=53 |issue=3 |pages=361–386 |doi=10.1007/BF00486156 |s2cid=36194904 }}</ref>

== Bibliography ==
<references/>


==See also==
==See also==
Line 196: Line 195:
*[[Robust decision making]]
*[[Robust decision making]]
*[[Imprecise Dirichlet process]]
*[[Imprecise Dirichlet process]]

== References ==
{{Reflist}}


== External links ==
== External links ==

Latest revision as of 11:51, 1 July 2024

Imprecise probability generalizes probability theory to allow for partial probability specifications, and is applicable when information is scarce, vague, or conflicting, in which case a unique probability distribution may be hard to identify. Thereby, the theory aims to represent the available knowledge more accurately. Imprecision is useful for dealing with expert elicitation, because:

  • People have a limited ability to determine their own subjective probabilities and might find that they can only provide an interval.
  • As an interval is compatible with a range of opinions, the analysis ought to be more convincing to a range of different people.

Einführung

[edit]

Uncertainty is traditionally modelled by a probability distribution, as developed by Kolmogorov,[1] Laplace, de Finetti,[2] Ramsey, Cox, Lindley, and many others. However, this has not been unanimously accepted by scientists, statisticians, and probabilists: it has been argued that some modification or broadening of probability theory is required, because one may not always be able to provide a probability for every event, particularly when only little information or data is available—an early example of such criticism is Boole's critique[3] of Laplace's work—, or when we wish to model probabilities that a group agrees with, rather than those of a single individual.

Perhaps the most common generalization is to replace a single probability specification with an interval specification. Lower and upper probabilities, denoted by and , or more generally, lower and upper expectations (previsions),[4][5][6][7] aim to fill this gap. A lower probability function is superadditive but not necessarily additive, whereas an upper probability is subadditive. To get a general understanding of the theory, consider:

  • the special case with for all events is equivalent to a precise probability
  • and for all non-trivial events represents no constraint at all on the specification of

We then have a flexible continuum of more or less precise models in between.

Some approaches, summarized under the name nonadditive probabilities,[8] directly use one of these set functions, assuming the other one to be naturally defined such that , with the complement of . Other related concepts understand the corresponding intervals for all events as the basic entity.[9][10]

History

[edit]

The idea to use imprecise probability has a long history. The first formal treatment dates back at least to the middle of the nineteenth century, by George Boole,[3] who aimed to reconcile the theories of logic and probability. In the 1920s, in A Treatise on Probability, Keynes[11] formulated and applied an explicit interval estimate approach to probability. Work on imprecise probability models proceeded fitfully throughout the 20th century, with important contributions by Bernard Koopman, C.A.B. Smith, I.J. Good, Arthur Dempster, Glenn Shafer, Peter M. Williams, Henry Kyburg, Isaac Levi, and Teddy Seidenfeld.[12] At the start of the 1990s, the field started to gather some momentum, with the publication of Peter Walley's book Statistical Reasoning with Imprecise Probabilities[7] (which is also where the term "imprecise probability" originates). The 1990s also saw important works by Kuznetsov,[13] and by Weichselberger,[9][10] who both use the term interval probability. Walley's theory extends the traditional subjective probability theory via buying and selling prices for gambles, whereas Weichselberger's approach generalizes Kolmogorov's axioms without imposing an interpretation.

Standard consistency conditions relate upper and lower probability assignments to non-empty closed convex sets of probability distributions. Therefore, as a welcome by-product, the theory also provides a formal framework for models used in robust statistics[14] and non-parametric statistics.[15] Included are also concepts based on Choquet integration,[16] and so-called two-monotone and totally monotone capacities,[17] which have become very popular in artificial intelligence under the name (Dempster–Shafer) belief functions.[18][19] Moreover, there is a strong connection[20] to Shafer and Vovk's notion of game-theoretic probability.[21]

Mathematical models

[edit]

The term "imprecise probability" is somewhat misleading in that precision is often mistaken for accuracy, whereas an imprecise representation may be more accurate than a spuriously precise representation. In any case, the term appears to have become established in the 1990s, and covers a wide range of extensions of the theory of probability, including:

Interpretation of imprecise probabilities

[edit]

A unification of many of the above-mentioned imprecise probability theories was proposed by Walley,[7] although this is in no way the first attempt to formalize imprecise probabilities. In terms of probability interpretations, Walley's formulation of imprecise probabilities is based on the subjective variant of the Bayesian interpretation of probability. Walley defines upper and lower probabilities as special cases of upper and lower previsions and the gambling framework advanced by Bruno de Finetti. In simple terms, a decision maker's lower prevision is the highest price at which the decision maker is sure he or she would buy a gamble, and the upper prevision is the lowest price at which the decision maker is sure he or she would buy the opposite of the gamble (which is equivalent to selling the original gamble). If the upper and lower previsions are equal, then they jointly represent the decision maker's fair price for the gamble, the price at which the decision maker is willing to take either side of the gamble. The existence of a fair price leads to precise probabilities.

The allowance for imprecision, or a gap between a decision maker's upper and lower previsions, is the primary difference between precise and imprecise probability theories. Such gaps arise naturally in betting markets that happen to be financially illiquid due to asymmetric information. This gap is also given by Henry Kyburg repeatedly for his interval probabilities, though he and Isaac Levi also give other reasons for intervals, or sets of distributions, representing states of belief.

Issues with imprecise probabilities

[edit]

One issue with imprecise probabilities is that there is often an independent degree of caution or boldness inherent in the use of one interval, rather than a wider or narrower one. This may be a degree of confidence, degree of fuzzy membership, or threshold of acceptance. This is not as much of a problem for intervals that are lower and upper bounds derived from a set of probability distributions, e.g., a set of priors followed by conditionalization on each member of the set. However, it can lead to the question why some distributions are included in the set of priors and some are not.

Another issue is why one can be precise about two numbers, a lower bound and an upper bound, rather than a single number, a point probability. This issue may be merely rhetorical, as the robustness of a model with intervals is inherently greater than that of a model with point-valued probabilities. It does raise concerns about inappropriate claims of precision at endpoints, as well as for point values.

A more practical issue is what kind of decision theory can make use of imprecise probabilities.[31] For fuzzy measures, there is the work of Ronald R. Yager.[32] For convex sets of distributions, Levi's works are instructive.[33] Another approach asks whether the threshold controlling the boldness of the interval matters more to a decision than simply taking the average or using a Hurwicz decision rule.[34] Other approaches appear in the literature.[35][36][37][38]

See also

[edit]

References

[edit]
  1. ^ Kolmogorov, A. N. (1950). Foundations of the Theory of Probability. New York: Chelsea Publishing Company.
  2. ^ a b de Finetti, Bruno (1974). Theory of Probability. New York: Wiley.
  3. ^ a b c Boole, George (1854). An investigation of the laws of thought on which are founded the mathematical theories of logic and probabilities. London: Walton and Maberly.
  4. ^ Smith, Cedric A. B. (1961). "Consistency in statistical inference and decision". Journal of the Royal Statistical Society. B (23): 1–37.
  5. ^ a b c Williams, Peter M. (1975). Notes on conditional previsions. School of Math. and Phys. Sci., Univ. of Sussex.
  6. ^ a b c Williams, Peter M. (2007). "Notes on conditional previsions". International Journal of Approximate Reasoning. 44 (3): 366–383. doi:10.1016/j.ijar.2006.07.019.
  7. ^ a b c d e Walley, Peter (1991). Statistical Reasoning with Imprecise Probabilities. London: Chapman and Hall. ISBN 978-0-412-28660-5.
  8. ^ Denneberg, Dieter (1994). Non-additive Measure and Integral. Dordrecht: Kluwer.
  9. ^ a b c Weichselberger, Kurt (2000). "The theory of interval probability as a unifying concept for uncertainty". International Journal of Approximate Reasoning. 24 (2–3): 149–170. doi:10.1016/S0888-613X(00)00032-3.
  10. ^ a b Weichselberger, K. (2001). Elementare Grundbegriffe einer allgemeineren Wahrscheinlichkeitsrechnung I - Intervallwahrscheinlichkeit als umfassendes Konzept. Heidelberg: Physica.
  11. ^ a b c Keynes, John Maynard (1921). A Treatise on Probability. London: Macmillan And Co.
  12. ^ "Imprecise Probabilities > Historical appendix: Theories of imprecise belief (Stanford Encyclopedia of Philosophy)".
  13. ^ Kuznetsov, Vladimir P. (1991). Interval Statistical Models. Moscow: Radio i Svyaz Publ.
  14. ^ Ruggeri, Fabrizio (2000). Robust Bayesian Analysis. D. Ríos Insua. New York: Springer.
  15. ^ Augustin, T.; Coolen, F. P. A. (2004). "Nonparametric predictive inference and interval probability" (PDF). Journal of Statistical Planning and Inference. 124 (2): 251–272. doi:10.1016/j.jspi.2003.07.003.
  16. ^ de Cooman, G.; Troffaes, M. C. M.; Miranda, E. (2008). "n-Monotone exact functionals". Journal of Mathematical Analysis and Applications. 347 (1): 143–156. arXiv:0801.1962. Bibcode:2008JMAA..347..143D. doi:10.1016/j.jmaa.2008.05.071. S2CID 6561656.
  17. ^ Huber, P. J.; V. Strassen (1973). "Minimax tests and the Neyman-Pearson lemma for capacities". The Annals of Statistics. 1 (2): 251–263. doi:10.1214/aos/1176342363.
  18. ^ a b Dempster, A. P. (1967). "Upper and lower probabilities induced by a multivalued mapping". The Annals of Mathematical Statistics. 38 (2): 325–339. doi:10.1214/aoms/1177698950. JSTOR 2239146.
  19. ^ a b Shafer, Glenn (1976). A Mathematical Theory of Evidence. Princeton University Press. ISBN 978-0-691-08175-5.
  20. ^ de Cooman, G.; Hermans, F. (2008). "Imprecise probability trees: Bridging two theories of imprecise probability". Artificial Intelligence. 172 (11): 1400–1427. arXiv:0801.1196. doi:10.1016/j.artint.2008.03.001. S2CID 14060218.
  21. ^ Shafer, Glenn; Vladimir Vovk (2001). Probability and Finance: It's Only a Game!. Wiley.
  22. ^ Zadeh, L. A. (1978). "Fuzzy sets as a basis for a theory of possibility". Fuzzy Sets and Systems. 1: 3–28. doi:10.1016/0165-0114(78)90029-5. hdl:10338.dmlcz/135193.
  23. ^ Dubois, Didier; Henri Prade (1985). Théorie des possibilité. Paris: Masson.
  24. ^ Dubois, Didier; Henri Prade (1988). Possibility Theory - An Approach to Computerized Processing of Uncertainty. New York: Plenum Press. ISBN 978-0-306-42520-2.
  25. ^ Troffaes, Matthias C. M.; de Cooman, Gert (2014). Lower previsions. Wiley. doi:10.1002/9781118762622. ISBN 978-0-470-72377-7.
  26. ^ de Finetti, Bruno (1931). "Sul significato soggettivo della probabilità". Fundamenta Mathematicae. 17: 298–329. doi:10.4064/fm-17-1-298-329.
  27. ^ Fine, Terrence L. (1973). Theories of Probability. New York: Academic Press. ISBN 978-0-12-256450-5.
  28. ^ Fishburn, P. C. (1986). "The axioms of subjective probability". Statistical Science. 1 (3): 335–358. doi:10.1214/ss/1177013611.
  29. ^ Ferson, Scott; Vladik Kreinovich; Lev Ginzburg; David S. Myers; Kari Sentz (2003). "Constructing Probability Boxes and Dempster-Shafer Structures". SAND2002-4015. Sandia National Laboratories. Archived from the original on 2011-07-22. Retrieved 2009-09-23.
  30. ^ Berger, James O. (1984). "The robust Bayesian viewpoint". In Kadane, J. B. (ed.). Robustness of Bayesian Analyses. Elsevier Science. pp. 63–144. ISBN 978-0-444-86209-9.
  31. ^ Seidenfeld, Teddy (1983). "Decisions with indeterminate probabilities". Behavioral and Brain Sciences. 6 (2): 259–261. doi:10.1017/S0140525X0001582X. S2CID 145583756.
  32. ^ Yager, R. R. (1978). "Fuzzy decision making including unequal objectives". Fuzzy Sets and Systems. 1 (2): 87–95. doi:10.1016/0165-0114(78)90010-6.
  33. ^ Levi, I. (1990). Hard choices: Decision making under unresolved conflict. Cambridge University Press. ISBN 0-521-38630-6.
  34. ^ Loui, R. P. (1986). "Decisions with indeterminate probabilities". Theory and Decision. 21 (3): 283–309. doi:10.1007/BF00134099. S2CID 121036131.
  35. ^ Guo, P.; Tanaka, H. (2010). "Decision making with interval probabilities". European Journal of Operational Research. 203 (2): 444–454. doi:10.1016/j.ejor.2009.07.020. S2CID 10582873.
  36. ^ Caselton, W. F.; Luo, W. (1992). "Decision making with imprecise probabilities: Dempster‐Shafer theory and application". Water Resources Research. 28 (12): 3071–3083. doi:10.1029/92WR01818.
  37. ^ Breese, J. S.; Fertig, K. W. (2013). "Decision making with interval influence diagrams". arXiv:1304.1096 [cs.AI].
  38. ^ Gärdenfors, P.; Sahlin, N. E. (1982). "Unreliable probabilities, risk taking, and decision making". Synthese. 53 (3): 361–386. doi:10.1007/BF00486156. S2CID 36194904.
[edit]