Planning and decision theory | The Knowledge Engineering Review | Cambridge Core (original) (raw)

Extract

Research on planning in AI can be separated into the two major areas: plan generation and plan representation. Most AI planners to date have been based on the STRIPS planning representation. This representation has a number of limitations. Much recent work in plan representation has addressed these limitations. It was shown that Decision Theory can be used to remove a number of the limitations. Furthermore, the decision theoretic framework provides a precise definition of rational behaviour. There remain open questions within decision theory regarding belief revision and causality. It should be noted that these problems are not artifacts of the representation. Rather they arise because the rich representation allows their formulation. Some work integrating AI and decision theoretic approaches to planning has been done but this remains a largely untouched research area.

We see two main avenues for fruitful research. First, the straightforward decision theoretic formulation of planning is computationally impractical. Techniques need to be developed to do efficient decision theoretic planning. Work in AI plan generation has exploited information contained the structure of qualitative representations to guide efficient plan construction. These techniques should be applied to decision theoretic representations as well. Second, AI has developed many representations that allow useful structuring of knowledge about the world. Decision Theory has concentrated on representing beliefs and desires. Integration of AI and decision theoretic representations would yield powerful representation languages. Some of the benefits of such work can already be seen in the research combining temporal and decision theoretic representations.

References

Allen, JF, 1984, “Towards a general theory of action and time” Artificial Intelligence 23(2) 123–154.CrossRefGoogle Scholar

Allen, JF and Koomen, JA, 1983, “Planning using a temporal world model” In: Proceedings of the eighth International Joint Conference on Artificial Intelligence pp 741–747, Karlsruhe, W. Germany,August 1983.Google Scholar

Appelt, D, 1985, “Planning English referring expressions” Artificial Intelligence 26(1) 1–33.CrossRefGoogle Scholar

Blackwell, D and Dubins, L, 1962, “Merging opinions with increasing information” Annals of Mathematical Statistics 33 882–887.CrossRefGoogle Scholar

Carnap, R, 1950, Logic Foundations of Probability Univ. of Chicago Press.Google Scholar

Cheeseman, P, 1983, “A method of computing generalized bayesian probability values for expert systems” In: Proceedings of the Eighth International Joint Conference on Artificial Intelligence pp 198–202, Karlsruhe, W. Germany,August 1983.Google Scholar

de Finetti, B, 1937, “La prevision: ses lois logiques, ses sources subjectives” Ann. Inst. Henri Poincare 7 1–68 (English translation in Kyburg and Smolker, 1964).Google Scholar

Dean, T and Kanazawa, K, 1988, “Probabilistic causal reasoning” In: Proceeding of the Fourth Workshop on Uncertainty in AI pp 73–80, Minneapolis, MN, August 1988.Google Scholar

Dean, T and Kanazawa, Keiji, 1988, “Probabilistic temporal reasoning” In: Proceedings of the Seventh National Conference on Artificial Intelligence pp 524–528, St. Paul, MN,1988.Google Scholar

DeGroot, M, 1970, Optimal Statistical Decisions McGraw-Hill.Google Scholar

Feldman, JA and Sproull, RF, 1977, “Decision theory and artificial intelligence ii: the hungry monkey” Artificial Intelligence 1 158–192.Google Scholar

Fikes, RE and Nilsson, NJ, 1971, “Strips: a new approach to the application of theorem proving to problem solving” Artificial Intelligence 2 189–208.CrossRefGoogle Scholar

Fishburn, PC, 1981, “Subjective expected utility: a review of normative theories” Theory and Decision 13 129–199.CrossRefGoogle Scholar

Fisher, RA, 1959, Smoking: The Cancer Controversy Oliver and Boyd.Google Scholar

Gaifman, H, 1986, “A theory of higher order probabilities” In: Proceedings of the Conference on Theoretical Aspects of Reasoning about Knowledge pp 275–292, Monterey,California,1986.Google Scholar

Gibbard, A and Harper, WL, 1978, “Counterfactuals and two kinds of expected utility” In: Hooker, , Leach, , and McClennen, , eds., Foundations and Applications of Decision Theory pp 125–162, D. Reidel, Dordrecht, Holland, 1978.Google Scholar

Goldstein, M, 1983, “The prevision of a prevision” J. American Statistical Association 78(384) 817–819.CrossRefGoogle Scholar

Green, C, 1969, “Application of theorem proving to problem solving” In: Proceedings of the First International Joint Conference on Artificial Intelligence pp 219–239, Washington, DC (Also in Weber, BL and Nilsson, NJ, eds., Readings in Artificial Intelligence Morgan Kaufmann, Los Alto, CA.)Google Scholar

Haas, AR, 1985, “Possible events, actual events, and robots” Computational Intelligence 1 59–70.CrossRefGoogle Scholar

Hass, AR, 1986, “A syntactic theory of belief and action” Artificial Intelligence 28 245–292.CrossRefGoogle Scholar

Haddawy, P and Frisch, AM, 1990, “Modal logics of higher-order probability” In: Shachter, R, Levitt, TS, Lemmer, J and Kanal, LN, eds., Uncertainty in Artificial Intelligence Elsevier.Google Scholar

Halpern, YJ, 1989, “The relationship between knowledge, belief, and certainty” In: Proceedings of the Fifth Workshop on Uncertainty in AI pp 142–151, An expanded version appears as IBM Research Report RJ 6765.Google Scholar

Hanks, S, 1988, “Representing and computing temporally scoped beliefs” In: Proceedings of the Seventh National Conference on Artificial Intelligence pp. 501–505, St. Paul, Minn.Google Scholar

Hanks, S, 1989, “Projecting plans for uncertain worlds” PhD thesis, Yale University.Google Scholar

Hayes, PJ, 1973, “The frame problem and related problems in artificial intelligence” In: Elithorn, A and Jones, D, eds., Artificial and Human Thinking pp 45–59, Jossey-Bass.Google Scholar

Hayes-Roth, B and Hayes-Roth, F, 1979, “A cognitive model of planning” Cognitive Science 3(4) 275–310.CrossRefGoogle Scholar

Hogarth, RM, 1975, “Cognitive processes and the assessment of subjective probability distributions” Journal of the American Association 70 271–294.CrossRefGoogle Scholar

Jaynes, ET, 1968, “Prior probabilities” IEEE Transactions on Systems, Science, and Cybernetics SSC–4 227–241.Google Scholar

Jaynes, ET, 1979, “Where do we stand on maximum entropy” In: Levine, and Tribus, , eds., The Maximum Entropy Formalism MIT Press.Google Scholar

Jeffreys, H, 1961, Theory of Probability 3rd edn., Oxford University Press.Google Scholar

Keeney, RL and Raiffa, H, 1976, Decisions with Multiple Objectives: Preferences and Value Tradeoffs Wiley.Google Scholar

Kemeny, J, 1955, “Fair bets and inductive probabilities” Journal of Symbolic Logic 20 263–273.CrossRefGoogle Scholar

Konolige, K, 1979, “A computer based consultant for mineral exploration, Appendix D: Bayesian methods for updating probabilities” report, SRI.Google Scholar

Kreps, DM, 1988, Notes on the Theory of Choice. Underground Classics in Economics Westview Press.Google Scholar

Kripke, S, 1963, “Semantical considerations on modal logic” Acta Philosophica Fennica 16 83–94 (Proceedings of a Colloquium on Modal and Many-Valued Logics Helsinki, 23–26 August, 1962).Google Scholar

Kyberg, HE Jr, and Smokler, HE, eds., 1969, Studies in Subjective Probability Wiley.Google Scholar

Lehman, RS, 1955, “On confirmation and rational betting” Journal of Symbolic Logic 20 251–262.CrossRefGoogle Scholar

Maher, P, 1990, “Betting on theories (unpublished manuscript).Google Scholar

Maher, P, 1990, “Symptomatic acts and the value of evidence in causal decision theory” Philosophy of Science September.CrossRefGoogle Scholar

McCarthy, J and Hayes, P, 1969, “Some philosophical problems from the standpoint of artificial intelligence” In: Meltzer, B and Michie, D, eds., Machine Intelligence 4, pp 463–502, Edinburgh University Press.Google Scholar

McDermott, DV, 1982, “A temporal logic for reasoning about processes and plans” Cognitive Science 6 101–155.Google Scholar

Moore, RC, 1985, “A formal theory of knowledge and action” In: Hobbs, JR and Moore, RC, eds., Formal Theories of the Commonsense World Ablex.Google Scholar

Morgenstern, L, 1986, “A first order theory of planning, knowledge, and action” In: Vardi, M, ed., Proceedings of the Conference on Theoretical Aspects of Reasoning About Knowledge, pp 99–114, Monterey.CrossRefGoogle Scholar

Pelavin, RN and Allen, JF, 1986, “A formal logic of plans in temporally rich domains” Proceedings of the IEEE 74(10) 1364–1382.CrossRefGoogle Scholar

Raiffa, H and Schlaifer, R, 1961, Applied Statistical Decision Theory MIT Press.Google Scholar

Ramsey, FP, 1926, “Truth and probability” In: Mellor, DH, ed., Foundations, chapter 3, pp 58–100, Humanitities Press.Google Scholar

Sacerdoti, ED, 1975, “A structure for plans and behavior” Technical Note 109, SRI.Google Scholar

Savage, LJ, 1954, The Foundations of Statistics John Wiley & Sons (Second revised edition published 1972).Google Scholar

Savage, LJ, 1971, “Elicitation of personal probabilities and expectations” Journal of the American Statistical Association 66 783–801.CrossRefGoogle Scholar

Schervish, MJ and Seidenfeld, , 1987, “An approach to consensus and certainty with increasing evidence” Technical Report 389, Carnegie-Mellon University, July 1987 (Forthcoming: J. Statistical Inference and Planning).Google Scholar

Shimony, A, 1955, “Coherence and the axioms of confirmation” Journal of Symbolic Logic 20 8–20.CrossRefGoogle Scholar

Sussman, GJ, 1975, A Computer Model of Skill Acquisition American Elsevier.Google Scholar

Tate, A, 1974, “INTERPLAN: A plan generation system which can deal with interactions between goals” memo MIP-R-109, Machine Intelligence Research Unit, Univ. of Edinburgh.Google Scholar

Tate, A, 1977, “Generating project networks” In; Proceedings of the Fifth International Joint Conference for Artificial Intelligence pp 888–893, Cambridge.Google Scholar

Venn, J, 1966, The Logic of Chance MacMillan (new paperback edition, Chelsea, 1962).Google Scholar

Vere, S, 1983, “Planning in time: windows and durations for activities and goals” IEEE Transactions on Pattern Analysis and Machine Intelligence 5(3) 246–267.CrossRefGoogle ScholarPubMed

Villegas, C, 1977, “On the representation of ignorance” J. American Statistical Association 72 651–654.CrossRefGoogle Scholar

von Miesis, R, 1957, Probability, Statistics and Truth Allen and Unwin.Google Scholar

Wellman, MP, 1988, “Formulation of tradeoffs in planning under uncertainty” PhD thesis, MIT.Google Scholar

Wilensky, R, 1983, Planning and Understanding. Addison-Wesley.Google Scholar

Wilkins, DE, 1984, “Domain independent planning: representation and plan generation” Artificial Intelligence 22 269–301.CrossRefGoogle Scholar