| Česky |
Cooperation Methods in Bayesian Decision Making with Multiple ParticipantsJan KracíkInstitute of Information Theory and Automation of the ASCR, Department of Adaptive SystemsIntroductionBayesian Decision Making, Fully Probabilistic DesignThe Bayesian theory represents a normative approach to the decision making under uncertainty. In practical applications a design of an optimal decision strategy is based on the following components:
Alternatively, the decision strategy can be designed using the, so-called, fully probabilistic design (FPD). Within the FPD, the objectives of the decision making are described by an ideal probability density function (pdf). The optimal decision strategy is then designed as a randomized one for which the Kullback-Leibler divergence of the joint pdf (given by the parametric model, decision strategy, and prior pdf) and the ideal pdf is minimal. The main asset of the FPD is that it has an explicit solution. MotivationIn practice the applicability of the Bayesian decision making is restricted by an extent of the given task. It is partially caused by the computational complexity, that grows quickly with increasing dimension of the task. There is also another, rather conceptual, reason: In large tasks it is often difficult to express the probabilistic model and objectives in a proper form. Instead, knowledge about the system and objectives are frequently available piecewise, i.e., separately for different parts of the system. Moreover, these parts can arbitrarily overlap. Then, it can easily happen that for some parts of the system there are several inconsistent objectives and knowledge pieces available. The need for a feasible solution of large tasks led to the idea of a distributed decision making with multiple Bayesian decision makers (participants). This task was solved in the Department of Adaptive Systems in the Institute of Information Theory and Automation of the ASCR within the project BADDYR (Bayesian Adaptive Distributed Decision Making) to the solution of which the presented thesis partially contributed. The staring point of the approach followed in the BADDYR project is that a group of Bayesian decision makers acts in a given system, whereas:
However, the inconsistency of the objectives and states of knowledge of the individual participants causes that if the participants act independently, as if they were the only decision makers in their environments, then the decision making can easily became inefficient. A common reason of this inefficiency is that the participants receive information about the objectives and knowledge of other participants only indirectly through the observed data. Such an information exchange is slow and thus ineffective. It is expected that the group decision making becomes more efficient if the participants are provided with cooperation means which allow them to communicate directly and to exploit the acquired information. As a distributed solution is aimed at, the communication cannot be realized centrally via a common facilitator but must be performed directly between the participants. The communication is meaningful only between participants with, at least partially, overlapping environments. A pair of such participants is called neighbours. The aim of the phd thesis is to design methods which allow the neighbours to share information on knowledge and objectives and use it to enhance the efficiency of the group decision making. The main part of the thesis consists of three chapters:
Multiple Participant Decision MakingIn this chapter the problem of Bayesian decision with multiple participants is formally introduced. Each participant is supposed to have:
Although our primary aim is the multiple participant decision making in a distributed form, it comes out that some kind of a centralized decision making task, called here a global task, must be inevitably considered. The global task serves as a criterion which enables to compare the tuples of decision strategies of the individual participants. However, the participants themselves do not provide enough information on which the global task could be constructed. For this purpose it is necessary to adopt additional assumptions, according to which, e.g., the sources of knowledge inconsistency could be modelled. Due to a large number of possible approaches to the construction of the global task their systematic classification is out of the scope of this work. Instead, we attempt to outline a formulation of a single specific case, on which the possibilities for the cooperation are illustrated. Moreover, this particular case allows to reveal some bottlenecks faced in the multiple participant decision making. The main results acquired in this chapter are as follows:
Global Objective SettingIn this chapter a method is proposed which enables to establish the global objective so that it is in some sense close to the objectives of the individual participants. Namely, it is supposed that the participants employ the fully probabilistic design. In such case, the objectives of the participants (local objectives) are represented by ideal pdfs on their environments. The global ideal pdf is then selected as the one which minimizes a weighted sum of the Kullback-Leibler divergences of the local ideal pdfs and the corresponding marginals of the searched global one. The weights in the sum represent optional parameters - the higher a particular weight is the closer to the objectives of the corresponding participant the global objectives are. The most important results of this chapter are as follows:
Bayesian Knowledge MergingIn the last chapter a method is proposed which allows a Bayesian decision maker to exploit an information in a form of a joint pdf of data and use it for updating its prior pdf. In a special case in which the information has a form of an empirical pdf from a data sample the proposed method leads to the same result as exploitation of the data themselves. In multiple participant decision making this method can be used as an approximate, but practically feasible, means for knowledge sharing between neighbours, even in case that that they use different parametric models. Its practical feasibility stems from the fact that for a parametric model from an exponential family and a conjugate prior pdf the updated prior pdf remains in a conjugate form. It is illustrated that under some assumptions on the parametric models of the communicating participants the proposed method is able to provide accurate knowledge sharing. ConclusionsThe main contributions of the thesis can be summarized as follows:
The thesis possesses also several open problems:
|