libri scuola books Fumetti ebook dvd top ten sconti 0 Carrello


Torna Indietro

altman eitan - constrained markov decision processes

Constrained Markov Decision Processes




Disponibilità: Normalmente disponibile in 20 giorni
A causa di problematiche nell'approvvigionamento legate alla Brexit sono possibili ritardi nelle consegne.


PREZZO
221,98 €
NICEPRICE
210,88 €
SCONTO
5%



Questo prodotto usufruisce delle SPEDIZIONI GRATIS
selezionando l'opzione Corriere Veloce in fase di ordine.


Pagabile anche con Carta della cultura giovani e del merito, 18App Bonus Cultura e Carta del Docente


Facebook Twitter Aggiungi commento


Spese Gratis

Dettagli

Genere:Libro
Lingua: Inglese
Pubblicazione: 03/1999
Edizione: 1° edizione





Note Editore

This book provides a unified approach for the study of constrained Markov decision processes with a finite state space and unbounded costs. Unlike the single controller case considered in many other books, the author considers a single controller with several objectives, such as minimizing delays and loss, probabilities, and maximization of throughputs. It is desirable to design a controller that minimizes one cost objective, subject to inequality constraints on other cost objectives. This framework describes dynamic decision problems arising frequently in many engineering fields. A thorough overview of these applications is presented in the introduction. The book is then divided into three sections that build upon each other. The first part explains the theory for the finite state space. The author characterizes the set of achievable expected occupation measures as well as performance vectors, and identifies simple classes of policies among which optimal policies exist. This allows the reduction of the original dynamic into a linear program. A Lagranian approach is then used to derive the dual linear program using dynamic programming techniques. In the second part, these results are extended to the infinite state space and action spaces. The author provides two frameworks: the case where costs are bounded below and the contracting framework. The third part builds upon the results of the first two parts and examines asymptotical results of the convergence of both the value and the policies in the time horizon and in the discount factor. Finally, several state truncation algorithms that enable the approximation of the solution of the original control problem via finite linear programs are given.




Sommario

INTRODUCTIONExamples of Constrained Dynamic Control ProblemsOn Solution Approaches for CMDPs with Expected CostsOther Types of CMDPsCost Criteria and AssumptionsThe Convex Analytical Approach and Occupation MeasuresLinear Programming and Lagrangian Approach for CMDPsAbout the MethodologyThe Structure of the BookPART ONE: FINITE MDPSMARKOV DECISION PROCESSESThe ModelCost Criteria and the Constrained ProblemSome NotationThe Dominance of Markov PoliciesTHE DISCOUNTED COSTOccupation Measure and the Primal LPDynamic Programming and Dual LP: the Unconstrained CaseConstrained Control: Lagrangian ApproachThe Dual LPNumber of RandomizationsTHE EXPECTED AVERAGE COSTOccupation Measure and the Primal LPEquivalent Linear ProgramThe Dual ProgramNumber of RandomizationsFLOW AND SERVICE CONTROL IN A SINGLE-SERVER QUEUEThe ModelThe LagrangianThe Original Constrained ProblemStructure of Randomization and Implementation IssuesOn Coordination Between ControllersOpen QuestionsPART TWO: INFINITE MDPSMDPS WITH INFINITE STATE AND ACTION SPACESThe ModelCost CriteriaMixed Policies, and Topologic StructuresThe Dominance of Markov PoliciesAggregation of StatesExtra Randomization in the PoliciesEquivalent Quasi-Markov Model and Quasi-Markov PoliciesTHE TOTAL COST: CLASSIFICATION OF MDPSTransient and Absorbing MDPsMDPs With Uniform Lyapunov FunctionsEquivalence of MDP With Unbounded and bounded costsProperties of MDPs With Uniform Lyapunov FunctionsProperties for Fixed Initial DistributionExamples of Uniform Lyapunov FunctionsContracting MDPsTHE TOTAL COST: OCCUPATION MEASURES AND THE PRIMAL LPOccupation MeasureContinuity of Occupation MeasuresMore Properties of MDPsCharacterization of Achievable Sets of Occupation MeasureRelation Between Cost and Occupation MeasureDominating Classes of PoliciesEquivalent Linear ProgramThe Dual ProgramTHE TOTAL COST: DYNAMIC AND LINEAR PROGRAMMINGNon-Constrained Control: Dynamic and Linear ProgrammingSuperharmonic Functions and Linear ProgrammingSet of Achievable CostsConstrained Control: Lagrangian ApproachThe Dual LPState TruncationA Second LP Approach for Optimal Mixed PoliciesMore on Unbound CostsTHE DISCOUNTED COSTThe Equivalent Total Cost ModelOccupation Measure and LPNon-negative Immediate CostWeak Contracting Assumptions and Lyapunov FunctionsExample: Flow and Service ControlTHE EXPECTED AVERAGE COSTOccupation MeasuresCompleteness Properties of Stationary PoliciesRelation Between Cost and Occupation MeasureDominating Classes of PoliciesEquivalent Linear ProgramThe Dual ProgramThe Contracting FrameworkOther Conditions for the Uniform IntegrabilityThe Case of Uniform Lyapunov ConditionsEXPECTED AVERAGE COST: DYNAMIC PROGRAMMING AND LPThe Non-Constrained Case: Optimality InequalityNon-Constrained Control: Cost Bounded BelowDynamic Programming and Uniform Lyapunov FunctionSuper-Harmonic Functions and Linear ProgrammingSet of Achievable Costs Constrained Control: Lagrangian ApproachThe Dual LPA Second LP Approach for Optimal Mixed PoliciesPART THREE: ASYMPTOTIC METHODS AND APPROXIMATIONSSENSITIVITY ANALYSISIntroductionApproximation of the ValuesApproximation and Robustness of the PoliciesCONVERGENCE OF DISCOUNTED CONSTRAINED MDPSConvergence in the Discount FactorConvergence to the Expected Average CostThe Case of Uniform Lyapunov FunctionCONVERGENCE AS THE HORIZON TENDS TO INFINITYThe Discounted CostThe Expected Average Cost: Stationary PoliciesThe Expected Average Cost: General PoliciesSTATE TRUNCATION AND APPROXIMATIONThe Approximating sets of StatesScheme I: the Total CostScheme II: the Total CostScheme III: the Total CostThe Expected Average CostInfinite MDPs: on the Number of RandomizationsAPPENDIX: CONVERGENCE OF PROBABILITY MEASURESREFERENCESLIST OF SYMBOLS AND NOTATIONINDEX




Autore

Altman, Eitan










Altre Informazioni

ISBN:

9780849303821

Condizione: Nuovo
Collana: Stochastic Modeling Series
Dimensioni: 9 x 6 in Ø 1.10 lb
Formato: Copertina rigida
Illustration Notes:277 equations
Pagine Arabe: 256


Dicono di noi