Einzeltreffer — DigiBib

Reinforcement learning is a growing field of research, but little work is being done to verify the correctness of reinforcement learning algorithms. Researchers are exploring the use of reinforcement learning in safety critical systems such as self-driving cars and autonomous aircraft, so mathematical proofs of correctness of the underlying reinforcement learning algorithms would greatly improve our confidence in the systems that utilize reinforcement learning. This project verifies convergence and optimality of two fundamental reinforcement learning algorithms: value iteration and policy iteration. These algorithms converge and are optimal if they eventually produce an optimal policy. It also is designed to be extensible to future research into verified reinforcement learning.

Titel:	Verifying Value Iteration and Policy Iteration in Coq
Autor/in / Beteiligte Person:	Masters, David M.
Link:	http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1618999718015199
Veröffentlichung:	Ohio University / OhioLINK, 2021
Medientyp:	Hochschulschrift
Schlagwort:	Computer Science Reinforcement Learning Software Verification Coq Value Iteration Policy Iteration
Sonstiges:	Nachgewiesen in: Networked Digital Library of Theses & Dissertations Sprachen: English Collection: Ohiolink ETDs Original Material: http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1618999718015199 Document Type: text Language: English Rights: unrestricted ; This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws.

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

BibTeX Citavi, JabRef, u.a.
(Literaturverwaltung)

PDF kein Volltext!
(Merkzettel, Notizen)

RIS Endnote, Citavi u.a.
(Literaturverwaltung)

MODS
(XML zur Weiterverarbeitung)

oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

Gewünschter Zitations-Stil:

oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

Verifying Value Iteration and Policy Iteration in Coq