CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE
PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Exportar este item:

Use este identificador para citar ou linkar para este item: https://tedebc.ufma.br/jspui/handle/tede/494

Tipo do documento:	Dissertação
Título:	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
Título(s) alternativo(s):	CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING
Autor:	Maciel, Allan James Ferreira
Primeiro orientador:	FONSECA NETO, João Viana da
Primeiro membro da banca:	Serra, Ginalber Luiz de Oliveira
Resumo:	A união das metodologias de controle ótimo e de programação dinâmica tem impulsionado o desenvolvimento de algoritmos para realizações de sistemas de controle discreto do tipo regulador linear quadrático (DLQR). A metodologia utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por reforço baseados em diferenças temporais e programação dinâmica aproximada. O método proposto combina a aproximação da função valor através do método RLS (mínimos quadrados recursivos) e iteração de política aproximada em esquemas de programação dinâmica heurística (HDP). A abordagem é orientada para a avaliação da convergência da solução DLQR e para a sintonia heurística das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É realizada a investigação das propriedades de convergência relacionadas à consistência, excitação persistente e polarização do estimador RLS. A metodologia contempla realizações de projetos de forma online de controladores DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem.
Abstract:	The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.
Palavras-chave:	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control
Área(s) do CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Idioma:	por
País:	BR
Instituição:	Universidade Federal do Maranhão
Sigla da instituição:	UFMA
Departamento:	Engenharia
Programa:	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
Citação:	MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012.
Tipo de acesso:	Acesso Aberto
URI:	http://tedebc.ufma.br:8080/jspui/handle/tede/494
Data de defesa:	28-Set-2012
Aparece nas coleções:	DISSERTAÇÃO DE MESTRADO - PROGRAMA DE PÓS GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE

Arquivos associados a este item:

Arquivo	Tamanho	Formato
Dissertacao Allan James.pdf	3,1 MB	Adobe PDF	Baixar/Abrir Pré-Visualizar ×

Mostrar registro completo do item Recomendar este item Visualizar estatísticas

Universidade Federal do Maranhão

Biblioteca Digital de Teses e Dissertações