CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE
PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Exportar este item:

Use este identificador para citar ou linkar para este item: https://tedebc.ufma.br/jspui/handle/tede/494

Registro completo de metadados

Campo DC	Valor	Idioma
dc.creator	Maciel, Allan James Ferreira	pt_BR
dc.creator.Lattes	http://lattes.cnpq.br/9294927489743146	por
dc.contributor.advisor1	FONSECA NETO, João Viana da	pt_BR
dc.contributor.advisor1Lattes	http://lattes.cnpq.br/0029055473709795	por
dc.contributor.referee1	Serra, Ginalber Luiz de Oliveira	pt_BR
dc.contributor.referee1Lattes	http://lattes.cnpq.br/0831092299374520	por
dc.date.accessioned	2016-08-17T14:53:22Z	-
dc.date.available	2013-04-03	pt_BR
dc.date.issued	2012-09-28	pt_BR
dc.identifier.citation	MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012.	por
dc.identifier.uri	http://tedebc.ufma.br:8080/jspui/handle/tede/494	-
dc.description.resumo	A união das metodologias de controle ótimo e de programação dinâmica tem impulsionado o desenvolvimento de algoritmos para realizações de sistemas de controle discreto do tipo regulador linear quadrático (DLQR). A metodologia utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por reforço baseados em diferenças temporais e programação dinâmica aproximada. O método proposto combina a aproximação da função valor através do método RLS (mínimos quadrados recursivos) e iteração de política aproximada em esquemas de programação dinâmica heurística (HDP). A abordagem é orientada para a avaliação da convergência da solução DLQR e para a sintonia heurística das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É realizada a investigação das propriedades de convergência relacionadas à consistência, excitação persistente e polarização do estimador RLS. A metodologia contempla realizações de projetos de forma online de controladores DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem.	por
dc.description.abstract	The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.	eng
dc.description.provenance	Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1 Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5) Previous issue date: 2012-09-28	eng
dc.description.sponsorship	Coordenação de Aperfeiçoamento de Pessoal de Nível Superior	pt_BR
dc.format	application/pdf	por
dc.language	por	por
dc.publisher	Universidade Federal do Maranhão	por
dc.publisher.department	Engenharia	por
dc.publisher.country	BR	por
dc.publisher.initials	UFMA	por
dc.publisher.program	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET	por
dc.rights	Acesso Aberto	por
dc.subject	Programação Dinâmica Heurística	por
dc.subject	Controle Multivariável	por
dc.subject	Controle Ótimo	por
dc.subject	Regulador Quadrático Linear Discreto	por
dc.subject	Mínimos Quadrados Recursivos	por
dc.subject	Controle Digital	por
dc.subject	Heuristic Dynamic Programming	eng
dc.subject	Multivariable Control	eng
dc.subject	Optimal Control	eng
dc.subject	Discrete Linear Quadratic Regulator	eng
dc.subject	Recursive Least Squares	eng
dc.subject	Digital Control	eng
dc.subject.cnpq	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO	por
dc.title	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA	por
dc.title.alternative	CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING	eng
dc.type	Dissertação	por
Aparece nas coleções:	DISSERTAÇÃO DE MESTRADO - PROGRAMA DE PÓS GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE

Arquivos associados a este item:

Arquivo	Tamanho	Formato
Dissertacao Allan James.pdf	3,1 MB	Adobe PDF	Baixar/Abrir Pré-Visualizar ×

Mostrar registro simples do item Recomendar este item Visualizar estatísticas

Universidade Federal do Maranhão

Biblioteca Digital de Teses e Dissertações