EXTRAÇÃO DE SINAIS DE VOZ
EM AMBIENTES RUIDOSOS POR
DECOMPOSIÇÃO EM FUNÇÕES BASES
ESTATISTICAMENTE INDEPENDENTES

Exportar este item:

Use este identificador para citar ou linkar para este item: https://tedebc.ufma.br/jspui/handle/tede/369

Tipo do documento:	Dissertação
Título:	EXTRAÇÃO DE SINAIS DE VOZ EM AMBIENTES RUIDOSOS POR DECOMPOSIÇÃO EM FUNÇÕES BASES ESTATISTICAMENTE INDEPENDENTES
Título(s) alternativo(s):	EXTRATION OF VOICE SIGNALS IN NOISY ENVIRONMENTS FOR DECOMPOSITION IN FUNCTIONS STATISTICAL INDEPENDENT BASES
Autor:	Abreu, Natália Costa Leite
Primeiro orientador:	BARROS FILHO, Allan Kardec Duailibe
Resumo:	A constante busca para aperfeiçoar e estreitar o relacionamento entre homens e máquinas, tornando-o mais natural, não é nenhuma novidade. Conseqüentemente, o reconhecimento da voz possibilitará uma manipulação mais fácil e prática de equipamentos dotados com a capacidade de compreender a fala humana. Neste sentido e utilizando-se dos conhecimentos disponíveis na literatura de como o cérebro humano processa informações, alguns métodos propostos procuram simular computacionalmente essa habilidade, voltados principalmente à extração de um sinal de voz de uma mistura de sons, na tentativa de, por exemplo, aumentar a taxa de reconhecimento e inteligibilidade. A extração da voz pode ser obtida usando medidas de um único ou múltiplos canais. Para extrair uma voz em um único canal, propomos usar as características da voz pelo conceito de codificação eficiente, que procura imitar o modo como o córtex auditivo trata as informações, utilizando-se da técnica de Análise de Componentes Independentes (ICA), obtendo as funções bases dos sinais de entrada e recuperando o sinal estimado, mesmo quando são adicionadas interferências. Através de simulações comprovamos também a eficiência da técnica usada, primeiro, na recuperação de um sinal de voz com a utilização das funções bases de outro sinal e, segundo, frente a efeitos de reverberação. Esta técnica pode ser usada para extrair uma única fala eficazmente, como também prenuncia um modo novo de chegar ao problema de reconhecimento da fala/orador.
Abstract:	The constant search for the improvement and strengthening of the relationship between humans and machines turning it more natural is common place. Consequently, the recognition of speech will turn, easier and practical the handling of equipments supplied with the capacity to understand the human speech. In this sense and with the use of the available knowledge information in the literature as how the human brain processes informations, some suggested methods try to simulate this ability in the computer, especially devoted to the extraction of a speech signal of mixed sounds, attempting, for example to increase the recognition and comprehension rate. The extraction of speech can be obtained by measures of a single-channel or multiple the channels. In order to extract the speech in a single channel, it is proposed here to use the speech characteristics introducing the concept of efficient codification, that tries to imitate the way the auditory cortex gets information using the method of Independent Component Analysis (ICA), getting the basis functions of the input signals and retrieving the estimated signal even when we add interferences to it. Our simulations also prove the efficiency of our method against reverberation effects and the recovery of speech signal by the handling of basis function of other speech signals. This technique can be used efficiently both to extract a single speech, as well as highlighting new ways of approaching the speech/speaker recognition problem.
Palavras-chave:	Simples Canal de Voz Cocktail Party Análise de Componente Independente Reconhecimento da Fala Single Channel Speech Cocktail Party Independent Component Analysis Speech Recognition
Área(s) do CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO::ENGENHARIA DE SOFTWARE
Idioma:	por
País:	BR
Instituição:	Universidade Federal do Maranhão
Sigla da instituição:	UFMA
Departamento:	Engenharia
Programa:	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
Citação:	ABREU, Natália Costa Leite. EXTRATION OF VOICE SIGNALS IN NOISY ENVIRONMENTS FOR DECOMPOSITION IN FUNCTIONS STATISTICAL INDEPENDENT BASES. 2003. 91 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luis, 2003.
Tipo de acesso:	Acesso Aberto
URI:	http://tedebc.ufma.br:8080/jspui/handle/tede/369
Data de defesa:	11-Dez-2003
Aparece nas coleções:	DISSERTAÇÃO DE MESTRADO - PROGRAMA DE PÓS GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE

Arquivos associados a este item:

Arquivo	Tamanho	Formato
Natalia Costa Leite Abreu.pdf	821,77 kB	Adobe PDF	Baixar/Abrir Pré-Visualizar ×

Mostrar registro completo do item Recomendar este item Visualizar estatísticas

Universidade Federal do Maranhão

Biblioteca Digital de Teses e Dissertações