Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/160974
Title: Auditory source separation and its application to automatic music transcription
Authors: SHEN LING
Keywords: auditory source separation; automatic music transcription; temporal processing; multiple pitch estimation; spectral smoothness; sparse coding
Issue Date: 27-Jun-2005
Citation: SHEN LING (2005-06-27). Auditory source separation and its application to automatic music transcription. ScholarBank@NUS Repository.
Abstract: 

HUMAN BEINGS ARE CAPABLE OF LISTENING TO SIMULTANEOUS SOUND SOURCES WITH ONE EAR. AUDITORY SOURCE SEPARATION IS A COMPUTATIONAL APPROACH TO SIMULATE THIS ABILITY BY EXTRACTING MULTIPLE SOURCES FROM A SINGLE CHANNEL MIXTURE. THIS THESIS PRESENTS NOVEL TECHNIQUES FOR MODELING THE AUDITORY SOURCE SEPARATION IN A STATISTICAL FRAMEWORK. THE SOURCE SIGNALS ARE MODELED AS MARKOV PROCESS, WHICH IS CHARACTERIZED BY MEANS OF LOCAL CONSTRAINTS IN BOTH TIME DOMAIN AND FREQUENCY DOMAIN. TYPICAL AUDIO SIGNALS LIKE SPEECH AND HARMONIC SOUNDS ARE DISCUSSED. SPEECH SIGNALS ARE MODELED BY LINEAR PREDICTIVE CODING WITH SMOOTH CONSTRAINT ON THE COEFFICIENTS. HARMONIC SOUNDS ARE MODELED BY THE NORMALIZED AVERAGE SUMMATION FUNCTION. HIGHER-ORDER STATISTICS ARE MODELED BY INDEPENDENT COMPONENT ANALYSIS. TO REDUCE THE NUMBER OF PARAMETERS TO BE OPTIMIZED, SINUSOIDAL MODELING, AUDITORY FRONT-END AND INDEPENDENT COMPONENT ANALYSIS ARE EMPLOYED FOR DIMENSION REDUCTION.

THE HARMONIC SOURCE SEPARATION AL

URI: https://scholarbank.nus.edu.sg/handle/10635/160974
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
ShenlingThesis.pdf683.24 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.