Bookcover of Who spoke when?
Booktitle:

Who spoke when?

Audio-based speaker location estimation for diarization

LAP LAMBERT Academic Publishing (2011-07-01 )

Books loader

Omni badge eligible for voucher
ISBN-13:

978-3-8443-8628-8

ISBN-10:
3844386289
EAN:
9783844386288
Book language:
English
Blurb/Shorttext:
Speaker diarization is the process which detects active speakers and groups those speech signals which has been uttered by the same speaker. Generally we can find two main applications for speaker diarization. Automatic Speech Recognition systems make use of the speaker homogeneous clusters to adapt the acoustic models to be speaker dependent and therefore increase recognition performance. Speaker indexing and rich transcription systems also use the speaker diarization output as one of information extracted from a recording, which allow its automatic indexation and other further processing. In this study a speaker diarization application is developed – using multiparty binaural speech recordings – to track speaker activity based on interaural time difference (ITD) cues. These cues, for a given speech signal frame, are computed using gammatone filtering and cross-correlation technique. Their values are used to determine which speaker in the recording produce the considered speech fragment. This study has been supervised by Dr. Jon Barker, and defended to fulfill the requirements for the degree of Master in Advanced Computer Science, University of Sheffield, United Kingdom, 2007.
Publishing house:
LAP LAMBERT Academic Publishing
Website:
https://www.lap-publishing.com/
By (author) :
Maral Dadvar
Number of pages:
68
Published on:
2011-07-01
Stock:
Available
Category:
Application software
Price:
49.00 €
Keywords:
Speaker localization, Speech detection, artificial intelligence, diarization

Books loader

Newsletter

Adyen::diners Adyen::jcb Adyen::discover Adyen::amex Adyen::mc Adyen::visa Adyen::cup Adyen::unionpay Adyen::paypal Paypal CryptoWallet Wire Transfer

  0 products in the shopping cart
Edit cart
Loading frontend
LOADING