Diarize definition

Diarize means making a note or keeping an event in a di- ary. Speaker diarization, like keeping a record of events in such a diary, addresses the question of “who spoke when” [1, 2, 3] by logging speaker-specific salient events on multiparticipant (or multispeaker) audio data. Throughout the diarization pro- cess, the audio data would be divided and clustered into groups of speech segments with the same speaker identity/label. As a result, salient events, such as non-speech/speech transition or speaker turn changes, are automatically detected. In general, this process does not require any prior knowledge of the speak- ers, such as their real identity or the number of participating speakers in the audio data. Thanks to its feature of separat- ing audio streams by these speaker-specific events, speaker di- arization can be effectively employed for indexing or analyzing various types of audio data, e.g., audio/video broadcasts from media stations, conversations in conferences, personal videos from online social media or hand-held devices, court proceed- ings, business meetings, earnings reports in a financial sector, just to name a few.