OpenAI Whisper Speaker Diarization - Transcription with Speaker Names

1littlecoder
1littlecoder
52 هزار بار بازدید - 2 سال پیش - High level overview of what's
High level overview of what's happening with OpenAI Whisper Speaker Diarization:

Using Open AI's Whisper model to seperate audio into segments and generate transcripts.
Then generating speaker embeddings for each segments.
Then using agglomerative clustering on the embeddings to identify the speaker for each segment.

Speaker Identification or Speaker Labelling is very important for Podcast Transcription or Conversations Audio Transcription. This code helps you do that.

Dwarkesh's Patel Tweet Announcement - Twitter: 1579672641887408129

Colab - https://colab.research.google.com/dri...

https://huggingface.co/spaces/dwarkes...
2 سال پیش در تاریخ 1401/09/23 منتشر شده است.
52,061 بـار بازدید شده
... بیشتر