Skip to content

Implement speaker diarization, voice activity detection, and/or conversation endpointing.Β #97

@kaeladair

Description

@kaeladair

Implement speaker diarization and VAD. This will let the agent understand who is speaking, providing the user with better responses. This should also get rid of audio hallucinations when there is silence, very important as the wearable will be recording during silence often if worn all the time.

Potential implementations:
https://github.com/pyannote/pyannote-audio
Deepgram

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions