Combining Multiple Public Pre-Trained Models for Better Audio Segmentation | Toronto .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

April 11, 2024 · Toronto

Whisper/VAD Multi-Model Segmentation

The session explains how merging Whisper, WebRTC‑VAD, and other public models creates a multi‑model pipeline that improves audio segmentation without extra preprocessing.

Overview
Links
Tech stack