Livoa LogoLivoa

MICROPHONE INPUT

(Real Time voice)

AUDIO LOADING

(Librosa)

RESAMPLING

(16 kHz)

NORMALIZATION &

MONO CONVERSION

WHISPER ASR MODEL

(Speech → English Text)

ASR TEXT CLEANING

(Lowercase, Normalize)

REFERENCE TEXT

(Ground Truth)

WER CALCULATION

(Compare ASR vs Reference)

WER SCORE


(ASR Accuracy Metric)

WER SCORE

Audio Preprocessing

by Tanuja

0
0 uses