Livoa LogoLivoa

USER INTERFACE

( Web App: Image / Audio / Mic )

IMAGE INPUT

AUDIO FILE INPUT

MICROPHONE INPUT

IMAGE PREPROCESSING

Grayscale,

Contrast, Sharp

AUDIO PREPROCESSING

Noise Reduction

Resampling (16kHz)

REAL-TIME AUDIO

Noise Filtering

Voice Capture

OCR (Image)


Textextract

ASR (Audio)

Wav2Vec2

ASR (Mic)

Whisper

Text Normalization

Text Normalization

Text Normalization

Context Classification

Multilingual Translation

Multilingual Translation

Multilingual Translation

Output

Output

Output

Methodology

by Tanuja

0
0 uses