Transcribe voice into text
Detect language from audio input
Audio-Analyzer and speaker diarization with quality metrics