from transformers import pipeline def transcribe(audio): text = p(audio)["text"] return text import gradio as gr gr.Interface( fn=transcribe, inputs=gr.Audio(source="microphone", type="filepath"), outputs="text").launch()