view post Post 4764 Reply Florence-2, the new vision foundation model by Microsoft, can now run 100% locally in your browser on WebGPU, thanks to Transformers.js! 🤗🤯It supports tasks like image captioning, optical character recognition, object detection, and many more! 😍 WOW!- Demo: Xenova/florence2-webgpu- Models: https://huggingface.co/models?library=transformers.js&other=florence2- Source code: https://github.com/xenova/transformers.js/tree/v3/examples/florence2-webgpu
view post Post 9173 Reply Introducing Whisper WebGPU: Blazingly-fast ML-powered speech recognition directly in your browser! 🚀 It supports multilingual transcription and translation across 100 languages! 🤯The model runs locally, meaning no data leaves your device! 😍Check it out! 👇- Demo: Xenova/whisper-webgpu- Source code: https://github.com/xenova/whisper-web/tree/experimental-webgpu
Transformers.js demos A collection of my favorite WebML demos, built with Transformers.js! Running 800 🎤 Whisper Web Running 394 🖼️ Remove Background Web In-browser background removal Running 248 🖼️ Depth Anything Web Running 212 👀 Distil Whisper Web
Xenova/tiny-random-WhisperForConditionalGeneration_timestamped Automatic Speech Recognition • Updated 3 days ago • 65
Xenova/tiny-random-Florence2ForConditionalGeneration Image-Text-to-Text • Updated 4 days ago • 54 • 5
Xenova/tiny-random-WhisperForConditionalGeneration Automatic Speech Recognition • Updated May 24 • 14