MishaalRahman, Gemini Nano with Multimodality just got announced! This new, 3.8B parameter is designed to run on-device and can process not just text input but also audio and images. It’s coming later this year “starting with Pixel” and will be used for:
- Clearer descriptions with TalkBack. TalkBack will soon be able to automatically generate more useful image descriptions. This will help people with visual impairments who can’t see images, especially when those images don’t have alt text already.
(1/2)
Add comment