Post

Prototyping Gemini pro on mobile

Last thursday, Google released early versions of Gemini Pro - their generative AI APIs and I created a quick iOS prototype app to test our their vision models πŸ‘€

I do deeply care about making good, accessible apps so apart from obvious banking/ fintech usecases, I think there is a lot of potential for it in a11y usecases πŸš€

  • I gave it a photo of a receipt for a lunch we had in our team and asked it create a json of item names and prices βœ…

  • I gave it a photo of my kid at Melbourne zoo and asked it to create an alt-text βœ…

  • I gave it a screenshot of an app and asked it to generate accessibility labels. It actually also generate attributes link links/ logo etc βœ…

I really do see there are some great usecases for Australian companies and sometimes I do wonder, why can’t we be at the forefront of disruptive tech? Is there no room for innovating, and moving fast in large enterprises here? Is it a matter of risk - that only venture capital based Silicon Valley companies are well-versed in? I am going to leave this open-ended question out there and seek no answers.

This post is licensed under CC BY 4.0 by the author.