Tag: Gemini

  • Simon Willison’s Weblog: Calling LLMs from client-side JavaScript, converting PDFs to HTML + weeknotes

    Source URL: https://simonwillison.net/2024/Sep/6/weeknotes/ Source: Simon Willison’s Weblog Title: Calling LLMs from client-side JavaScript, converting PDFs to HTML + weeknotes Feedly Summary: I’ve been having a bunch of fun taking advantage of CORS-enabled LLM APIs to build client-side JavaScript applications that access LLMs directly. I also span up a new Datasette plugin for advanced permission management.…

  • Slashdot: Google To Relaunch Tool For Creating AI-Generated Images of People

    Source URL: https://tech.slashdot.org/story/24/08/28/2051216/google-to-relaunch-tool-for-creating-ai-generated-images-of-people Source: Slashdot Title: Google To Relaunch Tool For Creating AI-Generated Images of People Feedly Summary: AI Summary and Description: Yes Summary: Google is set to reintroduce AI image generation capabilities via its Gemini tool, featuring improvements to address previous inaccuracies and concerns from users. The upcoming Imagen 3 generator aims to provide…

  • New York Times – Artificial Intelligence : Google Says It Fixed Its A.I. Image Generator

    Source URL: https://www.nytimes.com/2024/08/28/technology/google-gemini-ai-image-generator.html Source: New York Times – Artificial Intelligence Title: Google Says It Fixed Its A.I. Image Generator Feedly Summary: The company will allow users of its Gemini chatbot to create images of people with artificial intelligence after disabling the feature six months ago. AI Summary and Description: Yes Summary: The text discusses Google’s…

  • Simon Willison’s Weblog: Gemini Chat App

    Source URL: https://simonwillison.net/2024/Aug/27/gemini-chat-app/#atom-everything Source: Simon Willison’s Weblog Title: Gemini Chat App Feedly Summary: Gemini Chat App Google released three new Gemini models today: improved versions of Gemini 1.5 Pro and Gemini 1.5 Flash plus a new model, Gemini 1.5 Flash-8B, which is significantly faster (and will presumably be cheaper) than the regular Flash model. They’re…

  • Simon Willison’s Weblog: NousResearch/DisTrO

    Source URL: https://simonwillison.net/2024/Aug/27/distro/#atom-everything Source: Simon Willison’s Weblog Title: NousResearch/DisTrO Feedly Summary: NousResearch/DisTrO DisTrO stands for Distributed Training Over-The-Internet – it’s “a family of low latency distributed optimizers that reduce inter-GPU communication requirements by three to four orders of magnitude". This tweet from @NousResearch helps explain why this could be a big deal: DisTrO can increase…

  • Simon Willison’s Weblog: Gemini Bounding Box Visualization

    Source URL: https://simonwillison.net/2024/Aug/26/gemini-bounding-box-visualization/#atom-everything Source: Simon Willison’s Weblog Title: Gemini Bounding Box Visualization Feedly Summary: Gemini Bounding Box Visualization Here’s another fun tool I built with the help of Claude 3.5 Sonnet. I was browsing through Google’s Gemini documentation while researching how different multi-model LLM APIs work when I stumbled across this note in the vision…

  • Google Online Security Blog: Private AI For All: Our End-To-End Approach to AI Privacy on Android

    Source URL: http://security.googleblog.com/2024/08/android-private-ai-approach.html Source: Google Online Security Blog Title: Private AI For All: Our End-To-End Approach to AI Privacy on Android Feedly Summary: AI Summary and Description: Yes Summary: The text discusses how Google is integrating advanced AI, specifically Gemini, into Android while prioritizing user privacy and security. It highlights Google’s commitment to on-device processing…