Google’s New AI Releases– Gemini API, MedLM, Imagen 2, MusicFX
Plus: Stability AI introduces Stable Zero123 for quality image-to-3D generation.
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 168th edition of The AI Edge newsletter. This edition brings you Google’s slew of new AI updates and announcements.
And a huge shoutout to our amazing readers. We appreciate you😊
In today’s edition:
🚀 Google’s new AI releases: Gemini API, MedLM, Imagen 2, MusicFX
🤖
Stability AI introduces Stable Zero123 for quality image-to-3D generation
📚 Knowledge Nugget: How to build a generative AI service's backend by
Let’s go!
We need your help!
We are working on a Gen AI survey and would love your input.
It takes just 2 minutes.
The survey insights will help us both.
And hey, you might also win a $100 Amazon gift card!
Every response counts. Thanks in advance!
Google’s new AI releases: Gemini API, MedLM, Imagen 2, MusicFX
Google is introducing a range of generative AI tools and platforms for developers and Google Cloud customers.
Gemini API in AI Studio and Vertex AI: Google is making Gemini Pro available for developers and enterprises to build for their own use cases. Right now, developers have free access to Gemini Pro and Gemini Pro Vision through Google AI Studio, with up to 60 requests per minute. Vertex AI developers can try the same models, with the same rate limits, at no cost until general availability early next year.
Imagen 2 with text and logo generation: Imagen 2 now delivers significantly improved image quality and a host of features, including the ability to generate a wide variety of creative and realistic logos and render text in multiple languages.
MedLM: It is a family of foundation models fine-tuned for the healthcare industry, generally available (via allowlist) to Google Cloud customers in the U.S. through Vertex AI. MedLM builds on Med-PaLM 2.
MusicFX: It is a groundbreaking new experimental tool that enables users to generate their own music using AI. It uses Google’s MusicLM and DeepMind’s SynthID to create a unique digital watermark in the outputs, ensuring the authenticity and origin of the creations.
Google also announced the general availability of Duet AI for Developers and Duet AI in Security Operations.
Why does this matter?
Google isn’t done yet. While its impressive Gemini demo from last week may have been staged, Google is looking to fine-tune and improve Gemini based on developers’ feedback. In addition, it is also racing with rivals to push the boundaries of AI in various fields.
Stability AI introduces Stable Zero123 for quality image-to-3D generation
Stable Zero123 generates novel views of an object, demonstrating 3D understanding of the object’s appearance from various angles– all from a single image input. It’s notably improved quality over Zero1-to-3 or Zero123-XL is due to improved training datasets and elevation conditioning.
The model is now released on Hugging Face to enable researchers and non-commercial users to download and experiment with it.
Why does this matter?
This marks a notable improvement in both quality and understanding of 3D objects compared to previous models, showcasing advancements in AI's capabilities. It also sets the stage for a transformative year ahead in the world of Generative media.
Enjoying the daily updates?
Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.
When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.
Knowledge Nugget: How to build a generative AI service's backend
co-founded and launched Distillery a few months ago, with a promise to always be transparent and committed to the open source cause. This post aims to make true on this promise by disclosing the unsexy part of Distillery: its back end.It goes through the following topics:
How to build a Discord bot running Stable Diffusion in 15 minutes or less;
The fragilities that a simple bot like that needs to address;
How we did address them in Distillery;
The actual cloud architecture we are using, involving AWS, Runpod and Reyki AI as our cloud partners.
Why does this matter?
This post can help enthusiasts in their journey to learn how to deploy an AI system in a scalable way. It provides practical insights and a real-world architecture example.
What Else Is Happening❗
📰OpenAI partners with Axel Springer to deepen beneficial use of AI in journalism.
Axel Springer is the first publishing house globally to partner with OpenAI on a deeper integration of journalism in AI technologies. The initiative will enrich users’ experience with ChatGPT by adding recent and authoritative content on a wide variety of topics, and explicitly values the publisher’s role in contributing to OpenAI’s products. (Link)
🧠Accenture and Google Cloud launch joint Generative AI Center of Excellence.
It will provide businesses with the industry expertise, technical knowledge, and product resources to build and scale applications using Google Cloud’s generative AI portfolio and accelerate time-to-value. It will also help enterprises determine the optimal LLM– including Google’s latest model, Gemini– to use based on their business objectives. (Link)
🤝Google Cloud partners with Mistral AI on generative language models.
Google Cloud and Mistral AI are partnering to allow the Paris-based generative AI startup to distribute its language models on the tech giant's infrastructure. As part of the agreement, Mistral AI will use Google Cloud’s AI-optimized infrastructure, including TPU Accelerators, to further test, build, and scale up its LLMs. (Link)
🚫Amazon CTO shares how to opt out of 3rd party AI partner access to your Dropbox. Check out the tweet here (Link)
🌍Grok expands access to 40+ countries.
Earlier, it was only available to Premium+ subscribers in the US. Check out the list of countries here. (Link)
That's all for now!
Subscribe now to join the prestigious readership of The AI Edge alongside professionals from Moody’s, Vonage, Voya, WEHI, Cox, INSEAD, and other top organizations.
Thanks for reading, and see you tomorrow. 😊