Google’s 12sec inference latency sets new benchmark
Plus: Mercedes-Benz gets smarter with ChatGPT. Hugging Face's QR code AI art generator.
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 43rd edition of The AI Edge newsletter. This edition brings you Google’s 12sec inference latency sets new GPU-equipped mobile device benchmark.
And a big shoutout goes out to all our incredible readers! 😊
In today’s edition:
🌟 Google’s 12sec inference latency sets a new benchmark
🏎️ Mercedes-Benz gets smarter with ChatGPT
🤗 Hugging face's first QR code AI art generator
Let’s go!
Google’s 12sec inference latency sets a new benchmark
Researchers have developed a series of implementation optimizations for large diffusion models used in artificial intelligence. These optimizations enable the fastest reported inference latency on GPU-equipped mobile devices, enhancing the user experience and expanding the applicability of generative AI.
The improvements address challenges posed by these models' size and resource requirements, allowing for on-device deployment with benefits such as lower server costs and improved privacy. The Samsung S23 Ultra achieved impressive results, with inference latency under 12 seconds for a 512x512 image and 20 iterations of the Stable Diffusion 1.4 model without int8 quantization.
(Stable Diffusion runs on modern smartphones in under 12 seconds. Note that running the decoder after each iteration for displaying the intermediate output in this animated GIF results in a ~2× slowdown.)
Paper: https://arxiv.org/pdf/2304.11267.pdf
Why does this matter?
Reducing the inference latency to 12 seconds significantly improves the user experience, making AI applications more responsive and efficient. This breakthrough enables faster real-time processing, allowing for various applications such as real-time image generation, object recognition, natural language processing, and more.
Mercedes-Benz integrates ChatGPT to level up in-car voice assistant
Mercedes-Benz announced that it is integrating ChatGPT via Azure OpenAI Service to transform the in-car experience for drivers. Starting today, drivers in the US can opt into a beta program that makes the MBUX Voice Assistant’s “Hey Mercedes” feature even more intuitive and conversational. The enhanced capabilities will include:
More dynamic and interactive conversations with the voice assistant
Comprehensive responses to questions about the destination, a recipe, or more complex questions
Handling follow-up questions and maintaining contextual understanding
Integration with third-party services, exploring the ChatGPT plugin ecosystem
Imagine making movie ticket bookings, restaurant reservations, enhancing convenience and productivity- all while on the road.
Why does this matter?
This paves the way for more intelligent driving experiences and accelerates the automotive industry through AI. It also shows how industries can tap into the power of the most advanced AI models to optimize infrastructure with AI to do extraordinary things and complement other technologies like the cloud.
Hugging Face introduces QR code AI art generator
The Hugging Face hub now has the first QR code AI art generator. All you need is the QR code content and a text-to-image prompt idea, or you can upload your image. And it will generate a QR code-based artwork that is aesthetically pleasing while still maintaining the integral QR code shape.
Why does this matter?
This adds a novel dimension to the AI innovation landscape, providing businesses with a new way to generate aesthetically pleasing and contextual QR codes. Moreover, the intersection of QR codes and image generation algorithms explores the boundaries of what AI systems can accomplish, especially in visual arts.
What Else Is Happening
🎥 AI reconstructs a large-scale scene from a single casually captured video! (Link)
💼 Microsoft introduces new next-gen AI, and Copilot features across its ERP portfolio (Link)
🚀 Qualcomm launches AI-powered Video Collaboration Platform Suite (Link)
🤖 Mailchimp will use AI to expand offerings with 150 new features and updates (Link)
🌐 Meta is planning to offer its AI models for free commercial use (Link)
🎙️ Radio station gets a part-time AI DJ based on its midday host (Link)
Trending Tools
Creasquare: AI-powered social media content marketing platform. Create content 10x faster and generate captions in seconds.
Project Atlas Agents: No-code project management interface to build AI agents. Trained agents solve roadblocks in simple language.
One Click Crypto: AI + DeFi: AI-powered app analyzes blockchain history and risk tolerance to build a DeFi portfolio.
AI Image Variations: Generate variations of any image with AI (Stable Diffusion). Create unlimited AI variants of any source image.
CoverDesignAI: AI-powered tool for quick and easy book cover design. Offers tailored design inspirations & Midjourney prompts.
GPT-trainer: Build your own AI chatbot. Connect data for contextual responses. Embed on website or use in Slack.
Wallpapers.ai: Web tool to make unique HD wallpapers for PC, iPhone, or Android. Tell AI what you want and download for free!
CreateBookAI: Design and create beautiful illustrated children’s books in minutes. Perfect for gifts, Amazon KDP, or personalized books.
That's all for now!
If you are new to ‘The AI Edge’ newsletter. Subscribe to receive the ‘Ultimate AI tools and ChatGPT Prompt guide’ specifically designed for Engineering Leaders and AI enthusiasts.
Thanks for reading, and see you tomorrow.