Google AI Lets You Step into 3D Indoors
Plus: Render A Video: Video-to-video AI, Google's new AI features
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 42nd edition of The AI Edge newsletter. This edition brings you the AI that enables Google Maps Immersive View to let users view and interact with indoor spaces in 3D.
And a special thank you to our amazing readers. Your ongoing support fuels our passion for delivering quality content. 😊
In today’s edition:
🤖Google AI Lets You Step into 3D Indoors
🎥Rerender A Video: Zero-shot text-guided video-to-video AI
🚀Google’s 3 new AI updates to make your life easier
📚 Knowledge Nugget: 5 years of progress in GPTs by
Let’s go!
Google AI Lets You Step into 3D Indoors
Google reveals how it uses neural radiance fields (NeRF) in Immersive View to seamlessly fuse photos to produce realistic, multi-dimensional reconstructions within a neural network. Immersive View in Google Maps provides indoor views of restaurants, cafes, and other venues in 3D to give users a virtual up-close look that can help them confidently decide where to go.
Google describes its complete pipeline, from capturing photos of the space using DSLR cameras to enabling the 3D interactive 360° videos to be available on smartphones.
Why does this matter?
It is the first step of many in a journey towards universally accessible, AI-powered, immersive experiences. From a NeRF research perspective, it can help answer questions regarding reconstructions with scene segmentation, adapting NeRF to outdoor collections, and enabling real-time, interactive 3D exploration on-device.
Rerender A Video: Zero-shot text-guided video-to-video AI
New research has proposed a novel zero-shot text-guided video-to-video translation framework to adapt image models to videos. Called Rerender A Video, it takes an input video and re-renders it with your text prompt. It achieves both global and local temporal consistency at a low cost (without re-training or optimization).
Moreover, it is compatible with existing image diffusion techniques, indicating that it might be applied to other text-guided video editing tasks, such as video super-resolution and inpainting.
Why does this matter?
This approach can facilitate the creation of high-quality and temporally-coherent videos and inspire further research in this field. Moreover, it removes the weird, inconsistent flickering that happens when using current video-generating models. Thus, its compatibility with the existing diffusion-based models can benefit them.
Google introduced 3 new AI updates to make your life easier
Google leveraged AI technology to improve advertising effectiveness, enhance the online shopping experience, and provide comprehensive information for travel and product searches that aim to make your life easier. The three new AI updates are:
1) New AI-powered ad solutions to drive demand
Google and YouTube have unveiled two new AI-powered campaigns: Demand Gen and Video Views. These campaigns aim to assist advertisers in boosting creativity and generating consumer demand. They leverage immersive and relevant creatives to drive action and conversions during crucial moments. Research reveals that YouTube influences 87% of consumers in making purchase decisions faster.
The Demand Gen campaigns cater to the needs of social marketers, integrating top-performing video and image assets across YouTube, YouTube Shorts, Discover, and Gmail, reaching over 3 billion monthly users.
2) Virtually try on clothes with a new AI shopping feature
Google has introduced two new features to enhance the online shopping experience.
1. The first feature is a virtual try-on for apparel, which uses GenAI to show clothes on a diverse range of real models. This allows shoppers to see how the clothes look on different body types and skin tones before purchasing.
2. The second feature is guided refinements, which helps shoppers find the perfect piece by allowing them to refine their search using inputs like color, style, and pattern. This feature provides options from various retailers across the web, giving shoppers a wider range of choices.
3) Google intros new AI-powered travel and product search features
Google has introduced new features to its experimental search experience called Search Generative Experience (SGE). The updates focus on travel and shopping and aim to provide users with comprehensive information and insights. When users ask about a place or destination, they will see a snapshot that combines web information, reviews, photos, and business profile details. SGE will display product descriptions, reviews, ratings, prices, images, and recommendations for shopping.
Why does this matter?
Google’s new AI advancements aim to simplify users' lives by saving time, improving decision-making, and increasing satisfaction when interacting with Google's platforms and services.
Knowledge Nugget: 5 years of progress in GPTs
In this informative article,
discusses the evolution of the generative pre-trained transformer (GPT) line of research, specifically focusing on the state-of-the-art (SOTA) models and their differences.Furthermore, You will find an in-depth discussion on the progress and evolution of generative pre-trained transformer (GPT) models such as GPT, GPT-2, GPT-3, Jurassic-1, Megatron-Turing NLG, Gopher, and Chinchilla. These models represent advancements in natural language processing and have different architectural and computational characteristics.
Why does this matter?
While other articles provide summaries of these papers, this piece of content explicitly focuses on the differences between them. moreover, the GPT line of research is currently driving intense development in the field.
What Else Is Happening
👁️🗨️ See this AI-powered enhancement bring creative compositions. (Link)
🤖 Synthesia raised $90M for AI-generated custom avatars. (Link)
🚀 France’s Mistral AI raised $113M in seed funding at a $260M valuation to challenge OpenAI. (Link)
💰 Lenovo to invest $1B to accelerate AI solutions. (Link)
📱 Truecaller introduced AI-powered call recording for Android and iPhone. (Link)
💻 Vercel’s AI Accelerator 6-week program with $850k+ credits from Vercel and top AI platforms (Link)
Trending Tools
BeforeSunset AI: AI daily planner tool that plans your day based on your schedule and to-do list. Provides analytics for stress-free sunsetting.
Hotjar AI for Surveys: Instantly generates survey questions, analyzes open-text responses, and prepares summary reports in just a few clicks.
AlphaCTR: AI platform trained to generate high-performance thumbnails and ad creatives. Get hundreds of options in just a few clicks.
Lunacy: Design software that keeps your flow with AI tools and built-in graphics. Skip the routine and focus on creative tasks.
Masthead Data: Helps data engineers see anomalies and pipeline errors in real-time. Trace data flows and optimize cloud compute for data pipelines.
Spell AI: Delegate tasks to autonomous AI agents who surf the web and use plugins to get your work done.
Cohesive AI Voices: Collection of 20+ human-sounding voices for generating voiceovers in 10+ languages for Tiktok, Youtube videos, reels, podcasts, and more.
Juri Flow: Your personal AI lawyer—designed to assist lawyers, law students, and individuals seeking help.
That's all for now!
Join The AI Edge and join readers from Moody’s, Vonage, Voya, WEHI, Cox, INSEAD, and other respected organizations to stay at the forefront of AI trends and advancements.
Thanks for reading, and see you tomorrow.