Stable Diffusion 3 Creates Jaw-Dropping Text-to-images!
Plus: LongRoPE: Extending LLM context window beyond 2 million tokens, and Google Chrome introduces "Help me write" AI feature!
Hello, Engineering Leaders and AI Enthusiasts!
Welcome to the 217th edition of The AI Edge newsletter. This edition brings you Google's next-level AI features for its latest Pixel 8 series.
And a huge shoutout to our incredible readers. We appreciate you😊
In today’s edition:
📱 Stable Diffusion 3 creates jaw-dropping images from text
✨
LongRoPE: Extending LLM context window beyond 2 million token
🤖 Google Chrome introduces "Help me write" AI feature
📚 Knowledge Nugget: Impact of AI on workplaces and will it replace humans. by
Let’s go!
Stable Diffusion 3 creates jaw-dropping text-to-images!
Stability.AI announced the Stable Diffusion 3 in an early preview. It is a text-to-image model with improved performance in multi-subject prompts, image quality, and spelling abilities. Stability.AI has opened the model waitlist and introduced a preview to gather insights before the open release.
Stability AI's Stable Diffusion 3 preview has generated significant excitement in the AI community due to its superior image and text generation capabilities. This next-generation image tool promises better text generation, strong prompt adherence, and resistance to prompt leaking, ensuring the generated images match the requested prompts.
Why does it matter?
The announcement of Stable Diffusion 3 is a significant development in AI image generation because it introduces a new architecture with advanced features such as the diffusion transformer and flow matching. The early demos of Stable Diffusion 3 have shown remarkable improvements in overall generation quality, surpassing its competitors such as MidJourney, Dall-E 3, and Google ImageFX.
LongRoPE: Extending LLM context window beyond 2 million tokens
Researchers at Microsoft have introduced LongRoPE, a groundbreaking method that extends the context window of pre-trained large language models (LLMs) to an impressive 2048k tokens.
Current extended context windows are limited to around 128k tokens due to high fine-tuning costs, scarcity of long texts, and catastrophic values introduced by new token positions. LongRoPE overcomes these challenges by leveraging two forms of non-uniformities in positional interpolation, introducing a progressive extension strategy, and readjusting the model on shorter context windows.
Experiments on LLaMA2 and Mistral across various tasks demonstrate the effectiveness of LongRoPE. The extended models retain the original architecture with minor positional embedding modifications and optimizations.
Why does it matter?
LongRoPE extends the context window in LLMs and opens up possibilities for long-context tasks beyond 2 million tokens. This is the highest supported token, especially when other models like Google Gemini Pro have capabilities of up to 1 million tokens. Another major impact it will have is an extended context window for open-source models, unlike top proprietary models.
Google Chrome introduces "Help me write" AI feature
Google has recently rolled out an experimental AI feature called "Help me write" for its Chrome browser. This feature, powered by Gemini, aims to assist users in writing or refining text based on webpage content. It focuses on providing writing suggestions for short-form content, such as filling in digital surveys and reviews and drafting descriptions for items being sold online.
The tool can understand the webpage's context and pull relevant information into its suggestions, such as highlighting critical features mentioned on a product page for item reviews. Users can right-click on an open text field on any website to access the feature on Google Chrome.
This feature is currently only available for English-speaking Chrome users in the US on Mac and Windows PCs. To access this tool, users in the US can enable Chrome's Experimental AI under the "Try out experimental AI features" setting.
Why does it matter?
Google Chrome's "Help me write" AI feature can aid users in completing surveys, writing reviews, and drafting product descriptions. However, it is still in its early stages and may not inspire user confidence compared to Microsoft's Copilote on Edge browser. Adjusting the prompts and resulting text can negate any time-saving benefits, leaving the effectiveness of this feature for Google Chrome users open for debate.
Enjoying the daily updates?
Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.
When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.
Knowledge Nugget: Impact of AI on workplaces and will it replace humans.
In this article,
discusses the impact of AI on the workplace and addresses the fear of AI replacing human workers. The article discusses a scenario where AI will amplify and augment their abilities instead of replacing humans, making them even better at their jobs.The article highlights various examples of how AI is already enhancing productivity, like
In marketing, using GPT-4, Bard and Claude are helping create cold copy for outreach activities.
In sales, AI helps prioritize efforts and transcribe and summarize sales calls.
Customer service managers can train custom AI models for support calls.
AI can help in product development by analyzing human behavior and creating thousands of “product tester.”
AI tools like GitHub pilot are already augmenting engineering teams in different ways for enhanced software development.
It further highlights the rapid adoption of AI and why learning about these tools is essential for each individual.
Why does it matter?
The article challenges the idea that AI will replace human workers. It also raises concerns about management exploiting AI to squeeze more work out of employees. Further, the article also emphasizes the need for ethical considerations and fair treatment of workers with AI adoption. But at the same time, it argues about the importance of AI in different fields.
What Else Is Happening❗
📢Google cut a deal with Reddit for AI training data.
Google and Reddit have formed a partnership that will benefit both companies. Google will pay $60 million per year for real-time access to Reddit's data, while Reddit will gain access to Google's Vertex AI platform. This will help Google train its AI and ML models at scale while also giving Reddit expanded access to Google's services. (Link)
🤖GPT Store introduces linking profiles, ratings, and enhanced about pages.
OpenAI's GPT Store platform has new features. Builders can link their profiles to GitHub and LinkedIn, and users can leave ratings and feedback. The About pages for GPTs have also been enhanced. T (Link)
✏️Microsoft introduces a generative erase feature for AI-editing photos in Windows 11.
Microsoft's Photos app now has a Generative Erase feature powered by AI. It enables users to remove unwanted elements from their photos, including backgrounds. The AI edit features are currently available to Windows Insiders, and Microsoft plans to roll out the tools to Windows 10 users. However, there is no clarity on whether AI-edited photos will have watermarks or metadata to differentiate them from unedited photos. (Link)
🎧Suno AI V3 Alpha is redefining music generation.
The V3 Alpha version of Suno AI's music generation platform offers significant improvements, including better audio quality, longer clip length, and expanded language coverage. The update aims to redefine the state-of-the-art for generative music and invites user feedback with 300 free credits given to paying subscribers as a token of appreciation. (Link)
💸Jasper acquires image platform Clipdrop from Stability AI
Jasper acquires AI image creation and editing platform Clipdrop from Stability AI, expanding its conversational AI toolkit with visual capabilities for a comprehensive multimodal marketing copilot. The Clipdrop team will work in Paris to contribute to research and innovation on multimodality, furthering Jasper's vision of being the most all-encompassing end-to-end AI assistant for powering personalized marketing and automation. (Link)
That's all for now!
Subscribe to The AI Edge and join the impressive list of readers that includes professionals from Moody’s, Vonage, Voya, WEHI, Cox, INSEAD, and other reputable organizations.
Thanks for reading, and see you tomorrow. 😊