Meta’s New AI Milestone for Image + Video Gen 🌟🖼️🎥

Plus: Google SGE's 3 new AI capabilities. Deepmind & YouTube’s AI music-gen model.

Nov 17, 2023

Hello Engineering Leaders and AI Enthusiasts!

Welcome to the 149th edition of The AI Edge newsletter. This edition brings you Meta’s new AI milestone: Launches AI tools for Image and Video.

And a huge shoutout to our incredible readers. We appreciate you😊

In today’s edition:

🌟 Meta’s new AI milestone for Image + Video gen
🆕 Google giving its SGE 3 new AI capabilities
🎧 Deepmind + YouTube’s advanced AI music-gen model
💡 Knowledge Nugget: Is Big Tech monopolizing the AI boom? by
Charlie Guo
and
AI Supremacy
.

Let’s go!

Meta’s new AI milestone for Image + Video gen

Meta’s AI research team has made significant advancements in video generation and image editing.

They are announcing new research into controlled image editing based solely on text instructions and a method for text-to-video generation based on diffusion models.

They have developed Emu Video, a method of creating HQ videos from text prompts. This unified architecture for video generation tasks can respond to various inputs: text only, image only, and both text and image.

Emu Edit is a new approach to image editing that aims to offer precise control and enhanced capabilities. Try a prompt, and continue tweaking the prompt until you get to a more desired outcome.

Why does this matter?

Meta's Emu Video offers easy video creation from text or images, fostering creativity on platforms like Facebook and Instagram. Not everyone has the expertise or tools for intricate image or video editing, but nearly everyone can communicate through text. While it augments human artistry, it doesn't aim to replace it.

The competition among Meta's Emu Video is Runway ML and OpenAI's DALL·E, which are in this AI creative race.

Source

Google giving its SGE 3 new AI capabilities

Google is giving its Search Generative Experience (SGE) three new capabilities.

1) Make finding holiday gifts easier. Users will be able to generate gift ideas by searching for specific categories, such as "great gifts for athletes," and explore options from various brands.

2) Users can virtually try on men's tops to see how they fit, and a new AI image generation feature will help users find similar products based on their preferences.

3) The final new addition uses AI image generation to create a product and help you find something that’s similar.

Additionally, Google Photos has a new AI feature to help organize and categorize photos. One feature called Photo Stacks will identify the best photo from a group and select it as the top pick. Another feature will categorize photos of things like screenshots and documents, allowing users to set reminders for them.

Why does this matter?

New SGE features enhance user convenience and promote exploration of diverse brands and products, fostering a more tailored shopping experience.

Source

Deepmind + YouTube’s advanced AI music-gen model

DeepMind and YouTube have released a new music generation model called Lyria and two toolsets called Dream Track and Music AI. Lyria works in conjunction with YouTube and aims to help with the creative process of music creation.

Dream Track allows creators to generate AI-generated soundtracks for YouTube Shorts, while Music AI provides tools for creating music with different instruments, building ensembles, and creating backing tracks for vocals. The goal is to make AI-generated music sound credible and maintain musical continuity. The tools are being released amidst controversy surrounding AI in the creative arts industry.

Why does this matter?

Lyria, with YouTube, helps make music-making simpler but raises questions about AI's impact on creativity and sparks debates about whether AI affects creativity in art.

Source

Enjoying the daily updates?

Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.

Refer a friend

When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.

Knowledge Nugget: Is Big Tech monopolizing the AI boom?

A few big tech companies dominate the AI boom, raising concerns about monopolization. In this article

AI Supremacy

Charlie Guo

discussing about big tech companies, such as Google, Amazon, and Microsoft, are investing billions of dollars into AI startups, which have unique capital needs due to the high costs of building and training AI models.

While this concentration of funding may benefit the incumbents in the short term, it could limit the potential of AI and hinder innovation. On the other hand, Meta is taking a different approach by openly releasing its AI models, challenging the dominance of other tech giants.

The future impact of AI remains uncertain, but efforts should be made to make it more accessible and prevent monopolization.

Why does this matter?

The dominance of a few major tech companies in the AI sector, investing heavily in AI startups, raises concerns about monopolization. But the impact lies in the balance between accessibility and preventing a monopoly on AI, the importance of boosting innovation, and ensuring a more diverse AI landscape.

Source

What Else Is Happening❗

🔍 Google embeds Inaudible watermarks in its AI music

To identify if its AI tech has been used in creating a track, The watermarking tool, called SynthID, will be used to watermark audio from DeepMind's Lyria model. It is designed to be undetectable by the human ear and can still be detected even if the audio is compressed, sped up or down, or adds extra noise. (Link)

✏️ OpenAI exploring ways to bring ChatGPT into classrooms

According to the company's COO, Brad Lightcap: OpenAI plans to establish a team next year to explore educational applications of the technology. Initially, teachers were concerned about the potential for cheating and plagiarism, but they have since recognized the benefits of using ChatGPT as a learning tool. (Link)

👦 Google making Bard access available to teens

Teens who meet the minimum age requirement for managing their own Google account can access Bard in English, with more languages to be added later. Bard can be used to find inspiration, learn new skills, and solve everyday problems. (Link)

👀 Microsoft partnered with Be My Eyes to help blind people

With AI-powered visual assistance and using GPT-4. The digital visual assistant ‘Be My AI’ resolves issues in just 4 minutes without human agents. Team Be My Eyes has already integrated its software within Microsoft disability answer desk to help people. (Link)

🤔 ChatGPT rumors: It might be gaining long-term memory

In a viral tweet, ChatGPT’s new setting feature ‘Manage what it remembers’ shows upgrades like the ability for GPT to learn between chats, improve over time, and manage what it remembers. (Link)

That's all for now!

If you are new to The AI Edge newsletter, subscribe to get daily AI updates and news directly sent to your inbox for free!

Thanks for reading, and see you tomorrow. 😊

The AI Edge

Meta’s New AI Milestone for Image + Video Gen 🌟🖼️🎥

Plus: Google SGE's 3 new AI capabilities. Deepmind & YouTube’s AI music-gen model.

Meta’s new AI milestone for Image + Video gen

Google giving its SGE 3 new AI capabilities

Deepmind + YouTube’s advanced AI music-gen model

Enjoying the daily updates?

Knowledge Nugget: Is Big Tech monopolizing the AI boom?

What Else Is Happening❗

Discussion about this post