Meta’s New AI Makes Communication Seamless in 100 Languages 💬🗣️💬
Plus: NVIDIA integrated human-like intelligence to ADS. Mastercard introduces Muse AI for tailored shopping.
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 159th edition of The AI Edge newsletter. This edition brings you “Meta’s new 4 AI research models called Seamless Communication”.
And a huge shoutout to our incredible readers. We appreciate you😊
In today’s edition:
💬 Meta’s new AI makes communication seamless in 100 languages
.🚗
NVIDIA researchers have integrated human-like intelligence into ADS
🛍️ Mastercard introduces Muse AI for tailored shopping
💡 Knowledge Nugget: Let's make some AI mini games by
Let’s go!
Meta’s new AI makes communication seamless in 100 languages
Meta has developed a family of 4 AI research models called Seamless Communication, which aims to remove language barriers and enable more natural and authentic communication across languages. Here are they:
It is the first publicly available system that unlocks expressive cross-lingual communication in real-time and allows researchers to build on this work.
Try the SeamlessExpressive demo to listen how you sound in different languages.
Today, alongside their models, they are releasing metadata, data, and data alignment tools to assist the research community, including:
Metadata of an extension of SeamlessAlign corresponding to an additional 115,000 hours of speech and text alignments on top of the existing 470k hours.
Metadata of SeamlessAlignExpressive, an expressivity-focused version of the dataset above.
Tools to assist the research community in collecting more datasets for translation.
Why does this matter?
These models represent a significant step towards achieving seamless communication, high-quality AI translation, and an attempt to destroy language barriers. As they keep improving these models, the idea of a "universal translator" that helps people from different cultures talk easily gets more real.
NVIDIA researchers have integrated human-like intelligence into ADS
In this paper, the team of NVIDIA, Stanford, and USC researchers have released 'Agent-driver,' which integrates human-like intelligence into the driving system. It utilizes LLMs as a cognitive agent to enhance decision-making, reasoning, and planning.
Agent-Driver system includes a versatile tool library, a cognitive memory, and a reasoning engine. The system is evaluated on the nuScenes benchmark and outperforms existing driving methods significantly. It also demonstrates superior interpretability and the ability to learn with few examples. The code for this approach will be made available.
Why does this matter?
The approach demonstrates superior interpretability and few-shot learning ability. This can potentially revolutionize decision-making and planning in self-driving vehicles.
Mastercard introduces Muse AI for tailored shopping
Mastercard has launched Shopping Muse, an AI-powered tool that helps consumers find the perfect gift. AI will provide personalized recommendations on a retailer's website based on the individual consumer's profile, intent, and affinity.
Shopping Muse translates consumer requests made via a chatbot into tailored product recommendations, including suggestions for coordinating products and accessories. It considers the shopper's browsing history and past purchases to estimate future buying intent better.
Why does this matter?
The benefits of using Shopping Muse include saving time and effort, enhancing customer satisfaction and loyalty, and boosting sales and revenue for retailers. This is part of Mastercard's strategy to provide more value beyond transactions.
Enjoying the daily updates?
Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.
When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.
Knowledge Nugget: Let's make some AI mini-games
The author
created a simple toy game engine called 100 Boxes to explore whether FRVR Forge works with other game engines. They then used this engine to create a series of mini-games, with the only commonality being that the game is over if you fail.The author learned some weaknesses in their integration and API design, but overall it was a fun vacation project. They also wrote a small wrapper in XS to allow the games to be played via web publishing. The games are tied together in a single game where the goal is to survive as many rounds as possible.
Why does this matter?
It showcases an experimental endeavor exploring the compatibility of FRVR Forge with different game engines. It highlights the potential for cross-engine integration and the adaptability of game development tools, offering insights into both its possibilities and limitations.
What Else Is Happening❗
🤑 Microsoft plans to invest $3.2B in UK to drive AI progress
It will be its largest investment in the country over the next three years. The funding will support the growth of AI and Microsoft's data center footprint in Britain. The investment comes as the UK government seeks private investment to boost infrastructure development, particularly in industries like AI. (Link)
🤝 HPE and NVIDIA extended their collaboration to enhance AI offerings
The partnership aims to enable customers to become "AI-powered businesses" by providing them with products that leverage Nvidia's AI capabilities. The deal is expected to enhance generative AI capabilities and help users maximize the potential of AI technology. (Link)
🔊 Voicemod now allows users to create and share their own AI voices
This AI voice-changing platform has new features including AI Voice Changer, which lets users create and customize synthetic voices with different genders, ages, and tones. (Link)
📱 Samsung introduces a new type of DRAM called Low Latency Wide IO (LLW)
The company claims it is perfect for mobile AI processing and gaming. It’s more efficient in processing real-time data than the LPDDR modules currently used in mobile devices. It sits next to the CPU inside the SoC and is suitable for gaming and AI applications. (Link)
🖼️ Ideogram just launched image prompting
Toronto-based AI startup Ideogram has launched its own text-to-image generator platform, competing with existing platforms like DALL-E, Midjourney, and Adobe Firefly. So now you can upload an image and control the output using visual input in addition to text. This is available to all of their Plus subscribers. (Link)
That's all for now!
If you are new to The AI Edge newsletter, subscribe to get daily AI updates and news directly sent to your inbox for free!
Thanks for reading, and see you tomorrow. 😊
These AI advancements are fascinating. Meta's Seamless Communication models are a major breakthrough in breaking down language barriers. The future looks promising for AI in communication, transportation, and shopping.