Biggest Boom in AI: ChatGPT Talks & Beyond💥
Plus: Getty Images’s new AI art tool powered by NVIDIA, Colossal-AI’s huge release.
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 112th edition of The AI Edge newsletter. This edition brings you OpenAI’s big feature push: ChatGPT can now see, hear, and speak.
And a huge shoutout to our incredible readers. You all rock! 😊
In today’s edition:
💥 Biggest Boom in AI: ChatGPT Talks and Beyond
👏🖌️
Getty Images’s new AI art tool powered by NVIDIA
💰 Colossal-AI’s commercial-free LLM saving thousands
🧠 Knowledge Nugget: Market Map - Generative AI for Virtual Worlds by
Let’s go!
Biggest Boom in AI: ChatGPT Talks and Beyond
OpenAI is introducing voice and image capabilities in ChatGPT, allowing users to have voice conversations and show images to ChatGPT. This new feature offers a more intuitive interface and expands the ways in which ChatGPT can be used.
Users can have live conversations about landmarks, get recipe suggestions by showing pictures of their fridge, and even receive math problem hints by sharing photos. The voice and image capabilities will be rolled out to Plus and Enterprise users over the next two weeks, with voice available on iOS and Android and images available on all platforms.
ChatGPT can now comprehend images, including photos, screenshots, and text-containing documents, using its language reasoning abilities. You can also discuss multiple images and utilize their new drawing tool to guide you.
Why does this matter?
OpenAI’s this big feature push comes with ever-rising stakes in the AI race among chatbot leaders such as OpenAI, Microsoft, Google, and Anthropic. These new capabilities to ChatGPT make it a truly multimodal AI and 10x more convenient to use.
It enhances user experiences, expands educational potential, and opens up new horizons in problem-solving. However, they also come with important responsibilities and considerations regarding data privacy and ethical use.
Getty Images’s new AI art tool powered by NVIDIA
Getty Images has launched a generative AI art tool called Generative AI, which uses an AI model provided by Nvidia to render images from text descriptions. The tool is designed to be "commercially safer" than rival solutions, with safeguards to prevent disinformation and copyright infringement.
Getty Images will compensate contributors whose work is used to train the AI generator and share revenues generated from the tool. The tool can be accessed on Getty's website or integrated into apps and websites through an API, with pricing based on prompt volume. Other companies, including Bria and Shutterstock, are also exploring ethical approaches to generative AI.
Why does this matter?
Getty's plan to compensate artists and contributors whose work is used to train the AI model highlights the importance of fair compensation and setting a positive example for the industry.
This update enriches user experiences in art, design, and media consumption. They can expect more diverse, high-quality AI-generated content. Using its extensive library responsibly, it aims to create AI content that respects intellectual property rights.
Colossal-AI’s commercial-free LLM saving thousands
Colossal-AI has released Colossal-LLaMA-2, an open-source and commercial-free domain-specific language model solution. It uses a relatively small amount of data and training time, resulting in lower costs.
The Chinese version of LLaMA-2 has outperformed competitors in various evaluation benchmarks. The release includes improvements such as vocabulary expansion, a data cleaning system, and a multi-stage pre-training scheme to enhance Chinese and English abilities.
Why does this matter?
This release allows cost-effective training of lightweight domain-specific LLMs, enabling fine-tuning for specific business applications.
The progress made by the open-source community in this field is remarkable, and it raises the question of whether closed models like GPT-4 stand a chance if these open models continue to improve and become more accessible.
Enjoying the daily updates?
Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.
When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.
Knowledge Nugget: Market Map - Generative AI for Virtual Worlds
This gem content piece by
provides a market map of companies developing generative artificial intelligence technology for virtual worlds, such as games, simulations, and metaverse applications.The map aims to organize the landscape of generative AI companies in a more convincing way than existing maps, which often seem random. It includes large companies that have significant investments or operations in this field, as well as smaller companies focused on specific categories. It explains the different layers of the value chain used to organize the chart and highlights the complexity of generative AI for virtual worlds.
Why does this matter?
This market map is more than just an informational graphic; it serves as a tool for industry stakeholders to gain clarity, identify opportunities, and navigate the rapidly evolving landscape of generative AI.
What Else Is Happening❗
✅ Tesla’s humanoid robot Optimus can now sort objects autonomously
Using its end-to-end trained neural network. The robot can calibrate itself using joint position encoders and vision to locate its limbs precisely. It can then sort colored blocks into their respective trays, even adapting to dynamic changes in the environment. (Link)
✅ Snapchat partners with Microsoft to insert ads into its AI chatbot feature, My AI
It offers link suggestions related to user conversations. The partnership is a win for Microsoft's ads business and could position Snapchat as a platform for Gen Z users to search for products and services through AI chats. (Link)
✅ Spotify is testing a voice translation feature for podcasts, using AI to translate content into different languages
By offering translated podcasts from popular hosts like Dax Shepard and Lex Fridman, Spotify hopes to expand its global reach and cater to a wider audience. (Link)
✅ Google's Bard now has new capabilities to help travelers plan their vacations
Connecting with various Google applications like Gmail, Google Flights, and Google Maps, It can provide personalized assistance throughout the trip. Users can ask Bard to find flight and hotel information, get directions, watch YouTube videos, and even check dates that work for everyone involved. (Link)
✅ Correcto has raised $7M in seed funding to expand its language writing tool for Spanish speakers
While AI tools like ChatGPT can generate text in Spanish, Correcto believes its tool offers better quality and provides opportunities for individual learning. The company plans to target enterprise customers while offering a freemium version for individual users. (Link)
That's all for now!
Subscribe to The AI Edge and gain exclusive access to content enjoyed by professionals from Moody’s, Vonage, Voya, WEHI, Cox, INSEAD, and other esteemed organizations.
Thanks for reading, and see you tomorrow. 😊