AI Weekly Rundown (October 21 to October 27)

Major AI announcements from Meta, NVIDIA, OpenAI, Google and more this week.

Oct 27, 2023

Hello, Engineering Leaders and AI Enthusiasts!

Another eventful week in the AI realm. Lots of big news from huge enterprises.

In today’s edition:

🤖 Meta’s Habitat 3.0 can train AI agents to assist humans in daily tasks
🔥 NVIDIA's AI teaches robots complex skills on par with humans
🧪 OpenAI’s secret sauce behind Dall-E 3’s accuracy
💪 Qualcomm's new PC chip for AI to challenge Apple, Intel
🔝 Microsoft is outdoing its biggest rival, Google, in AI
🚀 Samsung Galaxy S24 is your upcoming pocket AI machine
😱 OpenAI’s new rival Jina AI has open-source 8k context
🔍 LLM hallucination problem will be over with “Woodpecker”
🌟 NVIDIA Research has announced new AI advancements
🛡 OpenAI forms 'Preparedness' team to study advanced AI risks
🔒 Google’s new ventures for safe and secure AI
🐕 Robot dog turns into a talking tour guide with ChatGPT

Let’s go!

Meta’s Habitat 3.0 can train AI agents to assist humans in daily tasks

Meta has announced three major advancements toward the development of socially intelligent AI agents that can cooperate with and assist humans in their daily lives:

Habitat 3.0: The highest-quality simulator that supports robots and humanoid avatars and allows for human-robot collaboration in home-like environments. AI agents trained with Habitat 3.0 learn to find and collaborate with human partners in everyday tasks like cleaning up a house. These AI agents are evaluated with real human partners using a simulated human-in-the-loop evaluation framework (also provided with Habitat 3.0).

Habitat Synthetic Scenes Dataset (HSSD-200): An artist-authored 3D scene dataset that more closely mirrors physical scenes. It comprises 211 high-quality 3D scenes and a diverse set of 18,656 models of physical-world objects from 466 semantic categories.
HomeRobot: An affordable home robot assistant hardware and software platform in which the robot can perform open vocabulary tasks in simulated and physical-world environments.

Source

NVIDIA's AI teaches robots complex skills on par with humans

A new AI agent developed by NVIDIA Research that can teach robots complex skills has trained a robotic hand to perform rapid pen-spinning tricks– for the first time as well as a human can.

The above are some of nearly 30 tasks that robots have learned to expertly accomplish thanks to Eureka, which uses LLMs to automatically generate reward algorithms to train robots. Eureka is powered by the GPT-4. Eureka-generated reward programs outperform expert human-written ones on more than 80% of tasks.

Source

OpenAI’s secret sauce of Dall-E 3’s accuracy

OpenAI published a paper on DALL-E 3, explaining why the new AI image generator follows prompts much more accurately than comparable systems.

Prior to the actual training of DALL-E 3, OpenAI trained its own AI image labeler, which was then used to relabel the image dataset for training the actual DALL-E 3 image system. During the relabeling process, OpenAI paid particular attention to detailed descriptions.

Source

Qualcomm's new PC chip for AI to challenge Apple, Intel

Qualcomm has unveiled a new laptop processor designed to outperform rival products from Intel Corp. and Apple Inc. Snapdragon X features 12 high-performance cores capable of crunching data at 3.8 megahertz.

The chip is as much as twice as fast as a similar 12-core processor from Intel while using 68% less power. Qualcomm also claims it can operate at peak speeds 50% higher than Apple's M2 SoC

In addition to overall improved performance, the new processor boasts features explicitly designed for AI. The chipmaker contends that AI’s full potential will be realized when it extends beyond data centers and into end-user devices such as smartphones and PCs.

Source

Microsoft is outdoing its biggest rival, Google, in AI

Samsung is going all in with AI on its next flagship. It wants to make the Galaxy S24, Galaxy S24+, and Galaxy S24 Ultra the smartest AI phones ever. The series will have features lifted straight from ChatGPT and Google Bard, such as the ability to create content and stories based on a few keywords provided by the user.

There will also be features Samsung has designed on its own, such as text-to-image Generative AI, and a lot of them will be available both online and offline. Speech-to-text functionality is one area that will see improvements.

Source

Samsung Galaxy S24 is your upcoming pocket AI machine

SAP announced new business AI and user experience innovations in its comprehensive spend management and business network solutions to help customers control costs, mitigate risk, and increase productivity.

SAP will also embed Joule, its new generative AI copilot, throughout its cloud solutions, with availability in its spend management software planned for 2024. It has also unveiled SAP Spend Control Tower, which offers advanced AI features and the ability to see across all SAP spend solutions.

All these new AI innovations are being developed with security, privacy, compliance, ethics, and accuracy in mind.

Source

OpenAI’s new rival Jina AI has open-source 8k context

Berlin-based AI company Jina AI has launched Jina-embeddings-v2, the world's first open-source 8K text embedding model. This model supports an impressive 8K context length, putting it on par with OpenAI's proprietary model. Jina-embeddings-v2 offers extended context potential, allowing for applications such as legal document analysis, medical research, literary analysis, financial forecasting, and conversational AI.

Benchmarking shows that it outperforms other leading base embedding models. The model is available in two sizes, a base model for heavy-duty tasks and a small model for lightweight applications. Jina AI plans to publish an academic paper, develop an embedding API platform, and expand into multilingual embeddings.

Source

Enjoying the weekly updates?

Refer your pals to subscribe to our newsletter and get exclusive access to 400+ game-changing AI tools.

Refer a friend

When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.

LLM hallucination problem will be over with “Woodpecker”

Researchers from the University of Science and Technology of China and Tencent YouTu Lab have developed a framework called "Woodpecker" to correct hallucinations in multimodal large language models (MLLMs).

Woodpecker uses a training-free method to identify and correct hallucinations in the generated text. The framework goes through five stages, including key concept extraction, question formulation, visual knowledge validation, visual claim generation, and hallucination correction.

The researchers have released the source code and an interactive demo of Woodpecker for further exploration and application. The framework has shown promising results in boosting accuracy and addressing the problem of hallucinations in AI-generated text.

Source

NVIDIA Research has announced new AI advancements

NVIDIA Research has announced new AI advancements that will be presented at the NeurIPS conference. The projects include new techniques for transforming text-to-images, photos to 3D avatars, and specialized robots into multi-talented machines.

The research focuses on generative AI models, reinforcement learning, robotics, and applications in the natural sciences. Highlights include improving text-to-image diffusion models, advancements in AI avatars, breakthroughs in reinforcement learning and robotics, and AI-accelerated physics, climate, and healthcare research. These advancements aim to accelerate the development of virtual worlds, simulations, and autonomous machines.

Source

OpenAI forms 'Preparedness' team to study advanced AI risks

To minimize risks from frontier AI as models continue to improve, OpenAI is building a new team called Preparedness. It tightly connect capability assessment, evaluations, and internal red teaming for frontier models, from the models OpenAI develops in the near future to those with AGI-level capabilities.

The team will help track, evaluate, forecast, and protect against catastrophic risks spanning multiple categories including:

Individualized persuasion
Cybersecurity
Chemical, biological, radiological, and nuclear (CBRN) threats
Autonomous replication and adaptation (ARA)

The Preparedness team mission also includes developing and maintaining a Risk-Informed Development Policy (RDP). In addition, OpenAI is soliciting ideas for risk studies from the community, with a $25,000 prize and a job at Preparedness on the line for top ten submissions.

Source

Google’s new ventures for safer, more secure AI

Google has announced a bug bounty program for attack scenarios specific to generative AI through expanding its Vulnerability Rewards Program (VRP) for AI. It shared guidelines for security researches to see what’s “in scope”.
To further protect against machine learning supply chain attacks, Google is expanding its open source security work and building upon prior collaboration with the Open Source Security Foundation. It has earlier released Secure AI Framework (SAIF) that emphasized AI ecosystems must have strong security foundations.
Google is also to support a new effort by the non-profit MLCommons Association to develop standard AI safety benchmarks. The effort aims to bring together expert researchers across academia and industry to develop standard benchmarks for measuring the safety of AI systems into scores that everyone can understand.

Source

Robot Dog turns into a talking tour guide with ChatGPT

Named Spot, the four-legged robot could run, jump, and even dance. To make Spot “talk,” Boston Dynamics used OpenAI’s ChatGPT API, along with some open-source LLMs to carefully train its responses. With ChatGPT, it can answer questions and generate responses about the company’s facilities while giving a tour.

It also outfitted the bot with a speaker, added text-to-speech capabilities, and made its mouth mimic speech “like the mouth of a puppet”.

Source

That's all for now!

Subscribe to The AI Edge and gain exclusive access to content enjoyed by professionals from Moody’s, Vonage, Voya, WEHI, Cox, INSEAD, and other esteemed organizations.

Thanks for reading, and see you on Monday. 😊

The AI Edge

AI Weekly Rundown (October 21 to October 27)

Major AI announcements from Meta, NVIDIA, OpenAI, Google and more this week.

Meta’s Habitat 3.0 can train AI agents to assist humans in daily tasks

NVIDIA's AI teaches robots complex skills on par with humans

OpenAI’s secret sauce of Dall-E 3’s accuracy

Qualcomm's new PC chip for AI to challenge Apple, Intel

Microsoft is outdoing its biggest rival, Google, in AI

Samsung Galaxy S24 is your upcoming pocket AI machine

OpenAI’s new rival Jina AI has open-source 8k context

Enjoying the weekly updates?

LLM hallucination problem will be over with “Woodpecker”

NVIDIA Research has announced new AI advancements

OpenAI forms 'Preparedness' team to study advanced AI risks

Google’s new ventures for safer, more secure AI

Robot Dog turns into a talking tour guide with ChatGPT

Discussion about this post