AI Weekly Rundown (June 24 to June 30)

News from Microsoft, Google, Baidu, Unity, Salesforce, Databricks, Meta, and more.

Jul 01, 2023

Hello, Engineering Leaders and AI Enthusiasts,

Another eventful week in the AI realm. Lots of big news from huge enterprises.

In today’s edition:

⚡️ Microsoft ZeRO++: Unmatched efficiency for LLM training
📦 RepoFusion: Training code models to understand your repository
🏃‍♂️ MotionGPT: A versatile text-to-motion AI
🎨 DragDiffusion: Giving Diffusion models interactive point-based image editing
🧠 Google’s new pgvector power AI applications
📊 Verge polled 2k people about using AI
🏆 Baidu’s Ernie 3.5 beat ChatGPT on multiple metrics
💡 Unity's game-changing AI products for game development
🤖 Google DeepMind's upcoming chatbot to rival ChatGPT
🚀 Salesforce’s XGen can replace Meta’s LLaMA
🔥 Databricks launches LakehouseIQ and Lakehouse AI tools
🎯 Gen AI is now a bankable skill backed by industry titans
🖼️ AI converts brain EEG signals to HQ images
🔍 Meta discloses AI behind Facebook and Instagram recommendations
🌟 OpenFlamingo V2 launched 5 newly trained multimodal models

Let’s go!

Microsoft ZeRO++: Unmatched efficiency for LLM training

Training large models requires considerable memory and computing resources across hundreds or thousands of GPU devices. Efficiently leveraging these resources requires a complex system of optimizations to:

1)Partition the models into pieces that fit into the memory of individual devices

2)Efficiently parallelize computing across these devices

But training on many GPUs results in small per-GPU batch size, requiring frequent communication and training on low-end clusters where cross-node network bandwidth is limited results in high communication latency.

To address these issues, Microsoft Research has introduced three communication volume reduction techniques, collectively called ZeRO++.

It reduces total communication volume by 4x compared with ZeRO without impacting model quality, enabling better throughput even at scale.

The AI Edge

AI Weekly Rundown (June 24 to June 30)

News from Microsoft, Google, Baidu, Unity, Salesforce, Databricks, Meta, and more.

Microsoft ZeRO++: Unmatched efficiency for LLM training

RepoFusion: Training code models to understand your repository

MotionGPT: A versatile text-to-motion AI

DragDiffusion: Giving Diffusion models interactive point-based image editing

Google’s new pgvector power AI-enabled applications

Verge polled 2k people about using AI

Baidu’s Ernie 3.5 beats ChatGPT on multiple metrics

Unity's game-changing AI products for game development

Google DeepMind's upcoming chatbot to rival ChatGPT

Salesforce’s XGen can replace Meta’s LLaMA

Databricks launches LakehouseIQ and Lakehouse AI tools

Gen AI is now a bankable skill backed by industry titans

AI converts brain EEG signals to HQ images

Meta disclosed AI behind Facebook and Instagram recommendations

OpenFlamingo V2 launched 5 newly trained multimodal models

Discussion about this post