NVIDIA Brings 4x AI Boost with TensorRT-LLM😲⚡
Plus: ChatGPT outperforms doctors, BlackBerry's new cybersecurity AI assistant.
Hello Engineering Leaders and AI Enthusiasts!
Welcome to the 128th edition of The AI Edge newsletter. This edition brings you NVIDIA’s TensorRT-LLM, which can now operate up to 4x faster on Windows.
And a huge shoutout to our incredible readers. We appreciate you😊
In today’s edition:
⚡ NVIDIA brings 4x AI boost with TensorRT-LLM
🏥
ChatGPT outperforms doctors in depression treatment
🔐 BlackBerry announces Gen AI cybersecurity assistant
🧠 Knowledge Nugget: Fixing search with AI? by
Let’s go!
NVIDIA brings 4x AI boost with TensorRT-LLM
NVIDIA is bringing its TensorRT-LLM AI model to Windows, providing a 4x boost to consumer PCs running GeForce RTX and RTX Pro GPUs. The update includes a new scheduler called In-Flight batching, allowing for dynamic processing of smaller queries alongside larger compute-intensive tasks.
Optimized open-source models are now available for download, enabling higher speedups with increased batch sizes. TensorRT-LLM can enhance daily productivity tasks such as chat engagement, document summarization, email drafting, data analysis, and content generation. It solves the problem of outdated or incomplete information by using a localized library filled with specific datasets. TensorRT acceleration is now available for Stable Diffusion, improving generative AI diffusion models by up to 2x.
The company has also released RTX Video Super Resolution version 1.5, enhancing LLMs and improving productivity.
Why does this matter?
Applications with a 4x boost will run much more efficiently, leading to smoother user experiences for many applications. TensorRT-LLM's capacity to enhance daily productivity tasks will cut or automate routine tasks. The mention of TensorRT acceleration for Stable Diffusion and RTX Video will definitely give a boost to gaming, media, and content creation.
ChatGPT outperforms doctors in depression treatment
According to the study, ChatGPT makes unbiased, evidence-based treatment recommendations for depression that are consistent with clinical guidelines and outperform human primary care physicians. The study compared the evaluations and treatment recommendations for depression generated by ChatGPT-3 and ChatGPT-4 with those of primary care physicians.
Vignettes describing patients with different attributes and depression severity were input into the chatbot interfaces.
However, further research is needed to refine the chatbot recommendations for severe cases and to address potential risks and ethical issues associated with using artificial intelligence in clinical decision-making.
Why does this matter?
Compared with primary care physicians, ChatGPT showed no bias in recommendations based on patient gender or socioeconomic status. This means the chatbot was aligned well with accepted guidelines for managing mild and severe depression.
BlackBerry announces AI Cybersecurity assistant
BlackBerry has announced a new generative AI-powered cybersecurity assistant for its Cylance AI customers. The solution predicts customer needs and proactively provides information, eliminating the need for manual questions. It compresses research hours into seconds and offers a natural workflow instead of an inefficient chatbot experience.
BlackBerry, known for its innovation in the technology industry, has more than 5 times the AI/ML patents than its competitors. The company was also one of the first signatories of Canada's voluntary Code of Conduct on the responsible development and management of advanced Generative AI systems. The cybersecurity assistant will initially be available to a select group of customers.
Why does this matter?
In an era of constantly evolving cyber threats, end users benefit from rapid and proactive cybersecurity assistance. Seems to provide better protection against cyber threats, making digital activities safer.
Enjoying the daily updates?
Refer your pals to subscribe to our daily newsletter and get exclusive access to 400+ game-changing AI tools.
When you use the referral link above or the “Share” button on any post, you'll get the credit for any new subscribers. All you need to do is send the link via text or email or share it on social media with friends.
Knowledge Nugget: Fixing search with AI?
This intriguing article by
discusses the limitations of traditional search methods and explores how machine learning can improve search accuracy. It highlights the challenges of using large language models for search, such as outdated information and limited personalization options.The article presents an alternative approach using machine learning to assist search, specifically focusing on finding papers mentioning new datasets. It introduces the SetFit library, which achieves high accuracy with minimal labeled data.
Why does this matter?
The article provides a step-by-step process for labeling data and training a model, resulting in a more effective search system for finding relevant papers and information.
What Else Is Happening❗
🤝 NVIDIA collabs with Foxconn to accelerate the development of electric vehicles
The collaboration will utilize Nvidia's automotive solutions, including the Drive Hyperion 9 platform and the upcoming Drive Thor superchip. The Drive Thor superchip is expected to deliver high-performance computing power for safe and intelligent driving. (Link)
🎓 DeepLearning.AI launches new GenAI course
It will help you learn how generative AI works and how to use it in your life and at work. It doesn’t require any coding skills or prior knowledge of AI. (Link)
💰 Oracle’s NetSuite is adding genAI capabilities to its finance software
Allowing companies to automate tasks such as writing collections letters or tracking delayed purchases. The new "Text Enhance" feature uses Oracle's cloud-based systems to develop AI that can read and write human-like text. (Link)
👮 Dubai Police has revealed an AI-powered driverless security patrol car
Designed for residential areas. The car has advanced cameras, AI, and smart technology to detect criminal behavior, recognize faces, and read license plates. (Link)
🎤 Stardog has launched Voicebox
Voicebox aims to simplify access to business insights by allowing users to ask questions using ordinary language and receive answers based on enterprise data without requiring any technical skills. (Link)
That's all for now!
If you are new to The AI Edge newsletter, subscribe to get daily AI updates and news directly sent to your inbox for free!
Thanks for reading, and see you tomorrow. 😊