Chinese researchers have unveiled a significant security loophole in widely used commercial multimodal large language models (MLLMs) such as ChatGPT, Bard, and Bing Chat. These models, deployed by tech giants like Google, Microsoft, and Baidu, are fundamental components of various applications, from virtual assistants to content moderation systems.
The researchers discovered that the vulnerabilities in these MLLMs could be exploited using manipulated images that closely resembled the originals. By making minute alterations almost invisible to the human eye, the researchers effectively bypassed the models’ built-in filters designed to weed out toxic or inappropriate content.
For instance, researchers in Beijing have identified a significant vulnerability in AI models like ChatGPT. Under attack, these models could mistake giant pandas for humans or fail to detect harmful content, highlighting a critical security flaw in commercial AI systems.
Among the affected models, Bard, equipped with face and toxicity detection mechanisms, could generate inappropriate descriptions of harmful content when compromised. The Chinese research team even provided code demonstrating how these adversarial examples could mislead AI models. Their experiments yielded a success rate of 22% against Bard, 26% against Bing Chat, and a staggering 86% against Ernie Bot.

Wu Zhaohui, China’s vice minister of science and technology, addressed these alarming findings at the Global AI Security Summit in the UK. He emphasized the urgent need for stronger technical risk controls in AI governance, urging the global community to address the vulnerabilities discovered in these widely used language models.
One of the key challenges highlighted by the research is the existing imbalance between efforts focused on attacking and defending AI models. While adversarial attacks have garnered significant attention, there remains a lack of robust defense strategies. Traditional defense methods might come at the cost of accuracy and computational resources, making it imperative to explore innovative solutions.
To address these vulnerabilities, the researchers suggested preprocessing-based defenses as a potential solution, especially for large-scale foundation models. These defenses aim to ensure the robustness of MLLMs against adversarial attacks, paving the way for future research and development in AI security.
This discovery underscores the critical importance of enhancing AI technologies’ security infrastructure. As these models become increasingly integrated into everyday applications, it is essential to fortify their defenses against malicious exploitation, ensuring a safer and more secure digital landscape for users worldwide.
Related:
- Major Outage at OpenAI, ChatGPT Down for Users Globally
- OpenAI Unveils New Features for ChatGPT at Inagural Developer Conference
- YouTube generative AI will soon let you have a conversation about video you’ve just watched
(via)




























YouTube generative AI will soon let you have a conversation about video you’ve just watched
In a move to enhance user engagement and improve content comprehension, YouTube is testing two new AI-powered features: comment topics and a conversational AI tool. These features are currently in an experimental phase and are available to a limited number of YouTube Premium members.
The comment topics tool aims to make navigating lengthy comment sections easier for both viewers and creators. By utilizing AI, YouTube automatically identifies and organizes comments into relevant themes or topics. This allows viewers to quickly locate specific discussions, while creators can effortlessly engage with their audience on specific aspects of their content.
Creators have the option to remove comment topics by deleting individual comments associated with that topic. Additionally, comment topics are only generated for published comments, excluding blocked or under review comments. Currently, comment topics are limited to a select few English-language videos with extensive comment sections.
YouTube’s conversational AI tool takes viewer engagement a step further by providing real-time interactions during video playback. Viewers can ask questions about the video, receive recommendations for related content, and even take quizzes to enhance their understanding of educational videos.
This feature seamlessly integrates into the viewing experience without disrupting the video playback. The conversational AI tool utilizes natural language processing to understand user queries and provide accurate, relevant responses.
As these features continue to undergo testing and development, it will be interesting to observe how they impact user engagement, content consumption patterns, and the overall YouTube ecosystem.
Related:
(Via)