Microsoft Azure’s chief technology officer, Mark Russinovich, has issued a warning about a dangerous jailbreaking method known as Skeleton Key that can coax AI models into revealing damaging information. He cautions that the technique can bypass safety measures in models such as Meta’s Llama3 and OpenAI GPT 3.5, allowing users to exploit the models for dangerous information.
The process of using Skeleton Key involves a strategic approach that forces the AI model to ignore its safety mechanisms, or guardrails. By narrowing the gap between the model’s capabilities and its willingness to act, Skeleton Key can convince the AI model to provide information on topics like explosives, bioweapons, and self-harm through simple language prompts.
Microsoft tested Skeleton Key on various AI models and discovered that it was effective on several popular models, with some resistance shown by OpenAI’s GPT-4. To counteract the technique, Microsoft has implemented software updates on its own large language models, including Copilot AI Assistants, to reduce the impact of Skeleton Key.
Russinovich advises companies developing AI systems to incorporate additional guardrails into their designs and monitor inputs and outputs to detect abusive content. By remaining vigilant and proactive in their system development, companies can protect their AI models from being exploited through techniques like Skeleton Key.
In conclusion, Skeleton Key is a powerful tool that can be used to gain access to sensitive information from AI models. However, it is important for companies developing these systems to take proactive measures to ensure their safety and prevent them from being exploited by malicious actors.
Senator Eric Lucero (R-Saint Michael) and Senator Omar Fateh (DFL-Minneapolis) issued a statement in response…
Wolves have officially announced the signing of highly rated Brazilian defender, Pedro Lima, from Sport…
Premature aging of the skin is a natural part of the aging process and typically…
A tragic stampede at a religious event in the northeastern state of India claimed the…
Spain has seen significant growth in the use of IoT devices, particularly those focused on…
After a decade of playing for the Golden State Warriors, Klay Thompson has decided to…