あなたが思うより、ジェイルブレイクは簡単
Monday, March 17, 2025
本ブログは Jailbreaking is (mostly) simple than you think の抄訳版です。最新の情報は原文を参照してください。 コンテンツに関する警告: こ
Monday, March 17, 2025
本ブログは Jailbreaking is (mostly) simple than you think の抄訳版です。最新の情報は原文を参照してください。 コンテンツに関する警告: こ
Friday, March 14, 2025
We are excited to announce the winners of LLMail-Inject, our first Adaptive Prompt Injection Challenge! The challenge ran from December 2024 until February 2025 and was featured as one of the four official competitions of the 3rd IEEE Conference on Secure and Trustworthy Machine Learning (IEEE SaTML). The overall aims of this challenge were to advance the state-of-the-art defenses against indirect prompt injection attacks and to broaden awareness of these new techniques.
Thursday, March 13, 2025
Content warning: This blog post contains discussions of sensitive topics. These subjects may be distressing or triggering for some readers. Reader discretion is advised. Today, we are sharing insights on a simple, optimization-free jailbreak method called Context Compliance Attack (CCA), that has proven effective against most leading AI systems. We are disseminating this research to promote awareness and encourage system designers to implement appropriate safeguards.
Tuesday, March 11, 2025
2025 年 3 月 11 日 (米国時間) 、マイクロソフトは、マイクロソフト製品に影響する脆弱性を修正するために、セキ