Authors: Ye Yu, Haibo Jin, Yaoning Yu et al.
This paper introduces a novel text-to-audio jailbreak attack, named "Now You Hear Me," that exploits large audio-language models (ALMs) by embedding disallowed directives within narrative-style audio streams. Leveraging advanced text-to-speech (TTS) models to manipulate acoustic and structural properties, the attack successfully bypasses ALM safety mechanisms with a 98.26% success rate against models like Gemini 2.0 Flash. The findings highlight a critical vulnerability in speech-based AI interfaces and emphasize the urgent need for multimodal safety frameworks that analyze both linguistic and paralinguistic features.
clinical triage
telemedicine
digital health
medical voice assistants
patient education platforms
Authors: Hamza Kalisch, Constantin Seibold, Jens Kleesiek et al.
This paper introduces Region-Normalized DPO (RN-DPO) to improve medical image segmentation models using inexpensive but noisy automatic quality-control signals, eliminating the need for additional pixel-wise annotations. RN-DPO enhances optimization stability by normalizing preference updates based on the size of the disagreement region between masks, leading to improved and more sustained segmentation performance, especially when judges are unreliable.
medical imaging
radiology
computational medicine
diagnostic imaging
AI in healthcare
Authors: Anglin Liu, Ruichao Chen, Yi Lu et al.
Despite recent Multimodal Large Language Models (MLLMs)' linguistic prowess in medical diagnosis, we find even state-of-the-art MLLMs suffer from a critical perceptual deficit: geometric blindness. This failure to ground outputs in objective geometric constraints leads to plausible yet factually inc...
cs.CV
Authors: Wenxuan Li, Pedro R. A. S. Bassi, Lizhou Wu et al.
This paper introduces ePAI, an AI system for early and prediagnostic detection of pancreatic ductal adenocarcinoma (PDAC) from computed tomography (CT) scans. ePAI demonstrated high performance in both internal (AUC 0.939-0.999, sensitivity 95.3%, specificity 98.7%) and external validation cohorts, accurately localizing PDACs as small as 2mm. Crucially, it detected previously overlooked PDACs 3-36 months before clinical diagnosis, significantly outperforming board-certified radiologists in sensitivity.
Oncology
Radiology
Gastroenterology
Surgical Oncology