Collaborative AI Achieves Top Scores in U.S. Medical Licensing Exams

A new study shows that a council of AI models working together has achieved record-breaking scores on U.S. medical licensing exams, demonstrating the power of collaborative AI for healthcare accuracy.
Researchers have developed an innovative approach using a council of AI models that collaboratively discuss and refine their answers, leading to unprecedented performance on U.S. medical licensing exams (USMLE). This collaborative AI system, composed of multiple instances of OpenAI's GPT-4, engaged in coordinated and iterative exchanges to reach consensus responses. In a recent study published in PLOS Medicine, this method achieved accuracy rates of 97%, 93%, and 94% on Step 1, Step 2 CK, and Step 3 exams respectively, surpassing individual GPT-4 performance.
The study highlights how collective decision-making among AI agents can enhance diagnostic accuracy, which is crucial in healthcare. When faced with 325 publicly available USMLE questions covering biomedical sciences and clinical diagnosis, the council demonstrated the ability to self-correct, correcting over half of the responses that were initially incorrect based on majority voting.
This approach addresses common issues in language models, such as inconsistent responses and hallucinations, by utilizing group deliberation. The authors suggest that such collaborative AI could serve as more reliable tools for medical education and, potentially, clinical decision-making in the future. However, they underscore that this paradigm has yet to be tested in real-world clinical settings.
The study emphasizes that embracing variability and teamwork in AI responses can unlock new possibilities for medical applications, promoting higher accuracy and trustworthiness. As noted by researchers Shaikh, Siddiqui, and Asiyah, collaborative dialogue among AI systems can lead to improved self-correction and performance, offering a promising avenue for advancing AI in healthcare.
Source: https://medicalxpress.com/news/2025-10-collaborative-ai-medical-exams.html
Stay Updated with Mia's Feed
Get the latest health & wellness insights delivered straight to your inbox.
Related Articles
Innovative Use of AI and Microbiome Analysis Enhances Accuracy in Detecting Colorectal Cancer
Researchers utilize AI and gut microbiome profiling to develop a highly accurate, noninvasive method for early colorectal cancer detection, potentially transforming screening practices.
Innovative Non-Hormonal Hydrogel Offers New Hope for Menopausal Vaginal Health
A groundbreaking non-hormonal hydrogel developed at UC San Diego shows potential to treat menopausal vaginal changes, especially for women seeking hormone-free solutions. Preclinical trials indicate promising tissue regeneration and safety, paving the way for new therapies to improve quality of life during menopause.
FDA Grants Breakthrough Status to AI-Driven Breast Cancer Risk Assessment Tool
A novel AI-powered mammogram analysis tool from Washington University has received FDA breakthrough status, promising more accurate five-year breast cancer risk predictions to improve early detection and prevention.
UK COVID Response Failure Led to Preventable Deaths, Experts Say
Experts warn that the UK’s failure to prioritize COVID suppression early in the pandemic led to thousands of preventable deaths, highlighting systemic issues in pandemic decision-making.



