Researchers at Brown University in Providence, R.I. found artificial inelligence models were able to pass written and oral neurosurgery exams, The Brown Daily Herald reported April 25.
The Brown University neurosurgery department compared ChatGPT, GPT-4 and Google Bard in the written board exams and oral board prepatory question bank. The AI was able to pass written exams with "flying colors" and the complex oral exam questions "superbly."
The AI wasn't perfect, however. GPT-4 correctly said it couldn't answer a question about an image as a text-based model, while Google Bard made up an answer. AT models also asserted false facts in some instances.
"It’s important to address the viral social media attention that these (models) have gained, which suggest that (they) could be a brain surgeon, but also important to clarify that these models are not yet ready for primetime and should not be considered a replacement for human activities currently," Rohaid Ali, MD, a fifth-year neurosurgery resident and co-first author of the study said in the report. "As neurosurgeons, it’s crucial that we safely integrate AI models for patient usage and actively investigate their blind spots to ensure the best possible care for the patients."