머숨 미러

1. 구글의 Med-PALM 2

USMLE에서 정답률 85% (기사에 따르면 대략적으로 60%가 합격선)

https://www.medpagetoday.com/special-reports/exclusives/103522

Google AI Performs at 'Expert' Level on U.S. Medical Licensing Exam

Latest version of medically tuned AI model achieved 85% accuracy, beating previous record

www.medpagetoday.com

인간과 답변 퀄리티 비교

The generation capabilities of large language models also enable them to produce long-form answers to consumer medical questions. However, ensuring model responses are accurate, safe, and helpful has been a crucial research challenge, especially in this safety-critical domain.

In a pairwise study, Med-PaLM 2 answers were preferred to physician answers across eight of nine axes considered.

2. 마이크로소프트 GPT-4의 성능:

https://arxiv.org/pdf/2303.13375.pdf

Prompt crafting (GPT-4 가 잘 이해하게 전처리하는 작업) 없이도 USMLE 합격점 20점 이상

Our results show that GPT-4, without any specialized prompt crafting, exceeds the passing score on USMLE by over 20 points and outperforms earlier general-purpose models (GPT-3.5) as well as models specifically fine-tuned on medical knowledge (Med-PaLM, a prompt-tuned version of Flan-PaLM 540B).

생성형 AI가 usmle는 뚫은듯

댓글 0

생성형 AI가 usmle는 뚫은듯

댓글 0

다른 게시글

의대랑 ai협력연구 많이 진행했었는데

의새… 애완용으로 키우고 싶다..

이새끼들 자꾸ai로 대체된다면서

국가를 막론하고 가장 힘 있는 사람들은

파업에 대한 의사 선생님들의 일침..jpg

진짜 궁금한거있음 설명좀

한무당들 의사인척 하지마라

이젠 오르비에서 쪽팔려서

아 당직서기 싫다

우리나라 2023년 출산율이 0.72 2024에는 0.6대로 떨어진대