The framework, designed with input from clinical subject matter experts across multiple specialties, evaluates AI-generated responses along five critical dimensions: query comprehension, response helpfulness, correctness, completeness, and potential for clinical harm. It serves as a comprehensive assessment to ensure that AI-powered tools not only provide accurate and relevant information but also align with the practical and current needs of healthcare professionals at the point of care. . . .
In a recent evaluation study of ClinicalKey AI, Elsevier worked with a panel of 41-board certified physicians and clinical pharmacists to rigorously test responses generated by the tool for a diverse set of clinical queries. That panel evaluated 426 query-response pairs, and results demonstrated impressive performance, with 94.4% of responses rated as helpful, 95.5% assessed as completely correct, with just 0.47% flagged for potential improvements.
| Artificial Intelligence |
| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |