Name
Exploring AI-driven innovation: benchmarking large language models for rare disease diagnostics
Date & Time
Thursday, April 3, 2025, 11:15 AM - 12:45 PM
Description

The session aims to facilitate engagement across GA4GH groups, specifically the Clinical & Phenotypic Data Capture (Clin/Pheno) Work Stream and the Rare Disease Community of Interest, through active participation in a benchmarking exercise focused on evaluating the use of large language models (LLMs) for rare disease diagnostics. The primary goal is to assess how effectively LLMs can organise complex knowledge and support diagnostic processes, ultimately contributing to the development of advanced AI tools for this field.  

A suggested proposal is to design benchmarking data for systems that query a knowledge base (or graph) with an AI component, helping us assess how well LLMs use structured data to answer specific questions. This could include tasks such as mapping phenotypes to potential rare disease names. Exploration of this topic began at the joint Clin/Pheno and Rare Disease Community meeting in January 2025 (see here).

Location Name
Monadnock
Full Address
Broad Institute of MIT and Harvard
Merkin Building
Cambridge, MA 02142
United States
Agenda