
Evaluating the performance of five large language models in answering Delphi consensus questions relating to patellar instability and medial patellofemoral ligament reconstruction.
AI models evaluated for patellar instability questions show ChatGPT4o and Claude2 excel, while Google Gemini underperforms. 📊🤖







