โก Quick Summary
This study evaluated the effectiveness of ChatGPT in enhancing search strategies for systematic reviews on drug harms by identifying errors and generating omitted keywords. The results demonstrated that ChatGPT successfully detected errors in all search strategies and recreated a significant proportion of omitted keywords, indicating its potential as a valuable tool for evidence retrieval.
๐ Key Details
- ๐ Dataset: Systematic reviews from 10 high-impact journals (2013-2023)
- ๐ Focus: Search strategies for drug harms
- โ๏ธ Technology: ChatGPT (GPT-4)
- ๐ Performance: Error detection in all strategies, mean Jaccard’s similarity of 0.17 (strict) and 0.23 (semantic)
- ๐ Keyword recreation: 49% (strict) and 71% (semantic)
๐ Key Takeaways
- ๐ค ChatGPT effectively identified errors in search strategies for drug harms.
- ๐ Jaccard’s similarity increased from 0.17 to 0.23 with semantic matching.
- ๐ Keyword generation was successful, with 49% recreated under strict matching.
- ๐ก Semantic matching significantly improved keyword recreation to 71%.
- ๐ Potential tool for enhancing systematic reviews and evidence retrieval.
- ๐ Study published in the journal Computational Biology and Medicine.
- ๐๏ธ Study period: Data collected from November 2013 to November 2023.
๐ Background
Systematic reviews are essential for synthesizing evidence on drug harms, yet developing effective search strategies requires specialized expertise. Traditional methods often overlook critical keywords, leading to incomplete evidence synthesis. The integration of advanced technologies like ChatGPT could streamline this process, enhancing the accuracy and comprehensiveness of systematic reviews.
๐๏ธ Study
The study involved a comprehensive literature search in PubMed, focusing on systematic reviews of drug harms published in high-impact journals. Sixteen search strategies were selected, each intentionally containing a single error of omission. The performance of ChatGPT was evaluated based on its ability to detect these errors and generate relevant keywords, using both strict and semantic matching criteria.
๐ Results
ChatGPT demonstrated remarkable effectiveness, identifying errors in all search strategies. The mean Jaccard’s similarity measure was recorded at 0.17 under strict matching, which improved to 0.23 with semantic matching. Furthermore, the proportion of omitted keywords recreated by ChatGPT was 49% using strict matching, which increased to 71% with semantic matching, showcasing its potential in enhancing search strategies.
๐ Impact and Implications
The findings of this study highlight the transformative potential of ChatGPT in the realm of systematic reviews. By effectively detecting errors and generating relevant keywords, ChatGPT can significantly improve the quality of evidence retrieval on drug harms. This advancement could lead to more comprehensive systematic reviews, ultimately benefiting healthcare professionals and researchers in making informed decisions based on robust evidence.
๐ฎ Conclusion
This study underscores the promising role of ChatGPT as a tool for enhancing search strategies in systematic reviews on drug harms. Its ability to detect errors and generate relevant keywords paves the way for more accurate and thorough evidence synthesis. As we continue to explore the integration of AI technologies in research, the future looks bright for improving systematic review methodologies.
๐ฌ Your comments
What are your thoughts on the use of AI in enhancing systematic reviews? We would love to hear your insights! ๐ฌ Leave your comments below or connect with us on social media:
Enhancing search strategies for systematic reviews on drug Harms: An evaluation of the utility of ChatGPT in error detection and keyword generation.
Abstract
OBJECTIVE: Developing search strategies for synthesizing evidence on drug harms requires specialized expertise and knowledge. The aim of this study was to evaluate ChatGPT’s ability to enhance search strategies for systematic reviews of drug harms by identifying missing and generating omitted keywords.
MATERIALS AND METHODS: A literature search in PubMed identified systematic reviews of drug harms from 10 high-impact journals between 1-Nov-2013 to 27-Nov-2023. Sixteen search strategies used in these reviews were selected each with a single error of omission introduced. ChatGPT’s (GPT-4) performance was evaluated based on error detection, similarity between the extracted and generated search strategies via strict and semantic keyword matching, and proportion of omitted keywords generated.
RESULTS: ChatGPT identified the introduced errors in all search strategies. Under strict matching, the mean Jaccard’s similarity measure was 0.17 (range: 0.00-0.52) and with semantic matching this increased to 0.23 (range: 0.00-0.53). Similarly, the mean proportion of keywords recreated by ChatGPT was 49ย % using strict matching increasing to 71ย % with semantic matching.
DISCUSSION AND CONCLUSION: ChatGPT effectively detected errors and generated relevant keywords, showing potential as a tool for evidence retrieval on drug harms.
Author: [‘Gitman V’, ‘Maxwell C’, ‘Gamble JM’]
Journal: Comput Biol Med
Citation: Gitman V, et al. Enhancing search strategies for systematic reviews on drug Harms: An evaluation of the utility of ChatGPT in error detection and keyword generation. Enhancing search strategies for systematic reviews on drug Harms: An evaluation of the utility of ChatGPT in error detection and keyword generation. 2025; 193:110464. doi: 10.1016/j.compbiomed.2025.110464