A systematic review of multi-modal large language models on domain-specific applications
Sirui Li, Kok Wai Wong, Guanjin Wang, Thach-Thao Duong
- Year
- 2025
- Citations
- 5
- Access
- Open access
Abstract
Abstract While Large Language Models (LLMs) have shown remarkable proficiency in text-based tasks, they struggle to interact effectively with the more realistic world without the perceptions of other modalities such as visual and audio. Multi-modal LLMs, which integrate these additional modalities, have become increasingly important across various domains. Despite the significant advancements and potential of multi-modal LLMs, there has been no comprehensive PRISMA-based systematic review that examines their applications across different domains. The objective of this work is to fill this gap by systematically reviewing and synthesising the quantitative research literature on domain-specific applications of multi-modal LLMs. This systematic review follows the PRISMA guidelines to analyse research literature published after 2022, the release of OpenAI’s ChatGPT $$-$$ 3.5. The literature search was conducted across several online databases, including Nature, Scopus, and Google Scholar. A total of 22 studies were identified, with 11 focusing on the medical domain, 3 on autonomous driving, and 2 on geometric analysis. The remaining studies covered a range of topics, with one each on climate, music, e-commerce, sentiment analysis, human-robot interaction, and construction. This review provides a comprehensive overview of the current state of multi-modal LLMs, highlights their domain-specific applications, and identifies gaps and future research directions.
Keywords
Related papers
3D is here: Point Cloud Library (PCL)
Radu Bogdan Rusu, Steve Cousins
2011
Intelligence without representation
Rodney A. Brooks
1991
A review of shape memory alloy research, applications and opportunities
Jaronie Mohd Jani, Martin Leary, Aleksandar Subic +1 more
2013
The university of Florida sparse matrix collection
Timothy A. Davis, Yifan Hu
2011