Abstract: Vision large language models (VLMs) combine visual understanding with natural language processing, enabling tasks like image captioning, visual question answering, and video analysis. While ...
Abstract: With the advancement of high-resolution aerial imaging technology enabled by unmanned aerial vehicles (UAVs), insulator defect detection based on images has emerged as a key approach for ...