MIL-OSI Banking: Panasonic HD develops “SparseVLM” technology that doubles the processing speed of Vision-Language Model

Source: Panasonic

Headline: Panasonic HD develops “SparseVLM” technology that doubles the processing speed of Vision-Language Model

Figure 1: Comparison of “SparseVLM” and existing sparsification methods (quoted from the accepted paper)

Osaka, Japan, July 4, 2025 – Panasonic R&D Company of America (PRDCA) and Panasonic Holdings Co., Ltd. (Panasonic HD), in collaboration with researchers from Peking University, Fudan University, University of California, Berkeley, and Shanghai Jiao Tong University, have developed “SparseVLM,” a technology that speeds up Vision-Language Models (VLMs), AI models that can understand and process both visual data such as images and videos, and text data.In recent years, VLMs have seen rapid development. These models can process visual and textual information simultaneously and can answer questions about visual content. However, handling a large amount of data, especially high-resolution images and long videos, leads to longer inference times and higher computational complexity for the AI model. “SparseVLM” adopts a novel approach by focusing solely on the visual information relevant to the input prompt (Figure 1), significantly reducing inference time and computational complexity while maintaining high accuracy in answering questions about images.This research has been accepted for presentation at the 42nd International Conference on Machine Learning (ICML2025), one of the premier conferences for AI and machine learning research. The conference will take place in Vancouver, Canada from July 13 to July 19, 2025.

MIL OSI Global Banks