QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern

Tianxiong Zhong; Zhiwei Zhang; Yihang Fu; Guo Lu; Ye Yuan; Guoren Wang

doi:10.1109/ICDE65448.2025.00071

QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern

Tianxiong Zhong, Zhiwei Zhang^*, Yihang Fu, Guo Lu, Ye Yuan, Guoren Wang

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

With the explosive growth of video data, efficient video analysis technology has garnered widespread attention. Existing online methods train proxy neural networks upon query arrival and use these networks to scan the entire dataset, guiding the invocation of the expensive deep neural network. While index-based methods advance this process to the index-building stage, significantly reducing the time overhead of video queries. However, the data to query often presents a long-tail distribution, and different types of queries are sensitive to different parts of the distribution. Since the index-based methods cannot predict the queries, they can only provide ad-hoc proxy score generating strategies. This paper proposes a query-aware video analysis framework, QaVA, to improve query performance further. QaVA retains the time-consuming, query-independent semantic extraction process during the index-building stage and employs a tunable lightweight adapter network to accurately and quickly focus on the data parts most relevant to the query after it arrives. Meanwhile, QaVA can automatically tune the training strategy of the adapter network by analyzing the data access pattern of historical queries, thus meeting the needs of general users. Experimental results demonstrate that QaVA can significantly reduce the cost of various queries across multiple datasets, and can speed up query processing by up to 9.2× compared to the most advanced index-based method. Our code is available: http://github.com/InkosiZhong/QaVA.

源语言	英语
主期刊名	Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025
出版商	IEEE Computer Society
页	877-890
页数	14
ISBN（电子版）	9798331536039
DOI	http://doi.org/10.1109/ICDE65448.2025.00071
出版状态	已出版 - 2025
已对外发布	是
活动	41st IEEE International Conference on Data Engineering, ICDE 2025 - Hong Kong, 中国期限: 19 5月 2025 → 23 5月 2025

出版系列

姓名	Proceedings - International Conference on Data Engineering
ISSN（印刷版）	1084-4627
ISSN（电子版）	2375-0286

会议

会议	41st IEEE International Conference on Data Engineering, ICDE 2025
国家/地区	中国
市	Hong Kong
时期	19/05/25 → 23/05/25

访问文件

10.1109/ICDE65448.2025.00071

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhong, T., Zhang, Z., Fu, Y., Lu, G., Yuan, Y., & Wang, G. (2025). QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern. 在 Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025 (页码 877-890). (Proceedings - International Conference on Data Engineering). IEEE Computer Society. http://doi.org/10.1109/ICDE65448.2025.00071

@inproceedings{aaf1ca3e149d47c3a58e70e169e7f649,

title = "QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern",

abstract = "With the explosive growth of video data, efficient video analysis technology has garnered widespread attention. Existing online methods train proxy neural networks upon query arrival and use these networks to scan the entire dataset, guiding the invocation of the expensive deep neural network. While index-based methods advance this process to the index-building stage, significantly reducing the time overhead of video queries. However, the data to query often presents a long-tail distribution, and different types of queries are sensitive to different parts of the distribution. Since the index-based methods cannot predict the queries, they can only provide ad-hoc proxy score generating strategies. This paper proposes a query-aware video analysis framework, QaVA, to improve query performance further. QaVA retains the time-consuming, query-independent semantic extraction process during the index-building stage and employs a tunable lightweight adapter network to accurately and quickly focus on the data parts most relevant to the query after it arrives. Meanwhile, QaVA can automatically tune the training strategy of the adapter network by analyzing the data access pattern of historical queries, thus meeting the needs of general users. Experimental results demonstrate that QaVA can significantly reduce the cost of various queries across multiple datasets, and can speed up query processing by up to 9.2× compared to the most advanced index-based method. Our code is available: http://github.com/InkosiZhong/QaVA.",

keywords = "deep learning, object detection, video analytics",

author = "Tianxiong Zhong and Zhiwei Zhang and Yihang Fu and Guo Lu and Ye Yuan and Guoren Wang",

note = "Publisher Copyright: {\textcopyright} 2025 IEEE.; 41st IEEE International Conference on Data Engineering, ICDE 2025 ; Conference date: 19-05-2025 Through 23-05-2025",

year = "2025",

doi = "10.1109/ICDE65448.2025.00071",

language = "English",

series = "Proceedings - International Conference on Data Engineering",

publisher = "IEEE Computer Society",

pages = "877--890",

booktitle = "Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025",

address = "United States",

}

Zhong, T, Zhang, Z, Fu, Y, Lu, G, Yuan, Y & Wang, G 2025, QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern. 在 Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025. Proceedings - International Conference on Data Engineering, IEEE Computer Society, 页码 877-890, 41st IEEE International Conference on Data Engineering, ICDE 2025, Hong Kong, 中国, 19/05/25. http://doi.org/10.1109/ICDE65448.2025.00071

QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern. / Zhong, Tianxiong; Zhang, Zhiwei; Fu, Yihang 等.
Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025. IEEE Computer Society, 2025. 页码 877-890 (Proceedings - International Conference on Data Engineering).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - QaVA

T2 - 41st IEEE International Conference on Data Engineering, ICDE 2025

AU - Zhong, Tianxiong

AU - Zhang, Zhiwei

AU - Fu, Yihang

AU - Lu, Guo

AU - Yuan, Ye

AU - Wang, Guoren

PY - 2025

Y1 - 2025

N2 - With the explosive growth of video data, efficient video analysis technology has garnered widespread attention. Existing online methods train proxy neural networks upon query arrival and use these networks to scan the entire dataset, guiding the invocation of the expensive deep neural network. While index-based methods advance this process to the index-building stage, significantly reducing the time overhead of video queries. However, the data to query often presents a long-tail distribution, and different types of queries are sensitive to different parts of the distribution. Since the index-based methods cannot predict the queries, they can only provide ad-hoc proxy score generating strategies. This paper proposes a query-aware video analysis framework, QaVA, to improve query performance further. QaVA retains the time-consuming, query-independent semantic extraction process during the index-building stage and employs a tunable lightweight adapter network to accurately and quickly focus on the data parts most relevant to the query after it arrives. Meanwhile, QaVA can automatically tune the training strategy of the adapter network by analyzing the data access pattern of historical queries, thus meeting the needs of general users. Experimental results demonstrate that QaVA can significantly reduce the cost of various queries across multiple datasets, and can speed up query processing by up to 9.2× compared to the most advanced index-based method. Our code is available: http://github.com/InkosiZhong/QaVA.

AB - With the explosive growth of video data, efficient video analysis technology has garnered widespread attention. Existing online methods train proxy neural networks upon query arrival and use these networks to scan the entire dataset, guiding the invocation of the expensive deep neural network. While index-based methods advance this process to the index-building stage, significantly reducing the time overhead of video queries. However, the data to query often presents a long-tail distribution, and different types of queries are sensitive to different parts of the distribution. Since the index-based methods cannot predict the queries, they can only provide ad-hoc proxy score generating strategies. This paper proposes a query-aware video analysis framework, QaVA, to improve query performance further. QaVA retains the time-consuming, query-independent semantic extraction process during the index-building stage and employs a tunable lightweight adapter network to accurately and quickly focus on the data parts most relevant to the query after it arrives. Meanwhile, QaVA can automatically tune the training strategy of the adapter network by analyzing the data access pattern of historical queries, thus meeting the needs of general users. Experimental results demonstrate that QaVA can significantly reduce the cost of various queries across multiple datasets, and can speed up query processing by up to 9.2× compared to the most advanced index-based method. Our code is available: http://github.com/InkosiZhong/QaVA.

KW - deep learning

KW - object detection

KW - video analytics

UR - http://www.scopus.com/pages/publications/105015522472

U2 - 10.1109/ICDE65448.2025.00071

DO - 10.1109/ICDE65448.2025.00071

M3 - Conference contribution

AN - SCOPUS:105015522472

T3 - Proceedings - International Conference on Data Engineering

SP - 877

EP - 890

BT - Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025

PB - IEEE Computer Society

Y2 - 19 May 2025 through 23 May 2025

ER -

QaVA: Query-Aware Video Analysis Framework Based on Data Access Pattern

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此