21 August 2023 Tabular-based self-supervised learning approach for encrypted traffic classification
Xuan Zheng, Xiuli Ma, Yanliang Jin, Dongsheng Gu, Rui Wang
Author Affiliations +
Abstract

Encrypted traffic classification (ETC) plays an important role in network management. In most research, the statistical features, transformed traffic images, or text are used for classification. However, the statistical features’ design is time-consuming and labor-intensive, and the transformed traffic data lack spatial or semantic features. Considering that the headers of traffic packets have a uniform structure and are independent of each other, traffic data are most similar to tabular data. Thus we propose a data processing approach to convert packet headers into traffic tables in which each field is viewed as a column (feature). In addition, traffic data are hard to label in real traffic environments, and each field contributes differently to the classification. Therefore, a self-supervised learning algorithm, SubTab, is used as the baseline network to reduce the reliance on labeled data and assign different weights to different fields. To the best of our knowledge, this is the first time that the ETC problem is solved from the tabular domain. Experimental results on two real-world datasets, ISCX VPN-nonVPN and the self-collected dataset SHU-ET, demonstrate that our method surpasses state-of-the-art methods based on traffic images or text and proves that traffic tables are more suitable for ETC problems. In addition, our method achieves a great performance with only 10% of labeled data and reduces the reliance on labeling data.

© 2023 SPIE and IS&T
Xuan Zheng, Xiuli Ma, Yanliang Jin, Dongsheng Gu, and Rui Wang "Tabular-based self-supervised learning approach for encrypted traffic classification," Journal of Electronic Imaging 32(4), 043032 (21 August 2023). https://doi.org/10.1117/1.JEI.32.4.043032
Received: 12 December 2022; Accepted: 3 August 2023; Published: 21 August 2023
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Machine learning

Education and training

Data modeling

Performance modeling

Classification systems

Image classification

Feature extraction

Back to Top