Title: Network Traffic Classification Using Ensemble Learning with Time Related Features

Issue Number: Vol. 10, No. 2
Year of Publication: 2020
Page Numbers: 23-31
Authors: Muhammad Anwarul Azim, Tanvir and Mohammad Khairul Islam
Journal Name: International Journal of New Computer Architectures and their Applications (IJNCAA)
- Hong Kong
DOI:  http://dx.doi.org/10.17781/P002674


Network traffic classification has become important with the rapid growth of the Internet and online applications. Though, there were researches that applied different machine learning algorithms for traffic classification purposes, the continuous expansion of technologies and applications in stationary and mobile are creating a dynamic environment. Because of encryption in today’s Internet, traffic classification still poses a great deal of concern for researchers and network communities. This work proposes ensemble learning including Voting, Bagging, and Boosting for traffic classification, and then compares them with their own base classifiers when used individually. Time-related features are focused which are independent of data encryption on the UNB ISCX dataset, containing flow duration, inter-arrival time, byte rate, packet rate, etc. Among different techniques, Random Forest outperforms nearly all others with respect to various evaluation matrices such as accuracy, precision, recall, and f1-score. In the case of VPN traffic and Non-VPN traffic, it gives almost 90.65% and 95.42% accuracy respectively. In the case of combined VPN and Non-VPN traffic, we achieve 90.18% accuracy for classifying traffic categories which is a significant improvement from previous works