Paper
12 October 2020 Cross-domain text classification algorithm based on instance-transfer learning
Author Affiliations +
Proceedings Volume 11574, International Symposium on Artificial Intelligence and Robotics 2020; 1157406 (2020) https://doi.org/10.1117/12.2576021
Event: International Symposium on Artificial Intelligence and Robotics (ISAIR), 2020, Kitakyushu, Japan
Abstract
Cross-domain text classification has broad application prospects in the field of data mining. Since transfer learning can help target domain data to achieve the sharing and transfer of semantic information with the help of existing knowledge domains, transfer learning are generally used to achieve cross-domain text processing. Based on this, we propose a cross-domain text classification algorithm -MTrA. The algorithm is based on TrAdaBoost, taking into account the distribution differences between the source domain and the target domain. It uses the Maximum Mean Discrepancy(MMD) as the initial weight parameter of the two domain. MTrA adds a weight backfill factor that considers the accuracy of the source domain classification and balances the weight update method of the source domain data. Through the verification in the dataset 20 Newsgroups, Compared with the traditional TrAdaBoost algorithm, it improves the classification accuracy by 9.4% on average. it proves the effectiveness and advantages of the algorithm.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ruijun Liu, Jun Wang, Zhuo Yu, Yuqian Shi, Lun Zhang, Changjiang Ji, and Xin Jin "Cross-domain text classification algorithm based on instance-transfer learning", Proc. SPIE 11574, International Symposium on Artificial Intelligence and Robotics 2020, 1157406 (12 October 2020); https://doi.org/10.1117/12.2576021
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top