• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (02): 244-250.

Previous Articles     Next Articles

GPS trajectory de-anonymization based on deep learning

BU Guan-hua1,2 ,ZHOU Li-liang3,LI Hao1,ZHANG Min1   

  1. (1.Trusted Computing and Information Assurance Laboratory,
    Institute of Software,Chinese Academy of Sciences,Beijing 100089;

    2.University of Chinese Academy of Sciences,Beijing 100089;

    3.CETC Key Laboratory of Avionic Information System Technology,Chengdu 610036,China)

  • Received:2020-07-20 Revised:2020-10-30 Accepted:2022-02-25 Online:2022-02-25 Published:2022-02-17

Abstract: The rapid development of mobile Internet and LBS technology allows location service providers to easily collect an ocean of user location trajectory data. Recent studies have shown that deep learning methods can extract user privacy such as user identity from trajectory datasets. However, the existing work mainly focuses on the check-in trajectories collected by social networks, and the de- anonymization research of GPS trajectories is relatively lacking. Therefore, the research on the de- anonymization technology of GPS trajectory based on deep learning is carried out. Firstly, a pre-training method of GPS trajectory data is proposed. After sub-trajectory division, location conversion and location embedding, the spatial distance and context information of GPS coordinates in original trajectories are embedded into fixed-length vectors, so that the GPS trajectory data can be used as the input of neural network. Secondly, a GPS trajectory de-anonymization method based on deep neural network training is proposed. Based on the pre-trained vector sequences obtained in data pre-training, neural networks such as LSTM and GRU are used as encoders to train and fit user identification to achieve trajectory-user link from anonymous trajectories. Finally, the above methods are verified on Geolife trajectory dataset. In the experiment, the accuracy and Top5-accuracy of trajectory de-anonymization reach 56.73% and 7348%. The experimental results demonstrate that the GPS trajectory de-anonymization method based on deep learning can obtain more accurate user identification from the anonymous trajectory data.

Key words: deep learning, recurrent neural network, trajectory de-anonymization, GPS trajectory, data pre-training