Learning travel recommendations from user-generated GPS traces
- Yu Zheng ,
- Xing Xie
ACM Transaction on Intelligent Systems and Technology |
The advance of GPS-enabled devices facilitates people to record their location histories with GPS traces, which imply human behaviors and preferences related to travel. In this paper, we perform two types of travel recommendations by mining multiple users’ GPS traces. The first is a generic one that recommends a user with top interesting locations and travel sequences in a given geospatial region. Here, interesting locations mean the culturally important places, such as Tiananmen Square in Beijing, and frequented public areas, like shopping malls and restaurants. The second is a personalized recommendation that provides an individual with locations matching her travel preferences. To achieve the first recommendation, we model multiple users’ location histories with a tree-based hierarchical graph (TBHG). Based on the TBHG, we propose a HITS (Hypertext Induced Topic Search)-based inference model, which regards an individual’s access on a location as a directed link from the user to that location. This model infers two values, the interest level of a location and a user’s travel experience, by taking into account 1) the mutual-reinforcement relation between the two values and 2) the geo-region conditions. Considering the inferred values, we mine the classical travel sequences among locations. In the personalized recommendation, we first understand the correlation among locations in terms of 1) the sequences that the locations have been visited and 2) the travel experiences of the persons accessing these locations. Beyond the geo-distance relation, this correlation represents the relation between locations in the spaces of human behavior. Later, we incorporate the location correlation into a collaborative filtering (CF)-based model that infers a user’s interests in an unvisited location based on her locations histories and that of others. We evaluated our system based on a real-world GPS trace dataset collected by 107 users over a period of one year. As a result, our HITS-based inference model outperformed baseline approaches like rank-by-count and rank-by-frequency. Meanwhile, when considering the users’ travel experiences and location interests, we achieved a better performance in recommending travel sequences beyond baselines including rank-by-count and rank-by-interest. Regarding the personalized recommendation, our approach is more effective than the weighted Slope One algorithm with a slightly additional computation. In addition, in contrast to the Pearson correlation-based CF model, our method is much more efficient while keeping the similar effectiveness.
Copyright © 2007 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or [email protected]. The definitive version of this paper can be found at ACM's Digital Library --http://www.acm.org/dl/.
Publication Downloads
GeoLife GPS Trajectories
August 9, 2012
This is a GPS trajectory dataset collected in (Microsoft Research Asia) GeoLife project by 182 users in a period of over three years (from April 2007 to August 2012). A GPS trajectory of this dataset is represented by a sequence of time-stamped points, each of which contains the information of latitude, longitude and altitude. This dataset contains 17,621 trajectories with a total distance of about 1.2 million kilometers and a total duration of 48,000+ hours. These trajectories were recorded by different GPS loggers and GPS-phones, and have a variety of sampling rates. 91 percent of the trajectories are logged in a dense representation, e.g. every 1~5 seconds or every 5~10 meters per point. This dataset recoded a broad range of users' outdoor movements, including not only life routines like go home and go to work but also some entertainments and sports activities, such as shopping, sightseeing, dining, hiking, and cycling. This trajectory dataset can be used in many research fields, such as mobility pattern mining, user activity recognition, location-based social networks, location privacy, and location recommendation.