Earth:Location-based recommendation

From HandWiki
Revision as of 17:49, 5 January 2022 by imported>Corlink (fixing)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Location-based recommendation is a recommender system that incorporates location information, such as that from a mobile device, into algorithms to attempt to provide more-relevant recommendations to users. This could include recommendations for restaurants, museums, or other points of interest or events near the user's location. These services take advantage of the increasing use of smartphones that store and provide the location information of their users alongside location-based social networks (LBSN), like Foursquare, Gowalla, Swarm, and Yelp. In addition to geosocial networking services, traditional online social networks such as Facebook and Twitter are using the location information of their users to show and recommend upcoming events, posts, and local trends.

In addition to its value for users, this information is valuable for third-party companies to advertise products, hotels, places, and to forecast service demand such as the number of taxis needed in a part of a city.

Background

Recommender systems are information filtering systems which attempt to predict the rating or preference that a user would give, based on ratings that similar users gave and ratings that the user gave on previous occasions. These systems have become increasingly popular and are used for movies, music, news, books, research articles, search queries, social tags, and products in general.

Recommending new places

The main objective of recommending new places is to provide a suggestion to a user to visit unvisited places like restaurants, museums, national parks or other points of interest. This type of recommendation is quite valuable, especially for those who are traveling to a new city and want the best experience during their trip. Location-based social networks or third-party advertising companies are willing to provide a recommendation not only based on previous check-ins and preferences but also using social links to suggest a not-visited point-of-interest. The implicit goal of this type of recommendation is to lift the user's burden of searching for an interesting place.

One of the first studies in this area was conducted in 2011. The idea behind this work was to leverage social influence and location influence and provide recommendations. The authors provide three types of scores:[1]

  • Similar users: this score is proportional to the similarity in behavior of users for visiting places. Mathematically, the similarity score between two users is computed as follows:[math]\displaystyle{ sim_{usage}(u,v) ={\sideset{}{_{j\in I}}\Sigma s(u,j) \times s(v,j) \over \sqrt{\sideset{}{_{j \in I}}\Sigma (s(u,j))^2}\sqrt{\sideset{}{_{j \in I} }\Sigma (s(v,j))^2} } }[/math]Where [math]\displaystyle{ s(u,i) }[/math] denotes the probability of visiting place [math]\displaystyle{ i }[/math] by user [math]\displaystyle{ u }[/math]. This value could be computed based on the idea of user-based collaborative filtering as below:[math]\displaystyle{ s(u,i) ={\sideset{}{_{v\in U}}\Sigma sim(u,v) \times sim(v,i) \over \sideset{}{_{v\in U}}\Sigma sim(u,v)} }[/math]
  • Similar friends: this score is calculated by the cosine similarity of users based on their mutual connections (i.e.: friendships) in social media. This similarity is proportional to the number of friends that two users have in common. It is calculated as:[math]\displaystyle{ sim_{social}(u,v) = {\eta \cdot |F_u \cap F_v | \over |F_u \cup F_v|} + {(1-\eta) \cdot |I_u \cap I_v | \over |I_u \cup I_v|} }[/math]Where [math]\displaystyle{ F_u }[/math]represent the set of friends and [math]\displaystyle{ I_u }[/math]is the place set of user [math]\displaystyle{ u }[/math] (i.e.: places the user visited). The tuning parameter [math]\displaystyle{ \eta }[/math], which is between 0 and 1, controls importance of social similarity and visiting similarity of two users.
  • Geographical distance: This score is inversely proportional to the distance between the target place and the typical places that a user frequently visits. Other studies have shown that overall distribution of distances is similar to power-law distribution.[2][3] The formula below calculates the probability of check-in for user [math]\displaystyle{ u }[/math] in place [math]\displaystyle{ i }[/math] according to its distance from all check-ins of user [math]\displaystyle{ u }[/math].[math]\displaystyle{ s_{geo}(u,i) = Pr(u,i) = \prod_{k \in I_u} f(distance(i,k)) }[/math]

The aggregate of these three scores is defined as:[math]\displaystyle{ S(u,i) = (1 - \alpha - \beta)sim_{usage}(u,i) + \alpha \cdot sim_{social} + \beta s_{geo}(u,i) }[/math]Where the three terms correspond to recommender systems based on user preference, social influence and geographical influence, respectively. The two weighting parameters [math]\displaystyle{ \alpha }[/math] and [math]\displaystyle{ \beta }[/math] [math]\displaystyle{ (0 \leq(\alpha + \beta)\leq 1) }[/math] denote the relative importance of social influence and geographical influence compared to user preference.

Recommending the next place

Providing a sequence of recommendations becomes increasingly complex, and must take into account each location, the time, weather, reachability, etc., before providing any suggestion. These are generally known as context-aware recommendations, and tend to provide places that other people (possibly the user's friends) visited following an initial visit to the location of the user's first recommendation.

Recommending events and neighborhoods

There are a vast number of temporary events being held in different locations. Detecting and recommending events that would be interesting to a user is a task which requires considerable profiling, both of the user's history of event preferences and those of the user's social circle.

Social events

Researchers at a 2010 Institute of Electrical and Electronics Engineers (IEEE) conference discussed the need of a reliable fine-grained dataset of previous user-attendance in order to provide social-event detection.[4] User residence area and attended events were estimated from user mobile data. Six different strategies were designed and tested for event recommendation:

  • Popular events: the most-attended event.
  • Geographically close events: events close to user's residency area. The raw score of an event could be adjusted by a factor inversely proportional to the distance to the event.
  • Popular events in area: the most-attended events within certain neighborhoods.
  • TF-IDF (term frequency–inverse document frequency): inspired by the popular approach in information retrieval, recommends events which may not be widely popular but are very popular within a local area.
  • The K-nearest locations: popular events in neighborhoods similar to the user's residence area. The similarity between two neighborhoods [math]\displaystyle{ i }[/math] and [math]\displaystyle{ k }[/math] could be defined as:[math]\displaystyle{ sim(i,k) ={\sideset{}{_e}\Sigma (n_{i,e} \cdot n_{k,e}) \over \sqrt{\sideset{}{_e}\Sigma (n^2_{i,e})}\sqrt{\sideset{}{_e}\Sigma (n^2_{k,e})} } {2N_{i\cup k} \over N_i + N_k} }[/math]where [math]\displaystyle{ n_{i,j} }[/math] represents number of individuals living in neighborhood [math]\displaystyle{ i }[/math] who attended event [math]\displaystyle{ e }[/math]. The similarity measure is weighted by [math]\displaystyle{ N_i }[/math] and [math]\displaystyle{ N_k }[/math] which represent the number of events people living in neighborhoods [math]\displaystyle{ i }[/math] and [math]\displaystyle{ k }[/math] have attended. Similarly, [math]\displaystyle{ N_{i\cup k} }[/math]represents number of users living in [math]\displaystyle{ i }[/math] or users living in [math]\displaystyle{ k }[/math]. Having similarity of neighborhoods, one can predict the score of user [math]\displaystyle{ i }[/math] to an event [math]\displaystyle{ j }[/math] based on a similarity-weighted average of the similar locations' values:[math]\displaystyle{ score_{i,j} ={\sideset{}{_k}\Sigma n_{k,j} \times sim(i,k) \over \sideset{}{_k}\Sigma sim(i,k)} }[/math]The scores of each pair-events can be predicted and those events recommended to users with the highest values.
  • The K-nearest events: the similarity of events is computed and top K-events are recommended that are similar to those that a user previously enjoyed.[math]\displaystyle{ score_{i,j} ={\sideset{}{_q}\Sigma n_{i,q} \times sim(j,q) \over \sideset{}{_q}\Sigma sim(j,q)} }[/math]

Neighborhoods

In their paper, the authors of interactive tool Hoodsquare explain how they use 5-month Foursquare check-in data to group similar areas together to redraw the neighborhood boundaries of a city.[5] Their analysis reveals different characteristics of a city which could be exploited for different tasks such as attractive neighborhood recommendations for tourists and city residents. Similarly, the Livehood project[6] was conducted in order to define new local regions, named Livehoods, to partition a city in a way that could reflect the character of life in those areas.

References

  1. Ye, Mao; Yin, Peifeng; Lee, Wang-Chien; Lee, Dik-Lun (2011-01-01). Exploiting Geographical Influence for Collaborative Point-of-interest Recommendation. SIGIR '11. New York, NY, USA: ACM. 325–334. doi:10.1145/2009916.2009962. ISBN 9781450307574. 
  2. Noulas, Anastasios; Scellato, Salvatore; Mascolo, Cecilia; Pontil, Massimiliano (2011-01-01). An empirical study of geographic user activity patterns in foursquare. pp. 570–573. http://www.citeulike.org/user/amirreza/article/9770331. 
  3. Brockmann, D.; Hufnagel, L.; Geisel, T. (2006-01-26). "The scaling laws of human travel". Nature 439 (7075): 462–465. doi:10.1038/nature04292. ISSN 1476-4687. PMID 16437114. Bibcode2006Natur.439..462B. 
  4. Quercia, D.; Lathia, N.; Calabrese, F.; Lorenzo, G. Di; Crowcroft, J. (2010-12-01). Recommending Social Events from Mobile Phone Location Data. 971–976. doi:10.1109/ICDM.2010.152. ISBN 978-1-4244-9131-5. 
  5. Zhang, A. X.; Noulas, A.; Scellato, S.; Mascolo, C. (2013-09-01). Hoodsquare: Modeling and Recommending Neighborhoods in Location-Based Social Networks. 69–74. doi:10.1109/SocialCom.2013.17. ISBN 978-0-7695-5137-1. 
  6. Cranshaw, Justin; Schwartz, Raz; Hong, Jason I.; Sadeh, Norman. The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City. ICWSM'12. 

External links