CitySearcher: A City Search Engine For Interests

6533b854fe1ef96bd12ae0e1

RESEARCH PRODUCT

CitySearcher: A City Search Engine For Interests

Gaurav Pandey Shuaiqiang Wang Mohamed Abdel Maksoud

subject

Feature engineering Word embedding kaupungit Computer science Information needs 02 engineering and technology semanttinen web Semantics computer.software_genre search engines Search engine semantic web 020204 information systems 0202 electrical engineering electronic engineering information engineering hakuohjelmat Word2vec towns and cities ta113 Information retrieval business.industry Rank (computer programming)Semantic search suosittelujärjestelmät Vertical search 020201 artificial intelligence & image processing Learning to rank Artificial intelligence recommender systems business computer Natural language processing

description

We introduce CitySearcher, a vertical search engine that searches for cities when queried for an interest. Generally in search engines, utilization of semantics between words is favorable for performance improvement. Even though ambiguous query words have multiple semantic meanings, search engines can return diversified results to satisfy different users' information needs. But for CitySearcher, mismatched semantic relationships can lead to extremely unsatisfactory results. For example, the city Sale would incorrectly rank high for the interest shopping because of semantic interpretations of the words. Thus in our system, the main challenge is to eliminate the mismatched semantic relationships resulting from the side effect of the semantic models. In the previous case, we aim to ignore the semantics of a city's name which is not indicative of the city's characteristics. In CitySearcher, we use word2vec, a very popular word embedding technique to estimate the semantics of the words and create the initial ranks of the cities. To reduce the effect of the mismatched semantic relationships, we generate a set of features for learning based on a novel clustering-based method. With the generated features, we then utilize learning to rank algorithms to rerank the cities for return. We use the English version of Wikivoyage dataset for evaluation of our system, where we sample a very small dataset for training. Experimental results demonstrate the performance gain of our system over various standard retrieval techniques. peerReviewed

year	journal	country	edition	language
2017-08-07

10.1145/3077136.3080742 http://juuli.fi/Record/0285046817