6533b85ffe1ef96bd12c1dea

RESEARCH PRODUCT

Coupling News Sentiment with Web Browsing Data Improves Prediction of Intra-Day Price Dynamics

Guido CaldarelliGabriele RancoMichele TreccaniGiacomo BormettiIlaria BordinoFabrizio Lillo

subject

0301 basic medicineINFORMATIONEconomicsComputer scienceBig datalcsh:MedicineSocial SciencesQuantitative Finance - Computational Financesocial and economic systemsMathematical and Statistical TechniquesSociologybig dataEconometrics050207 economicsComputer NetworksCapital Marketslcsh:ScienceFinancial Marketsmedia_common050208 financeMultidisciplinary05 social sciencesCommerceSocial CommunicationSettore FIS/02 - Fisica Teorica Modelli e Metodi MatematiciSurpriseModels EconomicSocial NetworksPhysical SciencesSocial SystemsEngineering and TechnologyComputational sociologyBEHAVIORStatistics (Mathematics)Network AnalysisResearch ArticleComputer and Information SciencesExploitmedia_common.quotation_subjectTwitterComputational Finance (q-fin.CP)Research and Analysis MethodsFOS: Economics and business03 medical and health sciencesSEARCH0502 economics and businessHumansRelevance (information retrieval)Web navigationInvestmentsStatistical MethodsInternetStatistical Finance (q-fin.ST)STOCK-MARKETbusiness.industrylcsh:RSentiment analysisFinancial marketATTENTIONQuantitative Finance - Statistical FinanceCommunicationsNoise ReductionFinancial Firms030104 developmental biologySignal ProcessingPredictive powerlcsh:QStock marketbusinessSocial MediaFinanceMathematicsForecasting

description

The new digital revolution of big data is deeply changing our capability of understanding society and forecasting the outcome of many social and economic systems. Unfortunately, information can be very heterogeneous in the importance, relevance, and surprise it conveys, affecting severely the predictive power of semantic and statistical methods. Here we show that the aggregation of web users' behavior can be elicited to overcome this problem in a hard to predict complex system, namely the financial market. Specifically, our in-sample analysis shows that the combined use of sentiment analysis of news and browsing activity of users of Yahoo! Finance greatly helps forecasting intra-day and daily price changes of a set of 100 highly capitalized US stocks traded in the period 2012-2013. Sentiment analysis or browsing activity when taken alone have very small or no predictive power. Conversely, when considering a "news signal" where in a given time interval we compute the average sentiment of the clicked news, weighted by the number of clicks, we show that for nearly 50% of the companies such signal Granger-causes hourly price returns. Our result indicates a "wisdom-of-the-crowd" effect that allows to exploit users' activity to identify and weigh properly the relevant and surprising news, enhancing considerably the forecasting power of the news sentiment.

https://doi.org/10.1371/journal.pone.0146576