6533b839fe1ef96bd12a5a28

RESEARCH PRODUCT

Text mining and expert curation to develop a database on psychiatric diseases and their genes

Sandra Montagud-romeroAntonio ArmarioLaura I. FurlongFerran SanzLierni Fernández-ibarrondoÀLex BravoJesús GiraldoAdriana FarréMarta TorrensOlga ValverdeEzequiel PerezVincent WarnaultMiguel Angel MayerFrancina FonsecaMaría Del Carmen Blanco-gandíaFrancisco Javier PavónRoser NadalAntonia SerranoMarta Portero-tresserraAlba Gutiérrez-sacristánJordi OrtizMarta Rodríguez-ariasAnna ManéAngela Leis

subject

0301 basic medicinemedia_common.quotation_subjectLibrary scienceMental disordersHealth informaticsGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciences0302 clinical medicinePlatformExcellencePolitical scienceDatabases GeneticGeneticsData MiningHumansData miningData Curationmedia_commonGlobal burdenDisordersData curationbusiness.industryMental DisordersData science3. Good health030104 developmental biologyOriginal ArticleChristian ministryGeneral Agricultural and Biological Sciencesbusiness030217 neurology & neurosurgerySoftwareInformation Systems

description

Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relevant genotype-phenotype information has hampered the application of this wealth of knowledge to translational research and clinical practice in order to improve diagnosis and treatment of psychiatric patients. PsyGeNET (http://www.psygenet.org/) has been developed with the aim of supporting research on the genetic architecture of psychiatric diseases, by providing integrated and structured accessibility to their genotype-phenotype association data, together with analysis and visualization tools. In this article, we describe the protocol developed for the sustainable update of this knowledge resource. It includes the recruitment of a team of domain experts in order to perform the curation of the data extracted by text mining. Annotation guidelines and a web-based annotation tool were developed to support the curators' tasks. A curation workflow was designed including a pilot phase and two rounds of curation and analysis phases. Negative evidence from the literature on gene-disease associ- ations (GDAs) was taken into account in the curation process. We report the results of the application of this workflow to the curation of GDAs for PsyGeNET, including the analysis of the inter-annotator agreement and suggest this model as a suitable approach for the sustainable development and update of knowledge resources. Database URL: http://www.psygenet.org. PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar We received support from ISCIII-FEDER (PI13/00082, CP10/00524, CPII16/00026), IMI-JU under grants agreements no. 115191 (Open PHACTS)] and no. 115372 (EMIF), resources of which are composed of financial contribution from the EU-FP7 (FP7/2007-2013) and EFPIA companies in kind contribution, and the EU H2020 Programme 2014-2020 under grant agreements no. 634143 (MedBioinformatics) and no. 676559 (Elixir-Excelerate). The Research Programme on Biomedical Informatics (GRIB) is a member of the Spanish National Bioinformatics Institute (INB), PRB2-ISCIII and is supported by grant PT13/0001/0023, of the PE I + D+i 2013-2016, funded by ISCIII and FEDER. MRA, SMR and MCBG are supported RD16/0017/0007; OV, FF and MT are supported by RD16/0017/0010; and AS and FJP are supported by RD16/0017/0001, by Instituto de Salud Carlos III, Red de Trastornos Adictivos (RTA-Retics-ISCIII). AGS acknowledges financial support from the Spanish Ministry of Economy and Competitiveness, through the 'María de Maeztu' Programme for Units of Excellence in R&D (MDM-2014-0370). Funding for open access: EU H2020 Programme 2014-2020 under grant agreements no. 634143 (MedBioinformatics).

http://www.scopus.com/inward/record.url?eid=2-s2.0-84985995034&partnerID=MN8TOARS