{Reference Type}: Journal Article {Title}: Dataset of the frequency patterns of publications annotated to human protein-coding genes, their protein products and genetic relevance. {Author}: Zwick M;Kraemer O;Carter AJ; {Journal}: Data Brief {Volume}: 25 {Issue}: 0 {Year}: Aug 2019 暂无{DOI}: 10.1016/j.dib.2019.104284 {Abstract}: We present data concerning the distribution of scientific publications for human protein-coding genes together with their protein products and genetic relevance. We annotated the gene2pubmed dataset Maglott et al., 2007 provided by the NCBI (National Center for Biotechnology Information) with publication years, genetic metadata corresponding to Online Mendelian Inheritance in Man (OMIM) Hamosh et al., 2005 entries and the frequency of their appearance in Genome-Wide Association Studies (GWAS) Buniello et al., 2019 provided by the European Bioinformatics Institute (EBI) using the KNIME® Analytics Platform Berthold et al., 2008. The results of this data integration process comprise two datasets: 1) A dataset containing information on all human protein-coding genes that can be used to analyse the number of scientific publications in context of the potential disease relevance of the individual genes. 2) A table with the annual and cumulated number of PubMed entries. For further interpretation of the data presented in this article, please see the research article 'Target 2035 - probing the human proteome' by Carter et al. https://doi.org/10.1016/j.drudis.2019.06.020 Carter et al., 2019.