PGB: A PubMed Graph Benchmark for Heterogeneous Network Representation Learning

Eric W. Lee, Joyce C. Ho

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

There has been rapid growth in biomedical literature, yet capturing the heterogeneity of the bibliographic information of these articles remains relatively understudied. Graph neural networks have gained popularity, however, they may not fully capture the information available in the PubMed database, a biomedical literature repository containing over 33 million articles. We introduce PubMed Graph Benchmark (PGB), a new benchmark dataset for evaluating heterogeneous graph representations. PGB is one of the largest heterogeneous networks to date and aggregates the rich metadata into a unified source including abstract, authors, citations, keywords, and the associated keyword hierarchy. The benchmark contains an evaluation task of 21 systematic review topics, an essential knowledge translation tool.

Original languageEnglish
Title of host publicationCIKM 2023 - Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages5331-5335
Number of pages5
ISBN (Electronic)9798400701245
DOIs
StatePublished - Oct 21 2023
Externally publishedYes
Event32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 - Birmingham, United Kingdom
Duration: Oct 21 2023Oct 25 2023

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Conference

Conference32nd ACM International Conference on Information and Knowledge Management, CIKM 2023
Country/TerritoryUnited Kingdom
CityBirmingham
Period10/21/2310/25/23

Funding

Acknowledgements. We thank the reviewers for their insightful suggestions and comments. This work was supported by the National Science Foundation award IIS-2145411.

FundersFunder number
National Science FoundationIIS-2145411

    Keywords

    • Heterogeneous Information Network
    • Network Embedding
    • PubMed Benchmark
    • Systematic Review

    Fingerprint

    Dive into the research topics of 'PGB: A PubMed Graph Benchmark for Heterogeneous Network Representation Learning'. Together they form a unique fingerprint.

    Cite this