SARS-CoV2 Docking Dataset for MLMol Language Model (50M)

  • Aristeidis (aris) Tsaris (Creator)
  • John Gounley (Creator)
  • Andrew E. Blanchard (Creator)

Dataset

Description

This is a processed molecular dataset from this https://doi.ccs.ornl.gov/ui/doi/348 adding up to 50M molecules for the training and 486K molecules for the validation. Instructions on how to use/run/train this dataset can be found here: https://code.ornl.gov/candle/mlmol

Cite this