Where is the metadata?

Contributed by DJ Darwin Bandoy, PhD Candidate

One distinguishing feature of this pandemic is the rapid release of whole genome sequencing data. These sequences are usually uploaded in public databases with minimal accompanying metadata. While dates and geographic origin are useful for creation of phylogenetic analysis, further sophisticated analysis requires more metadata, particularly associated with pathogen virulence, risk factors of patients. Without the metadata, we miss finding valuable insight from whole genome sequences.

