The following files were retrieved from NCBI on 2021-22-7: https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/013/339/725/GCF_013339725.1_ASM1333972v1/GCF_013339725.1_ASM1333972v1_genomic.gff.gz https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/013/339/725/GCF_013339725.1_ASM1333972v1/GCF_013339725.1_ASM1333972v1_cds_from_genomic.fna.gz https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/013/339/725/GCF_013339725.1_ASM1333972v1/GCF_013339725.1_ASM1333972v1_rna_from_genomic.fna.gz https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/013/339/725/GCF_013339725.1_ASM1333972v1/GCF_013339725.1_ASM1333972v1_translated_cds.faa.gz Information about this dataset can be found here: https://i5k.nal.usda.gov/bio_data/1141200 Information about the publication of this dataset can be found here: https://doi.org/10.1016/j.cell.2020.07.023