The following files were retrieved from NCBI https://www.ncbi.nlm.nih.gov/assembly/GCF_001937115.1 Information about this dataset can be found here: https://i5k.nal.usda.gov/node/739196 Information about the functional annotation pipeline can be found here: https://www.mdpi.com/2075-4450/12/8/748 Instruction for running the functional annotation pipeline can be found here: https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html The file GCF_001937115.1_Atum_1.0_protein.faa was post-processed with agbase/goanna:2.2 (https://hub.docker.com/repository/docker/agbase/goanna) docker container to generate GO functional annotation. GOanna was run with these options: -a,invertebrates; -f,3; -g,70; -k,9; -q,70; -r,1.2. The resulting files are: GCF_001937115.1.asn GCF_001937115.1.gaf.tsv GCF_001937115.1.html GCF_001937115.1.tsv The file GCF_001937115.1_Atum_1.0_protein.faa was post-processed with agbase/interproscan:5.45-80.0_1 (https://hub.docker.com/repository/docker/agbase/interproscan) docker container to generate GO and pathway functional annotation. The resulting files are: GCF_001937115_acc_go_counts.txt GCF_001937115_acc_interpro_counts.txt GCF_001937115_acc_pathway_counts.txt GCF_001937115.err GCF_001937115_gaf.txt GCF_001937115.gff3 GCF_001937115_go_counts.txt GCF_001937115_interpro_counts.txt GCF_001937115.json GCF_001937115_pathway_counts.txt GCF_001937115.tsv GCF_001937115.xml The file GCF_001937115.1_Atum_1.0_protein.faa was post-processed with agbase/kobas:3.0.3_0 (https://hub.docker.com/repository/docker/agbase/kobas) docker container to generate pathway functional annotation. The resulting files are: GCF_001937115.1_KOBAS_acc_pathways.tsv GCF_001937115.1_KOBAS_pathways_acc.tsv GCF_001937115.1_KOBAS.txt The GAF output files from InterProScan and GOanna were combined into a single GAF file using agbase/combine_gafs:1.0 (https://hub.docker.com/repository/docker/agbase/combine_gafs) GCF_001937115.1_complete.gaf.tsv