The following files were retrieved from NCBI ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/040/414/725/GCF_040414725.1_ASM4041472v1/GCF_040414725.1_ASM4041472v1_protein.faa.gz Information about this dataset can be found here: https://i5k.nal.usda.gov/bio_data/1394619 Information about the functional annotation pipeline can be found here: https://www.mdpi.com/2075-4450/12/8/748 Instructions for running the functional annotation pipeline can be found here: https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html The file GCF_040414725.1_ASM4041472v1_protein.faa was post-processed with agbase/goanna:2.3 (https://hub.docker.com/repository/docker/agbase/goanna) docker container to generate GO functional annotation. GOanna was run with these options: -a,invertebrates; -f,3; -g,70; -k,9; -q,70; -r,1.2. The resulting files are: GCF_040414725.1.asn GCF_040414725.1.gaf.tsv GCF_040414725.1.html GCF_040414725.1.tsv The file GCF_040414725.1_ASM4041472v1_protein.faa was post-processed with agbase/interproscan:5.63-95_2 (https://hub.docker.com/repository/docker/agbase/interproscan) docker container to generate GO and pathway functional annotation. The resulting files are: GCF_040414725_acc_go_counts.txt GCF_040414725_acc_interpro_counts.txt GCF_040414725_acc_pathways_counts.txt GCF_040414725.err GCF_040414725_gaf.txt GCF_040414725.gff3 GCF_040414725_go_counts.txt GCF_040414725_interpro_counts.txt GCF_040414725_pathways_counts.txt GCF_040414725.tsv The file GCF_040414725.1_ASM4041472v1_protein.faa was post-processed with agbase/kobas:3.0.3_3 (https://hub.docker.com/repository/docker/agbase/kobas) docker container to generate pathway functional annotation. The resulting files are: GCF_040414725.1_KOBAS_acc_pathways.tsv GCF_040414725.1_KOBAS_pathways_acc.tsv GCF_040414725.1_KOBAS.txt The GAF output files from InterProScan and GOanna were combined into a single GAF file using agbase/combine_gafs:1.1 (https://hub.docker.com/repository/docker/agbase/combine_gafs) GCF_040414725.1_complete.gaf.tsv