The following files were retrieved from NCBI https://www.ncbi.nlm.nih.gov/assembly/GCF_000188095.3 Information about this dataset can be found here: https://i5k.nal.usda.gov/bio_data/836556 Information about the functional annotation pipeline can be found here: https://www.mdpi.com/2075-4450/12/8/748 Instruction for running the functional annotation pipeline can be found here: https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html The file GCF_000188095.3_BIMP_2.2_protein.faa.gz was post-processed with agbase/goanna:2.2 (https://hub.docker.com/repository/docker/agbase/goanna) docker container to generate GO functional annotation. GOanna was run with these options: -a,invertebrates; -f,3; -g,70; -k,9; -q,70; -r,1.2. The resulting files are: GCF_000188095.3.asn GCF_000188095.3.gaf.tsv GCF_000188095.3.html GCF_000188095.3.tsv The file GCF_000188095.3_BIMP_2.2_protein.faa.gz was post-processed with agbase/interproscan:5.45-80.1_0 (https://hub.docker.com/repository/docker/agbase/interproscan) docker container to generate GO and pathway functional annotation. The resulting files are: GCF_000188095_acc_go_counts.txt GCF_000188095_acc_interpro_counts.txt GCF_000188095_acc_pathway_counts.txt GCF_000188095.err GCF_000188095_gaf.txt GCF_000188095.gff3 GCF_000188095_go_counts.txt GCF_000188095_interpro_counts.txt GCF_000188095.json GCF_000188095_pathway_counts.txt GCF_000188095.tsv GCF_000188095.xml The file GCF_000188095.3_BIMP_2.2_protein.faa.gz was post-processed with agbase/kobas:3.0.3_0 (https://hub.docker.com/repository/docker/agbase/kobas) docker container to generate pathway functional annotation. The resulting files are: GCF_000188095.3_KOBAS_acc_pathways.tsv GCF_000188095.3_KOBAS_pathways_acc.tsv GCF_000188095.3_KOBAS.txt The GAF output files from InterProScan and GOanna were combined into a single GAF file using agbase/combine_gafs:1.0 (https://hub.docker.com/repository/docker/agbase/combine_gafs) GCF_000188095.3_complete.gaf.tsv