The following files were retrieved from NCBI GCF_000671375.1_Cexi_2.0_protein.faa Information about this dataset can be found here: https://i5k.nal.usda.gov/bio_data/836497 Information about the functional annotation pipeline can be found here: https://www.mdpi.com/2075-4450/12/8/748 Instruction for running the functional annotation pipeline can be found here: https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html The file GCF_000671375.1_Cexi_2.0_protein.faa was post-processed with agbase/goanna:2.3 (https://hub.docker.com/repository/docker/agbase/goanna) docker container to generate GO functional annotation. GOanna was run with these options: -a,invertebrates; -f,3; -g,70; -k,9; -q,70; -r,1.2. The resulting files are: GCF_000671375.1.asn GCF_000671375.1.gaf.tsv GCF_000671375.1.html GCF_000671375.1.tsv The file GCF_000671375.1_Cexi_2.0_protein.faa was post-processed with agbase/interproscan:5.63-95 (https://hub.docker.com/repository/docker/agbase/interproscan) docker container to generate GO and pathway functional annotation. The resulting files are: GCF_000671375_acc_go_counts.txt GCF_000671375_acc_interpro_counts.txt GCF_000671375.err GCF_000671375_gaf.txt GCF_000671375.gff3 GCF_000671375_go_counts.txt GCF_000671375_interpro_counts.txt GCF_000671375.tsv The file GCF_000671375.1_Cexi_2.0_protein.faa was post-processed with agbase/kobas:3.0.3_3 (https://hub.docker.com/repository/docker/agbase/kobas) docker container to generate pathway functional annotation. The resulting files are: GCF_000671375.1_KOBAS_acc_pathways.tsv GCF_000671375.1_KOBAS_pathways_acc.tsv GCF_000671375.1_KOBAS.txt The GAF output files from InterProScan and GOanna were combined into a single GAF file using agbase/combine_gafs:1.1 (https://hub.docker.com/repository/docker/agbase/combine_gafs) GCF_000671375.1_complete.gaf.tsv