The following files were retrieved from NCBI https://www.ncbi.nlm.nih.gov/assembly/GCA_002382865.1 Information about this dataset can be found here: https://i5k.nal.usda.gov/bio_data/836519 Information about the functional annotation pipeline can be found here: https://www.mdpi.com/2075-4450/12/8/748 Instruction for running the functional annotation pipeline can be found here: https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html The file GCA_002382865.1_K63_refined_pacbio_protein.faa was post-processed with agbase/goanna:2.2 (https://hub.docker.com/repository/docker/agbase/goanna) docker container to generate GO functional annotation. GOanna was run with these options: -a,invertebrates; -f,3; -g,70; -k,9; -q,70; -r,1.2. The resulting files are: GCA_002382865.1.asn GCA_002382865.1.gaf.tsv GCA_002382865.1.html GCA_002382865.1.tsv The file GCA_002382865.1_K63_refined_pacbio_protein.faa was post-processed with agbase/interproscan:5.45-80_2 (https://hub.docker.com/repository/docker/agbase/interproscan) docker container to generate GO and pathway functional annotation. The resulting files are: GCA_002382865_acc_go_counts.txt GCA_002382865_acc_interpro_counts.txt GCA_002382865_acc_pathway_counts.txt GCA_002382865.err GCA_002382865_gaf.txt GCA_002382865.gff3 GCA_002382865_go_counts.txt GCA_002382865_interpro_counts.txt GCA_002382865.json GCA_002382865_pathway_counts.txt GCA_002382865.tsv GCA_002382865.xml GCA_002382865.html.tar.gz GCA_002382865.svg.tar.gz The file GCA_002382865.1_K63_refined_pacbio_protein.faa was post-processed with agbase/kobas:3.0.3_0 (https://hub.docker.com/repository/docker/agbase/kobas) docker container to generate pathway functional annotation. The resulting files are: GCA_002382865.1_KOBAS_acc_pathways.tsv GCA_002382865.1_KOBAS_pathways_acc.tsv GCA_002382865.1_KOBAS.txt The GAF output files from InterProScan and GOanna were combined into a single GAF file using agbase/combine_gafs:1.0 (https://hub.docker.com/repository/docker/agbase/combine_gafs) GCA_002382865.1_complete.gaf.tsv