An official website of the United States government.

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

The USDA-ARS Ag100Pest Initiative: High-Quality Genome Assemblies for Agricultural Pest Arthropod Research.

    Summary
    Publication Type
    Journal Article
    Abstract

    The phylum Arthropoda includes species crucial for ecosystem stability, soil health, crop production, and others that present obstacles to crop and animal agriculture. The United States Department of Agriculture's Agricultural Research Service initiated the Ag100Pest Initiative to generate reference genome assemblies of arthropods that are (or may become) pests to agricultural production and global food security. We describe the project goals, process, status, and future. The first three years of the project were focused on species selection, specimen collection, and the construction of lab and bioinformatics pipelines for the efficient production of assemblies at scale. Contig-level assemblies of 47 species are presented, all of which were generated from single specimens. Lessons learned and optimizations leading to the current pipeline are discussed. The project name implies a target of 100 species, but the efficiencies gained during the project have supported an expansion of the original goal and a total of 158 species are currently in the pipeline. We anticipate that the processes described in the paper will help other arthropod research groups or other consortia considering genome assembly at scale.

    Citation
    Childers AK, Geib SM, Sim SB, Poelchau MF, Coates BS, Simmonds TJ, Scully ED, Smith TPL, Childers CP, Corpuz RL, Hackett K, Scheffler B. The USDA-ARS Ag100Pest Initiative: High-Quality Genome Assemblies for Agricultural Pest Arthropod Research.. Insects. 2021 Jul 09; 12(7).
    Publication Date
    2021 Jul 09
    DOI
    10.3390/insects12070626
    Authors
    Childers AK, Geib SM, Sim SB, Poelchau MF, Coates BS, Simmonds TJ, Scully ED, Smith TPL, Childers CP, Corpuz RL, Hackett K, Scheffler B
    Cross Reference
    Database Accession
    PMID 34357286
    Analyses
    Name Program
    Ceratitis capitata genome assembly Ccap_2.1 (GCF_000347755.3) AllPaths v. 35218; ATLAS-link v. 1.0; ATLAS-gapfill v. 2.2; redundans v. 0.12c
    Tribolium castaneum genome assembly icTriCast1.1 (GCF_031307605.1) HiFiASM
    Schistocerca americana genome assembly iqSchAmer2.1 (GCF_021461395.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11
    Anabrus simplex genome assembly ASM4041472v1 (GCF_040414725.1) HiFiASM
    Ornithodoros turicata genome assembly ASM3712646v1 (GCF_037126465.1) HiFiASM
    Neodiprion fabricii genome assembly iyNeoFabr1.1 (GCF_021155785.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11
    Neodiprion pinetum genome assembly iyNeoPine1.1 (GCF_021155775.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11
    Helicoverpa zea genome assembly ilHelZeax1 (GCF_022581195.2) FALCON v. 1.8.1; FALCON-Unzip v. 1.3.7; purge_dups v. 1.2.5; bwa-mem v. 2.2.1; Juicebox v. 1.11.08; Arrow gcpp v. 2.0.2; FreeBayes v. 1.0.2; Merqury v. 1.1
    NCBI Anabrus simplex Annotation Release GCF_040414725.1-RS_2024_09 NCBI Eukaryotic Genome Annotation Pipeline
    Plodia interpunctella genome assembly ilPloInte3.2 (GCF_027563975.2) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08
    Zeugodacus cucurbitae genome assembly idZeuCucr1.2 (GCF_028554725.1) HiFiASM
    Bactrocera dorsalis genome assembly ASM2337382v1 (GCF_023373825.1) NextDenovo
    Schistocerca gregaria genome assembly iqSchGreg1.2 (GCF_023897955.1) HiFiASM
    Neodiprion lecontei genome assembly iyNeoLeco1.1 (GCF_021901455.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11
    Schistocerca piceifrons genome assembly iqSchPice1.1 (GCF_021461385.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11
    Schistocerca nitens genome assembly iqSchNite1.1 GCF_023898315.1 HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11
    Schistocerca serialis cubense genome assembly iqSchSeri2.2 (GCF_023864345.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11
    Anthonomus grandis grandis genome assembly icAntGran1.3 (GCF_022605725.1) HiFiASM
    Diprion similis genome assembly iyDipSimi1.1 (GCF_021155765.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11
    Bombus huntii genome assembly iyBomHunt1.1 (GCF_024542735.1) HiFiASM
    Schistocerca cancellata genome assembly iqSchCanc2.1 GCF_023864275.1 HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11
    Dermacentor andersoni genome assembly qqDerAnde1.2 (GCF_023375885.1) HiFiASM
    Aethina tumida genome assembly icAetTumi1.1 (GCF_024364675.1) HiFiASM
    Plodia interpunctella genome assembly ilPloInte3.1 (GCF_027563975.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08
    Pectinophora gossypiella genome assembly ilPecGoss1.1 (GCF_024362695.1) FALCON v. 1.8.1; FALCON-Unzip v. 1.3.7; purge_dups v. 1.2.5; bwa-mem v. 2.2.1; YaHS v. 1.0; Juicebox v. 1.11.08; Arrow gcpp v. 2.0.2; FreeBayes v. 1.0.2; Merqury v. 1.1
    Anastrepha ludens genome assembly idAnaLude1.1 (GCF_028408465.1) HiFiASM
    Anastrepha obliqua genome assembly idAnaObli1_1.0 (GCF_027943255.1) HiFiASM
    Vespa mandarinia genome assembly V.mandarinia_Nanaimo_p1.0 (GCF_014083535.2) IPA
    Diorhabda carinulata genome assembly icDioCari1.1 (GCF_026250575.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08 Additional genomes Browse all Diorhabda carinulata genomes (3) BioProject PRJNA788877 Diorhabda carinulata genome sequencing,
    Microplitis mediator genome assembly iyMicMedi2.1 (GCF_029852145.1) HiFiASM
    Microplitis demolitor genome assembly iyMicDemo2.1a (GCF_026212275.2) HiFiASM
    Diorhabda sublineata genome assembly icDioSubl1.1 (GCF_026230105.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08
    Cylas formicarius genome assembly icCylForm1.1 (GCF_029955315.1) HiFiASM
    Cydia pomonella genome assembly ilCydPomo1 (GCF_033807575.1) FALCON
    Amyelois transitella genome assembly ilAmyTran1.1 (GCF_032362555.1) HiFiASM
    Diachasmimorpha longicaudata genome assembly iyDiaLong2 (GCF_034640455.1) hifiasm
    Vanessa tameamea genome assembly ilVanTame1 primary haplotype (GCF_037043105.1) HiFiASM