An official website of the United States government.

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Ephemera danica annotations ephdan_OGSv1.0

    Summary
    Type
    Genome annotation
    Name
    Ephemera danica annotations ephdan_OGSv1.0
    Description

    This dataset presents the Ephemera danica Official Gene Set (OGS) v1.0. The OGS is an integration of automatic gene predictions from Ephemera danica genome annotations v0.5.3 (https://10.15482/USDA.ADC/1503792(link is external)), with manual annotations by the research community (https://data.nal.usda.gov/dataset/ephemera-danica-manual-annotations-genome-assembly-edan10, performed via the Apollo manual curation software, http://genomearchitect.org/(link is external)). Manual and automated annotations were lifted over from genome assembly Ephemera danica genome assembly v1.0 (https://10.15482/USDA.ADC/1503791(link is external)) to genome assembly Edan_2.0 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000507165.2(link is external)) using the coordinates_conversion and remap-gff3 programs (https://github.com/NAL-i5K/coordinates_conversion/(link is external); https://github.com/NAL-i5K/remap-gff3(link is external)). 1,035 annotations were removed from the original datasets during this process, due to changes in the new genome assembly, or due to problems with the original gene models.

    Protein pages for the manual annotations can be accessed at NCBI: https://www.ncbi.nlm.nih.gov/protein?LinkName=nuccore_protein_wgs&from_uid=1305557219

    The full dataset is accessible at the Ag Data Commons: https://doi.org/10.15482/USDA.ADC/1518589

    Program, Pipeline, Workflow or Method Name
    MAKER2, manual annotations, GFF3toolkit, remap-gff3
    Program Version
    NA
    Source Name
    Ephemera danica genome assembly Edan_2.0 (GCA_000507165.2)
    Data Source URI
    Organism
    Publication