NHGRI Division of Intramural Research

Research Home Page

Hydractinia Genome Project Portal

Home
About the Genome Project
BLAST
Genome Browser
Single-Cell Browser
- Single-Cell Browser (v1)
- Single-Cell Browser (v2)
View a Gene Page
Predicted Protein Structures
- View Protein Structures
- Download Protein Structures
Fetch a Scaffold
Pfam Domains
Developmental Expression
Download Sequences
Database Statistics
Publications
FAQ
Public Domain Notice and Reference

Frequently Asked Questions

BLAST

What version of BLAST is implemented in the Hydractinia BLAST tool?
What BLAST programs are available?
What BLAST databases are available?
What do the boxes in the 'Links' column of the BLAST output represent?

Genome Browser

What tracks are available for viewing in the Hydractinia genome browser?
How can I navigate the Hydractinia genome using JBrowse?
Is the Hydractinia genome browser searchable?
Is there a preferred Web browser I should use to view the Genome Browser?

View a Gene Page

How do I search for a gene using the Hydractinia gene Wiki pages?
What type of information is available in the Hydractinia gene Wiki pages?
How are the phylogenetically informed ortholog clusters determined?

Fetch a Scaffold

How do I download a single genomic scaffold sequence?
Is there a way to download a partial scaffold sequence?
What are the other options for fetching a scaffold sequence?

Pfam Domains

What options are available for searching the Hydractinia genomes for Pfam-A domains?
How were Pfam domains identified in the Hydractinia genomes?
What is the structure of the definition line when sequences are downloaded from the Pfam-A domains page?

Developmental Expression

What is the source of the developmental expression data?

Download Sequences

Where can I download the full Hydractinia genome assemblies?
What is the convention used for naming scaffolds?
How are the gene identifiers generated?
What functional annotation is available?

Database Statistics

Where can I find information regarding Hydractinia data type summaries?

Miscellaneous

How should data derived from the Hydractinia Genome Project Portal be cited?
How can I contact the Web site administrator regarding technical issues?
Where can I get additional information about the Hydractinia Genome Project?
How frequently are the data and portal tools updated?

What version of BLAST is implemented in the Hydractinia BLAST tool?

We use SequenceServer, which implements the NCBI BLAST+ 2.10.1+ command line applications (referred to as the BLAST+ applications). SequenceServer (version 1.0.14) is provided by the Wurm Lab and subject to all Terms & Conditions set forth by the developers.

What BLAST programs are available?

blastn: Compares a nucleotide query sequence against a nucleotide sequence database.
blastp: Compares an amino acid query sequence against a protein sequence database.
blastx: Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.
tblastn: Compares a protein query sequence against a nucleotide sequence database dynamically translated in all reading frames.
tblastx: Compares the six-frame translations of a nucleotide query sequence against the six-frame translations of a nucleotide sequence database.

What BLAST databases are available?

Descriptions of nucleotide and protein databases for both Hydractinia species are listed here: https://research.nhgri.nih.gov/hydractinia/databases.shtml

What do the boxes in the 'Links' column of the BLAST output represent?

The 'Links' column provides clickable hyperlinks for each significant BLAST alignment to their entries in the Hydractinia JBrowse genome browser (B), the scaffold Fetch tool (S), the WikiGene pages (G).

What tracks are available for viewing in the Hydractinia genome browser?

Detailed descriptions of genome browser tracks for both Hydractinia species are listed here: https://research.nhgri.nih.gov/hydractinia/jbrowse/

How can I navigate the Hydractinia genomes using JBrowse?

A user can zoom in or out and left or right on a region of interest by clicking on the appropriate icons centered on the blue toolbar. In addition, available tracks can be viewed (or hidden) by clicking on the appropriate track label on the left sidebar of the browser window.

Is the Hydractinia genome browser searchable?

The genome browser is currently searchable using Hydractinia scaffold (e.g., HyS0100, HyE0002) or gene (e.g., HyS0459.4, HyE0001.10) identifiers.

Is there a preferred Web browser I should use to view the Genome Browser?

The Hydractinia Genome Project Web site was developed and tested using Google Chrome. The rendering of desired output while using other Web browsers, including Firefox, Safari and Internet Explorer, is assumed but not guaranteed.

How do I search for a gene using the Hydractinia gene Wiki pages?

The Hydractinia gene Wiki pages are accessible from the home page left sidebar and are searchable by entering a Hydractinia gene identifier (e.g., HyS0001.4, HyE0208.10) in the search box and clicking ‘Go’.

What type of information is available in the Hydractinia gene Wiki pages?

Each record in the gene Wiki pages represents a single Hydractinia gene and provides the following annotation: nucleotide and protein sequences, coding exonic genomic coordinates, pre-computed BLAST hits from numerous organisms displaying the top hits for each protein, InterProScan data, PFAM domains, functional annotation (gene ontology derived from Argot2.5 and PANNZER2), developmental expression data, related human disease genes from the Online Mendelian Inheritance in Man (OMIM) database, and ortholog clusters inferred using the Orthofinder2 pipeline.

How are the phylogenetically informed ortholog clusters determined?

Orthologous clusters of proteins were inferred using the Orthofinder2 pipeline (Emms and Kelly 2015, 2018).

How do I download a single genomic scaffold sequence?

A user can enter a ScaffoldID (e.g., HyS0325, HyE0030) in the "Fetch Scaffold" textbox to return a single FASTA-formatted Hydractinia scaffold sequence.

Is there a way to download a partial scaffold sequence?

A partial scaffold sequence can be retrieved by entering a ScaffoldID (e.g., HyS0325, HyE0030) while also specifying the relative beginning and ending coordinates in the "Fetch Scaffold" textbox.

What are the other options for fetching a scaffold sequence?

A user can optionally retrieve either a reverse complement or the six-frame translation of a scaffold or partial scaffold by selecting the appropriate "Fetch Scaffold" search option.

What options are available for searching the Hydractinia genomes for Pfam-A domains?

You can elect to search the H. symbiolongicarpus or H. echinata Protein Models, 6-frame translations of scaffolds, or both. You may either select a domain name or domain accession number from the drop-down menus, or enter a domain name or five-digit Pfam accession number in the search box.

How were Pfam domains identified in the Hydractinia genome?

We used hmmscan from the HMMER suite to search the Hydractinia Augustus Protein Models and the six-frame translations of the Hydractinia genomes for domains from the Pfam-A database (version 31). We filtered the hmmscan output using an e-value cutoff of 1e-6. For more information on Pfam, visit http://pfam.xfam.org/.

What is the structure of the definition line when sequences are downloaded from the Pfam-A domains page?

If the "Full-length protein sequences, in FASTA format" option is chosen, the definition line will contain the gene identifier. If the "Pfam-A domains of selected proteins, in FASTA format" option is chosen, the definition line will contain the gene identifier, the coordinate range of the domain in that protein model, and the name of the query domain.

What is the source of the developmental expression data?

Three replicate RNA-seq samples were collected at four different time points during Hydractinia symbiolongicarpus embryonic development: at the 4-cell, 16-cell, and 64-cell stages and from 24-hour larvae. Each replicate consisted of pools of embryos or larvae from a single spawning event resulting from a cross of the 291-10 male strain with the 295-10 female strain. RNA-seq libraries were constructed for each sample for a total of 12 libraries. These libraries were then sequenced on the Illumina HiSeq 4000 platform using indexed 75-base paired end reads. Sequence reads were processed and aligned to the H. symbiolongicarpus gene models, counted, and normalized via MedianNorm (Anders and Huber, 2010).

Where can I download the full Hydractinia genome assemblies?

The full genome assemblies, consisting of 4,840 (HyS) and 7,767 (HyE) scaffolds respectively, are available for download from the sidebar "Download sequences-->Genome Assembly" search option.

What is the convention used for naming scaffolds?

Scaffolds (e.g., HySXXXX) are named as follows:

HyS = Hydractinia symbiolongicarpus and HyE = Hydractinia echinata.
XXXX = padded integer between 0001 and 4840 for HyS and 0001 and 7767 for HyE.

How are the gene identifiers generated?

Genes (e.g., HySXXXX.NNNa) are named as follows:

HySXXXX = corresponds with the scaffold where the gene is located.
NNN = non-padded integer. It is unique in combination with the scaffold ID. It is usually in order of its most 5' position on the scaffold, but is not a requirement. Newly added genes get the next highest unused integer regardless of its position.

What functional annotation is available?

GO term functional annotation is available from both Argot2.5 and PANNZER2 with the best BLAST hit listed in the 'GO-terms' file.

Where can I find information regarding Hydractinia data type summaries?

Clicking the ‘Database Statistics’ link on the Portal side bar displays the Hydractinia database statistics table page detailing the data types (e.g. Gene Models) and total number of database entries for each species.

How should data derived from the Hydractinia Genome Project Portal be cited?

Please cite this Web site:

https://research.nhgri.nih.gov/hydractinia/

How can I contact the Web site administrator regarding technical issues?

Please send any Web site usability or technical correspondence to bioinformatics@nhgri.nih.gov.

Where can I get additional information about the Hydractinia Genome Project?

For additional information, comments, or questions regarding the Hydractinia Genome Project, please contact Dr. Baxevanis directly, at andy@nhgri.nih.gov.

How frequently are the data and portal tools updated?

Changes to the Web site and underlying data are documented in the Release History.

TOP