Doryteuthis pealeii gene pages Help

Annotation: Dpe Pita v0.3, release December 22, 2023
Go back to gene search page


Gene pages v0.2

Find genes based on gene symbols, JGI IDs (Dpe v2.1), or matches to human or fly (Drosophila melanogaster) proteins (gene symbols). Pita has combined JGI gene symbols if multiple transcripts were merged into a single gene model.

Search is case-sensitive. Human gene symbols are in upper case, Doryteuthis JGI and Pita gene symbols are based on human gene symbols. Fly gene symbols can be capitalized, but often are not (cf. Wikipedia).

Partial search strings may result in more matches; regex special characters (regular expressions) can be used for more elaborate searches (but slashes and white space are not allowed). For example, ^CDH will match CDH genes but not the many PCDH genes, while ^HOX[A,B] and ^HOX.1 will find matches to HOXA and HOXB genes, and HOXB1, HOXA10 and HOXC11 respectively. Note that other gene symbols may still show up in the results if the search string matches the gene symbols in best matches or orthologs.

See release notes (below) for more information on Pita gene annotation.

Search results include the following fields:

  1. Pita v0.3 "combined_symbols" (see release notes)
  2. JGI v2.1 Dopeav ID
  3. JGI v2.1 gene symbol
  4. Best hit in the human proteome (of all open reading frames tested)
  5. Best hit in the fly (D. melanogaster) proteome
  6. Best hit in the bobtails squid (E. scolopes) proteome
  7. Genes in the Doryteuthis paralogs group of human ortholigs
  8. Orthologous human genes

The search may produce results based on matches in any of these fields. In some cases, more than one reading frame produces good hits in the human or fly proteome. This can be due to problems with the gene annotation (for example wrong splice sites, or read-through transcripts) or the genome assembly (sequencing errors, missing genomic fragments), resulting in apparent frame shifts.

For the BLASTP results on the gene pages, only reading frames where the best match has an E-value smaller than 1e-04 are included. The frame is a number [1-3]. If followed by "M", the sequence with the best hit starts with a start codon (usually a substring of the sequence). The best match is not always the "M"-frame, potentially due to imperfect genome assembly or gene models (missing upstream exons or fragmented gene annotations).


Release notes Dpe Pita v0.3

Usage

More information

Known issues

RU
Radboud University Faculty of Science Radboud Institute for Molecular Life Sciences