ORF ID ENST00000001146.7:29:1568
Gene ID ENSG00000003137
Gene Type protein_coding
Gene Symbol CYP26B1
Genomic Location (Chrom: start-end; Strand) chr2:72132226 72147834 ; -
Orf Length (nt) 1539
Transcript Length (nt) 4556
Upstream Length (nt) 29
Orf Type canonical
Start Codon ATG
Phylocsf ENSG00000003137
Kozak Sequence and Score 9
Orf Length FDR 0.001
Nucleotide Sequence ATGCTCTTTGAGGGCTTGGATCTGGTGTCGGCGCTGGCCACCCTCGCCGCGTGCCTGGTGTCCGTGACGCTGCTGCTGGCCGTGTCGCAGCAGCTGTGGCAGCTGCGCTGGGCCGCCACTCGCGACAAGAGCTGCAAGCTGCCCATCCCCAAGGGATCCATGGGCTTCCCGCTCATCGGAGAGACCGGCCACTGGCTGCTGCAGGGTTCTGGCTTCCAGTCGTCGCGGAGGGAGAAGTATGGCAACGTGTTCAAGACGCATTTGTTGGGGCGGCCGCTGATACGCGTGACCGGCGCGGAGAACGTGCGCAAGATCCTCATGGGCGAGCACCACCTCGTGAGCACCGAGTGGCCTCGCAGCACCCGCATGTTGCTGGGCCCCAACACGGTGTCCAATTCCATTGGCGACATCCACCGCAACAAGCGCAAGGTCTTCTCCAAGATCTTCAGCCACGAGGCCCTGGAGAGTTACCTGCCCAAGATCCAGCTGGTGATCCAGGACACACTGCGCGCCTGGAGCAGCCACCCCGAGGCCATCAACGTGTACCAGGAGGCGCAGAAGCTGACCTTCCGCATGGCCATCCGGGTGCTGCTGGGCTTCAGCATCCCTGAGGAGGACCTTGGGCACCTCTTTGAGGTCTACCAGCAGTTTGTGGACAATGTCTTCTCCCTGCCTGTCGACCTGCCCTTCAGTGGCTACCGGCGGGGCATTCAGGCTCGGCAGATCCTGCAGAAGGGGCTGGAGAAGGCCATCCGGGAGAAGCTGCAGTGCACACAGGGCAAGGACTACTTGGACGCCCTGGACCTCCTCATTGAGAGCAGCAAGGAGCACGGGAAGGAGATGACCATGCAGGAGCTGAAGGACGGGACCCTGGAGCTGATCTTTGCGGCCTATGCCACCACGGCCAGCGCCAGCACCTCACTCATCATGCAGCTGCTGAAGCACCCCACTGTGCTGGAGAAGCTGCGGGATGAGCTGCGGGCTCATGGCATCCTGCACAGTGGCGGCTGCCCCTGCGAGGGCACACTGCGCCTGGACACGCTCAGTGGGCTGCGCTACCTGGACTGCGTCATCAAGGAGGTCATGCGCCTGTTCACGCCCATTTCCGGCGGCTACCGCACTGTGCTGCAGACCTTCGAGCTTGATGGTTTCCAGATCCCCAAAGGCTGGAGTGTCATGTATAGCATCCGGGACACCCATGACACAGCGCCCGTGTTCAAAGACGTGAACGTGTTCGACCCCGATCGCTTCAGCCAGGCGCGGAGCGAGGACAAGGATGGCCGCTTCCATTACCTCCCGTTCGGTGGCGGTGTCCGGACCTGCCTGGGCAAGCACCTGGCCAAGCTGTTCCTGAAGGTGCTGGCGGTGGAGCTGGCTAGCACCAGCCGCTTTGAGCTGGCCACACGGACCTTCCCCCGCATCACCTTGGTCCCCGTCCTGCACCCCGTGGATGGCCTCAGCGTCAAGTTCTTTGGCCTGGACTCCAACCAGAACGAGATCCTGCCGGAGACGGAGGCCATGCTGAGCGCCACAGTCTAA
Peptide Sequence MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNSIGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV
Pfam Domain N
TMHMM.2 Domain Y
Alphafold Predicted Structure N

Peptide cellular Location Predicted by Deeploc

Nucleus Cytoplasm Mitochondrion Golgi Apparatus Endoplasmic Reticulum Membrane Extracellular Cell Membrane LysosomeVacuole Peroxisome Plastid
0.0006 0.0015 0.0637 0.0266 0.7776 0.952 0.009 0.0822 0.0053 0.0044 0.029

TMHMM transmembrane helices Prediction

Domain Start End
outside 1 512

Description

  • Upstream Length: the distance between the transcript start site and the ORF start codon.
  • Start Codon: the start codon type.
  • Phylocsf: the averaged PhyloCSF score of the ORF sequence.
  • Kozak Score: the Kozak Score of sequence around the start codon.
  • Orf Length FDR: XXX
  • PepScore: the calculated PepScore.
  • Expression Level: the expression levels of translated ORFs across different samples (a link to show the Bar plot).
  • Peptide Location Prediction: the DeepLoc.1 predicted peptide cellular localization (a link to the prediction).
  • Pfam Domain: whether the peptide contains a Pfam domain, (Y: a link to the prediction; N: unavailable).
  • TMHMM.2 Domain: whether the peptide contains a TMHMM domain, (Y: a link to the prediction; N: unavailable).
  • Structure Prediction: whether a structure predicted by Alphafold, (Y: a link to the prediction; N: unavailable).
footer logo

Zhe Ji’s Lab

Feinberg School of Medicine

McCormick School of Engineering