ORF ID ENST00000002596.6:305:1229
Gene ID ENSG00000002587
Gene Type protein_coding
Gene Symbol HS3ST1
Genomic Location (Chrom: start-end; Strand) chr4:11399081 11400005 ; -
Orf Length (nt) 924
Transcript Length (nt) 7160
Upstream Length (nt) 305
Orf Type canonical
Start Codon ATG
Phylocsf ENSG00000002587
Kozak Sequence and Score 9
Orf Length FDR 0.001
Nucleotide Sequence ATGGCCGCGCTGCTCCTGGGCGCGGTGCTGCTGGTGGCCCAGCCCCAGCTAGTGCCTTCCCGCCCCGCCGAGCTAGGCCAGCAGGAGCTTCTGCGGAAAGCGGGGACCCTCCAGGATGACGTCCGCGATGGCGTGGCCCCAAACGGCTCTGCCCAGCAGTTGCCGCAGACCATCATCATCGGCGTGCGCAAGGGCGGCACGCGCGCACTGCTGGAGATGCTCAGCCTGCACCCCGACGTGGCGGCCGCGGAGAACGAGGTCCACTTCTTCGACTGGGAGGAGCATTACAGCCACGGCTTGGGCTGGTACCTCAGCCAGATGCCCTTCTCCTGGCCACACCAGCTCACAGTGGAGAAGACCCCCGCGTATTTCACGTCGCCCAAAGTGCCTGAGCGAGTCTACAGCATGAACCCGTCCATCCGGCTGCTGCTCATCCTGCGAGACCCGTCGGAGCGCGTGCTATCTGACTACACCCAAGTGTTCTACAACCACATGCAGAAGCACAAGCCCTACCCGTCCATCGAGGAGTTCCTGGTGCGCGATGGCAGGCTCAATGTGGACTACAAGGCCCTCAACCGCAGCCTCTACCACGTGCACATGCAGAACTGGCTGCGCTTTTTCCCGCTGCGCCACATCCACATTGTGGACGGCGACCGCCTCATCAGGGACCCCTTCCCTGAGATCCAAAAGGTCGAGAGGTTCCTAAAGCTGTCGCCGCAGATCAATGCTTCGAACTTCTACTTTAACAAAACCAAGGGCTTTTACTGCCTGCGGGACAGCGGCCGGGACCGCTGCTTACATGAGTCCAAAGGCCGGGCGCACCCCCAAGTCGATCCCAAACTACTCAATAAACTGCACGAATATTTTCATGAGCCAAATAAGAAGTTCTTCGAGCTTGTTGGCAGAACATTTGACTGGCACTGA
Peptide Sequence MAALLLGAVLLVAQPQLVPSRPAELGQQELLRKAGTLQDDVRDGVAPNGSAQQLPQTIIIGVRKGGTRALLEMLSLHPDVAAAENEVHFFDWEEHYSHGLGWYLSQMPFSWPHQLTVEKTPAYFTSPKVPERVYSMNPSIRLLLILRDPSERVLSDYTQVFYNHMQKHKPYPSIEEFLVRDGRLNVDYKALNRSLYHVHMQNWLRFFPLRHIHIVDGDRLIRDPFPEIQKVERFLKLSPQINASNFYFNKTKGFYCLRDSGRDRCLHESKGRAHPQVDPKLLNKLHEYFHEPNKKFFELVGRTFDWH
Pfam Domain N
TMHMM.2 Domain Y
Alphafold Predicted Structure N

Peptide cellular Location Predicted by Deeploc

Nucleus Cytoplasm Mitochondrion Golgi Apparatus Endoplasmic Reticulum Membrane Extracellular Cell Membrane LysosomeVacuole Peroxisome Plastid
0.0032 0.0139 0.0821 0.0001 0.0591 0.0243 0.8196 0.0014 0.018 0.0004 0.0023

TMHMM transmembrane helices Prediction

Domain Start End
outside 1 307

Description

  • Upstream Length: the distance between the transcript start site and the ORF start codon.
  • Start Codon: the start codon type.
  • Phylocsf: the averaged PhyloCSF score of the ORF sequence.
  • Kozak Score: the Kozak Score of sequence around the start codon.
  • Orf Length FDR: XXX
  • PepScore: the calculated PepScore.
  • Expression Level: the expression levels of translated ORFs across different samples (a link to show the Bar plot).
  • Peptide Location Prediction: the DeepLoc.1 predicted peptide cellular localization (a link to the prediction).
  • Pfam Domain: whether the peptide contains a Pfam domain, (Y: a link to the prediction; N: unavailable).
  • TMHMM.2 Domain: whether the peptide contains a TMHMM domain, (Y: a link to the prediction; N: unavailable).
  • Structure Prediction: whether a structure predicted by Alphafold, (Y: a link to the prediction; N: unavailable).
footer logo

Zhe Ji’s Lab

Feinberg School of Medicine

McCormick School of Engineering