ORF ID ENST00000002596.6:219:231
Gene ID ENSG00000002587
Gene Type protein_coding
Gene Symbol HS3ST1
Genomic Location (Chrom: start-end; Strand) chr4:11400079 11400091 ; -
Orf Length (nt) 12
Transcript Length (nt) 7160
Upstream Length (nt) 219
Orf Type uORF
Start Codon ATG
Phylocsf ENSG00000002587
Kozak Sequence and Score 3
Orf Length FDR 0.892
Nucleotide Sequence ATGTGCCACTGA
Peptide Sequence MCH
Pfam Domain N
TMHMM.2 Domain Y
Alphafold Predicted Structure N

Peptide cellular Location Predicted by Deeploc

Nucleus Cytoplasm Mitochondrion Golgi Apparatus Endoplasmic Reticulum Membrane Extracellular Cell Membrane LysosomeVacuole Peroxisome Plastid

TMHMM transmembrane helices Prediction

Domain Start End
inside 1 3


  • Upstream Length: the distance between the transcript start site and the ORF start codon.
  • Start Codon: the start codon type.
  • Phylocsf: the averaged PhyloCSF score of the ORF sequence.
  • Kozak Score: the Kozak Score of sequence around the start codon.
  • Orf Length FDR: XXX
  • PepScore: the calculated PepScore.
  • Expression Level: the expression levels of translated ORFs across different samples (a link to show the Bar plot).
  • Peptide Location Prediction: the DeepLoc.1 predicted peptide cellular localization (a link to the prediction).
  • Pfam Domain: whether the peptide contains a Pfam domain, (Y: a link to the prediction; N: unavailable).
  • TMHMM.2 Domain: whether the peptide contains a TMHMM domain, (Y: a link to the prediction; N: unavailable).
  • Structure Prediction: whether a structure predicted by Alphafold, (Y: a link to the prediction; N: unavailable).
footer logo

Zhe Ji’s Lab

Feinberg School of Medicine

McCormick School of Engineering