O89090: Transcription factor Sp1
Protein names | - Transcription factor Sp1 - Specificity protein 1 {ECO:0000303|PubMed:27918959} |
---|---|
Gene names | Sp1 |
Organism | Mus musculus |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | O89090 |
4
N-termini
2
C-termini
1
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MSDQDHSMDE VTAVVKIEKD VGGNNGGSGN GGGAAFSQTR SSSTGSSSSS GGGGGQESQP
70 80 90 100 110 120
SPLALLAATC SRIESPNENS NNSQGPSQSG GTGELDLTAT QLSQGANGWQ IISSSSGATP
130 140 150 160 170 180
TSKEQSGNST NGSNGSESSK NRTVSGGQYV VAATPNLQNQ QVLTGLPGVM PNIQYQVIPQ
190 200 210 220 230 240
FQTVDGQQLQ FAATGAQVQQ DGSGQIQIIP GANQQIIPNR GSGGNIIAAM PNLLQQAVPL
250 260 270 280 290 300
QGLANNVLSG QTQYVTNVPV ALNGNITLLP VNSVSAATLT PSSQAGTISS SGSQESSSQP
310 320 330 340 350 360
VTSGTAISSA SLVSSQASSS SFFTNANSYS TTTTTSNMGI MNFTSSGSSG TSSQGQTPQR
370 380 390 400 410 420
VGGLQGSDSL NIQQNQTSGG SLQGSQQKEG EQSQQTQQQQ ILIQPQLVQG GQALQALQAA
430 440 450 460 470 480
PLSGQTFTTQ AISQETLQNL QLQAVQNSGP IIIRTPTVGP NGQVSWQTLQ LQNLQVQNPQ
490 500 510 520 530 540
AQTITLAPMQ GVSLGQTSSS NTTLTPIASA ASIPAGTVTV NAAQLSSMPG LQTINLSALG
550 560 570 580 590 600
TSGIQVHQLP GLPLAIANTP GDHGTQLGLH GSGGDGIHDE TAGGEGENSS DLQPQAGRRT
610 620 630 640 650 660
RREACTCPYC KDSEGRASGD PGKKKQHICH IQGCGKVYGK TSHLRAHLRW HTGERPFMCN
670 680 690 700 710 720
WSYCGKRFTR SDELQRHKRT HTGEKKFACP ECPKRFMRSD HLSKHIKTHQ NKKGGPGVAL
730 740 750 760 770 780
SVGTLPLDSG AGSEGTATPS ALITTNMVAM EAICPEGIAR LANSGINVMQ VTELQSINIS
GNGF
Isoforms
- Isoform 2 of Transcription factor Sp1Sequence View
10 20 30 40 50 60
MSDQDHSMDE VTAVVKIEKD VGGNNGGSGN GGGAAFSQTR SSSTGSSSSS GGGGGQESQP
70 80 90 100 110 120
SPLALLAATC SRIESPNENS NNSQGPSQSG GTGELDLTAT QLSQGANGWQ IISSSSGATP
130 140 150 160 170 180
TSKEQSGNST NGSNGSESSK NRTVSGGQYV VAATPNLQNQ QVLTGLPGVM PNIQYQVIPQ
190 200 210 220 230 240
FQTVDGQQLQ FAATGAQVQQ DGSGQIQIIP GANQQIIPNR GSGGNIIAAM PNLLQQAVPL
250 260 270 280 290 300
QGLANNVLSG QTQYVTNVPV ALNGNITLLP VNSVSAATLT PSSQAGTISS SGSQESSSQP
310 320 330 340 350 360
VTSGTAISSA SLVSSQASSS SFFTNANSYS TTTTTSNMGI MNFTSSGSSG TSSQGQTPQR
370 380 390 400 410 420
VGGLQGSDSL NIQQNQTSGG SLQGSQQKEG EQSQQTQQQQ ILIQPQLVQG GQALQALQAA
430 440 450 460 470 480
PLSGQTFTTQ AISQETLQNL QLQAVQNSGP IIIRTPTVGP NGQVSWQTLQ LQNLQVQNPQ
490 500 510 520 530 540
AQTITLAPMQ GVSLGQTSSS NTTLTPIASA ASIPAGTVTV NAAQLSSMPG LQTINLSALG
550 560 570 580 590 600
TSGIQVHQLP GLPLAIANTP GDHGTQLGLH GSGGDGIHDE TAGGEGENSS DLQPQAGRRT
610 620 630 640 650 660
RREACTCPYC KDSEGRASGD PGKKKQHICH IQGCGKVYGK TSHLRAHLRW HTGERPFMCN
670 680 690 700 710 720
WSYCGKRFTR SDELQRHKRT HTGEKKFACP ECPKRFMRSD HLSKHIKTHQ NKKGGPGVAL
730 740 750 760 770 780
SVGTLPLDSG AGSEGTATPS ALITTNMVAM EAICPEGIAR LANSGINVMQ VTELQSINIS
GNGF
Protein Neighborhood
Domains & Features
4 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
O89090-1-unknown | MSDQDH... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt90315 | |||
O89090-2-unknown | MSDQDH... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
O89090-2-Acetylation | SDQDHS... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
O89090-2-Acetylation | SDQDHS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111072 | ||
O89090-344-unknown | TSSGSS... | 344 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC28694 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...GIMNFT | 343 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC28694 | |||
...SGNGF | 784 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...SGNGF | 784 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt85933 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
CATD_MOUSE | 343 | MNFT.|.TSSG | inferred from experiment | unknown | MEROPS | Merops CATD_MOUSE -> SP1_MOUSE @343 |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|