Q9ESZ8: General transcription factor II-I
Protein names | - General transcription factor II-I - GTFII-I - TFII-I - Bruton tyrosine kinase-associated protein 135 - BAP-135 - BTK-associated protein 135 |
---|---|
Gene names | Gtf2i |
Organism | Mus musculus |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q9ESZ8 |
4
N-termini
2
C-termini
1
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
Isoforms
- Isoform 2 of General transcription factor II-I - Isoform 3 of General transcription factor II-I - Isoform 4 of General transcription factor II-I - Isoform 5 of General transcription factor II-I - Isoform 6 of General transcription factor II-ISequence View
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVVMSALP AEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPEHYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHLGGRVL AAEAERSMLS PSGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQGPSETD GVDEKLPLSK ALQGSHHSSE GNEGTEVEVP AEDSTQHVPS
310 320 330 340 350 360
ETSEDPEVEV TIEDDDYSPP TKRLKSTEPP PPPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVV SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNSSPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQNESE
970 980 990
GPVIQESAEA SQLEVPVTEE IKETDGSSQI KQEPDPTW
Protein Neighborhood
Domains & Features
4 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q9ESZ8-1-unknown | MAQVVM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76633 | |||
Q9ESZ8-1-unknown | MAQVVM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76634 | |||
Q9ESZ8-1-unknown | MAQVVM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76635 | |||
Q9ESZ8-1-unknown | MAQVVM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76636 | |||
Q9ESZ8-1-unknown | MAQVVM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76637 | |||
Q9ESZ8-2-unknown | MAQVVM... | 1 | inferred from similarity | unknown | UniProtKB | inferred from uniprot (by similarity) | |||
Q9ESZ8-2-unknown | MAQVVM... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111347 | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111348 | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111349 | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111350 | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111351 | ||
Q9ESZ8-2-Acetylation | AQVVMS... | 2 | acetylation- | inferred from similarity | unknown | UniProtKB | inferred from uniprot (by similarity) | ||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC31161 | |||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt154988 | |||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt155252 | |||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt155286 | |||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt155475 | |||
Q9ESZ8-686-unknown | VTVEGP... | 686 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt155492 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...VPEIEV | 685 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC31161 | |||
...VPEIEV | 685 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt138415 | |||
...VPEIEV | 685 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt138684 | |||
...VPEIEV | 685 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt138719 | |||
...VPEIEV | 685 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt138911 | |||
...VPEIEV | 685 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt138930 | |||
...PDPTW | 998 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72254 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72253 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72252 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72255 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72251 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
CATE_MOUSE | 685 | EIEV.|.VTVE | inferred from experiment | unknown | MEROPS | Merops CATE_MOUSE -> GTF2I_MOUSE @685 |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|