P78347: General transcription factor II-I
Protein names | - General transcription factor II-I - GTFII-I - TFII-I - Bruton tyrosine kinase-associated protein 135 - BAP-135 - BTK-associated protein 135 - SRF-Phox1-interacting protein - SPIN - Williams-Beuren syndrome chromosomal region 6 protein |
---|---|
Gene names | GTF2I |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P78347 |
11
N-termini
4
C-termini
4
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MAQVAMSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHVGGRVM VTDADRSILS PGGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQAGPSET DDVDEKQPLS KPLQGSHHSS EGNEGTEMEV PAEDSTQHVP
310 320 330 340 350 360
SETSEDPEVE VTIEDDDYSP PSKRPKANEL PQPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNNNPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQSESE
970 980 990
GPVIQESAEP SQLEVPATEE IKETDGSSQI KQEPDPTW
Isoforms
- Isoform 2 of General transcription factor II-I - Isoform 3 of General transcription factor II-I - Isoform 4 of General transcription factor II-I - Isoform 5 of General transcription factor II-ISequence View
10 20 30 40 50 60
MAQVAMSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHVGGRVM VTDADRSILS PGGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQAGPSET DDVDEKQPLS KPLQGSHHSS EGNEGTEMEV PAEDSTQHVP
310 320 330 340 350 360
SETSEDPEVE VTIEDDDYSP PSKRPKANEL PQPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNNNPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQSESE
970 980 990
GPVIQESAEP SQLEVPATEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVAMSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHVGGRVM VTDADRSILS PGGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQAGPSET DDVDEKQPLS KPLQGSHHSS EGNEGTEMEV PAEDSTQHVP
310 320 330 340 350 360
SETSEDPEVE VTIEDDDYSP PSKRPKANEL PQPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNNNPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQSESE
970 980 990
GPVIQESAEP SQLEVPATEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVAMSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHVGGRVM VTDADRSILS PGGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQAGPSET DDVDEKQPLS KPLQGSHHSS EGNEGTEMEV PAEDSTQHVP
310 320 330 340 350 360
SETSEDPEVE VTIEDDDYSP PSKRPKANEL PQPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNNNPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQSESE
970 980 990
GPVIQESAEP SQLEVPATEE IKETDGSSQI KQEPDPTW
10 20 30 40 50 60
MAQVAMSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTTQAN RMSVDAVEIE TLRKTVEDYF
130 140 150 160 170 180
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
190 200 210 220 230 240
SFIIKRPFLE PKKHVGGRVM VTDADRSILS PGGSCGPIKV KTEPTEDSGI SLEMAAVTVK
250 260 270 280 290 300
EESEDPDYYQ YNIQAGPSET DDVDEKQPLS KPLQGSHHSS EGNEGTEMEV PAEDSTQHVP
310 320 330 340 350 360
SETSEDPEVE VTIEDDDYSP PSKRPKANEL PQPPVPEPAN AGKRKVREFN FEKWNARITD
370 380 390 400 410 420
LRKQVEELFE RKYAQAIKAK GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE
430 440 450 460 470 480
RILLAKERIR FVIKKHELLN STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE
490 500 510 520 530 540
ALGSTEAKAV PYQKFEAHPN DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR
550 560 570 580 590 600
PELLTHSTTE VTQPRTNTPV KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF
610 620 630 640 650 660
ESNPEFLYVE GLPEGIPFRS PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK
670 680 690 700 710 720
INTKALQSPK RPRSPGSNSK VPEIEVTVEG PNNNNPQTSA VRTPTQTNGS NVPFKPRGRE
730 740 750 760 770 780
FSFEAWNAKI TDLKQKVENL FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF
790 800 810 820 830 840
RRPSTFGIPR LEKILRNKAK IKFIIKKPEM FETAIKESTS SKSPPRKINS SPNVNTTASG
850 860 870 880 890 900
VEDLNIIQVT IPDDDNERLS KVEKARQLRE QVNDLFSRKF GEAIGMGFPV KVPYRKITIN
910 920 930 940 950 960
PGCVVVDGMP PGVSFKAPSY LEISSMRRIL DSAEFIKFTV IRPFPGLVIN NQLVDQSESE
970 980 990
GPVIQESAEP SQLEVPATEE IKETDGSSQI KQEPDPTW
Protein Neighborhood
Domains & Features
11 N-termini - 4 C-termini - 4 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P78347-2-unknown | MAQVAM... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P78347-1-unknown | MAQVAM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76630 | |||
P78347-1-unknown | MAQVAM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76631 | |||
P78347-1-unknown | MAQVAM... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76632 | |||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: Complementary positional proteomics for screening substrates... | 20526345 | ||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | other | CNRS/ISV | Large_scale_NTA_Human | |||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: PC3-cells, Complementary positional proteomics for screening substrates... | 20526345 | ||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt110992 | ||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt110993 | ||
P78347-2-Acetylation | AQVAMS... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt110994 | ||
P78347-7- | STLPVE... | 7 | Subtiligase Based Positive Selection | Wells | apoptotic_MM1SDBJurkat_Mix | 23264352 | |||
P78347-7-unknown | STLPVE... | 7 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC14312 | |||
P78347-7-unknown | STLPVE... | 7 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt176984 | |||
P78347-7-unknown | STLPVE... | 7 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt176985 | |||
P78347-7-unknown | STLPVE... | 7 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt176986 | |||
P78347-546-unknown | HSTTEV... | 546 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC6170 | |||
P78347-546-unknown | HSTTEV... | 546 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC3915 | |||
P78347-546-unknown | HSTTEV... | 546 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt153468 | |||
P78347-546-unknown | HSTTEV... | 546 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt153679 | |||
P78347-546-unknown | HSTTEV... | 546 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt153689 | |||
P78347-586-unknown | ALGLTE... | 586 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC3919 | |||
P78347-586-unknown | ALGLTE... | 586 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt153936 | |||
P78347-586-unknown | ALGLTE... | 586 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt154171 | |||
P78347-586-unknown | ALGLTE... | 586 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt154179 | |||
P78347-908- | GMPPGV... | 908 | Subtiligase Based Positive Selection | Wells | apoptotic_Jurkat_bortezomib | 23264352 | |||
P78347-908- | GMPPGV... | 908 | Subtiligase Based Positive Selection | Wells | apoptotic_MM1s_bort | 23264352 | |||
P78347-908- | GMPPGV... | 908 | Subtiligase Based Positive Selection | Wells | apoptosis_U266_bortezomib_induced | 23264352 | |||
P78347-908-unknown | GMPPGV... | 908 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt167453 | |||
P78347-908-unknown | GMPPGV... | 908 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt167451 | |||
P78347-908-unknown | GMPPGV... | 908 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt167452 | |||
P78347-967- | SAEPSQ... | 967 | Subtiligase Based Positive Selection | Wells | apoptotic_MM1s_bort | 23264352 | |||
P78347-967-unknown | SAEPSQ... | 967 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt175849 | |||
P78347-967-unknown | SAEPSQ... | 967 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt175847 | |||
P78347-967-unknown | SAEPSQ... | 967 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt175848 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...AQVAMS | 6 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC14312 | |||
...PELLTH | 545 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC6170 | |||
...PELLTH | 545 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC3915 | |||
...PELLTH | 545 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt136891 | |||
...PELLTH | 545 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt137102 | |||
...PELLTH | 545 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt137112 | |||
...LKFAQA | 585 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC3919 | |||
...LKFAQA | 585 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt137358 | |||
...LKFAQA | 585 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt137591 | |||
...LKFAQA | 585 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt137599 | |||
...PDPTW | 998 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72248 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72250 | |||
...PDPTW | 998 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72249 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
GRAM_HUMAN | 6 | VAMS.|.STLP | inferred from experiment | unknown | MEROPS | Bovenschen N | de Poot SA et al.:Human and mouse granzyme M disp... (S01.139) | 21564021, |
CATL1_HUMAN | 545 | LLTH.|.HSTT | inferred from experiment | unknown | MEROPS | Schilling O | Biniossek ML et al.:Proteomic identification of pro... (C01.060) | 21967108, |
CATB_HUMAN | 545 | LLTH.|.HSTT | inferred from experiment | unknown | MEROPS | Schilling O | Biniossek ML et al.:Proteomic identification of pro... (C01.060) | 21967108, |
CATB_HUMAN | 585 | FAQA.|.ALGL | inferred from experiment | unknown | MEROPS | Schilling O | Biniossek ML et al.:Proteomic identification of pro... (C01.060) | 21967108, |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|