Q86UP8: General transcription factor II-I repeat domain-containing protein 2A
Protein names | - General transcription factor II-I repeat domain-containing protein 2A - GTF2I repeat domain-containing protein 2A - Transcription factor GTF2IRD2-alpha |
---|---|
Gene names | GTF2IRD2 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q86UP8 |
2
N-termini
2
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
Isoforms
- Isoform 2 of General transcription factor II-I repeat domain-containing protein 2A - Isoform 3 of General transcription factor II-I repeat domain-containing protein 2A - Isoform 4 of General transcription factor II-I repeat domain-containing protein 2A - Isoform 5 of General transcription factor II-I repeat domain-containing protein 2A - Isoform 6 of General transcription factor II-I repeat domain-containing protein 2A - Isoform 6 of General transcription factor II-I repeat domain-containing protein 2ASequence View
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
10 20 30 40 50 60
MAQVAVSTLP VEEESSSETR MVVTFLVSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
70 80 90 100 110 120
RGCAFVNART DFQKDFAKYC VAEGLCEVKP PCPVNGMQVH SGETEILRKA VEDYFCFCYG
130 140 150 160 170 180
KALGTTVMVP VPYEKMLRDQ SAVVVQGLPE GVAFQHPENY DLATLKWILE NKAGISFIIN
190 200 210 220 230 240
RPFLGPESQL GGPGMVTDAE RSIVSPSESC GPINVKTEPM EDSGISLKAE AVSVKKESED
250 260 270 280 290 300
PNYYQYNMQG SHPSSTSNEV IEMELPMEDS TPLVPSEEPN EDPEAEVKIE GNTNSSSVTN
310 320 330 340 350 360
SAAGVEDLNI VQVTVPDNEK ERLSSIEKIK QLREQVNDLF SRKFGEAIGV DFPVKVPYRK
370 380 390 400 410 420
ITFNPGCVVI DGMPPGVVFK APGYLEISSM RRILEAAEFI KFTVIRPLPG LELSNVGKRK
430 440 450 460 470 480
IDQEGRVFQE KWERAYFFVE VQNIPTCLIC KQSMSVSKEY NLRRHYQTNH SKHYDQYMER
490 500 510 520 530 540
MRDEKLHELK KGLRKYLLGS SDTECPEQKQ VFANPSPTQK SPVQPVEDLA GNLWEKLREK
550 560 570 580 590 600
IRSFVAYSIA IDEITDINNT TQLAIFIRGV DENFDVSEEL LDTVPMTGTK SGNEIFSRVE
610 620 630 640 650 660
KSLKNFCIDW SKLVSVASTG TPAMVDANNG LVTKLKSRVA TFCKGAELKS ICCIIHPESL
670 680 690 700 710 720
CAQKLKMDHV MDVVVKSVNW ICSRGLNHSE FTTLLYELDS QYGSLLYYTE IKWLSRGLVL
730 740 750 760 770 780
KRFFESLEEI DSFMSSRGKP LPQLSSIDWI RDLAFLVDMT MHLNALNISL QGHSQIVTQM
790 800 810 820 830 840
YDLIRAFLAK LCLWETHLTR NNLAHFPTLK LASRNESDGL NYIPKIAELQ TEFQKRLSDF
850 860 870 880 890 900
KLYESELTLF SSPFSTKIDS VHEELQMEVI DLQCNTVLKT KYDKVGIPEF YKYLWGSYPK
910 920 930 940
YKHHCAKILS MFGSTYICEQ LFSIMKLSKT KYCSQLKDSQ WDSVLHIAT
Protein Neighborhood
Domains & Features
2 N-termini - 2 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q86UP8-1-unknown | MAQVAV... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q86UP8-1-unknown | MAQVAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76618 | |||
Q86UP8-1-unknown | MAQVAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76619 | |||
Q86UP8-1-unknown | MAQVAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76620 | |||
Q86UP8-1-unknown | MAQVAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76621 | |||
Q86UP8-454-unknown | MSVSKE... | 454 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt76622 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...FIINRP | 181 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72236 | |||
...FIINRP | 181 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt91840 | |||
...FIINRP | 181 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt91841 | |||
...LHIAT | 949 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...LHIAT | 949 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt72240 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|