Q80UV9: Transcription initiation factor TFIID subunit 1
Protein names | - Transcription initiation factor TFIID subunit 1 - 2.3.1.48 {ECO:0000250|UniProtKB:P21675} - 2.7.11.1 - Cell cycle gene 1 protein - TBP-associated factor 250 kDa - p250 - Transcription initiation factor TFIID 250 kDa subunit - TAF(II)250 - TAFII-250 - TAFII250 |
---|---|
Gene names | Taf1 |
Organism | Mus musculus |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q80UV9 |
2
N-termini
1
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MGPGWAGLLQ DKGGGSPSVV MSDTDSDEES AGGGPFSLTG FLFGNINGAG QLEGESVLDD
70 80 90 100 110 120
ECKKHLAGLG ALGLGSLITE LTANEELSGS DGALVNDEGW IRSREDAVDY SDINEVAEDE
130 140 150 160 170 180
SRRYQQTMGS LQPLCHTDYD EDDYDADCED IDCKLMPPPP PPPGPLKKEK DQDDITGVSE
190 200 210 220 230 240
DGEGIILPSI IAPSSLASEK VDFSSSSDSE SEMGPQDAAQ SESKDGQLTL PLAGIMQHDA
250 260 270 280 290 300
TKLLPSVTEL FPEFRPGKVL RFLRLFGPGK NVPSVWRSAR RKRKKKHREL IQEGQVQEEE
310 320 330 340 350 360
CSVELEVNQK SLWNYDYAPP PLPDQCLSDD EITMMAPVES KFSQSTGDTD KVMDTKPRVA
370 380 390 400 410 420
EWRYGPARLW YDMLGVPEDG SGFDYGFKMK KTEHESTIKC NIMKKLRKLE ENSGVDLLAD
430 440 450 460 470 480
ENFLMVTQLH WEDDIIWDGE DVKHKGTKPQ RASLAGWLPS SMTRNAMAYN VQQGFTATLD
490 500 510 520 530 540
DDKPWYSIFP IDNEDLVYGR WEDNIIWDAQ NMPRILEPPV LTLDPNDENL ILEIPDEKEE
550 560 570 580 590 600
ATSNSPSKEN KKESSLKKSR ILLGKTGVIK EEPQQNMSQP EVKDPWNLSN DEYYYPKQQG
610 620 630 640 650 660
LRGTFGGNII QHSIPAVELR QPFFPTHMGP IKLRQFHRPP LKKYSFGALS QPGPHSVQPL
670 680 690 700 710 720
LKHIKKKAKM REQERQASGG GEMFFMRTPQ DLTGKDGDLI LAEYSEENGP LMMQVGMATK
730 740 750 760 770 780
IKNYYKRKPG KDPGAPDCKY GETVYCHTSP FLGSLHPGQL LQAFENNLFR APIYLHKMPE
790 800 810 820 830 840
SDFLIIRTRQ GYFIRELVDI FVVGQQCPLF EVPGPNSKRA NTHIRDFLQV FIYRLFWKSK
850 860 870 880 890 900
DRPRRIRMED IKKAFPSHSE SSIRKRLKLC ADFKRTGMDS NWWVLKSDFR LPTEEEIRAM
910 920 930 940 950 960
VSPEQCCAYY SMIAAEQRLK DAGYGEKSFF APEEENEEDF QMKIDDEVRT APWNTTRAFI
970 980 990 1000 1010 1020
AAMKGKCLLE VTGVADPTGC GEGFSYVKIP NKPTQQKDDK EPQPVKKTVT GTDADLRRLS
1030 1040 1050 1060 1070 1080
LKNAKQLLRK FGVPEEEIKK LSRWEVIDVV RTMSTEQARS GEGPMSKFAR GSRFSVAEHQ
1090 1100 1110 1120 1130 1140
ERYKEECQRI FDLQNKVLSS TEVLSTDTDS SSAEDSDFEE MGKNIENMLQ NKKTSSQLSR
1150 1160 1170 1180 1190 1200
EREEQERKEL QRMLLAAGSA AAGNNHRDDD TASVTSLNSS ATGRCLKIYR TFRDEEGKEY
1210 1220 1230 1240 1250 1260
VRCETVRKAT VIDAYVRIRT TKDEEFIRKF ALFDEQHREE MRKERRRIQE QLRRLKRNQE
1270 1280 1290 1300 1310 1320
KEKLKGPPEK KPKKMKERPD LKLKCGACGA IGHMRTNKFC PLYYQTNAPP SNPVAMTEEQ
1330 1340 1350 1360 1370 1380
EEELEKTVIH NDNEELIKVE GTKIVLGKQL IESADEVRRK SLVLKFPKQQ LPPKKKRRVG
1390 1400 1410 1420 1430 1440
TTVHCDYLNR PHKSIHRRRT DPMVTLSSIL ESIINDMRDL PNTYPFHTPV NAKVVKDYYK
1450 1460 1470 1480 1490 1500
IITRPMDLQT LRENVRKRLY PSREEFREHL ELIVKNSATY NGPKHSLTQI SQSMLDLCDE
1510 1520 1530 1540 1550 1560
KLKEKEDKLA RLEKAINPLL DDDDQVAFSF ILDNIVTQKM MAVPDSWPFH HPVNKKFVPD
1570 1580 1590 1600 1610 1620
YYKVIVSPMD LETIRKNISK HKYQSRESFL DDVNLILANS VKYNGPESQY TKTAQEIVNV
1630 1640 1650 1660 1670 1680
CHQTLTEYDE HLTQLEKDIC TAKEAALEEA ELESLDPMTP GPYTPQPPDL YDNNTSLSVS
1690 1700 1710 1720 1730 1740
RDASVYQDES NLSVLDIPSA TSEKQLTQEG GDGDGDLADE EEGTVQQPQA SVLYEDLLMS
1750 1760 1770 1780 1790 1800
EGEDDEEDAG SDEEGDNPFF AIQLSESGSD SDVESGSLRP KQPRVLQENT RMGMENEESM
1810 1820 1830 1840 1850 1860
MSYEGDGGDA SRGLEDSNIS YGSYEEPDPK SNTQDTSFSS IGGYEVSEEE EDEEEQRSGP
1870 1880 1890
SVLSQVHLSE DEEDSEDFHS IAGDTDLDSD E
Isoforms
- Isoform 2 of Transcription initiation factor TFIID subunit 1 - Isoform 3 of Transcription initiation factor TFIID subunit 1Sequence View
10 20 30 40 50 60
MGPGWAGLLQ DKGGGSPSVV MSDTDSDEES AGGGPFSLTG FLFGNINGAG QLEGESVLDD
70 80 90 100 110 120
ECKKHLAGLG ALGLGSLITE LTANEELSGS DGALVNDEGW IRSREDAVDY SDINEVAEDE
130 140 150 160 170 180
SRRYQQTMGS LQPLCHTDYD EDDYDADCED IDCKLMPPPP PPPGPLKKEK DQDDITGVSE
190 200 210 220 230 240
DGEGIILPSI IAPSSLASEK VDFSSSSDSE SEMGPQDAAQ SESKDGQLTL PLAGIMQHDA
250 260 270 280 290 300
TKLLPSVTEL FPEFRPGKVL RFLRLFGPGK NVPSVWRSAR RKRKKKHREL IQEGQVQEEE
310 320 330 340 350 360
CSVELEVNQK SLWNYDYAPP PLPDQCLSDD EITMMAPVES KFSQSTGDTD KVMDTKPRVA
370 380 390 400 410 420
EWRYGPARLW YDMLGVPEDG SGFDYGFKMK KTEHESTIKC NIMKKLRKLE ENSGVDLLAD
430 440 450 460 470 480
ENFLMVTQLH WEDDIIWDGE DVKHKGTKPQ RASLAGWLPS SMTRNAMAYN VQQGFTATLD
490 500 510 520 530 540
DDKPWYSIFP IDNEDLVYGR WEDNIIWDAQ NMPRILEPPV LTLDPNDENL ILEIPDEKEE
550 560 570 580 590 600
ATSNSPSKEN KKESSLKKSR ILLGKTGVIK EEPQQNMSQP EVKDPWNLSN DEYYYPKQQG
610 620 630 640 650 660
LRGTFGGNII QHSIPAVELR QPFFPTHMGP IKLRQFHRPP LKKYSFGALS QPGPHSVQPL
670 680 690 700 710 720
LKHIKKKAKM REQERQASGG GEMFFMRTPQ DLTGKDGDLI LAEYSEENGP LMMQVGMATK
730 740 750 760 770 780
IKNYYKRKPG KDPGAPDCKY GETVYCHTSP FLGSLHPGQL LQAFENNLFR APIYLHKMPE
790 800 810 820 830 840
SDFLIIRTRQ GYFIRELVDI FVVGQQCPLF EVPGPNSKRA NTHIRDFLQV FIYRLFWKSK
850 860 870 880 890 900
DRPRRIRMED IKKAFPSHSE SSIRKRLKLC ADFKRTGMDS NWWVLKSDFR LPTEEEIRAM
910 920 930 940 950 960
VSPEQCCAYY SMIAAEQRLK DAGYGEKSFF APEEENEEDF QMKIDDEVRT APWNTTRAFI
970 980 990 1000 1010 1020
AAMKGKCLLE VTGVADPTGC GEGFSYVKIP NKPTQQKDDK EPQPVKKTVT GTDADLRRLS
1030 1040 1050 1060 1070 1080
LKNAKQLLRK FGVPEEEIKK LSRWEVIDVV RTMSTEQARS GEGPMSKFAR GSRFSVAEHQ
1090 1100 1110 1120 1130 1140
ERYKEECQRI FDLQNKVLSS TEVLSTDTDS SSAEDSDFEE MGKNIENMLQ NKKTSSQLSR
1150 1160 1170 1180 1190 1200
EREEQERKEL QRMLLAAGSA AAGNNHRDDD TASVTSLNSS ATGRCLKIYR TFRDEEGKEY
1210 1220 1230 1240 1250 1260
VRCETVRKAT VIDAYVRIRT TKDEEFIRKF ALFDEQHREE MRKERRRIQE QLRRLKRNQE
1270 1280 1290 1300 1310 1320
KEKLKGPPEK KPKKMKERPD LKLKCGACGA IGHMRTNKFC PLYYQTNAPP SNPVAMTEEQ
1330 1340 1350 1360 1370 1380
EEELEKTVIH NDNEELIKVE GTKIVLGKQL IESADEVRRK SLVLKFPKQQ LPPKKKRRVG
1390 1400 1410 1420 1430 1440
TTVHCDYLNR PHKSIHRRRT DPMVTLSSIL ESIINDMRDL PNTYPFHTPV NAKVVKDYYK
1450 1460 1470 1480 1490 1500
IITRPMDLQT LRENVRKRLY PSREEFREHL ELIVKNSATY NGPKHSLTQI SQSMLDLCDE
1510 1520 1530 1540 1550 1560
KLKEKEDKLA RLEKAINPLL DDDDQVAFSF ILDNIVTQKM MAVPDSWPFH HPVNKKFVPD
1570 1580 1590 1600 1610 1620
YYKVIVSPMD LETIRKNISK HKYQSRESFL DDVNLILANS VKYNGPESQY TKTAQEIVNV
1630 1640 1650 1660 1670 1680
CHQTLTEYDE HLTQLEKDIC TAKEAALEEA ELESLDPMTP GPYTPQPPDL YDNNTSLSVS
1690 1700 1710 1720 1730 1740
RDASVYQDES NLSVLDIPSA TSEKQLTQEG GDGDGDLADE EEGTVQQPQA SVLYEDLLMS
1750 1760 1770 1780 1790 1800
EGEDDEEDAG SDEEGDNPFF AIQLSESGSD SDVESGSLRP KQPRVLQENT RMGMENEESM
1810 1820 1830 1840 1850 1860
MSYEGDGGDA SRGLEDSNIS YGSYEEPDPK SNTQDTSFSS IGGYEVSEEE EDEEEQRSGP
1870 1880 1890
SVLSQVHLSE DEEDSEDFHS IAGDTDLDSD E
10 20 30 40 50 60
MGPGWAGLLQ DKGGGSPSVV MSDTDSDEES AGGGPFSLTG FLFGNINGAG QLEGESVLDD
70 80 90 100 110 120
ECKKHLAGLG ALGLGSLITE LTANEELSGS DGALVNDEGW IRSREDAVDY SDINEVAEDE
130 140 150 160 170 180
SRRYQQTMGS LQPLCHTDYD EDDYDADCED IDCKLMPPPP PPPGPLKKEK DQDDITGVSE
190 200 210 220 230 240
DGEGIILPSI IAPSSLASEK VDFSSSSDSE SEMGPQDAAQ SESKDGQLTL PLAGIMQHDA
250 260 270 280 290 300
TKLLPSVTEL FPEFRPGKVL RFLRLFGPGK NVPSVWRSAR RKRKKKHREL IQEGQVQEEE
310 320 330 340 350 360
CSVELEVNQK SLWNYDYAPP PLPDQCLSDD EITMMAPVES KFSQSTGDTD KVMDTKPRVA
370 380 390 400 410 420
EWRYGPARLW YDMLGVPEDG SGFDYGFKMK KTEHESTIKC NIMKKLRKLE ENSGVDLLAD
430 440 450 460 470 480
ENFLMVTQLH WEDDIIWDGE DVKHKGTKPQ RASLAGWLPS SMTRNAMAYN VQQGFTATLD
490 500 510 520 530 540
DDKPWYSIFP IDNEDLVYGR WEDNIIWDAQ NMPRILEPPV LTLDPNDENL ILEIPDEKEE
550 560 570 580 590 600
ATSNSPSKEN KKESSLKKSR ILLGKTGVIK EEPQQNMSQP EVKDPWNLSN DEYYYPKQQG
610 620 630 640 650 660
LRGTFGGNII QHSIPAVELR QPFFPTHMGP IKLRQFHRPP LKKYSFGALS QPGPHSVQPL
670 680 690 700 710 720
LKHIKKKAKM REQERQASGG GEMFFMRTPQ DLTGKDGDLI LAEYSEENGP LMMQVGMATK
730 740 750 760 770 780
IKNYYKRKPG KDPGAPDCKY GETVYCHTSP FLGSLHPGQL LQAFENNLFR APIYLHKMPE
790 800 810 820 830 840
SDFLIIRTRQ GYFIRELVDI FVVGQQCPLF EVPGPNSKRA NTHIRDFLQV FIYRLFWKSK
850 860 870 880 890 900
DRPRRIRMED IKKAFPSHSE SSIRKRLKLC ADFKRTGMDS NWWVLKSDFR LPTEEEIRAM
910 920 930 940 950 960
VSPEQCCAYY SMIAAEQRLK DAGYGEKSFF APEEENEEDF QMKIDDEVRT APWNTTRAFI
970 980 990 1000 1010 1020
AAMKGKCLLE VTGVADPTGC GEGFSYVKIP NKPTQQKDDK EPQPVKKTVT GTDADLRRLS
1030 1040 1050 1060 1070 1080
LKNAKQLLRK FGVPEEEIKK LSRWEVIDVV RTMSTEQARS GEGPMSKFAR GSRFSVAEHQ
1090 1100 1110 1120 1130 1140
ERYKEECQRI FDLQNKVLSS TEVLSTDTDS SSAEDSDFEE MGKNIENMLQ NKKTSSQLSR
1150 1160 1170 1180 1190 1200
EREEQERKEL QRMLLAAGSA AAGNNHRDDD TASVTSLNSS ATGRCLKIYR TFRDEEGKEY
1210 1220 1230 1240 1250 1260
VRCETVRKAT VIDAYVRIRT TKDEEFIRKF ALFDEQHREE MRKERRRIQE QLRRLKRNQE
1270 1280 1290 1300 1310 1320
KEKLKGPPEK KPKKMKERPD LKLKCGACGA IGHMRTNKFC PLYYQTNAPP SNPVAMTEEQ
1330 1340 1350 1360 1370 1380
EEELEKTVIH NDNEELIKVE GTKIVLGKQL IESADEVRRK SLVLKFPKQQ LPPKKKRRVG
1390 1400 1410 1420 1430 1440
TTVHCDYLNR PHKSIHRRRT DPMVTLSSIL ESIINDMRDL PNTYPFHTPV NAKVVKDYYK
1450 1460 1470 1480 1490 1500
IITRPMDLQT LRENVRKRLY PSREEFREHL ELIVKNSATY NGPKHSLTQI SQSMLDLCDE
1510 1520 1530 1540 1550 1560
KLKEKEDKLA RLEKAINPLL DDDDQVAFSF ILDNIVTQKM MAVPDSWPFH HPVNKKFVPD
1570 1580 1590 1600 1610 1620
YYKVIVSPMD LETIRKNISK HKYQSRESFL DDVNLILANS VKYNGPESQY TKTAQEIVNV
1630 1640 1650 1660 1670 1680
CHQTLTEYDE HLTQLEKDIC TAKEAALEEA ELESLDPMTP GPYTPQPPDL YDNNTSLSVS
1690 1700 1710 1720 1730 1740
RDASVYQDES NLSVLDIPSA TSEKQLTQEG GDGDGDLADE EEGTVQQPQA SVLYEDLLMS
1750 1760 1770 1780 1790 1800
EGEDDEEDAG SDEEGDNPFF AIQLSESGSD SDVESGSLRP KQPRVLQENT RMGMENEESM
1810 1820 1830 1840 1850 1860
MSYEGDGGDA SRGLEDSNIS YGSYEEPDPK SNTQDTSFSS IGGYEVSEEE EDEEEQRSGP
1870 1880 1890
SVLSQVHLSE DEEDSEDFHS IAGDTDLDSD E
Protein Neighborhood
Domains & Features
2 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q80UV9-1-unknown | MGPGWA... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q80UV9-1-unknown | MGPGWA... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt91464 | |||
Q80UV9-963-unknown | MKGKCL... | 963 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt91463 | |||
Q80UV9-963-unknown | MKGKCL... | 963 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt99895 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...LDSDE | 1891 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...LDSDE | 1891 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt87081 | |||
...LDSDE | 1891 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt87082 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|