O75417: DNA polymerase theta {ECO:0000303|PubMed:10395804, ECO:0000303|PubMed:14576298, ECO:0000312|HGNC:HGNC:9186}
Protein names | - DNA polymerase theta {ECO:0000303|PubMed:10395804, ECO:0000303|PubMed:14576298, ECO:0000312|HGNC:HGNC:9186} - 2.7.7.7 {ECO:0000269|PubMed:14576298, ECO:0000269|PubMed:22135286} - DNA polymerase eta {ECO:0000303|Ref.4} |
---|---|
Gene names | POLQ |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | O75417 |
1
N-termini
1
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MNLLRRSGKR RRSESGSDSF SGSGGDSSAS PQFLSGSVLS PPPGLGRCLK AAAAGECKPT
70 80 90 100 110 120
VPDYERDKLL LANWGLPKAV LEKYHSFGVK KMFEWQAECL LLGQVLEGKN LVYSAPTSAG
130 140 150 160 170 180
KTLVAELLIL KRVLEMRKKA LFILPFVSVA KEKKYYLQSL FQEVGIKVDG YMGSTSPSRH
190 200 210 220 230 240
FSSLDIAVCT IERANGLINR LIEENKMDLL GMVVVDELHM LGDSHRGYLL ELLLTKICYI
250 260 270 280 290 300
TRKSASCQAD LASSLSNAVQ IVGMSATLPN LELVASWLNA ELYHTDFRPV PLLESVKVGN
310 320 330 340 350 360
SIYDSSMKLV REFEPMLQVK GDEDHVVSLC YETICDNHSV LLFCPSKKWC EKLADIIARE
370 380 390 400 410 420
FYNLHHQAEG LVKPSECPPV ILEQKELLEV MDQLRRLPSG LDSVLQKTVP WGVAFHHAGL
430 440 450 460 470 480
TFEERDIIEG AFRQGLIRVL AATSTLSSGV NLPARRVIIR TPIFGGRPLD ILTYKQMVGR
490 500 510 520 530 540
AGRKGVDTVG ESILICKNSE KSKGIALLQG SLKPVRSCLQ RREGEEVTGS MIRAILEIIV
550 560 570 580 590 600
GGVASTSQDM HTYAACTFLA ASMKEGKQGI QRNQESVQLG AIEACVMWLL ENEFIQSTEA
610 620 630 640 650 660
SDGTEGKVYH PTHLGSATLS SSLSPADTLD IFADLQRAMK GFVLENDLHI LYLVTPMFED
670 680 690 700 710 720
WTTIDWYRFF CLWEKLPTSM KRVAELVGVE EGFLARCVKG KVVARTERQH RQMAIHKRFF
730 740 750 760 770 780
TSLVLLDLIS EVPLREINQK YGCNRGQIQS LQQSAAVYAG MITVFSNRLG WHNMELLLSQ
790 800 810 820 830 840
FQKRLTFGIQ RELCDLVRVS LLNAQRARVL YASGFHTVAD LARANIVEVE VILKNAVPFK
850 860 870 880 890 900
SARKAVDEEE EAVEERRNMR TIWVTGRKGL TEREAAALIV EEARMILQQD LVEMGVQWNP
910 920 930 940 950 960
CALLHSSTCS LTHSESEVKE HTFISQTKSS YKKLTSKNKS NTIFSDSYIK HSPNIVQDLN
970 980 990 1000 1010 1020
KSREHTSSFN CNFQNGNQEH QTCSIFRARK RASLDINKEK PGASQNEGKT SDKKVVQTFS
1030 1040 1050 1060 1070 1080
QKTKKAPLNF NSEKMSRSFR SWKRRKHLKR SRDSSPLKDS GACRIHLQGQ TLSNPSLCED
1090 1100 1110 1120 1130 1140
PFTLDEKKTE FRNSGPFAKN VSLSGKEKDN KTSFPLQIKQ NCSWNITLTN DNFVEHIVTG
1150 1160 1170 1180 1190 1200
SQSKNVTCQA TSVVSEKGRG VAVEAEKINE VLIQNGSKNQ NVYMKHHDIH PINQYLRKQS
1210 1220 1230 1240 1250 1260
HEQTSTITKQ KNIIERQMPC EAVSSYINRD SNVTINCERI KLNTEENKPS HFQALGDDIS
1270 1280 1290 1300 1310 1320
RTVIPSEVLP SAGAFSKSEG QHENFLNISR LQEKTGTYTT NKTKNNHVSD LGLVLCDFED
1330 1340 1350 1360 1370 1380
SFYLDTQSEK IIQQMATENA KLGAKDTNLA AGIMQKSLVQ QNSMNSFQKE CHIPFPAEQH
1390 1400 1410 1420 1430 1440
PLGATKIDHL DLKTVGTMKQ SSDSHGVDIL TPESPIFHSP ILLEENGLFL KKNEVSVTDS
1450 1460 1470 1480 1490 1500
QLNSFLQGYQ TQETVKPVIL LIPQKRTPTG VEGECLPVPE TSLNMSDSLL FDSFSDDYLV
1510 1520 1530 1540 1550 1560
KEQLPDMQMK EPLPSEVTSN HFSDSLCLQE DLIKKSNVNE NQDTHQQLTC SNDESIIFSE
1570 1580 1590 1600 1610 1620
MDSVQMVEAL DNVDIFPVQE KNHTVVSPRA LELSDPVLDE HHQGDQDGGD QDERAEKSKL
1630 1640 1650 1660 1670 1680
TGTRQNHSFI WSGASFDLSP GLQRILDKVS SPLENEKLKS MTINFSSLNR KNTELNEEQE
1690 1700 1710 1720 1730 1740
VISNLETKQV QGISFSSNNE VKSKIEMLEN NANHDETSSL LPRKESNIVD DNGLIPPTPI
1750 1760 1770 1780 1790 1800
PTSASKLTFP GILETPVNPW KTNNVLQPGE SYLFGSPSDI KNHDLSPGSR NGFKDNSPIS
1810 1820 1830 1840 1850 1860
DTSFSLQLSQ DGLQLTPASS SSESLSIIDV ASDQNLFQTF IKEWRCKKRF SISLACEKIR
1870 1880 1890 1900 1910 1920
SLTSSKTATI GSRFKQASSP QEIPIRDDGF PIKGCDDTLV VGLAVCWGGR DAYYFSLQKE
1930 1940 1950 1960 1970 1980
QKHSEISASL VPPSLDPSLT LKDRMWYLQS CLRKESDKEC SVVIYDFIQS YKILLLSCGI
1990 2000 2010 2020 2030 2040
SLEQSYEDPK VACWLLDPDS QEPTLHSIVT SFLPHELPLL EGMETSQGIQ SLGLNAGSEH
2050 2060 2070 2080 2090 2100
SGRYRASVES ILIFNSMNQL NSLLQKENLQ DVFRKVEMPS QYCLALLELN GIGFSTAECE
2110 2120 2130 2140 2150 2160
SQKHIMQAKL DAIETQAYQL AGHSFSFTSS DDIAEVLFLE LKLPPNREMK NQGSKKTLGS
2170 2180 2190 2200 2210 2220
TRRGIDNGRK LRLGRQFSTS KDVLNKLKAL HPLPGLILEW RRITNAITKV VFPLQREKCL
2230 2240 2250 2260 2270 2280
NPFLGMERIY PVSQSHTATG RITFTEPNIQ NVPRDFEIKM PTLVGESPPS QAVGKGLLPM
2290 2300 2310 2320 2330 2340
GRGKYKKGFS VNPRCQAQME ERAADRGMPF SISMRHAFVP FPGGSILAAD YSQLELRILA
2350 2360 2370 2380 2390 2400
HLSHDRRLIQ VLNTGADVFR SIAAEWKMIE PESVGDDLRQ QAKQICYGII YGMGAKSLGE
2410 2420 2430 2440 2450 2460
QMGIKENDAA CYIDSFKSRY TGINQFMTET VKNCKRDGFV QTILGRRRYL PGIKDNNPYR
2470 2480 2490 2500 2510 2520
KAHAERQAIN TIVQGSAADI VKIATVNIQK QLETFHSTFK SHGHREGMLQ SDQTGLSRKR
2530 2540 2550 2560 2570 2580
KLQGMFCPIR GGFFILQLHD ELLYEVAEED VVQVAQIVKN EMESAVKLSV KLKVKVKIGA
2590
SWGELKDFDV
Isoforms
- Isoform 2 of DNA polymerase thetaSequence View
10 20 30 40 50 60
MNLLRRSGKR RRSESGSDSF SGSGGDSSAS PQFLSGSVLS PPPGLGRCLK AAAAGECKPT
70 80 90 100 110 120
VPDYERDKLL LANWGLPKAV LEKYHSFGVK KMFEWQAECL LLGQVLEGKN LVYSAPTSAG
130 140 150 160 170 180
KTLVAELLIL KRVLEMRKKA LFILPFVSVA KEKKYYLQSL FQEVGIKVDG YMGSTSPSRH
190 200 210 220 230 240
FSSLDIAVCT IERANGLINR LIEENKMDLL GMVVVDELHM LGDSHRGYLL ELLLTKICYI
250 260 270 280 290 300
TRKSASCQAD LASSLSNAVQ IVGMSATLPN LELVASWLNA ELYHTDFRPV PLLESVKVGN
310 320 330 340 350 360
SIYDSSMKLV REFEPMLQVK GDEDHVVSLC YETICDNHSV LLFCPSKKWC EKLADIIARE
370 380 390 400 410 420
FYNLHHQAEG LVKPSECPPV ILEQKELLEV MDQLRRLPSG LDSVLQKTVP WGVAFHHAGL
430 440 450 460 470 480
TFEERDIIEG AFRQGLIRVL AATSTLSSGV NLPARRVIIR TPIFGGRPLD ILTYKQMVGR
490 500 510 520 530 540
AGRKGVDTVG ESILICKNSE KSKGIALLQG SLKPVRSCLQ RREGEEVTGS MIRAILEIIV
550 560 570 580 590 600
GGVASTSQDM HTYAACTFLA ASMKEGKQGI QRNQESVQLG AIEACVMWLL ENEFIQSTEA
610 620 630 640 650 660
SDGTEGKVYH PTHLGSATLS SSLSPADTLD IFADLQRAMK GFVLENDLHI LYLVTPMFED
670 680 690 700 710 720
WTTIDWYRFF CLWEKLPTSM KRVAELVGVE EGFLARCVKG KVVARTERQH RQMAIHKRFF
730 740 750 760 770 780
TSLVLLDLIS EVPLREINQK YGCNRGQIQS LQQSAAVYAG MITVFSNRLG WHNMELLLSQ
790 800 810 820 830 840
FQKRLTFGIQ RELCDLVRVS LLNAQRARVL YASGFHTVAD LARANIVEVE VILKNAVPFK
850 860 870 880 890 900
SARKAVDEEE EAVEERRNMR TIWVTGRKGL TEREAAALIV EEARMILQQD LVEMGVQWNP
910 920 930 940 950 960
CALLHSSTCS LTHSESEVKE HTFISQTKSS YKKLTSKNKS NTIFSDSYIK HSPNIVQDLN
970 980 990 1000 1010 1020
KSREHTSSFN CNFQNGNQEH QTCSIFRARK RASLDINKEK PGASQNEGKT SDKKVVQTFS
1030 1040 1050 1060 1070 1080
QKTKKAPLNF NSEKMSRSFR SWKRRKHLKR SRDSSPLKDS GACRIHLQGQ TLSNPSLCED
1090 1100 1110 1120 1130 1140
PFTLDEKKTE FRNSGPFAKN VSLSGKEKDN KTSFPLQIKQ NCSWNITLTN DNFVEHIVTG
1150 1160 1170 1180 1190 1200
SQSKNVTCQA TSVVSEKGRG VAVEAEKINE VLIQNGSKNQ NVYMKHHDIH PINQYLRKQS
1210 1220 1230 1240 1250 1260
HEQTSTITKQ KNIIERQMPC EAVSSYINRD SNVTINCERI KLNTEENKPS HFQALGDDIS
1270 1280 1290 1300 1310 1320
RTVIPSEVLP SAGAFSKSEG QHENFLNISR LQEKTGTYTT NKTKNNHVSD LGLVLCDFED
1330 1340 1350 1360 1370 1380
SFYLDTQSEK IIQQMATENA KLGAKDTNLA AGIMQKSLVQ QNSMNSFQKE CHIPFPAEQH
1390 1400 1410 1420 1430 1440
PLGATKIDHL DLKTVGTMKQ SSDSHGVDIL TPESPIFHSP ILLEENGLFL KKNEVSVTDS
1450 1460 1470 1480 1490 1500
QLNSFLQGYQ TQETVKPVIL LIPQKRTPTG VEGECLPVPE TSLNMSDSLL FDSFSDDYLV
1510 1520 1530 1540 1550 1560
KEQLPDMQMK EPLPSEVTSN HFSDSLCLQE DLIKKSNVNE NQDTHQQLTC SNDESIIFSE
1570 1580 1590 1600 1610 1620
MDSVQMVEAL DNVDIFPVQE KNHTVVSPRA LELSDPVLDE HHQGDQDGGD QDERAEKSKL
1630 1640 1650 1660 1670 1680
TGTRQNHSFI WSGASFDLSP GLQRILDKVS SPLENEKLKS MTINFSSLNR KNTELNEEQE
1690 1700 1710 1720 1730 1740
VISNLETKQV QGISFSSNNE VKSKIEMLEN NANHDETSSL LPRKESNIVD DNGLIPPTPI
1750 1760 1770 1780 1790 1800
PTSASKLTFP GILETPVNPW KTNNVLQPGE SYLFGSPSDI KNHDLSPGSR NGFKDNSPIS
1810 1820 1830 1840 1850 1860
DTSFSLQLSQ DGLQLTPASS SSESLSIIDV ASDQNLFQTF IKEWRCKKRF SISLACEKIR
1870 1880 1890 1900 1910 1920
SLTSSKTATI GSRFKQASSP QEIPIRDDGF PIKGCDDTLV VGLAVCWGGR DAYYFSLQKE
1930 1940 1950 1960 1970 1980
QKHSEISASL VPPSLDPSLT LKDRMWYLQS CLRKESDKEC SVVIYDFIQS YKILLLSCGI
1990 2000 2010 2020 2030 2040
SLEQSYEDPK VACWLLDPDS QEPTLHSIVT SFLPHELPLL EGMETSQGIQ SLGLNAGSEH
2050 2060 2070 2080 2090 2100
SGRYRASVES ILIFNSMNQL NSLLQKENLQ DVFRKVEMPS QYCLALLELN GIGFSTAECE
2110 2120 2130 2140 2150 2160
SQKHIMQAKL DAIETQAYQL AGHSFSFTSS DDIAEVLFLE LKLPPNREMK NQGSKKTLGS
2170 2180 2190 2200 2210 2220
TRRGIDNGRK LRLGRQFSTS KDVLNKLKAL HPLPGLILEW RRITNAITKV VFPLQREKCL
2230 2240 2250 2260 2270 2280
NPFLGMERIY PVSQSHTATG RITFTEPNIQ NVPRDFEIKM PTLVGESPPS QAVGKGLLPM
2290 2300 2310 2320 2330 2340
GRGKYKKGFS VNPRCQAQME ERAADRGMPF SISMRHAFVP FPGGSILAAD YSQLELRILA
2350 2360 2370 2380 2390 2400
HLSHDRRLIQ VLNTGADVFR SIAAEWKMIE PESVGDDLRQ QAKQICYGII YGMGAKSLGE
2410 2420 2430 2440 2450 2460
QMGIKENDAA CYIDSFKSRY TGINQFMTET VKNCKRDGFV QTILGRRRYL PGIKDNNPYR
2470 2480 2490 2500 2510 2520
KAHAERQAIN TIVQGSAADI VKIATVNIQK QLETFHSTFK SHGHREGMLQ SDQTGLSRKR
2530 2540 2550 2560 2570 2580
KLQGMFCPIR GGFFILQLHD ELLYEVAEED VVQVAQIVKN EMESAVKLSV KLKVKVKIGA
2590
SWGELKDFDV
Protein Neighborhood
Domains & Features
1 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
O75417-1-unknown | MNLLRR... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...KDFDV | 2590 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...KDFDV | 2590 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt68820 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|