P28715: DNA repair protein complementing XP-G cells
Protein names | - DNA repair protein complementing XP-G cells - 3.1.-.- - DNA excision repair protein ERCC-5 - Xeroderma pigmentosum group G-complementing protein |
---|---|
Gene names | ERCC5 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P28715 |
5
N-termini
2
C-termini
1
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MGVQGLWKLL ECSGRQVSPE ALEGKILAVD ISIWLNQALK GVRDRHGNSI ENPHLLTLFH
70 80 90 100 110 120
RLCKLLFFRI RPIFVFDGDA PLLKKQTLVK RRQRKDLASS DSRKTTEKLL KTFLKRQAIK
130 140 150 160 170 180
TAFRSKRDEA LPSLTQVRRE NDLYVLPPLQ EEEKHSSEEE DEKEWQERMN QKQALQEEFF
190 200 210 220 230 240
HNPQAIDIES EDFSSLPPEV KHEILTDMKE FTKRRRTLFE AMPEESDDFS QYQLKGLLKK
250 260 270 280 290 300
NYLNQHIEHV QKEMNQQHSG HIRRQYEDEG GFLKEVESRR VVSEDTSHYI LIKGIQAKTV
310 320 330 340 350 360
AEVDSESLPS SSKMHGMSFD VKSSPCEKLK TEKEPDATPP SPRTLLAMQA ALLGSSSEEE
370 380 390 400 410 420
LESENRRQAR GRNAPAAVDE GSISPRTLSA IKRALDDDED VKVCAGDDVQ TGGPGAEEMR
430 440 450 460 470 480
INSSTENSDE GLKVRDGKGI PFTATLASSS VNSAEEHVAS TNEGREPTDS VPKEQMSLVH
490 500 510 520 530 540
VGTEAFPISD ESMIKDRKDR LPLESAVVRH SDAPGLPNGR ELTPASPTCT NSVSKNETHA
550 560 570 580 590 600
EVLEQQNELC PYESKFDSSL LSSDDETKCK PNSASEVIGP VSLQETSSIV SVPSEAVDNV
610 620 630 640 650 660
ENVVSFNAKE HENFLETIQE QQTTESAGQD LISIPKAVEP MEIDSEESES DGSFIEVQSV
670 680 690 700 710 720
ISDEELQAEF PETSKPPSEQ GEEELVGTRE GEAPAESESL LRDNSERDDV DGEPQEAEKD
730 740 750 760 770 780
AEDSLHEWQD INLEELETLE SNLLAQQNSL KAQKQQQERI AATVTGQMFL ESQELLRLFG
790 800 810 820 830 840
IPYIQAPMEA EAQCAILDLT DQTSGTITDD SDIWLFGARH VYRNFFNKNK FVEYYQYVDF
850 860 870 880 890 900
HNQLGLDRNK LINLAYLLGS DYTEGIPTVG CVTAMEILNE FPGHGLEPLL KFSEWWHEAQ
910 920 930 940 950 960
KNPKIRPNPH DTKVKKKLRT LQLTPGFPNP AVAEAYLKPV VDDSKGSFLW GKPDLDKIRE
970 980 990 1000 1010 1020
FCQRYFGWNR TKTDESLFPV LKQLDAQQTQ LRIDSFFRLA QQEKEDAKRI KSQRLNRAVT
1030 1040 1050 1060 1070 1080
CMLRKEKEAA ASEIEAVSVA MEKEFELLDK AKGKTQKRGI TNTLEESSSL KRKRLSDSKG
1090 1100 1110 1120 1130 1140
KNTCGGFLGE TCLSESSDGS SSEDAESSSL MNVQRRTAAK EPKTSASDSQ NSVKEAPVKN
1150 1160 1170 1180
GGATTSSSSD SDDDGGKEKM VLVTARSVFG KKRRKLRRAR GRKRKT
Isoforms
- Isoform 2 of DNA repair protein complementing XP-G cells - Isoform 3 of DNA repair protein complementing XP-G cellsSequence View
10 20 30 40 50 60
MGVQGLWKLL ECSGRQVSPE ALEGKILAVD ISIWLNQALK GVRDRHGNSI ENPHLLTLFH
70 80 90 100 110 120
RLCKLLFFRI RPIFVFDGDA PLLKKQTLVK RRQRKDLASS DSRKTTEKLL KTFLKRQAIK
130 140 150 160 170 180
TAFRSKRDEA LPSLTQVRRE NDLYVLPPLQ EEEKHSSEEE DEKEWQERMN QKQALQEEFF
190 200 210 220 230 240
HNPQAIDIES EDFSSLPPEV KHEILTDMKE FTKRRRTLFE AMPEESDDFS QYQLKGLLKK
250 260 270 280 290 300
NYLNQHIEHV QKEMNQQHSG HIRRQYEDEG GFLKEVESRR VVSEDTSHYI LIKGIQAKTV
310 320 330 340 350 360
AEVDSESLPS SSKMHGMSFD VKSSPCEKLK TEKEPDATPP SPRTLLAMQA ALLGSSSEEE
370 380 390 400 410 420
LESENRRQAR GRNAPAAVDE GSISPRTLSA IKRALDDDED VKVCAGDDVQ TGGPGAEEMR
430 440 450 460 470 480
INSSTENSDE GLKVRDGKGI PFTATLASSS VNSAEEHVAS TNEGREPTDS VPKEQMSLVH
490 500 510 520 530 540
VGTEAFPISD ESMIKDRKDR LPLESAVVRH SDAPGLPNGR ELTPASPTCT NSVSKNETHA
550 560 570 580 590 600
EVLEQQNELC PYESKFDSSL LSSDDETKCK PNSASEVIGP VSLQETSSIV SVPSEAVDNV
610 620 630 640 650 660
ENVVSFNAKE HENFLETIQE QQTTESAGQD LISIPKAVEP MEIDSEESES DGSFIEVQSV
670 680 690 700 710 720
ISDEELQAEF PETSKPPSEQ GEEELVGTRE GEAPAESESL LRDNSERDDV DGEPQEAEKD
730 740 750 760 770 780
AEDSLHEWQD INLEELETLE SNLLAQQNSL KAQKQQQERI AATVTGQMFL ESQELLRLFG
790 800 810 820 830 840
IPYIQAPMEA EAQCAILDLT DQTSGTITDD SDIWLFGARH VYRNFFNKNK FVEYYQYVDF
850 860 870 880 890 900
HNQLGLDRNK LINLAYLLGS DYTEGIPTVG CVTAMEILNE FPGHGLEPLL KFSEWWHEAQ
910 920 930 940 950 960
KNPKIRPNPH DTKVKKKLRT LQLTPGFPNP AVAEAYLKPV VDDSKGSFLW GKPDLDKIRE
970 980 990 1000 1010 1020
FCQRYFGWNR TKTDESLFPV LKQLDAQQTQ LRIDSFFRLA QQEKEDAKRI KSQRLNRAVT
1030 1040 1050 1060 1070 1080
CMLRKEKEAA ASEIEAVSVA MEKEFELLDK AKGKTQKRGI TNTLEESSSL KRKRLSDSKG
1090 1100 1110 1120 1130 1140
KNTCGGFLGE TCLSESSDGS SSEDAESSSL MNVQRRTAAK EPKTSASDSQ NSVKEAPVKN
1150 1160 1170 1180
GGATTSSSSD SDDDGGKEKM VLVTARSVFG KKRRKLRRAR GRKRKT
10 20 30 40 50 60
MGVQGLWKLL ECSGRQVSPE ALEGKILAVD ISIWLNQALK GVRDRHGNSI ENPHLLTLFH
70 80 90 100 110 120
RLCKLLFFRI RPIFVFDGDA PLLKKQTLVK RRQRKDLASS DSRKTTEKLL KTFLKRQAIK
130 140 150 160 170 180
TAFRSKRDEA LPSLTQVRRE NDLYVLPPLQ EEEKHSSEEE DEKEWQERMN QKQALQEEFF
190 200 210 220 230 240
HNPQAIDIES EDFSSLPPEV KHEILTDMKE FTKRRRTLFE AMPEESDDFS QYQLKGLLKK
250 260 270 280 290 300
NYLNQHIEHV QKEMNQQHSG HIRRQYEDEG GFLKEVESRR VVSEDTSHYI LIKGIQAKTV
310 320 330 340 350 360
AEVDSESLPS SSKMHGMSFD VKSSPCEKLK TEKEPDATPP SPRTLLAMQA ALLGSSSEEE
370 380 390 400 410 420
LESENRRQAR GRNAPAAVDE GSISPRTLSA IKRALDDDED VKVCAGDDVQ TGGPGAEEMR
430 440 450 460 470 480
INSSTENSDE GLKVRDGKGI PFTATLASSS VNSAEEHVAS TNEGREPTDS VPKEQMSLVH
490 500 510 520 530 540
VGTEAFPISD ESMIKDRKDR LPLESAVVRH SDAPGLPNGR ELTPASPTCT NSVSKNETHA
550 560 570 580 590 600
EVLEQQNELC PYESKFDSSL LSSDDETKCK PNSASEVIGP VSLQETSSIV SVPSEAVDNV
610 620 630 640 650 660
ENVVSFNAKE HENFLETIQE QQTTESAGQD LISIPKAVEP MEIDSEESES DGSFIEVQSV
670 680 690 700 710 720
ISDEELQAEF PETSKPPSEQ GEEELVGTRE GEAPAESESL LRDNSERDDV DGEPQEAEKD
730 740 750 760 770 780
AEDSLHEWQD INLEELETLE SNLLAQQNSL KAQKQQQERI AATVTGQMFL ESQELLRLFG
790 800 810 820 830 840
IPYIQAPMEA EAQCAILDLT DQTSGTITDD SDIWLFGARH VYRNFFNKNK FVEYYQYVDF
850 860 870 880 890 900
HNQLGLDRNK LINLAYLLGS DYTEGIPTVG CVTAMEILNE FPGHGLEPLL KFSEWWHEAQ
910 920 930 940 950 960
KNPKIRPNPH DTKVKKKLRT LQLTPGFPNP AVAEAYLKPV VDDSKGSFLW GKPDLDKIRE
970 980 990 1000 1010 1020
FCQRYFGWNR TKTDESLFPV LKQLDAQQTQ LRIDSFFRLA QQEKEDAKRI KSQRLNRAVT
1030 1040 1050 1060 1070 1080
CMLRKEKEAA ASEIEAVSVA MEKEFELLDK AKGKTQKRGI TNTLEESSSL KRKRLSDSKG
1090 1100 1110 1120 1130 1140
KNTCGGFLGE TCLSESSDGS SSEDAESSSL MNVQRRTAAK EPKTSASDSQ NSVKEAPVKN
1150 1160 1170 1180
GGATTSSSSD SDDDGGKEKM VLVTARSVFG KKRRKLRRAR GRKRKT
Protein Neighborhood
Domains & Features
5 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P28715-1-unknown | MGVQGL... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P28715-1-unknown | MGVQGL... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt74111 | |||
P28715-690-unknown | EGEAPA... | 690 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC10059 | |||
P28715-768-unknown | MFLESQ... | 768 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt74110 | |||
P28715-768-unknown | MFLESQ... | 768 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSP00000365121 | |||
P28715-1059- | GITNTL... | 1059 | Subtiligase Based Positive Selection | Wells | Jurkat_untreated | 23264352 | |||
P28715-1059-unknown | GITNTL... | 1059 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt169847 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...LVGTRE | 689 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC10059 | |||
...RKRKT | 1186 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...RKRKT | 1186 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt69728 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
GRAA_HUMAN | 689 | GTRE.|.EGEA | inferred from experiment | unknown | MEROPS | Gevaert K | Van Damme P et al.:The substrate specificity profi... (S01.135) | 20536382, |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|