Q01831: DNA repair protein complementing XP-C cells
Protein names | - DNA repair protein complementing XP-C cells - Xeroderma pigmentosum group C-complementing protein - p125 |
---|---|
Gene names | XPC |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q01831 |
3
N-termini
2
C-termini
1
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MARKRAAGGE PRGRELRSQK SKAKSKARRE EEEEDAFEDE KPPKKSLLSK VSQGKRKRGC
70 80 90 100 110 120
SHPGGSADGP AKKKVAKVTV KSENLKVIKD EALSDGDDLR DFPSDLKKAH HLKRGATMNE
130 140 150 160 170 180
DSNEEEEESE NDWEEVEELS EPVLGDVRES TAFSRSLLPV KPVEIEIETP EQAKTRERSE
190 200 210 220 230 240
KIKLEFETYL RRAMKRFNKG VHEDTHKVHL LCLLANGFYR NNICSQPDLH AIGLSIIPAR
250 260 270 280 290 300
FTRVLPRDVD TYYLSNLVKW FIGTFTVNAE LSASEQDNLQ TTLERRFAIY SARDDEELVH
310 320 330 340 350 360
IFLLILRALQ LLTRLVLSLQ PIPLKSATAK GKKPSKERLT ADPGGSSETS SQVLENHTKP
370 380 390 400 410 420
KTSKGTKQEE TFAKGTCRPS AKGKRNKGGR KKRSKPSSSE EDEGPGDKQE KATQRRPHGR
430 440 450 460 470 480
ERRVASRVSY KEESGSDEAG SGSDFELSSG EASDPSDEDS EPGPPKQRKA PAPQRTKAGS
490 500 510 520 530 540
KSASRTHRGS HRKDPSLPAA SSSSSSSKRG KKMCSDGEKA EKRSIAGIDQ WLEVFCEQEE
550 560 570 580 590 600
KWVCVDCVHG VVGQPLTCYK YATKPMTYVV GIDSDGWVRD VTQRYDPVWM TVTRKCRVDA
610 620 630 640 650 660
EWWAETLRPY QSPFMDREKK EDLEFQAKHM DQPLPTAIGL YKNHPLYALK RHLLKYEAIY
670 680 690 700 710 720
PETAAILGYC RGEAVYSRDC VHTLHSRDTW LKKARVVRLG EVPYKMVKGF SNRARKARLA
730 740 750 760 770 780
EPQLREENDL GLFGYWQTEE YQPPVAVDGK VPRNEFGNVY LFLPSMMPIG CVQLNLPNLH
790 800 810 820 830 840
RVARKLDIDC VQAITGFDFH GGYSHPVTDG YIVCEEFKDV LLTAWENEQA VIERKEKEKK
850 860 870 880 890 900
EKRALGNWKL LAKGLLIRER LKRRYGPKSE AAAPHTDAGG GLSSDEEEGT SSQAEAARIL
910 920 930 940
AASWPQNRED EEKQKLKGGP KKTKREKKAA ASHLFPFEQL
Isoforms
- Isoform 2 of DNA repair protein complementing XP-C cells - Isoform 3 of DNA repair protein complementing XP-C cellsSequence View
10 20 30 40 50 60
MARKRAAGGE PRGRELRSQK SKAKSKARRE EEEEDAFEDE KPPKKSLLSK VSQGKRKRGC
70 80 90 100 110 120
SHPGGSADGP AKKKVAKVTV KSENLKVIKD EALSDGDDLR DFPSDLKKAH HLKRGATMNE
130 140 150 160 170 180
DSNEEEEESE NDWEEVEELS EPVLGDVRES TAFSRSLLPV KPVEIEIETP EQAKTRERSE
190 200 210 220 230 240
KIKLEFETYL RRAMKRFNKG VHEDTHKVHL LCLLANGFYR NNICSQPDLH AIGLSIIPAR
250 260 270 280 290 300
FTRVLPRDVD TYYLSNLVKW FIGTFTVNAE LSASEQDNLQ TTLERRFAIY SARDDEELVH
310 320 330 340 350 360
IFLLILRALQ LLTRLVLSLQ PIPLKSATAK GKKPSKERLT ADPGGSSETS SQVLENHTKP
370 380 390 400 410 420
KTSKGTKQEE TFAKGTCRPS AKGKRNKGGR KKRSKPSSSE EDEGPGDKQE KATQRRPHGR
430 440 450 460 470 480
ERRVASRVSY KEESGSDEAG SGSDFELSSG EASDPSDEDS EPGPPKQRKA PAPQRTKAGS
490 500 510 520 530 540
KSASRTHRGS HRKDPSLPAA SSSSSSSKRG KKMCSDGEKA EKRSIAGIDQ WLEVFCEQEE
550 560 570 580 590 600
KWVCVDCVHG VVGQPLTCYK YATKPMTYVV GIDSDGWVRD VTQRYDPVWM TVTRKCRVDA
610 620 630 640 650 660
EWWAETLRPY QSPFMDREKK EDLEFQAKHM DQPLPTAIGL YKNHPLYALK RHLLKYEAIY
670 680 690 700 710 720
PETAAILGYC RGEAVYSRDC VHTLHSRDTW LKKARVVRLG EVPYKMVKGF SNRARKARLA
730 740 750 760 770 780
EPQLREENDL GLFGYWQTEE YQPPVAVDGK VPRNEFGNVY LFLPSMMPIG CVQLNLPNLH
790 800 810 820 830 840
RVARKLDIDC VQAITGFDFH GGYSHPVTDG YIVCEEFKDV LLTAWENEQA VIERKEKEKK
850 860 870 880 890 900
EKRALGNWKL LAKGLLIRER LKRRYGPKSE AAAPHTDAGG GLSSDEEEGT SSQAEAARIL
910 920 930 940
AASWPQNRED EEKQKLKGGP KKTKREKKAA ASHLFPFEQL
10 20 30 40 50 60
MARKRAAGGE PRGRELRSQK SKAKSKARRE EEEEDAFEDE KPPKKSLLSK VSQGKRKRGC
70 80 90 100 110 120
SHPGGSADGP AKKKVAKVTV KSENLKVIKD EALSDGDDLR DFPSDLKKAH HLKRGATMNE
130 140 150 160 170 180
DSNEEEEESE NDWEEVEELS EPVLGDVRES TAFSRSLLPV KPVEIEIETP EQAKTRERSE
190 200 210 220 230 240
KIKLEFETYL RRAMKRFNKG VHEDTHKVHL LCLLANGFYR NNICSQPDLH AIGLSIIPAR
250 260 270 280 290 300
FTRVLPRDVD TYYLSNLVKW FIGTFTVNAE LSASEQDNLQ TTLERRFAIY SARDDEELVH
310 320 330 340 350 360
IFLLILRALQ LLTRLVLSLQ PIPLKSATAK GKKPSKERLT ADPGGSSETS SQVLENHTKP
370 380 390 400 410 420
KTSKGTKQEE TFAKGTCRPS AKGKRNKGGR KKRSKPSSSE EDEGPGDKQE KATQRRPHGR
430 440 450 460 470 480
ERRVASRVSY KEESGSDEAG SGSDFELSSG EASDPSDEDS EPGPPKQRKA PAPQRTKAGS
490 500 510 520 530 540
KSASRTHRGS HRKDPSLPAA SSSSSSSKRG KKMCSDGEKA EKRSIAGIDQ WLEVFCEQEE
550 560 570 580 590 600
KWVCVDCVHG VVGQPLTCYK YATKPMTYVV GIDSDGWVRD VTQRYDPVWM TVTRKCRVDA
610 620 630 640 650 660
EWWAETLRPY QSPFMDREKK EDLEFQAKHM DQPLPTAIGL YKNHPLYALK RHLLKYEAIY
670 680 690 700 710 720
PETAAILGYC RGEAVYSRDC VHTLHSRDTW LKKARVVRLG EVPYKMVKGF SNRARKARLA
730 740 750 760 770 780
EPQLREENDL GLFGYWQTEE YQPPVAVDGK VPRNEFGNVY LFLPSMMPIG CVQLNLPNLH
790 800 810 820 830 840
RVARKLDIDC VQAITGFDFH GGYSHPVTDG YIVCEEFKDV LLTAWENEQA VIERKEKEKK
850 860 870 880 890 900
EKRALGNWKL LAKGLLIRER LKRRYGPKSE AAAPHTDAGG GLSSDEEEGT SSQAEAARIL
910 920 930 940
AASWPQNRED EEKQKLKGGP KKTKREKKAA ASHLFPFEQL
Protein Neighborhood
Domains & Features
3 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q01831-1-unknown | MARKRA... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt94723 | |||
Q01831-2-unknown | ARKRAA... | 2 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q01831-2-unknown | ARKRAA... | 2 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111398 | |||
Q01831-22-unknown | KAKSKA... | 22 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC14359 | |||
Q01831-22-unknown | KAKSKA... | 22 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt136968 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...RSQKSK | 21 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC14359 | |||
...RSQKSK | 21 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt119797 | |||
...PFEQL | 940 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...PFEQL | 940 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt90341 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
GRAM_HUMAN | 21 | QKSK.|.KAKS | inferred from experiment | unknown | MEROPS | Bovenschen N | de Poot SA et al.:Human and mouse granzyme M disp... (S01.139) | 21564021, |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|