Q8VC34: Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2
Protein names | - Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - 3.1.3.16 - RNA polymerase II-associated protein 2 |
---|---|
Gene names | Rpap2 |
Organism | Mus musculus |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q8VC34 |
4
N-termini
3
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
Isoforms
- Isoform 2 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - Isoform 3 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - Isoform 4 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - Isoform 5 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - Isoform 4 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2 - Isoform 5 of Putative RNA polymerase II subunit B1 CTD phosphatase Rpap2Sequence View
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
10 20 30 40 50 60
MADSAVPCSL GPSTRASSTH RDATGTKQTR ALKRGDASKR QAELEAAIQR KVEFERKAVR
70 80 90 100 110 120
IVEQLLEENI TEEFLKECGM FITPAHYSDV VDERSIIKLC GYPLCQKKLG VIPKQKYRIS
130 140 150 160 170 180
TKTNKVYDIT ERKSFCSNFC YRASKFFETQ IPKTPVWVRE EERPPDFQLL KKGQSGSSGE
190 200 210 220 230 240
VVQFFRDAVT AADVDGSGAL EAQCDPASSS SWSERASDEE EQGFVSSLLP GNRPKAVDTR
250 260 270 280 290 300
PQPHTKSSIM RKKAAQNVDS KEGEQTVSEV TEQLDNCRLD SQEKVATCKR PLKKESTQIS
310 320 330 340 350 360
SPGPLCDRFN TSAISEHKHG VSQVTLVGIS KKSAEHFRSK FAKSNPGSGS ASGLVHVRPE
370 380 390 400 410 420
VAKANLLRVL SDTLTEWKTE ETLKFLYGQN HDSVCLKPSS ASEPDEELDE DDISCDPGSC
430 440 450 460 470 480
GPALSQAQNT LDATLPFRGS DTAIKPLPSY ESLKKETEML NLRVREFYRG RCVLNEDTTK
490 500 510 520 530 540
SQDSKESVLQ RDPSFPLIDS SSQNQIRRRI VLEKLSKVLP GLLGPLQITM GDIYTELKNL
550 560 570 580 590 600
IQTFRLSNRN IIHKPVEWTL IAVVLLLLLT PILGIQKHSP KNVVFTQFIA TLLTELHLKF
610
EDLEKLTMIF RTSC
Protein Neighborhood
Domains & Features
4 N-termini - 3 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q8VC34-1-unknown | MADSAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt88113 | |||
Q8VC34-1-unknown | MADSAV... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt88114 | |||
Q8VC34-2-unknown | MADSAV... | 1 | inferred from similarity | unknown | UniProtKB | inferred from uniprot (by similarity) | |||
Q8VC34-2-unknown | MADSAV... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q8VC34-2-Acetylation | ADSAVP... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
Q8VC34-2-Acetylation | ADSAVP... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt110452 | ||
Q8VC34-2-Acetylation | ADSAVP... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt110453 | ||
Q8VC34-2-Acetylation | ADSAVP... | 2 | acetylation- | inferred from similarity | unknown | UniProtKB | inferred from uniprot (by similarity) | ||
Q8VC34-80-unknown | MFITPA... | 80 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSMUSP00000108269 | |||
Q8VC34-80-unknown | MFITPA... | 80 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt196656 | |||
Q8VC34-80-unknown | MFITPA... | 80 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt196655 | |||
Q8VC34-80-unknown | MFITPA... | 80 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt196657 | |||
Q8VC34-80-unknown | MFITPA... | 80 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt196658 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...AVRIVE | 62 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSMUSP00000121306 | |||
...AVRIVE | 62 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt153939 | |||
...AVRIVE | 62 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt153940 | |||
...VLLLLL | 568 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt83733 | |||
...VLLLLL | 568 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt83732 | |||
...VLLLLL | 568 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSMUSP00000108269 | |||
...VLLLLL | 568 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSMUSP00000108273 | |||
...VLLLLL | 568 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt92923 | |||
...VLLLLL | 568 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt92924 | |||
...FRTSC | 614 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...FRTSC | 614 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt83734 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|