P40692: DNA mismatch repair protein Mlh1
Protein names | - DNA mismatch repair protein Mlh1 - MutL protein homolog 1 |
---|---|
Gene names | MLH1 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P40692 |
5
N-termini
3
C-termini
2
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MSFVAGVIRR LDETVVNRIA AGEVIQRPAN AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ
70 80 90 100 110 120
IQDNGTGIRK EDLDIVCERF TTSKLQSFED LASISTYGFR GEALASISHV AHVTITTKTA
130 140 150 160 170 180
DGKCAYRASY SDGKLKAPPK PCAGNQGTQI TVEDLFYNIA TRRKALKNPS EEYGKILEVV
190 200 210 220 230 240
GRYSVHNAGI SFSVKKQGET VADVRTLPNA STVDNIRSIF GNAVSRELIE IGCEDKTLAF
250 260 270 280 290 300
KMNGYISNAN YSVKKCIFLL FINHRLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISP
310 320 330 340 350 360
QNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLP GLAGPSGEMV
370 380 390 400 410 420
KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS
430 440 450 460 470 480
SGRARQQDEE MLELPAPAEV AAKNQSLEGD TTKGTSEMSE KRGPTSSNPR KRHREDSDVE
490 500 510 520 530 540
MVEDDSRKEM TAACTPRRRI INLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL
550 560 570 580 590 600
AQHQTKLYLL NTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEE
610 620 630 640 650 660
DGPKEGLAEY IVEFLKKKAE MLADYFSLEI DEEGNLIGLP LLIDNYVPPL EGLPIFILRL
670 680 690 700 710 720
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQ QSEVPGSIPN SWKWTVEHIV
730 740 750
YKALRSHILP PKHFTEDGNI LQLANLPDLY KVFERC
Isoforms
- Isoform 2 of DNA mismatch repair protein Mlh1 - Isoform 3 of DNA mismatch repair protein Mlh1Sequence View
10 20 30 40 50 60
MSFVAGVIRR LDETVVNRIA AGEVIQRPAN AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ
70 80 90 100 110 120
IQDNGTGIRK EDLDIVCERF TTSKLQSFED LASISTYGFR GEALASISHV AHVTITTKTA
130 140 150 160 170 180
DGKCAYRASY SDGKLKAPPK PCAGNQGTQI TVEDLFYNIA TRRKALKNPS EEYGKILEVV
190 200 210 220 230 240
GRYSVHNAGI SFSVKKQGET VADVRTLPNA STVDNIRSIF GNAVSRELIE IGCEDKTLAF
250 260 270 280 290 300
KMNGYISNAN YSVKKCIFLL FINHRLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISP
310 320 330 340 350 360
QNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLP GLAGPSGEMV
370 380 390 400 410 420
KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS
430 440 450 460 470 480
SGRARQQDEE MLELPAPAEV AAKNQSLEGD TTKGTSEMSE KRGPTSSNPR KRHREDSDVE
490 500 510 520 530 540
MVEDDSRKEM TAACTPRRRI INLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL
550 560 570 580 590 600
AQHQTKLYLL NTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEE
610 620 630 640 650 660
DGPKEGLAEY IVEFLKKKAE MLADYFSLEI DEEGNLIGLP LLIDNYVPPL EGLPIFILRL
670 680 690 700 710 720
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQ QSEVPGSIPN SWKWTVEHIV
730 740 750
YKALRSHILP PKHFTEDGNI LQLANLPDLY KVFERC
10 20 30 40 50 60
MSFVAGVIRR LDETVVNRIA AGEVIQRPAN AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ
70 80 90 100 110 120
IQDNGTGIRK EDLDIVCERF TTSKLQSFED LASISTYGFR GEALASISHV AHVTITTKTA
130 140 150 160 170 180
DGKCAYRASY SDGKLKAPPK PCAGNQGTQI TVEDLFYNIA TRRKALKNPS EEYGKILEVV
190 200 210 220 230 240
GRYSVHNAGI SFSVKKQGET VADVRTLPNA STVDNIRSIF GNAVSRELIE IGCEDKTLAF
250 260 270 280 290 300
KMNGYISNAN YSVKKCIFLL FINHRLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISP
310 320 330 340 350 360
QNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLP GLAGPSGEMV
370 380 390 400 410 420
KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS
430 440 450 460 470 480
SGRARQQDEE MLELPAPAEV AAKNQSLEGD TTKGTSEMSE KRGPTSSNPR KRHREDSDVE
490 500 510 520 530 540
MVEDDSRKEM TAACTPRRRI INLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL
550 560 570 580 590 600
AQHQTKLYLL NTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEE
610 620 630 640 650 660
DGPKEGLAEY IVEFLKKKAE MLADYFSLEI DEEGNLIGLP LLIDNYVPPL EGLPIFILRL
670 680 690 700 710 720
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQ QSEVPGSIPN SWKWTVEHIV
730 740 750
YKALRSHILP PKHFTEDGNI LQLANLPDLY KVFERC
Protein Neighborhood
Domains & Features
5 N-termini - 3 C-termini - 2 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P40692-2-unknown | MSFVAG... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P40692-2-Acetylation | SFVAGV... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
P40692-2-Acetylation | SFVAGV... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: Complementary positional proteomics for screening substrates... | 20526345 | ||
P40692-2-Acetylation | SFVAGV... | 2 | acetylation- | other | CNRS/ISV | Large_scale_NTA_Human | |||
P40692-2-Acetylation | SFVAGV... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: PC3-cells, Complementary positional proteomics for screening substrates... | 20526345 | ||
P40692-242-unknown | MNGYIS... | 242 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt81248 | |||
P40692-242-unknown | MNGYIS... | 242 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt101337 | |||
P40692-419-unknown | ISSGRA... | 419 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1819 | |||
P40692-419-unknown | ISSGRA... | 419 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt145816 | |||
P40692-419-unknown | ISSGRA... | 419 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt150350 | |||
P40692-451-unknown | TTKGTS... | 451 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC11494 | |||
P40692-451-unknown | TTKGTS... | 451 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt146975 | |||
P40692-451-unknown | TTKGTS... | 451 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt151022 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...EDKTDI | 418 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1819 | |||
...EDKTDI | 418 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt129113 | |||
...EDKTDI | 418 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt133747 | |||
...SLEGDT | 450 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC11494 | |||
...SLEGDT | 450 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt130307 | |||
...SLEGDT | 450 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt134425 | |||
...VFERC | 756 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...VFERC | 756 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt76866 | |||
...VFERC | 756 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt76867 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
CASP3_HUMAN | 418 | KTDI.|.ISSG | inferred from experiment | unknown | MEROPS | Cryns VL | Chen F et al.:Proteolysis of the mismatch rep... (C14.003) | 15087450, |
GRAB_HUMAN | 450 | EGDT.|.TTKG | inferred from experiment | unknown | MEROPS | Gevaert K | Van Damme P et al.:Complementary positional proteo... (M14.017) | 20526345, |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|