Q96L73: Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific
Protein names | - Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific - 2.1.1.43 - Androgen receptor coactivator 267 kDa protein - Androgen receptor-associated protein of 267 kDa - H3-K36-HMTase - H4-K20-HMTase - Lysine N-methyltransferase 3B - Nuclear receptor-binding SET domain-containing protein 1 - NR-binding SET domain-containing protein |
---|---|
Gene names | NSD1 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q96L73 |
1
N-termini
1
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA
70 80 90 100 110 120
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG
130 140 150 160 170 180
PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI
190 200 210 220 230 240
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ
250 260 270 280 290 300
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
310 320 330 340 350 360
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP
370 380 390 400 410 420
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK
430 440 450 460 470 480
WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA
490 500 510 520 530 540
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG
550 560 570 580 590 600
DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDSL LGLPEGALIS
610 620 630 640 650 660
KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN
670 680 690 700 710 720
SVLEIPDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE
730 740 750 760 770 780
PGTETSQVNL SDLKASTLVH KPQSDFTNDA LSPKFNLSSS ISSENSLIKG GAANQALLHS
790 800 810 820 830 840
KSKQPKFRSI KCKHKENPVM AEPPVINEEC SLKCCSSDTK GSPLASISKS GKVDGLKLLN
850 860 870 880 890 900
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSASSQNHIP
910 920 930 940 950 960
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG RGDCSTNSPV GVSKVLVSGG
970 980 990 1000 1010 1020
STHNSEKKGD GTQNSANPSP SGGDSALSGE LSASLPGLLS DKRDLPASGK SRSDCVTRRN
1030 1040 1050 1060 1070 1080
CGRSKPSSKL RDAFSAQMVK NTVNRKALKT ERKRKLNQLP SVTLDAVLQG DRERGGSLRG
1090 1100 1110 1120 1130 1140
GAEDPSKEDP LQIMGHLTSE DGDHFSDVHF DSKVKQSDPG KISEKGLSFE NGKGPELDSV
1150 1160 1170 1180 1190 1200
MNSENDELNG VNQVVPKKRW QRLNQRRTKP RKRMNRFKEK ENSECAFRVL LPSDPVQEGR
1210 1220 1230 1240 1250 1260
DEFPEHRTPS ASILEEPLTE QNHADCLDSA GPRLNVCDKS SASIGDMEKE PGIPSLTPQA
1270 1280 1290 1300 1310 1320
ELPEPAVRSE KKRLRKPSKW LLEYTEEYDQ IFAPKKKQKK VQEQVHKVSS RCEEESLLAR
1330 1340 1350 1360 1370 1380
GRSSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ LTLSVPVAPE
1390 1400 1410 1420 1430 1440
VSPRPALESE ELLVKTPGNY ESKRQRKPTK KLLESNDLDP GFMPKKGDLG LSKKCYEAGH
1450 1460 1470 1480 1490 1500
LENGITESCA TSYSKDFGGG TTKIFDKPRK RKRQRHAAAK MQCKKVKNDD SSKEIPGSEG
1510 1520 1530 1540 1550 1560
ELMPHRTATS PKETVEEGVE HDPGMPASKK MQGERGGGAA LKENVCQNCE KLGELLLCEA
1570 1580 1590 1600 1610 1620
QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL CGKFYHEECV
1630 1640 1650 1660 1670 1680
QKYPPTVMQN KGFRCSLHIC ITCHAANPAN VSASKGRLMR CVRCPVAYHA NDFCLAAGSK
1690 1700 1710 1720 1730 1740
ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH RECLNIDIPE
1750 1760 1770 1780 1790 1800
GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD VGEFPVLFFG
1810 1820 1830 1840 1850 1860
SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA QKELRQLQED
1870 1880 1890 1900 1910 1920
RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE CINRMLLYEC
1930 1940 1950 1960 1970 1980
HPTVCPAGGR CQNQCFSKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE YVGELIDEEE
1990 2000 2010 2020 2030 2040
CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ KWSVNGDTRV
2050 2060 2070 2080 2090 2100
GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI ATEEKSKKFK
2110 2120 2130 2140 2150 2160
KKQQGKRRTQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT KRPAGKWECP
2170 2180 2190 2200 2210 2220
WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP NPLEPGEIRE
2230 2240 2250 2260 2270 2280
YVPPPVPLPP GPSTHLAEQS TGMAAQAPKM SDKPPADTNQ MLSLSKKALA GTCQRPLLPE
2290 2300 2310 2320 2330 2340
RPLERTDSRP QPLDKVRDLA GSGTKSQSLV SSQRPLDRPP AVAGPRPQLS DKPSPVTSPS
2350 2360 2370 2380 2390 2400
SSPSVRSQPL ERPLGTADPR LDKSIGAASP RPQSLEKTSV PTGLRLPPPD RLLITSSPKP
2410 2420 2430 2440 2450 2460
QTSDRPTDKP HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN RAALVMDLID
2470 2480 2490 2500 2510 2520
LTPRQKERAA SPHQVTPQAD EKMPVLESSS WPASKGLGHM PRAVEKGCVS DPLQTSGKAA
2530 2540 2550 2560 2570 2580
APSEDPWQAV KSLTQARLLS QPPAKAFLYE PTTQASGRAS AGAEQTPGPL SQSPGLVKQA
2590 2600 2610 2620 2630 2640
KQMVGGQQLP ALAAKSGQSF RSLGKAPASL PTEEKKLVTT EQSPWALGKA SSRAGLWPIV
2650 2660 2670 2680 2690
AGQTLAQSCW SAGSTQTLAQ TCWSLGRGQD PKPEQNTLPA LNQAPSSHKC AESEQK
Isoforms
- Isoform 2 of Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific - Isoform 3 of Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specificSequence View
10 20 30 40 50 60
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA
70 80 90 100 110 120
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG
130 140 150 160 170 180
PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI
190 200 210 220 230 240
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ
250 260 270 280 290 300
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
310 320 330 340 350 360
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP
370 380 390 400 410 420
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK
430 440 450 460 470 480
WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA
490 500 510 520 530 540
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG
550 560 570 580 590 600
DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDSL LGLPEGALIS
610 620 630 640 650 660
KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN
670 680 690 700 710 720
SVLEIPDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE
730 740 750 760 770 780
PGTETSQVNL SDLKASTLVH KPQSDFTNDA LSPKFNLSSS ISSENSLIKG GAANQALLHS
790 800 810 820 830 840
KSKQPKFRSI KCKHKENPVM AEPPVINEEC SLKCCSSDTK GSPLASISKS GKVDGLKLLN
850 860 870 880 890 900
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSASSQNHIP
910 920 930 940 950 960
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG RGDCSTNSPV GVSKVLVSGG
970 980 990 1000 1010 1020
STHNSEKKGD GTQNSANPSP SGGDSALSGE LSASLPGLLS DKRDLPASGK SRSDCVTRRN
1030 1040 1050 1060 1070 1080
CGRSKPSSKL RDAFSAQMVK NTVNRKALKT ERKRKLNQLP SVTLDAVLQG DRERGGSLRG
1090 1100 1110 1120 1130 1140
GAEDPSKEDP LQIMGHLTSE DGDHFSDVHF DSKVKQSDPG KISEKGLSFE NGKGPELDSV
1150 1160 1170 1180 1190 1200
MNSENDELNG VNQVVPKKRW QRLNQRRTKP RKRMNRFKEK ENSECAFRVL LPSDPVQEGR
1210 1220 1230 1240 1250 1260
DEFPEHRTPS ASILEEPLTE QNHADCLDSA GPRLNVCDKS SASIGDMEKE PGIPSLTPQA
1270 1280 1290 1300 1310 1320
ELPEPAVRSE KKRLRKPSKW LLEYTEEYDQ IFAPKKKQKK VQEQVHKVSS RCEEESLLAR
1330 1340 1350 1360 1370 1380
GRSSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ LTLSVPVAPE
1390 1400 1410 1420 1430 1440
VSPRPALESE ELLVKTPGNY ESKRQRKPTK KLLESNDLDP GFMPKKGDLG LSKKCYEAGH
1450 1460 1470 1480 1490 1500
LENGITESCA TSYSKDFGGG TTKIFDKPRK RKRQRHAAAK MQCKKVKNDD SSKEIPGSEG
1510 1520 1530 1540 1550 1560
ELMPHRTATS PKETVEEGVE HDPGMPASKK MQGERGGGAA LKENVCQNCE KLGELLLCEA
1570 1580 1590 1600 1610 1620
QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL CGKFYHEECV
1630 1640 1650 1660 1670 1680
QKYPPTVMQN KGFRCSLHIC ITCHAANPAN VSASKGRLMR CVRCPVAYHA NDFCLAAGSK
1690 1700 1710 1720 1730 1740
ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH RECLNIDIPE
1750 1760 1770 1780 1790 1800
GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD VGEFPVLFFG
1810 1820 1830 1840 1850 1860
SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA QKELRQLQED
1870 1880 1890 1900 1910 1920
RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE CINRMLLYEC
1930 1940 1950 1960 1970 1980
HPTVCPAGGR CQNQCFSKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE YVGELIDEEE
1990 2000 2010 2020 2030 2040
CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ KWSVNGDTRV
2050 2060 2070 2080 2090 2100
GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI ATEEKSKKFK
2110 2120 2130 2140 2150 2160
KKQQGKRRTQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT KRPAGKWECP
2170 2180 2190 2200 2210 2220
WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP NPLEPGEIRE
2230 2240 2250 2260 2270 2280
YVPPPVPLPP GPSTHLAEQS TGMAAQAPKM SDKPPADTNQ MLSLSKKALA GTCQRPLLPE
2290 2300 2310 2320 2330 2340
RPLERTDSRP QPLDKVRDLA GSGTKSQSLV SSQRPLDRPP AVAGPRPQLS DKPSPVTSPS
2350 2360 2370 2380 2390 2400
SSPSVRSQPL ERPLGTADPR LDKSIGAASP RPQSLEKTSV PTGLRLPPPD RLLITSSPKP
2410 2420 2430 2440 2450 2460
QTSDRPTDKP HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN RAALVMDLID
2470 2480 2490 2500 2510 2520
LTPRQKERAA SPHQVTPQAD EKMPVLESSS WPASKGLGHM PRAVEKGCVS DPLQTSGKAA
2530 2540 2550 2560 2570 2580
APSEDPWQAV KSLTQARLLS QPPAKAFLYE PTTQASGRAS AGAEQTPGPL SQSPGLVKQA
2590 2600 2610 2620 2630 2640
KQMVGGQQLP ALAAKSGQSF RSLGKAPASL PTEEKKLVTT EQSPWALGKA SSRAGLWPIV
2650 2660 2670 2680 2690
AGQTLAQSCW SAGSTQTLAQ TCWSLGRGQD PKPEQNTLPA LNQAPSSHKC AESEQK
10 20 30 40 50 60
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA
70 80 90 100 110 120
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG
130 140 150 160 170 180
PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI
190 200 210 220 230 240
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ
250 260 270 280 290 300
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
310 320 330 340 350 360
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP
370 380 390 400 410 420
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK
430 440 450 460 470 480
WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA
490 500 510 520 530 540
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG
550 560 570 580 590 600
DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDSL LGLPEGALIS
610 620 630 640 650 660
KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN
670 680 690 700 710 720
SVLEIPDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE
730 740 750 760 770 780
PGTETSQVNL SDLKASTLVH KPQSDFTNDA LSPKFNLSSS ISSENSLIKG GAANQALLHS
790 800 810 820 830 840
KSKQPKFRSI KCKHKENPVM AEPPVINEEC SLKCCSSDTK GSPLASISKS GKVDGLKLLN
850 860 870 880 890 900
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSASSQNHIP
910 920 930 940 950 960
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG RGDCSTNSPV GVSKVLVSGG
970 980 990 1000 1010 1020
STHNSEKKGD GTQNSANPSP SGGDSALSGE LSASLPGLLS DKRDLPASGK SRSDCVTRRN
1030 1040 1050 1060 1070 1080
CGRSKPSSKL RDAFSAQMVK NTVNRKALKT ERKRKLNQLP SVTLDAVLQG DRERGGSLRG
1090 1100 1110 1120 1130 1140
GAEDPSKEDP LQIMGHLTSE DGDHFSDVHF DSKVKQSDPG KISEKGLSFE NGKGPELDSV
1150 1160 1170 1180 1190 1200
MNSENDELNG VNQVVPKKRW QRLNQRRTKP RKRMNRFKEK ENSECAFRVL LPSDPVQEGR
1210 1220 1230 1240 1250 1260
DEFPEHRTPS ASILEEPLTE QNHADCLDSA GPRLNVCDKS SASIGDMEKE PGIPSLTPQA
1270 1280 1290 1300 1310 1320
ELPEPAVRSE KKRLRKPSKW LLEYTEEYDQ IFAPKKKQKK VQEQVHKVSS RCEEESLLAR
1330 1340 1350 1360 1370 1380
GRSSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ LTLSVPVAPE
1390 1400 1410 1420 1430 1440
VSPRPALESE ELLVKTPGNY ESKRQRKPTK KLLESNDLDP GFMPKKGDLG LSKKCYEAGH
1450 1460 1470 1480 1490 1500
LENGITESCA TSYSKDFGGG TTKIFDKPRK RKRQRHAAAK MQCKKVKNDD SSKEIPGSEG
1510 1520 1530 1540 1550 1560
ELMPHRTATS PKETVEEGVE HDPGMPASKK MQGERGGGAA LKENVCQNCE KLGELLLCEA
1570 1580 1590 1600 1610 1620
QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL CGKFYHEECV
1630 1640 1650 1660 1670 1680
QKYPPTVMQN KGFRCSLHIC ITCHAANPAN VSASKGRLMR CVRCPVAYHA NDFCLAAGSK
1690 1700 1710 1720 1730 1740
ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH RECLNIDIPE
1750 1760 1770 1780 1790 1800
GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD VGEFPVLFFG
1810 1820 1830 1840 1850 1860
SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA QKELRQLQED
1870 1880 1890 1900 1910 1920
RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE CINRMLLYEC
1930 1940 1950 1960 1970 1980
HPTVCPAGGR CQNQCFSKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE YVGELIDEEE
1990 2000 2010 2020 2030 2040
CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ KWSVNGDTRV
2050 2060 2070 2080 2090 2100
GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI ATEEKSKKFK
2110 2120 2130 2140 2150 2160
KKQQGKRRTQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT KRPAGKWECP
2170 2180 2190 2200 2210 2220
WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP NPLEPGEIRE
2230 2240 2250 2260 2270 2280
YVPPPVPLPP GPSTHLAEQS TGMAAQAPKM SDKPPADTNQ MLSLSKKALA GTCQRPLLPE
2290 2300 2310 2320 2330 2340
RPLERTDSRP QPLDKVRDLA GSGTKSQSLV SSQRPLDRPP AVAGPRPQLS DKPSPVTSPS
2350 2360 2370 2380 2390 2400
SSPSVRSQPL ERPLGTADPR LDKSIGAASP RPQSLEKTSV PTGLRLPPPD RLLITSSPKP
2410 2420 2430 2440 2450 2460
QTSDRPTDKP HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN RAALVMDLID
2470 2480 2490 2500 2510 2520
LTPRQKERAA SPHQVTPQAD EKMPVLESSS WPASKGLGHM PRAVEKGCVS DPLQTSGKAA
2530 2540 2550 2560 2570 2580
APSEDPWQAV KSLTQARLLS QPPAKAFLYE PTTQASGRAS AGAEQTPGPL SQSPGLVKQA
2590 2600 2610 2620 2630 2640
KQMVGGQQLP ALAAKSGQSF RSLGKAPASL PTEEKKLVTT EQSPWALGKA SSRAGLWPIV
2650 2660 2670 2680 2690
AGQTLAQSCW SAGSTQTLAQ TCWSLGRGQD PKPEQNTLPA LNQAPSSHKC AESEQK
Protein Neighborhood
Domains & Features
1 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q96L73-1-unknown | MDQTCE... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q96L73-1-unknown | MDQTCE... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt83329 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...ESEQK | 2696 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...ESEQK | 2696 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt78946 | |||
...ESEQK | 2696 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt78947 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|