P42858: Huntingtin
Protein names | - Huntingtin - Huntington disease protein - HD protein - Huntingtin, myristoylated N-terminal fragment |
---|---|
Gene names | HTT |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P42858 |
8
N-termini
9
C-termini
11
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MATLEKLMKA FESLKSFQQQ QQQQQQQQQQ QQQQQQQQPP PPPPPPPPPQ LPQPPPQAQP
70 80 90 100 110 120
LLPQPQPPPP PPPPPPGPAV AEEPLHRPKK ELSATKKDRV NHCLTICENI VAQSVRNSPE
130 140 150 160 170 180
FQKLLGIAME LFLLCSDDAE SDVRMVADEC LNKVIKALMD SNLPRLQLEL YKEIKKNGAP
190 200 210 220 230 240
RSLRAALWRF AELAHLVRPQ KCRPYLVNLL PCLTRTSKRP EESVQETLAA AVPKIMASFG
250 260 270 280 290 300
NFANDNEIKV LLKAFIANLK SSSPTIRRTA AGSAVSICQH SRRTQYFYSW LLNVLLGLLV
310 320 330 340 350 360
PVEDEHSTLL ILGVLLTLRY LVPLLQQQVK DTSLKGSFGV TRKEMEVSPS AEQLVQVYEL
370 380 390 400 410 420
TLHHTQHQDH NVVTGALELL QQLFRTPPPE LLQTLTAVGG IGQLTAAKEE SGGRSRSGSI
430 440 450 460 470 480
VELIAGGGSS CSPVLSRKQK GKVLLGEEEA LEDDSESRSD VSSSALTASV KDEISGELAA
490 500 510 520 530 540
SSGVSTPGSA GHDIITEQPR SQHTLQADSV DLASCDLTSS ATDGDEEDIL SHSSSQVSAV
550 560 570 580 590 600
PSDPAMDLND GTQASSPISD SSQTTTEGPD SAVTPSDSSE IVLDGTDNQY LGLQIGQPQD
610 620 630 640 650 660
EDEEATGILP DEASEAFRNS SMALQQAHLL KNMSHCRQPS DSSVDKFVLR DEATEPGDQE
670 680 690 700 710 720
NKPCRIKGDI GQSTDDDSAP LVHCVRLLSA SFLLTGGKNV LVPDRDVRVS VKALALSCVG
730 740 750 760 770 780
AAVALHPESF FSKLYKVPLD TTEYPEEQYV SDILNYIDHG DPQVRGATAI LCGTLICSIL
790 800 810 820 830 840
SRSRFHVGDW MGTIRTLTGN TFSLADCIPL LRKTLKDESS VTCKLACTAV RNCVMSLCSS
850 860 870 880 890 900
SYSELGLQLI IDVLTLRNSS YWLVRTELLE TLAEIDFRLV SFLEAKAENL HRGAHHYTGL
910 920 930 940 950 960
LKLQERVLNN VVIHLLGDED PRVRHVAAAS LIRLVPKLFY KCDQGQADPV VAVARDQSSV
970 980 990 1000 1010 1020
YLKLLMHETQ PPSHFSVSTI TRIYRGYNLL PSITDVTMEN NLSRVIAAVS HELITSTTRA
1030 1040 1050 1060 1070 1080
LTFGCCEALC LLSTAFPVCI WSLGWHCGVP PLSASDESRK SCTVGMATMI LTLLSSAWFP
1090 1100 1110 1120 1130 1140
LDLSAHQDAL ILAGNLLAAS APKSLRSSWA SEEEANPAAT KQEEVWPALG DRALVPMVEQ
1150 1160 1170 1180 1190 1200
LFSHLLKVIN ICAHVLDDVA PGPAIKAALP SLTNPPSLSP IRRKGKEKEP GEQASVPLSP
1210 1220 1230 1240 1250 1260
KKGSEASAAS RQSDTSGPVT TSKSSSLGSF YHLPSYLKLH DVLKATHANY KVTLDLQNST
1270 1280 1290 1300 1310 1320
EKFGGFLRSA LDVLSQILEL ATLQDIGKCV EEILGYLKSC FSREPMMATV CVQQLLKTLF
1330 1340 1350 1360 1370 1380
GTNLASQFDG LSSNPSKSQG RAQRLGSSSV RPGLYHYCFM APYTHFTQAL ADASLRNMVQ
1390 1400 1410 1420 1430 1440
AEQENDTSGW FDVLQKVSTQ LKTNLTSVTK NRADKNAIHN HIRLFEPLVI KALKQYTTTT
1450 1460 1470 1480 1490 1500
CVQLQKQVLD LLAQLVQLRV NYCLLDSDQV FIGFVLKQFE YIEVGQFRES EAIIPNIFFF
1510 1520 1530 1540 1550 1560
LVLLSYERYH SKQIIGIPKI IQLCDGIMAS GRKAVTHAIP ALQPIVHDLF VLRGTNKADA
1570 1580 1590 1600 1610 1620
GKELETQKEV VVSMLLRLIQ YHQVLEMFIL VLQQCHKENE DKWKRLSRQI ADIILPMLAK
1630 1640 1650 1660 1670 1680
QQMHIDSHEA LGVLNTLFEI LAPSSLRPVD MLLRSMFVTP NTMASVSTVQ LWISGILAIL
1690 1700 1710 1720 1730 1740
RVLISQSTED IVLSRIQELS FSPYLISCTV INRLRDGDST STLEEHSEGK QIKNLPEETF
1750 1760 1770 1780 1790 1800
SRFLLQLVGI LLEDIVTKQL KVEMSEQQHT FYCQELGTLL MCLIHIFKSG MFRRITAAAT
1810 1820 1830 1840 1850 1860
RLFRSDGCGG SFYTLDSLNL RARSMITTHP ALVLLWCQIL LLVNHTDYRW WAEVQQTPKR
1870 1880 1890 1900 1910 1920
HSLSSTKLLS PQMSGEEEDS DLAAKLGMCN REIVRRGALI LFCDYVCQNL HDSEHLTWLI
1930 1940 1950 1960 1970 1980
VNHIQDLISL SHEPPVQDFI SAVHRNSAAS GLFIQAIQSR CENLSTPTML KKTLQCLEGI
1990 2000 2010 2020 2030 2040
HLSQSGAVLT LYVDRLLCTP FRVLARMVDI LACRRVEMLL AANLQSSMAQ LPMEELNRIQ
2050 2060 2070 2080 2090 2100
EYLQSSGLAQ RHQRLYSLLD RFRLSTMQDS LSPSPPVSSH PLDGDGHVSL ETVSPDKDWY
2110 2120 2130 2140 2150 2160
VHLVKSQCWT RSDSALLEGA ELVNRIPAED MNAFMMNSEF NLSLLAPCLS LGMSEISGGQ
2170 2180 2190 2200 2210 2220
KSALFEAARE VTLARVSGTV QQLPAVHHVF QPELPAEPAA YWSKLNDLFG DAALYQSLPT
2230 2240 2250 2260 2270 2280
LARALAQYLV VVSKLPSHLH LPPEKEKDIV KFVVATLEAL SWHLIHEQIP LSLDLQAGLD
2290 2300 2310 2320 2330 2340
CCCLALQLPG LWSVVSSTEF VTHACSLIYC VHFILEAVAV QPGEQLLSPE RRTNTPKAIS
2350 2360 2370 2380 2390 2400
EEEEEVDPNT QNPKYITAAC EMVAEMVESL QSVLALGHKR NSGVPAFLTP LLRNIIISLA
2410 2420 2430 2440 2450 2460
RLPLVNSYTR VPPLVWKLGW SPKPGGDFGT AFPEIPVEFL QEKEVFKEFI YRINTLGWTS
2470 2480 2490 2500 2510 2520
RTQFEETWAT LLGVLVTQPL VMEQEESPPE EDTERTQINV LAVQAITSLV LSAMTVPVAG
2530 2540 2550 2560 2570 2580
NPAVSCLEQQ PRNKPLKALD TRFGRKLSII RGIVEQEIQA MVSKRENIAT HHLYQAWDPV
2590 2600 2610 2620 2630 2640
PSLSPATTGA LISHEKLLLQ INPERELGSM SYKLGQVSIH SVWLGNSITP LREEEWDEEE
2650 2660 2670 2680 2690 2700
EEEADAPAPS SPPTSPVNSR KHRAGVDIHS CSQFLLELYS RWILPSSSAR RTPAILISEV
2710 2720 2730 2740 2750 2760
VRSLLVVSDL FTERNQFELM YVTLTELRRV HPSEDEILAQ YLVPATCKAA AVLGMDKAVA
2770 2780 2790 2800 2810 2820
EPVSRLLEST LRSSHLPSRV GALHGVLYVL ECDLLDDTAK QLIPVISDYL LSNLKGIAHC
2830 2840 2850 2860 2870 2880
VNIHSQQHVL VMCATAFYLI ENYPLDVGPE FSASIIQMCG VMLSGSEEST PSIIYHCALR
2890 2900 2910 2920 2930 2940
GLERLLLSEQ LSRLDAESLV KLSVDRVNVH SPHRAMAALG LMLTCMYTGK EKVSPGRTSD
2950 2960 2970 2980 2990 3000
PNPAAPDSES VIVAMERVSV LFDRIRKGFP CEARVVARIL PQFLDDFFPP QDIMNKVIGE
3010 3020 3030 3040 3050 3060
FLSNQQPYPQ FMATVVYKVF QTLHSTGQSS MVRDWVMLSL SNFTQRAPVA MATWSLSCFF
3070 3080 3090 3100 3110 3120
VSASTSPWVA AILPHVISRM GKLEQVDVNL FCLVATDFYR HQIEEELDRR AFQSVLEVVA
3130 3140
APGSPYHRLL TCLRNVHKVT TC
Isoforms
Sequence View
Protein Neighborhood
Domains & Features
8 N-termini - 9 C-termini - 11 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P42858-1-unknown | MATLEK... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P42858-403-unknown | QLTAAK... | 403 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC19946 | |||
P42858-468-unknown | ASVKDE... | 468 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC771 | |||
P42858-468-unknown | ASVKDE... | 468 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1030 | |||
P42858-512-unknown | LASCDL... | 512 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1829 | |||
P42858-529-unknown | ILSHSS... | 529 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1830 | |||
P42858-535-unknown | SQVSAV... | 535 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC772 | |||
P42858-535-unknown | SQVSAV... | 535 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1031 | |||
P42858-551-unknown | GTQASS... | 551 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1831 | |||
P42858-551-unknown | GTQASS... | 551 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1363 | |||
P42858-551-unknown | GTQASS... | 551 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P42858-585-unknown | GTDNQY... | 585 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1832 | |||
P42858-585-unknown | GTDNQY... | 585 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC2556 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...VGGIGQ | 402 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC19946 | |||
...SSALTA | 467 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC771 | |||
...SSALTA | 467 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1030 | |||
...ADSVDL | 511 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1829 | |||
...GDEEDI | 528 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1830 | |||
...LSHSSS | 534 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC772 | |||
...LSHSSS | 534 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1031 | |||
...MDLNDG | 550 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1831 | |||
...MDLNDG | 550 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1363 | |||
...EIVLDG | 584 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC1832 | |||
...EIVLDG | 584 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC2556 | |||
...EIVLDG | 584 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...NDTSGW | 1389 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSP00000425743 | |||
...KVTTC | 3142 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot |
Cleavages
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|