P39060: Collagen alpha-1(XVIII) chain
Protein names | - Collagen alpha-1(XVIII) chain - Endostatin - Non-collagenous domain 1 - NC1 |
---|---|
Gene names | COL18A1 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P39060 |
13
N-termini
9
C-termini
13
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MAPYPCGCHI LLLLFCCLAA ARANLLNLNW LWFNNEDTSH AATTIPEPQG PLPVQPTADT
70 80 90 100 110 120
TTHVTPRNGS TEPATAPGSP EPPSELLEDG QDTPTSAESP DAPEENIAGV GAEILNVAKG
130 140 150 160 170 180
IRSFVQLWND TVPTESLARA ETLVLETPVG PLALAGPSST PQENGTTLWP SRGIPSSPGA
190 200 210 220 230 240
HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSGRASL SSLLGGAPPW GSLQDPDSQG
250 260 270 280 290 300
LSPAAAAPSQ QLQRPDVRLR TPLLHPLVMG SLGKHAAPSA FSSGLPGALS QVAVTTLTRD
310 320 330 340 350 360
SGAWVSHVAN SVGPGLANNS ALLGADPEAP AGRCLPLPPS LPVCGHLGIS RFWLPNHLHH
370 380 390 400 410 420
ESGEQVRAGA RAWGGLLQTH CHPFLAWFFC LLLVPPCGSV PPPAPPPCCQ FCEALQDACW
430 440 450 460 470 480
SRLGGGRLPV ACASLPTQED GYCVLIGPAA ERISEEVGLL QLLGDPPPQQ VTQTDDPDVG
490 500 510 520 530 540
LAYVFGPDAN SGQVARYHFP SLFFRDFSLL FHIRPATEGP GVLFAITDSA QAMVLLGVKL
550 560 570 580 590 600
SGVQDGHQDI SLLYTEPGAG QTHTAASFRL PAFVGQWTHL ALSVAGGFVA LYVDCEEFQR
610 620 630 640 650 660
MPLARSSRGL ELEPGAGLFV AQAGGADPDK FQGVIAELKV RRDPQVSPMH CLDEEGDDSD
670 680 690 700 710 720
GASGDSGSGL GDARELLREE TGAALKPRLP APPPVTTPPL AGGSSTEDSR SEEVEEQTTV
730 740 750 760 770 780
ASLGAQTLPG SDSVSTWDGS VRTPGGRVKE GGLKGQKGEP GVPGPPGRAG PPGSPCLPGP
790 800 810 820 830 840
PGLPCPVSPL GPAGPALQTV PGPQGPPGPP GRDGTPGRDG EPGDPGEDGK PGDTGPQGFP
850 860 870 880 890 900
GTPGDVGPKG DKGDPGVGER GPPGPQGPPG PPGPSFRHDK LTFIDMEGSG FGGDLEALRG
910 920 930 940 950 960
PRGFPGPPGP PGVPGLPGEP GRFGVNSSDV PGPAGLPGVP GREGPPGFPG LPGPPGPPGR
970 980 990 1000 1010 1020
EGPPGRTGQK GSLGEAGAPG HKGSKGAPGP AGARGESGLA GAPGPAGPPG PPGPPGPPGP
1030 1040 1050 1060 1070 1080
GLPAGFDDME GSGGPFWSTA RSADGPQGPP GLPGLKGDPG VPGLPGAKGE VGADGVPGFP
1090 1100 1110 1120 1130 1140
GLPGREGIAG PQGPKGDRGS RGEKGDPGKD GVGQPGLPGP PGPPGPVVYV SEQDGSVLSV
1150 1160 1170 1180 1190 1200
PGPEGRPGFA GFPGPAGPKG NLGSKGERGS PGPKGEKGEP GSIFSPDGGA LGPAQKGAKG
1210 1220 1230 1240 1250 1260
EPGFRGPPGP YGRPGYKGEI GFPGRPGRPG MNGLKGEKGE PGDASLGFGM RGMPGPPGPP
1270 1280 1290 1300 1310 1320
GPPGPPGTPV YDSNVFAESS RPGPPGLPGN QGPPGPKGAK GEVGPPGPPG QFPFDFLQLE
1330 1340 1350 1360 1370 1380
AEMKGEKGDR GDAGQKGERG EPGGGGFFGS SLPGPPGPPG PPGPRGYPGI PGPKGESIRG
1390 1400 1410 1420 1430 1440
QPGPPGPQGP PGIGYEGRQG PPGPPGPPGP PSFPGPHRQT ISVPGPPGPP GPPGPPGTMG
1450 1460 1470 1480 1490 1500
ASSGVRLWAT RQAMLGQVHE VPEGWLIFVA EQEELYVRVQ NGFRKVQLEA RTPLPRGTDN
1510 1520 1530 1540 1550 1560
EVAALQPPVV QLHDSNPYPR REHPHPTARP WRADDILASP PRLPEPQPYP GAPHHSSYVH
1570 1580 1590 1600 1610 1620
LRPARPTSPP AHSHRDFQPV LHLVALNSPL SGGMRGIRGA DFQCFQQARA VGLAGTFRAF
1630 1640 1650 1660 1670 1680
LSSRLQDLYS IVRRADRAAV PIVNLKDELL FPSWEALFSG SEGPLKPGAR IFSFDGKDVL
1690 1700 1710 1720 1730 1740
RHPTWPQKSV WHGSDPNGRR LTESYCETWR TEAPSATGQA SSLLGGRLLG QSAASCHHAY
1750
IVLCIENSFM TASK
Isoforms
- Isoform 2 of Collagen alpha-1(XVIII) chain - Isoform 3 of Collagen alpha-1(XVIII) chainSequence View
10 20 30 40 50 60
MAPYPCGCHI LLLLFCCLAA ARANLLNLNW LWFNNEDTSH AATTIPEPQG PLPVQPTADT
70 80 90 100 110 120
TTHVTPRNGS TEPATAPGSP EPPSELLEDG QDTPTSAESP DAPEENIAGV GAEILNVAKG
130 140 150 160 170 180
IRSFVQLWND TVPTESLARA ETLVLETPVG PLALAGPSST PQENGTTLWP SRGIPSSPGA
190 200 210 220 230 240
HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSGRASL SSLLGGAPPW GSLQDPDSQG
250 260 270 280 290 300
LSPAAAAPSQ QLQRPDVRLR TPLLHPLVMG SLGKHAAPSA FSSGLPGALS QVAVTTLTRD
310 320 330 340 350 360
SGAWVSHVAN SVGPGLANNS ALLGADPEAP AGRCLPLPPS LPVCGHLGIS RFWLPNHLHH
370 380 390 400 410 420
ESGEQVRAGA RAWGGLLQTH CHPFLAWFFC LLLVPPCGSV PPPAPPPCCQ FCEALQDACW
430 440 450 460 470 480
SRLGGGRLPV ACASLPTQED GYCVLIGPAA ERISEEVGLL QLLGDPPPQQ VTQTDDPDVG
490 500 510 520 530 540
LAYVFGPDAN SGQVARYHFP SLFFRDFSLL FHIRPATEGP GVLFAITDSA QAMVLLGVKL
550 560 570 580 590 600
SGVQDGHQDI SLLYTEPGAG QTHTAASFRL PAFVGQWTHL ALSVAGGFVA LYVDCEEFQR
610 620 630 640 650 660
MPLARSSRGL ELEPGAGLFV AQAGGADPDK FQGVIAELKV RRDPQVSPMH CLDEEGDDSD
670 680 690 700 710 720
GASGDSGSGL GDARELLREE TGAALKPRLP APPPVTTPPL AGGSSTEDSR SEEVEEQTTV
730 740 750 760 770 780
ASLGAQTLPG SDSVSTWDGS VRTPGGRVKE GGLKGQKGEP GVPGPPGRAG PPGSPCLPGP
790 800 810 820 830 840
PGLPCPVSPL GPAGPALQTV PGPQGPPGPP GRDGTPGRDG EPGDPGEDGK PGDTGPQGFP
850 860 870 880 890 900
GTPGDVGPKG DKGDPGVGER GPPGPQGPPG PPGPSFRHDK LTFIDMEGSG FGGDLEALRG
910 920 930 940 950 960
PRGFPGPPGP PGVPGLPGEP GRFGVNSSDV PGPAGLPGVP GREGPPGFPG LPGPPGPPGR
970 980 990 1000 1010 1020
EGPPGRTGQK GSLGEAGAPG HKGSKGAPGP AGARGESGLA GAPGPAGPPG PPGPPGPPGP
1030 1040 1050 1060 1070 1080
GLPAGFDDME GSGGPFWSTA RSADGPQGPP GLPGLKGDPG VPGLPGAKGE VGADGVPGFP
1090 1100 1110 1120 1130 1140
GLPGREGIAG PQGPKGDRGS RGEKGDPGKD GVGQPGLPGP PGPPGPVVYV SEQDGSVLSV
1150 1160 1170 1180 1190 1200
PGPEGRPGFA GFPGPAGPKG NLGSKGERGS PGPKGEKGEP GSIFSPDGGA LGPAQKGAKG
1210 1220 1230 1240 1250 1260
EPGFRGPPGP YGRPGYKGEI GFPGRPGRPG MNGLKGEKGE PGDASLGFGM RGMPGPPGPP
1270 1280 1290 1300 1310 1320
GPPGPPGTPV YDSNVFAESS RPGPPGLPGN QGPPGPKGAK GEVGPPGPPG QFPFDFLQLE
1330 1340 1350 1360 1370 1380
AEMKGEKGDR GDAGQKGERG EPGGGGFFGS SLPGPPGPPG PPGPRGYPGI PGPKGESIRG
1390 1400 1410 1420 1430 1440
QPGPPGPQGP PGIGYEGRQG PPGPPGPPGP PSFPGPHRQT ISVPGPPGPP GPPGPPGTMG
1450 1460 1470 1480 1490 1500
ASSGVRLWAT RQAMLGQVHE VPEGWLIFVA EQEELYVRVQ NGFRKVQLEA RTPLPRGTDN
1510 1520 1530 1540 1550 1560
EVAALQPPVV QLHDSNPYPR REHPHPTARP WRADDILASP PRLPEPQPYP GAPHHSSYVH
1570 1580 1590 1600 1610 1620
LRPARPTSPP AHSHRDFQPV LHLVALNSPL SGGMRGIRGA DFQCFQQARA VGLAGTFRAF
1630 1640 1650 1660 1670 1680
LSSRLQDLYS IVRRADRAAV PIVNLKDELL FPSWEALFSG SEGPLKPGAR IFSFDGKDVL
1690 1700 1710 1720 1730 1740
RHPTWPQKSV WHGSDPNGRR LTESYCETWR TEAPSATGQA SSLLGGRLLG QSAASCHHAY
1750
IVLCIENSFM TASK
10 20 30 40 50 60
MAPYPCGCHI LLLLFCCLAA ARANLLNLNW LWFNNEDTSH AATTIPEPQG PLPVQPTADT
70 80 90 100 110 120
TTHVTPRNGS TEPATAPGSP EPPSELLEDG QDTPTSAESP DAPEENIAGV GAEILNVAKG
130 140 150 160 170 180
IRSFVQLWND TVPTESLARA ETLVLETPVG PLALAGPSST PQENGTTLWP SRGIPSSPGA
190 200 210 220 230 240
HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSGRASL SSLLGGAPPW GSLQDPDSQG
250 260 270 280 290 300
LSPAAAAPSQ QLQRPDVRLR TPLLHPLVMG SLGKHAAPSA FSSGLPGALS QVAVTTLTRD
310 320 330 340 350 360
SGAWVSHVAN SVGPGLANNS ALLGADPEAP AGRCLPLPPS LPVCGHLGIS RFWLPNHLHH
370 380 390 400 410 420
ESGEQVRAGA RAWGGLLQTH CHPFLAWFFC LLLVPPCGSV PPPAPPPCCQ FCEALQDACW
430 440 450 460 470 480
SRLGGGRLPV ACASLPTQED GYCVLIGPAA ERISEEVGLL QLLGDPPPQQ VTQTDDPDVG
490 500 510 520 530 540
LAYVFGPDAN SGQVARYHFP SLFFRDFSLL FHIRPATEGP GVLFAITDSA QAMVLLGVKL
550 560 570 580 590 600
SGVQDGHQDI SLLYTEPGAG QTHTAASFRL PAFVGQWTHL ALSVAGGFVA LYVDCEEFQR
610 620 630 640 650 660
MPLARSSRGL ELEPGAGLFV AQAGGADPDK FQGVIAELKV RRDPQVSPMH CLDEEGDDSD
670 680 690 700 710 720
GASGDSGSGL GDARELLREE TGAALKPRLP APPPVTTPPL AGGSSTEDSR SEEVEEQTTV
730 740 750 760 770 780
ASLGAQTLPG SDSVSTWDGS VRTPGGRVKE GGLKGQKGEP GVPGPPGRAG PPGSPCLPGP
790 800 810 820 830 840
PGLPCPVSPL GPAGPALQTV PGPQGPPGPP GRDGTPGRDG EPGDPGEDGK PGDTGPQGFP
850 860 870 880 890 900
GTPGDVGPKG DKGDPGVGER GPPGPQGPPG PPGPSFRHDK LTFIDMEGSG FGGDLEALRG
910 920 930 940 950 960
PRGFPGPPGP PGVPGLPGEP GRFGVNSSDV PGPAGLPGVP GREGPPGFPG LPGPPGPPGR
970 980 990 1000 1010 1020
EGPPGRTGQK GSLGEAGAPG HKGSKGAPGP AGARGESGLA GAPGPAGPPG PPGPPGPPGP
1030 1040 1050 1060 1070 1080
GLPAGFDDME GSGGPFWSTA RSADGPQGPP GLPGLKGDPG VPGLPGAKGE VGADGVPGFP
1090 1100 1110 1120 1130 1140
GLPGREGIAG PQGPKGDRGS RGEKGDPGKD GVGQPGLPGP PGPPGPVVYV SEQDGSVLSV
1150 1160 1170 1180 1190 1200
PGPEGRPGFA GFPGPAGPKG NLGSKGERGS PGPKGEKGEP GSIFSPDGGA LGPAQKGAKG
1210 1220 1230 1240 1250 1260
EPGFRGPPGP YGRPGYKGEI GFPGRPGRPG MNGLKGEKGE PGDASLGFGM RGMPGPPGPP
1270 1280 1290 1300 1310 1320
GPPGPPGTPV YDSNVFAESS RPGPPGLPGN QGPPGPKGAK GEVGPPGPPG QFPFDFLQLE
1330 1340 1350 1360 1370 1380
AEMKGEKGDR GDAGQKGERG EPGGGGFFGS SLPGPPGPPG PPGPRGYPGI PGPKGESIRG
1390 1400 1410 1420 1430 1440
QPGPPGPQGP PGIGYEGRQG PPGPPGPPGP PSFPGPHRQT ISVPGPPGPP GPPGPPGTMG
1450 1460 1470 1480 1490 1500
ASSGVRLWAT RQAMLGQVHE VPEGWLIFVA EQEELYVRVQ NGFRKVQLEA RTPLPRGTDN
1510 1520 1530 1540 1550 1560
EVAALQPPVV QLHDSNPYPR REHPHPTARP WRADDILASP PRLPEPQPYP GAPHHSSYVH
1570 1580 1590 1600 1610 1620
LRPARPTSPP AHSHRDFQPV LHLVALNSPL SGGMRGIRGA DFQCFQQARA VGLAGTFRAF
1630 1640 1650 1660 1670 1680
LSSRLQDLYS IVRRADRAAV PIVNLKDELL FPSWEALFSG SEGPLKPGAR IFSFDGKDVL
1690 1700 1710 1720 1730 1740
RHPTWPQKSV WHGSDPNGRR LTESYCETWR TEAPSATGQA SSLLGGRLLG QSAASCHHAY
1750
IVLCIENSFM TASK
Protein Neighborhood
Domains & Features
13 N-termini - 9 C-termini - 13 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P39060-24-unknown | NLLNLN... | 24 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P39060-679-unknown | EETGAA... | 679 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC19091 | |||
P39060-679-unknown | EETGAA... | 679 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC17632 | |||
P39060-679-unknown | EETGAA... | 679 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt148956 | |||
P39060-725-unknown | AQTLPG... | 725 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt178413 | |||
P39060-725- | AQTLPG... | 725 | Subtiligase Based Positive Selection | Wells | apoptosis_U266_bortezomib_induced | 23264352 | |||
P39060-1512-unknown | LHDSNP... | 1512 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24764 | |||
P39060-1512-unknown | LHDSNP... | 1512 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157363 | |||
P39060-1531-unknown | WRADDI... | 1531 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC26435 | |||
P39060-1531-unknown | WRADDI... | 1531 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21406 | |||
P39060-1531-unknown | WRADDI... | 1531 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157431 | |||
P39060-1533-unknown | ADDILA... | 1533 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P39060-1533-unknown | ADDILA... | 1533 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180245 | |||
P39060-1536-unknown | ILASPP... | 1536 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24767 | |||
P39060-1536-unknown | ILASPP... | 1536 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157448 | |||
P39060-1549-unknown | YPGAPH... | 1549 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC26438 | |||
P39060-1549-unknown | YPGAPH... | 1549 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157479 | |||
P39060-1556-unknown | SSYVHL... | 1556 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21409 | |||
P39060-1556-unknown | SSYVHL... | 1556 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157489 | |||
P39060-1558-unknown | YVHLRP... | 1558 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21802 | |||
P39060-1558-unknown | YVHLRP... | 1558 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC23929 | |||
P39060-1558-unknown | YVHLRP... | 1558 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21412 | |||
P39060-1558-unknown | YVHLRP... | 1558 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24770 | |||
P39060-1558-unknown | YVHLRP... | 1558 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157492 | |||
P39060-1572-unknown | HSHRDF... | 1572 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P39060-1572-unknown | HSHRDF... | 1572 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt116500 | |||
P39060-1573-unknown | SHRDFQ... | 1573 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC23932 | |||
P39060-1573-unknown | SHRDFQ... | 1573 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157512 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...RELLRE | 678 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC19091 | |||
...RELLRE | 678 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC17632 | |||
...RELLRE | 678 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt132337 | |||
...PPVVQL | 1511 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24764 | |||
...PPVVQL | 1511 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140862 | |||
...PTARPW | 1530 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC26435 | |||
...PTARPW | 1530 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21406 | |||
...PTARPW | 1530 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140929 | |||
...WRADDI | 1535 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24767 | |||
...WRADDI | 1535 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140946 | |||
...PEPQPY | 1548 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC26438 | |||
...PEPQPY | 1548 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140977 | |||
...GAPHHS | 1555 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21409 | |||
...GAPHHS | 1555 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140987 | |||
...PHHSSY | 1557 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21802 | |||
...PHHSSY | 1557 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC23929 | |||
...PHHSSY | 1557 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC21412 | |||
...PHHSSY | 1557 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC24770 | |||
...PHHSSY | 1557 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140990 | |||
...SPPAHS | 1572 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC23932 | |||
...SPPAHS | 1572 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt141011 | |||
...MTASK | 1754 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...MTASK | 1754 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt66997 |
Cleavages
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|