TopFIND 4.0

P08572: Collagen alpha-2(IV) chain

General Information

Protein names
- Collagen alpha-2(IV) chain
- Canstatin

Gene names COL4A2
Organism Homo sapiens
Protease Family
Protease ID
Chromosome location
UniProt ID P08572

4

N-termini

4

C-termini

2

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MGRDQRAVAG PALRRWLLLG TVTVGFLAQS VLAGVKKFDV PCGGRDCSGG CQCYPEKGGR 
        70         80         90        100        110        120 
GQPGPVGPQG YNGPPGLQGF PGLQGRKGDK GERGAPGVTG PKGDVGARGV SGFPGADGIP 
       130        140        150        160        170        180 
GHPGQGGPRG RPGYDGCNGT QGDSGPQGPP GSEGFTGPPG PQGPKGQKGE PYALPKEERD 
       190        200        210        220        230        240 
RYRGEPGEPG LVGFQGPPGR PGHVGQMGPV GAPGRPGPPG PPGPKGQQGN RGLGFYGVKG 
       250        260        270        280        290        300 
EKGDVGQPGP NGIPSDTLHP IIAPTGVTFH PDQYKGEKGS EGEPGIRGIS LKGEEGIMGF 
       310        320        330        340        350        360 
PGLRGYPGLS GEKGSPGQKG SRGLDGYQGP DGPRGPKGEA GDPGPPGLPA YSPHPSLAKG 
       370        380        390        400        410        420 
ARGDPGFPGA QGEPGSQGEP GDPGLPGPPG LSIGDGDQRR GLPGEMGPKG FIGDPGIPAL 
       430        440        450        460        470        480 
YGGPPGPDGK RGPPGPPGLP GPPGPDGFLF GLKGAKGRAG FPGLPGSPGA RGPKGWKGDA 
       490        500        510        520        530        540 
GECRCTEGDE AIKGLPGLPG PKGFAGINGE PGRKGDRGDP GQHGLPGFPG LKGVPGNIGA 
       550        560        570        580        590        600 
PGPKGAKGDS RTITTKGERG QPGVPGVPGM KGDDGSPGRD GLDGFPGLPG PPGDGIKGPP 
       610        620        630        640        650        660 
GDPGYPGIPG TKGTPGEMGP PGLGLPGLKG QRGFPGDAGL PGPPGFLGPP GPAGTPGQID 
       670        680        690        700        710        720 
CDTDVKRAVG GDRQEAIQPG CIGGPKGLPG LPGPPGPTGA KGLRGIPGFA GADGGPGPRG 
       730        740        750        760        770        780 
LPGDAGREGF PGPPGFIGPR GSKGAVGLPG PDGSPGPIGL PGPDGPPGER GLPGEVLGAQ 
       790        800        810        820        830        840 
PGPRGDAGVP GQPGLKGLPG DRGPPGFRGS QGMPGMPGLK GQPGLPGPSG QPGLYGPPGL 
       850        860        870        880        890        900 
HGFPGAPGQE GPLGLPGIPG REGLPGDRGD PGDTGAPGPV GMKGLSGDRG DAGFTGEQGH 
       910        920        930        940        950        960 
PGSPGFKGID GMPGTPGLKG DRGSPGMDGF QGMPGLKGRP GFPGSKGEAG FFGIPGLKGL 
       970        980        990       1000       1010       1020 
AGEPGFKGSR GDPGPPGPPP VILPGMKDIK GEKGDEGPMG LKGYLGAKGI QGMPGIPGLS 
      1030       1040       1050       1060       1070       1080 
GIPGLPGRPG HIKGVKGDIG VPGIPGLPGF PGVAGPPGIT GFPGFIGSRG DKGAPGRAGL 
      1090       1100       1110       1120       1130       1140 
YGEIGATGDF GDIGDTINLP GRPGLKGERG TTGIPGLKGF FGEKGTEGDI GFPGITGVTG 
      1150       1160       1170       1180       1190       1200 
VQGPPGLKGQ TGFPGLTGPP GSQGELGRIG LPGGKGDDGW PGAPGLPGFP GLRGIRGLHG 
      1210       1220       1230       1240       1250       1260 
LPGTKGFPGS PGSDIHGDPG FPGPPGERGD PGEANTLPGP VGVPGQKGDQ GAPGERGPPG 
      1270       1280       1290       1300       1310       1320 
SPGLQGFPGI TPPSNISGAP GDKGAPGIFG LKGYRGPPGP PGSAALPGSK GDTGNPGAPG 
      1330       1340       1350       1360       1370       1380 
TPGTKGWAGD SGPQGRPGVF GLPGEKGPRG EQGFMGNTGP TGAVGDRGPK GPKGDPGFPG 
      1390       1400       1410       1420       1430       1440 
APGTVGAPGI AGIPQKIAVQ PGTVGPQGRR GPPGAPGEMG PQGPPGEPGF RGAPGKAGPQ 
      1450       1460       1470       1480       1490       1500 
GRGGVSAVPG FRGDEGPIGH QGPIGQEGAP GRPGSPGLPG MPGRSVSIGY LLVKHSQTDQ 
      1510       1520       1530       1540       1550       1560 
EPMCPVGMNK LWSGYSLLYF EGQEKAHNQD LGLAGSCLAR FSTMPFLYCN PGDVCYYASR 
      1570       1580       1590       1600       1610       1620 
NDKSYWLSTT APLPMMPVAE DEIKPYISRC SVCEAPAIAI AVHSQDVSIP HCPAGWRSLW 
      1630       1640       1650       1660       1670       1680 
IGYSFLMHTA AGDEGGGQSL VSPGSCLEDF RATPFIECNG GRGTCHYYAN KYSFWLTTIP 
      1690       1700       1710    
EQSFQGSPSA DTLKAGLIRT HISRCQVCMK NL

Isoforms



Sequence View



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

4 N-termini - 4 C-termini - 2 Cleavages - 0 Substrates

N-termini

C-termini

Cleavages

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)