TopFIND 4.0

Q8BLX7: Collagen alpha-1(XVI) chain

General Information

Protein names
- Collagen alpha-1(XVI) chain

Gene names Col16a1
Organism Mus musculus
Protease Family
Protease ID
Chromosome location
UniProt ID Q8BLX7

2

N-termini

1

C-termini

0

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MLTSWAPGLW VLGLWATFSH GTNIGERCPT SQQEGLKLEH SSDPSTNVTG FNLIRRLNLM 
        70         80         90        100        110        120 
KTSAIKKIRN PKGPLILRLG AAPVTQPTRR VFPRGLPEEF ALVLTVLLKK HTFRNTWYLF 
       130        140        150        160        170        180 
QVTDANGYPQ ISLEVNSQER SLELRAQGQD GDFVSCIFPV PQLFDLRWHK LMLSVAGRVA 
       190        200        210        220        230        240 
SVHVDCVSAS SQPLGPRQSI RPGGHVFLGL DAEQGKPVSF DLQQAHIYCD PELVLEEGCC 
       250        260        270        280        290        300 
EILPGGCPPE TSKSRRDTQS NELIEINPQT EGKVYTRCFC LEEPQNSKVD AQLMGRNIQK 
       310        320        330        340        350        360 
AERGTKVHQG TGVNECPPCA HSARESNVTL GPSGLKGGKG ERGLTGPSGP KGEKGARGND 
       370        380        390        400        410        420 
CVRVSPDAPL QCVEGPKGEK GESGDLGPPG LPGPTGQKGQ KGEKGDGGLK GLPGKPGRDG 
       430        440        450        460        470        480 
RPGEICVIGP KGQKGDPGFV GPEGLAGEPG PPGLPGPPGI GLPGTPGDPG GPPGPKGEKG 
       490        500        510        520        530        540 
SSGIPGKEGP GGKPGKPGVP GTKGEKGDPC EVCPTLPEGS QNFVGLPGKP GPKGEPGDPA 
       550        560        570        580        590        600 
PAWEGLGTVG LKGDRGDPGI QGMKGEKGEP CSSCSSGVGA QHLGPSPGHG LPGLPGTSGI 
       610        620        630        640        650        660 
PGPRGLKGEK GSFGDTGPAG VPGSPGPVGP AGIKGAKGEP CEPCTALSEL QDGDMRVVHL 
       670        680        690        700        710        720 
PGPAGEKGEP GSPGFGLPGK QGKAGERGLK GQKGDAGNPG DPGTPGITGQ PGISGEPGIR 
       730        740        750        760        770        780 
GPAGPKGEKG DGCTACPSLQ GALTDVSGLP GKPGPKGEPG PEGVGHPGKP GQPGLPGVQG 
       790        800        810        820        830        840 
PPGPKGTQGE PGPPGTGAEG PQGEPGTQGL PGTQGLPGPR GPPGSAGEKG AQGSPGPKGA 
       850        860        870        880        890        900 
IGPMGPPGAG VSGPPGQKGS RGEKGEPGEC SCPSRGEPIF SGMPGAPGLW MGSSSQPGPQ 
       910        920        930        940        950        960 
GPPGVPGPPG PPGMPGLQGV PGHNGLPGQP GLTAELGSLP IEKHLLKSIC GDCAQGQTAH 
       970        980        990       1000       1010       1020 
PAFLLEKGEK GDQGIPGVPG FDNCARCFIE RERPRAEEAR GDNSEGEPGC SGSPGLPGPP 
      1030       1040       1050       1060       1070       1080 
GMPGQRGEEG PPGMRGSPGP PGPIGLQGER GLTGLTGDKG EPGPPGQPGY PGAMGPPGLP 
      1090       1100       1110       1120       1130       1140 
GIKGERGYTG PSGEKGESGP PGSEGLPGPQ GPAGPRGERG PQGSSGEKGD QGFQGQPGFP 
      1150       1160       1170       1180       1190       1200 
GPPGPPGFPG KAGAPGPPGP QAEKGSEGIR GPSGLPGSPG PPGPPGIQGP AGLDGLDGKD 
      1210       1220       1230       1240       1250       1260 
GKPGLRGDPG PAGPPGLMGP PGFKGKTGHP GLPGPKGDCG KPGPPGSSGR PGAEGEPGAM 
      1270       1280       1290       1300       1310       1320 
GPQGRPGPPG HLGPPGQPGP PGLSTVGLKG DRGVPGERGL AGLPGQPGTP GHPGPPGEPG 
      1330       1340       1350       1360       1370       1380 
SDGAAGKEGP PGKQGLYGPP GPKGDPGPAG QKGQAGEKGR SGMPGGPGKS GSMGPIGPPG 
      1390       1400       1410       1420       1430       1440 
PAGERGHPGS PGPAGNPGLP GLPGSMGDMV NYDDIKRFIR QEIIKLFDER MAYYTSRMQF 
      1450       1460       1470       1480       1490       1500 
PMEVAAAPGR PGPPGKDGAP GRPGAPGSPG LPGQIGREGR QGLPGMRGLP GTKGEKGDIG 
      1510       1520       1530       1540       1550       1560 
VGIAGENGLP GPPGPQGPPG YGKMGATGPM GQQGIPGIPG PPGPMGQPGK AGHCNPSDCF 
      1570       1580    
GAMPMEQQYP PMKSMKGPFG 

Isoforms

- Isoform 2 of Collagen alpha-1(XVI) chain

Sequence View

        10         20         30         40         50         60 
MLTSWAPGLW VLGLWATFSH GTNIGERCPT SQQEGLKLEH SSDPSTNVTG FNLIRRLNLM 
        70         80         90        100        110        120 
KTSAIKKIRN PKGPLILRLG AAPVTQPTRR VFPRGLPEEF ALVLTVLLKK HTFRNTWYLF 
       130        140        150        160        170        180 
QVTDANGYPQ ISLEVNSQER SLELRAQGQD GDFVSCIFPV PQLFDLRWHK LMLSVAGRVA 
       190        200        210        220        230        240 
SVHVDCVSAS SQPLGPRQSI RPGGHVFLGL DAEQGKPVSF DLQQAHIYCD PELVLEEGCC 
       250        260        270        280        290        300 
EILPGGCPPE TSKSRRDTQS NELIEINPQT EGKVYTRCFC LEEPQNSKVD AQLMGRNIQK 
       310        320        330        340        350        360 
AERGTKVHQG TGVNECPPCA HSARESNVTL GPSGLKGGKG ERGLTGPSGP KGEKGARGND 
       370        380        390        400        410        420 
CVRVSPDAPL QCVEGPKGEK GESGDLGPPG LPGPTGQKGQ KGEKGDGGLK GLPGKPGRDG 
       430        440        450        460        470        480 
RPGEICVIGP KGQKGDPGFV GPEGLAGEPG PPGLPGPPGI GLPGTPGDPG GPPGPKGEKG 
       490        500        510        520        530        540 
SSGIPGKEGP GGKPGKPGVP GTKGEKGDPC EVCPTLPEGS QNFVGLPGKP GPKGEPGDPA 
       550        560        570        580        590        600 
PAWEGLGTVG LKGDRGDPGI QGMKGEKGEP CSSCSSGVGA QHLGPSPGHG LPGLPGTSGI 
       610        620        630        640        650        660 
PGPRGLKGEK GSFGDTGPAG VPGSPGPVGP AGIKGAKGEP CEPCTALSEL QDGDMRVVHL 
       670        680        690        700        710        720 
PGPAGEKGEP GSPGFGLPGK QGKAGERGLK GQKGDAGNPG DPGTPGITGQ PGISGEPGIR 
       730        740        750        760        770        780 
GPAGPKGEKG DGCTACPSLQ GALTDVSGLP GKPGPKGEPG PEGVGHPGKP GQPGLPGVQG 
       790        800        810        820        830        840 
PPGPKGTQGE PGPPGTGAEG PQGEPGTQGL PGTQGLPGPR GPPGSAGEKG AQGSPGPKGA 
       850        860        870        880        890        900 
IGPMGPPGAG VSGPPGQKGS RGEKGEPGEC SCPSRGEPIF SGMPGAPGLW MGSSSQPGPQ 
       910        920        930        940        950        960 
GPPGVPGPPG PPGMPGLQGV PGHNGLPGQP GLTAELGSLP IEKHLLKSIC GDCAQGQTAH 
       970        980        990       1000       1010       1020 
PAFLLEKGEK GDQGIPGVPG FDNCARCFIE RERPRAEEAR GDNSEGEPGC SGSPGLPGPP 
      1030       1040       1050       1060       1070       1080 
GMPGQRGEEG PPGMRGSPGP PGPIGLQGER GLTGLTGDKG EPGPPGQPGY PGAMGPPGLP 
      1090       1100       1110       1120       1130       1140 
GIKGERGYTG PSGEKGESGP PGSEGLPGPQ GPAGPRGERG PQGSSGEKGD QGFQGQPGFP 
      1150       1160       1170       1180       1190       1200 
GPPGPPGFPG KAGAPGPPGP QAEKGSEGIR GPSGLPGSPG PPGPPGIQGP AGLDGLDGKD 
      1210       1220       1230       1240       1250       1260 
GKPGLRGDPG PAGPPGLMGP PGFKGKTGHP GLPGPKGDCG KPGPPGSSGR PGAEGEPGAM 
      1270       1280       1290       1300       1310       1320 
GPQGRPGPPG HLGPPGQPGP PGLSTVGLKG DRGVPGERGL AGLPGQPGTP GHPGPPGEPG 
      1330       1340       1350       1360       1370       1380 
SDGAAGKEGP PGKQGLYGPP GPKGDPGPAG QKGQAGEKGR SGMPGGPGKS GSMGPIGPPG 
      1390       1400       1410       1420       1430       1440 
PAGERGHPGS PGPAGNPGLP GLPGSMGDMV NYDDIKRFIR QEIIKLFDER MAYYTSRMQF 
      1450       1460       1470       1480       1490       1500 
PMEVAAAPGR PGPPGKDGAP GRPGAPGSPG LPGQIGREGR QGLPGMRGLP GTKGEKGDIG 
      1510       1520       1530       1540       1550       1560 
VGIAGENGLP GPPGPQGPPG YGKMGATGPM GQQGIPGIPG PPGPMGQPGK AGHCNPSDCF 
      1570       1580    
GAMPMEQQYP PMKSMKGPFG 



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

2 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates

N-termini

    Name Sequence Position Modification Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMID)
    Q8BLX7-22-unknown TNIGER... 22 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    Q8BLX7-1431-unknown MAYYTS... 1431 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt71375

C-termini

    Name Sequence Position Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)
    ...KGPFG 1580 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    ...KGPFG 1580 inferred from isoform by sequence similarity unknown TopFIND inferred from TCt66993

Cleavages

    Protease Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)