TopFIND 4.0

Q91VF6: Collagen alpha-1(XXVI) chain

General Information

Protein names
- Collagen alpha-1(XXVI) chain
- Alpha-1 type XXVI collagen
- EMI domain-containing protein 2
- Emilin and multimerin domain-containing protein 2
- Emu2

Gene names Col26a1
Organism Mus musculus
Protease Family
Protease ID
Chromosome location
UniProt ID Q91VF6

2

N-termini

1

C-termini

0

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MKLVLLLPWA CCCLCGSALA TGFLYPFPAA ALQQHGYPEQ GAGSPGNGYS SRRHWCHHTV 
        70         80         90        100        110        120 
TRTVSCQVQN GSETVVQRVY QSCRWPGPCA NLVSYRTLIR PTYRVSYRTV TALEWRCCPG 
       130        140        150        160        170        180 
FTGSNCEEEC MNCTRLSDMS ERLTTLEAKV LLLEAAEQPS GPDNDLPPPQ STPPTWNEDF 
       190        200        210        220        230        240 
LPDAIPIAHP GPRRRRPTGP AGPPGQMGPP GPAGPPGSKG EQGQTGEKGP VGPPGLLGPP 
       250        260        270        280        290        300 
GPRGLPGEMG RPGPPGPPGP AGSPGLLPNT PQGVLYSLQT PTDKENGDSQ LNPAVVDTVL 
       310        320        330        340        350        360 
TGIPGPRGPP GPPGPPGPHG PPGPPGAPGS QGLVDERVVA RPSGEPSVKE EEDKASAAEG 
       370        380        390        400        410        420 
EGVQQLREAL KILAERVLIL EHMIGVHDPL ASPEGGSGQD AALRANLKMK RGGPRPDGIL 
       430        440    
AALLGPDPAQ KSADQAGDRK 

Isoforms

- Isoform 2 of Collagen alpha-1(XXVI) chain

Sequence View

        10         20         30         40         50         60 
MKLVLLLPWA CCCLCGSALA TGFLYPFPAA ALQQHGYPEQ GAGSPGNGYS SRRHWCHHTV 
        70         80         90        100        110        120 
TRTVSCQVQN GSETVVQRVY QSCRWPGPCA NLVSYRTLIR PTYRVSYRTV TALEWRCCPG 
       130        140        150        160        170        180 
FTGSNCEEEC MNCTRLSDMS ERLTTLEAKV LLLEAAEQPS GPDNDLPPPQ STPPTWNEDF 
       190        200        210        220        230        240 
LPDAIPIAHP GPRRRRPTGP AGPPGQMGPP GPAGPPGSKG EQGQTGEKGP VGPPGLLGPP 
       250        260        270        280        290        300 
GPRGLPGEMG RPGPPGPPGP AGSPGLLPNT PQGVLYSLQT PTDKENGDSQ LNPAVVDTVL 
       310        320        330        340        350        360 
TGIPGPRGPP GPPGPPGPHG PPGPPGAPGS QGLVDERVVA RPSGEPSVKE EEDKASAAEG 
       370        380        390        400        410        420 
EGVQQLREAL KILAERVLIL EHMIGVHDPL ASPEGGSGQD AALRANLKMK RGGPRPDGIL 
       430        440    
AALLGPDPAQ KSADQAGDRK 



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

2 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates

N-termini

    Name Sequence Position Modification Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMID)
    Q91VF6-1-unknown MKLVLL... 1 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt71438
    Q91VF6-21-unknown TGFLYP... 21 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    Q91VF6-21-unknown TGFLYP... 21 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt112601

C-termini

    Name Sequence Position Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)
    ...AGDRK 440 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    ...AGDRK 440 inferred from isoform by sequence similarity unknown TopFIND inferred from TCt67056

Cleavages

    Protease Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)