TopFIND 4.0

Q96A83: Collagen alpha-1(XXVI) chain

General Information

Protein names
- Collagen alpha-1(XXVI) chain
- Alpha-1 type XXVI collagen
- EMI domain-containing protein 2
- Emilin and multimerin domain-containing protein 2
- Emu2

Gene names COL26A1
Organism Homo sapiens
Protease Family
Protease ID
Chromosome location
UniProt ID Q96A83

2

N-termini

1

C-termini

0

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MKLALLLPWA CCCLCGSALA TGFLYPFSAA ALQQHGYPEP GAGSPGSGYA SRRHWCHHTV 
        70         80         90        100        110        120 
TRTVSCQVQN GSETVVQRVY QSCRWPGPCA NLVSYRTLIR PTYRVSYRTV TVLEWRCCPG 
       130        140        150        160        170        180 
FTGSNCDEEC MNCTRLSDMS ERLTTLEAKV LLLEAAERPS SPDNDLPAPE STPPTWNEDF 
       190        200        210        220        230        240 
LPDAIPLAHP VPRQRRPTGP AGPPGQTGPP GPAGPPGSKG DRGQTGEKGP AGPPGLLGPP 
       250        260        270        280        290        300 
GPRGLPGEMG RPGPPGPPGP AGNPGPSPNS PQGALYSLQP PTDKDNGDSR LASAIVDTVL 
       310        320        330        340        350        360 
AGVPGPRGPP GPPGPPGPRG PPGPPGTPGS QGLAGERGTV GPSGEPGVKG EEGEKAATAE 
       370        380        390        400        410        420 
GEGVQQLREA LKILAERVLI LEHMIGIHDP LASPEGGSGQ DAALRANLKM KRGGAQPDGV 
       430        440    
LAALLGPDPG QKSVDQASSR K

Isoforms

- Isoform 2 of Collagen alpha-1(XXVI) chain

Sequence View

        10         20         30         40         50         60 
MKLALLLPWA CCCLCGSALA TGFLYPFSAA ALQQHGYPEP GAGSPGSGYA SRRHWCHHTV 
        70         80         90        100        110        120 
TRTVSCQVQN GSETVVQRVY QSCRWPGPCA NLVSYRTLIR PTYRVSYRTV TVLEWRCCPG 
       130        140        150        160        170        180 
FTGSNCDEEC MNCTRLSDMS ERLTTLEAKV LLLEAAERPS SPDNDLPAPE STPPTWNEDF 
       190        200        210        220        230        240 
LPDAIPLAHP VPRQRRPTGP AGPPGQTGPP GPAGPPGSKG DRGQTGEKGP AGPPGLLGPP 
       250        260        270        280        290        300 
GPRGLPGEMG RPGPPGPPGP AGNPGPSPNS PQGALYSLQP PTDKDNGDSR LASAIVDTVL 
       310        320        330        340        350        360 
AGVPGPRGPP GPPGPPGPRG PPGPPGTPGS QGLAGERGTV GPSGEPGVKG EEGEKAATAE 
       370        380        390        400        410        420 
GEGVQQLREA LKILAERVLI LEHMIGIHDP LASPEGGSGQ DAALRANLKM KRGGAQPDGV 
       430        440    
LAALLGPDPG QKSVDQASSR K



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

2 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates

N-termini

    Name Sequence Position Modification Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMID)
    Q96A83-1-unknown MKLALL... 1 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt71437
    Q96A83-21-unknown TGFLYP... 21 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    Q96A83-21-unknown TGFLYP... 21 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt112788

C-termini

    Name Sequence Position Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)
    ...ASSRK 441 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    ...ASSRK 441 inferred from isoform by sequence similarity unknown TopFIND inferred from TCt67055

Cleavages

    Protease Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)