P0C0L4: Complement C4-A
Protein names | - Complement C4-A - Acidic complement C4 - C3 and PZP-like alpha-2-macroglobulin domain-containing protein 2 - Complement C4 beta chain - Complement C4-A alpha chain - C4a anaphylatoxin - C4b-A - C4d-A - Complement C4 gamma chain |
---|---|
Gene names | C4A |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P0C0L4 |
22
N-termini
7
C-termini
5
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MRLLWGLIWA SSFFTLSLQK PRLLLFSPSV VHLGVPLSVG VQLQDVPRGQ VVKGSVFLRN
70 80 90 100 110 120
PSRNNVPCSP KVDFTLSSER DFALLSLQVP LKDAKSCGLH QLLRGPEVQL VAHSPWLKDS
130 140 150 160 170 180
LSRTTNIQGI NLLFSSRRGH LFLQTDQPIY NPGQRVRYRV FALDQKMRPS TDTITVMVEN
190 200 210 220 230 240
SHGLRVRKKE VYMPSSIFQD DFVIPDISEP GTWKISARFS DGLESNSSTQ FEVKKYVLPN
250 260 270 280 290 300
FEVKITPGKP YILTVPGHLD EMQLDIQARY IYGKPVQGVA YVRFGLLDED GKKTFFRGLE
310 320 330 340 350 360
SQTKLVNGQS HISLSKAEFQ DALEKLNMGI TDLQGLRLYV AAAIIESPGG EMEEAELTSW
370 380 390 400 410 420
YFVSSPFSLD LSKTKRHLVP GAPFLLQALV REMSGSPASG IPVKVSATVS SPGSVPEVQD
430 440 450 460 470 480
IQQNTDGSGQ VSIPIIIPQT ISELQLSVSA GSPHPAIARL TVAAPPSGGP GFLSIERPDS
490 500 510 520 530 540
RPPRVGDTLN LNLRAVGSGA TFSHYYYMIL SRGQIVFMNR EPKRTLTSVS VFVDHHLAPS
550 560 570 580 590 600
FYFVAFYYHG DHPVANSLRV DVQAGACEGK LELSVDGAKQ YRNGESVKLH LETDSLALVA
610 620 630 640 650 660
LGALDTALYA AGSKSHKPLN MGKVFEAMNS YDLGCGPGGG DSALQVFQAA GLAFSDGDQW
670 680 690 700 710 720
TLSRKRLSCP KEKTTRKKRN VNFQKAINEK LGQYASPTAK RCCQDGVTRL PMMRSCEQRA
730 740 750 760 770 780
ARVQQPDCRE PFLSCCQFAE SLRKKSRDKG QAGLQRALEI LQEEDLIDED DIPVRSFFPE
790 800 810 820 830 840
NWLWRVETVD RFQILTLWLP DSLTTWEIHG LSLSKTKGLC VATPVQLRVF REFHLHLRLP
850 860 870 880 890 900
MSVRRFEQLE LRPVLYNYLD KNLTVSVHVS PVEGLCLAGG GGLAQQVLVP AGSARPVAFS
910 920 930 940 950 960
VVPTAAAAVS LKVVARGSFE FPVGDAVSKV LQIEKEGAIH REELVYELNP LDHRGRTLEI
970 980 990 1000 1010 1020
PGNSDPNMIP DGDFNSYVRV TASDPLDTLG SEGALSPGGV ASLLRLPRGC GEQTMIYLAP
1030 1040 1050 1060 1070 1080
TLAASRYLDK TEQWSTLPPE TKDHAVDLIQ KGYMRIQQFR KADGSYAAWL SRDSSTWLTA
1090 1100 1110 1120 1130 1140
FVLKVLSLAQ EQVGGSPEKL QETSNWLLSQ QQADGSFQDP CPVLDRSMQG GLVGNDETVA
1150 1160 1170 1180 1190 1200
LTAFVTIALH HGLAVFQDEG AEPLKQRVEA SISKANSFLG EKASAGLLGA HAAAITAYAL
1210 1220 1230 1240 1250 1260
TLTKAPVDLL GVAHNNLMAM AQETGDNLYW GSVTGSQSNA VSPTPAPRNP SDPMPQAPAL
1270 1280 1290 1300 1310 1320
WIETTAYALL HLLLHEGKAE MADQASAWLT RQGSFQGGFR STQDTVIALD ALSAYWIASH
1330 1340 1350 1360 1370 1380
TTEERGLNVT LSSTGRNGFK SHALQLNNRQ IRGLEEELQF SLGSKINVKV GGNSKGTLKV
1390 1400 1410 1420 1430 1440
LRTYNVLDMK NTTCQDLQIE VTVKGHVEYT MEANEDYEDY EYDELPAKDD PDAPLQPVTP
1450 1460 1470 1480 1490 1500
LQLFEGRRNR RRREAPKVVE EQESRVHYTV CIWRNGKVGL SGMAIADVTL LSGFHALRAD
1510 1520 1530 1540 1550 1560
LEKLTSLSDR YVSHFETEGP HVLLYFDSVP TSRECVGFEA VQEVPVGLVQ PASATLYDYY
1570 1580 1590 1600 1610 1620
NPERRCSVFY GAPSKSRLLA TLCSAEVCQC AEGKCPRQRR ALERGLQDED GYRMKFACYY
1630 1640 1650 1660 1670 1680
PRVEYGFQVK VLREDSRAAF RLFETKITQV LHFTKDVKAA ANQMRNFLVR ASCRLRLEPG
1690 1700 1710 1720 1730 1740
KEYLIMGLDG ATYDLEGHPQ YLLDSNSWIE EMPSERLCRS TRQRAACAQL NDFLQEYGTQ
GCQV
Isoforms
- Isoform 2 of Complement C4-ASequence View
10 20 30 40 50 60
MRLLWGLIWA SSFFTLSLQK PRLLLFSPSV VHLGVPLSVG VQLQDVPRGQ VVKGSVFLRN
70 80 90 100 110 120
PSRNNVPCSP KVDFTLSSER DFALLSLQVP LKDAKSCGLH QLLRGPEVQL VAHSPWLKDS
130 140 150 160 170 180
LSRTTNIQGI NLLFSSRRGH LFLQTDQPIY NPGQRVRYRV FALDQKMRPS TDTITVMVEN
190 200 210 220 230 240
SHGLRVRKKE VYMPSSIFQD DFVIPDISEP GTWKISARFS DGLESNSSTQ FEVKKYVLPN
250 260 270 280 290 300
FEVKITPGKP YILTVPGHLD EMQLDIQARY IYGKPVQGVA YVRFGLLDED GKKTFFRGLE
310 320 330 340 350 360
SQTKLVNGQS HISLSKAEFQ DALEKLNMGI TDLQGLRLYV AAAIIESPGG EMEEAELTSW
370 380 390 400 410 420
YFVSSPFSLD LSKTKRHLVP GAPFLLQALV REMSGSPASG IPVKVSATVS SPGSVPEVQD
430 440 450 460 470 480
IQQNTDGSGQ VSIPIIIPQT ISELQLSVSA GSPHPAIARL TVAAPPSGGP GFLSIERPDS
490 500 510 520 530 540
RPPRVGDTLN LNLRAVGSGA TFSHYYYMIL SRGQIVFMNR EPKRTLTSVS VFVDHHLAPS
550 560 570 580 590 600
FYFVAFYYHG DHPVANSLRV DVQAGACEGK LELSVDGAKQ YRNGESVKLH LETDSLALVA
610 620 630 640 650 660
LGALDTALYA AGSKSHKPLN MGKVFEAMNS YDLGCGPGGG DSALQVFQAA GLAFSDGDQW
670 680 690 700 710 720
TLSRKRLSCP KEKTTRKKRN VNFQKAINEK LGQYASPTAK RCCQDGVTRL PMMRSCEQRA
730 740 750 760 770 780
ARVQQPDCRE PFLSCCQFAE SLRKKSRDKG QAGLQRALEI LQEEDLIDED DIPVRSFFPE
790 800 810 820 830 840
NWLWRVETVD RFQILTLWLP DSLTTWEIHG LSLSKTKGLC VATPVQLRVF REFHLHLRLP
850 860 870 880 890 900
MSVRRFEQLE LRPVLYNYLD KNLTVSVHVS PVEGLCLAGG GGLAQQVLVP AGSARPVAFS
910 920 930 940 950 960
VVPTAAAAVS LKVVARGSFE FPVGDAVSKV LQIEKEGAIH REELVYELNP LDHRGRTLEI
970 980 990 1000 1010 1020
PGNSDPNMIP DGDFNSYVRV TASDPLDTLG SEGALSPGGV ASLLRLPRGC GEQTMIYLAP
1030 1040 1050 1060 1070 1080
TLAASRYLDK TEQWSTLPPE TKDHAVDLIQ KGYMRIQQFR KADGSYAAWL SRDSSTWLTA
1090 1100 1110 1120 1130 1140
FVLKVLSLAQ EQVGGSPEKL QETSNWLLSQ QQADGSFQDP CPVLDRSMQG GLVGNDETVA
1150 1160 1170 1180 1190 1200
LTAFVTIALH HGLAVFQDEG AEPLKQRVEA SISKANSFLG EKASAGLLGA HAAAITAYAL
1210 1220 1230 1240 1250 1260
TLTKAPVDLL GVAHNNLMAM AQETGDNLYW GSVTGSQSNA VSPTPAPRNP SDPMPQAPAL
1270 1280 1290 1300 1310 1320
WIETTAYALL HLLLHEGKAE MADQASAWLT RQGSFQGGFR STQDTVIALD ALSAYWIASH
1330 1340 1350 1360 1370 1380
TTEERGLNVT LSSTGRNGFK SHALQLNNRQ IRGLEEELQF SLGSKINVKV GGNSKGTLKV
1390 1400 1410 1420 1430 1440
LRTYNVLDMK NTTCQDLQIE VTVKGHVEYT MEANEDYEDY EYDELPAKDD PDAPLQPVTP
1450 1460 1470 1480 1490 1500
LQLFEGRRNR RRREAPKVVE EQESRVHYTV CIWRNGKVGL SGMAIADVTL LSGFHALRAD
1510 1520 1530 1540 1550 1560
LEKLTSLSDR YVSHFETEGP HVLLYFDSVP TSRECVGFEA VQEVPVGLVQ PASATLYDYY
1570 1580 1590 1600 1610 1620
NPERRCSVFY GAPSKSRLLA TLCSAEVCQC AEGKCPRQRR ALERGLQDED GYRMKFACYY
1630 1640 1650 1660 1670 1680
PRVEYGFQVK VLREDSRAAF RLFETKITQV LHFTKDVKAA ANQMRNFLVR ASCRLRLEPG
1690 1700 1710 1720 1730 1740
KEYLIMGLDG ATYDLEGHPQ YLLDSNSWIE EMPSERLCRS TRQRAACAQL NDFLQEYGTQ
GCQV
Protein Neighborhood
Domains & Features
22 N-termini - 7 C-termini - 5 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P0C0L4-1-unknown | MRLLWG... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt71279 | |||
P0C0L4-20-unknown | KPRLLL... | 20 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P0C0L4-20-unknown | KPRLLL... | 20 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt112364 | |||
P0C0L4-680-unknown | NVNFQK... | 680 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P0C0L4-680-unknown | NVNFQK... | 680 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-680-unknown | NVNFQK... | 680 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt116287 | |||
P0C0L4-702-unknown | CCQDGV... | 702 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-702-unknown | CCQDGV... | 702 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180273 | |||
P0C0L4-756-unknown | RALEIL... | 756 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8327 | |||
P0C0L4-756-unknown | RALEIL... | 756 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt156293 | |||
P0C0L4-757-unknown | ALEILQ... | 757 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P0C0L4-757-unknown | ALEILQ... | 757 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC16816 | |||
P0C0L4-757-unknown | ALEILQ... | 757 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC712 | |||
P0C0L4-757-unknown | ALEILQ... | 757 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt116376 | |||
P0C0L4-917-unknown | GSFEFP... | 917 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-917-unknown | GSFEFP... | 917 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180275 | |||
P0C0L4-930-unknown | VLQIEK... | 930 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-930-unknown | VLQIEK... | 930 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180277 | |||
P0C0L4-957-unknown | TLEIPG... | 957 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P0C0L4-957-unknown | TLEIPG... | 957 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8359 | |||
P0C0L4-957-unknown | TLEIPG... | 957 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt116408 | |||
P0C0L4-964-unknown | SDPNMI... | 964 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-964-unknown | SDPNMI... | 964 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180279 | |||
P0C0L4-968-unknown | MIPDGD... | 968 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-968-unknown | MIPDGD... | 968 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180281 | |||
P0C0L4-1337-unknown | NGFKSH... | 1337 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8361 | |||
P0C0L4-1337-unknown | NGFKSH... | 1337 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt157951 | |||
P0C0L4-1341-unknown | SHALQL... | 1341 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1341-unknown | SHALQL... | 1341 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180283 | |||
P0C0L4-1353-unknown | GLEEEL... | 1353 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1353-unknown | GLEEEL... | 1353 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180285 | |||
P0C0L4-1361-unknown | SLGSKI... | 1361 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1361-unknown | SLGSKI... | 1361 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180287 | |||
P0C0L4-1370-unknown | VGGNSK... | 1370 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1370-unknown | VGGNSK... | 1370 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180289 | |||
P0C0L4-1454-unknown | EAPKVV... | 1454 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P0C0L4-1454-unknown | EAPKVV... | 1454 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1458-unknown | VVEEQE... | 1458 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1584-unknown | SAEVCQ... | 1584 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1584-unknown | SAEVCQ... | 1584 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180292 | |||
P0C0L4-1605-unknown | GLQDED... | 1605 | unknown | Wildes D. et al: Sampling the N-terminal proteome of human blood | 20173099 | ||||
P0C0L4-1605-unknown | GLQDED... | 1605 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt180294 | |||
P0C0L4-1707-unknown | SWIEEM... | 1707 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt176325 | |||
P0C0L4-1707- | SWIEEM... | 1707 | Subtiligase Based Positive Selection | Wells | apoptotic_MM1s_bort | 23264352 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...KEKTTR | 675 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...KEKTTR | 675 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt93230 | |||
...QAGLQR | 755 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8327 | |||
...QAGLQR | 755 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt139765 | |||
...AGLQRA | 756 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...AGLQRA | 756 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC16816 | |||
...AGLQRA | 756 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC712 | |||
...AGLQRA | 756 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt93360 | |||
...DHRGRT | 956 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8359 | |||
...DHRGRT | 956 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt140563 | |||
...SSTGRN | 1336 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...SSTGRN | 1336 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC8361 | |||
...SSTGRN | 1336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt93735 | |||
...QLFEGR | 1446 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...QLFEGR | 1446 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt93757 | |||
...QGCQV | 1744 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...QGCQV | 1744 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt66897 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
CBPN_HUMAN | 755 | GLQR.|.RALE | inferred from experiment | unknown | MEROPS | Merops CBPN_HUMAN -> CO4A_HUMAN @755 | ||
MASP2_HUMAN | 756 | LQRA.|.ALEI | inferred from experiment | unknown | MEROPS | Fujita T | Matsushita M et al.:Proteolytic activities of two t... (S01.229) | 10946292, |
C1S_HUMAN | 756 | LQRA.|.ALEI | inferred from experiment | unknown | MEROPS | Tsiftsoglou SA | Sim RB et al.:Proteases of the complement sys... (S01.193) | 14748705, |
CFAI_HUMAN | 956 | RGRT.|.TLEI | inferred from experiment | unknown | MEROPS | Merops CFAI_HUMAN -> CO4A_HUMAN @956 | ||
CFAI_HUMAN | 1336 | TGRN.|.NGFK | inferred from experiment | unknown | MEROPS | Merops CFAI_HUMAN -> CO4A_HUMAN @1336 |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|