TopFIND 4.0

Q5SNV9: Uncharacterized protein C1orf167

General Information

Protein names
- Uncharacterized protein C1orf167

Gene names C1orf167
Organism Homo sapiens
Protease Family
Protease ID
Chromosome location
UniProt ID Q5SNV9

1

N-termini

1

C-termini

0

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MTRMSCPWGS YQWEPGACPA APRGIGGGDM AGGIPDVRGL QEAALGAGRS QEEARLVEEA 
        70         80         90        100        110        120 
QTPVMLPQDS GQRVEEVPGD LMAKRMSLIL HVQKLPWDHV PCLRRTRQNL YQDVGGHAHG 
       130        140        150        160        170        180 
SGLGGAKRGA ARSALRRPLP PATCRPAGIV SGPSPRLDSN PTGAHLIKQT RPLTVEWTKD 
       190        200        210        220        230        240 
TPVPEPMELR SDASHKENVS PKPAALPKPG KRLKQRRFRR SLGIGLSGRH DQWVPGCQVE 
       250        260        270        280        290        300 
RGGPAATPSP GAVLDQEPCR VQTNLASPGP RLGLALKDTT GQLVNSSFWQ QSNLQSLARR 
       310        320        330        340        350        360 
RQGKAREFAI QQSNLSINET SSPHLCPEPG GSSGPHKLPW GPLLSQEPLA RPSSCLRQSG 
       370        380        390        400        410        420 
LPAPGTPSGD FRPTEAFAPL DGHTQPGLRS WGGLGSWRSR LVGEPLTLED LAVPSQNQTQ 
       430        440        450        460        470        480 
APSRAAVHQL LASVHCLAQE AARLRCQAPQ EPPAWGVSPK QKGEEGAPRE RVHREEERTA 
       490        500        510        520        530        540 
FHLSDTVPAS SASKNKAQNI TAPESEAICW QLLSRCFRSW RHLVKRQREP AAAAVALGRW 
       550        560        570        580        590        600 
QLLRKCLQAL WLREAQLEAA WGQYTKVLLV RSFREVSGLQ VGPGGRVKQC PGSLREEEIA 
       610        620        630        640        650        660 
QRLLSHPRQR TDSRHERVQI LQALQLAVFF LWCQQKKRAR QERETLRKAT RATQRTGSFP 
       670        680        690        700        710        720 
QAWHSTAAGV AWVAPLSPQH QRAWLCRCFG AWQQFVQRGS RYRDHLADRR TGTLRKCLEQ 
       730        740        750        760        770        780 
WVRMKQLRES DGAKVTQLSL CRQKAGREAV YTAGPGACGL GAVGQAQGQQ EQGRGSLQDA 
       790        800        810        820        830        840 
CWTLALCWAL LLWKMRLFQR QWANSFFQGL QQRMLQRSLR WWHLRALGPD ATSSCTKTPS 
       850        860        870        880        890        900 
ALEPLSSSTL QDSLEKVPRA PTLPDTLQGS LLWAAGQRQQ GQCLLLWQAR AQQFQGTARW 
       910        920        930        940        950        960 
YQHTRQRRIF LSWSRWATAQ WAWRELASHR AWDRTCRAVL GLWRQRLLQS RLVEWWAQER 
       970        980        990       1000       1010       1020 
GWRLARDALC HWHSCWQGQQ FLHEKCQTWV QVHLQGLQKV VFRSWQQAAA HQRCTVTRPE 
      1030       1040       1050       1060       1070       1080 
QLLLQSYFQA WCEVVRDTGV LRAQHQAFQD GLRRRALGAV FATWREAQEV AAGAQEQRVA 
      1090       1100       1110       1120       1130       1140 
QASLARWRSC GQQGQEDGQQ KKARAPQAFP AWPVAPGMHH EAQQQAGESA GAQAAQCWTW 
      1150       1160       1170       1180       1190       1200 
CWALWVHESC RGQVSRAHAS WKPRAWVLEA SVQSAVRGGV QRAILTQLRP AELRRFLRTV 
      1210       1220       1230       1240       1250       1260 
QLRVRLGLPG AGKTRSCWTQ ATELVPPAPS LQCSLGGRRK PRGTAWAQRC REHSLCPAFQ 
      1270       1280       1290       1300       1310       1320 
LWPQWPGQSS WVPGLPLWTR DQGPRAHSSP EPRACKAQSK AHKRRLRILE KQAQAHGSAL 
      1330       1340       1350       1360       1370       1380 
LLALKGHDAL GHQEEVPAAP VPRGTASRAA GFPAGQVPGS GMAALGGCPR GRAAGADPAQ 
      1390       1400       1410       1420       1430       1440 
GVAPEMGLAD VVAADPATAS GSAVTAAGRW AFKKWHQRLA ARSPRRGAAS SPRPWSKPGP 
      1450       1460    
KGPESGQEAA RAPRGWGLGA EHGAQLQL

Isoforms

- Isoform 2 of Uncharacterized protein C1orf167

Sequence View

        10         20         30         40         50         60 
MTRMSCPWGS YQWEPGACPA APRGIGGGDM AGGIPDVRGL QEAALGAGRS QEEARLVEEA 
        70         80         90        100        110        120 
QTPVMLPQDS GQRVEEVPGD LMAKRMSLIL HVQKLPWDHV PCLRRTRQNL YQDVGGHAHG 
       130        140        150        160        170        180 
SGLGGAKRGA ARSALRRPLP PATCRPAGIV SGPSPRLDSN PTGAHLIKQT RPLTVEWTKD 
       190        200        210        220        230        240 
TPVPEPMELR SDASHKENVS PKPAALPKPG KRLKQRRFRR SLGIGLSGRH DQWVPGCQVE 
       250        260        270        280        290        300 
RGGPAATPSP GAVLDQEPCR VQTNLASPGP RLGLALKDTT GQLVNSSFWQ QSNLQSLARR 
       310        320        330        340        350        360 
RQGKAREFAI QQSNLSINET SSPHLCPEPG GSSGPHKLPW GPLLSQEPLA RPSSCLRQSG 
       370        380        390        400        410        420 
LPAPGTPSGD FRPTEAFAPL DGHTQPGLRS WGGLGSWRSR LVGEPLTLED LAVPSQNQTQ 
       430        440        450        460        470        480 
APSRAAVHQL LASVHCLAQE AARLRCQAPQ EPPAWGVSPK QKGEEGAPRE RVHREEERTA 
       490        500        510        520        530        540 
FHLSDTVPAS SASKNKAQNI TAPESEAICW QLLSRCFRSW RHLVKRQREP AAAAVALGRW 
       550        560        570        580        590        600 
QLLRKCLQAL WLREAQLEAA WGQYTKVLLV RSFREVSGLQ VGPGGRVKQC PGSLREEEIA 
       610        620        630        640        650        660 
QRLLSHPRQR TDSRHERVQI LQALQLAVFF LWCQQKKRAR QERETLRKAT RATQRTGSFP 
       670        680        690        700        710        720 
QAWHSTAAGV AWVAPLSPQH QRAWLCRCFG AWQQFVQRGS RYRDHLADRR TGTLRKCLEQ 
       730        740        750        760        770        780 
WVRMKQLRES DGAKVTQLSL CRQKAGREAV YTAGPGACGL GAVGQAQGQQ EQGRGSLQDA 
       790        800        810        820        830        840 
CWTLALCWAL LLWKMRLFQR QWANSFFQGL QQRMLQRSLR WWHLRALGPD ATSSCTKTPS 
       850        860        870        880        890        900 
ALEPLSSSTL QDSLEKVPRA PTLPDTLQGS LLWAAGQRQQ GQCLLLWQAR AQQFQGTARW 
       910        920        930        940        950        960 
YQHTRQRRIF LSWSRWATAQ WAWRELASHR AWDRTCRAVL GLWRQRLLQS RLVEWWAQER 
       970        980        990       1000       1010       1020 
GWRLARDALC HWHSCWQGQQ FLHEKCQTWV QVHLQGLQKV VFRSWQQAAA HQRCTVTRPE 
      1030       1040       1050       1060       1070       1080 
QLLLQSYFQA WCEVVRDTGV LRAQHQAFQD GLRRRALGAV FATWREAQEV AAGAQEQRVA 
      1090       1100       1110       1120       1130       1140 
QASLARWRSC GQQGQEDGQQ KKARAPQAFP AWPVAPGMHH EAQQQAGESA GAQAAQCWTW 
      1150       1160       1170       1180       1190       1200 
CWALWVHESC RGQVSRAHAS WKPRAWVLEA SVQSAVRGGV QRAILTQLRP AELRRFLRTV 
      1210       1220       1230       1240       1250       1260 
QLRVRLGLPG AGKTRSCWTQ ATELVPPAPS LQCSLGGRRK PRGTAWAQRC REHSLCPAFQ 
      1270       1280       1290       1300       1310       1320 
LWPQWPGQSS WVPGLPLWTR DQGPRAHSSP EPRACKAQSK AHKRRLRILE KQAQAHGSAL 
      1330       1340       1350       1360       1370       1380 
LLALKGHDAL GHQEEVPAAP VPRGTASRAA GFPAGQVPGS GMAALGGCPR GRAAGADPAQ 
      1390       1400       1410       1420       1430       1440 
GVAPEMGLAD VVAADPATAS GSAVTAAGRW AFKKWHQRLA ARSPRRGAAS SPRPWSKPGP 
      1450       1460    
KGPESGQEAA RAPRGWGLGA EHGAQLQL



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

1 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates

N-termini

    Name Sequence Position Modification Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMID)
    Q5SNV9-1-unknown MTRMSC... 1 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    Q5SNV9-1-unknown MTRMSC... 1 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt68916

C-termini

    Name Sequence Position Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)
    ...AQLQL 1468 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    ...AQLQL 1468 inferred from isoform by sequence similarity unknown TopFIND inferred from TCt64534

Cleavages

    Protease Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)