TopFIND 4.0

P15848: Arylsulfatase B

General Information

Protein names
- Arylsulfatase B
- ASB
- 3.1.6.12
- N-acetylgalactosamine-4-sulfatase
- G4S

Gene names ARSB
Organism Homo sapiens
Protease Family
Protease ID
Chromosome location
UniProt ID P15848

2

N-termini

1

C-termini

0

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MGPRGAASLP RGPGPRRLLL PVVLPLLLLL LLAPPGSGAG ASRPPHLVFL LADDLGWNDV 
        70         80         90        100        110        120 
GFHGSRIRTP HLDALAAGGV LLDNYYTQPL CTPSRSQLLT GRYQIRTGLQ HQIIWPCQPS 
       130        140        150        160        170        180 
CVPLDEKLLP QLLKEAGYTT HMVGKWHLGM YRKECLPTRR GFDTYFGYLL GSEDYYSHER 
       190        200        210        220        230        240 
CTLIDALNVT RCALDFRDGE EVATGYKNMY STNIFTKRAI ALITNHPPEK PLFLYLALQS 
       250        260        270        280        290        300 
VHEPLQVPEE YLKPYDFIQD KNRHHYAGMV SLMDEAVGNV TAALKSSGLW NNTVFIFSTD 
       310        320        330        340        350        360 
NGGQTLAGGN NWPLRGRKWS LWEGGVRGVG FVASPLLKQK GVKNRELIHI SDWLPTLVKL 
       370        380        390        400        410        420 
ARGHTNGTKP LDGFDVWKTI SEGSPSPRIE LLHNIDPNFV DSSPCPRNSM APAKDDSSLP 
       430        440        450        460        470        480 
EYSAFNTSVH AAIRHGNWKL LTGYPGCGYW FPPPSQYNVS EIPSSDPPTK TLWLFDIDRD 
       490        500        510        520        530    
PEERHDLSRE YPHIVTKLLS RLQFYHKHSV PVYFPAQDPR CDPKATGVWG PWM

Isoforms

- Isoform 2 of Arylsulfatase B

Sequence View

        10         20         30         40         50         60 
MGPRGAASLP RGPGPRRLLL PVVLPLLLLL LLAPPGSGAG ASRPPHLVFL LADDLGWNDV 
        70         80         90        100        110        120 
GFHGSRIRTP HLDALAAGGV LLDNYYTQPL CTPSRSQLLT GRYQIRTGLQ HQIIWPCQPS 
       130        140        150        160        170        180 
CVPLDEKLLP QLLKEAGYTT HMVGKWHLGM YRKECLPTRR GFDTYFGYLL GSEDYYSHER 
       190        200        210        220        230        240 
CTLIDALNVT RCALDFRDGE EVATGYKNMY STNIFTKRAI ALITNHPPEK PLFLYLALQS 
       250        260        270        280        290        300 
VHEPLQVPEE YLKPYDFIQD KNRHHYAGMV SLMDEAVGNV TAALKSSGLW NNTVFIFSTD 
       310        320        330        340        350        360 
NGGQTLAGGN NWPLRGRKWS LWEGGVRGVG FVASPLLKQK GVKNRELIHI SDWLPTLVKL 
       370        380        390        400        410        420 
ARGHTNGTKP LDGFDVWKTI SEGSPSPRIE LLHNIDPNFV DSSPCPRNSM APAKDDSSLP 
       430        440        450        460        470        480 
EYSAFNTSVH AAIRHGNWKL LTGYPGCGYW FPPPSQYNVS EIPSSDPPTK TLWLFDIDRD 
       490        500        510        520        530    
PEERHDLSRE YPHIVTKLLS RLQFYHKHSV PVYFPAQDPR CDPKATGVWG PWM



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

2 N-termini - 1 C-termini - 0 Cleavages - 0 Substrates

N-termini

    Name Sequence Position Modification Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMID)
    P15848-1-unknown MGPRGA... 1 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt67565
    P15848-37-unknown SGAGAS... 37 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot
    P15848-37-unknown SGAGAS... 37 inferred from isoform by sequence similarity unknown TopFIND inferred from TNt115311

C-termini

    Name Sequence Position Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)
    ...WGPWM 533 inferred from electronic annotation electronic annotation UniProtKB inferred from uniprot

Cleavages

    Protease Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)