Q4LE39: AT-rich interactive domain-containing protein 4B
Protein names | - AT-rich interactive domain-containing protein 4B - ARID domain-containing protein 4B - 180 kDa Sin3-associated polypeptide - Sin3-associated polypeptide p180 - Breast cancer-associated antigen BRCAA1 - Histone deacetylase complex subunit SAP180 - Retinoblastoma-binding protein 1-like 1 |
---|---|
Gene names | ARID4B |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | Q4LE39 |
3
N-termini
2
C-termini
1
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MKALDEPPYL TVGTDVSAKY RGAFCEAKIK TAKRLVKVKV TFRHDSSTVE VQDDHIKGPL
70 80 90 100 110 120
KVGAIVEVKN LDGAYQEAVI NKLTDASWYT VVFDDGDEKT LRRSSLCLKG ERHFAESETL
130 140 150 160 170 180
DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP EEESSSSSSD EDEDDRKQID ELLGKVVCVD
190 200 210 220 230 240
YISLDKKKAL WFPALVVCPD CSDEIAVKKD NILVRSFKDG KFTSVPRKDV HEITSDTAPK
250 260 270 280 290 300
PDAVLKQAFE QALEFHKSRT IPANWKTELK EDSSSSEAEE EEEEEDDEKE KEDNSSEEEE
310 320 330 340 350 360
EIEPFPEERE NFLQQLYKFM EDRGTPINKR PVLGYRNLNL FKLFRLVHKL GGFDNIESGA
370 380 390 400 410 420
VWKQVYQDLG IPVLNSAAGY NVKCAYKKYL YGFEEYCRSA NIEFQMALPE KVVNKQCKEC
430 440 450 460 470 480
ENVKEIKVKE ENETEIKEIK MEEERNIIPR EEKPIEDEIE RKENIKPSLG SKKNLLESIP
490 500 510 520 530 540
THSDQEKEVN IKKPEDNENL DDKDDDTTRV DESLNIKVEA EEEKAKSGDE TNKEEDEDDE
550 560 570 580 590 600
EAEEEEEEEE EEEDEDDDDN NEEEEFECYP PGMKVQVRYG RGKNQKMYEA SIKDSDVEGG
610 620 630 640 650 660
EVLYLVHYCG WNVRYDEWIK ADKIVRPADK NVPKIKHRKK IKNKLDKEKD KDEKYSPKNC
670 680 690 700 710 720
KLRRLSKPPF QTNPSPEMVS KLDLTDAKNS DTAHIKSIEI TSILNGLQAS ESSAEDSEQE
730 740 750 760 770 780
DERGAQDMDN NGKEESKIDH LTNNRNDLIS KEEQNSSSLL EENKVHADLV ISKPVSKSPE
790 800 810 820 830 840
RLRKDIEVLS EDTDYEEDEV TKKRKDVKKD TTDKSSKPQI KRGKRRYCNT EECLKTGSPG
850 860 870 880 890 900
KKEEKAKNKE SLCMENSSNS SSDEDEEETK AKMTPTKKYN GLEEKRKSLR TTGFYSGFSE
910 920 930 940 950 960
VAEKRIKLLN NSDERLQNSR AKDRKDVWSS IQGQWPKKTL KELFSDSDTE AAASPPHPAP
970 980 990 1000 1010 1020
EEGVAEESLQ TVAEEESCSP SVELEKPPPV NVDSKPIEEK TVEVNDRKAE FPSSGSNSVL
1030 1040 1050 1060 1070 1080
NTPPTTPESP SSVTVTEGSR QQSSVTVSEP LAPNQEEVRS IKSETDSTIE VDSVAGELQD
1090 1100 1110 1120 1130 1140
LQSEGNSSPA GFDASVSSSS SNQPEPEHPE KACTGQKRVK DAQGGGSSSK KQKRSHKATV
1150 1160 1170 1180 1190 1200
VNNKKKGKGT NSSDSEELSA GESITKSQPV KSVSTGMKSH STKSPARTQS PGKCGKNGDK
1210 1220 1230 1240 1250 1260
DPDLKEPSNR LPKVYKWSFQ MSDLENMTSA ERITILQEKL QEIRKHYLSL KSEVASIDRR
1270 1280 1290 1300 1310
RKRLKKKERE SAATSSSSSS PSSSSITAAV MLTLAEPSMS SASQNGMSVE CR
Isoforms
- Isoform 2 of AT-rich interactive domain-containing protein 4B - Isoform 3 of AT-rich interactive domain-containing protein 4B - Isoform 4 of AT-rich interactive domain-containing protein 4BSequence View
10 20 30 40 50 60
MKALDEPPYL TVGTDVSAKY RGAFCEAKIK TAKRLVKVKV TFRHDSSTVE VQDDHIKGPL
70 80 90 100 110 120
KVGAIVEVKN LDGAYQEAVI NKLTDASWYT VVFDDGDEKT LRRSSLCLKG ERHFAESETL
130 140 150 160 170 180
DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP EEESSSSSSD EDEDDRKQID ELLGKVVCVD
190 200 210 220 230 240
YISLDKKKAL WFPALVVCPD CSDEIAVKKD NILVRSFKDG KFTSVPRKDV HEITSDTAPK
250 260 270 280 290 300
PDAVLKQAFE QALEFHKSRT IPANWKTELK EDSSSSEAEE EEEEEDDEKE KEDNSSEEEE
310 320 330 340 350 360
EIEPFPEERE NFLQQLYKFM EDRGTPINKR PVLGYRNLNL FKLFRLVHKL GGFDNIESGA
370 380 390 400 410 420
VWKQVYQDLG IPVLNSAAGY NVKCAYKKYL YGFEEYCRSA NIEFQMALPE KVVNKQCKEC
430 440 450 460 470 480
ENVKEIKVKE ENETEIKEIK MEEERNIIPR EEKPIEDEIE RKENIKPSLG SKKNLLESIP
490 500 510 520 530 540
THSDQEKEVN IKKPEDNENL DDKDDDTTRV DESLNIKVEA EEEKAKSGDE TNKEEDEDDE
550 560 570 580 590 600
EAEEEEEEEE EEEDEDDDDN NEEEEFECYP PGMKVQVRYG RGKNQKMYEA SIKDSDVEGG
610 620 630 640 650 660
EVLYLVHYCG WNVRYDEWIK ADKIVRPADK NVPKIKHRKK IKNKLDKEKD KDEKYSPKNC
670 680 690 700 710 720
KLRRLSKPPF QTNPSPEMVS KLDLTDAKNS DTAHIKSIEI TSILNGLQAS ESSAEDSEQE
730 740 750 760 770 780
DERGAQDMDN NGKEESKIDH LTNNRNDLIS KEEQNSSSLL EENKVHADLV ISKPVSKSPE
790 800 810 820 830 840
RLRKDIEVLS EDTDYEEDEV TKKRKDVKKD TTDKSSKPQI KRGKRRYCNT EECLKTGSPG
850 860 870 880 890 900
KKEEKAKNKE SLCMENSSNS SSDEDEEETK AKMTPTKKYN GLEEKRKSLR TTGFYSGFSE
910 920 930 940 950 960
VAEKRIKLLN NSDERLQNSR AKDRKDVWSS IQGQWPKKTL KELFSDSDTE AAASPPHPAP
970 980 990 1000 1010 1020
EEGVAEESLQ TVAEEESCSP SVELEKPPPV NVDSKPIEEK TVEVNDRKAE FPSSGSNSVL
1030 1040 1050 1060 1070 1080
NTPPTTPESP SSVTVTEGSR QQSSVTVSEP LAPNQEEVRS IKSETDSTIE VDSVAGELQD
1090 1100 1110 1120 1130 1140
LQSEGNSSPA GFDASVSSSS SNQPEPEHPE KACTGQKRVK DAQGGGSSSK KQKRSHKATV
1150 1160 1170 1180 1190 1200
VNNKKKGKGT NSSDSEELSA GESITKSQPV KSVSTGMKSH STKSPARTQS PGKCGKNGDK
1210 1220 1230 1240 1250 1260
DPDLKEPSNR LPKVYKWSFQ MSDLENMTSA ERITILQEKL QEIRKHYLSL KSEVASIDRR
1270 1280 1290 1300 1310
RKRLKKKERE SAATSSSSSS PSSSSITAAV MLTLAEPSMS SASQNGMSVE CR
10 20 30 40 50 60
MKALDEPPYL TVGTDVSAKY RGAFCEAKIK TAKRLVKVKV TFRHDSSTVE VQDDHIKGPL
70 80 90 100 110 120
KVGAIVEVKN LDGAYQEAVI NKLTDASWYT VVFDDGDEKT LRRSSLCLKG ERHFAESETL
130 140 150 160 170 180
DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP EEESSSSSSD EDEDDRKQID ELLGKVVCVD
190 200 210 220 230 240
YISLDKKKAL WFPALVVCPD CSDEIAVKKD NILVRSFKDG KFTSVPRKDV HEITSDTAPK
250 260 270 280 290 300
PDAVLKQAFE QALEFHKSRT IPANWKTELK EDSSSSEAEE EEEEEDDEKE KEDNSSEEEE
310 320 330 340 350 360
EIEPFPEERE NFLQQLYKFM EDRGTPINKR PVLGYRNLNL FKLFRLVHKL GGFDNIESGA
370 380 390 400 410 420
VWKQVYQDLG IPVLNSAAGY NVKCAYKKYL YGFEEYCRSA NIEFQMALPE KVVNKQCKEC
430 440 450 460 470 480
ENVKEIKVKE ENETEIKEIK MEEERNIIPR EEKPIEDEIE RKENIKPSLG SKKNLLESIP
490 500 510 520 530 540
THSDQEKEVN IKKPEDNENL DDKDDDTTRV DESLNIKVEA EEEKAKSGDE TNKEEDEDDE
550 560 570 580 590 600
EAEEEEEEEE EEEDEDDDDN NEEEEFECYP PGMKVQVRYG RGKNQKMYEA SIKDSDVEGG
610 620 630 640 650 660
EVLYLVHYCG WNVRYDEWIK ADKIVRPADK NVPKIKHRKK IKNKLDKEKD KDEKYSPKNC
670 680 690 700 710 720
KLRRLSKPPF QTNPSPEMVS KLDLTDAKNS DTAHIKSIEI TSILNGLQAS ESSAEDSEQE
730 740 750 760 770 780
DERGAQDMDN NGKEESKIDH LTNNRNDLIS KEEQNSSSLL EENKVHADLV ISKPVSKSPE
790 800 810 820 830 840
RLRKDIEVLS EDTDYEEDEV TKKRKDVKKD TTDKSSKPQI KRGKRRYCNT EECLKTGSPG
850 860 870 880 890 900
KKEEKAKNKE SLCMENSSNS SSDEDEEETK AKMTPTKKYN GLEEKRKSLR TTGFYSGFSE
910 920 930 940 950 960
VAEKRIKLLN NSDERLQNSR AKDRKDVWSS IQGQWPKKTL KELFSDSDTE AAASPPHPAP
970 980 990 1000 1010 1020
EEGVAEESLQ TVAEEESCSP SVELEKPPPV NVDSKPIEEK TVEVNDRKAE FPSSGSNSVL
1030 1040 1050 1060 1070 1080
NTPPTTPESP SSVTVTEGSR QQSSVTVSEP LAPNQEEVRS IKSETDSTIE VDSVAGELQD
1090 1100 1110 1120 1130 1140
LQSEGNSSPA GFDASVSSSS SNQPEPEHPE KACTGQKRVK DAQGGGSSSK KQKRSHKATV
1150 1160 1170 1180 1190 1200
VNNKKKGKGT NSSDSEELSA GESITKSQPV KSVSTGMKSH STKSPARTQS PGKCGKNGDK
1210 1220 1230 1240 1250 1260
DPDLKEPSNR LPKVYKWSFQ MSDLENMTSA ERITILQEKL QEIRKHYLSL KSEVASIDRR
1270 1280 1290 1300 1310
RKRLKKKERE SAATSSSSSS PSSSSITAAV MLTLAEPSMS SASQNGMSVE CR
10 20 30 40 50 60
MKALDEPPYL TVGTDVSAKY RGAFCEAKIK TAKRLVKVKV TFRHDSSTVE VQDDHIKGPL
70 80 90 100 110 120
KVGAIVEVKN LDGAYQEAVI NKLTDASWYT VVFDDGDEKT LRRSSLCLKG ERHFAESETL
130 140 150 160 170 180
DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP EEESSSSSSD EDEDDRKQID ELLGKVVCVD
190 200 210 220 230 240
YISLDKKKAL WFPALVVCPD CSDEIAVKKD NILVRSFKDG KFTSVPRKDV HEITSDTAPK
250 260 270 280 290 300
PDAVLKQAFE QALEFHKSRT IPANWKTELK EDSSSSEAEE EEEEEDDEKE KEDNSSEEEE
310 320 330 340 350 360
EIEPFPEERE NFLQQLYKFM EDRGTPINKR PVLGYRNLNL FKLFRLVHKL GGFDNIESGA
370 380 390 400 410 420
VWKQVYQDLG IPVLNSAAGY NVKCAYKKYL YGFEEYCRSA NIEFQMALPE KVVNKQCKEC
430 440 450 460 470 480
ENVKEIKVKE ENETEIKEIK MEEERNIIPR EEKPIEDEIE RKENIKPSLG SKKNLLESIP
490 500 510 520 530 540
THSDQEKEVN IKKPEDNENL DDKDDDTTRV DESLNIKVEA EEEKAKSGDE TNKEEDEDDE
550 560 570 580 590 600
EAEEEEEEEE EEEDEDDDDN NEEEEFECYP PGMKVQVRYG RGKNQKMYEA SIKDSDVEGG
610 620 630 640 650 660
EVLYLVHYCG WNVRYDEWIK ADKIVRPADK NVPKIKHRKK IKNKLDKEKD KDEKYSPKNC
670 680 690 700 710 720
KLRRLSKPPF QTNPSPEMVS KLDLTDAKNS DTAHIKSIEI TSILNGLQAS ESSAEDSEQE
730 740 750 760 770 780
DERGAQDMDN NGKEESKIDH LTNNRNDLIS KEEQNSSSLL EENKVHADLV ISKPVSKSPE
790 800 810 820 830 840
RLRKDIEVLS EDTDYEEDEV TKKRKDVKKD TTDKSSKPQI KRGKRRYCNT EECLKTGSPG
850 860 870 880 890 900
KKEEKAKNKE SLCMENSSNS SSDEDEEETK AKMTPTKKYN GLEEKRKSLR TTGFYSGFSE
910 920 930 940 950 960
VAEKRIKLLN NSDERLQNSR AKDRKDVWSS IQGQWPKKTL KELFSDSDTE AAASPPHPAP
970 980 990 1000 1010 1020
EEGVAEESLQ TVAEEESCSP SVELEKPPPV NVDSKPIEEK TVEVNDRKAE FPSSGSNSVL
1030 1040 1050 1060 1070 1080
NTPPTTPESP SSVTVTEGSR QQSSVTVSEP LAPNQEEVRS IKSETDSTIE VDSVAGELQD
1090 1100 1110 1120 1130 1140
LQSEGNSSPA GFDASVSSSS SNQPEPEHPE KACTGQKRVK DAQGGGSSSK KQKRSHKATV
1150 1160 1170 1180 1190 1200
VNNKKKGKGT NSSDSEELSA GESITKSQPV KSVSTGMKSH STKSPARTQS PGKCGKNGDK
1210 1220 1230 1240 1250 1260
DPDLKEPSNR LPKVYKWSFQ MSDLENMTSA ERITILQEKL QEIRKHYLSL KSEVASIDRR
1270 1280 1290 1300 1310
RKRLKKKERE SAATSSSSSS PSSSSITAAV MLTLAEPSMS SASQNGMSVE CR
Protein Neighborhood
Domains & Features
3 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
Q4LE39-1-unknown | MKALDE... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
Q4LE39-1-unknown | MKALDE... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt67451 | |||
Q4LE39-1-unknown | MKALDE... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt67452 | |||
Q4LE39-201-unknown | CSDEIA... | 201 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC12454 | |||
Q4LE39-201-unknown | CSDEIA... | 201 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt146608 | |||
Q4LE39-201-unknown | CSDEIA... | 201 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt146609 | |||
Q4LE39-320-unknown | MEDRGT... | 320 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt67453 | |||
Q4LE39-320-unknown | MEDRGT... | 320 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt102975 | |||
Q4LE39-320-unknown | MEDRGT... | 320 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt102976 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...VVCPDC | 200 | inferred from cleavage | unknown | TopFIND | Inferred from cleavage TC12454 | |||
...VVCPDC | 200 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt129915 | |||
...VVCPDC | 200 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt129916 | |||
...SVECR | 1312 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
...SVECR | 1312 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TCt63069 |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
GRAB_HUMAN | 200 | CPDC.|.CSDE | inferred from experiment | unknown | MEROPS | Gevaert K | Van Damme P et al.:Complementary positional proteo... (M14.017) | 20526345, |
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|