P23497: Nuclear autoantigen Sp-100
Protein names | - Nuclear autoantigen Sp-100 - Nuclear dot-associated Sp100 protein - Speckled 100 kDa |
---|---|
Gene names | SP100 |
Organism | Homo sapiens |
Protease Family | |
Protease ID | |
Chromosome location | |
UniProt ID | P23497 |
4
N-termini
2
C-termini
0
Cleavages
0
Substrates
Sequence
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
Isoforms
- Isoform Sp100-A of Nuclear autoantigen Sp-100 - Isoform Sp100-B of Nuclear autoantigen Sp-100 - Isoform Sp100-C of Nuclear autoantigen Sp-100 - Isoform SpAlt-C of Nuclear autoantigen Sp-100 - Isoform 6 of Nuclear autoantigen Sp-100 - Isoform 7 of Nuclear autoantigen Sp-100Sequence View
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
10 20 30 40 50 60
MAGGGGDLST RRLNECISPV ANEMNHLPAH SHDLQRMFTE DQGVDDRLLY DIVFKHFKRN
70 80 90 100 110 120
KVEISNAIKK TFPFLEGLRD RDLITNKMFE DSQDSCRNLV PVQRVVYNVL SELEKTFNLP
130 140 150 160 170 180
VLEALFSDVN MQEYPDLIHI YKGFENVIHD KLPLQESEEE EREERSGLQL SLEQGTGENS
190 200 210 220 230 240
FRSLTWPPSG SPSHAGTTPP ENGLSEHPCE TEQINAKRKD TTSDKDDSLG SQQTNEQCAQ
250 260 270 280 290 300
KAEPTESCEQ IAVQVNNGDA GREMPCPLPC DEESPEAELH NHGIQINSCS VRLVDIKKEK
310 320 330 340 350 360
PFSNSKVECQ AQARTHHNQA SDIIVISSED SEGSTDVDEP LEVFISAPRS EPVINNDNPL
370 380 390 400 410 420
ESNDEKEGQE ATCSRPQIVP EPMDFRKLST FRESFKKRVI GQDHDFSESS EEEAPAEASS
430 440 450 460 470 480
GALRSKHGEK APMTSRSTST WRIPSRKRRF SSSDFSDLSN GEELQETCSS SLRRGSGSQP
490 500 510 520 530 540
QEPENKKCSC VMCFPKGVPR SQEARTESSQ ASDMMDTMDV ENNSTLEKHS GKRRKKRRHR
550 560 570 580 590 600
SKVNGLQRGR KKDRPRKHLT LNNKVQKKRW QQRGRKANTR PLKRRRKRGP RIPKDENINF
610 620 630 640 650 660
KQSELPVTCG EVKGTLYKER FKQGTSKKCI QSEDKKWFTP REFEIEGDRG ASKNWKLSIR
670 680 690 700 710 720
CGGYTLKVLM ENKFLPEPPS TRKKRILESH NNTLVDPCEE HKKKNPDASV KFSEFLKKCS
730 740 750 760 770 780
ETWKTIFAKE KGKFEDMAKA DKAHYEREMK TYIPPKGEKK KKFKDPNAPK RPPLAFFLFC
790 800 810 820 830 840
SEYRPKIKGE HPGLSIDDVV KKLAGMWNNT AAADKQFYEK KAAKLKEKYK KDIAAYRAKG
850 860 870
KPNSAKKRVV KAEKSKKKKE EEEDEEDEQE EENEEDDDK
Protein Neighborhood
Domains & Features
4 N-termini - 2 C-termini - 0 Cleavages - 0 Substrates
N-termini
Name | Sequence | Position | Modification | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMID) |
---|---|---|---|---|---|---|---|---|---|
P23497-1-unknown | MAGGGG... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt90290 | |||
P23497-1-unknown | MAGGGG... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt90291 | |||
P23497-1-unknown | MAGGGG... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt90292 | |||
P23497-1-unknown | MAGGGG... | 1 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt90293 | |||
P23497-2-unknown | MAGGGG... | 1 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | |||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: Complementary positional proteomics for screening substrates... | 20526345 | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | COFRADIC | Gevaert K. | Van Damme P et al.: PC3-cells, Complementary positional proteomics for screening substrates... | 20526345 | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111119 | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111120 | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111121 | ||
P23497-2-Acetylation | AGGGGD... | 2 | acetylation- | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt111122 | ||
P23497-336-unknown | DVDEPL... | 336 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSP00000416563 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199033 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199034 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199035 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199036 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199037 | |||
P23497-336-unknown | DVDEPL... | 336 | inferred from isoform by sequence similarity | unknown | TopFIND | inferred from TNt199038 |
C-termini
Name | Sequence | Position | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|---|---|---|---|---|---|---|---|
...NGLQRG | 548 | inferred from electronic annotation | unknown | Ensembl | inferred from ensembl protein ENSP00000416563 | |||
...EDDDK | 879 | inferred from electronic annotation | electronic annotation | UniProtKB | inferred from uniprot |
Cleavages
Protease | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|
Substrates
Substrate | Position | Sequence | Evidence type | Method | Source (database) | Source (Lab) | Evidence name | Publications (PMIDs) |
---|