TopFIND 4.0

Q96FV9: THO complex subunit 1

General Information

Protein names
- THO complex subunit 1
- Tho1
- Nuclear matrix protein p84
- p84N5
- hTREX84

Gene names THOC1
Organism Homo sapiens
Protease Family
Protease ID
Chromosome location
UniProt ID Q96FV9

4

N-termini

2

C-termini

1

Cleavages

0

Substrates

Sequence

        10         20         30         40         50         60 
MSPTPPLFSL PEARTRFTKS TREALNNKNI KPLLSTFSQV PGSENEKKCT LDQAFRGILE 
        70         80         90        100        110        120 
EEIINHSSCE NVLAIISLAI GGVTEGICTA STPFVLLGDV LDCLPLDQCD TIFTFVEKNV 
       130        140        150        160        170        180 
ATWKSNTFYS AGKNYLLRMC NDLLRRLSKS QNTVFCGRIQ LFLARLFPLS EKSGLNLQSQ 
       190        200        210        220        230        240 
FNLENVTVFN TNEQESTLGQ KHTEDREEGM DVEEGEMGDE EAPTTCSIPI DYNLYRKFWS 
       250        260        270        280        290        300 
LQDYFRNPVQ CYEKISWKTF LKYSEEVLAV FKSYKLDDTQ ASRKKMEELK TGGEHVYFAK 
       310        320        330        340        350        360 
FLTSEKLMDL QLSDSNFRRH ILLQYLILFQ YLKGQVKFKS SNYVLTDEQS LWIEDTTKSV 
       370        380        390        400        410        420 
YQLLSENPPD GERFSKMVEH ILNTEENWNS WKNEGCPSFV KERTSDTKPT RIIRKRTAPE 
       430        440        450        460        470        480 
DFLGKGPTKK ILMGNEELTR LWNLCPDNME ACKSETREHM PTLEEFFEEA IEQADPENMV 
       490        500        510        520        530        540 
ENEYKAVNNS NYGWRALRLL ARRSPHFFQP TNQQFKSLPE YLENMVIKLA KELPPPSEEI 
       550        560        570        580        590        600 
KTGEDEDEED NDALLKENES PDVRRDKPVT GEQIEVFANK LGEQWKILAP YLEMKDSEIR 
       610        620        630        640        650    
QIECDSEDMK MRAKQLLVAW QDQEGVHATP ENLINALNKS GLSDLAESLT NDNETNS

Isoforms

- Isoform 2 of THO complex subunit 1

Sequence View

        10         20         30         40         50         60 
MSPTPPLFSL PEARTRFTKS TREALNNKNI KPLLSTFSQV PGSENEKKCT LDQAFRGILE 
        70         80         90        100        110        120 
EEIINHSSCE NVLAIISLAI GGVTEGICTA STPFVLLGDV LDCLPLDQCD TIFTFVEKNV 
       130        140        150        160        170        180 
ATWKSNTFYS AGKNYLLRMC NDLLRRLSKS QNTVFCGRIQ LFLARLFPLS EKSGLNLQSQ 
       190        200        210        220        230        240 
FNLENVTVFN TNEQESTLGQ KHTEDREEGM DVEEGEMGDE EAPTTCSIPI DYNLYRKFWS 
       250        260        270        280        290        300 
LQDYFRNPVQ CYEKISWKTF LKYSEEVLAV FKSYKLDDTQ ASRKKMEELK TGGEHVYFAK 
       310        320        330        340        350        360 
FLTSEKLMDL QLSDSNFRRH ILLQYLILFQ YLKGQVKFKS SNYVLTDEQS LWIEDTTKSV 
       370        380        390        400        410        420 
YQLLSENPPD GERFSKMVEH ILNTEENWNS WKNEGCPSFV KERTSDTKPT RIIRKRTAPE 
       430        440        450        460        470        480 
DFLGKGPTKK ILMGNEELTR LWNLCPDNME ACKSETREHM PTLEEFFEEA IEQADPENMV 
       490        500        510        520        530        540 
ENEYKAVNNS NYGWRALRLL ARRSPHFFQP TNQQFKSLPE YLENMVIKLA KELPPPSEEI 
       550        560        570        580        590        600 
KTGEDEDEED NDALLKENES PDVRRDKPVT GEQIEVFANK LGEQWKILAP YLEMKDSEIR 
       610        620        630        640        650    
QIECDSEDMK MRAKQLLVAW QDQEGVHATP ENLINALNKS GLSDLAESLT NDNETNS



Filter Information:


(REFRESH)

Directness:


Physiological Relevance:


Evidence Codes:


Methodology:


Perturbation of System:


Biological System:


Protease Assignment Confidence:


Evidence Names:


Database:


Lab:



Protein Neighborhood

Domains & Features

4 N-termini - 2 C-termini - 1 Cleavages - 0 Substrates

N-termini

C-termini

Cleavages

Substrates

    Substrate Position Sequence Evidence type Method Source (database) Source (Lab) Evidence name Publications (PMIDs)