NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|564333910|ref|XP_006231513|]
View 

peroxisome proliferator-activated receptor gamma coactivator-related protein 1 isoform X1 [Rattus norvegicus]

Protein Classification

RNA-binding protein; RNA-binding protein 43( domain architecture ID 10190348)

RNA-binding protein containing an RNA recognition motif (RRM)| RNA-binding protein 43 (RBM43) is an RNA-binding protein containing an RNA recognition motif (RRM)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RRM_PRC cd12624
RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma ...
1516-1606 2.49e-61

RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma coactivator-related protein 1 (PRC) and similar proteins; This subgroup corresponds to the RRM of PRC, also termed PGC-1-related coactivator, one of the members of PGC-1 transcriptional coactivators family, including peroxisome proliferator-activated receptor gamma coactivators PGC-1alpha and PGC-1beta. Unlike PGC-1alpha and PGC-1beta, PRC is ubiquitous and more abundantly expressed in proliferating cells than in growth-arrested cells. PRC has been implicated in the regulation of several metabolic pathways, mitochondrial biogenesis, and cell growth. It functions as a growth-regulated transcriptional cofactor activating many nuclear genes specifying mitochondrial respiratory function. PRC directly interacts with nuclear transcriptional factors implicated in respiratory chain expression including nuclear respiratory factors 1 and 2 (NRF-1 and NRF-2), CREB (cAMP-response element-binding protein), and estrogen-related receptor alpha (ERRalpha). It interacts indirectly with the NRF-2beta subunit through host cell factor (HCF), a cellular protein involved in herpes simplex virus (HSV) infection and cell cycle regulation. Furthermore, like PGC-1alpha and PGC-1beta, PRC can transactivate a number of NRF-dependent nuclear genes required for mitochondrial respiratory function, including those encoding cytochrome c, 5-aminolevulinate synthase, Tfam, and TFB1M, and TFB2M. Further research indicates that PRC may also act as a sensor of metabolic stress that orchestrates a redox-sensitive program of inflammatory gene expression. PRC is a multi-domain protein containing an N-terminal activation domain, an LXXLL coactivator signature, a central proline-rich region, a tetrapeptide motif (DHDY) responsible for HCF binding, a C-terminal arginine/serine-rich (SR) domain, and an RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


:

Pssm-ID: 410035 [Multi-domain]  Cd Length: 91  Bit Score: 204.28  E-value: 2.49e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCFGGRR 1595
Cdd:cd12624     1 ERRVVYIGKIRGRMTRSELKDRFSVFGEIEECTIHFREEGDNYGFVTYRYTEDAFAAIENGHKLRRPDELPFDLCFGGRR 80
                          90
                  ....*....|.
gi 564333910 1596 QFCKRSYSDLD 1606
Cdd:cd12624    81 QFCKSSYADLD 91
PHA03247 super family cl33720
large tegument protein UL36; Provisional
631-1116 5.03e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 5.03e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  631 HLSPADAILANPGTADPAPNGLAPGDPVlvkcRPTDPRRaaaaataaqgPRPSLEPSD------HPKVIT-------PEV 697
Cdd:PHA03247 2480 YRRPAEARFPFAAGAAPDPGGGGPPDPD----APPAPSR----------LAPAILPDEpvgepvHPRMLTwirgleeLAS 2545
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  698 KDAGGP--LKVEGSTSATTQEARPRPlslsEYRQRRRQRQIEA-EDRNSQPPAGKWPSLPETPKGLADIPcLVPSAPTKK 774
Cdd:PHA03247 2546 DDAGDPppPLPPAAPPAAPDRSVPPP----RPAPRPSEPAVTSrARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPD 2620
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  775 TAPQRSPIAVPETCSVSVGSNPVSPTPEP-----SASKLMVSTHSEQVSSHEMPLPVRLP--------PPTLQSMA---- 837
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPerprdDPAPGRVSRPRRARRLGRAAQASSPPqrprrraaRPTVGSLTslad 2700
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  838 --PAGPTPST-----VPTPLPFPPSLPPLLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQYAPVPP-------- 902
Cdd:PHA03247 2701 ppPPPPTPEPaphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPaapaagpp 2780
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  903 -----------------WPCYPSVSPSGYSCLPPPPAMPiVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLG-W 964
Cdd:PHA03247 2781 rrltrpavaslsesresLPSPWDPADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  965 GPGLQQPPFWSTVSPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSP 1044
Cdd:PHA03247 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1045 QP--KHKVSPVVQSPQIKVLPTLSAEGVAIGESESERLK-PETQENRPREKPISTATKAVLLPKQSTVAKLPAVH 1116
Cdd:PHA03247 2940 QPplAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALH 3014
PHA03247 super family cl33720
large tegument protein UL36; Provisional
379-844 1.94e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  379 DPALQLLMPAMEPGTEDGVPKVAPCPDQELPLSSACLLEPRDIMESLAPKEPRNLPANASEGSQRVPRKGRKKKKKEQPT 458
Cdd:PHA03247 2550 DPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  459 AclegcTRRLRSSSRGQSTVATEVNSQAGNLQKQPQEELQRETAAPqsrGKPRAWARAWAAALEKTGSEDPEDQNAGQDS 538
Cdd:PHA03247 2630 S-----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL---GRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  539 PAEEDVLELRPKLVETSQADPTLSLNDSARADSIPVNSVEgDSTAVDTAVPVDQASSCTElvgSLSGGPILSSPVLDDRT 618
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARP---PTTAGPPAPAPPAAPAA 2777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  619 GVEPAVAIPTSNHLSPADAILANPGTADPAPNGLAPGDPVLVKCRPTDPRRAAAAATAAQGPRPSLEPSDHPKVITPEVK 698
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  699 dAGGPLKVEGST--SATTQEARPRPLSLSEYRQRRRQRQIEAEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTA 776
Cdd:PHA03247 2858 -PGGDVRRRPPSrsPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910  777 PQRSPIAVPETcsvsvgsnpvSPTPEPSASKLMVSTHSEQVSSHEMPLP-VRLPPPTLQSMAPAGPTPS 844
Cdd:PHA03247 2937 PRPQPPLAPTT----------DPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPP 2995
PHA03247 super family cl33720
large tegument protein UL36; Provisional
978-1362 2.43e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  978 SPPPLSSASTGRA----VPPPS------VEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSPQPK 1047
Cdd:PHA03247 2574 APRPSEPAVTSRArrpdAPPQSarprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1048 HKVSPVVQSPQIKVlptlSAEGVAIGESESERlKPETQENRPREKPISTATKAVLLPKQ------STVAKLPAVHPARLR 1121
Cdd:PHA03247 2654 DDPAPGRVSRPRRA----RRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTpepaphALVSATPLPPGPAAA 2728
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1122 RLSFLPTPRAQGSEDVVQAFISEIGiEASDLSSLLEQFEKSEAKKECPL--PAPADSVAVGNSGSIDTPQEKKPLD--RL 1197
Cdd:PHA03247 2729 RQASPALPAAPAPPAVPAGPATPGG-PARPARPPTTAGPPAPAPPAAPAagPPRRLTRPAVASLSESRESLPSPWDpaDP 2807
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1198 HAPELANVAGLTPPATPPHQLWKPLAAV---SLLAKAKSPKSTAQEGTLKPEGITEAKPPAaaclqegvRGPSPVHVgsg 1274
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQptaPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS--------RSPAAKPA--- 2876
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1275 dhdycVRSRTPPKKMPALVISEMGSRWNVKRHQditIKPVLSLGSAAPPLPCTTTSQEPLDHRTSSEQADPSAPcFAPST 1354
Cdd:PHA03247 2877 -----APARPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTT 2947

                  ....*...
gi 564333910 1355 LLSPEASP 1362
Cdd:PHA03247 2948 DPAGAGEP 2955
 
Name Accession Description Interval E-value
RRM_PRC cd12624
RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma ...
1516-1606 2.49e-61

RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma coactivator-related protein 1 (PRC) and similar proteins; This subgroup corresponds to the RRM of PRC, also termed PGC-1-related coactivator, one of the members of PGC-1 transcriptional coactivators family, including peroxisome proliferator-activated receptor gamma coactivators PGC-1alpha and PGC-1beta. Unlike PGC-1alpha and PGC-1beta, PRC is ubiquitous and more abundantly expressed in proliferating cells than in growth-arrested cells. PRC has been implicated in the regulation of several metabolic pathways, mitochondrial biogenesis, and cell growth. It functions as a growth-regulated transcriptional cofactor activating many nuclear genes specifying mitochondrial respiratory function. PRC directly interacts with nuclear transcriptional factors implicated in respiratory chain expression including nuclear respiratory factors 1 and 2 (NRF-1 and NRF-2), CREB (cAMP-response element-binding protein), and estrogen-related receptor alpha (ERRalpha). It interacts indirectly with the NRF-2beta subunit through host cell factor (HCF), a cellular protein involved in herpes simplex virus (HSV) infection and cell cycle regulation. Furthermore, like PGC-1alpha and PGC-1beta, PRC can transactivate a number of NRF-dependent nuclear genes required for mitochondrial respiratory function, including those encoding cytochrome c, 5-aminolevulinate synthase, Tfam, and TFB1M, and TFB2M. Further research indicates that PRC may also act as a sensor of metabolic stress that orchestrates a redox-sensitive program of inflammatory gene expression. PRC is a multi-domain protein containing an N-terminal activation domain, an LXXLL coactivator signature, a central proline-rich region, a tetrapeptide motif (DHDY) responsible for HCF binding, a C-terminal arginine/serine-rich (SR) domain, and an RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 410035 [Multi-domain]  Cd Length: 91  Bit Score: 204.28  E-value: 2.49e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCFGGRR 1595
Cdd:cd12624     1 ERRVVYIGKIRGRMTRSELKDRFSVFGEIEECTIHFREEGDNYGFVTYRYTEDAFAAIENGHKLRRPDELPFDLCFGGRR 80
                          90
                  ....*....|.
gi 564333910 1596 QFCKRSYSDLD 1606
Cdd:cd12624    81 QFCKSSYADLD 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
631-1116 5.03e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 5.03e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  631 HLSPADAILANPGTADPAPNGLAPGDPVlvkcRPTDPRRaaaaataaqgPRPSLEPSD------HPKVIT-------PEV 697
Cdd:PHA03247 2480 YRRPAEARFPFAAGAAPDPGGGGPPDPD----APPAPSR----------LAPAILPDEpvgepvHPRMLTwirgleeLAS 2545
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  698 KDAGGP--LKVEGSTSATTQEARPRPlslsEYRQRRRQRQIEA-EDRNSQPPAGKWPSLPETPKGLADIPcLVPSAPTKK 774
Cdd:PHA03247 2546 DDAGDPppPLPPAAPPAAPDRSVPPP----RPAPRPSEPAVTSrARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPD 2620
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  775 TAPQRSPIAVPETCSVSVGSNPVSPTPEP-----SASKLMVSTHSEQVSSHEMPLPVRLP--------PPTLQSMA---- 837
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPerprdDPAPGRVSRPRRARRLGRAAQASSPPqrprrraaRPTVGSLTslad 2700
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  838 --PAGPTPST-----VPTPLPFPPSLPPLLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQYAPVPP-------- 902
Cdd:PHA03247 2701 ppPPPPTPEPaphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPaapaagpp 2780
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  903 -----------------WPCYPSVSPSGYSCLPPPPAMPiVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLG-W 964
Cdd:PHA03247 2781 rrltrpavaslsesresLPSPWDPADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  965 GPGLQQPPFWSTVSPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSP 1044
Cdd:PHA03247 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1045 QP--KHKVSPVVQSPQIKVLPTLSAEGVAIGESESERLK-PETQENRPREKPISTATKAVLLPKQSTVAKLPAVH 1116
Cdd:PHA03247 2940 QPplAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALH 3014
RRM smart00360
RNA recognition motif;
1519-1580 8.50e-09

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 53.75  E-value: 8.50e-09
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910   1519 VVFIGKIPGRMTRSELKQRFSVFGEIEECTIHF---RVQGDNYGFVTYRYAEEAFAAIES-------GHKLR 1580
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdkeTGKSKGFAFVEFESEEDAEKALEAlngkeldGRPLK 72
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
1520-1581 2.06e-07

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 49.54  E-value: 2.06e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910  1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIES--GHKLRQ 1581
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRDETGRSkgFAFVEFEDEEDAEKAIEAlnGKELGG 66
PHA03247 PHA03247
large tegument protein UL36; Provisional
379-844 1.94e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  379 DPALQLLMPAMEPGTEDGVPKVAPCPDQELPLSSACLLEPRDIMESLAPKEPRNLPANASEGSQRVPRKGRKKKKKEQPT 458
Cdd:PHA03247 2550 DPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  459 AclegcTRRLRSSSRGQSTVATEVNSQAGNLQKQPQEELQRETAAPqsrGKPRAWARAWAAALEKTGSEDPEDQNAGQDS 538
Cdd:PHA03247 2630 S-----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL---GRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  539 PAEEDVLELRPKLVETSQADPTLSLNDSARADSIPVNSVEgDSTAVDTAVPVDQASSCTElvgSLSGGPILSSPVLDDRT 618
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARP---PTTAGPPAPAPPAAPAA 2777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  619 GVEPAVAIPTSNHLSPADAILANPGTADPAPNGLAPGDPVLVKCRPTDPRRAAAAATAAQGPRPSLEPSDHPKVITPEVK 698
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  699 dAGGPLKVEGST--SATTQEARPRPLSLSEYRQRRRQRQIEAEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTA 776
Cdd:PHA03247 2858 -PGGDVRRRPPSrsPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910  777 PQRSPIAVPETcsvsvgsnpvSPTPEPSASKLMVSTHSEQVSSHEMPLP-VRLPPPTLQSMAPAGPTPS 844
Cdd:PHA03247 2937 PRPQPPLAPTT----------DPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
978-1362 2.43e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  978 SPPPLSSASTGRA----VPPPS------VEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSPQPK 1047
Cdd:PHA03247 2574 APRPSEPAVTSRArrpdAPPQSarprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1048 HKVSPVVQSPQIKVlptlSAEGVAIGESESERlKPETQENRPREKPISTATKAVLLPKQ------STVAKLPAVHPARLR 1121
Cdd:PHA03247 2654 DDPAPGRVSRPRRA----RRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTpepaphALVSATPLPPGPAAA 2728
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1122 RLSFLPTPRAQGSEDVVQAFISEIGiEASDLSSLLEQFEKSEAKKECPL--PAPADSVAVGNSGSIDTPQEKKPLD--RL 1197
Cdd:PHA03247 2729 RQASPALPAAPAPPAVPAGPATPGG-PARPARPPTTAGPPAPAPPAAPAagPPRRLTRPAVASLSESRESLPSPWDpaDP 2807
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1198 HAPELANVAGLTPPATPPHQLWKPLAAV---SLLAKAKSPKSTAQEGTLKPEGITEAKPPAaaclqegvRGPSPVHVgsg 1274
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQptaPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS--------RSPAAKPA--- 2876
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1275 dhdycVRSRTPPKKMPALVISEMGSRWNVKRHQditIKPVLSLGSAAPPLPCTTTSQEPLDHRTSSEQADPSAPcFAPST 1354
Cdd:PHA03247 2877 -----APARPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTT 2947

                  ....*...
gi 564333910 1355 LLSPEASP 1362
Cdd:PHA03247 2948 DPAGAGEP 2955
ELAV_HUD_SF TIGR01661
ELAV/HuD family splicing factor; This model describes the ELAV/HuD subfamily of splicing ...
1515-1580 2.86e-04

ELAV/HuD family splicing factor; This model describes the ELAV/HuD subfamily of splicing factors found in metazoa. HuD stands for the human paraneoplastic encephalomyelitis antigen D of which there are 4 variants in human. ELAV stnds for the Drosophila Embryonic lethal abnormal visual protein. ELAV-like splicing factors are also known in human as HuB (ELAV-like protein 2), HuC (ELAV-like protein 3, Paraneoplastic cerebellar degeneration-associated antigen) and HuR (ELAV-like protein 1). These genes are most closely related to the sex-lethal subfamily of splicing factors found in Dipteran insects (TIGR01659). These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).


Pssm-ID: 273741 [Multi-domain]  Cd Length: 352  Bit Score: 44.93  E-value: 2.86e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910  1515 EERRVVFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:TIGR01661    1 ESKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKlVRDKVTGQSlgYGFVNYVRPEDAEKAVNSLNGLR 69
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
1521-1595 3.04e-04

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 41.24  E-value: 3.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTI---HF--RVQGdnYGFVTYRYAEEAFAAIEsghKLRQADeqpfdlcFGGRR 1595
Cdd:COG0724     5 YVGNLPYSVTEEDLRELFSEYGEVTSVKLitdREtgRSRG--FGFVEMPDDEEAQAAIE---ALNGAE-------LMGRT 72
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
707-1050 2.61e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   707 EGSTSATTQEARPRPLSLSEYRQRRRQRQIEAEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTAP--------Q 778
Cdd:pfam03154  157 DSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliqqtpT 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   779 RSPIAVPETCSVSVGSNPVSPTPEPSASKLMVSTHSEQVSSheMPLPVRLPPPTLQSMAPAGPTPSTVPTPLPFPPSLPP 858
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPP--MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   859 LLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQ-------YAPVPPWPCYPS-------VSPSGY---SCLPPPP 921
Cdd:pfam03154  315 PAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMphikpppTTPIPQLPNPQShkhpphlSGPSPFqmnSNLPPPP 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   922 AM----------------PIVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLGWGPGLQQPPFWSTVSPPPLSSA 985
Cdd:pfam03154  395 ALkplsslsthhppsahpPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPP 474
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910   986 STGRAVPPPSVEPSNSSAGPPED-----TLPVPVTPSLSSRPAS-PIAPPAEPTKLEAQPVPV-SPQPKHKV 1050
Cdd:pfam03154  475 ITPPSGPPTSTSSAMPGIQPPSSasvssSGPVPAAVSCPLPPVQiKEEALDEAEEPESPPPPPrSPSPEPTV 546
 
Name Accession Description Interval E-value
RRM_PRC cd12624
RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma ...
1516-1606 2.49e-61

RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma coactivator-related protein 1 (PRC) and similar proteins; This subgroup corresponds to the RRM of PRC, also termed PGC-1-related coactivator, one of the members of PGC-1 transcriptional coactivators family, including peroxisome proliferator-activated receptor gamma coactivators PGC-1alpha and PGC-1beta. Unlike PGC-1alpha and PGC-1beta, PRC is ubiquitous and more abundantly expressed in proliferating cells than in growth-arrested cells. PRC has been implicated in the regulation of several metabolic pathways, mitochondrial biogenesis, and cell growth. It functions as a growth-regulated transcriptional cofactor activating many nuclear genes specifying mitochondrial respiratory function. PRC directly interacts with nuclear transcriptional factors implicated in respiratory chain expression including nuclear respiratory factors 1 and 2 (NRF-1 and NRF-2), CREB (cAMP-response element-binding protein), and estrogen-related receptor alpha (ERRalpha). It interacts indirectly with the NRF-2beta subunit through host cell factor (HCF), a cellular protein involved in herpes simplex virus (HSV) infection and cell cycle regulation. Furthermore, like PGC-1alpha and PGC-1beta, PRC can transactivate a number of NRF-dependent nuclear genes required for mitochondrial respiratory function, including those encoding cytochrome c, 5-aminolevulinate synthase, Tfam, and TFB1M, and TFB2M. Further research indicates that PRC may also act as a sensor of metabolic stress that orchestrates a redox-sensitive program of inflammatory gene expression. PRC is a multi-domain protein containing an N-terminal activation domain, an LXXLL coactivator signature, a central proline-rich region, a tetrapeptide motif (DHDY) responsible for HCF binding, a C-terminal arginine/serine-rich (SR) domain, and an RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 410035 [Multi-domain]  Cd Length: 91  Bit Score: 204.28  E-value: 2.49e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCFGGRR 1595
Cdd:cd12624     1 ERRVVYIGKIRGRMTRSELKDRFSVFGEIEECTIHFREEGDNYGFVTYRYTEDAFAAIENGHKLRRPDELPFDLCFGGRR 80
                          90
                  ....*....|.
gi 564333910 1596 QFCKRSYSDLD 1606
Cdd:cd12624    81 QFCKSSYADLD 91
RRM_PPARGC1A_like cd12357
RNA recognition motif (RRM) found in the peroxisome proliferator-activated receptor gamma ...
1516-1606 2.79e-54

RNA recognition motif (RRM) found in the peroxisome proliferator-activated receptor gamma coactivator 1A (PGC-1alpha) family of regulated coactivators; This subfamily corresponds to the RRM of PGC-1alpha, PGC-1beta, and PGC-1-related coactivator (PRC), which serve as mediators between environmental or endogenous signals and the transcriptional machinery governing mitochondrial biogenesis. They play an important integrative role in the control of respiratory gene expression through interacting with a number of transcription factors, such as NRF-1, NRF-2, ERR, CREB and YY1. All family members are multi-domain proteins containing the N-terminal activation domain, an LXXLL coactivator signature, a tetrapeptide motif (DHDY) responsible for HCF binding, and an RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). In contrast to PGC-1alpha and PRC, PGC-1beta possesses two glutamic/aspartic acid-rich acidic domains, but lacks most of the arginine/serine (SR)-rich domain that is responsible for the regulation of RNA processing.


Pssm-ID: 409793 [Multi-domain]  Cd Length: 91  Bit Score: 183.78  E-value: 2.79e-54
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCFGGRR 1595
Cdd:cd12357     1 ERRVVYVGKLEQDTTRSELRRRFEVFGEIEECTVHFRERGDKYGFVTYRYSEDAFLALENGHDLRKRNEPMFDLSFGGRR 80
                          90
                  ....*....|.
gi 564333910 1596 QFCKRSYSDLD 1606
Cdd:cd12357    81 AFCKSSYADLD 91
RRM_PPARGC1A cd12623
RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma ...
1516-1606 3.18e-44

RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma coactivator 1-alpha (PGC-1alpha, or PPARGC-1-alpha) and similar proteins; This subgroup corresponds to the RRM of PGC-1alpha, also termed PPARGC-1-alpha, or ligand effect modulator 6, a member of a family of transcription coactivators that plays a central role in the regulation of cellular energy metabolism. As an inducible transcription coactivator, PGC-1alpha can interact with a broad range of transcription factors involved in a wide variety of biological responses, such as adaptive thermogenesis, skeletal muscle fiber type switching, glucose/fatty acid metabolism, and heart development. PGC-1alpha stimulates mitochondrial biogenesis and promotes oxidative metabolism. It participates in the regulation of both carbohydrate and lipid metabolism and plays a role in disorders such as obesity, diabetes, and cardiomyopathy. PGC-1alpha is a multi-domain protein containing an N-terminal activation domain region, a central region involved in the interaction with at least a nuclear receptor, and a C-terminal domain region. The N-terminal domain region consists of three leucine-rich motifs (L1, NR box 2 and 3), among which the two last are required for interaction with nuclear receptors, potential nuclear localization signals (NLS), and a proline-rich region overlapping a putative repression domain. The C-terminus of PGC-1alpha is composed of two arginine/serine-rich regions (SR domains), a putative dimerization domain, and an RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain). PGC-1alpha could interact favorably with single-stranded RNA.


Pssm-ID: 410034 [Multi-domain]  Cd Length: 91  Bit Score: 155.43  E-value: 3.18e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCFGGRR 1595
Cdd:cd12623     1 ERRVIYVGKIRPDITRTELKRRFEVFGEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNEPDFELCFGGRK 80
                          90
                  ....*....|.
gi 564333910 1596 QFCKRSYSDLD 1606
Cdd:cd12623    81 QFCKSNYADLD 91
RRM_PPARGC1B cd12356
RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma ...
1511-1606 7.95e-27

RNA recognition motif (RRM) found in peroxisome proliferator-activated receptor gamma coactivator 1-beta (PGC-1-beta) and similar proteins; This subfamily corresponds to the RRM of PGC-1beta, also termed PPAR-gamma coactivator 1-beta, or PPARGC-1-beta, or PGC-1-related estrogen receptor alpha coactivator, which is one of the members of PGC-1 transcriptional coactivators family, including PGC-1alpha and PGC-1-related coactivator (PRC). PGC-1beta plays a nonredundant role in controlling mitochondrial oxidative energy metabolism and affects both, insulin sensitivity and mitochondrial biogenesis, and functions in a number of oxidative tissues. It is involved in maintaining baseline mitochondrial function and cardiac contractile function following pressure overload hypertrophy by preserving glucose metabolism and preventing oxidative stress. PGC-1beta induces hypertriglyceridemia in response to dietary fats through activating hepatic lipogenesis and lipoprotein secretion. It can stimulate apolipoprotein C3 (APOC3) expression, further mediating hypolipidemic effect of nicotinic acid. PGC-1beta also drives nuclear respiratory factor 1 (NRF-1) target gene expression and NRF-1 and estrogen related receptor alpha (ERRalpha)-dependent mitochondrial biogenesis. The modulation of the expression of PGC-1beta can trigger ERRalpha-induced adipogenesis. PGC-1beta is also a potent regulator inducing angiogenesis in skeletal muscle. The transcriptional activity of PGC-1beta can be increased through binding to host cell factor (HCF), a cellular protein involved in herpes simplex virus (HSV) infection and cell cycle regulation. PGC-1beta is a multi-domain protein containing an N-terminal activation domain, an LXXLL coactivator signature, a tetrapeptide motif (DHDY) responsible for HCF binding, two glutamic/aspartic acid-rich acidic domains, and an RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). In contrast to PGC-1alpha, PGC-1beta lacks most of the arginine/serine (SR)-rich domain that is responsible for the regulation of RNA processing.


Pssm-ID: 409792 [Multi-domain]  Cd Length: 97  Bit Score: 105.81  E-value: 7.95e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1511 ERAIEERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRV-QGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDL 1589
Cdd:cd12356     1 EKAIGEGRVVYIRNLSSSMSSNELKRRFEVFGEITECCVLSRSnRGEKYGFITYRDSEHAALSLQKGASLRKRNEPSFQL 80
                          90
                  ....*....|....*..
gi 564333910 1590 CFGGRRQFCKRSYSDLD 1606
Cdd:cd12356    81 SYGGLRHFFWTRYTDLD 97
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
1520-1577 1.57e-10

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 58.45  E-value: 1.57e-10
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIESGH 1577
Cdd:cd00590     1 LFVGNLPPDTTEEDLRELFSKFGEVVSVRIVRDRDGKSkgFAFVEFESPEDAEKALEALN 60
PHA03247 PHA03247
large tegument protein UL36; Provisional
631-1116 5.03e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 5.03e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  631 HLSPADAILANPGTADPAPNGLAPGDPVlvkcRPTDPRRaaaaataaqgPRPSLEPSD------HPKVIT-------PEV 697
Cdd:PHA03247 2480 YRRPAEARFPFAAGAAPDPGGGGPPDPD----APPAPSR----------LAPAILPDEpvgepvHPRMLTwirgleeLAS 2545
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  698 KDAGGP--LKVEGSTSATTQEARPRPlslsEYRQRRRQRQIEA-EDRNSQPPAGKWPSLPETPKGLADIPcLVPSAPTKK 774
Cdd:PHA03247 2546 DDAGDPppPLPPAAPPAAPDRSVPPP----RPAPRPSEPAVTSrARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPD 2620
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  775 TAPQRSPIAVPETCSVSVGSNPVSPTPEP-----SASKLMVSTHSEQVSSHEMPLPVRLP--------PPTLQSMA---- 837
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPerprdDPAPGRVSRPRRARRLGRAAQASSPPqrprrraaRPTVGSLTslad 2700
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  838 --PAGPTPST-----VPTPLPFPPSLPPLLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQYAPVPP-------- 902
Cdd:PHA03247 2701 ppPPPPTPEPaphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPaapaagpp 2780
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  903 -----------------WPCYPSVSPSGYSCLPPPPAMPiVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLG-W 964
Cdd:PHA03247 2781 rrltrpavaslsesresLPSPWDPADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  965 GPGLQQPPFWSTVSPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSP 1044
Cdd:PHA03247 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1045 QP--KHKVSPVVQSPQIKVLPTLSAEGVAIGESESERLK-PETQENRPREKPISTATKAVLLPKQSTVAKLPAVH 1116
Cdd:PHA03247 2940 QPplAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALH 3014
RRM smart00360
RNA recognition motif;
1519-1580 8.50e-09

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 53.75  E-value: 8.50e-09
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910   1519 VVFIGKIPGRMTRSELKQRFSVFGEIEECTIHF---RVQGDNYGFVTYRYAEEAFAAIES-------GHKLR 1580
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdkeTGKSKGFAFVEFESEEDAEKALEAlngkeldGRPLK 72
RRM1_2_CELF1-6_like cd12361
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding ...
1521-1577 1.05e-07

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding proteins and plant flowering time control protein FCA; This subfamily corresponds to the RRM1 and RRM2 domains of the CUGBP1 and ETR-3-like factors (CELF) as well as plant flowering time control protein FCA. CELF, also termed BRUNOL (Bruno-like) proteins, is a family of structurally related RNA-binding proteins involved in regulation of pre-mRNA splicing in the nucleus, and control of mRNA translation and deadenylation in the cytoplasm. The family contains six members: CELF-1 (also known as BRUNOL-2, CUG-BP1, NAPOR, EDEN-BP), CELF-2 (also known as BRUNOL-3, ETR-3, CUG-BP2, NAPOR-2), CELF-3 (also known as BRUNOL-1, TNRC4, ETR-1, CAGH4, ER DA4), CELF-4 (BRUNOL-4), CELF-5 (BRUNOL-5) and CELF-6 (BRUNOL-6). They all contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The low sequence conservation of the linker region is highly suggestive of a large variety in the co-factors that associate with the various CELF family members. Based on both, sequence similarity and function, the CELF family can be divided into two subfamilies, the first containing CELFs 1 and 2, and the second containing CELFs 3, 4, 5, and 6. The different CELF proteins may act through different sites on at least some substrates. Furthermore, CELF proteins may interact with each other in varying combinations to influence alternative splicing in different contexts. This subfamily also includes plant flowering time control protein FCA that functions in the posttranscriptional regulation of transcripts involved in the flowering process. FCA contains two RRMs, and a WW protein interaction domain.


Pssm-ID: 409796 [Multi-domain]  Cd Length: 77  Bit Score: 50.70  E-value: 1.05e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTIhfrVQGDNYG------FVTYRYAEEAFAAIESGH 1577
Cdd:cd12361     3 FVGMIPKTASEEDVRPLFEQFGNIEEVQI---LRDKQTGqskgcaFVTFSTREEALRAIEALH 62
RRM_DAZL_BOULE cd12412
RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and ...
1520-1580 1.06e-07

RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE; This subfamily corresponds to the RRM domain of two Deleted in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE. BOULE is the founder member of the family and DAZL arose from BOULE in an ancestor of vertebrates. The DAZ gene subsequently originated from a duplication transposition of the DAZL gene. Invertebrates contain a single DAZ homolog, BOULE, while vertebrates, other than catarrhine primates, possess both BOULE and DAZL genes. The catarrhine primates possess BOULE, DAZL, and DAZ genes. The family members encode closely related RNA-binding proteins that are required for fertility in numerous organisms. These proteins contain an RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a varying number of copies of a DAZ motif, believed to mediate protein-protein interactions. DAZL and BOULE contain a single copy of the DAZ motif, while DAZ proteins can contain 8-24 copies of this repeat. Although their specific biochemical functions remain to be investigated, DAZL proteins may interact with poly(A)-binding proteins (PABPs), and act as translational activators of specific mRNAs during gametogenesis.


Pssm-ID: 409846 [Multi-domain]  Cd Length: 81  Bit Score: 50.69  E-value: 1.06e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12412     5 IFVGGIDWDTTEEELREFFSKFGKVKDVKIIKDRAGVSkgYGFVTFETQEDAEKIQKWGANLV 67
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
1520-1581 2.06e-07

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 49.54  E-value: 2.06e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910  1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIES--GHKLRQ 1581
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRDETGRSkgFAFVEFEDEEDAEKAIEAlnGKELGG 66
RRM2_NsCP33_like cd21608
RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ...
1520-1574 6.30e-07

RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ribonucleoprotein (NsCP33) and similar proteins; The family includes NsCP33, Arabidopsis thaliana chloroplastic 31 kDa ribonucleoprotein (CP31A) and mitochondrial glycine-rich RNA-binding protein 2 (AtGR-RBP2). NsCP33 may be involved in splicing and/or processing of chloroplast RNA's. AtCP31A, also called RNA-binding protein 1/2/3 (AtRBP33), or RNA-binding protein CP31A, or RNA-binding protein RNP-T, or RNA-binding protein cp31, is required for specific RNA editing events in chloroplasts and stabilizes specific chloroplast mRNAs, as well as for normal chloroplast development under cold stress conditions by stabilizing transcripts of numerous mRNAs under these conditions. CP31A may modulate telomere replication through RNA binding domains. AtGR-RBP2, also called AtRBG2, or glycine-rich protein 2 (AtGRP2), or mitochondrial RNA-binding protein 1a (At-mRBP1a), plays a role in RNA transcription or processing during stress. It binds RNAs and DNAs sequence with a preference to single-stranded nucleic acids. AtGR-RBP2 displays strong affinity to poly(U) sequence. It exerts cold and freezing tolerance, probably by exhibiting an RNA chaperone activity during the cold and freezing adaptation process. Some members in this family contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the second RRM motif.


Pssm-ID: 410187 [Multi-domain]  Cd Length: 76  Bit Score: 48.32  E-value: 6.30e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIE 1574
Cdd:cd21608     2 LYVGNLSWDTTEDDLRDLFSEFGEVESAKvITDRETGRSrgFGFVTFSTAEAAEAAID 59
RRM2_FCA cd12637
RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar ...
1521-1575 7.33e-07

RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar proteins; This subgroup corresponds to the RRM2 of FCA, a gene controlling flowering time in Arabidopsis, which encodes a flowering time control protein that functions in the posttranscriptional regulation of transcripts involved in the flowering process. The flowering time control protein FCA contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNP (ribonucleoprotein domains), and a WW protein interaction domain.


Pssm-ID: 410045 [Multi-domain]  Cd Length: 81  Bit Score: 48.53  E-value: 7.33e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTI---HFRVQGDNYGFVTYRYAEEAFAAIES 1575
Cdd:cd12637     3 FVGSLPKTATEQEVRDLFEAYGEVEEVYLmkdPVTQQGTGCAFVKFAYKEEALAAIRS 60
RRM_RBM24_RBM38_like cd12384
RNA recognition motif (RRM) found in eukaryotic RNA-binding protein RBM24, RBM38 and similar ...
1520-1574 1.05e-06

RNA recognition motif (RRM) found in eukaryotic RNA-binding protein RBM24, RBM38 and similar proteins; This subfamily corresponds to the RRM of RBM24 and RBM38 from vertebrate, SUPpressor family member SUP-12 from Caenorhabditis elegans and similar proteins. Both, RBM24 and RBM38, are preferentially expressed in cardiac and skeletal muscle tissues. They regulate myogenic differentiation by controlling the cell cycle in a p21-dependent or -independent manner. RBM24, also termed RNA-binding region-containing protein 6, interacts with the 3'-untranslated region (UTR) of myogenin mRNA and regulates its stability in C2C12 cells. RBM38, also termed CLL-associated antigen KW-5, or HSRNASEB, or RNA-binding region-containing protein 1(RNPC1), or ssDNA-binding protein SEB4, is a direct target of the p53 family. It is required for maintaining the stability of the basal and stress-induced p21 mRNA by binding to their 3'-UTRs. It also binds the AU-/U-rich elements in p63 3'-UTR and regulates p63 mRNA stability and activity. SUP-12 is a novel tissue-specific splicing factor that controls muscle-specific splicing of the ADF/cofilin pre-mRNA in C. elegans. All family members contain a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409818 [Multi-domain]  Cd Length: 76  Bit Score: 47.75  E-value: 1.05e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEEC-TIHFRVQGDN--YGFVTYRYAEEAFAAIE 1574
Cdd:cd12384     3 IFVGGLPYHTTDDSLREYFEQFGEIEEAvVITDRQTGKSrgYGFVTMADREAAERACK 60
RRM2_RIM4_like cd12454
RNA recognition motif 2 (RRM2) found in yeast meiotic activator RIM4 and similar proteins; ...
1516-1575 2.03e-06

RNA recognition motif 2 (RRM2) found in yeast meiotic activator RIM4 and similar proteins; This subfamily corresponds to the RRM2 of RIM4, also termed regulator of IME2 protein 4, a putative RNA binding protein that is expressed at elevated levels early in meiosis. It functions as a meiotic activator required for both the IME1- and IME2-dependent pathways of meiotic gene expression, as well as early events of meiosis, such as meiotic division and recombination, in Saccharomyces cerevisiae. RIM4 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The family also includes a putative RNA-binding protein termed multicopy suppressor of sporulation protein Msa1. It is a putative RNA-binding protein encoded by a novel gene, msa1, from the fission yeast Schizosaccharomyces pombe. Msa1 may be involved in the inhibition of sexual differentiation by controlling the expression of Ste11-regulated genes, possibly through the pheromone-signaling pathway. Like RIM4, Msa1 also contains two RRMs, both of which are essential for the function of Msa1.


Pssm-ID: 409888 [Multi-domain]  Cd Length: 80  Bit Score: 47.08  E-value: 2.03e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 564333910 1516 ERRVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN-YGFVTYRYAEEAFAAIES 1575
Cdd:cd12454     2 DKLSIFVGQLDPKTTDSELFRRFSKYGKIVDCKLIKRPEPVNaFAFLRFESEEAAEAAVEE 62
RRM2_CELF3_4_5_6 cd12635
RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, ...
1518-1577 3.11e-06

RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, CELF-6 and similar proteins; This subgroup corresponds to the RRM2 of CELF-3, CELF-4, CELF-5, and CELF-6, all of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that display dual nuclear and cytoplasmic localizations and have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-3, expressed in brain and testis only, is also known as bruno-like protein 1 (BRUNOL-1), or CAG repeat protein 4, or CUG-BP- and ETR-3-like factor 3, or embryonic lethal abnormal vision (ELAV)-type RNA-binding protein 1 (ETR-1), or expanded repeat domain protein CAG/CTG 4, or trinucleotide repeat-containing gene 4 protein (TNRC4). It plays an important role in the pathogenesis of tauopathies. CELF-3 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The effect of CELF-3 on tau splicing is mediated mainly by the RNA-binding activity of RRM2. The divergent linker region might mediate the interaction of CELF-3 with other proteins regulating its activity or involved in target recognition. CELF-4, being highly expressed throughout the brain and in glandular tissues, moderately expressed in heart, skeletal muscle, and liver, is also known as bruno-like protein 4 (BRUNOL-4), or CUG-BP- and ETR-3-like factor 4. Like CELF-3, CELF-4 also contain three highly conserved RRMs. The splicing activation or repression activity of CELF-4 on some specific substrates is mediated by its RRM1/RRM2. On the other hand, both RRM1 and RRM2 of CELF-4 can activate cardiac troponin T (cTNT) exon 5 inclusion. CELF-5, expressed in brain, is also known as bruno-like protein 5 (BRUNOL-5), or CUG-BP- and ETR-3-like factor 5. Although its biological role remains unclear, CELF-5 shares same domain architecture with CELF-3. CELF-6, being strongly expressed in kidney, brain, and testis, is also known as bruno-like protein 6 (BRUNOL-6), or CUG-BP- and ETR-3-like factor 6. It activates exon inclusion of a cardiac troponin T minigene in transient transfection assays in a muscle-specific splicing enhancer (MSE)-dependent manner and can activate inclusion via multiple copies of a single element, MSE2. CELF-6 also promotes skipping of exon 11 of insulin receptor, a known target of CELF activity that is expressed in kidney. In addition to three highly conserved RRMs, CELF-6 also possesses numerous potential phosphorylation sites, a potential nuclear localization signal (NLS) at the C terminus, and an alanine-rich region within the divergent linker region.


Pssm-ID: 410043 [Multi-domain]  Cd Length: 81  Bit Score: 46.64  E-value: 3.11e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYG--FVTYRYAEEAFAAIESGH 1577
Cdd:cd12635     2 RKLFVGMLGKQQSEDDVRRLFEPFGSIEECTILRGPDGNSKGcaFVKFSSHAEAQAAINALH 63
RRM1_Hu cd12650
RNA recognition motif 1 (RRM1) found in the Hu proteins family; This subfamily corresponds to ...
1525-1580 4.42e-06

RNA recognition motif 1 (RRM1) found in the Hu proteins family; This subfamily corresponds to the RRM1 of the Hu proteins family which represents a group of RNA-binding proteins involved in diverse biological processes. Since the Hu proteins share high homology with the Drosophila embryonic lethal abnormal vision (ELAV) protein, the Hu family is sometimes referred to as the ELAV family. Drosophila ELAV is exclusively expressed in neurons and is required for the correct differentiation and survival of neurons in flies. The neuronal members of the Hu family include Hu-antigen B (HuB or ELAV-2 or Hel-N1), Hu-antigen C (HuC or ELAV-3 or PLE21), and Hu-antigen D (HuD or ELAV-4), which play important roles in neuronal differentiation, plasticity and memory. HuB is also expressed in gonads. Hu-antigen R (HuR or ELAV-1 or HuA) is the ubiquitously expressed Hu family member. It has a variety of biological functions mostly related to the regulation of cellular response to DNA damage and other types of stress. HuR has an anti-apoptotic function during early cell stress response. It binds to mRNAs and enhances the expression of several anti-apoptotic proteins, such as p21waf1, p53, and prothymosin alpha. HuR also has pro-apoptotic function by promoting apoptosis when cell death is unavoidable. Furthermore, HuR may be important in muscle differentiation, adipogenesis, suppression of inflammatory response and modulation of gene expression in response to chronic ethanol exposure and amino acid starvation. Hu proteins perform their cytoplasmic and nuclear molecular functions by coordinately regulating functionally related mRNAs. In the cytoplasm, Hu proteins recognize and bind to AU-rich RNA elements (AREs) in the 3' untranslated regions (UTRs) of certain target mRNAs, such as GAP-43, vascular epithelial growth factor (VEGF), the glucose transporter GLUT1, eotaxin and c-fos, and stabilize those ARE-containing mRNAs. They also bind and regulate the translation of some target mRNAs, such as neurofilament M, GLUT1, and p27. In the nucleus, Hu proteins function as regulators of polyadenylation and alternative splicing. Each Hu protein contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an ARE. RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions.


Pssm-ID: 410053 [Multi-domain]  Cd Length: 77  Bit Score: 46.24  E-value: 4.42e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910 1525 IPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12650     8 LPQNMTQDEIRSLFSSIGEIESCKlIRDKVTGQSlgYGFVNYVDPSDAEKAINTLNGLR 66
RRM_hnRNPC_like cd12341
RNA recognition motif (RRM) found in heterogeneous nuclear ribonucleoprotein C (hnRNP C) ...
1520-1590 9.85e-06

RNA recognition motif (RRM) found in heterogeneous nuclear ribonucleoprotein C (hnRNP C)-related proteins; This subfamily corresponds to the RRM in the hnRNP C-related protein family, including hnRNP C proteins, Raly, and Raly-like protein (RALYL). hnRNP C proteins, C1 and C2, are produced by a single coding sequence. They are the major constituents of the heterogeneous nuclear RNA (hnRNA) ribonucleoprotein (hnRNP) complex in vertebrates. They bind hnRNA tightly, suggesting a central role in the formation of the ubiquitous hnRNP complex; they are involved in the packaging of the hnRNA in the nucleus and in processing of pre-mRNA such as splicing and 3'-end formation. Raly, also termed autoantigen p542, is an RNA-binding protein that may play a critical role in embryonic development. The biological role of RALYL remains unclear. It shows high sequence homology with hnRNP C proteins and Raly. Members of this family are characterized by an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal auxiliary domain. The Raly proteins contain a glycine/serine-rich stretch within the C-terminal regions, which is absent in the hnRNP C proteins. Thus, the Raly proteins represent a newly identified class of evolutionarily conserved autoepitopes.


Pssm-ID: 409778 [Multi-domain]  Cd Length: 68  Bit Score: 44.93  E-value: 9.85e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910 1520 VFIGKIP-GRMTRSELKQRFSVFGEIEECTIHfrvqgDNYGFVTYRYAEEAFAAIeSGHKLRQADEQPFDLC 1590
Cdd:cd12341     3 IFVGNLPtDQMTKEDLEEIFSKYGKILGISLH-----KGYGFVQFDNEEDARAAV-AGENGRTIKGQRLDIN 68
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
890-1090 3.81e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 48.53  E-value: 3.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  890 PPDP-YTQYAPVPPWPCYPSVSPSGYSclPPPPAMPIVSGTPGTYAVPPTCNVPWVPPPApvspyssssaygslgWGPGL 968
Cdd:PTZ00449  609 PKSPkLPELLDIPKSPKRPESPKSPKR--PPPPQRPSSPERPEGPKIIKSPKPPKSPKPP---------------FDPKF 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  969 QQPpFWSTVSPPPLSSASTGRAVPPPSVEPSNSSAGPPED-----TLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVS 1043
Cdd:PTZ00449  672 KEK-FYDDYLDAAAKSKETKTTVVLDESFESILKETLPETpgtpfTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFF 750
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 564333910 1044 PQPKHKVSPVVQSPQIKVLPTLSAEGVAIGESESERLKPETQENRPR 1090
Cdd:PTZ00449  751 TPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD 797
RRM1_hnRNPA_hnRNPD_like cd12325
RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein hnRNP A and ...
1520-1583 4.31e-05

RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein hnRNP A and hnRNP D subfamilies and similar proteins; This subfamily corresponds to the RRM1 in the hnRNP A subfamily which includes hnRNP A0, hnRNP A1, hnRNP A2/B1, hnRNP A3 and similar proteins. hnRNP A0 is a low abundance hnRNP protein that has been implicated in mRNA stability in mammalian cells. hnRNP A1 is an abundant eukaryotic nuclear RNA-binding protein that may modulate splice site selection in pre-mRNA splicing. hnRNP A2/B1 is an RNA trafficking response element-binding protein that interacts with the hnRNP A2 response element (A2RE). hnRNP A3 is also a RNA trafficking response element-binding protein that participates in the trafficking of A2RE-containing RNA. The hnRNP A subfamily is characterized by two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a long glycine-rich region at the C-terminus. The hnRNP D subfamily includes hnRNP D0, hnRNP A/B, hnRNP DL and similar proteins. hnRNP D0 is a UUAG-specific nuclear RNA binding protein that may be involved in pre-mRNA splicing and telomere elongation. hnRNP A/B is an RNA unwinding protein with a high affinity for G- followed by U-rich regions. hnRNP A/B has also been identified as an APOBEC1-binding protein that interacts with apolipoprotein B (apoB) mRNA transcripts around the editing site and thus, plays an important role in apoB mRNA editing. hnRNP DL (or hnRNP D-like) is a dual functional protein that possesses DNA- and RNA-binding properties. It has been implicated in mRNA biogenesis at the transcriptional and post-transcriptional levels. All members in this subfamily contain two putative RRMs and a glycine- and tyrosine-rich C-terminus. The family also contains DAZAP1 (Deleted in azoospermia-associated protein 1), RNA-binding protein Musashi homolog Musashi-1, Musashi-2 and similar proteins. They all harbor two RRMs.


Pssm-ID: 409763 [Multi-domain]  Cd Length: 72  Bit Score: 42.90  E-value: 4.31e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHF-----RVQGdnYGFVTYRYAEEAFAAIESG-HKL--RQAD 1583
Cdd:cd12325     1 LFVGGLSWETTEESLREYFSKYGEVVDCVVMKdpatgRSRG--FGFVTFKDPSSVDAVLAARpHTLdgRTID 70
RRM1_p54nrb_like cd12332
RNA recognition motif 1 (RRM1) found in the p54nrb/PSF/PSP1 family; This subfamily corresponds ...
1521-1574 4.67e-05

RNA recognition motif 1 (RRM1) found in the p54nrb/PSF/PSP1 family; This subfamily corresponds to the RRM1 of the p54nrb/PSF/PSP1 family, including 54 kDa nuclear RNA- and DNA-binding protein (p54nrb or NonO or NMT55), polypyrimidine tract-binding protein (PTB)-associated-splicing factor (PSF or POMp100), paraspeckle protein 1 (PSP1 or PSPC1), which are ubiquitously expressed and are conserved in vertebrates. p54nrb is a multi-functional protein involved in numerous nuclear processes including transcriptional regulation, splicing, DNA unwinding, nuclear retention of hyperedited double-stranded RNA, viral RNA processing, control of cell proliferation, and circadian rhythm maintenance. PSF is also a multi-functional protein that binds RNA, single-stranded DNA (ssDNA), double-stranded DNA (dsDNA) and many factors, and mediates diverse activities in the cell. PSP1 is a novel nucleolar factor that accumulates within a new nucleoplasmic compartment, termed paraspeckles, and diffusely distributes in the nucleoplasm. The cellular function of PSP1 remains unknown currently. This subfamily also includes some p54nrb/PSF/PSP1 homologs from invertebrate species, such as the Drosophila melanogaster gene no-ontransient A (nonA) encoding puff-specific protein Bj6 (also termed NONA) and Chironomus tentans hrp65 gene encoding protein Hrp65. D. melanogaster NONA is involved in eye development and behavior, and may play a role in circadian rhythm maintenance, similar to vertebrate p54nrb. C. tentans Hrp65 is a component of nuclear fibers associated with ribonucleoprotein particles in transit from the gene to the nuclear pore. All family members contain a DBHS domain (for Drosophila behavior, human splicing), which comprises two conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a charged protein-protein interaction module. PSF has an additional large N-terminal domain that differentiates it from other family members.


Pssm-ID: 409769 [Multi-domain]  Cd Length: 71  Bit Score: 43.05  E-value: 4.67e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTIHfrvQGDNYGFV--TYRY-AEEAFAAIE 1574
Cdd:cd12332     5 FVGNLPNDITEEEFKELFQKYGEVSEVFLN---KGKGFGFIrlDTRAnAEAAKAELD 58
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
918-1128 5.70e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.95  E-value: 5.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  918 PPPPAMPIVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLGWGPGLQQPPFWSTVSPPPLSSASTGRAVPPPSVE 997
Cdd:PRK12323  375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  998 PSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTkleaqPVPVSPQPKHKVSPVVQSPQikvlPTLSAEGVAIGESES 1077
Cdd:PRK12323  455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA-----PADDDPPPWEELPPEFASPA----PAQPDAAPAGWVAES 525
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 564333910 1078 ERLKPETQENRPREKPISTATKAVLLPKQSTVAKLPAVHPARLRRLSFLPT 1128
Cdd:PRK12323  526 IPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
RRM1_2_CoAA_like cd12343
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator ...
1520-1578 8.62e-05

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator/modulator (CoAA) and similar proteins; This subfamily corresponds to the RRM in CoAA (also known as RBM14 or PSP2) and RNA-binding protein 4 (RBM4). CoAA is a heterogeneous nuclear ribonucleoprotein (hnRNP)-like protein identified as a nuclear receptor coactivator. It mediates transcriptional coactivation and RNA splicing effects in a promoter-preferential manner, and is enhanced by thyroid hormone receptor-binding protein (TRBP). CoAA contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a TRBP-interacting domain. RBM4 is a ubiquitously expressed splicing factor with two isoforms, RBM4A (also known as Lark homolog) and RBM4B (also known as RBM30), which are very similar in structure and sequence. RBM4 may also function as a translational regulator of stress-associated mRNAs as well as play a role in micro-RNA-mediated gene regulation. RBM4 contains two N-terminal RRMs, a CCHC-type zinc finger, and three alanine-rich regions within their C-terminal regions. This family also includes Drosophila RNA-binding protein lark (Dlark), a homolog of human RBM4. It plays an important role in embryonic development and in the circadian regulation of adult eclosion. Dlark shares high sequence similarity with RBM4 at the N-terminal region. However, Dlark has three proline-rich segments instead of three alanine-rich segments within the C-terminal region.


Pssm-ID: 409779 [Multi-domain]  Cd Length: 66  Bit Score: 42.22  E-value: 8.62e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIhfrVQgdNYGFVTYRYAEEAFAAIESGHK 1578
Cdd:cd12343     2 IFVGNLPDAATSEELRALFEKYGKVTECDI---VK--NYAFVHMEKEEDAEDAIKALNG 55
RRM_NOL8 cd12226
RNA recognition motif (RRM) found in nucleolar protein 8 (NOL8) and similar proteins; This ...
1519-1571 9.97e-05

RNA recognition motif (RRM) found in nucleolar protein 8 (NOL8) and similar proteins; This model corresponds to the RRM of NOL8 (also termed Nop132) encoded by a novel NOL8 gene that is up-regulated in the majority of diffuse-type, but not intestinal-type, gastric cancers. Thus, NOL8 may be a good molecular target for treatment of diffuse-type gastric cancer. Also, NOL8 is a phosphorylated protein that contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), suggesting NOL8 is likely to function as a novel RNA-binding protein. It may be involved in regulation of gene expression at the post-transcriptional level or in ribosome biogenesis in cancer cells.


Pssm-ID: 409673 [Multi-domain]  Cd Length: 77  Bit Score: 42.18  E-value: 9.97e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1519 VVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFR--VQGDNYGFVTYRYAEEAFA 1571
Cdd:cd12226     1 RLFVGGLSPSITEDDLERRFSRFGTVSDVEIIRKkdAPDRGFAYIDLRTSEAALQ 55
RRM3_PUB1 cd12622
RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated ...
1520-1576 1.06e-04

RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subfamily corresponds to the RRM3 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410033 [Multi-domain]  Cd Length: 74  Bit Score: 42.05  E-value: 1.06e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEEctihFRVQGDN-YGFVTYRYAEEAFAAIESG 1576
Cdd:cd12622     3 VYVGNLPPEVTQADLIPLFQNFGVIEE----VRVQRDKgFGFVKYDTHEEAALAIQQL 56
RRM2_Bruno_like cd12636
RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar ...
1518-1577 1.09e-04

RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar proteins; This subgroup corresponds to the RRM2 of Bruno, a Drosophila RNA recognition motif (RRM)-containing protein that plays a central role in regulation of Oskar (Osk) expression. It mediates repression by binding to regulatory Bruno response elements (BREs) in the Osk mRNA 3' UTR. The full-length Bruno protein contains three RRMs, two located in the N-terminal half of the protein and the third near the C-terminus, separated by a linker region.


Pssm-ID: 410044 [Multi-domain]  Cd Length: 81  Bit Score: 42.17  E-value: 1.09e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYG--FVTYRYAEEAFAAIESGH 1577
Cdd:cd12636     2 RKLFVGMLSKKCNESDVRIMFSPYGSIEECTVLRDQNGKSRGcaFVTFTSRQCAVNAIKAMH 63
RRM2_RBM23_RBM39 cd12284
RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and ...
1521-1574 1.35e-04

RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and similar proteins; This subfamily corresponds to the RRM2 of RBM39 (also termed HCC1), a nuclear autoantigen that contains an N-terminal arginine/serine rich (RS) motif and three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). An octapeptide sequence called the RS-ERK motif is repeated six times in the RS region of RBM39. Although the cellular function of RBM23 remains unclear, it shows high sequence homology to RBM39 and contains two RRMs. It may possibly function as a pre-mRNA splicing factor.


Pssm-ID: 409726 [Multi-domain]  Cd Length: 78  Bit Score: 41.84  E-value: 1.35e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTIHF-----RVQGdnYGFVTYRYAEEAFAAIE 1574
Cdd:cd12284     2 YVGSLHFNITEDMLRGIFEPFGKIEFVQLQKdpetgRSKG--YGFIQFRDAEDAKKALE 58
RRM3_TIA1_like cd12354
RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and ...
1520-1573 1.74e-04

RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and TIAR), and yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1; This subfamily corresponds to the RRM3 of TIA-1, TIAR, and PUB1. Nucleolysin TIA-1 isoform p40 (p40-TIA-1 or TIA-1) and nucleolysin TIA-1-related protein (TIAR) are granule-associated RNA binding proteins involved in inducing apoptosis in cytotoxic lymphocyte (CTL) target cells. They share high sequence similarity and are expressed in a wide variety of cell types. TIA-1 can be phosphorylated by a serine/threonine kinase that is activated during Fas-mediated apoptosis.TIAR is mainly localized in the nucleus of hematopoietic and nonhematopoietic cells. It is translocated from the nucleus to the cytoplasm in response to exogenous triggers of apoptosis. Both TIA-1 and TIAR bind specifically to poly(A) but not to poly(C) homopolymers. They are composed of three N-terminal highly homologous RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a glutamine-rich C-terminal auxiliary domain containing a lysosome-targeting motif. TIA-1 and TIAR interact with RNAs containing short stretches of uridylates and their RRM2 can mediate the specific binding to uridylate-rich RNAs. The C-terminal auxiliary domain may be responsible for interacting with other proteins. In addition, TIA-1 and TIAR share a potential serine protease-cleavage site (Phe-Val-Arg) localized at the junction between their RNA binding domains and their C-terminal auxiliary domains. This subfamily also includes a yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1, termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein, which has been identified as both a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP). It may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RRMs, and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 409790 [Multi-domain]  Cd Length: 71  Bit Score: 41.50  E-value: 1.74e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIhFRVQGdnYGFVTYRYAEEAFAAI 1573
Cdd:cd12354     3 VYVGNITKGLTEALLQQTFSPFGQILEVRV-FPDKG--YAFIRFDSHEAATHAI 53
PHA03247 PHA03247
large tegument protein UL36; Provisional
379-844 1.94e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  379 DPALQLLMPAMEPGTEDGVPKVAPCPDQELPLSSACLLEPRDIMESLAPKEPRNLPANASEGSQRVPRKGRKKKKKEQPT 458
Cdd:PHA03247 2550 DPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  459 AclegcTRRLRSSSRGQSTVATEVNSQAGNLQKQPQEELQRETAAPqsrGKPRAWARAWAAALEKTGSEDPEDQNAGQDS 538
Cdd:PHA03247 2630 S-----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL---GRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  539 PAEEDVLELRPKLVETSQADPTLSLNDSARADSIPVNSVEgDSTAVDTAVPVDQASSCTElvgSLSGGPILSSPVLDDRT 618
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARP---PTTAGPPAPAPPAAPAA 2777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  619 GVEPAVAIPTSNHLSPADAILANPGTADPAPNGLAPGDPVLVKCRPTDPRRAAAAATAAQGPRPSLEPSDHPKVITPEVK 698
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  699 dAGGPLKVEGST--SATTQEARPRPLSLSEYRQRRRQRQIEAEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTA 776
Cdd:PHA03247 2858 -PGGDVRRRPPSrsPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910  777 PQRSPIAVPETcsvsvgsnpvSPTPEPSASKLMVSTHSEQVSSHEMPLP-VRLPPPTLQSMAPAGPTPS 844
Cdd:PHA03247 2937 PRPQPPLAPTT----------DPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPP 2995
RBD_RRM1_NPL3 cd12340
RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 3 (Npl3p) and similar proteins; ...
1521-1577 2.00e-04

RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 3 (Npl3p) and similar proteins; This subfamily corresponds to the RRM1 of Npl3p, also termed mitochondrial targeting suppressor 1 protein, or nuclear polyadenylated RNA-binding protein 1. Npl3p is a major yeast RNA-binding protein that competes with 3'-end processing factors, such as Rna15, for binding to the nascent RNA, protecting the transcript from premature termination and coordinating transcription termination and the packaging of the fully processed transcript for export. It specifically recognizes a class of G/U-rich RNAs. Npl3p is a multi-domain protein containing two central RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), separated by a short linker and a C-terminal domain rich in glycine, arginine and serine residues.


Pssm-ID: 409777 [Multi-domain]  Cd Length: 69  Bit Score: 41.23  E-value: 2.00e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTIhfrVQGDNYGFVTYRYAEEAFAAIESGH 1577
Cdd:cd12340     3 FVRPFPPDTSESAIREIFSPYGPVKEVKM---LSDSNFAFVEFEELEDAIRAKDSVH 56
RRM2_TDP43 cd12322
RNA recognition motif 2 (RRM2) found in TAR DNA-binding protein 43 (TDP-43) and similar ...
1518-1580 2.30e-04

RNA recognition motif 2 (RRM2) found in TAR DNA-binding protein 43 (TDP-43) and similar proteins; This subfamily corresponds to the RRM2 of TDP-43 (also termed TARDBP), a ubiquitously expressed pathogenic protein whose normal function and abnormal aggregation are directly linked to the genetic disease cystic fibrosis, and two neurodegenerative disorders: frontotemporal lobar degeneration (FTLD) and amyotrophic lateral sclerosis (ALS). TDP-43 binds both DNA and RNA, and has been implicated in transcriptional repression, pre-mRNA splicing and translational regulation. TDP-43 is a dimeric protein with two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a C-terminal glycine-rich domain. The RRMs are responsible for DNA and RNA binding; they bind to TAR DNA and RNA sequences with UG-repeats. The glycine-rich domain can interact with the hnRNP family proteins to form the hnRNP-rich complex involved in splicing inhibition. It is also essential for the cystic fibrosis transmembrane conductance regulator (CFTR) exon 9-skipping activity.


Pssm-ID: 409761 [Multi-domain]  Cd Length: 71  Bit Score: 41.15  E-value: 2.30e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTI--HFRvqgdNYGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12322     1 RKVFVGRCTEDMTEDDLRQYFSQFGEVTDVFIpkPFR----AFAFVTFADDEVAQSLCGEDHIIK 61
PHA03247 PHA03247
large tegument protein UL36; Provisional
978-1362 2.43e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  978 SPPPLSSASTGRA----VPPPS------VEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEPTKLEAQPVPVSPQPK 1047
Cdd:PHA03247 2574 APRPSEPAVTSRArrpdAPPQSarprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1048 HKVSPVVQSPQIKVlptlSAEGVAIGESESERlKPETQENRPREKPISTATKAVLLPKQ------STVAKLPAVHPARLR 1121
Cdd:PHA03247 2654 DDPAPGRVSRPRRA----RRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTpepaphALVSATPLPPGPAAA 2728
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1122 RLSFLPTPRAQGSEDVVQAFISEIGiEASDLSSLLEQFEKSEAKKECPL--PAPADSVAVGNSGSIDTPQEKKPLD--RL 1197
Cdd:PHA03247 2729 RQASPALPAAPAPPAVPAGPATPGG-PARPARPPTTAGPPAPAPPAAPAagPPRRLTRPAVASLSESRESLPSPWDpaDP 2807
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1198 HAPELANVAGLTPPATPPHQLWKPLAAV---SLLAKAKSPKSTAQEGTLKPEGITEAKPPAaaclqegvRGPSPVHVgsg 1274
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQptaPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS--------RSPAAKPA--- 2876
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1275 dhdycVRSRTPPKKMPALVISEMGSRWNVKRHQditIKPVLSLGSAAPPLPCTTTSQEPLDHRTSSEQADPSAPcFAPST 1354
Cdd:PHA03247 2877 -----APARPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTT 2947

                  ....*...
gi 564333910 1355 LLSPEASP 1362
Cdd:PHA03247 2948 DPAGAGEP 2955
RRM_G3BP cd12229
RNA recognition motif (RRM) found in ras GTPase-activating protein-binding protein G3BP1, ...
1520-1575 2.57e-04

RNA recognition motif (RRM) found in ras GTPase-activating protein-binding protein G3BP1, G3BP2 and similar proteins; This subfamily corresponds to the RRM domain in the G3BP family of RNA-binding and SH3 domain-binding proteins. G3BP acts at the level of RNA metabolism in response to cell signaling, possibly as RNA transcript stabilizing factors or an RNase. Members include G3BP1, G3BP2 and similar proteins. These proteins associate directly with the SH3 domain of GTPase-activating protein (GAP), which functions as an inhibitor of Ras. They all contain an N-terminal nuclear transfer factor 2 (NTF2)-like domain, an acidic domain, a domain containing PXXP motif(s), an RNA recognition motif (RRM), and an Arg-Gly-rich region (RGG-rich region, or arginine methylation motif).


Pssm-ID: 409676 [Multi-domain]  Cd Length: 81  Bit Score: 41.24  E-value: 2.57e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGD---NYGFVTYRYAEEAFAAIES 1575
Cdd:cd12229     6 LFVGNLPHDITEDELKEFFSRFGNVLELRINSKGGGGrlpNFGFVVFDDPEAVQKILAN 64
RRM_RBM22 cd12224
RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This ...
1521-1579 2.62e-04

RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This subgroup corresponds to the RRM of RBM22 (also known as RNA-binding motif protein 22, or Zinc finger CCCH domain-containing protein 16), a newly discovered RNA-binding motif protein which belongs to the SLT11 gene family. SLT11 gene encoding protein (Slt11p) is a splicing factor in yeast, which is required for spliceosome assembly. Slt11p has two distinct biochemical properties: RNA-annealing and RNA-binding activities. RBM22 is the homolog of SLT11 in vertebrate. It has been reported to be involved in pre-splicesome assembly and to interact with the Ca2+-signaling protein ALG-2. It also plays an important role in embryogenesis. RBM22 contains a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a zinc finger of the unusual type C-x8-C-x5-C-x3-H, and a C-terminus that is unusually rich in the amino acids Gly and Pro, including sequences of tetraprolines.


Pssm-ID: 409671 [Multi-domain]  Cd Length: 74  Bit Score: 41.12  E-value: 2.62e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTIhfrVQGDNYGFVTY--RYAEEAfAAIESGHKL 1579
Cdd:cd12224     5 YVGGLGDKITEKDLRDHFYQFGEIRSITV---VARQQCAFVQFttRQAAER-AAERTFNKL 61
RRM2_RBM45 cd12367
RNA recognition motif 2 (RRM2) found in RNA-binding protein 45 (RBM45) and similar proteins; ...
1525-1574 2.71e-04

RNA recognition motif 2 (RRM2) found in RNA-binding protein 45 (RBM45) and similar proteins; This subfamily corresponds to the RRM2 of RBM45, also termed developmentally-regulated RNA-binding protein 1 (DRB1), a new member of RNA recognition motif (RRM)-type neural RNA-binding proteins, which expresses under spatiotemporal control. It is encoded by gene drb1 that is expressed in neurons, not in glial cells. RBM45 predominantly localizes in cytoplasm of cultured cells and specifically binds to poly(C) RNA. It could play an important role during neurogenesis. RBM45 carries four RRMs, also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409802 [Multi-domain]  Cd Length: 74  Bit Score: 40.82  E-value: 2.71e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 564333910 1525 IPGRMTRSELKQRFSVFGEIEECTI-HFRVQGDN--YGFVTYRYAEEAFAAIE 1574
Cdd:cd12367     8 IPKSYTEEDLREKFKEFGDIEYCSIvKDKNTGESkgFGYVKFLKPSQAALAIE 60
RRM1_RBM4 cd12606
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 4 (RBM4); This subgroup ...
1520-1577 2.76e-04

RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 4 (RBM4); This subgroup corresponds to the RRM1 of RBM4, a ubiquitously expressed splicing factor that has two isoforms, RBM4A (also known as Lark homolog) and RBM4B (also known as RBM30), which are very similar in structure and sequence. RBM4 may function as a translational regulator of stress-associated mRNAs and also plays a role in micro-RNA-mediated gene regulation. RBM4 contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), a CCHC-type zinc finger, and three alanine-rich regions within their C-terminal regions. The C-terminal region may be crucial for nuclear localization and protein-protein interaction. The RRMs, in combination with the C-terminal region, are responsible for the splicing function of RBM4.


Pssm-ID: 410018 [Multi-domain]  Cd Length: 67  Bit Score: 40.56  E-value: 2.76e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHfrvqgDNYGFVTYRYAEEAFAAIESGH 1577
Cdd:cd12606     3 LFIGNLPREATEEEIRSLFEQYGKVTECDII-----KNYGFVHMEDKSAADEAIRNLH 55
ELAV_HUD_SF TIGR01661
ELAV/HuD family splicing factor; This model describes the ELAV/HuD subfamily of splicing ...
1515-1580 2.86e-04

ELAV/HuD family splicing factor; This model describes the ELAV/HuD subfamily of splicing factors found in metazoa. HuD stands for the human paraneoplastic encephalomyelitis antigen D of which there are 4 variants in human. ELAV stnds for the Drosophila Embryonic lethal abnormal visual protein. ELAV-like splicing factors are also known in human as HuB (ELAV-like protein 2), HuC (ELAV-like protein 3, Paraneoplastic cerebellar degeneration-associated antigen) and HuR (ELAV-like protein 1). These genes are most closely related to the sex-lethal subfamily of splicing factors found in Dipteran insects (TIGR01659). These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).


Pssm-ID: 273741 [Multi-domain]  Cd Length: 352  Bit Score: 44.93  E-value: 2.86e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910  1515 EERRVVFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:TIGR01661    1 ESKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKlVRDKVTGQSlgYGFVNYVRPEDAEKAVNSLNGLR 69
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
1521-1595 3.04e-04

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 41.24  E-value: 3.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1521 FIGKIPGRMTRSELKQRFSVFGEIEECTI---HF--RVQGdnYGFVTYRYAEEAFAAIEsghKLRQADeqpfdlcFGGRR 1595
Cdd:COG0724     5 YVGNLPYSVTEEDLRELFSEYGEVTSVKLitdREtgRSRG--FGFVEMPDDEEAQAAIE---ALNGAE-------LMGRT 72
RRM1_hnRNPA0 cd12326
RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein A0 (hnRNP A0) ...
1520-1575 3.34e-04

RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein A0 (hnRNP A0) and similar proteins; This subfamily corresponds to the RRM1 of hnRNP A0 which is a low abundance hnRNP protein that has been implicated in mRNA stability in mammalian cells. It has been identified as the substrate for MAPKAP-K2 and may be involved in the lipopolysaccharide (LPS)-induced post-transcriptional regulation of tumor necrosis factor-alpha (TNF-alpha), cyclooxygenase 2 (COX-2) and macrophage inflammatory protein 2 (MIP-2). hnRNP A0 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a long glycine-rich region at the C-terminus.


Pssm-ID: 409764 [Multi-domain]  Cd Length: 79  Bit Score: 40.67  E-value: 3.34e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN---YGFVTYRYAEEAFAAIES 1575
Cdd:cd12326     5 LFIGGLNVQTTEEGLRAHFEAYGQLTDCVVVVNPQTKRsrcFGFVTYSSAEEADAAMAA 63
RRM1_RBM46 cd12484
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 46 (RBM46); This ...
1520-1574 4.04e-04

RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 46 (RBM46); This subgroup corresponds to the RRM1 of RBM46, also termed cancer/testis antigen 68 (CT68), a putative RNA-binding protein that shows high sequence homology with heterogeneous nuclear ribonucleoprotein R (hnRNP R) and heterogeneous nuclear ribonucleoprotein Q (hnRNP Q). Its biological function remains unclear. Like hnRNP R and hnRNP Q, RBM46 contains two well-defined and one degenerated RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409911 [Multi-domain]  Cd Length: 78  Bit Score: 40.64  E-value: 4.04e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIE 1574
Cdd:cd12484     4 VFVGKIPRDMYEDELVPVFERAGKIYEFRLMMEFSGENrgYAFVMYTTKEEAQLAIK 60
RRM_SNP1_like cd21615
RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ...
1518-1580 4.51e-04

RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ribonucleoprotein SNP1 and similar proteins; SNP1, also called U1 snRNP protein SNP1, or U1 small nuclear ribonucleoprotein 70 kDa homolog, or U1 70K, or U1 snRNP 70 kDa homolog, interacts with mRNA and is involved in nuclear mRNA splicing. It is a component of the spliceosome, where it is associated with snRNP U1 by binding stem loop I of U1 snRNA. Members in this family contain an N-terminal U1snRNP70 domain and an RNA recognition motif (RRM), also called RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 410194 [Multi-domain]  Cd Length: 118  Bit Score: 41.53  E-value: 4.51e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN---YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd21615    19 KTLFVGRLDYSLTELELQKKFSKFGEIEKIRIVRDKETGKsrgYAFIVFKSESDAKNAFKEGNGLR 84
RRM1_Hu_like cd12375
RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), ...
1522-1580 5.34e-04

RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), and similar proteins; This subfamily corresponds to the RRM1 of Hu proteins and SXL. The Hu proteins family represents a group of RNA-binding proteins involved in diverse biological processes. Since the Hu proteins share high homology with the Drosophila embryonic lethal abnormal vision (ELAV) protein, the Hu family is sometimes referred to as the ELAV family. Drosophila ELAV is exclusively expressed in neurons and is required for the correct differentiation and survival of neurons in flies. The neuronal members of the Hu family include Hu-antigen B (HuB or ELAV-2 or Hel-N1), Hu-antigen C (HuC or ELAV-3 or PLE21), and Hu-antigen D (HuD or ELAV-4), which play important roles in neuronal differentiation, plasticity and memory. HuB is also expressed in gonads. Hu-antigen R (HuR or ELAV-1 or HuA) is ubiquitously expressed Hu family member. It has a variety of biological functions mostly related to the regulation of cellular response to DNA damage and other types of stress. Hu proteins perform their cytoplasmic and nuclear molecular functions by coordinately regulating functionally related mRNAs. In the cytoplasm, Hu proteins recognize and bind to AU-rich RNA elements (AREs) in the 3' untranslated regions (UTRs) of certain target mRNAs, such as GAP-43, vascular epithelial growth factor (VEGF), the glucose transporter GLUT1, eotaxin and c-fos, and stabilize those ARE-containing mRNAs. They also bind and regulate the translation of some target mRNAs, such as neurofilament M, GLUT1, and p27. In the nucleus, Hu proteins function as regulators of polyadenylation and alternative splicing. Each Hu protein contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an ARE. RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions. This family also includes the sex-lethal protein (SXL) from Drosophila melanogaster. SXL governs sexual differentiation and X chromosome dosage compensation in flies. It induces female-specific alternative splicing of the transformer (tra) pre-mRNA by binding to the tra uridine-rich polypyrimidine tract at the non-sex-specific 3' splice site during the sex-determination process. SXL binds to its own pre-mRNA and promotes female-specific alternative splicing. It contains an N-terminal Gly/Asn-rich domain that may be responsible for the protein-protein interaction, and tandem RRMs that show high preference to bind single-stranded, uridine-rich target RNA transcripts.


Pssm-ID: 409810 [Multi-domain]  Cd Length: 76  Bit Score: 40.08  E-value: 5.34e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910 1522 IGKIPGRMTRSELKQRFSVFGEIEECTIHF-RVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12375     4 VNYLPQSMTQEELRSLFGAIGPIESCKLVRdKITGQSlgYGFVNYRDPNDARKAINTLNGLD 65
RRM1_HuB cd12771
RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen B (HuB); This subgroup ...
1520-1580 6.43e-04

RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen B (HuB); This subgroup corresponds to the RRM1 of HuB, also termed ELAV-like protein 2 (ELAV-2), or ELAV-like neuronal protein 1, or nervous system-specific RNA-binding protein Hel-N1 (Hel-N1), one of the neuronal members of the Hu family. The neuronal Hu proteins play important roles in neuronal differentiation, plasticity and memory. HuB is also expressed in gonads and is up-regulated during neuronal differentiation of embryonic carcinoma P19 cells. Like other Hu proteins, HuB contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an AU-rich RNA element (ARE). RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions.


Pssm-ID: 410164 [Multi-domain]  Cd Length: 83  Bit Score: 40.09  E-value: 6.43e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12771     7 LIVNYLPQNMTQEELKSLFGSIGEIESCKlVRDKITGQSlgYGFVNYIEPKDAEKAINTLNGLR 70
RRM1_SRSF1_like cd12338
RNA recognition motif 1 (RRM1) found in serine/arginine-rich splicing factor 1 (SRSF1) and ...
1520-1580 7.50e-04

RNA recognition motif 1 (RRM1) found in serine/arginine-rich splicing factor 1 (SRSF1) and similar proteins; This subgroup corresponds to the RRM1 in three serine/arginine (SR) proteins: serine/arginine-rich splicing factor 1 (SRSF1 or ASF-1), serine/arginine-rich splicing factor 9 (SRSF9 or SRp30C), and plant pre-mRNA-splicing factor SF2 (SR1). SRSF1 is a shuttling SR protein involved in constitutive and alternative splicing, nonsense-mediated mRNA decay (NMD), mRNA export and translation. It also functions as a splicing-factor oncoprotein that regulates apoptosis and proliferation to promote mammary epithelial cell transformation. SRSF9 has been implicated in the activity of many elements that control splice site selection, the alternative splicing of the glucocorticoid receptor beta in neutrophils and in the gonadotropin-releasing hormone pre-mRNA. It can also interact with other proteins implicated in alternative splicing, including YB-1, rSLM-1, rSLM-2, E4-ORF4, Nop30, and p32. Both, SRSF1 and SRSF9, contain two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a C-terminal RS domains rich in serine-arginine dipeptides. In contrast, SF2 contains two N-terminal RRMs and a C-terminal PSK domain rich in proline, serine and lysine residues.


Pssm-ID: 409775 [Multi-domain]  Cd Length: 72  Bit Score: 39.66  E-value: 7.50e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIES-------GHKLR 1580
Cdd:cd12338     2 IYVGNLPGDIRERDIEDLFYKYGPILAIDLKNRRRGPPFAFVEFEDPRDAEDAIRGrdgydfdGYRLR 69
RRM2_Spen cd12309
RNA recognition motif 2 (RRM2) found in the Spen (split end) protein family; This subfamily ...
1518-1563 1.13e-03

RNA recognition motif 2 (RRM2) found in the Spen (split end) protein family; This subfamily corresponds to the RRM2 domain in the Spen (split end) protein family which includes RNA binding motif protein 15 (RBM15), putative RNA binding motif protein 15B (RBM15B), and similar proteins found in Metazoa. RBM15, also termed one-twenty two protein 1 (OTT1), conserved in eukaryotes, is a novel mRNA export factor and component of the NXF1 pathway. It binds to NXF1 and serves as receptor for the RNA export element RTE. It also possess mRNA export activity and can facilitate the access of DEAD-box protein DBP5 to mRNA at the nuclear pore complex (NPC). RNA-binding protein 15B (RBM15B), also termed one twenty-two 3 (OTT3), is a paralog of RBM15 and therefore has post-transcriptional regulatory activity. It is a nuclear protein sharing with RBM15 the association with the splicing factor compartment and the nuclear envelope as well as the binding to mRNA export factors NXF1 and Aly/REF. Members in this family belong to the Spen (split end) protein family, which share a domain architecture comprising of three N-terminal RNA recognition motifs (RRMs), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal SPOC (Spen paralog and ortholog C-terminal) domain.


Pssm-ID: 240755 [Multi-domain]  Cd Length: 79  Bit Score: 39.31  E-value: 1.13e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIH--FRVQGDNYGFVTY 1563
Cdd:cd12309     3 RTLFVGNLEITITEEELRRAFERYGVVEDVDIKrpPRGQGNAYAFVKF 50
RRM1_RBM34 cd12394
RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; ...
1518-1552 1.24e-03

RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; This subfamily corresponds to the RRM1 of RBM34, a putative RNA-binding protein containing two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). Although the function of RBM34 remains unclear currently, its RRM domains may participate in mRNA processing. RBM34 may act as an mRNA processing-related protein.


Pssm-ID: 409828 [Multi-domain]  Cd Length: 91  Bit Score: 39.50  E-value: 1.24e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEecTIHFR 1552
Cdd:cd12394     1 RTVFVGNLPVTVKKKALKKLFKEFGKIE--SVRFR 33
RRM1_RBM19 cd12564
RNA recognition motif 1 (RRM1) found in RNA-binding protein 19 (RBM19) and similar proteins; ...
1525-1573 1.31e-03

RNA recognition motif 1 (RRM1) found in RNA-binding protein 19 (RBM19) and similar proteins; This subgroup corresponds to the RRM1 of RBM19, also termed RNA-binding domain-1 (RBD-1), a nucleolar protein conserved in eukaryotes. It is involved in ribosome biogenesis by processing rRNA. In addition, it is essential for preimplantation development. RBM19 has a unique domain organization containing 6 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409980 [Multi-domain]  Cd Length: 76  Bit Score: 39.22  E-value: 1.31e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 564333910 1525 IPGRMTRSELKQRFSVFGEIEECTIHFRVQGD--NYGFVTYRYAEEAFAAI 1573
Cdd:cd12564     8 LPSSITEDRLRKLFSAFGTITDVQLKYTKDGKfrRFGFVGFKSEEEAQKAL 58
RRM2_CELF1_2 cd12634
RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-1, CELF-2 and ...
1518-1578 1.37e-03

RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-1, CELF-2 and similar proteins; This subgroup corresponds to the RRM2 of CELF-1 (also termed BRUNOL-2, or CUG-BP1, or EDEN-BP), CELF-2 (also termed BRUNOL-3, or ETR-3, or CUG-BP2, or NAPOR), both of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-1 is strongly expressed in all adult and fetal tissues tested. Human CELF-1 is a nuclear and cytoplasmic RNA-binding protein that regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of type 1 myotonic dystrophy (DM1), a neuromuscular disease associated with an unstable CUG triplet expansion in the 3'-UTR (3'-untranslated region) of the DMPK (myotonic dystrophy protein kinase) gene; it preferentially targets UGU-rich mRNA elements. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. The Xenopus homolog embryo deadenylation element-binding protein (EDEN-BP) mediates sequence-specific deadenylation of Eg5 mRNA. It binds specifically to the EDEN motif in the 3'-untranslated regions of maternal mRNAs and targets these mRNAs for deadenylation and translational repression. CELF-1 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The two N-terminal RRMs of EDEN-BP are necessary for the interaction with EDEN as well as a part of the linker region (between RRM2 and RRM3). Oligomerization of EDEN-BP is required for specific mRNA deadenylation and binding. CELF-2 is expressed in all tissues at some level, but highest in brain, heart, and thymus. It has been implicated in the regulation of nuclear and cytoplasmic RNA processing events, including alternative splicing, RNA editing, stability and translation. CELF-2 shares high sequence identity with CELF-1, but shows different binding specificity; it preferentially binds to sequences with UG repeats and UGUU motifs. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. It also binds to the 3'-UTR of cyclooxygenase-2 messages, affecting both translation and mRNA stability, and binds to apoB mRNA, regulating its C to U editing. CELF-2 also contains three highly conserved RRMs. It binds to RNA via the first two RRMs, which are also important for localization in the cytoplasm. The splicing activation or repression activity of CELF-2 on some specific substrates is mediated by RRM1/RRM2. Both, RRM1 and RRM2 of CELF-2, can activate cardiac troponin T (cTNT) exon 5 inclusion. In addition, CELF-2 possesses a typical arginine and lysine-rich nuclear localization signal (NLS) in the C-terminus, within RRM3.


Pssm-ID: 410042 [Multi-domain]  Cd Length: 81  Bit Score: 39.27  E-value: 1.37e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYG--FVTYRYAEEAFAAIESGHK 1578
Cdd:cd12634     2 RKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGcaFVTFSTRAMAQNAIKAMHQ 64
RRM1_HuR cd12769
RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen R (HuR); This subgroup ...
1520-1580 1.46e-03

RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen R (HuR); This subgroup corresponds to the RRM1 of HuR, also termed ELAV-like protein 1 (ELAV-1), a ubiquitously expressed Hu family member. It has a variety of biological functions mostly related to the regulation of cellular response to DNA damage and other types of stress. HuR has an anti-apoptotic function during early cell stress response; it binds to mRNAs and enhances the expression of several anti-apoptotic proteins, such as p21waf1, p53, and prothymosin alpha. Meanwhile, HuR also has pro-apoptotic function by promoting apoptosis when cell death is unavoidable. Furthermore, HuR may be important in muscle differentiation, adipogenesis, suppression of inflammatory response and modulation of gene expression in response to chronic ethanol exposure and amino acid starvation. Like other Hu proteins, HuR contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an AU-rich RNA element (ARE). RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions.


Pssm-ID: 410162 [Multi-domain]  Cd Length: 82  Bit Score: 39.25  E-value: 1.46e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12769     5 LIVNYLPQNMTQDELRSLFSSIGEVESAKlIRDKVAGHSlgYGFVNYVTAKDAERAINTLNGLR 68
RRM1_MRD1 cd12565
RNA recognition motif 1 (RRM1) found in yeast multiple RNA-binding domain-containing protein 1 ...
1520-1574 1.69e-03

RNA recognition motif 1 (RRM1) found in yeast multiple RNA-binding domain-containing protein 1 (MRD1) and similar proteins; This subgroup corresponds to the RRM1 of MRD1 which is encoded by a novel yeast gene MRD1 (multiple RNA-binding domain). It is well-conserved in yeast and its homologs exist in all eukaryotes. MRD1 is present in the nucleolus and the nucleoplasm. It interacts with the 35 S precursor rRNA (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). MRD1 is essential for the initial processing at the A0-A2 cleavage sites in the 35 S pre-rRNA. It contains 5 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), which may play an important structural role in organizing specific rRNA processing events.


Pssm-ID: 409981 [Multi-domain]  Cd Length: 76  Bit Score: 38.70  E-value: 1.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDN--YGFVTYRYAEEAFAAIE 1574
Cdd:cd12565     3 IIVKNLPKYVTEKRLKEHFSKKGEITDVKVMRTKDGKSrrFGFIGFKSEEEAQKAVK 59
RRM3_CELF1-6 cd12362
RNA recognition motif 3 (RRM3) found in CELF/Bruno-like family of RNA binding proteins CELF1, ...
1520-1577 1.80e-03

RNA recognition motif 3 (RRM3) found in CELF/Bruno-like family of RNA binding proteins CELF1, CELF2, CELF3, CELF4, CELF5, CELF6 and similar proteins; This subgroup corresponds to the RRM3 of the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) proteins, a family of structurally related RNA-binding proteins involved in the regulation of pre-mRNA splicing in the nucleus and in the control of mRNA translation and deadenylation in the cytoplasm. The family contains six members: CELF-1 (also termed BRUNOL-2, or CUG-BP1, or NAPOR, or EDEN-BP), CELF-2 (also termed BRUNOL-3, or ETR-3, or CUG-BP2, or NAPOR-2), CELF-3 (also termed BRUNOL-1, or TNRC4, or ETR-1, or CAGH4, or ER DA4), CELF-4 (also termed BRUNOL-4), CELF-5 (also termed BRUNOL-5), CELF-6 (also termed BRUNOL-6). They all contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The low sequence conservation of the linker region is highly suggestive of a large variety in the co-factors that associate with the various CELF family members. Based on both sequence similarity and function, the CELF family can be divided into two subfamilies, the first containing CELFs 1 and 2, and the second containing CELFs 3, 4, 5, and 6. The different CELF proteins may act through different sites on at least some substrates. Furthermore, CELF proteins may interact with each other in varying combinations to influence alternative splicing in different contexts.


Pssm-ID: 409797 [Multi-domain]  Cd Length: 73  Bit Score: 38.37  E-value: 1.80e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIeectIHFRVQGDN-------YGFVTYRYAEEAFAAIESGH 1577
Cdd:cd12362     1 LFVYHLPNEFTDQDLYQLFAPFGNV----VSAKVFVDKntgrskgFGFVSYDNPLSAQAAIKAMN 61
RRM2_MSI cd12323
RNA recognition motif 2 (RRM2) found in RNA-binding protein Musashi homologs Musashi-1, ...
1520-1563 2.01e-03

RNA recognition motif 2 (RRM2) found in RNA-binding protein Musashi homologs Musashi-1, Musashi-2 and similar proteins; This subfamily corresponds to the RRM2.in Musashi-1 (also termed Msi1), a neural RNA-binding protein putatively expressed in central nervous system (CNS) stem cells and neural progenitor cells, and associated with asymmetric divisions in neural progenitor cells. It is evolutionarily conserved from invertebrates to vertebrates. Musashi-1 is a homolog of Drosophila Musashi and Xenopus laevis nervous system-specific RNP protein-1 (Nrp-1). It has been implicated in the maintenance of the stem-cell state, differentiation, and tumorigenesis. It translationally regulates the expression of a mammalian numb gene by binding to the 3'-untranslated region of mRNA of Numb, encoding a membrane-associated inhibitor of Notch signaling, and further influences neural development. Moreover, Musashi-1 represses translation by interacting with the poly(A)-binding protein and competes for binding of the eukaryotic initiation factor-4G (eIF-4G). Musashi-2 (also termed Msi2) has been identified as a regulator of the hematopoietic stem cell (HSC) compartment and of leukemic stem cells after transplantation of cells with loss and gain of function of the gene. It influences proliferation and differentiation of HSCs and myeloid progenitors, and further modulates normal hematopoiesis and promotes aggressive myeloid leukemia. Both, Musashi-1 and Musashi-2, contain two conserved N-terminal tandem RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), along with other domains of unknown function.


Pssm-ID: 240769 [Multi-domain]  Cd Length: 74  Bit Score: 38.57  E-value: 2.01e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHF-----RVQGdnYGFVTY 1563
Cdd:cd12323     2 IFVGGLSANTTEDDVKKYFSQFGKVEDAMLMFdkqtnRHRG--FGFVTF 48
RRM2_CoAA cd12609
RNA recognition motif 2 (RRM2) found in vertebrate RRM-containing coactivator activator ...
1520-1575 2.06e-03

RNA recognition motif 2 (RRM2) found in vertebrate RRM-containing coactivator activator/modulator (CoAA); This subgroup corresponds to the RRM2 of CoAA, also termed RNA-binding protein 14 (RBM14), or paraspeckle protein 2 (PSP2), or synaptotagmin-interacting protein (SYT-interacting protein), a heterogeneous nuclear ribonucleoprotein (hnRNP)-like protein identified as a nuclear receptor coactivator. It mediates transcriptional coactivation and RNA splicing effects in a promoter-preferential manner and is enhanced by thyroid hormone receptor-binding protein (TRBP). CoAA contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a TRBP-interacting domain. It stimulates transcription through its interactions with coactivators, such as TRBP and CREB-binding protein CBP/p300, via the TRBP-interacting domain and interaction with an RNA-containing complex, such as DNA-dependent protein kinase-poly(ADP-ribose) polymerase complexes, via the RRMs.


Pssm-ID: 410021 [Multi-domain]  Cd Length: 68  Bit Score: 38.29  E-value: 2.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIhfrVQGdnYGFVTYRYAEEAFAAIES 1575
Cdd:cd12609     3 IFVGNVSATCTSDELRGLFEEFGRVVECDK---VKD--YAFVHMEREEEALAAIEA 53
RRM_Srp1p_AtRSp31_like cd12233
RNA recognition motif (RRM) found in fission yeast pre-mRNA-splicing factor Srp1p, Arabidopsis ...
1526-1580 2.51e-03

RNA recognition motif (RRM) found in fission yeast pre-mRNA-splicing factor Srp1p, Arabidopsis thaliana arginine/serine-rich-splicing factor RSp31 and similar proteins; This subfamily corresponds to the RRM of Srp1p and RRM2 of plant SR splicing factors. Srp1p is encoded by gene srp1 from fission yeast Schizosaccharomyces pombe. It plays a role in the pre-mRNA splicing process, but is not essential for growth. Srp1p is closely related to the SR protein family found in Metazoa. It contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a glycine hinge and a RS domain in the middle, and a C-terminal domain. The family also includes a novel group of arginine/serine (RS) or serine/arginine (SR) splicing factors existing in plants, such as A. thaliana RSp31, RSp35, RSp41 and similar proteins. Like vertebrate RS splicing factors, these proteins function as plant splicing factors and play crucial roles in constitutive and alternative splicing in plants. They all contain two RRMs at their N-terminus and an RS domain at their C-terminus.


Pssm-ID: 240679 [Multi-domain]  Cd Length: 70  Bit Score: 38.19  E-value: 2.51e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1526 PGRMTRSELKQRFSVFGEIEECTIHfrvqgDNYGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12233     9 PGTTREEDIEKLFEPFGPLVRCDIR-----KTFAFVEFEDSEDATKALEALHGSR 58
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
707-1050 2.61e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   707 EGSTSATTQEARPRPLSLSEYRQRRRQRQIEAEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTAP--------Q 778
Cdd:pfam03154  157 DSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliqqtpT 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   779 RSPIAVPETCSVSVGSNPVSPTPEPSASKLMVSTHSEQVSSheMPLPVRLPPPTLQSMAPAGPTPSTVPTPLPFPPSLPP 858
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPP--MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   859 LLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQ-------YAPVPPWPCYPS-------VSPSGY---SCLPPPP 921
Cdd:pfam03154  315 PAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMphikpppTTPIPQLPNPQShkhpphlSGPSPFqmnSNLPPPP 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910   922 AM----------------PIVSGTPGTYAVPPTCNVPWVPPPAPVSPYSSSSAYGSLGWGPGLQQPPFWSTVSPPPLSSA 985
Cdd:pfam03154  395 ALkplsslsthhppsahpPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPP 474
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 564333910   986 STGRAVPPPSVEPSNSSAGPPED-----TLPVPVTPSLSSRPAS-PIAPPAEPTKLEAQPVPV-SPQPKHKV 1050
Cdd:pfam03154  475 ITPPSGPPTSTSSAMPGIQPPSSasvssSGPVPAAVSCPLPPVQiKEEALDEAEEPESPPPPPrSPSPEPTV 546
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
978-1132 2.66e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.55  E-value: 2.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  978 SPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAEP--------TKLEAQPVPVSPQPkhk 1049
Cdd:PRK07994  367 EPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQllaarqqlQRAQGATKAKKSEP--- 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1050 VSPVVQSPQIKVLPTLSAEGVAIGESESERLKPETQENRPReKPISTATKAVLLPKQSTVAKLPAVHPARLRRLSFLPTP 1129
Cdd:PRK07994  444 AAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKAT-NPVEVKKEPVATPKALKKALEHEKTPELAAKLAAEAIE 522

                  ...
gi 564333910 1130 RAQ 1132
Cdd:PRK07994  523 RDP 525
RRM1_HuD cd12770
RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen D (HuD); This subgroup ...
1520-1580 3.18e-03

RNA recognition motif 1 (RRM1) found in vertebrate Hu-antigen D (HuD); This subgroup corresponds to the RRM1 of HuD, also termed ELAV-like protein 4 (ELAV-4), or paraneoplastic encephalomyelitis antigen HuD, one of the neuronal members of the Hu family. The neuronal Hu proteins play important roles in neuronal differentiation, plasticity and memory. HuD has been implicated in various aspects of neuronal function, such as the commitment and differentiation of neuronal precursors as well as synaptic remodeling in mature neurons. HuD also functions as an important regulator of mRNA expression in neurons by interacting with AU-rich RNA element (ARE) and stabilizing multiple transcripts. Moreover, HuD regulates the nuclear processing/stability of N-myc pre-mRNA in neuroblastoma cells, as well as the neurite elongation and morphological differentiation. HuD specifically binds poly(A) RNA. Like other Hu proteins, HuD contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an ARE. RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions.


Pssm-ID: 410163 [Multi-domain]  Cd Length: 81  Bit Score: 38.17  E-value: 3.18e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECT-IHFRVQGDN--YGFVTYRYAEEAFAAIESGHKLR 1580
Cdd:cd12770     4 LIVNYLPQNMTQEEFRSLFGSIGEIESCKlVRDKITGQSlgYGFVNYIDPKDAEKAINTLNGLR 67
RRM_CSTF2_RNA15_like cd12398
RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ...
1518-1573 3.22e-03

RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins; This subfamily corresponds to the RRM domain of CSTF2, its tau variant and eukaryotic homologs. CSTF2, also termed cleavage stimulation factor 64 kDa subunit (CstF64), is the vertebrate conterpart of yeast mRNA 3'-end-processing protein RNA15. It is expressed in all somatic tissues and is one of three cleavage stimulatory factor (CstF) subunits required for polyadenylation. CstF64 contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a CstF77-binding domain, a repeated MEARA helical region and a conserved C-terminal domain reported to bind the transcription factor PC-4. During polyadenylation, CstF interacts with the pre-mRNA through the RRM of CstF64 at U- or GU-rich sequences within 10 to 30 nucleotides downstream of the cleavage site. CSTF2T, also termed tauCstF64, is a paralog of the X-linked cleavage stimulation factor CstF64 protein that supports polyadenylation in most somatic cells. It is expressed during meiosis and subsequent haploid differentiation in a more limited set of tissues and cell types, largely in meiotic and postmeiotic male germ cells, and to a lesser extent in brain. The loss of CSTF2T will cause male infertility, as it is necessary for spermatogenesis and fertilization. Moreover, CSTF2T is required for expression of genes involved in morphological differentiation of spermatids, as well as for genes having products that function during interaction of motile spermatozoa with eggs. It promotes germ cell-specific patterns of polyadenylation by using its RRM to bind to different sequence elements downstream of polyadenylation sites than does CstF64. The family also includes yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins. RNA15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3'-end processing factor from Saccharomyces cerevisiae. RNA recognition by CFIA is mediated by an N-terminal RRM, which is contained in the RNA15 subunit of the complex. The RRM of RNA15 has a strong preference for GU-rich RNAs, mediated by a binding pocket that is entirely conserved in both yeast and vertebrate RNA15 orthologs.


Pssm-ID: 409832 [Multi-domain]  Cd Length: 77  Bit Score: 37.88  E-value: 3.22e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIeectIHFRVQGDN-------YGFVTYRYAEEAFAAI 1573
Cdd:cd12398     1 RSVFVGNIPYDATEEQLKEIFSEVGPV----VSFRLVTDRetgkpkgYGFCEFRDAETALSAV 59
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1520-1613 3.37e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.83  E-value: 3.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIH---FRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQpfdLCFGGRRQ 1596
Cdd:TIGR01622  217 LYVGNLHFNITEQDLRQIFEPFGEIEFVQLQkdpETGRSKGYGFIQFRDAEQAKEALEKMNGFELAGRP---IKVGLGND 293
                           90
                   ....*....|....*..
gi 564333910  1597 FCKRSYSDLDSNREDFD 1613
Cdd:TIGR01622  294 FTPESDANLAQRFQDQD 310
RRM3_SHARP cd12350
RNA recognition motif 3 (RRM3) found in SMART/HDAC1-associated repressor protein (SHARP) and ...
1518-1563 3.40e-03

RNA recognition motif 3 (RRM3) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins; This subfamily corresponds to the RRM3 of SHARP, also termed Msx2-interacting protein (MINT), or SPEN homolog, an estrogen-inducible transcriptional repressor that interacts directly with the nuclear receptor corepressor SMRT, histone deacetylases (HDACs) and components of the NuRD complex. SHARP recruits HDAC activity and binds to the steroid receptor RNA coactivator SRA through four conserved N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), further suppressing SRA-potentiated steroid receptor transcription activity. Thus, SHARP has the capacity to modulate both liganded and nonliganded nuclear receptors. SHARP also has been identified as a component of transcriptional repression complexes in Notch/RBP-Jkappa signaling pathways. In addition to the N-terminal RRMs, SHARP possesses a C-terminal SPOC domain (Spen paralog and ortholog C-terminal domain), which is highly conserved among Spen proteins.


Pssm-ID: 409786 [Multi-domain]  Cd Length: 74  Bit Score: 37.77  E-value: 3.40e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 564333910 1518 RVVFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTY 1563
Cdd:cd12350     3 RTLFIGNLEKTTTYGDLRNIFERFGEIIDIDIKKQNGNPQYAFLQY 48
RRM_PPIE cd12347
RNA recognition motif (RRM) found in cyclophilin-33 (Cyp33) and similar proteins; This ...
1520-1574 3.43e-03

RNA recognition motif (RRM) found in cyclophilin-33 (Cyp33) and similar proteins; This subfamily corresponds to the RRM of Cyp33, also termed peptidyl-prolyl cis-trans isomerase E (PPIase E), or cyclophilin E, or rotamase E. Cyp33 is a nuclear RNA-binding cyclophilin with an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal PPIase domain. Cyp33 possesses RNA-binding activity and preferentially binds to polyribonucleotide polyA and polyU, but hardly to polyG and polyC. It binds specifically to mRNA, which can stimulate its PPIase activity. Moreover, Cyp33 interacts with the third plant homeodomain (PHD3) zinc finger cassette of the mixed lineage leukemia (MLL) proto-oncoprotein and a poly-A RNA sequence through its RRM domain. It further mediates downregulation of the expression of MLL target genes HOXC8, HOXA9, CDKN1B, and C-MYC, in a proline isomerase-dependent manner. Cyp33 also possesses a PPIase activity that catalyzes cis-trans isomerization of the peptide bond preceding a proline, which has been implicated in the stimulation of folding and conformational changes in folded and unfolded proteins. The PPIase activity can be inhibited by the immunosuppressive drug cyclosporin A.


Pssm-ID: 409783 [Multi-domain]  Cd Length: 75  Bit Score: 37.97  E-value: 3.43e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGD---NYGFVTYRYAEEAFAAIE 1574
Cdd:cd12347     1 LYVGGLAEEVDEKVLHAAFIPFGDIVDIQIPLDYETEkhrGFAFVEFEEAEDAAAAID 58
RRM1_hnRNPD cd12756
RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein D0 (hnRNP D0) ...
1520-1579 3.65e-03

RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein D0 (hnRNP D0) and similar proteins; This subgroup corresponds to the RRM1 of hnRNP D0, also termed AU-rich element RNA-binding protein 1, which is a UUAG-specific nuclear RNA binding protein that may be involved in pre-mRNA splicing and telomere elongation. hnRNP D0 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), in the middle and an RGG box rich in glycine and arginine residues in the C-terminal part. Each of RRMs can bind solely to the UUAG sequence specifically.


Pssm-ID: 410150 [Multi-domain]  Cd Length: 74  Bit Score: 37.67  E-value: 3.65e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHF-----RVQGdnYGFVTYRYAEEAFAAIESG-HKL 1579
Cdd:cd12756     1 MFIGGLSWDTTKKDLKDYFSKFGEVVDCTLKLdpitgRSRG--FGFVLFKESESVDKVMDQKeHKL 64
PRK10263 PRK10263
DNA translocase FtsK; Provisional
890-1117 3.68e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.38  E-value: 3.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  890 PPDPYTQYAPVPPWPCYPSVSPSGYSCLP-PPPAMPIVSGTPGTYAVPPTCNVPWVPPPAPvspyssssaygslgwgpgL 968
Cdd:PRK10263  336 PVEPVTQTPPVASVDVPPAQPTVAWQPVPgPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP------------------L 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  969 QQPpfwstvSPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTpslsSRPASPIAPPAEPTKLEAQPVPVSPQPKH 1048
Cdd:PRK10263  398 QQP------VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAP----EQPVAGNAWQAEEQQSTFAPQSTYQTEQT 467
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910 1049 KVSPVVQSPQIKVLPTLSAEGVAIGESESERLKP---------ETQENRPREK--------PISTATKAVLLPKQSTVAK 1111
Cdd:PRK10263  468 YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKParpplyyfeEVEEKRAREReqlaawyqPIPEPVKEPEPIKSSLKAP 547

                  ....*.
gi 564333910 1112 LPAVHP 1117
Cdd:PRK10263  548 SVAAVP 553
RRM_G3BP1 cd12463
RNA recognition motif (RRM) found in ras GTPase-activating protein-binding protein 1 (G3BP1) ...
1520-1567 4.67e-03

RNA recognition motif (RRM) found in ras GTPase-activating protein-binding protein 1 (G3BP1) and similar proteins; This subgroup corresponds to the RRM of G3BP1, also termed ATP-dependent DNA helicase VIII (DH VIII), or GAP SH3 domain-binding protein 1, which has been identified as a phosphorylation-dependent endoribonuclease that interacts with the SH3 domain of RasGAP, a multi-functional protein controlling Ras activity. The acidic RasGAP binding domain of G3BP1 harbors an arsenite-regulated phosphorylation site and dominantly inhibits stress granule (SG) formation. G3BP1 also contains an N-terminal nuclear transfer factor 2 (NTF2)-like domain, an RNA recognition motif (RRM domain), and an Arg-Gly-rich region (RGG-rich region, or arginine methylation motif). The RRM domain and RGG-rich region are canonically associated with RNA binding. G3BP1 co-immunoprecipitates with mRNAs. It binds to and cleaves the 3'-untranslated region (3'-UTR) of the c-myc mRNA in a phosphorylation-dependent manner. Thus, G3BP1 may play a role in coupling extra-cellular stimuli to mRNA stability. It has been shown that G3BP1 is a novel Dishevelled-associated protein that is methylated upon Wnt3a stimulation and that arginine methylation of G3BP1 regulates both Ctnnb1 mRNA and canonical Wnt/beta-catenin signaling. Furthermore, G3BP1 can be associated with the 3'-UTR of beta-F1 mRNA in cytoplasmic RNA-granules, demonstrating that G3BP1 may specifically repress the translation of the transcript.


Pssm-ID: 409896 [Multi-domain]  Cd Length: 80  Bit Score: 37.54  E-value: 4.67e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHFRVQGDNYGFVTYRYAE 1567
Cdd:cd12463     6 LFVGNLPHDVDKSELKEFFQGYGNVVELRINSGGKLPNFGFVVFDDPE 53
RRM2_Hrp1p cd12330
RNA recognition motif 2 (RRM2) found in yeast nuclear polyadenylated RNA-binding protein 4 ...
1520-1579 4.76e-03

RNA recognition motif 2 (RRM2) found in yeast nuclear polyadenylated RNA-binding protein 4 (Hrp1p or Nab4p) and similar proteins; This subfamily corresponds to the RRM1 of Hrp1p and similar proteins. Hrp1p or Nab4p, also termed cleavage factor IB (CFIB), is a sequence-specific trans-acting factor that is essential for mRNA 3'-end formation in yeast Saccharomyces cerevisiae. It can be UV cross-linked to RNA and specifically recognizes the (UA)6 RNA element required for both, the cleavage and poly(A) addition steps. Moreover, Hrp1p can shuttle between the nucleus and the cytoplasm, and play an additional role in the export of mRNAs to the cytoplasm. Hrp1p also interacts with Rna15p and Rna14p, two components of CF1A. In addition, Hrp1p functions as a factor directly involved in modulating the activity of the nonsense-mediated mRNA decay (NMD) pathway; it binds specifically to a downstream sequence element (DSE)-containing RNA and interacts with Upf1p, a component of the surveillance complex, further triggering the NMD pathway. Hrp1p contains two central RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and an arginine-glycine-rich region harboring repeats of the sequence RGGF/Y.


Pssm-ID: 409767 [Multi-domain]  Cd Length: 78  Bit Score: 37.69  E-value: 4.76e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTIHF-----RVQGdnYGFVTYRyAEEAFAAIESGHKL 1579
Cdd:cd12330     2 IFVGGLAPDVTEEEFKEYFEQFGTVVDAVVMLdhdtgRSRG--FGFVTFD-SESAVEKVLSKGFH 63
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
738-1031 7.96e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 7.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  738 AEDRNSQPPAGKWPSLPETPKGLADIPCLVPSAPTKKTAPQRSPIAVPETCSVSVGSNPVSPTPEPS-ASKLMVSTHSEQ 816
Cdd:PHA03307   28 PGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTlAPASPAREGSPT 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  817 VSSHEMPLPvrlPPPTLQSMAPaGPTPSTVPTPLPFPPSLPPLLPLPASGPGVPSLPPPPLQPPGLPVSMRQMPPDPYTQ 896
Cdd:PHA03307  108 PPGPSSPDP---PPPTPPPASP-PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564333910  897 YAPVPPWPCYP-SVSPSGYSCLPPPPAMPIVSGTPGTYAVPPTCNVpwvPPPAPVSPYSSSSAYGSLGWGPGLQQPPFWS 975
Cdd:PHA03307  184 RAPSSPPAEPPpSTPPAAASPRPPRRSSPISASASSPAPAPGRSAA---DDAGASSSDSSSSESSGCGWGPENECPLPRP 260
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 564333910  976 TVSPPPLSSASTGRAVPPPSVEPSNSSAGPPEDTLPVPVTPSLSSRPASPIAPPAE 1031
Cdd:PHA03307  261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASS 316
RRM1_PUB1 cd12614
RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated ...
1520-1575 9.95e-03

RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subgroup corresponds to the RRM1 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. It is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410026 [Multi-domain]  Cd Length: 74  Bit Score: 36.64  E-value: 9.95e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 564333910 1520 VFIGKIPGRMTRSELKQRFSVFGEIEECTI--HFRVQGDNYGFVTYRYAEEAFAAIES 1575
Cdd:cd12614     1 LYVGNLDPRVTEDLLQEIFAVTGPVENCKIipDKNSKGVNYGFVEYYDRRSAEIAIQT 58
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH