NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|166694|gb|AAA73382|]
View 

unnamed protein product [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RRM_UHM_SPF45 cd12647
RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar ...
277-373 1.38e-56

RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar proteins; This subgroup corresponds to the RRM of SPF45, also termed RNA-binding motif protein 17 (RBM17), an RNA-binding protein consisting of an unstructured N-terminal region, followed by a G-patch motif and a C-terminal U2AF (U2 auxiliary factor) homology motifs (UHM) that harbors a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain) and an Arg-Xaa-Phe sequence motif. SPF45 regulates alternative splicing of the apoptosis regulatory gene FAS (also known as CD95). It induces exon 6 skipping in FAS pre-mRNA through the UHM domain that binds to tryptophan-containing linear peptide motifs (UHM ligand motifs, ULMs) present in the 3' splice site-recognizing factors U2AF65, SF1 and SF3b155.


:

Pssm-ID: 410051 [Multi-domain]  Cd Length: 95  Bit Score: 180.17  E-value: 1.38e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   277 PTRVLLLRNMVGPGQVDDELEDEVGGECAKYGTVTRVLIFEITEPnfPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGR 356
Cdd:cd12647   1 PSKVVLLRNMVGPGEVDEDLEPEVKEECEKYGKVTKVVIFEIPGA--PDDEAVRIFVEFERVESAIKAVVDLNGRFFGGR 78
                        90
                ....*....|....*..
gi 166694   357 TVRATFYDEEKFSKNEL 373
Cdd:cd12647  79 TVKASFYDLDRFRRLDL 95
half-pint super family cl31128
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
170-374 1.42e-21

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


The actual alignment was detected with superfamily member TIGR01645:

Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 96.29  E-value: 1.42e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     170 PARVVVEVLGREDPRLLPGNVDGFSIGKSKPSGLGVGAGGqmtpaqrMMPKMGWKQ-GQGLGKSEQGIPTPLMAKKTdrr 248
Cdd:TIGR01645 386 AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGA-------LDDTLAWKEpSKEDQTSEDGKMLAIMGEAA--- 455
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     249 agvIVNASENKSSSAEK---KVVKSVNINGEP-----------------------------TRVLLLRNMVGPGQVDDEL 296
Cdd:TIGR01645 456 ---AALALEPKKKKKEKegeELQPKLVMNSEDaslasqegmsirgnsarhlvmqklmrtnrSNVIVLRNMVTPQDIDEFL 532
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694     297 EDEVGGECAKYGTVTRVLIF-EITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATFYDEEKFSKNELA 374
Cdd:TIGR01645 533 EGEIREECGKFGVVDRVIINfEKQGEEEDAEIIVKIFVEFSDSMEVDRAKAALDGRFFGGRTVVAEAYDQILFDHADLS 611
PHA02682 super family cl31817
ORF080 virion core protein; Provisional
10-89 2.06e-03

ORF080 virion core protein; Provisional


The actual alignment was detected with superfamily member PHA02682:

Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 39.46  E-value: 2.06e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     10 PPPTDDEKPSGNSSSVWSRSTKMAPPTLRKPPAfAPPQTILRPLNKPKPIVSAPYKPPPNSSQSVlIPANESAPSHQPAL 89
Cdd:PHA02682 117 PPATAPTCPPPAVCPAPARPAPACPPSTRQCPP-APPLPTPKPAPAAKPIFLHNQLPPPDYPAAS-CPTIETAPAASPVL 194
 
Name Accession Description Interval E-value
RRM_UHM_SPF45 cd12647
RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar ...
277-373 1.38e-56

RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar proteins; This subgroup corresponds to the RRM of SPF45, also termed RNA-binding motif protein 17 (RBM17), an RNA-binding protein consisting of an unstructured N-terminal region, followed by a G-patch motif and a C-terminal U2AF (U2 auxiliary factor) homology motifs (UHM) that harbors a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain) and an Arg-Xaa-Phe sequence motif. SPF45 regulates alternative splicing of the apoptosis regulatory gene FAS (also known as CD95). It induces exon 6 skipping in FAS pre-mRNA through the UHM domain that binds to tryptophan-containing linear peptide motifs (UHM ligand motifs, ULMs) present in the 3' splice site-recognizing factors U2AF65, SF1 and SF3b155.


Pssm-ID: 410051 [Multi-domain]  Cd Length: 95  Bit Score: 180.17  E-value: 1.38e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   277 PTRVLLLRNMVGPGQVDDELEDEVGGECAKYGTVTRVLIFEITEPnfPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGR 356
Cdd:cd12647   1 PSKVVLLRNMVGPGEVDEDLEPEVKEECEKYGKVTKVVIFEIPGA--PDDEAVRIFVEFERVESAIKAVVDLNGRFFGGR 78
                        90
                ....*....|....*..
gi 166694   357 TVRATFYDEEKFSKNEL 373
Cdd:cd12647  79 TVKASFYDLDRFRRLDL 95
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
170-374 1.42e-21

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 96.29  E-value: 1.42e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     170 PARVVVEVLGREDPRLLPGNVDGFSIGKSKPSGLGVGAGGqmtpaqrMMPKMGWKQ-GQGLGKSEQGIPTPLMAKKTdrr 248
Cdd:TIGR01645 386 AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGA-------LDDTLAWKEpSKEDQTSEDGKMLAIMGEAA--- 455
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     249 agvIVNASENKSSSAEK---KVVKSVNINGEP-----------------------------TRVLLLRNMVGPGQVDDEL 296
Cdd:TIGR01645 456 ---AALALEPKKKKKEKegeELQPKLVMNSEDaslasqegmsirgnsarhlvmqklmrtnrSNVIVLRNMVTPQDIDEFL 532
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694     297 EDEVGGECAKYGTVTRVLIF-EITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATFYDEEKFSKNELA 374
Cdd:TIGR01645 533 EGEIREECGKFGVVDRVIINfEKQGEEEDAEIIVKIFVEFSDSMEVDRAKAALDGRFFGGRTVVAEAYDQILFDHADLS 611
RRM_1 smart00361
RNA recognition motif;
292-361 1.97e-20

RNA recognition motif;


Pssm-ID: 214637 [Multi-domain]  Cd Length: 70  Bit Score: 84.38  E-value: 1.97e-20
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694      292 VDDELEDEVGGECAKYGTVTRVLIFEITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRAT 361
Cdd:smart00361   1 KDEDFERELKEEEEYFGEVGKINKIYIDDVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRLVKAE 70
G-patch pfam01585
G-patch domain; This domain is found in a number of RNA binding proteins, and is also found in ...
214-251 1.68e-13

G-patch domain; This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.


Pssm-ID: 396249 [Multi-domain]  Cd Length: 45  Bit Score: 64.07  E-value: 1.68e-13
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 166694     214 AQRMMPKMGWKQGQGLGKSEQGIPTPLMAKKTDRRAGV 251
Cdd:pfam01585   5 GFKLLQKMGWKEGQGLGKNEQGIAEPIEAKIKKDRRGL 42
G_patch smart00443
glycine rich nucleic binding domain; A predicted glycine rich nucleic binding domain found in ...
209-254 2.57e-13

glycine rich nucleic binding domain; A predicted glycine rich nucleic binding domain found in the splicing factor 45, SON DNA binding protein and D-type Retrovirus- polyproteins.


Pssm-ID: 197727 [Multi-domain]  Cd Length: 47  Bit Score: 63.72  E-value: 2.57e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 166694      209 GQMTPAQRMMPKMGWKQGQGLGKSEQGIPTPLMAKKTDRRAGVIVN 254
Cdd:smart00443   2 STSNIGAKLLRKMGWKEGQGLGKNEQGIVEPISAEIKKDRKGLGAV 47
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
269-372 6.02e-12

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 66.84  E-value: 6.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     269 KSVNINGEPTRVLLLRNMVGPGQ--VDDELED---EVGGECAKYGTVTRVlifEITEPNfPVHEAV----RIFVQFSRPE 339
Cdd:TIGR01642 400 SILQIGGKPTKVVQLTNLVTGDDlmDDEEYEEiyeDVKTEFSKYGPLINI---VIPRPN-GDRNSTpgvgKVFLEYADVR 475
                          90       100       110
                  ....*....|....*....|....*....|...
gi 166694     340 ETTKALVDLDGRYFGGRTVRATFYDEEKFSKNE 372
Cdd:TIGR01642 476 SAEKAMEGMNGRKFNDRVVVAAFYGEDCYKAGD 508
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
293-359 1.05e-04

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 39.91  E-value: 1.05e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 166694     293 DDELEDEvggeCAKYGTVTRV-LIFEITEPNFPVheavrIFVQFSRPEETTKALVDLDGRYFGGRTVR 359
Cdd:pfam00076  12 EEDLKDL----FSKFGPIKSIrLVRDETGRSKGF-----AFVEFEDEEDAEKAIEALNGKELGGRELK 70
PHA02682 PHA02682
ORF080 virion core protein; Provisional
10-89 2.06e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 39.46  E-value: 2.06e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     10 PPPTDDEKPSGNSSSVWSRSTKMAPPTLRKPPAfAPPQTILRPLNKPKPIVSAPYKPPPNSSQSVlIPANESAPSHQPAL 89
Cdd:PHA02682 117 PPATAPTCPPPAVCPAPARPAPACPPSTRQCPP-APPLPTPKPAPAAKPIFLHNQLPPPDYPAAS-CPTIETAPAASPVL 194
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
10-106 5.15e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 38.88  E-value: 5.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694      10 PPPTDDEKPSGNSSSVWSRST-KMAPPTLRKPPAFAPPQTILRPLNK-PKPIVSAPYKPPP---NSSQSVLIPANESAPS 84
Cdd:pfam05539 243 PSGSPQHPPSTTSQDQSTTGDgQEHTQRRKTPPATSNRRSPHSTATPpPTTKRQETGRPTPrptATTQSGSSPPHSSPPG 322
                          90       100
                  ....*....|....*....|..
gi 166694      85 HQPALVGVTSSVIEEYDPARPN 106
Cdd:pfam05539 323 VQANPTTQNLVDCKELDPPKPN 344
 
Name Accession Description Interval E-value
RRM_UHM_SPF45 cd12647
RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar ...
277-373 1.38e-56

RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar proteins; This subgroup corresponds to the RRM of SPF45, also termed RNA-binding motif protein 17 (RBM17), an RNA-binding protein consisting of an unstructured N-terminal region, followed by a G-patch motif and a C-terminal U2AF (U2 auxiliary factor) homology motifs (UHM) that harbors a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain) and an Arg-Xaa-Phe sequence motif. SPF45 regulates alternative splicing of the apoptosis regulatory gene FAS (also known as CD95). It induces exon 6 skipping in FAS pre-mRNA through the UHM domain that binds to tryptophan-containing linear peptide motifs (UHM ligand motifs, ULMs) present in the 3' splice site-recognizing factors U2AF65, SF1 and SF3b155.


Pssm-ID: 410051 [Multi-domain]  Cd Length: 95  Bit Score: 180.17  E-value: 1.38e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   277 PTRVLLLRNMVGPGQVDDELEDEVGGECAKYGTVTRVLIFEITEPnfPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGR 356
Cdd:cd12647   1 PSKVVLLRNMVGPGEVDEDLEPEVKEECEKYGKVTKVVIFEIPGA--PDDEAVRIFVEFERVESAIKAVVDLNGRFFGGR 78
                        90
                ....*....|....*..
gi 166694   357 TVRATFYDEEKFSKNEL 373
Cdd:cd12647  79 TVKASFYDLDRFRRLDL 95
RRM_UHM_SPF45_PUF60 cd12374
RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar ...
278-364 1.39e-37

RNA recognition motif (RRM) found in UHM domain of 45 kDa-splicing factor (SPF45) and similar proteins; This subfamily corresponds to the RRM found in UHM domain of 45 kDa-splicing factor (SPF45 or RBM17), poly(U)-binding-splicing factor PUF60 (FIR or Hfp or RoBP1 or Siah-BP1), and similar proteins. SPF45 is an RNA-binding protein consisting of an unstructured N-terminal region, followed by a G-patch motif and a C-terminal U2AF (U2 auxiliary factor) homology motifs (UHM) that harbors a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain) and an Arg-Xaa-Phe sequence motif. SPF45 regulates alternative splicing of the apoptosis regulatory gene FAS (also known as CD95). It induces exon 6 skipping in FAS pre-mRNA through the UHM domain that binds to tryptophan-containing linear peptide motifs (UHM ligand motifs, ULMs) present in the 3' splice site-recognizing factors U2AF65, SF1 and SF3b155. PUF60 is an essential splicing factor that functions as a poly-U RNA-binding protein required to reconstitute splicing in depleted nuclear extracts. Its function is enhanced through interaction with U2 auxiliary factor U2AF65. PUF60 also controls human c-myc gene expression by binding and inhibiting the transcription factor far upstream sequence element (FUSE)-binding-protein (FBP), an activator of c-myc promoters. PUF60 contains two central RRMs and a C-terminal UHM domain.


Pssm-ID: 409809 [Multi-domain]  Cd Length: 85  Bit Score: 130.42  E-value: 1.39e-37
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   278 TRVLLLRNMVGPGQVDDELEDEVGGECAKYGTVTRVLIFEitEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRT 357
Cdd:cd12374   1 TKILVLRNMVGPGEIDEDLKDEIKEECSKYGKVLNVIIHE--VASSEADDAVRVFVEFEDADEAIKAFRALNGRFFGGRK 78

                ....*..
gi 166694   358 VRATFYD 364
Cdd:cd12374  79 VKARFYD 85
RRM3_UHM_PUF60 cd12648
RNA recognition motif 3 (RRM3) found in UHM domain of poly(U)-binding-splicing factor PUF60 ...
278-373 9.02e-34

RNA recognition motif 3 (RRM3) found in UHM domain of poly(U)-binding-splicing factor PUF60 and similar proteins; This subgroup corresponds to the RRM3 of PUF60, also termed FUSE-binding protein-interacting repressor (FBP-interacting repressor or FIR), or Ro-binding protein 1 (RoBP1), or Siah-binding protein 1 (Siah-BP1), an essential splicing factor that functions as a poly-U RNA-binding protein required to reconstitute splicing in depleted nuclear extracts. Its function is enhanced through interaction with U2 auxiliary factor U2AF65. PUF60 also controls human c-myc gene expression by binding and inhibiting the transcription factor far upstream sequence element (FUSE)-binding-protein (FBP), an activator of c-myc promoters. PUF60 contains two central RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a C-terminal U2AF (U2 auxiliary factor) homology motifs (UHM) that harbors another RRM and binds to tryptophan-containing linear peptide motifs (UHM ligand motifs, ULMs) in several nuclear proteins. The research indicates that PUF60 binds FUSE as a dimer, and only the first two RRM domains participate in the single-stranded DNA recognition.


Pssm-ID: 410052 [Multi-domain]  Cd Length: 98  Bit Score: 120.98  E-value: 9.02e-34
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   278 TRVLLLRNMVGPGQVDDELEDEVGGECAKYGTVTRVLIF---EITEPNFPVheAVRIFVQFSRPEETTKALVDLDGRYFG 354
Cdd:cd12648   1 SRVMVLRNMVGPEDIDDDLEGEVTEECGKFGAVNRVIIYqekQGEEEDAEI--IVKIFVEFSMPSEAEKAIQALNGRWFG 78
                        90
                ....*....|....*....
gi 166694   355 GRTVRATFYDEEKFSKNEL 373
Cdd:cd12648  79 GRKVVAELYDQTRFDNSDL 97
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
170-374 1.42e-21

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 96.29  E-value: 1.42e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     170 PARVVVEVLGREDPRLLPGNVDGFSIGKSKPSGLGVGAGGqmtpaqrMMPKMGWKQ-GQGLGKSEQGIPTPLMAKKTdrr 248
Cdd:TIGR01645 386 AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGA-------LDDTLAWKEpSKEDQTSEDGKMLAIMGEAA--- 455
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     249 agvIVNASENKSSSAEK---KVVKSVNINGEP-----------------------------TRVLLLRNMVGPGQVDDEL 296
Cdd:TIGR01645 456 ---AALALEPKKKKKEKegeELQPKLVMNSEDaslasqegmsirgnsarhlvmqklmrtnrSNVIVLRNMVTPQDIDEFL 532
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694     297 EDEVGGECAKYGTVTRVLIF-EITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATFYDEEKFSKNELA 374
Cdd:TIGR01645 533 EGEIREECGKFGVVDRVIINfEKQGEEEDAEIIVKIFVEFSDSMEVDRAKAALDGRFFGGRTVVAEAYDQILFDHADLS 611
RRM_1 smart00361
RNA recognition motif;
292-361 1.97e-20

RNA recognition motif;


Pssm-ID: 214637 [Multi-domain]  Cd Length: 70  Bit Score: 84.38  E-value: 1.97e-20
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694      292 VDDELEDEVGGECAKYGTVTRVLIFEITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRAT 361
Cdd:smart00361   1 KDEDFERELKEEEEYFGEVGKINKIYIDDVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRLVKAE 70
RRM3_U2AF65 cd12232
RNA recognition motif 3 (RRM3) found in U2 large nuclear ribonucleoprotein auxiliary factor ...
278-361 6.61e-17

RNA recognition motif 3 (RRM3) found in U2 large nuclear ribonucleoprotein auxiliary factor U2AF 65 kDa subunit (U2AF65) and similar proteins; This subfamily corresponds to the RRM3 of U2AF65 and dU2AF50. U2AF65, also termed U2AF2, is the large subunit of U2 small nuclear ribonucleoprotein (snRNP) auxiliary factor (U2AF), which has been implicated in the recruitment of U2 snRNP to pre-mRNAs and is a highly conserved heterodimer composed of large and small subunits. U2AF65 specifically recognizes the intron polypyrimidine tract upstream of the 3' splice site and promotes binding of U2 snRNP to the pre-mRNA branchpoint. U2AF65 also plays an important role in the nuclear export of mRNA. It facilitates the formation of a messenger ribonucleoprotein export complex, containing both the NXF1 receptor and the RNA substrate. Moreover, U2AF65 interacts directly and specifically with expanded CAG RNA, and serves as an adaptor to link expanded CAG RNA to NXF1 for RNA export. U2AF65 contains an N-terminal RS domain rich in arginine and serine, followed by a proline-rich segment and three C-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The N-terminal RS domain stabilizes the interaction of U2 snRNP with the branch point (BP) by contacting the branch region, and further promotes base pair interactions between U2 snRNA and the BP. The proline-rich segment mediates protein-protein interactions with the RRM domain of the small U2AF subunit (U2AF35 or U2AF1). The RRM1 and RRM2 are sufficient for specific RNA binding, while RRM3 is responsible for protein-protein interactions. The family also includes Splicing factor U2AF 50 kDa subunit (dU2AF50), the Drosophila ortholog of U2AF65. dU2AF50 functions as an essential pre-mRNA splicing factor in flies. It associates with intronless mRNAs and plays a significant and unexpected role in the nuclear export of a large number of intronless mRNAs.


Pssm-ID: 409679 [Multi-domain]  Cd Length: 89  Bit Score: 74.93  E-value: 6.61e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   278 TRVLLLRNMVGPGQVDD-----ELEDEVGGECAKYGTVTRVLIFEITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRY 352
Cdd:cd12232   1 SRVLCLLNMVTPEELEDdeeyeEILEDVKEECSKYGKVLSVVIPRPEAEGVDVPGVGKVFVEFEDVEDAQKAQKALAGRK 80

                ....*....
gi 166694   353 FGGRTVRAT 361
Cdd:cd12232  81 FDGRTVVAS 89
RRM3_RBM39_like cd12285
RNA recognition motif 3 (RRM3) found in vertebrate RNA-binding protein 39 (RBM39) and similar ...
278-364 2.20e-14

RNA recognition motif 3 (RRM3) found in vertebrate RNA-binding protein 39 (RBM39) and similar proteins; This subfamily corresponds to the RRM3 of RBM39, also termed hepatocellular carcinoma protein 1, or RNA-binding region-containing protein 2, or splicing factor HCC1, ia nuclear autoantigen that contains an N-terminal arginine/serine rich (RS) motif and three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). An octapeptide sequence called the RS-ERK motif is repeated six times in the RS region of RBM39. Based on the specific domain composition, RBM39 has been classified into a family of non-snRNP (small nuclear ribonucleoprotein) splicing factors that are usually not complexed to snRNAs.


Pssm-ID: 409727 [Multi-domain]  Cd Length: 85  Bit Score: 67.96  E-value: 2.20e-14
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   278 TRVLLLRNMVGPGQ-----VDDELEDEVGGECAKYGTVTRVLIFEitepnfpVHEAVRIFVQFSRPEETTKALVDLDGRY 352
Cdd:cd12285   1 SRCVILKNMFDPAEetednWDDEIKEDVIEECSKYGPVLHIYVDK-------NSPQGNVYVKFKTIEAAQKCVQAMNGRW 73
                        90
                ....*....|..
gi 166694   353 FGGRTVRATFYD 364
Cdd:cd12285  74 FDGRQITAAYVP 85
G-patch pfam01585
G-patch domain; This domain is found in a number of RNA binding proteins, and is also found in ...
214-251 1.68e-13

G-patch domain; This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.


Pssm-ID: 396249 [Multi-domain]  Cd Length: 45  Bit Score: 64.07  E-value: 1.68e-13
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 166694     214 AQRMMPKMGWKQGQGLGKSEQGIPTPLMAKKTDRRAGV 251
Cdd:pfam01585   5 GFKLLQKMGWKEGQGLGKNEQGIAEPIEAKIKKDRRGL 42
G_patch smart00443
glycine rich nucleic binding domain; A predicted glycine rich nucleic binding domain found in ...
209-254 2.57e-13

glycine rich nucleic binding domain; A predicted glycine rich nucleic binding domain found in the splicing factor 45, SON DNA binding protein and D-type Retrovirus- polyproteins.


Pssm-ID: 197727 [Multi-domain]  Cd Length: 47  Bit Score: 63.72  E-value: 2.57e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 166694      209 GQMTPAQRMMPKMGWKQGQGLGKSEQGIPTPLMAKKTDRRAGVIVN 254
Cdd:smart00443   2 STSNIGAKLLRKMGWKEGQGLGKNEQGIVEPISAEIKKDRKGLGAV 47
RRM2_TatSF1_like cd12282
RNA recognition motif 2 (RRM2) found in HIV Tat-specific factor 1 (Tat-SF1) and similar ...
279-364 2.26e-12

RNA recognition motif 2 (RRM2) found in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins; This subfamily corresponds to the RRM2 of Tat-SF1 and CUS2. Tat-SF1 is the cofactor for stimulation of transcriptional elongation by human immunodeficiency virus-type 1 (HIV-1) Tat. It is a substrate of an associated cellular kinase. Tat-SF1 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a highly acidic carboxyl-terminal half. The family also includes CUS2, a yeast homolog of human Tat-SF1. CUS2 interacts with U2 RNA in splicing extracts and functions as a splicing factor that aids assembly of the splicing-competent U2 snRNP in vivo. CUS2 also associates with PRP11 that is a subunit of the conserved splicing factor SF3a. Like Tat-SF1, CUS2 contains two RRMs as well.


Pssm-ID: 409724 [Multi-domain]  Cd Length: 91  Bit Score: 62.26  E-value: 2.26e-12
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   279 RVLLLRNMVGPGQVDD------ELEDEVGGECAKYGTVTRVLIFeitepnfPVHEAVRIFVQFSRPEETTKALVDLDGRY 352
Cdd:cd12282   2 KVVILKNMFHPKEFEEdpelinEIKEDLREECEKFGQVKKVVVF-------DRHPDGVASVKFKEPEEADKCIQALNGRW 74
                        90
                ....*....|..
gi 166694   353 FGGRTVRATFYD 364
Cdd:cd12282  75 FAGRKLEAETWD 86
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
269-372 6.02e-12

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 66.84  E-value: 6.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     269 KSVNINGEPTRVLLLRNMVGPGQ--VDDELED---EVGGECAKYGTVTRVlifEITEPNfPVHEAV----RIFVQFSRPE 339
Cdd:TIGR01642 400 SILQIGGKPTKVVQLTNLVTGDDlmDDEEYEEiyeDVKTEFSKYGPLINI---VIPRPN-GDRNSTpgvgKVFLEYADVR 475
                          90       100       110
                  ....*....|....*....|....*....|...
gi 166694     340 ETTKALVDLDGRYFGGRTVRATFYDEEKFSKNE 372
Cdd:TIGR01642 476 SAEKAMEGMNGRKFNDRVVVAAFYGEDCYKAGD 508
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
245-374 1.50e-08

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 56.08  E-value: 1.50e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     245 TDRRAGVIVNASENKSSSAEKKVVKSVNIngePTRVLLLRNMVGPGQ-----VDDELEDEVGGECAKYGTVTRVLIfeiT 319
Cdd:TIGR01622 369 TDEPPAVIPETQILKPKAETSFVPVNVNL---ASRCLVLSNMFDPATeeepnWDKEIEDDVREECSKYGGVVHIYV---D 442
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 166694     320 EPNfpvhEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATFYDEEKFSKNELA 374
Cdd:TIGR01622 443 DKN----SAGDIYLKFDSVQAAEAAIKALNGRYFGGKMITAAFVVDAVYSKSRLP 493
RRM_UHMK1 cd12465
RNA recognition motif (RRM) found in U2AF homology motif kinase 1 (UHMK1) and similar proteins; ...
277-363 4.29e-06

RNA recognition motif (RRM) found in U2AF homology motif kinase 1 (UHMK1) and similar proteins; This subgroup corresponds to the RRM of UHMK1. UHMK1, also termed kinase interacting with stathmin (KIS) or P-CIP2, is a serine/threonine protein kinase functionally related to RNA metabolism and neurite outgrowth. It contains an N-terminal kinase domain and a C-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), with high homology to the corresponding motif of the mammalian U2 small nuclear ribonucleoprotein auxiliary factor U2AF 65 kDa subunit (U2AF65 or U2AF2). UHMK1 targets two key regulators of cell proliferation and migration, the cyclin-dependent kinase (CDK) inhibitor p27Kip1 and the microtubule-destabilizing protein stathmin. It plays a critical role during vascular wound repair by preventing excessive vascular smooth muscle cell (VSMC) migration into the vascular lesion. Moreover, UHMK1 may control cell migration and neurite outgrowth by interacting with and phosphorylating the splicing factor SF1, thereby probably contributing to the control of protein expression. Furthermore, UHMK1 may be functionally related to microtubule dynamics and axon development. It localizes to RNA granules, interacts with three proteins found in RNA granules (KIF3A, NonO, and eEF1A), and further enhances the local translation. UHMK1 is highly expressed in regions of the brain implicated in schizophrenia and may play a role in susceptibility to schizophrenia.


Pssm-ID: 409898 [Multi-domain]  Cd Length: 88  Bit Score: 44.56  E-value: 4.29e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   277 PTRVLLLRNMVGPG--QVDDELED---EVGGECAKYGTVTRVLIfeitepnfPVHEAVR--IFVQFSRPEETTKALVDLD 349
Cdd:cd12465   2 PTPVLRLLNVIDDSylQSEEEYEDiveDVREECQKYGPVVSLLI--------PKENPGKgqVFVEYANAGDSKAAQKMLT 73
                        90
                ....*....|....
gi 166694   350 GRYFGGRTVRATFY 363
Cdd:cd12465  74 GRMFDGKFVVATFY 87
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
297-359 1.53e-05

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 42.66  E-value: 1.53e-05
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 166694   297 EDEVGGECAKYGTVTRVLIFEITEPNFPVHeavrIFVQFSRPEETTKALVDLDGRYFGGRTVR 359
Cdd:cd00590  12 EEDLRELFSKFGEVVSVRIVRDRDGKSKGF----AFVEFESPEDAEKALEALNGTELGGRPLK 70
RRM_CNOT4 cd12438
RNA recognition motif (RRM) found in Eukaryotic CCR4-NOT transcription complex subunit 4 (NOT4) ...
306-362 4.70e-05

RNA recognition motif (RRM) found in Eukaryotic CCR4-NOT transcription complex subunit 4 (NOT4) and similar proteins; This subfamily corresponds to the RRM of NOT4, also termed CCR4-associated factor 4, or E3 ubiquitin-protein ligase CNOT4, or potential transcriptional repressor NOT4Hp, a component of the CCR4-NOT complex, a global negative regulator of RNA polymerase II transcription. NOT4 functions as an ubiquitin-protein ligase (E3). It contains an N-terminal C4C4 type RING finger motif, followed by a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The RING fingers may interact with a subset of ubiquitin-conjugating enzymes (E2s), including UbcH5B, and mediate protein-protein interactions. T


Pssm-ID: 409872 [Multi-domain]  Cd Length: 98  Bit Score: 41.75  E-value: 4.70e-05
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 166694   306 KYGTVTRVLIFEITEPNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATF 362
Cdd:cd12438  31 QYGKIKKIVINRSTSYAGSQGPSASAYVTYSRKEDALRAIQAVDGFVLDGRTLKASF 87
RRM_U2AF35B cd12539
RNA recognition motif (RRM) found in splicing factor U2AF 35 kDa subunit B (U2AF35B); This ...
277-362 6.59e-05

RNA recognition motif (RRM) found in splicing factor U2AF 35 kDa subunit B (U2AF35B); This subgroup corresponds to the RRM of U2AF35B, also termed zinc finger CCCH domain-containing protein 60 (C3H60), which is one of the small subunits of U2 small nuclear ribonucleoprotein (snRNP) auxiliary factor (U2AF). It has been implicated in the recruitment of U2 snRNP to pre-mRNAs and is a highly conserved heterodimer composed of large and small subunits. Members in this family are mainly found in plant. They show high sequence homology to vertebrates U2AF35 that directly binds to the 3' splice site of the conserved AG dinucleotide and performs multiple functions in the splicing process in a substrate-specific manner. U2AF35B contains two N-terminal zinc fingers, a central RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal arginine/serine (SR)-rich domain. In contrast to U2AF35, U2AF35B has a plant-specific conserved C-terminal region containing SERE motif(s), which may have an important function specific to higher plants.


Pssm-ID: 409955 [Multi-domain]  Cd Length: 102  Bit Score: 41.62  E-value: 6.59e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694   277 PTrvLLLRNM-----------------VGPGQVDDELED---EVGGECAKYGTVTRVLIFEitepNFPVHEAVRIFVQFS 336
Cdd:cd12539   1 PT--ILLSNMyqnpimnaplgaaqgipLDPRELQEHFEDfyeDVFEELSKFGEVEALNVCD----NLGDHMVGNVYVKFR 74
                        90       100
                ....*....|....*....|....*.
gi 166694   337 RPEETTKALVDLDGRYFGGRTVRATF 362
Cdd:cd12539  75 DEEHAAAALKALQGRFYAGRPIIVEF 100
RRM smart00360
RNA recognition motif;
297-359 9.18e-05

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 40.27  E-value: 9.18e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 166694      297 EDEVGGECAKYGTVTRVLIFEITEPNFPVHEAvriFVQFSRPEETTKALVDLDGRYFGGRTVR 359
Cdd:smart00360  13 EEELRELFSKFGKVESVRLVRDKETGKSKGFA---FVEFESEEDAEKALEALNGKELDGRPLK 72
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
293-359 1.05e-04

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 39.91  E-value: 1.05e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 166694     293 DDELEDEvggeCAKYGTVTRV-LIFEITEPNFPVheavrIFVQFSRPEETTKALVDLDGRYFGGRTVR 359
Cdd:pfam00076  12 EEDLKDL----FSKFGPIKSIrLVRDETGRSKGF-----AFVEFEDEEDAEKAIEALNGKELGGRELK 70
RRM4_I_PABPs cd12381
RNA recognition motif 4 (RRM4) found in type I polyadenylate-binding proteins; This subfamily ...
293-356 1.64e-04

RNA recognition motif 4 (RRM4) found in type I polyadenylate-binding proteins; This subfamily corresponds to the RRM4 of type I poly(A)-binding proteins (PABPs), highly conserved proteins that bind to the poly(A) tail present at the 3' ends of most eukaryotic mRNAs. They have been implicated in theThe CD corresponds to the RRM. regulation of poly(A) tail length during the polyadenylation reaction, translation initiation, mRNA stabilization by influencing the rate of deadenylation and inhibition of mRNA decapping. The family represents type I polyadenylate-binding proteins (PABPs), including polyadenylate-binding protein 1 (PABP-1 or PABPC1), polyadenylate-binding protein 3 (PABP-3 or PABPC3), polyadenylate-binding protein 4 (PABP-4 or APP-1 or iPABP), polyadenylate-binding protein 5 (PABP-5 or PABPC5), polyadenylate-binding protein 1-like (PABP-1-like or PABPC1L), polyadenylate-binding protein 1-like 2 (PABPC1L2 or RBM32), polyadenylate-binding protein 4-like (PABP-4-like or PABPC4L), yeast polyadenylate-binding protein, cytoplasmic and nuclear (PABP or ACBP-67), and similar proteins. PABP-1 is an ubiquitously expressed multifunctional protein that may play a role in 3' end formation of mRNA, translation initiation, mRNA stabilization, protection of poly(A) from nuclease activity, mRNA deadenylation, inhibition of mRNA decapping, and mRNP maturation. Although PABP-1 is thought to be a cytoplasmic protein, it is also found in the nucleus. PABP-1 may be involved in nucleocytoplasmic trafficking and utilization of mRNP particles. PABP-1 contains four copies of RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), a less well conserved linker region, and a proline-rich C-terminal conserved domain (CTD). PABP-3 is a testis-specific poly(A)-binding protein specifically expressed in round spermatids. It is mainly found in mammalian and may play an important role in the testis-specific regulation of mRNA homeostasis. PABP-3 shows significant sequence similarity to PABP-1. However, it binds to poly(A) with a lower affinity than PABP-1. Moreover, PABP-1 possesses an A-rich sequence in its 5'-UTR and allows binding of PABP and blockage of translation of its own mRNA. In contrast, PABP-3 lacks the A-rich sequence in its 5'-UTR. PABP-4 is an inducible poly(A)-binding protein (iPABP) that is primarily localized to the cytoplasm. It shows significant sequence similarity to PABP-1 as well. The RNA binding properties of PABP-1 and PABP-4 appear to be identical. PABP-5 is encoded by PABPC5 gene within the X-specific subinterval, and expressed in fetal brain and in a range of adult tissues in mammalian, such as ovary and testis. It may play an important role in germ cell development. Moreover, unlike other PABPs, PABP-5 contains only four RRMs, but lacks both the linker region and the CTD. PABP-1-like and PABP-1-like 2 are the orthologs of PABP-1. PABP-4-like is the ortholog of PABP-5. Their cellular functions remain unclear. The family also includes the yeast PABP, a conserved poly(A) binding protein containing poly(A) tails that can be attached to the 3'-ends of mRNAs. The yeast PABP and its homologs may play important roles in the initiation of translation and in mRNA decay. Like vertebrate PABP-1, the yeast PABP contains four RRMs, a linker region, and a proline-rich CTD as well. The first two RRMs are mainly responsible for specific binding to poly(A). The proline-rich region may be involved in protein-protein interactions.


Pssm-ID: 409815 [Multi-domain]  Cd Length: 79  Bit Score: 39.95  E-value: 1.64e-04
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694   293 DDELEDEVggecAKYGTVTRVLIfeITEPN-----FPvheavriFVQFSRPEETTKALVDLDGRYFGGR 356
Cdd:cd12381  15 DEKLREEF----SPFGTITSAKV--MTDEGgrskgFG-------FVCFSSPEEATKAVTEMNGRIIGGK 70
RRM_HP0827_like cd12399
RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; ...
332-362 1.11e-03

RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; This subfamily corresponds to the RRM of H. pylori HP0827, a putative ssDNA-binding protein 12rnp2 precursor, containing one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The ssDNA binding may be important in activation of HP0827.


Pssm-ID: 409833 [Multi-domain]  Cd Length: 75  Bit Score: 37.50  E-value: 1.11e-03
                        10        20        30
                ....*....|....*....|....*....|.
gi 166694   332 FVQFSRPEETTKALVDLDGRYFGGRTVRATF 362
Cdd:cd12399  44 FVELQEEESAEKAIAKLDGTDFMGRTIRVNE 74
RRM_U2AFBPL cd12540
RNA recognition motif (RRM) found in U2 small nuclear ribonucleoprotein auxiliary factor 35 ...
294-362 1.87e-03

RNA recognition motif (RRM) found in U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 1 (U2AFBPL) and similar proteins; This subgroup corresponds to the RRM of U2AFBPL, a human homolog of the imprinted mouse gene U2afbp-rs, which encodes a U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 1 (U2AFBPL), also termed CCCH type zinc finger, RNA-binding motif and serine/arginine rich protein 1 (U2AF1RS1), or U2 small nuclear RNA auxiliary factor 1-like 1 (U2AF1L1). Although the biological role of U2AFBPL remains unclear, it shows high sequence homology to splicing factor U2AF 35 kDa subunit (U2AF35 or U2AF1) that directly binds to the 3' splice site of the conserved AG dinucleotide and performs multiple functions in the splicing process in a substrate-specific manner. Like U2AF35, U2AFBPL contains two N-terminal zinc fingers, a central RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal arginine/serine (SR)-rich domain.


Pssm-ID: 409956 [Multi-domain]  Cd Length: 105  Bit Score: 37.63  E-value: 1.87e-03
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694   294 DELEDEVGGECAKYGTVTRVLIFEITEPnfpvHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATF 362
Cdd:cd12540  39 LEFYEDVLPEFKKFGKVVQFKVCCNSEP----HLRGNVYVQYQSEEEALKAFTSFNGRWYAGKQLQCEF 103
PHA02682 PHA02682
ORF080 virion core protein; Provisional
10-89 2.06e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 39.46  E-value: 2.06e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694     10 PPPTDDEKPSGNSSSVWSRSTKMAPPTLRKPPAfAPPQTILRPLNKPKPIVSAPYKPPPNSSQSVlIPANESAPSHQPAL 89
Cdd:PHA02682 117 PPATAPTCPPPAVCPAPARPAPACPPSTRQCPP-APPLPTPKPAPAAKPIFLHNQLPPPDYPAAS-CPTIETAPAASPVL 194
RRM_U2AF35_like cd12287
RNA recognition motif (RRM) found in U2 small nuclear ribonucleoprotein auxiliary factor U2AF ...
294-362 3.32e-03

RNA recognition motif (RRM) found in U2 small nuclear ribonucleoprotein auxiliary factor U2AF 35 kDa subunit (U2AF35) and similar proteins; This subfamily corresponds to the RRM in U2 small nuclear ribonucleoprotein (snRNP) auxiliary factor (U2AF) which has been implicated in the recruitment of U2 snRNP to pre-mRNAs. It is a highly conserved heterodimer composed of large and small subunits; this family includes the small subunit of U2AF (U2AF35 or U2AF1) and U2AF 35 kDa subunit B (U2AF35B or C3H60). U2AF35 directly binds to the 3' splice site of the conserved AG dinucleotide and performs multiple functions in the splicing process in a substrate-specific manner. It promotes U2 snRNP binding to the branch-point sequences of introns through association with the large subunit of U2AF (U2AF65 or U2AF2). Although the biological role of U2AF35B remains unclear, it shows high sequence homolgy to U2AF35, which contains two N-terminal zinc fingers, a central RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal arginine/serine (SR) -rich segment interrupted by glycines. In contrast to U2AF35, U2AF35B has a plant-specific conserved C-terminal region containing SERE motif(s), which may have an important function specific to higher plants.


Pssm-ID: 409729 [Multi-domain]  Cd Length: 101  Bit Score: 36.86  E-value: 3.32e-03
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 166694   294 DELEDEVGGECAKYGTVTRVLIFEitepNFPVHEAVRIFVQFSRPEETTKALVDLDGRYFGGRTVRATF 362
Cdd:cd12287  35 DEFYEDVFLELSRFGEIEDLVVCS----NLNDHLLGNVYVKFESEEDAEAALQALNGRYYAGRPLYPEL 99
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
10-106 5.15e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 38.88  E-value: 5.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 166694      10 PPPTDDEKPSGNSSSVWSRST-KMAPPTLRKPPAFAPPQTILRPLNK-PKPIVSAPYKPPP---NSSQSVLIPANESAPS 84
Cdd:pfam05539 243 PSGSPQHPPSTTSQDQSTTGDgQEHTQRRKTPPATSNRRSPHSTATPpPTTKRQETGRPTPrptATTQSGSSPPHSSPPG 322
                          90       100
                  ....*....|....*....|..
gi 166694      85 HQPALVGVTSSVIEEYDPARPN 106
Cdd:pfam05539 323 VQANPTTQNLVDCKELDPPKPN 344
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH