NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|149022357|gb|EDL79251|]
View 

rCG26274, isoform CRA_b [Rattus norvegicus]

Protein Classification

CWC22 family protein( domain architecture ID 10650471)

CWC22 family protein similar to Candida albicans pre-mRNA-splicing factor CWC22, which is a component of the CWC complex (or CEF1-associated complex) that may be involved in pre-mRNA splicing

CATH:  3.30.70.330
Gene Ontology:  GO:0003723

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
165-346 6.02e-23

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


:

Pssm-ID: 214713  Cd Length: 200  Bit Score: 97.43  E-value: 6.02e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   165 KSINGLINKVNISNISIIIQELLQENIVRG--RGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQIGELILKRLILNF 242
Cdd:smart00543   2 KKVKGLINKLSPSNFESIIKELLKLNNSDKnlRKYILELIFEKAVEEPNFIPAYARLCALLNAKNPDFGSLLLERLQEEF 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   243 RKG---YRRNDKQLCLTASKFVAHLINQNVAHEVLCLEMLTLLLERPT-------DDSVEVAIGFLKECGLKLT-QVSPR 311
Cdd:smart00543  82 EKGlesEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLKELLNDLTkldpprsDFSVECLLSLLPTCGKDLErEKSPK 161
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 149022357   312 GINAIFERLRNILHES---EIDKRVQYMIEVMFAVRKD 346
Cdd:smart00543 162 LLDEILERLQDYLLKKdktELSSRLRFMLELLIELRKN 199
MA3 super family cl02653
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
457-563 5.28e-20

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


The actual alignment was detected with superfamily member pfam02847:

Pssm-ID: 413424  Cd Length: 113  Bit Score: 86.18  E-value: 5.28e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  457 FRRTIYLAIQSSL---DFEECAHKLLKMEfAESQTKELCNMILDCCAQQ-RTYEKFFGLLAGRFCMLKKEYMESFESIFK 532
Cdd:pfam02847   1 LKRKIFLILEEYLssgDYDEAARCLLKLG-LPSQHHEVVKVLIECALEEsKTYREFYGLLLERLCEFNLISTKQFEKGFW 79
                          90       100       110
                  ....*....|....*....|....*....|....
gi 149022357  533 EQYDTIHRLET---NKLRNVAKMFAHLLYTDSLP 563
Cdd:pfam02847  80 RVLEDLEDLELdipNAWRNLAEFVARLISDDGLP 113
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
825-904 1.50e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 51.82  E-value: 1.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  825 DSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSG--HKGDDDDRYQNGAERRWEKPSRYAEHSRESKRSQDRRREKSP 902
Cdd:TIGR01642   6 DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSreRSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSV 85

                  ..
gi 149022357  903 TK 904
Cdd:TIGR01642  86 RS 87
PRK12678 super family cl36163
transcription termination factor Rho; Provisional
729-898 2.84e-05

transcription termination factor Rho; Provisional


The actual alignment was detected with superfamily member PRK12678:

Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 47.98  E-value: 2.84e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 729 EEVDKLARSHQARDRRREGGREDQRHQEGRTERARSERHRAQHSRDADWRDPPAKHMEDRSHENSyNRVGNGREQGSHRE 808
Cdd:PRK12678 121 APEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA-ERGERGRREERGRD 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 809 PEDRhgepRKKRRERRDSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSGHKGDDDDRYQNGAERRwekpsryaehsr 888
Cdd:PRK12678 200 GDDR----DRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRG------------ 263
                        170
                 ....*....|
gi 149022357 889 esKRSQDRRR 898
Cdd:PRK12678 264 --RRFRDRDR 271
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
9-124 4.35e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 47.20  E-value: 4.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357    9 KQSSGHDRRESHNSyHRRSSSPEDRYTEQDRSPRDRDYSDYSRSDYERSRRGYSYDDSMESRSRDREKRRererdaDHRK 88
Cdd:TIGR01642  10 EKSRGRDRDRSSER-PRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRP------RRRS 82
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 149022357   89 RSRKSPSPERRspdRGVSQSSTQEEPTSKKKKDEQD 124
Cdd:TIGR01642  83 RSVRSIEQHRR---RLRDRSPSNQWRKDDKKRSLWD 115
 
Name Accession Description Interval E-value
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
165-346 6.02e-23

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 97.43  E-value: 6.02e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   165 KSINGLINKVNISNISIIIQELLQENIVRG--RGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQIGELILKRLILNF 242
Cdd:smart00543   2 KKVKGLINKLSPSNFESIIKELLKLNNSDKnlRKYILELIFEKAVEEPNFIPAYARLCALLNAKNPDFGSLLLERLQEEF 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   243 RKG---YRRNDKQLCLTASKFVAHLINQNVAHEVLCLEMLTLLLERPT-------DDSVEVAIGFLKECGLKLT-QVSPR 311
Cdd:smart00543  82 EKGlesEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLKELLNDLTkldpprsDFSVECLLSLLPTCGKDLErEKSPK 161
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 149022357   312 GINAIFERLRNILHES---EIDKRVQYMIEVMFAVRKD 346
Cdd:smart00543 162 LLDEILERLQDYLLKKdktELSSRLRFMLELLIELRKN 199
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
457-563 5.28e-20

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 86.18  E-value: 5.28e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  457 FRRTIYLAIQSSL---DFEECAHKLLKMEfAESQTKELCNMILDCCAQQ-RTYEKFFGLLAGRFCMLKKEYMESFESIFK 532
Cdd:pfam02847   1 LKRKIFLILEEYLssgDYDEAARCLLKLG-LPSQHHEVVKVLIECALEEsKTYREFYGLLLERLCEFNLISTKQFEKGFW 79
                          90       100       110
                  ....*....|....*....|....*....|....
gi 149022357  533 EQYDTIHRLET---NKLRNVAKMFAHLLYTDSLP 563
Cdd:pfam02847  80 RVLEDLEDLELdipNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
457-563 2.12e-18

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 81.52  E-value: 2.12e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   457 FRRTIYLAIQSSL---DFEECAHKLLKMEFAEsQTKELCNMILDCCAQQ-RTYEKFFGLLAGRFCMLKKEYMESFESIFK 532
Cdd:smart00544   1 LKKKIFLIIEEYLssgDTDEAVHCLLELKLPE-QHHEVVKVLLTCALEEkRTYREMYSVLLSRLCQANVISTKQFEKGFW 79
                           90       100       110
                   ....*....|....*....|....*....|....
gi 149022357   533 EQYDTIHRLET---NKLRNVAKMFAHLLYTDSLP 563
Cdd:smart00544  80 RLLEDIEDLELdipNAWRNLAEFVARLISDGILP 113
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
825-904 1.50e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 51.82  E-value: 1.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  825 DSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSG--HKGDDDDRYQNGAERRWEKPSRYAEHSRESKRSQDRRREKSP 902
Cdd:TIGR01642   6 DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSreRSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSV 85

                  ..
gi 149022357  903 TK 904
Cdd:TIGR01642  86 RS 87
PRK12678 PRK12678
transcription termination factor Rho; Provisional
729-898 2.84e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 47.98  E-value: 2.84e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 729 EEVDKLARSHQARDRRREGGREDQRHQEGRTERARSERHRAQHSRDADWRDPPAKHMEDRSHENSyNRVGNGREQGSHRE 808
Cdd:PRK12678 121 APEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA-ERGERGRREERGRD 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 809 PEDRhgepRKKRRERRDSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSGHKGDDDDRYQNGAERRwekpsryaehsr 888
Cdd:PRK12678 200 GDDR----DRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRG------------ 263
                        170
                 ....*....|
gi 149022357 889 esKRSQDRRR 898
Cdd:PRK12678 264 --RRFRDRDR 271
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
9-124 4.35e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 47.20  E-value: 4.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357    9 KQSSGHDRRESHNSyHRRSSSPEDRYTEQDRSPRDRDYSDYSRSDYERSRRGYSYDDSMESRSRDREKRRererdaDHRK 88
Cdd:TIGR01642  10 EKSRGRDRDRSSER-PRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRP------RRRS 82
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 149022357   89 RSRKSPSPERRspdRGVSQSSTQEEPTSKKKKDEQD 124
Cdd:TIGR01642  83 RSVRSIEQHRR---RLRDRSPSNQWRKDDKKRSLWD 115
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
165-346 2.86e-04

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 43.12  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  165 KSINGLINKVNISNISIIIQELLQENI--VRGRGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQ-IGELILKRLILN 241
Cdd:pfam02854   2 KKVKGILNKLSPENFEKLIKELLKLIMsdPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRNPTdFGIHLLNRLQEE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  242 FRKGY--RRNDKQLCLTASKFVA--------HLINQNVAHEVLCL--EMLTLLLERPTDDSVEVAIGFLKECGLKL-TQV 308
Cdd:pfam02854  82 FEKRFelEENEQGNRRRRLGLVRflgelykfGLLTEKILFECLKEllSSLTKEDLKRDLFNLECLLTLLTTIGKLLeNEK 161
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 149022357  309 SPRGINAIFERLRNIL---HESEIDKRVQYMIEVMFAVRKD 346
Cdd:pfam02854 162 LPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKN 202
 
Name Accession Description Interval E-value
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
165-346 6.02e-23

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 97.43  E-value: 6.02e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   165 KSINGLINKVNISNISIIIQELLQENIVRG--RGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQIGELILKRLILNF 242
Cdd:smart00543   2 KKVKGLINKLSPSNFESIIKELLKLNNSDKnlRKYILELIFEKAVEEPNFIPAYARLCALLNAKNPDFGSLLLERLQEEF 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   243 RKG---YRRNDKQLCLTASKFVAHLINQNVAHEVLCLEMLTLLLERPT-------DDSVEVAIGFLKECGLKLT-QVSPR 311
Cdd:smart00543  82 EKGlesEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLKELLNDLTkldpprsDFSVECLLSLLPTCGKDLErEKSPK 161
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 149022357   312 GINAIFERLRNILHES---EIDKRVQYMIEVMFAVRKD 346
Cdd:smart00543 162 LLDEILERLQDYLLKKdktELSSRLRFMLELLIELRKN 199
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
457-563 5.28e-20

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 86.18  E-value: 5.28e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  457 FRRTIYLAIQSSL---DFEECAHKLLKMEfAESQTKELCNMILDCCAQQ-RTYEKFFGLLAGRFCMLKKEYMESFESIFK 532
Cdd:pfam02847   1 LKRKIFLILEEYLssgDYDEAARCLLKLG-LPSQHHEVVKVLIECALEEsKTYREFYGLLLERLCEFNLISTKQFEKGFW 79
                          90       100       110
                  ....*....|....*....|....*....|....
gi 149022357  533 EQYDTIHRLET---NKLRNVAKMFAHLLYTDSLP 563
Cdd:pfam02847  80 RVLEDLEDLELdipNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
457-563 2.12e-18

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 81.52  E-value: 2.12e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357   457 FRRTIYLAIQSSL---DFEECAHKLLKMEFAEsQTKELCNMILDCCAQQ-RTYEKFFGLLAGRFCMLKKEYMESFESIFK 532
Cdd:smart00544   1 LKKKIFLIIEEYLssgDTDEAVHCLLELKLPE-QHHEVVKVLLTCALEEkRTYREMYSVLLSRLCQANVISTKQFEKGFW 79
                           90       100       110
                   ....*....|....*....|....*....|....
gi 149022357   533 EQYDTIHRLET---NKLRNVAKMFAHLLYTDSLP 563
Cdd:smart00544  80 RLLEDIEDLELdipNAWRNLAEFVARLISDGILP 113
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
825-904 1.50e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 51.82  E-value: 1.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  825 DSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSG--HKGDDDDRYQNGAERRWEKPSRYAEHSRESKRSQDRRREKSP 902
Cdd:TIGR01642   6 DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSreRSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSV 85

                  ..
gi 149022357  903 TK 904
Cdd:TIGR01642  86 RS 87
PRK12678 PRK12678
transcription termination factor Rho; Provisional
729-898 2.84e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 47.98  E-value: 2.84e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 729 EEVDKLARSHQARDRRREGGREDQRHQEGRTERARSERHRAQHSRDADWRDPPAKHMEDRSHENSyNRVGNGREQGSHRE 808
Cdd:PRK12678 121 APEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA-ERGERGRREERGRD 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 809 PEDRhgepRKKRRERRDSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSGHKGDDDDRYQNGAERRwekpsryaehsr 888
Cdd:PRK12678 200 GDDR----DRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRG------------ 263
                        170
                 ....*....|
gi 149022357 889 esKRSQDRRR 898
Cdd:PRK12678 264 --RRFRDRDR 271
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
9-124 4.35e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 47.20  E-value: 4.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357    9 KQSSGHDRRESHNSyHRRSSSPEDRYTEQDRSPRDRDYSDYSRSDYERSRRGYSYDDSMESRSRDREKRRererdaDHRK 88
Cdd:TIGR01642  10 EKSRGRDRDRSSER-PRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRP------RRRS 82
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 149022357   89 RSRKSPSPERRspdRGVSQSSTQEEPTSKKKKDEQD 124
Cdd:TIGR01642  83 RSVRSIEQHRR---RLRDRSPSNQWRKDDKKRSLWD 115
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
825-900 1.75e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 45.30  E-value: 1.75e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 149022357  825 DSFSENEKQRARNQDSDNVRRKdgSKSRERSRKHSGHKGDDDDRYQNGAERRWEKPSRYAEHSRESKRSQDRRREK 900
Cdd:TIGR01622  12 DSSSAGDRDRRRDKGRERSRDR--SRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDSYRR 85
PRK12678 PRK12678
transcription termination factor Rho; Provisional
716-868 2.53e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 44.89  E-value: 2.53e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357 716 KGSRKKRQGKARGEEVDKLARSHQARDRRREGGREDQRHQEGRTERARSERHRAQHSRDADWRDPPAKHMEDRSHENSYN 795
Cdd:PRK12678 140 GAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERG 219
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 149022357 796 RVGNGREQGSHREPEDRHGEprkkrrerrDSFSENEKQRARNQDSDNVRRKDGSKSRERSRKHSGHKGDDDDR 868
Cdd:PRK12678 220 RRDGGDRRGRRRRRDRRDAR---------GDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNER 283
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
165-346 2.86e-04

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 43.12  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  165 KSINGLINKVNISNISIIIQELLQENI--VRGRGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQ-IGELILKRLILN 241
Cdd:pfam02854   2 KKVKGILNKLSPENFEKLIKELLKLIMsdPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRNPTdFGIHLLNRLQEE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  242 FRKGY--RRNDKQLCLTASKFVA--------HLINQNVAHEVLCL--EMLTLLLERPTDDSVEVAIGFLKECGLKL-TQV 308
Cdd:pfam02854  82 FEKRFelEENEQGNRRRRLGLVRflgelykfGLLTEKILFECLKEllSSLTKEDLKRDLFNLECLLTLLTTIGKLLeNEK 161
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 149022357  309 SPRGINAIFERLRNIL---HESEIDKRVQYMIEVMFAVRKD 346
Cdd:pfam02854 162 LPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKN 202
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
786-906 1.59e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.19  E-value: 1.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 149022357  786 EDRSHENSYNRVGNGREQGSHREPEDRhgeprkkrrerrdsfsenEKQRARNQDSDNVRRKDGSKSRERSRKHSGHKGDD 865
Cdd:TIGR01642   5 PDREREKSRGRDRDRSSERPRRRSRDR------------------SRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSL 66
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 149022357  866 DDRYQNGA---ERRWEKPSRYAEHSRESKRSQDRRREKSPTKHK 906
Cdd:TIGR01642  67 RYSSVRRSrdrPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKK 110
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH