NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|18409250|ref|NP_564961|]
View 

Tetratricopeptide repeat (TPR)-like superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000585)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
102-768 3.33e-154

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 471.26  E-value: 3.33e-154
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  102 VFPSVLRACAGSREHlSVGGKVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 181
Cdd:PLN03077  88 AYVALFRLCEWKRAV-EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAG 166
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  182 EVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFE 261
Cdd:PLN03077 167 YFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFD 246
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  262 KIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYeSL 341
Cdd:PLN03077 247 RMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDV-SV 325
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  342 SLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLV 421
Cdd:PLN03077 326 CNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDL 405
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  422 PLGKQIHGHVIRTD-VSDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSy 500
Cdd:PLN03077 406 DVGVKLHELAERKGlISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT- 484
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  501 LEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAETVFRAmSSRSIVSWSSMINA 579
Cdd:PLN03077 485 LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGfDGFLPNALLDLYVRCGRMNYAWNQFNS-HEKDVVSWNILLTG 563
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  580 YGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMK-SFGVSPNSEHFACFIDLLSRSGDLKE 658
Cdd:PLN03077 564 YVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEeKYSITPNLKHYACVVDLLGRAGKLTE 643
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  659 AYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLK 738
Cdd:PLN03077 644 AYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLT 723
                        650       660       670
                 ....*....|....*....|....*....|
gi 18409250  739 KVPGYSAIEIDQKVFRFGAGEENRIQTDEI 768
Cdd:PLN03077 724 VDPGCSWVEVKGKVHAFLTDDESHPQIKEI 753
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
102-768 3.33e-154

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 471.26  E-value: 3.33e-154
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  102 VFPSVLRACAGSREHlSVGGKVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 181
Cdd:PLN03077  88 AYVALFRLCEWKRAV-EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAG 166
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  182 EVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFE 261
Cdd:PLN03077 167 YFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFD 246
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  262 KIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYeSL 341
Cdd:PLN03077 247 RMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDV-SV 325
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  342 SLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLV 421
Cdd:PLN03077 326 CNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDL 405
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  422 PLGKQIHGHVIRTD-VSDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSy 500
Cdd:PLN03077 406 DVGVKLHELAERKGlISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT- 484
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  501 LEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAETVFRAmSSRSIVSWSSMINA 579
Cdd:PLN03077 485 LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGfDGFLPNALLDLYVRCGRMNYAWNQFNS-HEKDVVSWNILLTG 563
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  580 YGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMK-SFGVSPNSEHFACFIDLLSRSGDLKE 658
Cdd:PLN03077 564 YVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEeKYSITPNLKHYACVVDLLGRAGKLTE 643
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  659 AYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLK 738
Cdd:PLN03077 644 AYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLT 723
                        650       660       670
                 ....*....|....*....|....*....|
gi 18409250  739 KVPGYSAIEIDQKVFRFGAGEENRIQTDEI 768
Cdd:PLN03077 724 VDPGCSWVEVKGKVHAFLTDDESHPQIKEI 753
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
685-747 4.32e-13

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 64.49  E-value: 4.32e-13
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 18409250   685 HQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIE 747
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
371-405 1.76e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.76e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18409250   371 VAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDA 405
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
102-768 3.33e-154

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 471.26  E-value: 3.33e-154
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  102 VFPSVLRACAGSREHlSVGGKVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 181
Cdd:PLN03077  88 AYVALFRLCEWKRAV-EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAG 166
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  182 EVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFE 261
Cdd:PLN03077 167 YFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFD 246
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  262 KIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYeSL 341
Cdd:PLN03077 247 RMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDV-SV 325
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  342 SLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLV 421
Cdd:PLN03077 326 CNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDL 405
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  422 PLGKQIHGHVIRTD-VSDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSy 500
Cdd:PLN03077 406 DVGVKLHELAERKGlISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT- 484
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  501 LEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAETVFRAmSSRSIVSWSSMINA 579
Cdd:PLN03077 485 LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGfDGFLPNALLDLYVRCGRMNYAWNQFNS-HEKDVVSWNILLTG 563
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  580 YGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMK-SFGVSPNSEHFACFIDLLSRSGDLKE 658
Cdd:PLN03077 564 YVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEeKYSITPNLKHYACVVDLLGRAGKLTE 643
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  659 AYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLK 738
Cdd:PLN03077 644 AYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLT 723
                        650       660       670
                 ....*....|....*....|....*....|
gi 18409250  739 KVPGYSAIEIDQKVFRFGAGEENRIQTDEI 768
Cdd:PLN03077 724 VDPGCSWVEVKGKVHAFLTDDESHPQIKEI 753
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
305-775 7.31e-91

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 300.25  E-value: 7.31e-91
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  305 TLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVeLYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRG 384
Cdd:PLN03081 125 TYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLL-MHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAG 203
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  385 MVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDV-SDEFVQNSLIDMYSKSGSVDSASTVFN 463
Cdd:PLN03081 204 NYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVvGDTFVSCALIDMYSKCGDIEDARCVFD 283
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  464 QIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTD 542
Cdd:PLN03081 284 GMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPlDIVAN 363
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  543 TALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVE 622
Cdd:PLN03081 364 TALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSE 443
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  623 EGKYYFNLM-KSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDI 701
Cdd:PLN03081 444 QGWEIFQSMsENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGM 523
                        410       420       430       440       450       460       470
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 18409250  702 VTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGEENRIQTDEIYRFLGNL 775
Cdd:PLN03081 524 GPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLDEL 597
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
53-468 2.78e-38

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 152.33  E-value: 2.78e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   53 SRLVFEAFPYPDSFMYGVLIKCNVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREhLSVGGKVHGRIIKGGV 132
Cdd:PLN03081 177 ARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGS-ARAGQQLHCCVLKTGV 255
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  133 DDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGC 212
Cdd:PLN03081 256 VGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIF 335
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  213 AELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYNRGEFSEKALRSFS 292
Cdd:PLN03081 336 SRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFE 415
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  293 EMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVhgfavrreldpnYESLSlalvelyaecgklsdceTVLRVvsDRNIVA 372
Cdd:PLN03081 416 RMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEI------------FQSMS-----------------ENHRI--KPRAMH 464
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  373 WNSLISLYAHRGMVIQALGLFRQmvtQRIKPDAFTLASSISACENAGLVPLGK----QIHGhvirtdVSDEFVQN--SLI 446
Cdd:PLN03081 465 YACMIELLGREGLLDEAYAMIRR---APFKPTVNMWAALLTACRIHKNLELGRlaaeKLYG------MGPEKLNNyvVLL 535
                        410       420
                 ....*....|....*....|..
gi 18409250  447 DMYSKSGSVDSASTVFNQIKHR 468
Cdd:PLN03081 536 NLYNSSGRQAEAAKVVETLKRK 557
PLN03077 PLN03077
Protein ECB2; Provisional
12-461 5.99e-36

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 146.15  E-value: 5.99e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   12 SSLRLVSQLHAHLLVTGrLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPYPDSFMYGVLIKCNVWCHLLDAAIDLYHRL 91
Cdd:PLN03077 302 GDERLGREMHGYVVKTG-FAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALM 380
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   92 VSETTQISKFVFPSVLRACAGSREhLSVGGKVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWS 171
Cdd:PLN03077 381 EQDNVSPDEITIASVLSACACLGD-LDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWT 459
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  172 TLVSSCLENGEVVKALRMFKCMVDDgVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCG 251
Cdd:PLN03077 460 SIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCG 538
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  252 DLLSSERIFeKIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREG-KSVHGFAV 330
Cdd:PLN03077 539 RMNYAWNQF-NSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGlEYFHSMEE 617
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  331 RRELDPNYESLSlALVELYAECGKLSDcetvlrvvsdrnivawnslislyAHRgmviqalgLFRQMvtqRIKPDAFTLAS 410
Cdd:PLN03077 618 KYSITPNLKHYA-CVVDLLGRAGKLTE-----------------------AYN--------FINKM---PITPDPAVWGA 662
                        410       420       430       440       450
                 ....*....|....*....|....*....|....*....|....*....|.
gi 18409250  411 SISACENAGLVPLGKQIHGHVIRTDVSDEFVQNSLIDMYSKSGSVDSASTV 461
Cdd:PLN03077 663 LLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARV 713
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
13-213 9.88e-21

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 97.25  E-value: 9.88e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   13 SLRLVSQLHAHLLVTGrLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPYPDSFMYGVLIKCNVWCHLLDAAIDLYHRLV 92
Cdd:PLN03081 239 SARAGQQLHCCVLKTG-VVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMR 317
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   93 SETTQISKFVFPSVLRACA--GSREHLSvggKVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAW 170
Cdd:PLN03081 318 DSGVSIDQFTFSIMIRIFSrlALLEHAK---QAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISW 394
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|...
gi 18409250  171 STLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCA 213
Cdd:PLN03081 395 NALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACR 437
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
685-747 4.32e-13

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 64.49  E-value: 4.32e-13
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 18409250   685 HQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIE 747
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03218 PLN03218
maturation of RBCL 1; Provisional
361-666 6.56e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 72.60  E-value: 6.56e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   361 VLRVVSDRNIVA----WNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDV 436
Cdd:PLN03218  459 VLRLVQEAGLKAdcklYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNV 538
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   437 S-DEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSvvtwnsmlcgfsqngnsveaislfdymyhSYLEMNEVTFLAVIQAC 515
Cdd:PLN03218  539 KpDRVVFNALISACGQSGAVDRAFDVLAEMKAET-----------------------------HPIDPDHITVGALMKAC 589
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   516 SSIGSLEKGKWVH---HKLIISGLKDLFtdTALIDMYAKCGDLNAAETVFRAMSSRSIV----SWSSMINAYGMHGRIGS 588
Cdd:PLN03218  590 ANAGQVDRAKEVYqmiHEYNIKGTPEVY--TIAVNSCSQKGDWDFALSIYDDMKKKGVKpdevFFSALVDVAGHAGDLDK 667
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 18409250   589 AISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEM 666
Cdd:PLN03218  668 AFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEM 745
PLN03218 PLN03218
maturation of RBCL 1; Provisional
166-419 1.80e-10

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 64.90  E-value: 1.80e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   166 DLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLt 245
Cdd:PLN03218  471 DCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALI- 549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   246 mySKCGDLLSSERIFEKIAKKNA---------VSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLI 316
Cdd:PLN03218  550 --SACGQSGAVDRAFDVLAEMKAethpidpdhITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQK 627
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   317 GLIREGKSVHGFAVRRELDPNYESLSlALVElyaecgklsdcetvlrvvsdrniVAwnslislyAHRGMVIQALGLFRQM 396
Cdd:PLN03218  628 GDWDFALSIYDDMKKKGVKPDEVFFS-ALVD-----------------------VA--------GHAGDLDKAFEILQDA 675
                         250       260
                  ....*....|....*....|...
gi 18409250   397 VTQRIKPDAFTLASSISACENAG 419
Cdd:PLN03218  676 RKQGIKLGTVSYSSLMGACSNAK 698
PLN03218 PLN03218
maturation of RBCL 1; Provisional
54-310 1.58e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 61.82  E-value: 1.58e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250    54 RLVFEAFPYPDSFMYGVLIKCNVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGGkVHGRIIKGGVD 133
Cdd:PLN03218  461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFG-AYGIMRSKNVK 539
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   134 DDAVIETSLLCMYGQTGNLSDAEKVFDGM-----PVR-DLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVE--PDAVTM 205
Cdd:PLN03218  540 PDRVVFNALISACGQSGAVDRAFDVLAEMkaethPIDpDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKgtPEVYTI 619
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   206 isVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLlssERIFE--KIAKK----------------- 266
Cdd:PLN03218  620 --AVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDL---DKAFEilQDARKqgiklgtvsysslmgac 694
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 18409250   267 -NAVSW----------------------TAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTlYSVL 310
Cdd:PLN03218  695 sNAKNWkkalelyediksiklrptvstmNALITALCEGNQLPKALEVLSEMKRLGLCPNTIT-YSIL 760
PLN03077 PLN03077
Protein ECB2; Provisional
474-681 3.39e-08

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 57.17  E-value: 3.39e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  474 NSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVhHKLIISGLKDLFTD--TALIDMYAK 551
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRV-CSRALSSHPSLGVRlgNAMLSMFVR 133
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250  552 CGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLM 631
Cdd:PLN03077 134 FGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHV 213
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|
gi 18409250  632 KSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPfLADASVWGSLVNG 681
Cdd:PLN03077 214 VRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMP-RRDCISWNAMISG 262
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
468-515 9.36e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.90  E-value: 9.36e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 18409250   468 RSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQAC 515
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
368-407 3.03e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.36  E-value: 3.03e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 18409250   368 RNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFT 407
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYT 40
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
266-310 3.99e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.36  E-value: 3.99e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 18409250   266 KNAVSWTAMISSY-NRGEFsEKALRSFSEMIKSGIEPNLVTlYSVL 310
Cdd:pfam13041   1 PDVVTYNTLINGYcKKGKV-EEAFKLFNEMKKRGVKPNVYT-YTIL 44
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
165-213 1.32e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.82  E-value: 1.32e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18409250   165 RDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCA 213
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03218 PLN03218
maturation of RBCL 1; Provisional
369-652 3.14e-05

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 47.56  E-value: 3.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   369 NIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQI------HGHVIRtdvSDEFVQ 442
Cdd:PLN03218  506 NVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVlaemkaETHPID---PDHITV 582
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   443 NSLIDMYSKSGSVDSASTVFNQIKHRSV----VTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSI 518
Cdd:PLN03218  583 GALMKACANAGQVDRAKEVYQMIHEYNIkgtpEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHA 662
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   519 GSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYgMHG-----RIGSAIST 592
Cdd:PLN03218  663 GDLDKAFEILQDARKQGIKlGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNAL-ITAlcegnQLPKALEV 741
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   593 FNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSR 652
Cdd:PLN03218  742 LSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITGLCLR 801
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
371-405 1.76e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.76e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18409250   371 VAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDA 405
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
576-617 2.10e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 39.65  E-value: 2.10e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 18409250   576 MINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGH 617
Cdd:pfam13041   9 LINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
576-682 3.29e-04

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 44.48  E-value: 3.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18409250   576 MINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGD 655
Cdd:PLN03218  478 LISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGA 557
                          90       100       110
                  ....*....|....*....|....*....|..
gi 18409250   656 LKEAYRTIKEM-----PFLADASVWGSLVNGC 682
Cdd:PLN03218  558 VDRAFDVLAEMkaethPIDPDHITVGALMKAC 589
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
471-496 4.86e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 4.86e-04
                          10        20
                  ....*....|....*....|....*.
gi 18409250   471 VTWNSMLCGFSQNGNSVEAISLFDYM 496
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEM 26
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
371-401 7.71e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 7.71e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18409250   371 VAWNSLISLYAHRGMVIQALGLFRQMVTQRI 401
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
269-299 1.07e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.07e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18409250   269 VSWTAMISSYNRGEFSEKALRSFSEMIKSGI 299
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
168-202 1.21e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 1.21e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18409250   168 VAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDA 202
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
443-480 1.60e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 36.96  E-value: 1.60e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 18409250   443 NSLIDMYSKSGSVDSASTVFNQIKHR----SVVTWNSMLCGF 480
Cdd:pfam13041   7 NTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
576-605 1.82e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 1.82e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 18409250   576 MINAYGMHGRIGSAISTFNQMVESGTKPNE 605
Cdd:TIGR00756   6 LIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
168-198 1.90e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.90e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18409250   168 VAWSTLVSSCLENGEVVKALRMFKCMVDDGV 198
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
471-496 2.95e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.89  E-value: 2.95e-03
                          10        20
                  ....*....|....*....|....*.
gi 18409250   471 VTWNSMLCGFSQNGNSVEAISLFDYM 496
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEM 26
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
269-302 3.67e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 3.67e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18409250   269 VSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPN 302
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
470-496 4.40e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.40  E-value: 4.40e-03
                          10        20
                  ....*....|....*....|....*..
gi 18409250   470 VVTWNSMLCGFSQNGNSVEAISLFDYM 496
Cdd:pfam12854   7 VVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
258-314 7.51e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.41  E-value: 7.51e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 18409250   258 RIFEKIAKK----NAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCG 314
Cdd:pfam13812   1 SILREMVRDgiqlNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIG 61
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH