NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958706156|ref|XP_038951678|]
View 

disco-interacting protein 2 homolog C isoform X4 [Rattus norvegicus]

Protein Classification

disco-interacting protein 2( domain architecture ID 10534274)

disco-interacting protein 2 (DIP2) such as human DIP2 homolog A that catalyzes the de novo synthesis of acetyl-CoA in vitro, and binds to follistatin-related protein FSTL1 and may act as a cell surface receptor for FSTL1

EC:  6.2.1.-
Gene Ontology:  GO:0005524|GO:0120225|GO:0016405

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
1033-1607 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


:

Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 786.16  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1033 LYTLLNCRGTIANSLTCVQLHKRAEKIAVMLMERGHLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPH-PQNIA 1111
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLKPGDRVALMYPDPLDFVAAFYGCLYAGVVPIPIEPPDiSQQLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1112 TTLPTVKMivEVSRSACLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPA-----QIYKPSNPDTLAYLDFSV 1186
Cdd:cd05905     81 FLLGTCKV--RVALTVEACLKGLPKKLLKSKTAAEIAKKKGWPKILDFVKIPKSKRSklkkwGPHPPTRDGDTAYIEYSF 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1187 STTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSELETNPALWLLAV 1266
Cdd:cd05905    159 SSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELMKTNPLLWLQTL 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1267 SQYKVRDTFCSYSVMELCTKGLGSQTESLKARGLDLSRVRTCVVVAEERPRIALTQSFSKLFKDLGLHPRAVSTSFGCRV 1346
Cdd:cd05905    239 SQYKVRDAYVKLRTLHWCLKDLSSTLASLKNRDVNLSSLRMCMVPCENRPRISSCDSFLKLFQTLGLSPRAVSTEFGTRV 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1347 NLAICLqphrlwtlaeQGTSGPDPTTVYVDMRALRHDRVRLVERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSHL 1426
Cdd:cd05905    319 NPFICW----------QGTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKGLCKDGEI 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1427 GEIWVHSAHNASGYFTIYGDESLQSDHFNS-RLSFGDTQTIWARTGYLGFLRRTELTDANGERHDALYVVGALDEAMELR 1505
Cdd:cd05905    389 GEIWVNSPANASGYFLLDGETNDTFKVFPStRLSTGITNNSYARTGLLGFLRPTKCTDLNVEEHDLLFVVGSIDETLEVR 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1506 GMRYHPIDIETSVIRAHKSVTECAVFTWTNLLVVVVELD-GSEQEALDLVPLVTNVVLEEHYLIVGVVVVVDIGVIPINS 1584
Cdd:cd05905    469 GLRHHPSDIEATVMRVHPYRGRCAVFSITGLVVVVAEQPpGSEEEALDLVPLVLNAILEEHQVIVDCVALVPPGSLPKNP 548
                          570       580
                   ....*....|....*....|...
gi 1958706156 1585 RGEKQRMHLRDGFLADQLDPIYV 1607
Cdd:cd05905    549 LGEKQRMEIRQAFLAGKLHPIYV 571
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
377-953 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


:

Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 737.62  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  377 CLTTMDTNGKPLYILTYGKLWTRSMKVAYNILHKLGTKqepmvrPGDRVALVFPnnDPAAFMVAFYGCLLAEVVPVPIEV 456
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLK------PGDRVALMYP--DPLDFVAAFYGCLYAGVVPIPIEP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  457 PLtrkdaGSQQIGFLLGSCGVTVALTSDACHKGLPKS-----PTGEIPQFKGWPKLLWFVTESKHLSKPPRDWFPHIKDA 531
Cdd:cd05905     73 PD-----ISQQLGFLLGTCKVRVALTVEACLKGLPKKllkskTAAEIAKKKGWPKILDFVKIPKSKRSKLKKWGPHPPTR 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  532 NNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALM 611
Cdd:cd05905    148 DGDTAYIEYSFSSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELM 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  612 KVNPLSWIQKVCQYKAKVACVKSRDMHWAL------VAHRDQRDVNLSSLRMLIVADGaNPWSISSCDAFLNVFQSKGLR 685
Cdd:cd05905    228 KTNPLLWLQTLSQYKVRDAYVKLRTLHWCLkdlsstLASLKNRDVNLSSLRMCMVPCE-NRPRISSCDSFLKLFQTLGLS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  686 QEvicpcASSPEALTVAIRRPTD--DSNQPPGRGVLSMHGLTYGVIRVDSEEKLSVLTVQDVGLVMPGAIMCSVKPDGVP 763
Cdd:cd05905    307 PR-----AVSTEFGTRVNPFICWqgTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKG 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  764 qLCRTDEIGELCVCAVATGTSYYGLSGMTKNTFEVFPMTSSGAPISEYPFIRTGLLGFVGPG----------GLVFVVGK 833
Cdd:cd05905    382 -LCKDGEIGEIWVNSPANASGYFLLDGETNDTFKVFPSTRLSTGITNNSYARTGLLGFLRPTkctdlnveehDLLFVVGS 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  834 MDGLMVVSGRRHNADDIVATALAVEPmkfvYRGRIAVFSVTvlhdERIVIVAEQRPDStEEDSFQWMSRVLQAIDSIHQV 913
Cdd:cd05905    461 IDETLEVRGLRHHPSDIEATVMRVHP----YRGRCAVFSIT----GLVVVVAEQPPGS-EEEALDLVPLVLNAILEEHQV 531
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 1958706156  914 GVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSLHPCNV 953
Cdd:cd05905    532 IVDCVALVPPGSLPKNPLGEKQRMEIRQAFLAGKLHPIYV 571
DMAP_binding pfam06464
DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.
14-159 1.17e-21

DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.


:

Pssm-ID: 368923 [Multi-domain]  Cd Length: 104  Bit Score: 91.33  E-value: 1.17e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156   14 GDITQKGYEKKRSKLIGAYLPQPptangaavvrcrlqpgegvprrpfrsahigvcdireaaareraasaagnrplfyfrf 93
Cdd:pfam06464   23 GDITEKGYEKKKLKLLRKFLLHP--------------------------------------------------------- 45
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958706156   94 gvdqalpqerRAPVTPSSASRYHR-RRSSGSRDERYRSDVHTEAVQAALAKHKERKMAVPMPSKRRS 159
Cdd:pfam06464   46 ----------ETPTKLSAEAQNQLaSLETKLRDEELSEEVYLEKVKALLAKELERENGLNAPTKEQS 102
 
Name Accession Description Interval E-value
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
1033-1607 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 786.16  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1033 LYTLLNCRGTIANSLTCVQLHKRAEKIAVMLMERGHLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPH-PQNIA 1111
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLKPGDRVALMYPDPLDFVAAFYGCLYAGVVPIPIEPPDiSQQLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1112 TTLPTVKMivEVSRSACLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPA-----QIYKPSNPDTLAYLDFSV 1186
Cdd:cd05905     81 FLLGTCKV--RVALTVEACLKGLPKKLLKSKTAAEIAKKKGWPKILDFVKIPKSKRSklkkwGPHPPTRDGDTAYIEYSF 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1187 STTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSELETNPALWLLAV 1266
Cdd:cd05905    159 SSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELMKTNPLLWLQTL 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1267 SQYKVRDTFCSYSVMELCTKGLGSQTESLKARGLDLSRVRTCVVVAEERPRIALTQSFSKLFKDLGLHPRAVSTSFGCRV 1346
Cdd:cd05905    239 SQYKVRDAYVKLRTLHWCLKDLSSTLASLKNRDVNLSSLRMCMVPCENRPRISSCDSFLKLFQTLGLSPRAVSTEFGTRV 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1347 NLAICLqphrlwtlaeQGTSGPDPTTVYVDMRALRHDRVRLVERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSHL 1426
Cdd:cd05905    319 NPFICW----------QGTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKGLCKDGEI 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1427 GEIWVHSAHNASGYFTIYGDESLQSDHFNS-RLSFGDTQTIWARTGYLGFLRRTELTDANGERHDALYVVGALDEAMELR 1505
Cdd:cd05905    389 GEIWVNSPANASGYFLLDGETNDTFKVFPStRLSTGITNNSYARTGLLGFLRPTKCTDLNVEEHDLLFVVGSIDETLEVR 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1506 GMRYHPIDIETSVIRAHKSVTECAVFTWTNLLVVVVELD-GSEQEALDLVPLVTNVVLEEHYLIVGVVVVVDIGVIPINS 1584
Cdd:cd05905    469 GLRHHPSDIEATVMRVHPYRGRCAVFSITGLVVVVAEQPpGSEEEALDLVPLVLNAILEEHQVIVDCVALVPPGSLPKNP 548
                          570       580
                   ....*....|....*....|...
gi 1958706156 1585 RGEKQRMHLRDGFLADQLDPIYV 1607
Cdd:cd05905    549 LGEKQRMEIRQAFLAGKLHPIYV 571
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
377-953 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 737.62  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  377 CLTTMDTNGKPLYILTYGKLWTRSMKVAYNILHKLGTKqepmvrPGDRVALVFPnnDPAAFMVAFYGCLLAEVVPVPIEV 456
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLK------PGDRVALMYP--DPLDFVAAFYGCLYAGVVPIPIEP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  457 PLtrkdaGSQQIGFLLGSCGVTVALTSDACHKGLPKS-----PTGEIPQFKGWPKLLWFVTESKHLSKPPRDWFPHIKDA 531
Cdd:cd05905     73 PD-----ISQQLGFLLGTCKVRVALTVEACLKGLPKKllkskTAAEIAKKKGWPKILDFVKIPKSKRSKLKKWGPHPPTR 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  532 NNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALM 611
Cdd:cd05905    148 DGDTAYIEYSFSSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELM 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  612 KVNPLSWIQKVCQYKAKVACVKSRDMHWAL------VAHRDQRDVNLSSLRMLIVADGaNPWSISSCDAFLNVFQSKGLR 685
Cdd:cd05905    228 KTNPLLWLQTLSQYKVRDAYVKLRTLHWCLkdlsstLASLKNRDVNLSSLRMCMVPCE-NRPRISSCDSFLKLFQTLGLS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  686 QEvicpcASSPEALTVAIRRPTD--DSNQPPGRGVLSMHGLTYGVIRVDSEEKLSVLTVQDVGLVMPGAIMCSVKPDGVP 763
Cdd:cd05905    307 PR-----AVSTEFGTRVNPFICWqgTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKG 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  764 qLCRTDEIGELCVCAVATGTSYYGLSGMTKNTFEVFPMTSSGAPISEYPFIRTGLLGFVGPG----------GLVFVVGK 833
Cdd:cd05905    382 -LCKDGEIGEIWVNSPANASGYFLLDGETNDTFKVFPSTRLSTGITNNSYARTGLLGFLRPTkctdlnveehDLLFVVGS 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  834 MDGLMVVSGRRHNADDIVATALAVEPmkfvYRGRIAVFSVTvlhdERIVIVAEQRPDStEEDSFQWMSRVLQAIDSIHQV 913
Cdd:cd05905    461 IDETLEVRGLRHHPSDIEATVMRVHP----YRGRCAVFSIT----GLVVVVAEQPPGS-EEEALDLVPLVLNAILEEHQV 531
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 1958706156  914 GVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSLHPCNV 953
Cdd:cd05905    532 IVDCVALVPPGSLPKNPLGEKQRMEIRQAFLAGKLHPIYV 571
AMP-binding pfam00501
AMP-binding enzyme;
1021-1505 4.80e-57

AMP-binding enzyme;


Pssm-ID: 459834 [Multi-domain]  Cd Length: 417  Bit Score: 204.08  E-value: 4.80e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1021 LQWRAQTTPDHLLYTllNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPI 1100
Cdd:pfam00501    1 LERQAARTPDKTALE--VGEGR---RLTYRELDERANRLAAGLRALG-VGKGDRVAILLPNSPEWVVAFLACLKAGAVYV 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1101 TVRPphpqniATTLPTVKMIVEVSRSACLMTTQlICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAQIYKPSN----- 1175
Cdd:pfam00501   75 PLNP------RLPAEELAYILEDSGAKVLITDD-ALKLEELLEALGKLEVVKLVLVLDRDPVLKEEPLPEEAKPAdvppp 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1176 ------PDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCE----LYPSREVAICLDPYCGLGFVLWCLCSVYSGH 1245
Cdd:pfam00501  148 pppppdPDDLAYIIYTSGTTGKPKGVMLTHRNLVANVLSIKRVRPrgfgLGPDDRVLSTLPLFHDFGLSLGLLGPLLAGA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1246 QSILIPPSELeTNPALWLLAVSQYKVRDTFCSYSVMELCtkglgsqTESLKARGLDLSRVRTCVVVAeERPRIALTQSFS 1325
Cdd:pfam00501  228 TVVLPPGFPA-LDPAALLELIERYKVTVLYGVPTLLNML-------LEAGAPKRALLSSLRLVLSGG-APLPPELARRFR 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1326 KLFkdlglhPRAVSTSFGcrvnlaiclqphrlwtLAEqgTSGPdptTVYVDMRALRHDRVRLVergsphslplmesGKIL 1405
Cdd:pfam00501  299 ELF------GGALVNGYG----------------LTE--TTGV---VTTPLPLDEDLRSLGSV-------------GRPL 338
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1406 PGVRIIIANPETKGPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnsrlsfgdTQTIWARTGYLGFLrrteltDAN 1485
Cdd:pfam00501  339 PGTEVKIVDDETGEPVPPGEPGELCVRGPGVMKGY---LNDPELTAEAF--------DEDGWYRTGDLGRR------DED 401
                          490       500
                   ....*....|....*....|
gi 1958706156 1486 GErhdaLYVVGALDEAMELR 1505
Cdd:pfam00501  402 GY----LEIVGRKKDQIKLG 417
PRK05691 PRK05691
peptide synthase; Validated
1017-1547 4.39e-35

peptide synthase; Validated


Pssm-ID: 235564 [Multi-domain]  Cd Length: 4334  Bit Score: 147.62  E-value: 4.39e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHLLYTLLNCRGTIANSLTCVQLHKRAEKIAVMLMERGHLqdGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:PRK05691    11 LVQALQRRAAQTPDRLALRFLADDPGEGVVLSYRDLDLRARTIAAALQARASF--GDRAVLLFPSGPDYVAAFFGCLYAG 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CV------PITVRPPHPQNIATtlptvkmIVEVSRSACLMTTQLICKLLRSREAAAAVDVrtwPLILDTDDLPKKRPAQI 1170
Cdd:PRK05691    89 VIavpaypPESARRHHQERLLS-------IIADAEPRLLLTVADLRDSLLQMEELAAANA---PELLCVDTLDPALAEAW 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1171 YKPS-NPDTLAYLDFSVSTTGMLAGVKMSHAATSAfcrsiklqCELYPSREVAICLDP----------YCGLGFVLWCLC 1239
Cdd:PRK05691   159 QEPAlQPDDIAFLQYTSGSTALPKGVQVSHGNLVA--------NEQLIRHGFGIDLNPddvivswlplYHDMGLIGGLLQ 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1240 SVYSGHQSILIPPSELETNPALWLLAVSQYKVRDTFCSYSVMELCTKGLGsqtESLKARgLDLSRVRtcVVVAEERP-RI 1318
Cdd:PRK05691   231 PIFSGVPCVLMSPAYFLERPLRWLEAISEYGGTISGGPDFAYRLCSERVS---ESALER-LDLSRWR--VAYSGSEPiRQ 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1319 ALTQSFSKLFKDLGLHPRAVSTSFGcrvnlaiclqphrlwtLAEQ------GTSGPDPTTVYVDMRALRHDRVRLVErGS 1392
Cdd:PRK05691   305 DSLERFAEKFAACGFDPDSFFASYG----------------LAEAtlfvsgGRRGQGIPALELDAEALARNRAEPGT-GS 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1393 phslPLMESGKILPGVRIIIANPETKGPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFNSRlsfgDTQTiWARTGY 1472
Cdd:PRK05691   368 ----VLMSCGRSQPGHAVLIVDPQSLEVLGDNRVGEIWASGPSIAHGY---WRNPEASAKTFVEH----DGRT-WLRTGD 435
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958706156 1473 LGFLRRTEltdangerhdaLYVVGALDEAMELRGMRYHPIDIETSVIRahksvtECAVFTWTNLLVVVVELDGSE 1547
Cdd:PRK05691   436 LGFLRDGE-----------LFVTGRLKDMLIVRGHNLYPQDIEKTVER------EVEVVRKGRVAAFAVNHQGEE 493
MenE/FadK COG0318
O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid ...
1017-1554 2.39e-31

O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism]; O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) is part of the Pathway/BioSystem: Menaquinone biosynthesis


Pssm-ID: 440087 [Multi-domain]  Cd Length: 452  Bit Score: 129.55  E-value: 2.39e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:COG0318      1 LADLLRRAAARHPDR---PALVFGGR---RLTYAELDARARRLAAALRALG-VGPGDRVALLLPNSPEFVVAFLAALRAG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVrpphpqNIATTLPTVKMIVEVSRSACLMTtqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnp 1176
Cdd:COG0318     74 AVVVPL------NPRLTAEELAYILEDSGARALVT--------------------------------------------- 102
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1177 dtlAYLDFSvS-TTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPpsel 1255
Cdd:COG0318    103 ---ALILYT-SgTTGRPKGVMLTHRNLLANAAAIAAALGLTPGDVVLVALPLFHVFGLTVGLLAPLLAGATLVLLP---- 174
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1256 ETNPALWLLAVSQYKVrdTFCSYS---VMELCtkglgsqtESLKARGLDLSRVRTCVVVAEerpriALTQSFSKLFKDLg 1332
Cdd:COG0318    175 RFDPERVLELIERERV--TVLFGVptmLARLL--------RHPEFARYDLSSLRLVVSGGA-----PLPPELLERFEER- 238
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1333 lhpravstsFGCRVnlaiclqpHRLWTLAEqgtSGPDPTTVYVDMRALRHDRVrlvergsphslplmesGKILPGVRIII 1412
Cdd:COG0318    239 ---------FGVRI--------VEGYGLTE---TSPVVTVNPEDPGERRPGSV----------------GRPLPGVEVRI 282
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1413 ANPETKgPLGDSHLGEIWVHSAHNASGYftiYGDESLqsdhfnSRLSFGDTqtiWARTGYLGFLrrteltDANGErhdaL 1492
Cdd:COG0318    283 VDEDGR-ELPPGEVGEIVVRGPNVMKGY---WNDPEA------TAEAFRDG---WLRTGDLGRL------DEDGY----L 339
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958706156 1493 YVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAVF-----TWTNLLVVVVEL-DGSEQEALDLV 1554
Cdd:COG0318    340 YIVGRKKDMIISGGENVYPAEVE-EVLAAHPGVAEAAVVgvpdeKWGERVVAFVVLrPGAELDAEELR 406
MenE/FadK COG0318
O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid ...
361-948 2.31e-29

O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism]; O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) is part of the Pathway/BioSystem: Menaquinone biosynthesis


Pssm-ID: 440087 [Multi-domain]  Cd Length: 452  Bit Score: 123.38  E-value: 2.31e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  361 LEAALQRWGTISPKAPCLTTMDTNgkplyiLTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVA 440
Cdd:COG0318      1 LADLLRRAAARHPDRPALVFGGRR------LTYAELDARARRLA-AALRALG------VGPGDRVALLLPNS--PEFVVA 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  441 FYGCLLAEVVPVPIEVPLTRKdagsqQIGFLLGSCGVTVALTSDACH----KGLPKsptgeipqfkgwpkllwfvteskh 516
Cdd:COG0318     66 FLAALRAGAVVVPLNPRLTAE-----ELAYILEDSGARALVTALILYtsgtTGRPK------------------------ 116
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  517 lskpprdwfphikdanndtayieyktckdgsvlGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTS 596
Cdd:COG0318    117 ---------------------------------GVMLTHRNLLANAAAIAAALGLTPGDVVLVALPLFHVFGLTVGLLAP 163
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  597 VMNMMHVISIPyalmKVNPLSWIQKVCQYKA-KVACVKSrdMHWALVAHRDQRDVNLSSLRMLIVadGANPWSISSCDAF 675
Cdd:COG0318    164 LLAGATLVLLP----RFDPERVLELIERERVtVLFGVPT--MLARLLRHPEFARYDLSSLRLVVS--GGAPLPPELLERF 235
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  676 LNVFQSkglrqeVICPC-ASSpEALTVAIRRPTDDSNQPPGRgvlsmhgltygvirvdseeklsvltvqdVGLVMPGAIM 754
Cdd:COG0318    236 EERFGV------RIVEGyGLT-ETSPVVTVNPEDPGERRPGS----------------------------VGRPLPGVEV 280
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  755 CSVKPDGVPqlCRTDEIGELCVC--AVATGtsYYGLSGMTKNTFEVfpmtssgapiseyPFIRTGLLGFVGPGGLVFVVG 832
Cdd:COG0318    281 RIVDEDGRE--LPPGEVGEIVVRgpNVMKG--YWNDPEATAEAFRD-------------GWLRTGDLGRLDEDGYLYIVG 343
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  833 KMDGLMVVSGRRHNADDIVATALAVEPMKfvyrgRIAVFSVTV-LHDERIV--IVAEQRPDSTEEDSFQWMSRVL---QA 906
Cdd:COG0318    344 RKKDMIISGGENVYPAEVEEVLAAHPGVA-----EAAVVGVPDeKWGERVVafVVLRPGAELDAEELRAFLRERLaryKV 418
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|..
gi 1958706156  907 IDSIHQVGvyclalvpanTLPKTPLGGIHLSETKQLFLEGSL 948
Cdd:COG0318    419 PRRVEFVD----------ELPRTASGKIDRRALRERYAAGAL 450
PRK09192 PRK09192
fatty acyl-AMP ligase;
391-950 9.96e-29

fatty acyl-AMP ligase;


Pssm-ID: 236403 [Multi-domain]  Cd Length: 579  Bit Score: 123.58  E-value: 9.96e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNILhKLGtkqepmVRPGDRVALVfPNNDPAaFMVAFYGCLLAEVVPVPIEVP--LTRKDAGSQQI 468
Cdd:PRK09192    50 LPYQTLRARAEAGARRLL-ALG------LKPGDRVALI-AETDGD-FVEAFFACQYAGLVPVPLPLPmgFGGRESYIAQL 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  469 GFLLGSCGVTVALTSDACHKGLPKSPTGEIPQFKGWPKLLwfvteskHLSKPPRDWFPHIKDanNDTAYIEYKTckdGSV 548
Cdd:PRK09192   121 RGMLASAQPAAIITPDELLPWVNEATHGNPLLHVLSHAWF-------KALPEADVALPRPTP--DDIAYLQYSS---GST 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  549 ---LGVTVTRIALLTHCQALTQ-ACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALMKVNPLSWIQKVCQ 624
Cdd:PRK09192   189 rfpRGVIITHRALMANLRAISHdGLKVRPGDRCVSWLPFYHDMGLVGFLLTPVATQLSVDYLPTRDFARRPLQWLDLISR 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  625 YKAKVAcvKSRDMHWALVAHR----DQRDVNLSSLRmlIVADGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEAlT 700
Cdd:PRK09192   269 NRGTIS--YSPPFGYELCARRvnskDLAELDLSCWR--VAGIGADMIRPDVLHQFAEAFAPAGFDDKAFMPSYGLAEA-T 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  701 VAIrrptddSNQPPGRGV----LSMHGLTYGVIRVDSEEK-LSVLTVQDVGLVMPGAIMCSVKPDG--VPQLcrtdEIGE 773
Cdd:PRK09192   344 LAV------SFSPLGSGIvveeVDRDRLEYQGKAVAPGAEtRRVRTFVNCGKALPGHEIEIRNEAGmpLPER----VVGH 413
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  774 LCVcavatgtsyYGLSGMTkntfEVFPMTSSGAPISEYPFIRTGLLGFVGPGGLVfVVGKMDGLMVVSGRRHNADDIVAT 853
Cdd:PRK09192   414 ICV---------RGPSLMS----GYFRDEESQDVLAADGWLDTGDLGYLLDGYLY-ITGRAKDLIIINGRNIWPQDIEWI 479
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  854 AlavEPMKFVYRGRIAVFSVTVLHDERIVIVAEQRPdSTEEDSFQWMSRVLQAIDSIHqvGVYCL-ALVPANTLPKTPLG 932
Cdd:PRK09192   480 A---EQEPELRSGDAAAFSIAQENGEKIVLLVQCRI-SDEERRGQLIHALAALVRSEF--GVEAAvELVPPHSLPRTSSG 553
                          570
                   ....*....|....*...
gi 1958706156  933 GIHLSETKQLFLEGSLHP 950
Cdd:PRK09192   554 KLSRAKAKKRYLSGAFAS 571
AMP-binding pfam00501
AMP-binding enzyme;
365-841 1.40e-23

AMP-binding enzyme;


Pssm-ID: 459834 [Multi-domain]  Cd Length: 417  Bit Score: 105.47  E-value: 1.40e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  365 LQRWGTISPKAPCLTTMDTngkplYILTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGC 444
Cdd:pfam00501    1 LERQAARTPDKTALEVGEG-----RRLTYRELDERANRLA-AGLRALG------VGKGDRVAILLPNS--PEWVVAFLAC 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  445 LLAEVVPVPIEVpltrkDAGSQQIGFLLGSCGVTVALTSD--------ACHKGLPKSPTGEIPQFKGWPKLLWFVTESKH 516
Cdd:pfam00501   67 LKAGAVYVPLNP-----RLPAEELAYILEDSGAKVLITDDalkleellEALGKLEVVKLVLVLDRDPVLKEEPLPEEAKP 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  517 LSKPPRdwfPHIKDANNDTAYIEY--------KtckdgsvlGVTVTRIALLTHCQALTQAC----GYTEAETIVNVLDFK 584
Cdd:pfam00501  142 ADVPPP---PPPPPDPDDLAYIIYtsgttgkpK--------GVMLTHRNLVANVLSIKRVRprgfGLGPDDRVLSTLPLF 210
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  585 KDVGLWHGILTSVMNMMHVISIPYALMKvNPLSWIQKVCQYKAKVACVKSRDMHwALVAHRDQRDVNLSSLRMLIVadGA 664
Cdd:pfam00501  211 HDFGLSLGLLGPLLAGATVVLPPGFPAL-DPAALLELIERYKVTVLYGVPTLLN-MLLEAGAPKRALLSSLRLVLS--GG 286
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  665 NPWSISSCDAFLNVFqskglrqevicpcasspealtvairrptddsnqppGRGVLSMHGLT--YGVIRVDSEEKLSVLTV 742
Cdd:pfam00501  287 APLPPELARRFRELF-----------------------------------GGALVNGYGLTetTGVVTTPLPLDEDLRSL 331
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  743 QDVGLVMPGAIMCSVKPDGVpQLCRTDEIGELCV--CAVATGtsYYGLSGMTKNTFEvfpmtssgapisEYPFIRTGLLG 820
Cdd:pfam00501  332 GSVGRPLPGTEVKIVDDETG-EPVPPGEPGELCVrgPGVMKG--YLNDPELTAEAFD------------EDGWYRTGDLG 396
                          490       500
                   ....*....|....*....|.
gi 1958706156  821 FVGPGGLVFVVGKMDGLMVVS 841
Cdd:pfam00501  397 RRDEDGYLEIVGRKKDQIKLG 417
AA-adenyl-dom TIGR01733
amino acid adenylation domain; This model represents a domain responsible for the specific ...
1051-1530 3.77e-23

amino acid adenylation domain; This model represents a domain responsible for the specific recognition of amino acids and activation as adenylyl amino acids. The reaction catalyzed is aa + ATP -> aa-AMP + PPi. These domains are usually found as components of multi-domain non-ribosomal peptide synthetases and are usually called "A-domains" in that context. A-domains are almost invariably followed by "T-domains" (thiolation domains, pfam00550) to which the amino acid adenylate is transferred as a thiol-ester to a bound pantetheine cofactor with the release of AMP (these are also called peptide carrier proteins, or PCPs. When the A-domain does not represent the first module (corresponding to the first amino acid in the product molecule) it is usually preceded by a "C-domain" (condensation domain, pfam00668) which catalyzes the ligation of two amino acid thiol-esters from neighboring modules. This domain is a subset of the AMP-binding domain found in Pfam (pfam00501) which also hits substrate--CoA ligases and luciferases. Sequences scoring in between trusted and noise for this model may be ambiguous as to whether they activate amino acids or other molecules lacking an alpha amino group.


Pssm-ID: 273779 [Multi-domain]  Cd Length: 409  Bit Score: 103.88  E-value: 3.77e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1051 QLHKRAEKIAVMLMERGHLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPITvrPPHPQN-IATtlptvkmIVEVSRSA 1127
Cdd:TIGR01733    4 ELDERANRLARHLRAAGGVGPGDRVAVLLERSAELVVAILAVLKAGAayVPLD--PAYPAErLAF-------ILEDAGAR 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1128 CLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAqiykPSNPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCR 1207
Cdd:TIGR01733   75 LLLTDSALASRLAGLVLPVILLDPLELAALDDAPAPPPPDA----PSGPDDLAYVIYTSGSTGRPKGVVVTHRSLVNLLA 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1208 SIklqCELYPSREVAICLDpYCGLGF------VLWCLcsvYSGHQSILIPPSELETNPALWLLAVSQYKVRDTFCSYSVM 1281
Cdd:TIGR01733  151 WL---ARRYGLDPDDRVLQ-FASLSFdasveeIFGAL---LAGATLVVPPEDEERDDAALLAALIAEHPVTVLNLTPSLL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1282 ELCtkglgsqtesLKARGLDLSRVRTcVVVAEERPRIALTQSFSKLFKDLglhpravstsfgcrvnlaiclqphRLWTla 1361
Cdd:TIGR01733  224 ALL----------AAALPPALASLRL-VILGGEALTPALVDRWRARGPGA------------------------RLIN-- 266
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1362 eqgTSGPDPTTVYVDMRALRHDRVRlvergSPHSLPLmesGKILPGVRIIIANPETKgPLGDSHLGEIWVHSAHNASGYF 1441
Cdd:TIGR01733  267 ---LYGPTETTVWSTATLVDPDDAP-----RESPVPI---GRPLANTRLYVLDDDLR-PVPVGVVGELYIGGPGVARGYL 334
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1442 tiyGDESLQSDHFNSRLSFGDTQTIWARTGYLGflRRteltDANGErhdaLYVVGALDEAMELRGMRYHPIDIEtSVIRA 1521
Cdd:TIGR01733  335 ---NRPELTAERFVPDPFAGGDGARLYRTGDLV--RY----LPDGN----LEFLGRIDDQVKIRGYRIELGEIE-AALLR 400

                   ....*....
gi 1958706156 1522 HKSVTECAV 1530
Cdd:TIGR01733  401 HPGVREAVV 409
DMAP_binding pfam06464
DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.
14-159 1.17e-21

DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.


Pssm-ID: 368923 [Multi-domain]  Cd Length: 104  Bit Score: 91.33  E-value: 1.17e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156   14 GDITQKGYEKKRSKLIGAYLPQPptangaavvrcrlqpgegvprrpfrsahigvcdireaaareraasaagnrplfyfrf 93
Cdd:pfam06464   23 GDITEKGYEKKKLKLLRKFLLHP--------------------------------------------------------- 45
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958706156   94 gvdqalpqerRAPVTPSSASRYHR-RRSSGSRDERYRSDVHTEAVQAALAKHKERKMAVPMPSKRRS 159
Cdd:pfam06464   46 ----------ETPTKLSAEAQNQLaSLETKLRDEELSEEVYLEKVKALLAKELERENGLNAPTKEQS 102
AA-adenyl-dom TIGR01733
amino acid adenylation domain; This model represents a domain responsible for the specific ...
392-540 1.70e-04

amino acid adenylation domain; This model represents a domain responsible for the specific recognition of amino acids and activation as adenylyl amino acids. The reaction catalyzed is aa + ATP -> aa-AMP + PPi. These domains are usually found as components of multi-domain non-ribosomal peptide synthetases and are usually called "A-domains" in that context. A-domains are almost invariably followed by "T-domains" (thiolation domains, pfam00550) to which the amino acid adenylate is transferred as a thiol-ester to a bound pantetheine cofactor with the release of AMP (these are also called peptide carrier proteins, or PCPs. When the A-domain does not represent the first module (corresponding to the first amino acid in the product molecule) it is usually preceded by a "C-domain" (condensation domain, pfam00668) which catalyzes the ligation of two amino acid thiol-esters from neighboring modules. This domain is a subset of the AMP-binding domain found in Pfam (pfam00501) which also hits substrate--CoA ligases and luciferases. Sequences scoring in between trusted and noise for this model may be ambiguous as to whether they activate amino acids or other molecules lacking an alpha amino group.


Pssm-ID: 273779 [Multi-domain]  Cd Length: 409  Bit Score: 46.10  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  392 TYGKLWTRSMKVAYNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEV--PLTRkdagsqqIG 469
Cdd:TIGR01733    1 TYRELDERANRLARHLRAAGG------VGPGDRVAVLLERS--AELVVAILAVLKAGAAYVPLDPayPAER-------LA 65
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  470 FLLGSCGVTVALTSDACHKGLPKSPTGEIPqfkgwPKLLWFVTESKHLSKPPRDWFPHikdaNNDTAYIEY 540
Cdd:TIGR01733   66 FILEDAGARLLLTDSALASRLAGLVLPVIL-----LDPLELAALDDAPAPPPPDAPSG----PDDLAYVIY 127
 
Name Accession Description Interval E-value
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
1033-1607 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 786.16  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1033 LYTLLNCRGTIANSLTCVQLHKRAEKIAVMLMERGHLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPH-PQNIA 1111
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLKPGDRVALMYPDPLDFVAAFYGCLYAGVVPIPIEPPDiSQQLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1112 TTLPTVKMivEVSRSACLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPA-----QIYKPSNPDTLAYLDFSV 1186
Cdd:cd05905     81 FLLGTCKV--RVALTVEACLKGLPKKLLKSKTAAEIAKKKGWPKILDFVKIPKSKRSklkkwGPHPPTRDGDTAYIEYSF 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1187 STTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSELETNPALWLLAV 1266
Cdd:cd05905    159 SSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELMKTNPLLWLQTL 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1267 SQYKVRDTFCSYSVMELCTKGLGSQTESLKARGLDLSRVRTCVVVAEERPRIALTQSFSKLFKDLGLHPRAVSTSFGCRV 1346
Cdd:cd05905    239 SQYKVRDAYVKLRTLHWCLKDLSSTLASLKNRDVNLSSLRMCMVPCENRPRISSCDSFLKLFQTLGLSPRAVSTEFGTRV 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1347 NLAICLqphrlwtlaeQGTSGPDPTTVYVDMRALRHDRVRLVERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSHL 1426
Cdd:cd05905    319 NPFICW----------QGTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKGLCKDGEI 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1427 GEIWVHSAHNASGYFTIYGDESLQSDHFNS-RLSFGDTQTIWARTGYLGFLRRTELTDANGERHDALYVVGALDEAMELR 1505
Cdd:cd05905    389 GEIWVNSPANASGYFLLDGETNDTFKVFPStRLSTGITNNSYARTGLLGFLRPTKCTDLNVEEHDLLFVVGSIDETLEVR 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1506 GMRYHPIDIETSVIRAHKSVTECAVFTWTNLLVVVVELD-GSEQEALDLVPLVTNVVLEEHYLIVGVVVVVDIGVIPINS 1584
Cdd:cd05905    469 GLRHHPSDIEATVMRVHPYRGRCAVFSITGLVVVVAEQPpGSEEEALDLVPLVLNAILEEHQVIVDCVALVPPGSLPKNP 548
                          570       580
                   ....*....|....*....|...
gi 1958706156 1585 RGEKQRMHLRDGFLADQLDPIYV 1607
Cdd:cd05905    549 LGEKQRMEIRQAFLAGKLHPIYV 571
Dip2 cd05905
Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of ...
377-953 0e+00

Disco-interacting protein 2 (Dip2); Dip2 proteins show sequence similarity to other members of the adenylate forming enzyme family, including insect luciferase, acetyl CoA ligases and the adenylation domain of nonribosomal peptide synthetases (NRPS). However, its function may have diverged from other members of the superfamily. In mouse embryo, Dip2 homolog A plays an important role in the development of both vertebrate and invertebrate nervous systems. Dip2A appears to regulate cell growth and the arrangement of cells in organs. Biochemically, Dip2A functions as a receptor of FSTL1, an extracellular glycoprotein, and may play a role as a cardiovascular protective agent.


Pssm-ID: 341231 [Multi-domain]  Cd Length: 571  Bit Score: 737.62  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  377 CLTTMDTNGKPLYILTYGKLWTRSMKVAYNILHKLGTKqepmvrPGDRVALVFPnnDPAAFMVAFYGCLLAEVVPVPIEV 456
Cdd:cd05905      1 AYTLLDSKGKEATTLTWGKLLSRAEKIAAVLQKKVGLK------PGDRVALMYP--DPLDFVAAFYGCLYAGVVPIPIEP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  457 PLtrkdaGSQQIGFLLGSCGVTVALTSDACHKGLPKS-----PTGEIPQFKGWPKLLWFVTESKHLSKPPRDWFPHIKDA 531
Cdd:cd05905     73 PD-----ISQQLGFLLGTCKVRVALTVEACLKGLPKKllkskTAAEIAKKKGWPKILDFVKIPKSKRSKLKKWGPHPPTR 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  532 NNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALM 611
Cdd:cd05905    148 DGDTAYIEYSFSSDGSLSGVAVSHSSLLAHCRALKEACELYESRPLVTVLDFKSGLGLWHGCLLSVYSGHHTILIPPELM 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  612 KVNPLSWIQKVCQYKAKVACVKSRDMHWAL------VAHRDQRDVNLSSLRMLIVADGaNPWSISSCDAFLNVFQSKGLR 685
Cdd:cd05905    228 KTNPLLWLQTLSQYKVRDAYVKLRTLHWCLkdlsstLASLKNRDVNLSSLRMCMVPCE-NRPRISSCDSFLKLFQTLGLS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  686 QEvicpcASSPEALTVAIRRPTD--DSNQPPGRGVLSMHGLTYGVIRVDSEEKLSVLTVQDVGLVMPGAIMCSVKPDGVP 763
Cdd:cd05905    307 PR-----AVSTEFGTRVNPFICWqgTSGPEPSRVYLDMRALRHGVVRLDERDKPNSLPLQDSGKVLPGAQVAIVNPETKG 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  764 qLCRTDEIGELCVCAVATGTSYYGLSGMTKNTFEVFPMTSSGAPISEYPFIRTGLLGFVGPG----------GLVFVVGK 833
Cdd:cd05905    382 -LCKDGEIGEIWVNSPANASGYFLLDGETNDTFKVFPSTRLSTGITNNSYARTGLLGFLRPTkctdlnveehDLLFVVGS 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  834 MDGLMVVSGRRHNADDIVATALAVEPmkfvYRGRIAVFSVTvlhdERIVIVAEQRPDStEEDSFQWMSRVLQAIDSIHQV 913
Cdd:cd05905    461 IDETLEVRGLRHHPSDIEATVMRVHP----YRGRCAVFSIT----GLVVVVAEQPPGS-EEEALDLVPLVLNAILEEHQV 531
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 1958706156  914 GVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSLHPCNV 953
Cdd:cd05905    532 IVDCVALVPPGSLPKNPLGEKQRMEIRQAFLAGKLHPIYV 571
FAAL cd05931
Fatty acyl-AMP ligase (FAAL); FAAL belongs to the class I adenylate forming enzyme family and ...
1023-1565 1.69e-69

Fatty acyl-AMP ligase (FAAL); FAAL belongs to the class I adenylate forming enzyme family and is homologous to fatty acyl-coenzyme A (CoA) ligases (FACLs). However, FAALs produce only the acyl adenylate and are unable to perform the thioester-forming reaction, while FACLs perform a two-step catalytic reaction; AMP ligation followed by CoA ligation using ATP and CoA as cofactors. FAALs have insertion motifs between the N-terminal and C-terminal subdomains that distinguish them from the FACLs. This insertion motif precludes the binding of CoA, thus preventing CoA ligation. It has been suggested that the acyl adenylates serve as substrates for multifunctional polyketide synthases to permit synthesis of complex lipids such as phthiocerol dimycocerosate, sulfolipids, mycolic acids, and mycobactin.


Pssm-ID: 341254 [Multi-domain]  Cd Length: 547  Bit Score: 244.46  E-value: 1.69e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1023 WRAQTTPDHLLYTLLNCRGTIANSLTCVQLHKRAEKIAVMLmeRGHLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITV 1102
Cdd:cd05931      1 RRAAARPDRPAYTFLDDEGGREETLTYAELDRRARAIAARL--QAVGKPGDRVLLLAPPGLDFVAAFLGCLYAGAIAVPL 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1103 RPPHPqniATTLPTVKMIVEVSRSACLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAQIYKPSnPDTLAYL 1182
Cdd:cd05931     79 PPPTP---GRHAERLAAILADAGPRVVLTTAAALAAVRAFAASRPAAGTPRLLVVDLLPDTSAADWPPPSPD-PDDIAYL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1183 DFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSELETNPALW 1262
Cdd:cd05931    155 QYTSGSTGTPKGVVVTHRNLLANVRQIRRAYGLDPGDVVVSWLPLYHDMGLIGGLLTPLYSGGPSVLMSPAAFLRRPLRW 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1263 LLAVSQYkvRDTFcsySVM-----ELCTKglgsQTESLKARGLDLSRVRTCVVVAEeRPRIALTQSFSKLFKDLGLHPRA 1337
Cdd:cd05931    235 LRLISRY--RATI---SAApnfayDLCVR----RVRDEDLEGLDLSSWRVALNGAE-PVRPATLRRFAEAFAPFGFRPEA 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1338 VSTSFGcrvnlaiclqphrlwtLAEQ------GTSGPDPTTVYVDMRALRHdRVRLVERGSPHSLPLMESGKILPGVRII 1411
Cdd:cd05931    305 FRPSYG----------------LAEAtlfvsgGPPGTGPVVLRVDRDALAG-RAVAVAADDPAARELVSCGRPLPDQEVR 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1412 IANPETKGPLGDSHLGEIWVHSAHNASGYFtiyGDESLQSDHFNSRLSFGDtqTIWARTGYLGFLRRteltdanGErhda 1491
Cdd:cd05931    368 IVDPETGRELPDGEVGEIWVRGPSVASGYW---GRPEATAETFGALAATDE--GGWLRTGDLGFLHD-------GE---- 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1492 LYVVGALDEAMELRGMRYHPIDIETSVIRAHKSVTE--CAVFTW----TNLLVVVVELDGSeQEALDLVPLVTNV---VL 1562
Cdd:cd05931    432 LYITGRLKDLIIVRGRNHYPQDIEATAEEAHPALRPgcVAAFSVpddgEERLVVVAEVERG-ADPADLAAIAAAIraaVA 510

                   ...
gi 1958706156 1563 EEH 1565
Cdd:cd05931    511 REH 513
FAAL cd05931
Fatty acyl-AMP ligase (FAAL); FAAL belongs to the class I adenylate forming enzyme family and ...
391-944 7.03e-60

Fatty acyl-AMP ligase (FAAL); FAAL belongs to the class I adenylate forming enzyme family and is homologous to fatty acyl-coenzyme A (CoA) ligases (FACLs). However, FAALs produce only the acyl adenylate and are unable to perform the thioester-forming reaction, while FACLs perform a two-step catalytic reaction; AMP ligation followed by CoA ligation using ATP and CoA as cofactors. FAALs have insertion motifs between the N-terminal and C-terminal subdomains that distinguish them from the FACLs. This insertion motif precludes the binding of CoA, thus preventing CoA ligation. It has been suggested that the acyl adenylates serve as substrates for multifunctional polyketide synthases to permit synthesis of complex lipids such as phthiocerol dimycocerosate, sulfolipids, mycolic acids, and mycobactin.


Pssm-ID: 341254 [Multi-domain]  Cd Length: 547  Bit Score: 216.34  E-value: 7.03e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNILHKLgtkqepmvRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEVPLTRKDAgsQQIGF 470
Cdd:cd05931     25 LTYAELDRRARAIAARLQAVG--------KPGDRVLLLAPPG--LDFVAAFLGCLYAGAIAVPLPPPTPGRHA--ERLAA 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  471 LLGSCGVTVALTSDACHKGLPKSptgeIPQFKGWPKLLWFVTESKHLSkPPRDWFPHIKDANnDTAYIEYKTCKDGSVLG 550
Cdd:cd05931     93 ILADAGPRVVLTTAAALAAVRAF----AASRPAAGTPRLLVVDLLPDT-SAADWPPPSPDPD-DIAYLQYTSGSTGTPKG 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  551 VTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISI-PYALMKvNPLSWIQKVCQYKAKV 629
Cdd:cd05931    167 VVVTHRNLLANVRQIRRAYGLDPGDVVVSWLPLYHDMGLIGGLLTPLYSGGPSVLMsPAAFLR-RPLRWLRLISRYRATI 245
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  630 ACVKsrDMHWALVAHR----DQRDVNLSSLRMLIVadGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEA-LTVAIR 704
Cdd:cd05931    246 SAAP--NFAYDLCVRRvrdeDLEGLDLSSWRVALN--GAEPVRPATLRRFAEAFAPFGFRPEAFRPSYGLAEAtLFVSGG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  705 RPtddsNQPPGRGVLSMHGLTYGVIRVDSEEKLSVLTVQdVGLVMPG---AIMCsvkPDGVpQLCRTDEIGELCVC--AV 779
Cdd:cd05931    322 PP----GTGPVVLRVDRDALAGRAVAVAADDPAARELVS-CGRPLPDqevRIVD---PETG-RELPDGEVGEIWVRgpSV 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  780 ATGtsYYGLSGMTKNTFEVFpmtssgAPISEYPFIRTGLLGFVGPGGLvFVVGKMDGLMVVSGRRHNADDIVATALAVEP 859
Cdd:cd05931    393 ASG--YWGRPEATAETFGAL------AATDEGGWLRTGDLGFLHDGEL-YITGRLKDLIIVRGRNHYPQDIEATAEEAHP 463
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  860 MkfVYRGRIAVFSVTVLHDERIVIVAEQRPDSTEEDSFQWMSRVLQAIDSIHQVGVYCLALVPANTLPKTPLGGIHLSET 939
Cdd:cd05931    464 A--LRPGCVAAFSVPDDGEERLVVVAEVERGADPADLAAIAAAIRAAVAREHGVAPADVVLVRPGSIPRTSSGKIQRRAC 541

                   ....*
gi 1958706156  940 KQLFL 944
Cdd:cd05931    542 RAAYL 546
AMP-binding pfam00501
AMP-binding enzyme;
1021-1505 4.80e-57

AMP-binding enzyme;


Pssm-ID: 459834 [Multi-domain]  Cd Length: 417  Bit Score: 204.08  E-value: 4.80e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1021 LQWRAQTTPDHLLYTllNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPI 1100
Cdd:pfam00501    1 LERQAARTPDKTALE--VGEGR---RLTYRELDERANRLAAGLRALG-VGKGDRVAILLPNSPEWVVAFLACLKAGAVYV 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1101 TVRPphpqniATTLPTVKMIVEVSRSACLMTTQlICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAQIYKPSN----- 1175
Cdd:pfam00501   75 PLNP------RLPAEELAYILEDSGAKVLITDD-ALKLEELLEALGKLEVVKLVLVLDRDPVLKEEPLPEEAKPAdvppp 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1176 ------PDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCE----LYPSREVAICLDPYCGLGFVLWCLCSVYSGH 1245
Cdd:pfam00501  148 pppppdPDDLAYIIYTSGTTGKPKGVMLTHRNLVANVLSIKRVRPrgfgLGPDDRVLSTLPLFHDFGLSLGLLGPLLAGA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1246 QSILIPPSELeTNPALWLLAVSQYKVRDTFCSYSVMELCtkglgsqTESLKARGLDLSRVRTCVVVAeERPRIALTQSFS 1325
Cdd:pfam00501  228 TVVLPPGFPA-LDPAALLELIERYKVTVLYGVPTLLNML-------LEAGAPKRALLSSLRLVLSGG-APLPPELARRFR 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1326 KLFkdlglhPRAVSTSFGcrvnlaiclqphrlwtLAEqgTSGPdptTVYVDMRALRHDRVRLVergsphslplmesGKIL 1405
Cdd:pfam00501  299 ELF------GGALVNGYG----------------LTE--TTGV---VTTPLPLDEDLRSLGSV-------------GRPL 338
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1406 PGVRIIIANPETKGPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnsrlsfgdTQTIWARTGYLGFLrrteltDAN 1485
Cdd:pfam00501  339 PGTEVKIVDDETGEPVPPGEPGELCVRGPGVMKGY---LNDPELTAEAF--------DEDGWYRTGDLGRR------DED 401
                          490       500
                   ....*....|....*....|
gi 1958706156 1486 GErhdaLYVVGALDEAMELR 1505
Cdd:pfam00501  402 GY----LEIVGRKKDQIKLG 417
AFD_class_I cd04433
Adenylate forming domain, Class I, also known as the ANL superfamily; This family is known as ...
1178-1552 2.72e-38

Adenylate forming domain, Class I, also known as the ANL superfamily; This family is known as the ANL (acyl-CoA synthetases, the NRPS adenylation domains, and the Luciferase enzymes) superfamily. It includes acyl- and aryl-CoA ligases, as well as the adenylation domain of nonribosomal peptide synthetases and firefly luciferases.The adenylate-forming enzymes catalyze an ATP-dependent two-step reaction to first activate a carboxylate substrate as an adenylate and then transfer the carboxylate to the pantetheine group of either coenzyme A or an acyl-carrier protein. The active site of the domain is located at the interface of a large N-terminal subdomain and a smaller C-terminal subdomain.


Pssm-ID: 341228 [Multi-domain]  Cd Length: 336  Bit Score: 147.05  E-value: 2.72e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1178 TLAYLDFSVSTTGMLAGVKMSH---AATSAFCRSIKlqceLYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSe 1254
Cdd:cd04433      1 DPALILYTSGTTGKPKGVVLSHrnlLAAAAALAASG----GLTEGDVFLSTLPLFHIGGLFGLLGALLAGGTVVLLPKF- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1255 letNPALWLLAVSQYKVRDTFCSYSVMELCTKglgsqteSLKARGLDLSRVRTCVVVAEERPrIALTQSFSKLfkdlglh 1334
Cdd:cd04433     76 ---DPEAALELIEREKVTILLGVPTLLARLLK-------APESAGYDLSSLRALVSGGAPLP-PELLERFEEA------- 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1335 pravstsFGCRVnlaiclqpHRLWTLAEqgtSGPDPTTVYVDMRALRhdrvrlveRGSphslplmeSGKILPGVRIIIAN 1414
Cdd:cd04433    138 -------PGIKL--------VNGYGLTE---TGGTVATGPPDDDARK--------PGS--------VGRPVPGVEVRIVD 183
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1415 PETkGPLGDSHLGEIWVHSAHNASGYFTiygdeslqsdhfNSRLSFGDTQTIWARTGYLGFLrrteltDANGErhdaLYV 1494
Cdd:cd04433    184 PDG-GELPPGEIGELVVRGPSVMKGYWN------------NPEATAAVDEDGWYRTGDLGRL------DEDGY----LYI 240
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958706156 1495 VGALDEAMELRGMRYHPIDIETsVIRAHKSVTECAVF-----TWTNLLVVVVELDGSEQEALD 1552
Cdd:cd04433    241 VGRLKDMIKSGGENVYPAEVEA-VLLGHPGVAEAAVVgvpdpEWGERVVAVVVLRPGADLDAE 302
PRK05691 PRK05691
peptide synthase; Validated
1017-1547 4.39e-35

peptide synthase; Validated


Pssm-ID: 235564 [Multi-domain]  Cd Length: 4334  Bit Score: 147.62  E-value: 4.39e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHLLYTLLNCRGTIANSLTCVQLHKRAEKIAVMLMERGHLqdGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:PRK05691    11 LVQALQRRAAQTPDRLALRFLADDPGEGVVLSYRDLDLRARTIAAALQARASF--GDRAVLLFPSGPDYVAAFFGCLYAG 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CV------PITVRPPHPQNIATtlptvkmIVEVSRSACLMTTQLICKLLRSREAAAAVDVrtwPLILDTDDLPKKRPAQI 1170
Cdd:PRK05691    89 VIavpaypPESARRHHQERLLS-------IIADAEPRLLLTVADLRDSLLQMEELAAANA---PELLCVDTLDPALAEAW 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1171 YKPS-NPDTLAYLDFSVSTTGMLAGVKMSHAATSAfcrsiklqCELYPSREVAICLDP----------YCGLGFVLWCLC 1239
Cdd:PRK05691   159 QEPAlQPDDIAFLQYTSGSTALPKGVQVSHGNLVA--------NEQLIRHGFGIDLNPddvivswlplYHDMGLIGGLLQ 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1240 SVYSGHQSILIPPSELETNPALWLLAVSQYKVRDTFCSYSVMELCTKGLGsqtESLKARgLDLSRVRtcVVVAEERP-RI 1318
Cdd:PRK05691   231 PIFSGVPCVLMSPAYFLERPLRWLEAISEYGGTISGGPDFAYRLCSERVS---ESALER-LDLSRWR--VAYSGSEPiRQ 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1319 ALTQSFSKLFKDLGLHPRAVSTSFGcrvnlaiclqphrlwtLAEQ------GTSGPDPTTVYVDMRALRHDRVRLVErGS 1392
Cdd:PRK05691   305 DSLERFAEKFAACGFDPDSFFASYG----------------LAEAtlfvsgGRRGQGIPALELDAEALARNRAEPGT-GS 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1393 phslPLMESGKILPGVRIIIANPETKGPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFNSRlsfgDTQTiWARTGY 1472
Cdd:PRK05691   368 ----VLMSCGRSQPGHAVLIVDPQSLEVLGDNRVGEIWASGPSIAHGY---WRNPEASAKTFVEH----DGRT-WLRTGD 435
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958706156 1473 LGFLRRTEltdangerhdaLYVVGALDEAMELRGMRYHPIDIETSVIRahksvtECAVFTWTNLLVVVVELDGSE 1547
Cdd:PRK05691   436 LGFLRDGE-----------LFVTGRLKDMLIVRGHNLYPQDIEKTVER------EVEVVRKGRVAAFAVNHQGEE 493
PRK05850 PRK05850
acyl-CoA synthetase; Validated
1020-1561 1.42e-32

acyl-CoA synthetase; Validated


Pssm-ID: 235624 [Multi-domain]  Cd Length: 578  Bit Score: 135.46  E-value: 1.42e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1020 VLQWRAQTTPDHLLYTLLNCR---GTIANSLTCVQLHKRAEKIAVMLMERGhlQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:PRK05850     6 LLRERASLQPDDAAFTFIDYEqdpAGVAETLTWSQLYRRTLNVAEELRRHG--STGDRAVILAPQGLEYIVAFLGALQAG 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVRPPHPQN--------IATTLPTVkmivevsrsacLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPA 1168
Cdd:PRK05850    84 LIAVPLSVPQGGAhdervsavLRDTSPSV-----------VLTTSAVVDDVTEYVAPQPGQSAPPVIEVDLLDLDSPRGS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1169 QIYKPSNPDTlAYLDFSVSTTGMLAGVKMSHAATSAFCRsiKLQCELYPSREVAICLD-------P-YCGLGFVLWCLCS 1240
Cdd:PRK05850   153 DARPRDLPST-AYLQYTSGSTRTPAGVMVSHRNVIANFE--QLMSDYFGDTGGVPPPDttvvswlPfYHDMGLVLGVCAP 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1241 VYSGHQSILIPPSELETNPALW--LLAVSQYkvrdtfcSYSV-----MELCTKglgsQTESLKARGLDLSRVRTcVVVAE 1313
Cdd:PRK05850   230 ILGGCPAVLTSPVAFLQRPARWmqLLASNPH-------AFSAapnfaFELAVR----KTSDDDMAGLDLGGVLG-IISGS 297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1314 ERPRIALTQSFSKLFKDLGLHPRAVSTSFGcrvnlaiclqphrlwtLAE------QGTSGPDPTTVYVDMRALRHDRVR- 1386
Cdd:PRK05850   298 ERVHPATLKRFADRFAPFNLRETAIRPSYG----------------LAEatvyvaTREPGQPPESVRFDYEKLSAGHAKr 361
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1387 --------LVERGSPHSlplmesgkilPGVRIIiaNPETKGPLGDSHLGEIWVHSAHNASGYFTiyGDESLQSDhFNSRL 1458
Cdd:PRK05850   362 cetgggtpLVSYGSPRS----------PTVRIV--DPDTCIECPAGTVGEIWVHGDNVAAGYWQ--KPEETERT-FGATL 426
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1459 ---SFGDTQTIWARTGYLGFLrrteltdANGErhdaLYVVGALDEAMELRGMRYHPIDIETSVirahKSVT--ECAVFT- 1532
Cdd:PRK05850   427 vdpSPGTPEGPWLRTGDLGFI-------SEGE----LFIVGRIKDLLIVDGRNHYPDDIEATI----QEITggRVAAISv 491
                          570       580       590
                   ....*....|....*....|....*....|....*
gi 1958706156 1533 ---WTNLLVVVVEL---DGSEQEALDLVPLVTNVV 1561
Cdd:PRK05850   492 pddGTEKLVAIIELkkrGDSDEEAMDRLRTVKREV 526
MenE/FadK COG0318
O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid ...
1017-1554 2.39e-31

O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism]; O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) is part of the Pathway/BioSystem: Menaquinone biosynthesis


Pssm-ID: 440087 [Multi-domain]  Cd Length: 452  Bit Score: 129.55  E-value: 2.39e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:COG0318      1 LADLLRRAAARHPDR---PALVFGGR---RLTYAELDARARRLAAALRALG-VGPGDRVALLLPNSPEFVVAFLAALRAG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVrpphpqNIATTLPTVKMIVEVSRSACLMTtqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnp 1176
Cdd:COG0318     74 AVVVPL------NPRLTAEELAYILEDSGARALVT--------------------------------------------- 102
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1177 dtlAYLDFSvS-TTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPpsel 1255
Cdd:COG0318    103 ---ALILYT-SgTTGRPKGVMLTHRNLLANAAAIAAALGLTPGDVVLVALPLFHVFGLTVGLLAPLLAGATLVLLP---- 174
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1256 ETNPALWLLAVSQYKVrdTFCSYS---VMELCtkglgsqtESLKARGLDLSRVRTCVVVAEerpriALTQSFSKLFKDLg 1332
Cdd:COG0318    175 RFDPERVLELIERERV--TVLFGVptmLARLL--------RHPEFARYDLSSLRLVVSGGA-----PLPPELLERFEER- 238
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1333 lhpravstsFGCRVnlaiclqpHRLWTLAEqgtSGPDPTTVYVDMRALRHDRVrlvergsphslplmesGKILPGVRIII 1412
Cdd:COG0318    239 ---------FGVRI--------VEGYGLTE---TSPVVTVNPEDPGERRPGSV----------------GRPLPGVEVRI 282
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1413 ANPETKgPLGDSHLGEIWVHSAHNASGYftiYGDESLqsdhfnSRLSFGDTqtiWARTGYLGFLrrteltDANGErhdaL 1492
Cdd:COG0318    283 VDEDGR-ELPPGEVGEIVVRGPNVMKGY---WNDPEA------TAEAFRDG---WLRTGDLGRL------DEDGY----L 339
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958706156 1493 YVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAVF-----TWTNLLVVVVEL-DGSEQEALDLV 1554
Cdd:COG0318    340 YIVGRKKDMIISGGENVYPAEVE-EVLAAHPGVAEAAVVgvpdeKWGERVVAFVVLrPGAELDAEELR 406
A_NRPS_TubE_like cd05906
The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) ...
358-946 6.43e-31

The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) synthesizing toxins and antitumor agents; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino)-acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. This family includes NRPSs that synthesize toxins and antitumor agents; for example, TubE for Tubulysine, CrpA for cryptophycin, TdiA for terrequinone A, KtzG for kutzneride, and Vlm1/Vlm2 for Valinomycin. Nonribosomal peptide synthetases are large multifunctional enzymes which synthesize many therapeutically useful peptides. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341232 [Multi-domain]  Cd Length: 540  Bit Score: 129.71  E-value: 6.43e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  358 PPSLEAALQRWGTISPKAPClTTMDTNGKPlYILTYGKLWTRsmkvAYNILHKLGTKQepmVRPGDRVALVFPNNDpaAF 437
Cdd:cd05906      9 PRTLLELLLRAAERGPTKGI-TYIDADGSE-EFQSYQDLLED----ARRLAAGLRQLG---LRPGDSVILQFDDNE--DF 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  438 MVAFYGCLLAEVVPVPIEVPLTRKDAGSQ-----QIGFLLGSCgvtVALTSDACHKGLpksptgeIPQFKGWPKLLWFVT 512
Cdd:cd05906     78 IPAFWACVLAGFVPAPLTVPPTYDEPNARlrklrHIWQLLGSP---VVLTDAELVAEF-------AGLETLSGLPGIRVL 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  513 ESKHLSKPPRDWFPHIKDAnNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHG 592
Cdd:cd05906    148 SIEELLDTAADHDLPQSRP-DDLALLMLTSGSTGFPKAVPLTHRNILARSAGKIQHNGLTPQDVFLNWVPLDHVGGLVEL 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  593 ILTSVMNMMHVISIPYALMKVNPLSWIQKVCQYKAKV------ACVKSRDmhwaLVAHRDQRDVNLSSLRMLIVADGANp 666
Cdd:cd05906    227 HLRAVYLGCQQVHVPTEEILADPLRWLDLIDRYRVTItwapnfAFALLND----LLEEIEDGTWDLSSLRYLVNAGEAV- 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  667 wSISSCDAFLNVFQSKGLRQEVICPCASSPEalTVAirrptddsnqppgrgvlsmhGLTYGviRVDSEEKLS-VLTVQDV 745
Cdd:cd05906    302 -VAKTIRRLLRLLEPYGLPPDAIRPAFGMTE--TCS--------------------GVIYS--RSFPTYDHSqALEFVSL 356
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  746 GLVMPGAIMCSVKPDGvpQLCRTDEIGELCVCavatGTS----YYGLSGMTKNTFevfpmTSSGapiseypFIRTGLLGF 821
Cdd:cd05906    357 GRPIPGVSMRIVDDEG--QLLPEGEVGRLQVR----GPVvtkgYYNNPEANAEAF-----TEDG-------WFRTGDLGF 418
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  822 VGPGGLVFvVGKMDGLMVVSGRRHNADDIVAtalAVEPMKFVYRGRIAVFSVTVLHD--ERIVIVAeqrpdSTEEDSFQW 899
Cdd:cd05906    419 LDNGNLTI-TGRTKDTIIVNGVNYYSHEIEA---AVEEVPGVEPSFTAAFAVRDPGAetEELAIFF-----VPEYDLQDA 489
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  900 MSRVLQAIDSI--HQVGVYCLALVP--ANTLPKTPLGGIHLSETKQLFLEG 946
Cdd:cd05906    490 LSETLRAIRSVvsREVGVSPAYLIPlpKEEIPKTSLGKIQRSKLKAAFEAG 540
MenE/FadK COG0318
O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid ...
361-948 2.31e-29

O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism]; O-succinylbenzoic acid-CoA ligase MenE or related acyl-CoA synthetase (AMP-forming) is part of the Pathway/BioSystem: Menaquinone biosynthesis


Pssm-ID: 440087 [Multi-domain]  Cd Length: 452  Bit Score: 123.38  E-value: 2.31e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  361 LEAALQRWGTISPKAPCLTTMDTNgkplyiLTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVA 440
Cdd:COG0318      1 LADLLRRAAARHPDRPALVFGGRR------LTYAELDARARRLA-AALRALG------VGPGDRVALLLPNS--PEFVVA 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  441 FYGCLLAEVVPVPIEVPLTRKdagsqQIGFLLGSCGVTVALTSDACH----KGLPKsptgeipqfkgwpkllwfvteskh 516
Cdd:COG0318     66 FLAALRAGAVVVPLNPRLTAE-----ELAYILEDSGARALVTALILYtsgtTGRPK------------------------ 116
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  517 lskpprdwfphikdanndtayieyktckdgsvlGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTS 596
Cdd:COG0318    117 ---------------------------------GVMLTHRNLLANAAAIAAALGLTPGDVVLVALPLFHVFGLTVGLLAP 163
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  597 VMNMMHVISIPyalmKVNPLSWIQKVCQYKA-KVACVKSrdMHWALVAHRDQRDVNLSSLRMLIVadGANPWSISSCDAF 675
Cdd:COG0318    164 LLAGATLVLLP----RFDPERVLELIERERVtVLFGVPT--MLARLLRHPEFARYDLSSLRLVVS--GGAPLPPELLERF 235
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  676 LNVFQSkglrqeVICPC-ASSpEALTVAIRRPTDDSNQPPGRgvlsmhgltygvirvdseeklsvltvqdVGLVMPGAIM 754
Cdd:COG0318    236 EERFGV------RIVEGyGLT-ETSPVVTVNPEDPGERRPGS----------------------------VGRPLPGVEV 280
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  755 CSVKPDGVPqlCRTDEIGELCVC--AVATGtsYYGLSGMTKNTFEVfpmtssgapiseyPFIRTGLLGFVGPGGLVFVVG 832
Cdd:COG0318    281 RIVDEDGRE--LPPGEVGEIVVRgpNVMKG--YWNDPEATAEAFRD-------------GWLRTGDLGRLDEDGYLYIVG 343
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  833 KMDGLMVVSGRRHNADDIVATALAVEPMKfvyrgRIAVFSVTV-LHDERIV--IVAEQRPDSTEEDSFQWMSRVL---QA 906
Cdd:COG0318    344 RKKDMIISGGENVYPAEVEEVLAAHPGVA-----EAAVVGVPDeKWGERVVafVVLRPGAELDAEELRAFLRERLaryKV 418
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|..
gi 1958706156  907 IDSIHQVGvyclalvpanTLPKTPLGGIHLSETKQLFLEGSL 948
Cdd:COG0318    419 PRRVEFVD----------ELPRTASGKIDRRALRERYAAGAL 450
PRK09192 PRK09192
fatty acyl-AMP ligase;
391-950 9.96e-29

fatty acyl-AMP ligase;


Pssm-ID: 236403 [Multi-domain]  Cd Length: 579  Bit Score: 123.58  E-value: 9.96e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNILhKLGtkqepmVRPGDRVALVfPNNDPAaFMVAFYGCLLAEVVPVPIEVP--LTRKDAGSQQI 468
Cdd:PRK09192    50 LPYQTLRARAEAGARRLL-ALG------LKPGDRVALI-AETDGD-FVEAFFACQYAGLVPVPLPLPmgFGGRESYIAQL 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  469 GFLLGSCGVTVALTSDACHKGLPKSPTGEIPQFKGWPKLLwfvteskHLSKPPRDWFPHIKDanNDTAYIEYKTckdGSV 548
Cdd:PRK09192   121 RGMLASAQPAAIITPDELLPWVNEATHGNPLLHVLSHAWF-------KALPEADVALPRPTP--DDIAYLQYSS---GST 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  549 ---LGVTVTRIALLTHCQALTQ-ACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALMKVNPLSWIQKVCQ 624
Cdd:PRK09192   189 rfpRGVIITHRALMANLRAISHdGLKVRPGDRCVSWLPFYHDMGLVGFLLTPVATQLSVDYLPTRDFARRPLQWLDLISR 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  625 YKAKVAcvKSRDMHWALVAHR----DQRDVNLSSLRmlIVADGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEAlT 700
Cdd:PRK09192   269 NRGTIS--YSPPFGYELCARRvnskDLAELDLSCWR--VAGIGADMIRPDVLHQFAEAFAPAGFDDKAFMPSYGLAEA-T 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  701 VAIrrptddSNQPPGRGV----LSMHGLTYGVIRVDSEEK-LSVLTVQDVGLVMPGAIMCSVKPDG--VPQLcrtdEIGE 773
Cdd:PRK09192   344 LAV------SFSPLGSGIvveeVDRDRLEYQGKAVAPGAEtRRVRTFVNCGKALPGHEIEIRNEAGmpLPER----VVGH 413
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  774 LCVcavatgtsyYGLSGMTkntfEVFPMTSSGAPISEYPFIRTGLLGFVGPGGLVfVVGKMDGLMVVSGRRHNADDIVAT 853
Cdd:PRK09192   414 ICV---------RGPSLMS----GYFRDEESQDVLAADGWLDTGDLGYLLDGYLY-ITGRAKDLIIINGRNIWPQDIEWI 479
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  854 AlavEPMKFVYRGRIAVFSVTVLHDERIVIVAEQRPdSTEEDSFQWMSRVLQAIDSIHqvGVYCL-ALVPANTLPKTPLG 932
Cdd:PRK09192   480 A---EQEPELRSGDAAAFSIAQENGEKIVLLVQCRI-SDEERRGQLIHALAALVRSEF--GVEAAvELVPPHSLPRTSSG 553
                          570
                   ....*....|....*...
gi 1958706156  933 GIHLSETKQLFLEGSLHP 950
Cdd:PRK09192   554 KLSRAKAKKRYLSGAFAS 571
PRK07769 PRK07769
long-chain-fatty-acid--CoA ligase; Validated
420-948 1.17e-26

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 181109 [Multi-domain]  Cd Length: 631  Bit Score: 117.52  E-value: 1.17e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  420 RPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEVPltrKDAG-SQQIGFLLGSCGVTVALTSDACHKGLPKSptgei 498
Cdd:PRK07769    77 KPGDRVAILAPQN--LDYLIAFFGALYAGRIAVPLFDP---AEPGhVGRLHAVLDDCTPSAILTTTDSAEGVRKF----- 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  499 pqFKGWPKllwfvteskhlSKPPR-------------DWFPhiKDANNDT-AYIEYKTckdGSV---LGVTVTRIALLTH 561
Cdd:PRK07769   147 --FRARPA-----------KERPRviavdavpdevgaTWVP--PEANEDTiAYLQYTS---GSTripAGVQITHLNLPTN 208
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  562 CQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNmmHVISI--PYALMKvNPLSWIQKVCQYKAKVACVKSRDMHW 639
Cdd:PRK07769   209 VLQVIDALEGQEGDRGVSWLPFFHDMGLITVLLPALLG--HYITFmsPAAFVR-RPGRWIRELARKPGGTGGTFSAAPNF 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  640 A--LVAHR-----DQRDVNLSSLRMLIvaDGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEALTVAIRRPTDDSNQ 712
Cdd:PRK07769   286 AfeHAAARglpkdGEPPLDLSNVKGLL--NGSEPVSPASMRKFNEAFAPYGLPPTAIKPSYGMAEATLFVSTTPMDEEPT 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  713 ppgrgvlsmhgltygVIRVDSEEkLSVLTVQDVGLVMPGAI-------------MCSVKPDGVPQLcRTDEIGELCVCAV 779
Cdd:PRK07769   364 ---------------VIYVDRDE-LNAGRFVEVPADAPNAVaqvsagkvgvsewAVIVDPETASEL-PDGQIGEIWLHGN 426
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  780 ATGTSYYGLSGMTKNTFE------VFPMTSSGAPiSEYPFIRTGLLGFVGPGGLvFVVGKMDGLMVVSGRRHNADDIVAT 853
Cdd:PRK07769   427 NIGTGYWGKPEETAATFQnilksrLSESHAEGAP-DDALWVRTGDYGVYFDGEL-YITGRVKDLVIIDGRNHYPQDLEYT 504
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  854 ALavEPMKFVYRGRIAVFSV-------TVLHD-------------ERIVIVAEQRPDSTEEDSFQWMSRVLQAIDSIHQV 913
Cdd:PRK07769   505 AQ--EATKALRTGYVAAFSVpanqlpqVVFDDshaglkfdpedtsEQLVIVAERAPGAHKLDPQPIADDIRAAIAVRHGV 582
                          570       580       590
                   ....*....|....*....|....*....|....*
gi 1958706156  914 GVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSL 948
Cdd:PRK07769   583 TVRDVLLVPAGSIPRTSSGKIARRACRAAYLDGSL 617
PRK12476 PRK12476
putative fatty-acid--CoA ligase; Provisional
1044-1525 1.19e-25

putative fatty-acid--CoA ligase; Provisional


Pssm-ID: 171527 [Multi-domain]  Cd Length: 612  Bit Score: 114.07  E-value: 1.19e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1044 ANSLTCVQLHKRAEKIAVMLMErgHLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPQNIATTLPTVkmiVEV 1123
Cdd:PRK12476    66 AVELTWTQLGVRLRAVGARLQQ--VAGPGDRVAILAPQGIDYVAGFFAAIKAGTIAVPLFAPELPGHAERLDTA---LRD 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1124 SRSACLMTTQLICKLLRSREAAAAVDVRtwPLILDTDDLPKkRPAQIYKPSNPDT--LAYLDFSVSTTGMLAGVKMSHAA 1201
Cdd:PRK12476   141 AEPTVVLTTTAAAEAVEGFLRNLPRLRR--PRVIAIDAIPD-SAGESFVPVELDTddVSHLQYTSGSTRPPVGVEITHRA 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1202 TSA----FCRSIKLqceLYPSREVAICLDPYCGLGFVLWCLCSVYSGHqSILIPPSELETNPALWLLAVSQ-YKVRDTFC 1276
Cdd:PRK12476   218 VGTnlvqMILSIDL---LDRNTHGVSWLPLYHDMGLSMIGFPAVYGGH-STLMSPTAFVRRPQRWIKALSEgSRTGRVVT 293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1277 SYS--VMELCT-KGLGSQTEslkarGLDLSRVrtCVVVAEERPRIALTQSFSKLFKDLGLHPRAVSTSFGcrvnlaiclq 1353
Cdd:PRK12476   294 AAPnfAYEWAAqRGLPAEGD-----DIDLSNV--VLIIGSEPVSIDAVTTFNKAFAPYGLPRTAFKPSYG---------- 356
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1354 phrlwtLAEQ----GTSGPD--PTTVYVDMRALRHDRVRLVERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSHLG 1427
Cdd:PRK12476   357 ------IAEAtlfvATIAPDaePSVVYLDREQLGAGRAVRVAADAPNAVAHVSCGQVARSQWAVIVDPDTGAELPDGEVG 430
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1428 EIWVHSAHNASGYFTiYGDESLQSDH--FNSRLSFG------DTQTIWARTGYLGFLRrteltdaNGErhdaLYVVGALD 1499
Cdd:PRK12476   431 EIWLHGDNIGRGYWG-RPEETERTFGakLQSRLAEGshadgaADDGTWLRTGDLGVYL-------DGE----LYITGRIA 498
                          490       500
                   ....*....|....*....|....*.
gi 1958706156 1500 EAMELRGMRYHPIDIETSVIRAHKSV 1525
Cdd:PRK12476   499 DLIVIDGRNHYPQDIEATVAEASPMV 524
PRK07769 PRK07769
long-chain-fatty-acid--CoA ligase; Validated
1021-1525 1.18e-24

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 181109 [Multi-domain]  Cd Length: 631  Bit Score: 111.36  E-value: 1.18e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1021 LQWRAQTTPDHLLYTLLNC---RGTIANSLTCVQLHKRAEKIAVMLMERGhlQDGDHVALVYPPGIDLIAAFYGCLYAGC 1097
Cdd:PRK07769    27 VERWAKVRGDKLAYRFLDFsteRDGVARDLTWSQFGARNRAVGARLQQVT--KPGDRVAILAPQNLDYLIAFFGALYAGR 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1098 VPITVRPPHPQNIATTLPTVkmIVEVSRSACLMTT---QLICKLLRSREAAAAvdvrtwPLILDTDDLPKKRPAQIYKPS 1174
Cdd:PRK07769   105 IAVPLFDPAEPGHVGRLHAV--LDDCTPSAILTTTdsaEGVRKFFRARPAKER------PRVIAVDAVPDEVGATWVPPE 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1175 -NPDTLAYLDFSVSTTGMLAGVKMSH--AATSAF--CRSIKLQcelYPSREVAiCLDPYCGLGFVLWCLCSVYSGHQSIL 1249
Cdd:PRK07769   177 aNEDTIAYLQYTSGSTRIPAGVQITHlnLPTNVLqvIDALEGQ---EGDRGVS-WLPFFHDMGLITVLLPALLGHYITFM 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1250 IPPSELEtNPALWLLAVSQyKVRDTFCSYSV-----MELCT-KGLGSQTESlkarGLDLSRVRtCVVVAEERPRIALTQS 1323
Cdd:PRK07769   253 SPAAFVR-RPGRWIRELAR-KPGGTGGTFSAapnfaFEHAAaRGLPKDGEP----PLDLSNVK-GLLNGSEPVSPASMRK 325
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1324 FSKLFKDLGLHPRAVSTSFGcrvnlaiclqphrlwtLAEQ----GTSGPD--PTTVYVDMRALRHDRVRLVERGSPHSLP 1397
Cdd:PRK07769   326 FNEAFAPYGLPPTAIKPSYG----------------MAEAtlfvSTTPMDeePTVIYVDRDELNAGRFVEVPADAPNAVA 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1398 LMESGKILPGVRIIIANPETKGPLGDSHLGEIWVHSAHNASGYftiYG--DESLQSDH--FNSRLSFGDTQ-----TIWA 1468
Cdd:PRK07769   390 QVSAGKVGVSEWAVIVDPETASELPDGQIGEIWLHGNNIGTGY---WGkpEETAATFQniLKSRLSESHAEgapddALWV 466
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958706156 1469 RTGYLGflrrtelTDANGErhdaLYVVGALDEAMELRGMRYHPIDIETSVIRAHKSV 1525
Cdd:PRK07769   467 RTGDYG-------VYFDGE----LYITGRVKDLVIIDGRNHYPQDLEYTAQEATKAL 512
PRK09192 PRK09192
fatty acyl-AMP ligase;
1003-1550 1.43e-24

fatty acyl-AMP ligase;


Pssm-ID: 236403 [Multi-domain]  Cd Length: 579  Bit Score: 110.48  E-value: 1.43e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1003 DNDQARKvRTEFLFLSEVLQWRAQTTPDHLLYTLlncRGTIANSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPG 1082
Cdd:PRK09192    10 TSSLPRR-YADFPTLVEALDYAALGEAGMNFYDR---RGQLEEALPYQTLRARAEAGARRLLALG-LKPGDRVALIAETD 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1083 IDLIAAFYGCLYAGCVPITVrpPHPQNI---ATTLPTVKMIVEVSRSACLMTTQLICKLLrsREAAAAVDVRtWPLILDT 1159
Cdd:PRK09192    85 GDFVEAFFACQYAGLVPVPL--PLPMGFggrESYIAQLRGMLASAQPAAIITPDELLPWV--NEATHGNPLL-HVLSHAW 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1160 DDLPKKRPAQIYKPSnPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQ-CELYPSREVAICLDPYCGLGFVlWCL 1238
Cdd:PRK09192   160 FKALPEADVALPRPT-PDDIAYLQYSSGSTRFPRGVIITHRALMANLRAISHDgLKVRPGDRCVSWLPFYHDMGLV-GFL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1239 CSVYSGHQSI-LIPPSELETNPALWLLAVSqyKVRDTFcSYSV---MELCTKGLGSQTESlkarGLDLSRVRTCVVVAEE 1314
Cdd:PRK09192   238 LTPVATQLSVdYLPTRDFARRPLQWLDLIS--RNRGTI-SYSPpfgYELCARRVNSKDLA----ELDLSCWRVAGIGADM 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1315 -RPRIalTQSFSKLFKDLGLHPRAVSTSFG-CRVNLAICLQPHrlwtlaEQGtsgpdpttvyvdMRALRHDRVRLVERGs 1392
Cdd:PRK09192   311 iRPDV--LHQFAEAFAPAGFDDKAFMPSYGlAEATLAVSFSPL------GSG------------IVVEEVDRDRLEYQG- 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1393 pHSLPLMES----------GKILPGVRIIIANpETKGPLGDSHLGEIWVHSAHNASGYFtiyGDESlqsdhfnsrlsfgD 1462
Cdd:PRK09192   370 -KAVAPGAEtrrvrtfvncGKALPGHEIEIRN-EAGMPLPERVVGHICVRGPSLMSGYF---RDEE-------------S 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1463 TQTI----WARTGYLGFLrrteltdANGErhdaLYVVGALDEAMELRGMRYHPIDIETSV-----IRAHksvtECAVFTW 1533
Cdd:PRK09192   432 QDVLaadgWLDTGDLGYL-------LDGY----LYITGRAKDLIIINGRNIWPQDIEWIAeqepeLRSG----DAAAFSI 496
                          570       580
                   ....*....|....*....|.
gi 1958706156 1534 TN----LLVVVVELDGSEQEA 1550
Cdd:PRK09192   497 AQengeKIVLLVQCRISDEER 517
A_NRPS_TubE_like cd05906
The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) ...
1017-1540 2.01e-24

The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) synthesizing toxins and antitumor agents; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino)-acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. This family includes NRPSs that synthesize toxins and antitumor agents; for example, TubE for Tubulysine, CrpA for cryptophycin, TdiA for terrequinone A, KtzG for kutzneride, and Vlm1/Vlm2 for Valinomycin. Nonribosomal peptide synthetases are large multifunctional enzymes which synthesize many therapeutically useful peptides. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341232 [Multi-domain]  Cd Length: 540  Bit Score: 109.68  E-value: 2.01e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHLLYTllncrgtIANSLTCV-----QLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYG 1091
Cdd:cd05906     12 LLELLLRAAERGPTKGITY-------IDADGSEEfqsyqDLLEDARRLAAGLRQLG-LRPGDSVILQFDDNEDFIPAFWA 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1092 CLYAGCVPITVRPPHP-QNIATTLPTVKMIVEVSRSA-CLMTTQLICKLLRSREAA--AAVDVRTWPLILDTDDLPKKRP 1167
Cdd:cd05906     84 CVLAGFVPAPLTVPPTyDEPNARLRKLRHIWQLLGSPvVLTDAELVAEFAGLETLSglPGIRVLSIEELLDTAADHDLPQ 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1168 AQiykpsnPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSiKLQCELYPSREVA---ICLDPYCGLGFVlwCLCSVYSG 1244
Cdd:cd05906    164 SR------PDDLALLMLTSGSTGFPKAVPLTHRNILARSAG-KIQHNGLTPQDVFlnwVPLDHVGGLVEL--HLRAVYLG 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1245 HQSILIPPSELETNPALWLLAVSQYKVRDTFCSYSvmeLCTKgLGSQTESLKARGLDLSRVRtCVVVAEERPRIALTQSF 1324
Cdd:cd05906    235 CQQVHVPTEEILADPLRWLDLIDRYRVTITWAPNF---AFAL-LNDLLEEIEDGTWDLSSLR-YLVNAGEAVVAKTIRRL 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1325 SKLFKDLGLHPRAVSTSFGcrvnlaiclqphrlwtLAEQGtSGpdpTTVYVDMRALRHdrvrlvergsPHSLPLMESGKI 1404
Cdd:cd05906    310 LRLLEPYGLPPDAIRPAFG----------------MTETC-SG---VIYSRSFPTYDH----------SQALEFVSLGRP 359
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1405 LPGVRIIIANPETKGpLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnsrlsfgdTQTIWARTGYLGFLRrteltda 1484
Cdd:cd05906    360 IPGVSMRIVDDEGQL-LPEGEVGRLQVRGPVVTKGY---YNNPEANAEAF--------TEDGWFRTGDLGFLD------- 420
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958706156 1485 NGErhdaLYVVGALDEAMELRGMRYHPIDIETSV----IRAHKSVTECAVF---TWTNLLVVV 1540
Cdd:cd05906    421 NGN----LTITGRTKDTIIVNGVNYYSHEIEAAVeevpGVEPSFTAAFAVRdpgAETEELAIF 479
A_NRPS_MycA_like cd05908
The adenylation domain of nonribosomal peptide synthetases (NRPS) similar to mycosubtilin ...
384-946 2.13e-24

The adenylation domain of nonribosomal peptide synthetases (NRPS) similar to mycosubtilin synthase subunit A (MycA); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as (amino)-acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. This family includes NRPS similar to mycosubtilin synthase subunit A (MycA). Mycosubtilin, which is characterized by a beta-amino fatty acid moiety linked to the circular heptapeptide Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin family of lipopeptide antibiotics. The mycosubtilin synthase subunit A (MycA) combines functional domains derived from peptide synthetases, amino transferases, and fatty acid synthases. Nonribosomal peptide synthetases are large multifunction enzymes that synthesize many therapeutically useful peptides. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341234 [Multi-domain]  Cd Length: 499  Bit Score: 109.11  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  384 NGKPLYIlTYGKLWTrsmkvayNILHKLGTKQEPMVRPGDRValVFPNNDPAAFMVAFYGCLLAEVVPVPievpltrkda 463
Cdd:cd05908     10 DKKEKFV-SYRHLRE-------EALGYLGALQELGIKPGQEV--VFQITHNNKFLYLFWACLLGGMIAVP---------- 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  464 gsqqigfllgscgvtVALTSDACHKglpksptgeIPQFKGWPKLL--WFVTESKHLSKPPrdwfphikdanNDTAYIEYK 541
Cdd:cd05908     70 ---------------VSIGSNEEHK---------LKLNKVWNTLKnpYLITEEEVLCELA-----------DELAFIQFS 114
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  542 TCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPYALMKVNPLSWIQK 621
Cdd:cd05908    115 SGSTGDPKGVMLTHENLVHNMFAILNSTEWKTKDRILSWMPLTHDMGLIAFHLAPLIAGMNQYLMPTRLFIRRPILWLKK 194
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  622 VCQYKAKVACVKSRDMHWALVAHRDQR--DVNLSSLRMLIvaDGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEAl 699
Cdd:cd05908    195 ASEHKATIVSSPNFGYKYFLKTLKPEKanDWDLSSIRMIL--NGAEPIDYELCHEFLDHMSKYGLKRNAILPVYGLAEA- 271
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  700 TVAIRRPTDDSNQPPgrgvLSMH--GLTYG--VIRVDSEEKlSVLTVQDVGLVMPgaiMCSVK-PDGVPQLCRTDEIGEL 774
Cdd:cd05908    272 SVGASLPKAQSPFKT----ITLGrrHVTHGepEPEVDKKDS-ECLTFVEVGKPID---ETDIRiCDEDNKILPDGYIGHI 343
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  775 CVCAVATGTSYYGLSGMTKNTfevfpMTSSGapiseypFIRTGLLGFVGPGGLVfVVGKMDGLMVVSGRRHNADDIVATA 854
Cdd:cd05908    344 QIRGKNVTPGYYNNPEATAKV-----FTDDG-------WLKTGDLGFIRNGRLV-ITGREKDIIFVNGQNVYPHDIERIA 410
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  855 LAVEPmkfVYRGRIAVFSV--TVLHDERIVIVAEQRpdsTEEDSFQWMSRVLQAID------SIHQVgvyclalVPANTL 926
Cdd:cd05908    411 EELEG---VELGRVVACGVnnSNTRNEEIFCFIEHR---KSEDDFYPLGKKIKKHLnkrggwQINEV-------LPIRRI 477
                          570       580
                   ....*....|....*....|
gi 1958706156  927 PKTPLGGIHLSETKQLFLEG 946
Cdd:cd05908    478 PKTTSGKVKRYELAQRYQSG 497
AMP-binding pfam00501
AMP-binding enzyme;
365-841 1.40e-23

AMP-binding enzyme;


Pssm-ID: 459834 [Multi-domain]  Cd Length: 417  Bit Score: 105.47  E-value: 1.40e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  365 LQRWGTISPKAPCLTTMDTngkplYILTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGC 444
Cdd:pfam00501    1 LERQAARTPDKTALEVGEG-----RRLTYRELDERANRLA-AGLRALG------VGKGDRVAILLPNS--PEWVVAFLAC 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  445 LLAEVVPVPIEVpltrkDAGSQQIGFLLGSCGVTVALTSD--------ACHKGLPKSPTGEIPQFKGWPKLLWFVTESKH 516
Cdd:pfam00501   67 LKAGAVYVPLNP-----RLPAEELAYILEDSGAKVLITDDalkleellEALGKLEVVKLVLVLDRDPVLKEEPLPEEAKP 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  517 LSKPPRdwfPHIKDANNDTAYIEY--------KtckdgsvlGVTVTRIALLTHCQALTQAC----GYTEAETIVNVLDFK 584
Cdd:pfam00501  142 ADVPPP---PPPPPDPDDLAYIIYtsgttgkpK--------GVMLTHRNLVANVLSIKRVRprgfGLGPDDRVLSTLPLF 210
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  585 KDVGLWHGILTSVMNMMHVISIPYALMKvNPLSWIQKVCQYKAKVACVKSRDMHwALVAHRDQRDVNLSSLRMLIVadGA 664
Cdd:pfam00501  211 HDFGLSLGLLGPLLAGATVVLPPGFPAL-DPAALLELIERYKVTVLYGVPTLLN-MLLEAGAPKRALLSSLRLVLS--GG 286
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  665 NPWSISSCDAFLNVFqskglrqevicpcasspealtvairrptddsnqppGRGVLSMHGLT--YGVIRVDSEEKLSVLTV 742
Cdd:pfam00501  287 APLPPELARRFRELF-----------------------------------GGALVNGYGLTetTGVVTTPLPLDEDLRSL 331
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  743 QDVGLVMPGAIMCSVKPDGVpQLCRTDEIGELCV--CAVATGtsYYGLSGMTKNTFEvfpmtssgapisEYPFIRTGLLG 820
Cdd:pfam00501  332 GSVGRPLPGTEVKIVDDETG-EPVPPGEPGELCVrgPGVMKG--YLNDPELTAEAFD------------EDGWYRTGDLG 396
                          490       500
                   ....*....|....*....|.
gi 1958706156  821 FVGPGGLVFVVGKMDGLMVVS 841
Cdd:pfam00501  397 RRDEDGYLEIVGRKKDQIKLG 417
PRK05850 PRK05850
acyl-CoA synthetase; Validated
360-934 3.08e-23

acyl-CoA synthetase; Validated


Pssm-ID: 235624 [Multi-domain]  Cd Length: 578  Bit Score: 106.18  E-value: 3.08e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  360 SLEAALQRWGTISPKAPCLTTMDTNGKPLYI---LTYGKLWTRSMKVAYNiLHKLGtkqepmvRPGDRVALVFPNNdpAA 436
Cdd:PRK05850     2 SVPSLLRERASLQPDDAAFTFIDYEQDPAGVaetLTWSQLYRRTLNVAEE-LRRHG-------STGDRAVILAPQG--LE 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  437 FMVAFYGCLLAEVVPVPIEVPLTRkdAGSQQIGFLLGSCGVTVALTS----DACHKGLPKSPTGEIPqfkgwpkllWFVT 512
Cdd:PRK05850    72 YIVAFLGALQAGLIAVPLSVPQGG--AHDERVSAVLRDTSPSVVLTTsavvDDVTEYVAPQPGQSAP---------PVIE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  513 ------ESKHLSKPPRDWFPhikdannDTAYIEYKTCKDGSVLGVTVTRIALLTHC-QALTQACGYTEAE-----TIVNV 580
Cdd:PRK05850   141 vdlldlDSPRGSDARPRDLP-------STAYLQYTSGSTRTPAGVMVSHRNVIANFeQLMSDYFGDTGGVpppdtTVVSW 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  581 LDFKKDVGLWHGILTSVMNMMH-VISIPYALMKvNPLSWIQKVCQYKAKVACVKSrdmhWAL-VAHRDQRDVNLSSL--- 655
Cdd:PRK05850   214 LPFYHDMGLVLGVCAPILGGCPaVLTSPVAFLQ-RPARWMQLLASNPHAFSAAPN----FAFeLAVRKTSDDDMAGLdlg 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  656 RMLIVADGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEA-LTVAIRRPtddsNQPPGrgvlsmhgltygVIRVDSE 734
Cdd:PRK05850   289 GVLGIISGSERVHPATLKRFADRFAPFNLRETAIRPSYGLAEAtVYVATREP----GQPPE------------SVRFDYE 352
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  735 eKLSVLTVQ----DVG--LVMPGAIMCS----VKPDGVPQlCRTDEIGELCVCA--VATGtsYYGLSGMTKNTFE---VF 799
Cdd:PRK05850   353 -KLSAGHAKrcetGGGtpLVSYGSPRSPtvriVDPDTCIE-CPAGTVGEIWVHGdnVAAG--YWQKPEETERTFGatlVD 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  800 PmtSSGAPISeyPFIRTGLLGFVGPGGLvFVVGKMDGLMVVSGRRHNADDIVATalavepMKFVYRGRIAVFSVTVLHDE 879
Cdd:PRK05850   429 P--SPGTPEG--PWLRTGDLGFISEGEL-FIVGRIKDLLIVDGRNHYPDDIEAT------IQEITGGRVAAISVPDDGTE 497
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  880 RIVIVAE-QRPDSTEEDSFQWM----SRVLQAIDSIHQVGVYCLALVPANTLPKTPLGGI 934
Cdd:PRK05850   498 KLVAIIElKKRGDSDEEAMDRLrtvkREVTSAISKSHGLSVADLVLVAPGSIPITTSGKI 557
AA-adenyl-dom TIGR01733
amino acid adenylation domain; This model represents a domain responsible for the specific ...
1051-1530 3.77e-23

amino acid adenylation domain; This model represents a domain responsible for the specific recognition of amino acids and activation as adenylyl amino acids. The reaction catalyzed is aa + ATP -> aa-AMP + PPi. These domains are usually found as components of multi-domain non-ribosomal peptide synthetases and are usually called "A-domains" in that context. A-domains are almost invariably followed by "T-domains" (thiolation domains, pfam00550) to which the amino acid adenylate is transferred as a thiol-ester to a bound pantetheine cofactor with the release of AMP (these are also called peptide carrier proteins, or PCPs. When the A-domain does not represent the first module (corresponding to the first amino acid in the product molecule) it is usually preceded by a "C-domain" (condensation domain, pfam00668) which catalyzes the ligation of two amino acid thiol-esters from neighboring modules. This domain is a subset of the AMP-binding domain found in Pfam (pfam00501) which also hits substrate--CoA ligases and luciferases. Sequences scoring in between trusted and noise for this model may be ambiguous as to whether they activate amino acids or other molecules lacking an alpha amino group.


Pssm-ID: 273779 [Multi-domain]  Cd Length: 409  Bit Score: 103.88  E-value: 3.77e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1051 QLHKRAEKIAVMLMERGHLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPITvrPPHPQN-IATtlptvkmIVEVSRSA 1127
Cdd:TIGR01733    4 ELDERANRLARHLRAAGGVGPGDRVAVLLERSAELVVAILAVLKAGAayVPLD--PAYPAErLAF-------ILEDAGAR 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1128 CLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAqiykPSNPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCR 1207
Cdd:TIGR01733   75 LLLTDSALASRLAGLVLPVILLDPLELAALDDAPAPPPPDA----PSGPDDLAYVIYTSGSTGRPKGVVVTHRSLVNLLA 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1208 SIklqCELYPSREVAICLDpYCGLGF------VLWCLcsvYSGHQSILIPPSELETNPALWLLAVSQYKVRDTFCSYSVM 1281
Cdd:TIGR01733  151 WL---ARRYGLDPDDRVLQ-FASLSFdasveeIFGAL---LAGATLVVPPEDEERDDAALLAALIAEHPVTVLNLTPSLL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1282 ELCtkglgsqtesLKARGLDLSRVRTcVVVAEERPRIALTQSFSKLFKDLglhpravstsfgcrvnlaiclqphRLWTla 1361
Cdd:TIGR01733  224 ALL----------AAALPPALASLRL-VILGGEALTPALVDRWRARGPGA------------------------RLIN-- 266
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1362 eqgTSGPDPTTVYVDMRALRHDRVRlvergSPHSLPLmesGKILPGVRIIIANPETKgPLGDSHLGEIWVHSAHNASGYF 1441
Cdd:TIGR01733  267 ---LYGPTETTVWSTATLVDPDDAP-----RESPVPI---GRPLANTRLYVLDDDLR-PVPVGVVGELYIGGPGVARGYL 334
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1442 tiyGDESLQSDHFNSRLSFGDTQTIWARTGYLGflRRteltDANGErhdaLYVVGALDEAMELRGMRYHPIDIEtSVIRA 1521
Cdd:TIGR01733  335 ---NRPELTAERFVPDPFAGGDGARLYRTGDLV--RY----LPDGN----LEFLGRIDDQVKIRGYRIELGEIE-AALLR 400

                   ....*....
gi 1958706156 1522 HKSVTECAV 1530
Cdd:TIGR01733  401 HPGVREAVV 409
PRK12476 PRK12476
putative fatty-acid--CoA ligase; Provisional
382-950 8.92e-23

putative fatty-acid--CoA ligase; Provisional


Pssm-ID: 171527 [Multi-domain]  Cd Length: 612  Bit Score: 105.21  E-value: 8.92e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  382 DTNGKPLYiLTYGKLWTRsmkvayniLHKLGTKQEPMVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPI---EVPl 458
Cdd:PRK12476    61 SAAGCAVE-LTWTQLGVR--------LRAVGARLQQVAGPGDRVAILAPQG--IDYVAGFFAAIKAGTIAVPLfapELP- 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  459 trkdAGSQQIGFLLGSCGVTVALTSDAChkglpkspTGEIPQFkgwpkllwfvteskhLSKPPRDWFPH------IKDA- 531
Cdd:PRK12476   129 ----GHAERLDTALRDAEPTVVLTTTAA--------AEAVEGF---------------LRNLPRLRRPRviaidaIPDSa 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  532 ----------NNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETI-VNVLDFKKDVGLWHGILTSVMNM 600
Cdd:PRK12476   182 gesfvpveldTDDVSHLQYTSGSTRPPVGVEITHRAVGTNLVQMILSIDLLDRNTHgVSWLPLYHDMGLSMIGFPAVYGG 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  601 MHVISIPYALMKvNPLSWIQKV---CQYKAKVACVKSRDMHWAlvAHR----DQRDVNLSSLRMLIvadGANPWSISSCD 673
Cdd:PRK12476   262 HSTLMSPTAFVR-RPQRWIKALsegSRTGRVVTAAPNFAYEWA--AQRglpaEGDDIDLSNVVLII---GSEPVSIDAVT 335
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  674 AFLNVFQSKGLRQEVICPCASSPEA-LTVAirrpTDDSNQPPGRGVLSMHGLTYG-VIRVDSEEKLSVLTVQdVGLVMPG 751
Cdd:PRK12476   336 TFNKAFAPYGLPRTAFKPSYGIAEAtLFVA----TIAPDAEPSVVYLDREQLGAGrAVRVAADAPNAVAHVS-CGQVARS 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  752 AIMCSVKPDGVPQLcRTDEIGELCVCAVATGTSYYGLSGMTKNTFEV-----FPMTS--SGAPISEyPFIRTGLLGFVGP 824
Cdd:PRK12476   411 QWAVIVDPDTGAEL-PDGEVGEIWLHGDNIGRGYWGRPEETERTFGAklqsrLAEGShaDGAADDG-TWLRTGDLGVYLD 488
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  825 GGLvFVVGKMDGLMVVSGRRHNADDIVATALAVEPMkfVYRGRIAVFSVTVLHDERIVIVAEQRPDSTEEDSFQWMSRVL 904
Cdd:PRK12476   489 GEL-YITGRIADLIVIDGRNHYPQDIEATVAEASPM--VRRGYVTAFTVPAEDNERLVIVAERAAGTSRADPAPAIDAIR 565
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*.
gi 1958706156  905 QAIDSIHQVGVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSLHP 950
Cdd:PRK12476   566 AAVSRRHGLAVADVRLVPAGAIPRTTSGKLARRACRAQYLDGRLGV 611
Firefly_Luc_like cd05911
Firefly luciferase of light emitting insects and 4-Coumarate-CoA Ligase (4CL); This family ...
1041-1531 2.44e-22

Firefly luciferase of light emitting insects and 4-Coumarate-CoA Ligase (4CL); This family contains insect firefly luciferases that share significant sequence similarity to plant 4-coumarate:coenzyme A ligases, despite their functional diversity. Luciferase catalyzes the production of light in the presence of MgATP, molecular oxygen, and luciferin. In the first step, luciferin is activated by acylation of its carboxylate group with ATP, resulting in an enzyme-bound luciferyl adenylate. In the second step, luciferyl adenylate reacts with molecular oxygen, producing an enzyme-bound excited state product (Luc=O*) and releasing AMP. This excited-state product then decays to the ground state (Luc=O), emitting a quantum of visible light.


Pssm-ID: 341237 [Multi-domain]  Cd Length: 486  Bit Score: 102.68  E-value: 2.44e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1041 GTIANSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCvpitvrPPHPQNIATTLPTVKMI 1120
Cdd:cd05911      5 ADTGKELTYAQLRTLSRRLAAGLRKLG-LKKGDVVGIISPNSTYYPPVFLGCLFAGG------IFSAANPIYTADELAHQ 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1121 VEVSRSACLMTTQliCKLLRSREAAAAVDVRTWPLILDTDDLPKKRPAQIYKPSN--------------PDTLAYLDFSV 1186
Cdd:cd05911     78 LKISKPKVIFTDP--DGLEKVKEAAKELGPKDKIIVLDDKPDGVLSIEDLLSPTLgeededlppplkdgKDDTAAILYSS 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1187 STTGMLAGVKMSHaatSAFCRSIKLQC----ELYPSREVAICLDPY---CGLgfvLWCLCSVYSGHQSILIPpselETNP 1259
Cdd:cd05911    156 GTTGLPKGVCLSH---RNLIANLSQVQtflyGNDGSNDVILGFLPLyhiYGL---FTTLASLLNGATVIIMP----KFDS 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1260 ALWLLAVSQYKVRDTFCSYSVMELctkgLGSQTESLKArglDLSRVRTCVVVAEerpriALTQSFSKLFKdlglhpravs 1339
Cdd:cd05911    226 ELFLDLIEKYKITFLYLVPPIAAA----LAKSPLLDKY---DLSSLRVILSGGA-----PLSKELQELLA---------- 283
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1340 tsfgCRVNLAICLQphrLWTLAEqgTSGPDPTTVYVDmralrhdrvrlVERGSphslplmeSGKILPGVRIIIANPETKG 1419
Cdd:cd05911    284 ----KRFPNATIKQ---GYGMTE--TGGILTVNPDGD-----------DKPGS--------VGRLLPNVEAKIVDDDGKD 335
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1420 PLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnsrlsfgdTQTIWARTGYLGFLrrteltDANGErhdaLYVVGALD 1499
Cdd:cd05911    336 SLGPNEPGEICVRGPQVMKGY---YNNPEATKETF--------DEDGWLHTGDIGYF------DEDGY----LYIVDRKK 394
                          490       500       510
                   ....*....|....*....|....*....|..
gi 1958706156 1500 EAMELRGMRYHPIDIEtSVIRAHKSVTECAVF 1531
Cdd:cd05911    395 ELIKYKGFQVAPAELE-AVLLEHPGVADAAVI 425
DMAP_binding pfam06464
DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.
14-159 1.17e-21

DMAP1-binding Domain; This domain binds DMAP1, a transcriptional co-repressor.


Pssm-ID: 368923 [Multi-domain]  Cd Length: 104  Bit Score: 91.33  E-value: 1.17e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156   14 GDITQKGYEKKRSKLIGAYLPQPptangaavvrcrlqpgegvprrpfrsahigvcdireaaareraasaagnrplfyfrf 93
Cdd:pfam06464   23 GDITEKGYEKKKLKLLRKFLLHP--------------------------------------------------------- 45
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958706156   94 gvdqalpqerRAPVTPSSASRYHR-RRSSGSRDERYRSDVHTEAVQAALAKHKERKMAVPMPSKRRS 159
Cdd:pfam06464   46 ----------ETPTKLSAEAQNQLaSLETKLRDEELSEEVYLEKVKALLAKELERENGLNAPTKEQS 102
BCL_4HBCL cd05959
Benzoate CoA ligase (BCL) and 4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase); Benzoate ...
1044-1533 7.77e-19

Benzoate CoA ligase (BCL) and 4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase); Benzoate CoA ligase and 4-hydroxybenzoate-coenzyme A ligase catalyze the first activating step for benzoate and 4-hydroxybenzoate catabolic pathways, respectively. Although these two enzymes share very high sequence homology, they have their own substrate preference. The reaction proceeds via a two-step process; the first ATP-dependent step forms the substrate-AMP intermediate, while the second step forms the acyl-CoA ester, releasing the AMP. Aromatic compounds represent the second most abundant class of organic carbon compounds after carbohydrates. Some bacteria can use benzoic acid or benzenoid compounds as the sole source of carbon and energy through degradation. Benzoate CoA ligase and 4-hydroxybenzoate-Coenzyme A ligase are key enzymes of this process.


Pssm-ID: 341269 [Multi-domain]  Cd Length: 508  Bit Score: 92.05  E-value: 7.77e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1044 ANSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPItvrpphPQNIATTLPTVKMIVEV 1123
Cdd:cd05959     27 AGSLTYAELEAEARRVAGALRALG-VKREERVLLIMLDTVDFPTAFLGAIRAGIVPV------PVNTLLTPDDYAYYLED 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1124 SRSACLMTTQLICKLLRSREAAAAVDVRT----------WPLILDTDDLPKKRPAQIYKPSNPDTLAYLDFSVSTTGMLA 1193
Cdd:cd05959    100 SRARVVVVSGELAPVLAAALTKSEHTLVVlivsggagpeAGALLLAELVAAEAEQLKPAATHADDPAFWLYSSGSTGRPK 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1194 GVKMSHAatsafcrSIKLQCELYPSREVAICLDPYC----------GLGFVLWCLCSVysGHQSILIPpsELETnPALWL 1263
Cdd:cd05959    180 GVVHLHA-------DIYWTAELYARNVLGIREDDVCfsaaklffayGLGNSLTFPLSV--GATTVLMP--ERPT-PAAVF 247
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1264 LAVSQYKvRDTFcsYSVMELCTKGLGSqtESLKARglDLSRVRTCVVVAEerpriALTQSFSKLFKDLglhpravstsFG 1343
Cdd:cd05959    248 KRIRRYR-PTVF--FGVPTLYAAMLAA--PNLPSR--DLSSLRLCVSAGE-----ALPAEVGERWKAR----------FG 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1344 CRVnlaiclqphrlwtlaEQGTSGPDPTTVYVDMRAlrhDRVRLverGSphslplmeSGKILPGVRIIIANpETKGPLGD 1423
Cdd:cd05959    306 LDI---------------LDGIGSTEMLHIFLSNRP---GRVRY---GT--------TGKPVPGYEVELRD-EDGGDVAD 355
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1424 SHLGEIWVHSAHNASGYFTIYGDeslqsdhfnSRLSFgdtQTIWARTGYlGFLRrteltDANGerhdALYVVGALDEAME 1503
Cdd:cd05959    356 GEPGELYVRGPSSATMYWNNRDK---------TRDTF---QGEWTRTGD-KYVR-----DDDG----FYTYAGRADDMLK 413
                          490       500       510
                   ....*....|....*....|....*....|
gi 1958706156 1504 LRGMRYHPIDIEtSVIRAHKSVTECAVFTW 1533
Cdd:cd05959    414 VSGIWVSPFEVE-SALVQHPAVLEAAVVGV 442
A_NRPS cd05930
The adenylation domain of nonribosomal peptide synthetases (NRPS); The adenylation (A) domain ...
1046-1553 3.16e-17

The adenylation domain of nonribosomal peptide synthetases (NRPS); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341253 [Multi-domain]  Cd Length: 444  Bit Score: 86.43  E-value: 3.16e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPQNiattlpTVKMIVEVSR 1125
Cdd:cd05930     12 SLTYAELDARANRLARYLRERG-VGPGDLVAVLLERSLEMVVAILAVLKAGAAYVPLDPSYPAE------RLAYILEDSG 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1126 SACLMTTqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnPDTLAYLDFSVSTTGMLAGVKMSHAATSAF 1205
Cdd:cd05930     85 AKLVLTD-------------------------------------------PDDLAYVIYTSGSTGKPKGVMVEHRGLVNL 121
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1206 CRSIKlqcELYP--SREVAICLDPYcglGF------VLWCLCsvySGHQsILIPPSELETNPALWLLAVSQYKVRDTFCS 1277
Cdd:cd05930    122 LLWMQ---EAYPltPGDRVLQFTSF---SFdvsvweIFGALL---AGAT-LVVLPEEVRKDPEALADLLAEEGITVLHLT 191
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1278 YSVMELCTKGLGSQteslkarglDLSRVRTcVVVAEERPRIALTQSFSKLFKDlglhpravstsfgcrvnlaiclqpHRL 1357
Cdd:cd05930    192 PSLLRLLLQELELA---------ALPSLRL-VLVGGEALPPDLVRRWRELLPG------------------------ARL 237
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1358 WTLaeqgtSGPDPTTVYVDMRALRHDRVRlvergsPHSLPLmesGKILPGVRIIIANPETKgPLGDSHLGEIWVHSAHNA 1437
Cdd:cd05930    238 VNL-----YGPTEATVDATYYRVPPDDEE------DGRVPI---GRPIPNTRVYVLDENLR-PVPPGVPGELYIGGAGLA 302
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1438 SGYftiYGDESLQSDHFNsRLSFGDTQTIWaRTGYLGflRRteltDANGErhdaLYVVGALDEAMELRGMRYHPIDIETs 1517
Cdd:cd05930    303 RGY---LNRPELTAERFV-PNPFGPGERMY-RTGDLV--RW----LPDGN----LEFLGRIDDQVKIRGYRIELGEIEA- 366
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|..
gi 1958706156 1518 VIRAHKSVTECAVFTWTN------LLVVVVELDGSEQEALDL 1553
Cdd:cd05930    367 ALLAHPGVREAAVVAREDgdgekrLVAYVVPDEGGELDEEEL 408
PRK05691 PRK05691
peptide synthase; Validated
358-978 2.10e-16

peptide synthase; Validated


Pssm-ID: 235564 [Multi-domain]  Cd Length: 4334  Bit Score: 85.99  E-value: 2.10e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  358 PPSLEAALQRWGTISPKAPCLTTMDTNGKPLYILTYGKLWTRSMKVAYNIlhklgtkqEPMVRPGDRVALVFPNNdpAAF 437
Cdd:PRK05691     8 PLTLVQALQRRAAQTPDRLALRFLADDPGEGVVLSYRDLDLRARTIAAAL--------QARASFGDRAVLLFPSG--PDY 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  438 MVAFYGCLLAEVVPVPIEVPLTRKDAGSQQIGFLLGSCGVTVALTSDACHKGLPKSPTGEIPqfkGWPKLLwfvTESKHL 517
Cdd:PRK05691    78 VAAFFGCLYAGVIAVPAYPPESARRHHQERLLSIIADAEPRLLLTVADLRDSLLQMEELAAA---NAPELL---CVDTLD 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  518 SKPPRDWF-PHIKDanNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACG--YTEAETIVNVLDFKKDVGLWHGIL 594
Cdd:PRK05691   152 PALAEAWQePALQP--DDIAFLQYTSGSTALPKGVQVSHGNLVANEQLIRHGFGidLNPDDVIVSWLPLYHDMGLIGGLL 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  595 TSVMNmmhviSIPYALMKVN-----PLSWIQKVCQYKAKVAcvKSRDMHWALVAHRdQRDVNLSSL---RMLIVADGANP 666
Cdd:PRK05691   230 QPIFS-----GVPCVLMSPAyflerPLRWLEAISEYGGTIS--GGPDFAYRLCSER-VSESALERLdlsRWRVAYSGSEP 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  667 WSISSCDAFLNVFQSKGLRQEVICPCASSPEA-LTVAirrptddsNQPPGRGVlsmhgltyGVIRVDSE-------EKLS 738
Cdd:PRK05691   302 IRQDSLERFAEKFAACGFDPDSFFASYGLAEAtLFVS--------GGRRGQGI--------PALELDAEalarnraEPGT 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  739 VLTVQDVGLVMPGAIMCSVKPDGVPQLcRTDEIGELCVCAVATGTSYYGLSGMTKNTFevfpMTSSGApiseyPFIRTGL 818
Cdd:PRK05691   366 GSVLMSCGRSQPGHAVLIVDPQSLEVL-GDNRVGEIWASGPSIAHGYWRNPEASAKTF----VEHDGR-----TWLRTGD 435
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  819 LGFVGPGGLvFVVGKMDGLMVVSGRRHNADDIVATalAVEPMKFVYRGRIAVFSVTVLHDERIVIVAE-----QRPDSTE 893
Cdd:PRK05691   436 LGFLRDGEL-FVTGRLKDMLIVRGHNLYPQDIEKT--VEREVEVVRKGRVAAFAVNHQGEEGIGIAAEisrsvQKILPPQ 512
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  894 EdsfqWMSRVLQAIDSIHQVGVYCLALVPANTLPKTPLGGIHLSETKQLFLEGSLhpcnvlmcphTCVTNLPKPRQKQPE 973
Cdd:PRK05691   513 A----LIKSIRQAVAEACQEAPSVVLLLNPGALPKTSSGKLQRSACRLRLADGSL----------DSYALFPALQAVEAA 578

                   ....*
gi 1958706156  974 IGPAS 978
Cdd:PRK05691   579 QTAAS 583
PRK05851 PRK05851
long-chain-fatty acid--ACP ligase MbtM;
1051-1530 9.08e-14

long-chain-fatty acid--ACP ligase MbtM;


Pssm-ID: 180289 [Multi-domain]  Cd Length: 525  Bit Score: 75.96  E-value: 9.08e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1051 QLHKRAEKIAVMLMERGhlqDGDHVALVYPPGIDLIAAFYGCLYAG----CVPITVRPPHPQNIA-TTLptvkmivevSR 1125
Cdd:PRK05851    36 EVHGRAENVAARLLDRD---RPGAVGLVGEPTVELVAAIQGAWLAGaavsILPGPVRGADDGRWAdATL---------TR 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1126 SACLMTTQLIC-----KLLRSREAAAAV-DVRTWPlildtddlpKKRPAQIYKPSNPDTLAYLDFSVSTTGMLAGVKMSH 1199
Cdd:PRK05851   104 FAGIGVRTVLShgshlERLRAVDSSVTVhDLATAA---------HTNRSASLTPPDSGGPAVLQGTAGSTGTPRTAILSP 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1200 AATSAFCRSIKLQCELYPSREVAICLDPY---CGLGFVLwclCSVYSGHQSILIPPSELETNPALWLLAVSQykVRDTFC 1276
Cdd:PRK05851   175 GAVLSNLRGLNARVGLDAATDVGCSWLPLyhdMGLAFLL---TAALAGAPLWLAPTTAFSASPFRWLSWLSD--SRATLT 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1277 SYSVMELCTKGLGSQteslKARGLDLSRVRTCVVVAEerP-RIALTQSFSKLFKDLGLHPRAVSTSFGcrvnlaiclqph 1355
Cdd:PRK05851   250 AAPNFAYNLIGKYAR----RVSDVDLGALRVALNGGE--PvDCDGFERFATAMAPFGFDAGAAAPSYG------------ 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1356 rlwtLAEQ--GTSGPDPTTvyvdmrALRHDRVRLVERGSPHSLPLMesGKILPGVRIIIANPETKGPLGDSHLGEIWVHS 1433
Cdd:PRK05851   312 ----LAEStcAVTVPVPGI------GLRVDEVTTDDGSGARRHAVL--GNPIPGMEVRISPGDGAAGVAGREIGEIEIRG 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1434 AHNASGYFtiyGDESLQSDHfnsrlsfgdtqtiWARTGYLGFlrrteLTDangerhDALYVVGALDEAMELRGMRYHPID 1513
Cdd:PRK05851   380 ASMMSGYL---GQAPIDPDD-------------WFPTGDLGY-----LVD------GGLVVCGRAKELITVAGRNIFPTE 432
                          490
                   ....*....|....*..
gi 1958706156 1514 IETSVIRAhKSVTECAV 1530
Cdd:PRK05851   433 IERVAAQV-RGVREGAV 448
A_NRPS_MycA_like cd05908
The adenylation domain of nonribosomal peptide synthetases (NRPS) similar to mycosubtilin ...
1175-1515 1.14e-12

The adenylation domain of nonribosomal peptide synthetases (NRPS) similar to mycosubtilin synthase subunit A (MycA); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as (amino)-acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. This family includes NRPS similar to mycosubtilin synthase subunit A (MycA). Mycosubtilin, which is characterized by a beta-amino fatty acid moiety linked to the circular heptapeptide Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin family of lipopeptide antibiotics. The mycosubtilin synthase subunit A (MycA) combines functional domains derived from peptide synthetases, amino transferases, and fatty acid synthases. Nonribosomal peptide synthetases are large multifunction enzymes that synthesize many therapeutically useful peptides. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341234 [Multi-domain]  Cd Length: 499  Bit Score: 72.52  E-value: 1.14e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1175 NPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLCSVYSGHQSILIPPSE 1254
Cdd:cd05908    104 LADELAFIQFSSGSTGDPKGVMLTHENLVHNMFAILNSTEWKTKDRILSWMPLTHDMGLIAFHLAPLIAGMNQYLMPTRL 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1255 LETNPALWLLAVSQYKVRDTFCSysvmELCTKGLGSQTESLKARGLDLSRVRTCVVVAEErprIA--LTQSFSKLFKDLG 1332
Cdd:cd05908    184 FIRRPILWLKKASEHKATIVSSP----NFGYKYFLKTLKPEKANDWDLSSIRMILNGAEP---IDyeLCHEFLDHMSKYG 256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1333 LHPRAVSTSFGcrvnlaiclqphrlwtLAEQ--GTSGPDP----TTVYVDMRALRH-DRVRLVERGSPHSLPLMESGKIL 1405
Cdd:cd05908    257 LKRNAILPVYG----------------LAEAsvGASLPKAqspfKTITLGRRHVTHgEPEPEVDKKDSECLTFVEVGKPI 320
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1406 PGVRIIIANPETKGpLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnsrlsfgdTQTIWARTGYLGFLRrteltdaN 1485
Cdd:cd05908    321 DETDIRICDEDNKI-LPDGYIGHIQIRGKNVTPGY---YNNPEATAKVF--------TDDGWLKTGDLGFIR-------N 381
                          330       340       350
                   ....*....|....*....|....*....|
gi 1958706156 1486 GErhdaLYVVGALDEAMELRGMRYHPIDIE 1515
Cdd:cd05908    382 GR----LVITGREKDIIFVNGQNVYPHDIE 407
FACL_like_6 cd05922
Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ...
420-932 2.79e-12

Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ATP-dependent activation of fatty acids in a two-step reaction. The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions.


Pssm-ID: 341246 [Multi-domain]  Cd Length: 457  Bit Score: 70.93  E-value: 2.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  420 RPGDRVALVFPNNDPA---AFMVAFYGCLLAEVVpvpieVPLTrKDAGSQQIGFLLGSCGVTVALTS----DACHKGLPK 492
Cdd:cd05922     16 VRGERVVLILPNRFTYielSFAVAYAGGRLGLVF-----VPLN-PTLKESVLRYLVADAGGRIVLADagaaDRLRDALPA 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  493 SPTGEipqfkgwpklLWFVTESKHLSKPPRDWFPHIKDannDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYT 572
Cdd:cd05922     90 SPDPG----------TVLDADGIRAARASAPAHEVSHE---DLALLLYTSGSTGSPKLVRLSHQNLLANARSIAEYLGIT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  573 EAETIVNVLDFKKDVGLwhgiltSVMNMmH-------VISIPYALmkvnPLSWIQKVCQYKAK-VACVKSrdmHWALVAH 644
Cdd:cd05922    157 ADDRALTVLPLSYDYGL------SVLNT-HllrgatlVLTNDGVL----DDAFWEDLREHGATgLAGVPS---TYAMLTR 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  645 RDQRDVNLSSLRMLIVADGANPwsisscDAFLNVFQSKGlrqevicpcasspealtvairrptddsnqpPGRGVLSMHGL 724
Cdd:cd05922    223 LGFDPAKLPSLRYLTQAGGRLP------QETIARLRELL------------------------------PGAQVYVMYGQ 266
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  725 TYGVIR---VDSEEKLSVLTvqDVGLVMPGAIMCSVKPDGvpQLCRTDEIGELcvcaVATGTSYYgLSGMTKNTFEVFPM 801
Cdd:cd05922    267 TEATRRmtyLPPERILEKPG--SIGLAIPGGEFEILDDDG--TPTPPGEPGEI----VHRGPNVM-KGYWNDPPYRRKEG 337
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  802 TSSGApiseypfIRTGLLGFVGPGGLVFVVGKMDGLMVVSGRRHNADDIVATALAVEPMkfvyrGRIAVFSVTVLHDERI 881
Cdd:cd05922    338 RGGGV-------LHTGDLARRDEDGFLFIVGRRDRMIKLFGNRISPTEIEAAARSIGLI-----IEAAAVGLPDPLGEKL 405
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  882 VIVAEqRPDSTEEDSfqwMSRVLQAIDSIHQVGVYClalVPANTLPKTPLG 932
Cdd:cd05922    406 ALFVT-APDKIDPKD---VLRSLAERLPPYKVPATV---RVVDELPLTASG 449
FC-FACS_FadD_like cd05936
Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD; This ...
1017-1251 5.95e-12

Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD; This subfamily of the AMP-forming adenylation family contains Escherichia coli FadD and similar prokaryotic fatty acid CoA synthetases. FadD was characterized as a long-chain fatty acid CoA synthetase. The gene fadD is regulated by the fatty acid regulatory protein FadR. Fatty acid CoA synthetase catalyzes the formation of fatty acyl-CoA in a two-step reaction: the formation of a fatty acyl-AMP molecule as an intermediate, followed by the formation of a fatty acyl-CoA. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions.


Pssm-ID: 341259 [Multi-domain]  Cd Length: 468  Bit Score: 69.90  E-value: 5.95e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:cd05936      1 LADLLEEAARRFPDK---TALIFMGR---KLTYRELDALAEAFAAGLQNLG-VQPGDRVALMLPNCPQFPIAYFGALKAG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVRPphpqniattlptvkmivevsrsacLMTTQLICKLLRSREAAAAVDVRTWplildTDDLPKKRPAQIYKPSNP 1176
Cdd:cd05936     74 AVVVPLNP------------------------LYTPRELEHILNDSGAKALIVAVSF-----TDLLAAGAPLGERVALTP 124
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1177 DTLAYLDFSVSTTGMLAGVKMSHAATSAfcrsIKLQC-----ELYPSREVAICLDP-YCGLGFVLWCLCSVYSGHQSILI 1250
Cdd:cd05936    125 EDVAVLQYTSGTTGVPKGAMLTHRNLVA----NALQIkawleDLLEGDDVVLAALPlFHVFGLTVALLLPLALGATIVLI 200

                   .
gi 1958706156 1251 P 1251
Cdd:cd05936    201 P 201
4CL cd05904
4-Coumarate-CoA Ligase (4CL); 4-Coumarate:coenzyme A ligase is a key enzyme in the ...
1046-1530 1.55e-11

4-Coumarate-CoA Ligase (4CL); 4-Coumarate:coenzyme A ligase is a key enzyme in the phenylpropanoid metabolic pathway for monolignol and flavonoid biosynthesis. It catalyzes the synthesis of hydroxycinnamate-CoA thioesters in a two-step reaction, involving the formation of hydroxycinnamate-AMP anhydride and the nucleophilic substitution of AMP by CoA. The phenylpropanoid pathway is one of the most important secondary metabolism pathways in plants and hydroxycinnamate-CoA thioesters are the precursors of lignin and other important phenylpropanoids.


Pssm-ID: 341230 [Multi-domain]  Cd Length: 505  Bit Score: 68.80  E-value: 1.55e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGHLQdGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPphpqniATTLPTVKMIVEVSR 1125
Cdd:cd05904     32 ALTYAELERRVRRLAAGLAKRGGRK-GDVVLLLSPNSIEFPVAFLAVLSLGAVVTTANP------LSTPAEIAKQVKDSG 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1126 SACLMTT-QLICKLlrsreAAAAVDVrtwpLILDTDD---LPKKRPAQIYKPSNP-------DTLAYLDFSVSTTGMLAG 1194
Cdd:cd05904    105 AKLAFTTaELAEKL-----ASLALPV----VLLDSAEfdsLSFSDLLFEADEAEPpvvvikqDDVAALLYSSGTTGRSKG 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1195 VKMSH----AATSAFCRSIKLQCElypSREVAICLDPYCGL-GFVLWCLCSVYSGHQSILIPPSELETnpalWLLAVSQY 1269
Cdd:cd05904    176 VMLTHrnliAMVAQFVAGEGSNSD---SEDVFLCVLPMFHIyGLSSFALGLLRLGATVVVMPRFDLEE----LLAAIERY 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1270 KVRDTFCSYSVMELCTKglgsqteSLKARGLDLSRVRTCVVVAEERPRiALTQSFSKLFKDlglhpravstsfgcrVNLa 1349
Cdd:cd05904    249 KVTHLPVVPPIVLALVK-------SPIVDKYDLSSLRQIMSGAAPLGK-ELIEAFRAKFPN---------------VDL- 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1350 icLQPHRLWTLAEQGTSGPDPTtvyvdmralrHDRVRlveRGSphslplmeSGKILPGVRIIIANPETKGPLGDSHLGEI 1429
Cdd:cd05904    305 --GQGYGMTESTGVVAMCFAPE----------KDRAK---YGS--------VGRLVPNVEAKIVDPETGESLPPNQTGEL 361
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1430 WVHSAHNASGYFtiyGDESlqsdhfnsrlsfgDTQ-TI----WARTGYLGFLrrteltDANGErhdaLYVVGALDEAMEL 1504
Cdd:cd05904    362 WIRGPSIMKGYL---NNPE-------------ATAaTIdkegWLHTGDLCYI------DEDGY----LFIVDRLKELIKY 415
                          490       500
                   ....*....|....*....|....*.
gi 1958706156 1505 RGMRYHPIDIEtSVIRAHKSVTECAV 1530
Cdd:cd05904    416 KGFQVAPAELE-ALLLSHPEILDAAV 440
DltA cd05945
D-alanine:D-alanyl carrier protein ligase (DltA) and similar proteins; This family includes ...
1025-1556 9.18e-11

D-alanine:D-alanyl carrier protein ligase (DltA) and similar proteins; This family includes D-alanyl carrier protein ligase DltA and aliphatic beta-amino acid adenylation enzymes IdnL1 and CmiS6. DltA incorporates D-ala in techoic acids in gram-positive bacteria via a two-step process, starting with adenylation of D-alanine that transfers D-alanine to the D-alanyl carrier protein. IdnL1, a short-chain aliphatic beta-amino acid adenylation enzyme, recognizes 3-aminobutanoic acid, and is involved in the synthesis of the macrolactam antibiotic incednine. CmiS6 is a medium-chain beta-amino acid adenylation enzyme that recognizes 3-aminononanoic acid, and is involved in the synthesis of cremimycin, also a macrolactam antibiotic. The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341267 [Multi-domain]  Cd Length: 449  Bit Score: 66.12  E-value: 9.18e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1025 AQTTPDHLLYTllnCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPITV 1102
Cdd:cd05945      1 AAANPDRPAVV---EGGR---TLTYRELKERADALAAALASLG-LDAGDPVVVYGHKSPDAIAAFLAALKAGHayVPLDA 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1103 RPPHPQniattlptVKMIVEVSRSACLMTTqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnPDTLAYL 1182
Cdd:cd05945     74 SSPAER--------IREILDAAKPALLIAD-------------------------------------------GDDNAYI 102
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1183 DFSVSTTGMLAGVKMSHAATSAFCRSIkLQCELYPSREVAICLDPYcglGF---VLWCLCSVYSGhQSILIPPSELETNP 1259
Cdd:cd05945    103 IFTSGSTGRPKGVQISHDNLVSFTNWM-LSDFPLGPGDVFLNQAPF---SFdlsVMDLYPALASG-ATLVPVPRDATADP 177
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1260 ALWLLAVSQYKVRDTFCSYSVMELCTkGLGSQTESLkargldLSRVRTCVVVAEERPrIALTQSFSKLFkdlglhPravs 1339
Cdd:cd05945    178 KQLFRFLAEHGITVWVSTPSFAAMCL-LSPTFTPES------LPSLRHFLFCGEVLP-HKTARALQQRF------P---- 239
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1340 tsfGCRV-NlaiclqphrlwtlaeqgTSGPDPTTVYVdmraLRHDRVRLVERGSPhSLPLmesGKILPGVRIIIANPETK 1418
Cdd:cd05945    240 ---DARIyN-----------------TYGPTEATVAV----TYIEVTPEVLDGYD-RLPI---GYAKPGAKLVILDEDGR 291
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1419 gPLGDSHLGEIWVHSAHNASGYFtiyGDEslqsDHFNSRLSFGDTQTiWARTGYLGFLrrteltDANGErhdaLYVVGAL 1498
Cdd:cd05945    292 -PVPPGEKGELVISGPSVSKGYL---NNP----EKTAAAFFPDEGQR-AYRTGDLVRL------EADGL----LFYRGRL 352
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958706156 1499 DEAMELRGMRYHPIDIETSViRAHKSVTECAVFTWTNL-----LVVVVELDGSEqEALDLVPL 1556
Cdd:cd05945    353 DFQVKLNGYRIELEEIEAAL-RQVPGVKEAVVVPKYKGekvteLIAFVVPKPGA-EAGLTKAI 413
A_NRPS_VisG_like cd17651
similar to adenylation domain of virginiamycin S synthetase; This family of the adenylation (A) ...
1025-1530 2.24e-10

similar to adenylation domain of virginiamycin S synthetase; This family of the adenylation (A) domain of nonribosomal peptide synthases (NRPS) includes virginiamycin S synthetase (VisG) in Streptomyces virginiae; VisG is involved in virginiamycin S (VS) biosynthesis as the provider of an L-pheGly molecule, a highly specific substrate for the last condensation step by VisF. This family also includes linear gramicidin synthetase B (LgrB) in Brevibacillus brevis. Substrate specificity analysis using residues of the substrate-binding pockets of all 16 adenylation domains has shown good agreement of the substrate amino acids predicted with the sequence of linear gramicidin. The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341306 [Multi-domain]  Cd Length: 491  Bit Score: 65.06  E-value: 2.24e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1025 AQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG--CVPITV 1102
Cdd:cd17651      5 AARTPDA---PALVAEGR---RLTYAELDRRANRLAHRLRARG-VGPGDLVALCARRSAELVVALLAILKAGaaYVPLDP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1103 RPPhPQNIAttlptvkMIVEVSRSACLMTTQlickllrsREAAAAVDVRTWPLILDTDDLPKKRPAQIYKPSNPDTLAYL 1182
Cdd:cd17651     78 AYP-AERLA-------FMLADAGPVLVLTHP--------ALAGELAVELVAVTLLDQPGAAAGADAEPDPALDADDLAYV 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1183 DFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAicldPYCGLGF--VLWCLCSVYSGHQSILIPPSELETNPA 1260
Cdd:cd17651    142 IYTSGSTGRPKGVVMPHRSLANLVAWQARASSLGPGARTL----QFAGLGFdvSVQEIFSTLCAGATLVLPPEEVRTDPP 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1261 LWLLAVSQYKVRDTFCSYSVME-LCtkglgsqtESLKARGLDLSRVRtCVVVAEERprialtqsfsklfkdLGLHPRavs 1339
Cdd:cd17651    218 ALAAWLDEQRISRVFLPTVALRaLA--------EHGRPLGVRLAALR-YLLTGGEQ---------------LVLTED--- 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1340 tsfgcrvnlaiclqpHRLWTLAEQG-----TSGPDPTTVyVDMRALRHDRVRlveRGSPHSLplmesGKILPGVRIIIAN 1414
Cdd:cd17651    271 ---------------LREFCAGLPGlrlhnHYGPTETHV-VTALSLPGDPAA---WPAPPPI-----GRPIDNTRVYVLD 326
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1415 PETKgPLGDSHLGEIWVHSAHNASGYFTIYG--DESLQSDHF--NSRLSfgdtqtiwaRTGYLGflRRteltDANGErhd 1490
Cdd:cd17651    327 AALR-PVPPGVPGELYIGGAGLARGYLNRPEltAERFVPDPFvpGARMY---------RTGDLA--RW----LPDGE--- 387
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1958706156 1491 aLYVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAV 1530
Cdd:cd17651    388 -LEFLGRADDQVKIRGFRIELGEIE-AALARHPGVREAVV 425
PRK07768 PRK07768
long-chain-fatty-acid--CoA ligase; Validated
551-943 4.47e-10

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236091 [Multi-domain]  Cd Length: 545  Bit Score: 64.25  E-value: 4.47e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  551 VTVTRIALLTHCQALTQACGYT-EAETIVNVLDFKKDVGLWhGILTSVMNM-MHVISIPYALMKVNPLSWIQKVCQYKAK 628
Cdd:PRK07768   170 VQITHGNLYANAEAMFVAAEFDvETDVMVSWLPLFHDMGMV-GFLTVPMYFgAELVKVTPMDFLRDPLLWAELISKYRGT 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  629 VACvkSRDMHWALVAHR-----DQRDVNLSSLRmlIVADGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEAlTVAI 703
Cdd:PRK07768   249 MTA--APNFAYALLARRlrrqaKPGAFDLSSLR--FALNGAEPIDPADVEDLLDAGARFGLRPEAILPAYGMAEA-TLAV 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  704 rrptddSNQPPGRGvlsmhgLTYGVIRVD--SEEKLSVLTVQD-------VGLVMPGAIMCSVKPDGvpQLCRTDEIGEL 774
Cdd:PRK07768   324 ------SFSPCGAG------LVVDEVDADllAALRRAVPATKGntrrlatLGPPLPGLEVRVVDEDG--QVLPPRGVGVI 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  775 CVCAVATgTSYYglsgmtkntfevfpMTSSG--APISEYPFIRTGLLGFVGPGGLVFVVGKMDGLMVVSGRRHNADDIVA 852
Cdd:PRK07768   390 ELRGESV-TPGY--------------LTMDGfiPAQDADGWLDTGDLGYLTEEGEVVVCGRVKDVIIMAGRNIYPTDIER 454
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  853 TALAVEPmkfVYRGRIAVFSVTVLHD-ERIVIVAEQRPDSTEEDSFQWMSRVLQAIDSihQVGV--YCLALVPANTLPKT 929
Cdd:PRK07768   455 AAARVEG---VRPGNAVAVRLDAGHSrEGFAVAVESNAFEDPAEVRRIRHQVAHEVVA--EVGVrpRNVVVLGPGSIPKT 529
                          410
                   ....*....|....
gi 1958706156  930 PLGGIHLSETKQLF 943
Cdd:PRK07768   530 PSGKLRRANAAELV 543
AFD_class_I cd04433
Adenylate forming domain, Class I, also known as the ANL superfamily; This family is known as ...
550-934 4.49e-10

Adenylate forming domain, Class I, also known as the ANL superfamily; This family is known as the ANL (acyl-CoA synthetases, the NRPS adenylation domains, and the Luciferase enzymes) superfamily. It includes acyl- and aryl-CoA ligases, as well as the adenylation domain of nonribosomal peptide synthetases and firefly luciferases.The adenylate-forming enzymes catalyze an ATP-dependent two-step reaction to first activate a carboxylate substrate as an adenylate and then transfer the carboxylate to the pantetheine group of either coenzyme A or an acyl-carrier protein. The active site of the domain is located at the interface of a large N-terminal subdomain and a smaller C-terminal subdomain.


Pssm-ID: 341228 [Multi-domain]  Cd Length: 336  Bit Score: 63.07  E-value: 4.49e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  550 GVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWhGILTSVMNMMHVISIPyalmKVNPLSWIQKVCQYKAKV 629
Cdd:cd04433     17 GVVLSHRNLLAAAAALAASGGLTEGDVFLSTLPLFHIGGLF-GLLGALLAGGTVVLLP----KFDPEAALELIEREKVTI 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  630 ACVkSRDMHWALVAHRDQRDVNLSSLRMLIVadGANPWSISSCDAFLNVFqskglrqevicpcasspealtvairrptdd 709
Cdd:cd04433     92 LLG-VPTLLARLLKAPESAGYDLSSLRALVS--GGAPLPPELLERFEEAP------------------------------ 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  710 snqppGRGVLSMHGLT-----YGVIRVDSEEKLSVltvqDVGLVMPGAIMCSVKPDGVPqlCRTDEIGELCVcavatgTS 784
Cdd:cd04433    139 -----GIKLVNGYGLTetggtVATGPPDDDARKPG----SVGRPVPGVEVRIVDPDGGE--LPPGEIGELVV------RG 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  785 YYGLSGMTKNTFEVFPMTSSGapiseypFIRTGLLGFVGPGGLVFVVGKMDGLMVVSGRRHNADDIVATALAVEPMKfvy 864
Cdd:cd04433    202 PSVMKGYWNNPEATAAVDEDG-------WYRTGDLGRLDEDGYLYIVGRLKDMIKSGGENVYPAEVEAVLLGHPGVA--- 271
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958706156  865 rgRIAVFSVTvlhDER------IVIVAEQRPDSTEEDSFQWMSRVLQAIDSIHQVgvyclalVPANTLPKTPLGGI 934
Cdd:cd04433    272 --EAAVVGVP---DPEwgervvAVVVLRPGADLDAEELRAHVRERLAPYKVPRRV-------VFVDALPRTASGKI 335
A_NRPS_AB3403-like cd17646
Peptide Synthetase; The adenylation (A) domain of NRPS recognizes a specific amino acid or ...
1023-1530 6.57e-10

Peptide Synthetase; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341301 [Multi-domain]  Cd Length: 488  Bit Score: 63.45  E-value: 6.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1023 WRAQT--TPDH--LLYTllncrgtiANSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCV 1098
Cdd:cd17646      4 VAEQAarTPDApaVVDE--------GRTLTYRELDERANRLAHLLRARG-VGPEDRVAVLLPRSADLVVALLAVLKAGAA 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1099 PITVRPPHPQniattlPTVKMIVEVSRSACLMTTQlickllRSREAAAAVDVRTWPLILDTDDLPKKRPAQiykPSNPDT 1178
Cdd:cd17646     75 YLPLDPGYPA------DRLAYMLADAGPAVVLTTA------DLAARLPAGGDVALLGDEALAAPPATPPLV---PPRPDN 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1179 LAYLDFSVSTTGMLAGVKMSHAatsAFCRSIKLQCELYP--SREVAICLDPycgLGF------VLWCLCsvySGhQSILI 1250
Cdd:cd17646    140 LAYVIYTSGSTGRPKGVMVTHA---GIVNRLLWMQDEYPlgPGDRVLQKTP---LSFdvsvweLFWPLV---AG-ARLVV 209
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1251 PPSELETNPALWLLAVSQYKVrdTFCSY--SVMELCtkglgsqteslkargLDLSRVRTC-----VVVAEErpriALTQS 1323
Cdd:cd17646    210 ARPGGHRDPAYLAALIREHGV--TTCHFvpSMLRVF---------------LAEPAAGSCaslrrVFCSGE----ALPPE 268
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1324 FSKLFKDLglhpravstsFGCRVnlaiclqpHRLWtlaeqgtsGPDPTTVYVDmralrHDRVRlvERGSPHSLPLmesGK 1403
Cdd:cd17646    269 LAARFLAL----------PGAEL--------HNLY--------GPTEAAIDVT-----HWPVR--GPAETPSVPI---GR 312
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1404 ILPGVRIIIANPETKgPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHF-------NSRLSfgdtqtiwaRTGYLGfl 1476
Cdd:cd17646    313 PVPNTRLYVLDDALR-PVPVGVPGELYLGGVQLARGY---LGRPALTAERFvpdpfgpGSRMY---------RTGDLA-- 377
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958706156 1477 RRTeltdANGerhdALYVVGALDEAMELRGMRYHPIDIETsVIRAHKSVTECAV 1530
Cdd:cd17646    378 RWR----PDG----ALEFLGRSDDQVKIRGFRVEPGEIEA-ALAAHPAVTHAVV 422
A_NRPS_TlmIV_like cd12114
The adenylation domain of nonribosomal peptide synthetases (NRPS), including ...
1046-1271 9.72e-10

The adenylation domain of nonribosomal peptide synthetases (NRPS), including Streptoalloteichus tallysomycin biosynthesis genes; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the TLM biosynthetic gene cluster from Streptoalloteichus that consists of nine NRPS genes; the N-terminal module of TlmVI (NRPS-5) and the starter module of BlmVI (NRPS-5) are comprised of the acyl CoA ligase (AL) and acyl carrier protein (ACP)-like domains, which are thought to be involved in the biosynthesis of the beta-aminoalaninamide moiety.


Pssm-ID: 341279 [Multi-domain]  Cd Length: 477  Bit Score: 63.06  E-value: 9.72e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPITVRPPhPQNIATtlptvkmIVEV 1123
Cdd:cd12114     12 TLTYGELAERARRVAGALKAAG-VRPGDLVAVTLPKGPEQVVAVLGILAAGAayVPVDIDQP-AARREA-------ILAD 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1124 SRSACLMTTQLICKLLRSREAAAAVDvrtwpliLDTDDLPKKRPAQiykPSNPDTLAYLDFSVSTTGMLAGVKMSHAATS 1203
Cdd:cd12114     83 AGARLVLTDGPDAQLDVAVFDVLILD-------LDALAAPAPPPPV---DVAPDDLAYVIFTSGSTGTPKGVMISHRAAL 152
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958706156 1204 AFCRSIKLQCELYPSREVaICLDPycgLGFVLwclcSVY------SGHQSILIPPSELETNPALWLLAVSQYKV 1271
Cdd:cd12114    153 NTILDINRRFAVGPDDRV-LALSS---LSFDL----SVYdifgalSAGATLVLPDEARRRDPAHWAELIERHGV 218
A_NRPS_Ta1_like cd12116
The adenylation domain of nonribosomal peptide synthetases (NRPS), including salinosporamide A ...
1046-1547 1.24e-09

The adenylation domain of nonribosomal peptide synthetases (NRPS), including salinosporamide A polyketide synthase; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the myxovirescin (TA) antibiotic biosynthetic gene in Myxococcus xanthus; TA production plays a role in predation. It also includes the salinosporamide A polyketide synthase which is involved in the biosynthesis of salinosporamide A, a marine microbial metabolite whose chlorine atom is crucial for potent proteasome inhibition and anticancer activity.


Pssm-ID: 341281 [Multi-domain]  Cd Length: 470  Bit Score: 62.69  E-value: 1.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPqniattLPTVKMIVEVSR 1125
Cdd:cd12116     12 SLSYAELDERANRLAARLRARG-VGPGDRVAVYLPRSARLVAAMLAVLKAGAAYVPLDPDYP------ADRLRYILEDAE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1126 SACLMTTQlickllrSREAAAAVDVRTWPLILDTDDLPkkrPAQIYKPSNPDTLAYLDFSVSTTGMLAGVKMSHAATSAF 1205
Cdd:cd12116     85 PALVLTDD-------ALPDRLPAGLPVLLLALAAAAAA---PAAPRTPVSPDDLAYVIYTSGSTGRPKGVVVSHRNLVNF 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1206 CRSIklqcelypsREvaicldpycglgfvlwclcsvysghqsilippsELETNPALWLLAVSQYkvrdTFcSYSVMEL-- 1283
Cdd:cd12116    155 LHSM---------RE---------------------------------RLGLGPGDRLLAVTTY----AF-DISLLELll 187
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1284 ----------CTKGLGSQTESLKARgldLSRVRTCVVVAeerprialTQSFSKLFKDLGLHPRAvstsfGCRV------- 1346
Cdd:cd12116    188 pllagarvviAPRETQRDPEALARL---IEAHSITVMQA--------TPATWRMLLDAGWQGRA-----GLTAlcggeal 251
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1347 --NLA--ICLQPHRLWTLaeqgtSGPDPTTVYVDMRALrhdrvrlveRGSPHSLPLmesGKILPGVRIIIANPETKgPLG 1422
Cdd:cd12116    252 ppDLAarLLSRVGSLWNL-----YGPTETTIWSTAARV---------TAAAGPIPI---GRPLANTQVYVLDAALR-PVP 313
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1423 DSHLGEIWVHSAHNASGYftiYGDESLQSDHFnSRLSFGDTQTIWARTGYLgfLRRteltDANGErhdaLYVVGALDEAM 1502
Cdd:cd12116    314 PGVPGELYIGGDGVAQGY---LGRPALTAERF-VPDPFAGPGSRLYRTGDL--VRR----RADGR----LEYLGRADGQV 379
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*....
gi 1958706156 1503 ELRGMRYHPIDIETsVIRAHKSVTECAVFTWTN----LLVVVVELDGSE 1547
Cdd:cd12116    380 KIRGHRIELGEIEA-ALAAHPGVAQAAVVVREDggdrRLVAYVVLKAGA 427
EntF COG1020
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ...
1024-1553 1.42e-09

EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440643 [Multi-domain]  Cd Length: 1329  Bit Score: 63.34  E-value: 1.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1024 RAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPIT 1101
Cdd:COG1020    485 QAARTPDA---VAVVFGDQ---SLTYAELNARANRLAHHLRALG-VGPGDLVGVCLERSLEMVVALLAVLKAGAayVPLD 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1102 vrPPHPQN-IAttlptvkMIVEVSRSACLMTTQlickLLRSREAAAAVDVrtwpLILDTDDLPKKRPAQIYKPSNPDTLA 1180
Cdd:COG1020    558 --PAYPAErLA-------YMLEDAGARLVLTQS----ALAARLPELGVPV----LALDALALAAEPATNPPVPVTPDDLA 620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1181 YLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAicldPYCGLGF------VLWCLCsvySGHQSILIPPsE 1254
Cdd:COG1020    621 YVIYTSGSTGRPKGVMVEHRALVNLLAWMQRRYGLGPGDRVL----QFASLSFdasvweIFGALL---SGATLVLAPP-E 692
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1255 LETNPALWLLAVSQYKVrdtfcsySVMELcTKGLGSQTesLKARGLDLSRVRTcVVVAEERPRIALTQSFSKLFKDLGLH 1334
Cdd:COG1020    693 ARRDPAALAELLARHRV-------TVLNL-TPSLLRAL--LDAAPEALPSLRL-VLVGGEALPPELVRRWRARLPGARLV 761
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1335 pravstsfgcrvNLAiclqphrlwtlaeqgtsGPDPTTVYVDMRALRHDRVrlvergSPHSLPLmesGKILPGVRIIIAn 1414
Cdd:COG1020    762 ------------NLY-----------------GPTETTVDSTYYEVTPPDA------DGGSVPI---GRPIANTRVYVL- 802
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1415 petkgplgDSHL--------GEIWVHSAHNASGYFtiyGDESLQSDHF--NsrlSFGDTQTIWARTGYLGflRRTeltdA 1484
Cdd:COG1020    803 --------DAHLqpvpvgvpGELYIGGAGLARGYL---NRPELTAERFvaD---PFGFPGARLYRTGDLA--RWL----P 862
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958706156 1485 NGErhdaLYVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAVFTWTN-----LLVVVVELDGSEQEALDL 1553
Cdd:COG1020    863 DGN----LEFLGRADDQVKIRGFRIELGEIE-AALLQHPGVREAVVVAREDapgdkRLVAYVVPEAGAAAAAAL 931
PRK06187 PRK06187
long-chain-fatty-acid--CoA ligase; Validated
392-842 1.19e-08

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 235730 [Multi-domain]  Cd Length: 521  Bit Score: 59.43  E-value: 1.19e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  392 TYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNDpaAFMVAFYGCLLAEVVPVPIEVPLTrkdagSQQIGFL 471
Cdd:PRK06187    33 TYAELDERVNRLA-NALRALG------VKKGDRVAVFDWNSH--EYLEAYFAVPKIGAVLHPINIRLK-----PEEIAYI 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  472 LGSCGVTVALTSDAcHKGLPKSPTGEIPQFKGW--------PKLLWFVTESKHL--SKPPRDWFPHIKDanNDTAYIEYK 541
Cdd:PRK06187    99 LNDAEDRVVLVDSE-FVPLLAAILPQLPTVRTVivegdgpaAPLAPEVGEYEELlaAASDTFDFPDIDE--NDAAAMLYT 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  542 TCKDGSVLGVTVTRIALLTHCQALTQACGYTEaetivnvldfkKDVGLwhgiltSVMNMMHV--ISIPY-ALM------- 611
Cdd:PRK06187   176 SGTTGHPKGVVLSHRNLFLHSLAVCAWLKLSR-----------DDVYL------VIVPMFHVhaWGLPYlALMagakqvi 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  612 --KVNPlswiQKVCQY--KAKV---ACVKSrdMHWALVAHRDQRDVNLSSLRMLIVadGANPWSISSCDAFLNVFqskgl 684
Cdd:PRK06187   239 prRFDP----ENLLDLieTERVtffFAVPT--IWQMLLKAPRAYFVDFSSLRLVIY--GGAALPPALLREFKEKF----- 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  685 rqevicpcasspealtvairrptddsnqppGRGVLSMHGL--TYGVIRVD--SEEKLSVLTVQ-DVGLVMPG---AImcs 756
Cdd:PRK06187   306 ------------------------------GIDLVQGYGMteTSPVVSVLppEDQLPGQWTKRrSAGRPLPGveaRI--- 352
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  757 VKPDGVPQLCRTDEIGELCVCAVATGTSYYGLSGMTKNTFEvfpmtsSGapiseypFIRTGLLGFVGPGGLVFVVGKMDG 836
Cdd:PRK06187   353 VDDDGDELPPDGGEVGEIIVRGPWLMQGYWNRPEATAETID------GG-------WLHTGDVGYIDEDGYLYITDRIKD 419

                   ....*.
gi 1958706156  837 lMVVSG 842
Cdd:PRK06187   420 -VIISG 424
PRK07656 PRK07656
long-chain-fatty-acid--CoA ligase; Validated
354-659 5.02e-08

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236072 [Multi-domain]  Cd Length: 513  Bit Score: 57.61  E-value: 5.02e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  354 VTNWPPSLEAALQRWgtisPKAPCLTTMDTNgkplyiLTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNd 433
Cdd:PRK07656     4 WMTLPELLARAARRF----GDKEAYVFGDQR------LTYAELNARVRRAA-AALAALG------IGKGDRVAIWAPNS- 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  434 pAAFMVAFYGCLLAEVVPVPIEvplTRKDAGsqQIGFLLGSCGVTVALTSD-------ACHKGLPK----------SPTG 496
Cdd:PRK07656    66 -PHWVIAALGALKAGAVVVPLN---TRYTAD--EAAYILARGDAKALFVLGlflgvdySATTRLPAlehvviceteEDDP 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  497 EIPQFKGWPKLLwfvteskhLSKPPRDWFPHIKDanNDTAYIEYKTCKDGSVLGVtvtriaLLTHCQALTQA------CG 570
Cdd:PRK07656   140 HTEKMKTFTDFL--------AAGDPAERAPEVDP--DDVADILFTSGTTGRPKGA------MLTHRQLLSNAadwaeyLG 203
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  571 YTEAETIVNVLDFKKDVGLWHGILTSVMNMMHVISIPyalmKVNPLSWIQKVCQYKAKV-ACVKSrdMHWALVAHRDQRD 649
Cdd:PRK07656   204 LTEGDRYLAANPFFHVFGYKAGVNAPLMRGATILPLP----VFDPDEVFRLIETERITVlPGPPT--MYNSLLQHPDRSA 277
                          330
                   ....*....|
gi 1958706156  650 VNLSSLRMLI 659
Cdd:PRK07656   278 EDLSSLRLAV 287
PRK09274 PRK09274
peptide synthase; Provisional
1021-1551 6.23e-08

peptide synthase; Provisional


Pssm-ID: 236443 [Multi-domain]  Cd Length: 552  Bit Score: 57.22  E-value: 6.23e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1021 LQWRAQTTPDHLLYTLLNCRGTIAN----SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:PRK09274    12 LPRAAQERPDQLAVAVPGGRGADGKlaydELSFAELDARSDAIAHGLNAAG-IGRGMRAVLMVTPSLEFFALTFALFKAG 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVRP-------------PHPQ--------NIATTL-----PTVKMIVEVSRSACLMTTQLickllrsreaaaavdv 1150
Cdd:PRK09274    91 AVPVLVDPgmgiknlkqclaeAQPDafigipkaHLARRLfgwgkPSVRRLVTVGGRLLWGGTTL---------------- 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1151 rtwplildtDDLPKKRPAQIYKP--SNPDTLAYLDFSVSTTGMLAGVKMSHaatSAFCRSIKLQCELYPSR--EVAIC-- 1224
Cdd:PRK09274   155 ---------ATLLRDGAAAPFPMadLAPDDMAAILFTSGSTGTPKGVVYTH---GMFEAQIEALREDYGIEpgEIDLPtf 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1225 -----LDPYCGlgfvlwcLCSVysghqsilIPPSEL----ETNPALWLLAVSQYKVRDTFCSYSVMELCTKglgsqteSL 1295
Cdd:PRK09274   223 plfalFGPALG-------MTSV--------IPDMDPtrpaTVDPAKLFAAIERYGVTNLFGSPALLERLGR-------YG 280
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1296 KARGLDLSRVRTcVVVAEERPRIALTQSFSKLfkdlgLHPRA-VSTSFGCRVNLAICLqphrlwtlaeqgtsgpdpttvy 1374
Cdd:PRK09274   281 EANGIKLPSLRR-VISAGAPVPIAVIERFRAM-----LPPDAeILTPYGATEALPISS---------------------- 332
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1375 VDMRALRHDRVRLVERGSPHSLplmesGKILPGVRI-IIA---NP-----ETKgPLGDSHLGEIWVHSAHNASGYFtiyg 1445
Cdd:PRK09274   333 IESREILFATRAATDNGAGICV-----GRPVDGVEVrIIAisdAPipewdDAL-RLATGEIGEIVVAGPMVTRSYY---- 402
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1446 deslQSDHFN--SRLSFGDTQtIWARTGYLGFLrrteltDANGErhdaLYVVGALDEAMELRGMRYHPIDIEtSVIRAHK 1523
Cdd:PRK09274   403 ----NRPEATrlAKIPDGQGD-VWHRMGDLGYL------DAQGR----LWFCGRKAHRVETAGGTLYTIPCE-RIFNTHP 466
                          570       580
                   ....*....|....*....|....*...
gi 1958706156 1524 SVTECAvftwtnllVVVVELDGSEQEAL 1551
Cdd:PRK09274   467 GVKRSA--------LVGVGVPGAQRPVL 486
FC-FACS_FadD_like cd05936
Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD; This ...
361-482 1.16e-07

Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD; This subfamily of the AMP-forming adenylation family contains Escherichia coli FadD and similar prokaryotic fatty acid CoA synthetases. FadD was characterized as a long-chain fatty acid CoA synthetase. The gene fadD is regulated by the fatty acid regulatory protein FadR. Fatty acid CoA synthetase catalyzes the formation of fatty acyl-CoA in a two-step reaction: the formation of a fatty acyl-AMP molecule as an intermediate, followed by the formation of a fatty acyl-CoA. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions.


Pssm-ID: 341259 [Multi-domain]  Cd Length: 468  Bit Score: 56.42  E-value: 1.16e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  361 LEAALQRWgtisPKAPCLTTMDTNgkplyiLTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVA 440
Cdd:cd05936      5 LEEAARRF----PDKTALIFMGRK------LTYRELDALAEAFA-AGLQNLG------VQPGDRVALMLPNC--PQFPIA 65
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1958706156  441 FYGCLLAEVVPVPIEVPLTrkdagSQQIGFLLGSCGVTVALT 482
Cdd:cd05936     66 YFGALKAGAVVVPLNPLYT-----PRELEHILNDSGAKALIV 102
PRK12316 PRK12316
peptide synthase; Provisional
1047-1555 1.37e-07

peptide synthase; Provisional


Pssm-ID: 237054 [Multi-domain]  Cd Length: 5163  Bit Score: 56.89  E-value: 1.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1047 LTCVQLHKRAEKIAVMLMERGHLQDgDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPQNiattlpTVKMIVEVSRS 1126
Cdd:PRK12316  4577 LTYAELNRRANRLAHALIARGVGPE-VLVGIAMERSAEMMVGLLAVLKAGGAYVPLDPEYPRE------RLAYMMEDSGA 4649
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1127 ACLMTTQLICKLLRSREAAAAVDV---RTWplildtDDLPKKRPAqiyKPSNPDTLAYLDFSVSTTGMLAGVKMSHAATS 1203
Cdd:PRK12316  4650 ALLLTQSHLLQRLPIPDGLASLALdrdEDW------EGFPAHDPA---VRLHPDNLAYVIYTSGSTGRPKGVAVSHGSLV 4720
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1204 AFCRSIKLQCELYPSREVaICLDPYCGLGFVL---WCLCSvysgHQSILIPPSELeTNPALWLLAVSQYKVRDTFCSYSV 1280
Cdd:PRK12316  4721 NHLHATGERYELTPDDRV-LQFMSFSFDGSHEglyHPLIN----GASVVIRDDSL-WDPERLYAEIHEHRVTVLVFPPVY 4794
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1281 MELCTKGlgsqteslKARGLDLSRVRTCVVVAEERPRIALTQSFSKlfkdlglhpravstsfgcrvnlaicLQPHRLWTl 1360
Cdd:PRK12316  4795 LQQLAEH--------AERDGEPPSLRVYCFGGEAVAQASYDLAWRA-------------------------LKPVYLFN- 4840
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1361 aeqgTSGPDPTTVYVDMRALRhdrvrlveRGSPHSLPLMESGKILPGVRIII----ANPETKGPLGDSHLGEIWVhsahn 1436
Cdd:PRK12316  4841 ----GYGPTETTVTVLLWKAR--------DGDACGAAYMPIGTPLGNRSGYVldgqLNPLPVGVAGELYLGGEGV----- 4903
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1437 ASGYFTiygDESLQSDHFNSRlSFGDTQTIWARTGYLGFLRRTELTDangerhdalyVVGALDEAMELRGMRYHPIDIET 1516
Cdd:PRK12316  4904 ARGYLE---RPALTAERFVPD-PFGAPGGRLYRTGDLARYRADGVID----------YLGRVDHQVKIRGFRIELGEIEA 4969
                          490       500       510
                   ....*....|....*....|....*....|....*....
gi 1958706156 1517 SvIRAHKSVTECavftwtnlLVVVVELDGSEQEALDLVP 1555
Cdd:PRK12316  4970 R-LREHPAVREA--------VVIAQEGAVGKQLVGYVVP 4999
PRK12316 PRK12316
peptide synthase; Provisional
1051-1218 2.13e-07

peptide synthase; Provisional


Pssm-ID: 237054 [Multi-domain]  Cd Length: 5163  Bit Score: 56.12  E-value: 2.13e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1051 QLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPQNiattlpTVKMIVEVSRSACLM 1130
Cdd:PRK12316  2033 ELDSRANRLAHRLRARG-VGPEVRVAIAAERSFELVVALLAVLKAGGAYVPLDPNYPAE------RLAYMLEDSGAALLL 2105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1131 T-TQLICKLLRSREAAAavdvrtwpLILDTD----DLPKKRPAQiykPSNPDTLAYLDFSVSTTGMLAGVKMSHAATSAF 1205
Cdd:PRK12316  2106 TqRHLLERLPLPAGVAR--------LPLDRDaewaDYPDTAPAV---QLAGENLAYVIYTSGSTGLPKGVAVSHGALVAH 2174
                          170
                   ....*....|...
gi 1958706156 1206 CRSIKLQCELYPS 1218
Cdd:PRK12316  2175 CQAAGERYELSPA 2187
BCL_4HBCL cd05959
Benzoate CoA ligase (BCL) and 4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase); Benzoate ...
385-487 2.23e-07

Benzoate CoA ligase (BCL) and 4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase); Benzoate CoA ligase and 4-hydroxybenzoate-coenzyme A ligase catalyze the first activating step for benzoate and 4-hydroxybenzoate catabolic pathways, respectively. Although these two enzymes share very high sequence homology, they have their own substrate preference. The reaction proceeds via a two-step process; the first ATP-dependent step forms the substrate-AMP intermediate, while the second step forms the acyl-CoA ester, releasing the AMP. Aromatic compounds represent the second most abundant class of organic carbon compounds after carbohydrates. Some bacteria can use benzoic acid or benzenoid compounds as the sole source of carbon and energy through degradation. Benzoate CoA ligase and 4-hydroxybenzoate-Coenzyme A ligase are key enzymes of this process.


Pssm-ID: 341269 [Multi-domain]  Cd Length: 508  Bit Score: 55.45  E-value: 2.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  385 GKPLYI-----LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFpnNDPAAFMVAFYGCLLAEVVPVPIEVPLT 459
Cdd:cd05959     19 DKTAFIddagsLTYAELEAEARRVA-GALRALG------VKREERVLLIM--LDTVDFPTAFLGAIRAGIVPVPVNTLLT 89
                           90       100
                   ....*....|....*....|....*...
gi 1958706156  460 rkdagSQQIGFLLGSCGVTVALTSDACH 487
Cdd:cd05959     90 -----PDDYAYYLEDSRARVVVVSGELA 112
FACL_FadD13-like cd17631
fatty acyl-CoA synthetase, including FadD13; This family contains fatty acyl-CoA synthetases, ...
1021-1554 4.38e-07

fatty acyl-CoA synthetase, including FadD13; This family contains fatty acyl-CoA synthetases, including Mycobacterium tuberculosis acid-induced operon MymA encoding the fatty acyl-CoA synthetase FadD13 which is essential for virulence and intracellular growth of the pathogen. The fatty acyl-CoA synthetase activates lipids before entering into the metabolic pathways and is also involved in transmembrane lipid transport. However, unlike soluble fatty acyl-CoA synthetases, but like the mammalian integral-membrane very-long-chain acyl-CoA synthetases, FadD13 accepts lipid substrates up to the maximum length of C26, and this is facilitated by an extensive hydrophobic tunnel from the active site to a positively charged patch. Also included is feruloyl-CoA synthetase (Fcs) in Rhodococcus strains where it is involved in biotechnological vanillin production from eugenol and ferulic acid via a non-beta-oxidative pathway.


Pssm-ID: 341286 [Multi-domain]  Cd Length: 435  Bit Score: 54.15  E-value: 4.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1021 LQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPi 1100
Cdd:cd17631      1 LRRRARRHPDR---TALVFGGR---SLTYAELDERVNRLAHALRALG-VAKGDRVAVLSKNSPEFLELLFAAARLGAVF- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1101 tvrppHPQNIATTLPTVKMIVEVSRSACLMttqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnpDTLA 1180
Cdd:cd17631     73 -----VPLNFRLTPPEVAYILADSGAKVLF----------------------------------------------DDLA 101
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1181 YLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELyPSREVAICLDPYC---GLGfvLWCLCSVYSGHQSILIPpselET 1257
Cdd:cd17631    102 LLMYTSGTTGRPKGAMLTHRNLLWNAVNALAALDL-GPDDVLLVVAPLFhigGLG--VFTLPTLLRGGTVVILR----KF 174
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1258 NPALWLLAVSQYKVRDTFCSYSVME-LCTKGlgsqteslKARGLDLSRVRtCVVVAEERPRIALTQSFSKlfkdlglhpr 1336
Cdd:cd17631    175 DPETVLDLIERHRVTSFFLVPTMIQaLLQHP--------RFATTDLSSLR-AVIYGGAPMPERLLRALQA---------- 235
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1337 avstsFGCRvnlaiclqphrLWTLAEQGTSGPdPTTVyvdMRALRHDRvRLVERGSPHslplmesgkilPGVRIIIANPE 1416
Cdd:cd17631    236 -----RGVK-----------FVQGYGMTETSP-GVTF---LSPEDHRR-KLGSAGRPV-----------FFVEVRIVDPD 283
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1417 TKgPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDhfnsrlSFGDTqtiWARTGYLGFLrrteltDANGerhdALYVVG 1496
Cdd:cd17631    284 GR-EVPPGEVGEIVVRGPHVMAGY---WNRPEATAA------AFRDG---WFHTGDLGRL------DEDG----YLYIVD 340
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958706156 1497 ALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAVF-----TWTNLLV-VVVELDGSEQEALDLV 1554
Cdd:cd17631    341 RKKDMIISGGENVYPAEVE-DVLYEHPAVAEVAVIgvpdeKWGEAVVaVVVPRPGAELDEDELI 403
PRK12316 PRK12316
peptide synthase; Provisional
322-858 4.47e-07

peptide synthase; Provisional


Pssm-ID: 237054 [Multi-domain]  Cd Length: 5163  Bit Score: 55.35  E-value: 4.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  322 FEELLEVQQPDPNQPKPEGAqmLATRGEQLGVVTNWPPSLEAALQRWGT---------ISPKAPCLTTMDTNgkplyiLT 392
Cdd:PRK12316  1959 LLHLLEQMAEDAQAALGELA--LLDAGERQRILADWDRTPEAYPRGPGVhqriaeqaaRAPEAIAVVFGDQH------LS 2030
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  393 YGKLWTRSMKVAYNILhKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLA--EVVPVPIEVPLTRkdagsqqIGF 470
Cdd:PRK12316  2031 YAELDSRANRLAHRLR-ARG------VGPEVRVAIAAERS--FELVVALLAVLKAggAYVPLDPNYPAER-------LAY 2094
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  471 LLGSCGVTVALTSDACHKGLPksPTGEIPQFKGWPKLLWFVTESKHlskpprdwfPHIKDANNDTAYIEYKTCKDGSVLG 550
Cdd:PRK12316  2095 MLEDSGAALLLTQRHLLERLP--LPAGVARLPLDRDAEWADYPDTA---------PAVQLAGENLAYVIYTSGSTGLPKG 2163
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  551 VTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWhGILTSVMNMMHVISIPYALMkvNPLSWIQKVCQYKAKVA 630
Cdd:PRK12316  2164 VAVSHGALVAHCQAAGERYELSPADCELQFMSFSFDGAHE-QWFHPLLNGARVLIRDDELW--DPEQLYDEMERHGVTIL 2240
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  631 CVKSrdMHWALVAHRDQRDVNLSSLRMLIVadGANPWSISSCDAFLN------VFQSKGLRQEVICP-----CASSPE-A 698
Cdd:PRK12316  2241 DFPP--VYLQQLAEHAERDGRPPAVRVYCF--GGEAVPAASLRLAWEalrpvyLFNGYGPTEAVVTPllwkcRPQDPCgA 2316
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  699 LTVAIRRPTDDsnqppgrgvlsmhgltygvirvdseEKLSVLtvqDVGLvmpgaimcsvkpdgvpQLCRTDEIGELCVCA 778
Cdd:PRK12316  2317 AYVPIGRALGN-------------------------RRAYIL---DADL----------------NLLAPGMAGELYLGG 2352
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  779 VATGTSYYGLSGMTKNTFEVFPMTSSGAPIseypfIRTGLLGFVGPGGLVFVVGKMDGLMVVSGRRHNADDIVATALAVE 858
Cdd:PRK12316  2353 EGLARGYLNRPGLTAERFVPDPFSASGERL-----YRTGDLARYRADGVVEYLGRIDHQVKIRGFRIELGEIEARLQAHP 2427
A_NRPS_Cytc1-like cd17643
similar to adenylation domain of cytotrienin synthetase CytC1; This family of the adenylation ...
1046-1530 5.44e-07

similar to adenylation domain of cytotrienin synthetase CytC1; This family of the adenylation (A) domain of nonribosomal peptide synthases (NRPS) includes Streptomyces sp. cytotrienin synthetase (CytC1), a relatively promiscuous adenylation enzyme that installs the aminoacyl moieties on the phosphopantetheinyl arm of the holo carrier protein CytC2. Also included are Streptomyces sp Thr1, involved in the biosynthesis of 4-chlorothreonine, Pseudomonas aeruginosa pyoverdine synthetase D (PvdD), involved in the biosynthesis of the siderophore pyoverdine and Pseudomonas syringae syringopeptin synthetase, where syringpeptin is a necrosis-inducing phytotoxin that functions as a virulence determinant in the plant-pathogen interaction. The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341298 [Multi-domain]  Cd Length: 450  Bit Score: 54.24  E-value: 5.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGC--VPITVRPPHPQNIAttlptvkmIVEV 1123
Cdd:cd17643     12 RLTYGELDARANRLARTLRAEG-VGPGDRVALALPRSAELIVALLAILKAGGayVPIDPAYPVERIAF--------ILAD 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1124 SRSACLMTTqlickllrsreaaaavdvrtwplildtddlpkkrpaqiykpsnPDTLAYLDFSVSTTGMLAGVKMSHAATS 1203
Cdd:cd17643     83 SGPSLLLTD-------------------------------------------PDDLAYVIYTSGSTGRPKGVVVSHANVL 119
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1204 AFCRSIKLQCELYPSREVAICldPYCGLGFVLWCLCSVYSGHQSILIPPSELETNPALWLLAVSQYKVrdtfcsySVMel 1283
Cdd:cd17643    120 ALFAATQRWFGFNEDDVWTLF--HSYAFDFSVWEIWGALLHGGRLVVVPYEVARSPEDFARLLRDEGV-------TVL-- 188
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1284 ctkglgSQT--------ESLKARGLDLSRVRTcVVVAEERPRIALTQSFSKLFKDlgLHPRAvstsfgcrVNLaiclqph 1355
Cdd:cd17643    189 ------NQTpsafyqlvEAADRDGRDPLALRY-VIFGGEALEAAMLRPWAGRFGL--DRPQL--------VNM------- 244
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1356 rlwtlaeqgtSGPDPTTVYVDMRALRHDRVRLVErGSPhslplmeSGKILPGVRIIIANpETKGPLGDSHLGEIWVHSAH 1435
Cdd:cd17643    245 ----------YGITETTVHVTFRPLDAADLPAAA-ASP-------IGRPLPGLRVYVLD-ADGRPVPPGVVGELYVSGAG 305
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1436 NASGYFtiyGDESLQSDHFNSrLSFGDTQTIWARTGYLGflRRTeltdANGErhdaLYVVGALDEAMELRGMRYHPIDIE 1515
Cdd:cd17643    306 VARGYL---GRPELTAERFVA-NPFGGPGSRMYRTGDLA--RRL----PDGE----LEYLGRADEQVKIRGFRIELGEIE 371
                          490
                   ....*....|....*
gi 1958706156 1516 tSVIRAHKSVTECAV 1530
Cdd:cd17643    372 -AALATHPSVRDAAV 385
A_NRPS_Srf_like cd12117
The adenylation domain of nonribosomal peptide synthetases (NRPS), including Bacillus subtilis ...
1019-1564 7.54e-07

The adenylation domain of nonribosomal peptide synthetases (NRPS), including Bacillus subtilis termination module Surfactin (SrfA-C); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the adenylation domain of the Bacillus subtilis termination module (Surfactin domain, SrfA-C) which recognizes a specific amino acid building block, which is then activated and transferred to the terminal thiol of the 4'-phosphopantetheine (Ppan) arm of the downstream peptidyl carrier protein (PCP) domain.


Pssm-ID: 341282 [Multi-domain]  Cd Length: 483  Bit Score: 53.74  E-value: 7.54e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1019 EVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCV 1098
Cdd:cd12117      1 ELFEEQAARTPDA---VAVVYGDR---SLTYAELNERANRLARRLRAAG-VGPGDVVGVLAERSPELVVALLAVLKAGAA 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1099 PITVRPPHPQNiattlpTVKMIVEVSRSACLMTtqlickllrSREAAAAVDVRTWPLILDtDDLPKKRPAQIYKPSNPDT 1178
Cdd:cd12117     74 YVPLDPELPAE------RLAFMLADAGAKVLLT---------DRSLAGRAGGLEVAVVID-EALDAGPAGNPAVPVSPDD 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1179 LAYLDFSVSTTGMLAGVKMSHAATSAFCRS---IKLQcelypSREVAICLDPYC--GLGFVLWclCSVYSGHQSILIPPS 1253
Cdd:cd12117    138 LAYVMYTSGSTGRPKGVAVTHRGVVRLVKNtnyVTLG-----PDDRVLQTSPLAfdASTFEIW--GALLNGARLVLAPKG 210
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1254 ELETNPALwLLAVSQYKVrdtfcsySVMELcTKGLGSQteslkargldlsrvrtcvvVAEERPrialtQSFSKLfkdlgl 1333
Cdd:cd12117    211 TLLDPDAL-GALIAEEGV-------TVLWL-TAALFNQ-------------------LADEDP-----ECFAGL------ 251
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1334 hpRAVSTSfGCRVNLAIClqphRLWTLAEQGTS-----GPDPTTVYvdmrALRHdrvrLVERG--SPHSLPLmesGKILP 1406
Cdd:cd12117    252 --RELLTG-GEVVSPPHV----RRVLAACPGLRlvngyGPTENTTF----TTSH----VVTELdeVAGSIPI---GRPIA 313
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1407 GVRIIIANpETKGPLGDSHLGEIWVHSAHNASGYftiYGDESLQSDHFnSRLSFGDTQTIWaRTGYLGFLRrteltdANG 1486
Cdd:cd12117    314 NTRVYVLD-EDGRPVPPGVPGELYVGGDGLALGY---LNRPALTAERF-VADPFGPGERLY-RTGDLARWL------PDG 381
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958706156 1487 ErhdaLYVVGALDEAMELRGMRYHPIDIETsVIRAHKSVTECAvftwtnllVVVVELDGSEQEaldlvpLVTNVVLEE 1564
Cdd:cd12117    382 R----LEFLGRIDDQVKIRGFRIELGEIEA-ALRAHPGVREAV--------VVVREDAGGDKR------LVAYVVAEG 440
PRK08316 PRK08316
acyl-CoA synthetase; Validated
391-485 8.94e-07

acyl-CoA synthetase; Validated


Pssm-ID: 181381 [Multi-domain]  Cd Length: 523  Bit Score: 53.40  E-value: 8.94e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNDpaAFMVAFYGCLLAEVVPVPIEVPLTRKDagsqqIGF 470
Cdd:PRK08316    37 WTYAELDAAVNRVA-AALLDLG------LKKGDRVAALGHNSD--AYALLWLACARAGAVHVPVNFMLTGEE-----LAY 102
                           90
                   ....*....|....*
gi 1958706156  471 LLGSCGVTVALTSDA 485
Cdd:PRK08316   103 ILDHSGARAFLVDPA 117
A_NRPS_Bac cd17655
bacitracin synthetase and related proteins; This family of the adenylation (A) domain of ...
1019-1199 1.93e-06

bacitracin synthetase and related proteins; This family of the adenylation (A) domain of nonribosomal peptide synthases (NRPS) includes bacitracin synthetases 1, 2, and 3 (BA1, also known as ATP-dependent cysteine adenylase or cysteine activase, BA2, also known as ATP-dependent lysine adenylase or lysine activase, and BA3, also known as ATP-dependent isoleucine adenylase or isoleucine activase) in Bacilli. Bacitracin is a mixture of related cyclic peptides used as a polypeptide antibiotic. This family also includes gramicidin synthetase 1 involved in synthesis of the cyclic peptide antibiotic gramicidin S via activation of phenylalanine. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341310 [Multi-domain]  Cd Length: 490  Bit Score: 52.33  E-value: 1.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1019 EVLQWRAQTTPDHllyTLLNCRGtiaNSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGC- 1097
Cdd:cd17655      1 ELFEEQAEKTPDH---TAVVFED---QTLTYRELNERANQLARTLREKG-VGPDTIVGIMAERSLEMIVGILGILKAGGa 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1098 -VPITvrPPHPQNiattlpTVKMIVEVSRSACLMTTQLICKLLRSREAAaavdvrtwpLILDTDDLPKKRPAQIYKPSNP 1176
Cdd:cd17655     74 yLPID--PDYPEE------RIQYILEDSGADILLTQSHLQPPIAFIGLI---------DLLDEDTIYHEESENLEPVSKS 136
                          170       180
                   ....*....|....*....|...
gi 1958706156 1177 DTLAYLDFSVSTTGMLAGVKMSH 1199
Cdd:cd17655    137 DDLAYVIYTSGSTGKPKGVMIEH 159
PRK07768 PRK07768
long-chain-fatty-acid--CoA ligase; Validated
1173-1530 4.36e-06

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236091 [Multi-domain]  Cd Length: 545  Bit Score: 51.15  E-value: 4.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1173 PSNPDTLAYLDFSVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAICLDPYC-GLGFVLWCLCSVYSGHQSILIP 1251
Cdd:PRK07768   148 ETGEDDLALMQLTSGSTGSPKAVQITHGNLYANAEAMFVAAEFDVETDVMVSWLPLFhDMGMVGFLTVPMYFGAELVKVT 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1252 PSELETNPALWLLAVSQYKVRDTFCSYSVMELCTKGLGSQTESlkaRGLDLSRVRtCVVVAEERPRIALTQSFSKLFKDL 1331
Cdd:PRK07768   228 PMDFLRDPLLWAELISKYRGTMTAAPNFAYALLARRLRRQAKP---GAFDLSSLR-FALNGAEPIDPADVEDLLDAGARF 303
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1332 GLHPRAVSTSFG-CRVNLAICLQPhrlwtlaeqgtSGPDPTTVYVDMRALRHDRvRLVERGSPHSLPLMESGKILPG--V 1408
Cdd:PRK07768   304 GLRPEAILPAYGmAEATLAVSFSP-----------CGAGLVVDEVDADLLAALR-RAVPATKGNTRRLATLGPPLPGleV 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1409 RIIIANPEtkgPLGDSHLGEIWVHSAHNASGYFTIYGDESLQSDHfnsrlsfGdtqtiWARTGYLGFLrrTEltdaNGEr 1488
Cdd:PRK07768   372 RVVDEDGQ---VLPPRGVGVIELRGESVTPGYLTMDGFIPAQDAD-------G-----WLDTGDLGYL--TE----EGE- 429
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1958706156 1489 hdaLYVVGALDEAMELRGMRYHPIDIETSVIRAHKSVTECAV 1530
Cdd:PRK07768   430 ---VVVCGRVKDVIIMAGRNIYPTDIERAAARVEGVRPGNAV 468
Acs COG0365
Acyl-coenzyme A synthetase/AMP-(fatty) acid ligase [Lipid transport and metabolism];
362-485 5.41e-06

Acyl-coenzyme A synthetase/AMP-(fatty) acid ligase [Lipid transport and metabolism];


Pssm-ID: 440134 [Multi-domain]  Cd Length: 565  Bit Score: 50.88  E-value: 5.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  362 EAALQRWGTISPKAPCLTTMDTNGKPlYILTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdPAAfMVAF 441
Cdd:COG0365     12 YNCLDRHAEGRGDKVALIWEGEDGEE-RTLTYAELRREVNRFA-NALRALG------VKKGDRVAIYLPNI-PEA-VIAM 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1958706156  442 YGCLLAEVVPVPIeVPLTRKDAgsqqIGFLLGSCGVTVALTSDA 485
Cdd:COG0365     82 LACARIGAVHSPV-FPGFGAEA----LADRIEDAEAKVLITADG 120
PRK08314 PRK08314
long-chain-fatty-acid--CoA ligase; Validated
391-482 6.47e-06

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236235 [Multi-domain]  Cd Length: 546  Bit Score: 50.73  E-value: 6.47e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNILHKLGtkqepmVRPGDRVALvFPNNDPAaFMVAFYGCLLAEVVPVPIEvPLTRkdagSQQIGF 470
Cdd:PRK08314    36 ISYRELLEEAERLAGYLQQECG------VRKGDRVLL-YMQNSPQ-FVIAYYAILRANAVVVPVN-PMNR----EEELAH 102
                           90
                   ....*....|..
gi 1958706156  471 LLGSCGVTVALT 482
Cdd:PRK08314   103 YVTDSGARVAIV 114
PRK05851 PRK05851
long-chain-fatty acid--ACP ligase MbtM;
581-946 8.13e-06

long-chain-fatty acid--ACP ligase MbtM;


Pssm-ID: 180289 [Multi-domain]  Cd Length: 525  Bit Score: 50.53  E-value: 8.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  581 LDFKKDVG-----LWHG-----ILTSVMNMMHVISIPYALMKVNPLSWIQKVCQYKAKVacVKSRDMHWALVAH--RDQR 648
Cdd:PRK05851   190 LDAATDVGcswlpLYHDmglafLLTAALAGAPLWLAPTTAFSASPFRWLSWLSDSRATL--TAAPNFAYNLIGKyaRRVS 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  649 DVNLSSLRMLIvaDGANPWSISSCDAFLNVFQSKGLRQEVICPCASSPEAlTVAIRRPTddsnqpPGRGVLsmhgltygV 728
Cdd:PRK05851   268 DVDLGALRVAL--NGGEPVDCDGFERFATAMAPFGFDAGAAAPSYGLAES-TCAVTVPV------PGIGLR--------V 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  729 IRVDSEEKLSVLTVQDVGLVMPGA---IMCSVKPDGVPQlcrtDEIGELCVCAVATGTSYYGlsgmtkntfevfpmtssG 805
Cdd:PRK05851   331 DEVTTDDGSGARRHAVLGNPIPGMevrISPGDGAAGVAG----REIGEIEIRGASMMSGYLG-----------------Q 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  806 APISEYPFIRTGLLGFVGPGGLVfVVGKMDGLMVVSGRRHNADDIVATALAVEPmkfVYRGRIavfsVTVLHDE-----R 880
Cdd:PRK05851   390 APIDPDDWFPTGDLGYLVDGGLV-VCGRAKELITVAGRNIFPTEIERVAAQVRG---VREGAV----VAVGTGEgsarpG 461
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958706156  881 IVIVAEQRpdSTEEDSFQwmSRVLQAIDSihQVGVyclalVPAN-------TLPKTPLGGIHLSETKQLFLEG 946
Cdd:PRK05851   462 LVIAAEFR--GPDEAGAR--SEVVQRVAS--ECGV-----VPSDvvfvapgSLPRTSSGKLRRLAVKRSLEAA 523
FACL_FadD13-like cd17631
fatty acyl-CoA synthetase, including FadD13; This family contains fatty acyl-CoA synthetases, ...
365-481 1.16e-05

fatty acyl-CoA synthetase, including FadD13; This family contains fatty acyl-CoA synthetases, including Mycobacterium tuberculosis acid-induced operon MymA encoding the fatty acyl-CoA synthetase FadD13 which is essential for virulence and intracellular growth of the pathogen. The fatty acyl-CoA synthetase activates lipids before entering into the metabolic pathways and is also involved in transmembrane lipid transport. However, unlike soluble fatty acyl-CoA synthetases, but like the mammalian integral-membrane very-long-chain acyl-CoA synthetases, FadD13 accepts lipid substrates up to the maximum length of C26, and this is facilitated by an extensive hydrophobic tunnel from the active site to a positively charged patch. Also included is feruloyl-CoA synthetase (Fcs) in Rhodococcus strains where it is involved in biotechnological vanillin production from eugenol and ferulic acid via a non-beta-oxidative pathway.


Pssm-ID: 341286 [Multi-domain]  Cd Length: 435  Bit Score: 49.92  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  365 LQRWGTISPKAPCLTTMDTngkplyILTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNDpaAFMVAFYGC 444
Cdd:cd17631      1 LRRRARRHPDRTALVFGGR------SLTYAELDERVNRLA-HALRALG------VAKGDRVAVLSKNSP--EFLELLFAA 65
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1958706156  445 LLAEVVPVPIEVPLTRKDagsqqIGFLLGSCGVTVAL 481
Cdd:cd17631     66 ARLGAVFVPLNFRLTPPE-----VAYILADSGAKVLF 97
PRK06178 PRK06178
acyl-CoA synthetase; Validated
357-485 1.59e-05

acyl-CoA synthetase; Validated


Pssm-ID: 235724 [Multi-domain]  Cd Length: 567  Bit Score: 49.65  E-value: 1.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  357 WPP-------------SLEAALQRWGTISPKAPCLttmDTNGkplYILTYGKLWTRSMKVAyNILHKLGtkqepmVRPGD 423
Cdd:PRK06178    18 WPAgiprepeyphgerPLTEYLRAWARERPQRPAI---IFYG---HVITYAELDELSDRFA-ALLRQRG------VGAGD 84
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958706156  424 RVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEvPLTRKdagsQQIGFLLGSCGVTVALTSDA 485
Cdd:PRK06178    85 RVAVFLPNC--PQFHIVFFGILKLGAVHVPVS-PLFRE----HELSYELNDAGAEVLLALDQ 139
entF PRK10252
enterobactin non-ribosomal peptide synthetase EntF;
1017-1201 2.47e-05

enterobactin non-ribosomal peptide synthetase EntF;


Pssm-ID: 236668 [Multi-domain]  Cd Length: 1296  Bit Score: 49.27  E-value: 2.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:PRK10252   460 LSALVAQQAAKTPDA---PALADARY---QFSYREMREQVVALANLLRERG-VKPGDSVAVALPRSVFLTLALHAIVEAG 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1097 CVPITVRPPHPQNiattlpTVKMIVEVSRSACLMTTQlickLLRSREAAAAVDVRTWPLILDTDdlPKKRPAQIYKPSNP 1176
Cdd:PRK10252   533 AAWLPLDTGYPDD------RLKMMLEDARPSLLITTA----DQLPRFADVPDLTSLCYNAPLAP--QGAAPLQLSQPHHT 600
                          170       180
                   ....*....|....*....|....*
gi 1958706156 1177 dtlAYLDFSVSTTGMLAGVKMSHAA 1201
Cdd:PRK10252   601 ---AYIIFTSGSTGRPKGVMVGQTA 622
A_NRPS_LgrA-like cd17645
adenylation (A) domain of linear gramicidin synthetase (LgrA) and similar proteins; This ...
1019-1206 3.03e-05

adenylation (A) domain of linear gramicidin synthetase (LgrA) and similar proteins; This family of the adenylation (A) domain of nonribosomal peptide synthases (NRPS) includes linear gramicidin synthetase (LgrA) in Brevibacillus brevis. LgrA has a formylation domain fused to the N-terminal end that formylates its substrate for linear gramicidin synthesis to proceed. This formyl group is essential for the clinically important antibacterial activity of gramicidin by enabling head-to-head gramicidin dimers to make a beta-helical pore in gram-positive bacterial membranes, allowing free passage of monovalent cations, destroying the ion gradient and killing bacteria. This family also includes bacitracin synthetase 1 (known as ATP-dependent cysteine adenylase or BA1); it activates cysteine, incorporates two D-amino acids, releases and cyclizes the mature bacitracin, an antibiotic that is a mixture of related cyclic peptides that disrupt gram positive bacteria by interfering with cell wall and peptidoglycan synthesis. Also included is surfactin synthetase which activates and polymerizes the amino acids Leu, Glu, Asp, and Val to form the antibiotic surfactin.


Pssm-ID: 341300 [Multi-domain]  Cd Length: 440  Bit Score: 48.32  E-value: 3.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1019 EVLQWRAQTTPDHLLYTLlncRGtiaNSLTCVQLHKRAEKIAVMLMERGHLQDgDHVALVYPPGIDLIAAFYGCLYAGCV 1098
Cdd:cd17645      2 QLFEEQVERTPDHVAVVD---RG---QSLTYKQLNEKANQLARHLRGKGVKPD-DQVGIMLDKSLDMIAAILGVLKAGGA 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1099 PITVRPPHPQniattlptvkmivevsrsaclmttQLICKLLRSREAAaavdvrtwpLILdtddlpkkrpaqiykpSNPDT 1178
Cdd:cd17645     75 YVPIDPDYPG------------------------ERIAYMLADSSAK---------ILL----------------TNPDD 105
                          170       180
                   ....*....|....*....|....*...
gi 1958706156 1179 LAYLDFSVSTTGMLAGVKMSHAATSAFC 1206
Cdd:cd17645    106 LAYVIYTSGSTGLPKGVMIEHHNLVNLC 133
PRK07656 PRK07656
long-chain-fatty-acid--CoA ligase; Validated
1046-1205 4.45e-05

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236072 [Multi-domain]  Cd Length: 513  Bit Score: 47.98  E-value: 4.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1046 SLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPH-PQNIATTLPT--VKMI-- 1120
Cdd:PRK07656    30 RLTYAELNARVRRAAAALAALG-IGKGDRVAIWAPNSPHWVIAALGALKAGAVVVPLNTRYtADEAAYILARgdAKALfv 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1121 ------VEVSRSACLMTTQLICKLLRSREAAAAVDVRTWPLILDTDDlPKKRPAQIykpsNPDTLAYLDFSVSTTGMLAG 1194
Cdd:PRK07656   109 lglflgVDYSATTRLPALEHVVICETEEDDPHTEKMKTFTDFLAAGD-PAERAPEV----DPDDVADILFTSGTTGRPKG 183
                          170
                   ....*....|..
gi 1958706156 1195 VKMSHAAT-SAF 1205
Cdd:PRK07656   184 AMLTHRQLlSNA 195
EntE COG1021
EntE, 2,3-dihydroxybenzoate-AMP synthase component of non-ribosomal peptide synthetase ...
1017-1106 6.43e-05

EntE, 2,3-dihydroxybenzoate-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440644 [Multi-domain]  Cd Length: 533  Bit Score: 47.45  E-value: 6.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1017 LSEVLQWRAQTTPDHllyTLLNCRGTianSLTCVQLHKRAEKIAVMLMERGhLQDGDHVALVYPPGIDLIAAFYGCLYAG 1096
Cdd:COG1021     27 LGDLLRRRAERHPDR---IAVVDGER---RLSYAELDRRADRLAAGLLALG-LRPGDRVVVQLPNVAEFVIVFFALFRAG 99
                           90
                   ....*....|
gi 1958706156 1097 CVPITVRPPH 1106
Cdd:COG1021    100 AIPVFALPAH 109
EntE COG1021
EntE, 2,3-dihydroxybenzoate-AMP synthase component of non-ribosomal peptide synthetase ...
391-452 9.14e-05

EntE, 2,3-dihydroxybenzoate-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440644 [Multi-domain]  Cd Length: 533  Bit Score: 47.06  E-value: 9.14e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958706156  391 LTYGKLWTRSMKVAYNiLHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPV 452
Cdd:COG1021     51 LSYAELDRRADRLAAG-LLALG------LRPGDRVVVQLPNV--AEFVIVFFALFRAGAIPV 103
AA-adenyl-dom TIGR01733
amino acid adenylation domain; This model represents a domain responsible for the specific ...
392-540 1.70e-04

amino acid adenylation domain; This model represents a domain responsible for the specific recognition of amino acids and activation as adenylyl amino acids. The reaction catalyzed is aa + ATP -> aa-AMP + PPi. These domains are usually found as components of multi-domain non-ribosomal peptide synthetases and are usually called "A-domains" in that context. A-domains are almost invariably followed by "T-domains" (thiolation domains, pfam00550) to which the amino acid adenylate is transferred as a thiol-ester to a bound pantetheine cofactor with the release of AMP (these are also called peptide carrier proteins, or PCPs. When the A-domain does not represent the first module (corresponding to the first amino acid in the product molecule) it is usually preceded by a "C-domain" (condensation domain, pfam00668) which catalyzes the ligation of two amino acid thiol-esters from neighboring modules. This domain is a subset of the AMP-binding domain found in Pfam (pfam00501) which also hits substrate--CoA ligases and luciferases. Sequences scoring in between trusted and noise for this model may be ambiguous as to whether they activate amino acids or other molecules lacking an alpha amino group.


Pssm-ID: 273779 [Multi-domain]  Cd Length: 409  Bit Score: 46.10  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  392 TYGKLWTRSMKVAYNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEV--PLTRkdagsqqIG 469
Cdd:TIGR01733    1 TYRELDERANRLARHLRAAGG------VGPGDRVAVLLERS--AELVVAILAVLKAGAAYVPLDPayPAER-------LA 65
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  470 FLLGSCGVTVALTSDACHKGLPKSPTGEIPqfkgwPKLLWFVTESKHLSKPPRDWFPHikdaNNDTAYIEY 540
Cdd:TIGR01733   66 FILEDAGARLLLTDSALASRLAGLVLPVIL-----LDPLELAALDDAPAPPPPDAPSG----PDDLAYVIY 127
A_NRPS_TlmIV_like cd12114
The adenylation domain of nonribosomal peptide synthetases (NRPS), including ...
373-583 1.98e-04

The adenylation domain of nonribosomal peptide synthetases (NRPS), including Streptoalloteichus tallysomycin biosynthesis genes; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the TLM biosynthetic gene cluster from Streptoalloteichus that consists of nine NRPS genes; the N-terminal module of TlmVI (NRPS-5) and the starter module of BlmVI (NRPS-5) are comprised of the acyl CoA ligase (AL) and acyl carrier protein (ACP)-like domains, which are thought to be involved in the biosynthesis of the beta-aminoalaninamide moiety.


Pssm-ID: 341279 [Multi-domain]  Cd Length: 477  Bit Score: 45.72  E-value: 1.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  373 PKAPCLTTMDTNgkplyiLTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNDPAafMVAFYGCLLAEVVPV 452
Cdd:cd12114      1 PDATAVICGDGT------LTYGELAERARRVA-GALKAAG------VRPGDLVAVTLPKGPEQ--VVAVLGILAAGAAYV 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  453 PIEV--PLTRKDAgsqqigfLLGSCGVTVALTSDACHKGLPKSPTgeipqfkgwpkllwFVTESKHLSKPPRDwFPHIKD 530
Cdd:cd12114     66 PVDIdqPAARREA-------ILADAGARLVLTDGPDAQLDVAVFD--------------VLILDLDALAAPAP-PPPVDV 123
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1958706156  531 ANNDTAYIEYKTCKDGSVLGVTVTRIALLTHCQALTQACGYTEAETIVNV--LDF 583
Cdd:cd12114    124 APDDLAYVIFTSGSTGTPKGVMISHRAALNTILDINRRFAVGPDDRVLALssLSF 178
EntF COG1020
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ...
391-590 2.54e-04

EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440643 [Multi-domain]  Cd Length: 1329  Bit Score: 46.00  E-value: 2.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPI--EVPLTRkdagsqqI 468
Cdd:COG1020    502 LTYAELNARANRLA-HHLRALG------VGPGDLVGVCLERS--LEMVVALLAVLKAGAAYVPLdpAYPAER-------L 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  469 GFLLGSCGVTVALTSDACHKGLPKSptgeipqfkgwpKLLWFVTESKHLSKPPRDWfPHIKDANNDTAYIEY-------- 540
Cdd:COG1020    566 AYMLEDAGARLVLTQSALAARLPEL------------GVPVLALDALALAAEPATN-PPVPVTPDDLAYVIYtsgstgrp 632
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958706156  541 KtckdgsvlGVTVTRIALLTHCQALTQACGYTEAETIVNV--LDFkkDVGLW 590
Cdd:COG1020    633 K--------GVMVEHRALVNLLAWMQRRYGLGPGDRVLQFasLSF--DASVW 674
A_NRPS_ProA cd17656
gramicidin S synthase 2, also known as ATP-dependent proline adenylase; This family of the ...
1028-1199 3.34e-04

gramicidin S synthase 2, also known as ATP-dependent proline adenylase; This family of the adenylation (A) domain of nonribosomal peptide synthases (NRPS) contains gramicidin S synthase 2 (also known as ATP-dependent proline adenylase or proline activase or ProA). ProA is a multifunctional enzyme involved in synthesis of the cyclic peptide antibiotic gramicidin S and able to activate and polymerize the amino acids proline, valine, ornithine and leucine. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341311 [Multi-domain]  Cd Length: 479  Bit Score: 45.16  E-value: 3.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1028 TPDHLLYTLLNCrgtianSLTCVQLHKRAEKIAVMLMERGHLQDgDHVALVYPPGIDLIAAFYGCLYAG--CVPITvrpP 1105
Cdd:cd17656      1 TPDAVAVVFENQ------KLTYRELNERSNQLARFLREKGVKKD-SIVAIMMERSAEMIVGILGILKAGgaFVPID---P 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1106 HpqniattLPTVKMIVEVSRSAC-LMTTQLICKLLRSREAAaaVDVRTWPLILDTDDlpkkrpAQIYKPSNPDTLAYLDF 1184
Cdd:cd17656     71 E-------YPEERRIYIMLDSGVrVVLTQRHLKSKLSFNKS--TILLEDPSISQEDT------SNIDYINNSDDLLYIIY 135
                          170
                   ....*....|....*
gi 1958706156 1185 SVSTTGMLAGVKMSH 1199
Cdd:cd17656    136 TSGTTGKPKGVQLEH 150
VL_LC_FACS_like cd05907
Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA ...
391-485 3.55e-04

Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA synthetases; This family includes long-chain fatty acid (C12-C20) CoA synthetases and Bubblegum-like very long-chain (>C20) fatty acid CoA synthetases. FACS catalyzes the formation of fatty acyl-CoA in a two-step reaction: the formation of a fatty acyl-AMP molecule as an intermediate, and the formation of a fatty acyl-CoA. Eukaryotes generally have multiple isoforms of LC-FACS genes with multiple splice variants. For example, nine genes are found in Arabidopsis and six genes are expressed in mammalian cells. Drosophila melanogaster mutant bubblegum (BGM) have elevated levels of very-long-chain fatty acids (VLCFA) caused by a defective gene later named bubblegum. The human homolog (hsBG) of bubblegum has been characterized as a very long chain fatty acid CoA synthetase that functions specifically in the brain; hsBG may play a central role in brain VLCFA metabolism and myelinogenesis. Free fatty acids must be "activated" to their CoA thioesters before participating in most catabolic and anabolic reactions.


Pssm-ID: 341233 [Multi-domain]  Cd Length: 452  Bit Score: 44.89  E-value: 3.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALvFPNNDPaAFMVAFYGCLLAEVVPVPIevpltRKDAGSQQIGF 470
Cdd:cd05907      6 ITWAEFAEEVRALA-KGLIALG------VEPGDRVAI-LSRNRP-EWTIADLAILAIGAVPVPI-----YPTSSAEQIAY 71
                           90
                   ....*....|....*
gi 1958706156  471 LLGSCGVTVALTSDA 485
Cdd:cd05907     72 ILNDSEAKALFVEDP 86
PRK12467 PRK12467
peptide synthase; Provisional
1025-1530 3.88e-04

peptide synthase; Provisional


Pssm-ID: 237108 [Multi-domain]  Cd Length: 3956  Bit Score: 45.54  E-value: 3.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1025 AQTTPDHLLYTLLNCRgtiansLTCVQLHKRAEKIAVMLMERGHLQDgDHVALVYPPGIDLIAAFYGCLYAGCVPITVRP 1104
Cdd:PRK12467   522 ARQHPERPALVFGEQV------LSYAELNRQANRLAHVLIAAGVGPD-VLVGIAVERSIEMVVGLLAVLKAGGAYVPLDP 594
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1105 PHPQNiattlpTVKMIVEVSRSACLMTTQLICKLLrsreaAAAVDVRTWPLILDTDDLPKKRPAQIYKPSNPDTLAYLDF 1184
Cdd:PRK12467   595 EYPQD------RLAYMLDDSGVRLLLTQSHLLAQL-----PVPAGLRSLCLDEPADLLCGYSGHNPEVALDPDNLAYVIY 663
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1185 SVSTTGMLAGVKMSHAATSAFCRSIKLQCELYPSREVAIcLDPYCGLGFVLWCLCSVYSGHQSILIPPSEletnpalwll 1264
Cdd:PRK12467   664 TSGSTGQPKGVAISHGALANYVCVIAERLQLAADDSMLM-VSTFAFDLGVTELFGALASGATLHLLPPDC---------- 732
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1265 avsqykVRDTFCSYSVMelCTKGLG--SQTES-----LKARGLDLSRVRTCVVVAEErpriALTQSFSKLFKDLGLhpra 1337
Cdd:PRK12467   733 ------ARDAEAFAALM--ADQGVTvlKIVPShlqalLQASRVALPRPQRALVCGGE----ALQVDLLARVRALGP---- 796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1338 vstsfGCrvnlaiclqphRLWTLaeqgtSGPDPTTVYVDMRALRHDRVrlVERGSPHSLPLMESgkilpGVRIIIA--NP 1415
Cdd:PRK12467   797 -----GA-----------RLINH-----YGPTETTVGVSTYELSDEER--DFGNVPIGQPLANL-----GLYILDHylNP 848
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1416 ETKGPLGDSHLGeiwvhSAHNASGYftiYGDESLQSDHFNSRLSFGDTQTIWaRTGYLGflRRTeltdANGErhdaLYVV 1495
Cdd:PRK12467   849 VPVGVVGELYIG-----GAGLARGY---HRRPALTAERFVPDPFGADGGRLY-RTGDLA--RYR----ADGV----IEYL 909
                          490       500       510
                   ....*....|....*....|....*....|....*
gi 1958706156 1496 GALDEAMELRGMRYHPIDIETSvIRAHKSVTECAV 1530
Cdd:PRK12467   910 GRMDHQVKIRGFRIELGEIEAR-LLAQPGVREAVV 943
PRK04813 PRK04813
D-alanine--poly(phosphoribitol) ligase subunit DltA;
1025-1200 4.83e-04

D-alanine--poly(phosphoribitol) ligase subunit DltA;


Pssm-ID: 235313 [Multi-domain]  Cd Length: 503  Bit Score: 44.50  E-value: 4.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1025 AQTTPDHLLYtllNCRGTianSLTCVQLHKRAEKIAVMLMERgHLQDGDHVaLVY----PpgiDLIAAFYGCLYAGC--V 1098
Cdd:PRK04813    12 AQTQPDFPAY---DYLGE---KLTYGQLKEDSDALAAFIDSL-KLPDKSPI-IVFghmsP---EMLATFLGAVKAGHayI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1099 PITVRPPhpqniattLPTVKMIVEVSRSACLMTTqlickllrSREAAAAVDVRTwpLILD--TDDLPKKRPAQIYKPSNP 1176
Cdd:PRK04813    81 PVDVSSP--------AERIEMIIEVAKPSLIIAT--------EELPLEILGIPV--ITLDelKDIFATGNPYDFDHAVKG 142
                          170       180
                   ....*....|....*....|....
gi 1958706156 1177 DTLAYLDFSVSTTGMLAGVKMSHA 1200
Cdd:PRK04813   143 DDNYYIIFTSGTTGKPKGVQISHD 166
FACL_fum10p_like cd05926
Subfamily of fatty acid CoA ligase (FACL) similar to Fum10p of Gibberella moniliformis; FACL ...
391-484 7.21e-04

Subfamily of fatty acid CoA ligase (FACL) similar to Fum10p of Gibberella moniliformis; FACL catalyzes the formation of fatty acyl-CoA in a two-step reaction: the formation of a fatty acyl-AMP molecule as an intermediate, followed by the formation of a fatty acyl-CoA. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions. Fum10p is a fatty acid CoA ligase involved in the synthesis of fumonisin, a polyketide mycotoxin, in Gibberella moniliformis.


Pssm-ID: 341249 [Multi-domain]  Cd Length: 493  Bit Score: 44.23  E-value: 7.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEvPLTRKDagsqQIGF 470
Cdd:cd05926     15 LTYADLAELVDDLA-RQLAALG------IKKGDRVAIALPNG--LEFVVAFLAAARAGAVVAPLN-PAYKKA----EFEF 80
                           90
                   ....*....|....
gi 1958706156  471 LLGSCGVTVALTSD 484
Cdd:cd05926     81 YLADLGSKLVLTPK 94
FAA1 COG1022
Long-chain acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism];
391-485 9.58e-04

Long-chain acyl-CoA synthetase (AMP-forming) [Lipid transport and metabolism];


Pssm-ID: 440645 [Multi-domain]  Cd Length: 603  Bit Score: 43.94  E-value: 9.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNiLHKLGtkqepmVRPGDRVALvFPNNDPaAFMVAFYGCLLAEVVPVPIEVpltrkDAGSQQIGF 470
Cdd:COG1022     41 LTWAEFAERVRALAAG-LLALG------VKPGDRVAI-LSDNRP-EWVIADLAILAAGAVTVPIYP-----TSSAEEVAY 106
                           90
                   ....*....|....*
gi 1958706156  471 LLGSCGVTVALTSDA 485
Cdd:COG1022    107 ILNDSGAKVLFVEDQ 121
BCL_like cd05919
Benzoate CoA ligase (BCL) and similar adenylate forming enzymes; This family contains benzoate ...
391-462 9.81e-04

Benzoate CoA ligase (BCL) and similar adenylate forming enzymes; This family contains benzoate CoA ligase (BCL) and related ligases that catalyze the acylation of benzoate derivatives, 2-aminobenzoate and 4-hydroxybenzoate. Aromatic compounds represent the second most abundant class of organic carbon compounds after carbohydrates. Xenobiotic aromatic compounds are also a major class of man-made pollutants. Some bacteria use benzoate as the sole source of carbon and energy through benzoate degradation. Benzoate degradation starts with its activation to benzoyl-CoA by benzoate CoA ligase. The reaction catalyzed by benzoate CoA ligase proceeds via a two-step process; the first ATP-dependent step forms an acyl-AMP intermediate, and the second step forms the acyl-CoA ester with release of the AMP.


Pssm-ID: 341243 [Multi-domain]  Cd Length: 436  Bit Score: 43.60  E-value: 9.81e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPnnDPAAFMVAFYGCLLAEVVPVPIEVPLTRKD 462
Cdd:cd05919     11 VTYGQLHDGANRLG-SALRNLG------VSSGDRVLLLML--DSPELVQLFLGCLARGAIAVVINPLLHPDD 73
A_NRPS cd05930
The adenylation domain of nonribosomal peptide synthetases (NRPS); The adenylation (A) domain ...
391-930 1.11e-03

The adenylation domain of nonribosomal peptide synthetases (NRPS); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.


Pssm-ID: 341253 [Multi-domain]  Cd Length: 444  Bit Score: 43.29  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEV--PLTRkdagsqqI 468
Cdd:cd05930     13 LTYAELDARANRLA-RYLRERG------VGPGDLVAVLLERS--LEMVVAILAVLKAGAAYVPLDPsyPAER-------L 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  469 GFLLGSCGVTVALTSDachkglpksptgeipqfkgwpkllwfvteskhlskpprdwfphikdanNDTAYIEY-------- 540
Cdd:cd05930     77 AYILEDSGAKLVLTDP------------------------------------------------DDLAYVIYtsgstgkp 108
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  541 KtckdgsvlGVTVTRIALLTHCQALTQACGYTEAETIVNVLDFKKDVGLWhGILTSVMN--MMHVISipyALMKVNPLSW 618
Cdd:cd05930    109 K--------GVMVEHRGLVNLLLWMQEAYPLTPGDRVLQFTSFSFDVSVW-EIFGALLAgaTLVVLP---EEVRKDPEAL 176
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  619 IQKVCQYKAKVA-CVKSrdmHW-ALVAHRDQRDvnLSSLRMLIVAdGanpwsisscdaflnvfqskglrqevicpcassp 696
Cdd:cd05930    177 ADLLAEEGITVLhLTPS---LLrLLLQELELAA--LPSLRLVLVG-G--------------------------------- 217
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  697 EALTVAIRRPTDDSNqpPGRGVLSMHGLT-------YGVIRVDSEEKLSVltvqDVGLVMPGAIMCSVKPDGvpQLCRTD 769
Cdd:cd05930    218 EALPPDLVRRWRELL--PGARLVNLYGPTeatvdatYYRVPPDDEEDGRV----PIGRPIPNTRVYVLDENL--RPVPPG 289
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  770 EIGELCVC--AVATGtsYYGLSGMTKntfEVFPMTSSGAPISEYpfiRTGLLGFVGPGGLVFVVGKMDGLMVVSGRRHNA 847
Cdd:cd05930    290 VPGELYIGgaGLARG--YLNRPELTA---ERFVPNPFGPGERMY---RTGDLVRWLPDGNLEFLGRIDDQVKIRGYRIEL 361
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  848 DDIVATALAVEPMkfvyrgRIAVfsVTVLHD----ERIV--IVAEQRPDSTEEDSFQWMSRVLQAidsihqvgvYCL--A 919
Cdd:cd05930    362 GEIEAALLAHPGV------REAA--VVAREDgdgeKRLVayVVPDEGGELDEEELRAHLAERLPD---------YMVpsA 424
                          570
                   ....*....|.
gi 1958706156  920 LVPANTLPKTP 930
Cdd:cd05930    425 FVVLDALPLTP 435
A_NRPS_Ta1_like cd12116
The adenylation domain of nonribosomal peptide synthetases (NRPS), including salinosporamide A ...
391-580 1.40e-03

The adenylation domain of nonribosomal peptide synthetases (NRPS), including salinosporamide A polyketide synthase; The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the myxovirescin (TA) antibiotic biosynthetic gene in Myxococcus xanthus; TA production plays a role in predation. It also includes the salinosporamide A polyketide synthase which is involved in the biosynthesis of salinosporamide A, a marine microbial metabolite whose chlorine atom is crucial for potent proteasome inhibition and anticancer activity.


Pssm-ID: 341281 [Multi-domain]  Cd Length: 470  Bit Score: 43.05  E-value: 1.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEvpltrKDAGSQQIGF 470
Cdd:cd12116     13 LSYAELDERANRLA-ARLRARG------VGPGDRVAVYLPRS--ARLVAAMLAVLKAGAAYVPLD-----PDYPADRLRY 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  471 LLGSCGVTVALTSDACHKGLPKSPTgeipqfkGWPKLLWFVTESKHLSKPPrdwfphikDANNDTAYIEYKTCKDGSVLG 550
Cdd:cd12116     79 ILEDAEPALVLTDDALPDRLPAGLP-------VLLLALAAAAAAPAAPRTP--------VSPDDLAYVIYTSGSTGRPKG 143
                          170       180       190
                   ....*....|....*....|....*....|
gi 1958706156  551 VTVTRIALLTHCQALTQACGYTEAETIVNV 580
Cdd:cd12116    144 VVVSHRNLVNFLHSMRERLGLGPGDRLLAV 173
LC_FACS_like cd05935
Putative long-chain fatty acid CoA ligase; The members of this family are putative long-chain ...
391-482 1.46e-03

Putative long-chain fatty acid CoA ligase; The members of this family are putative long-chain fatty acyl-CoA synthetases, which catalyze the ATP-dependent activation of fatty acids in a two-step reaction. The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. Fatty acyl-CoA synthetases are responsible for fatty acid degradation as well as physiological regulation of cellular functions via the production of fatty acyl-CoA esters.


Pssm-ID: 341258 [Multi-domain]  Cd Length: 430  Bit Score: 42.85  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEvPLTRKDagsqQIGF 470
Cdd:cd05935      2 LTYLELLEVVKKLA-SFLSNKG------VRKGDRVGICLQNS--PQYVIAYFAIWRANAVVVPIN-PMLKER----ELEY 67
                           90
                   ....*....|..
gi 1958706156  471 LLGSCGVTVALT 482
Cdd:cd05935     68 ILNDSGAKVAVV 79
FACL_like_6 cd05922
Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ...
1402-1544 2.68e-03

Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ATP-dependent activation of fatty acids in a two-step reaction. The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions.


Pssm-ID: 341246 [Multi-domain]  Cd Length: 457  Bit Score: 42.04  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1402 GKILPGVRIIIANPEtKGPLGDSHLGEIWVHSAHNASGYFTIYGDEsLQSDHFNSRLsfgdtqtiwaRTGYLGFLrrtel 1481
Cdd:cd05922    289 GLAIPGGEFEILDDD-GTPTPPGEPGEIVHRGPNVMKGYWNDPPYR-RKEGRGGGVL----------HTGDLARR----- 351
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958706156 1482 tDANGErhdaLYVVGALDEAMELRGMRYHPIDIETSvIRAHKSVTECAVF----TWTNLLVVVVELD 1544
Cdd:cd05922    352 -DEDGF----LFIVGRRDRMIKLFGNRISPTEIEAA-ARSIGLIIEAAAVglpdPLGEKLALFVTAP 412
PRK05605 PRK05605
long-chain-fatty-acid--CoA ligase; Validated
392-452 2.86e-03

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 235531 [Multi-domain]  Cd Length: 573  Bit Score: 42.29  E-value: 2.86e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  392 TYGKLWTRSMKVAYNiLHKLGtkqepmVRPGDRVALVFPNNdPaAFMVAFYGCLLAEVVPV 452
Cdd:PRK05605    59 TYAELGKQVRRAAAG-LRALG------VRPGDRVAIVLPNC-P-QHIVAFYAVLRLGAVVV 110
FADD10 cd17635
adenylate forming domain, fatty acid CoA ligase (FadD10); This family contains long chain ...
1402-1530 3.70e-03

adenylate forming domain, fatty acid CoA ligase (FadD10); This family contains long chain fatty acid CoA ligases, including FadD10 which is involved in the synthesis of a virulence-related lipopeptide. FadD10 is a fatty acyl-AMP ligase (FAAL) that transfers fatty acids to an acyl carrier protein. Structures of FadD10 in apo- and complexed form with dodecanoyl-AMP, show a novel open conformation, facilitated by its unique inter-domain and intermolecular interactions, which is critical for the enzyme to carry out the acyl transfer onto the acyl carrier protein (Rv0100) rather than coenzyme A.


Pssm-ID: 341290 [Multi-domain]  Cd Length: 340  Bit Score: 41.48  E-value: 3.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1402 GKILPGVRIIIANPETKGPLGDSHlGEIWVHSAHNASGYftiYGDESLQSDHFNSRlsfgdtqtiWARTGYLGFLRrtel 1481
Cdd:cd17635    173 GRPYPGVDVYLAATDGIAGPSASF-GTIWIKSPANMLGY---WNNPERTAEVLIDG---------WVNTGDLGERR---- 235
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1958706156 1482 tdangeRHDALYVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAV 1530
Cdd:cd17635    236 ------EDGFLFITGRSSESINCGGVKIAPDEVE-RIAEGVSGVQECAC 277
PRK05852 PRK05852
fatty acid--CoA ligase family protein;
1407-1531 5.55e-03

fatty acid--CoA ligase family protein;


Pssm-ID: 235625 [Multi-domain]  Cd Length: 534  Bit Score: 41.41  E-value: 5.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1407 GVRIIIANPEtKGPLGDSHLGEIWVHSAHNASGYFtiyGDESLQSDHFNSRlsfgdtqtiWARTGYLGFLrrteltDANG 1486
Cdd:PRK05852   362 GAQIRIVGSD-GLPLPAGAVGEVWLRGTTVVRGYL---GDPTITAANFTDG---------WLRTGDLGSL------SAAG 422
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1958706156 1487 ErhdaLYVVGALDEAMELRGMRYHPIDIEtSVIRAHKSVTECAVF 1531
Cdd:PRK05852   423 D----LSIRGRIKELINRGGEKISPERVE-GVLASHPNVMEAAVF 462
FACL_DitJ_like cd05934
Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ...
392-486 6.26e-03

Uncharacterized subfamily of fatty acid CoA ligase (FACL); Fatty acyl-CoA ligases catalyze the ATP-dependent activation of fatty acids in a two-step reaction. The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. This is a required step before free fatty acids can participate in most catabolic and anabolic reactions. Members of this family include DitJ from Pseudomonas and similar proteins.


Pssm-ID: 341257 [Multi-domain]  Cd Length: 422  Bit Score: 40.74  E-value: 6.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  392 TYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEVplTRKDAGSQQIgfl 471
Cdd:cd05934      5 TYAELLRESARIA-AALAALG------IRPGDRVALMLDNC--PEFLFAWFALAKLGAVLVPINT--ALRGDELAYI--- 70
                           90
                   ....*....|....*
gi 1958706156  472 LGSCGVTVALTSDAC 486
Cdd:cd05934     71 IDHSGAQLVVVDPAS 85
PRK05691 PRK05691
peptide synthase; Validated
1051-1195 6.46e-03

peptide synthase; Validated


Pssm-ID: 235564 [Multi-domain]  Cd Length: 4334  Bit Score: 41.31  E-value: 6.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156 1051 QLHKRAEKIAVMLMERGHLQDgDHVALVYPPGIDLIAAFYGCLYAGCVPITVRPPHPqniattLPTVKMIVEVSRSACLM 1130
Cdd:PRK05691  3750 ELNRAANRLGHALRAAGVGVD-QPVALLAERGLDLLGMIVGSFKAGAGYLPLDPGLP------AQRLQRIIELSRTPVLV 3822
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958706156 1131 TTQlICkLLRSREAAAAVDVRTWPLILDTDDLPKKRPAQ----IYkpSNPDTLAYLDFSVSTTGMLAGV 1195
Cdd:PRK05691  3823 CSA-AC-REQARALLDELGCANRPRLLVWEEVQAGEVAShnpgIY--SGPDNLAYVIYTSGSTGLPKGV 3887
A_NRPS_Srf_like cd12117
The adenylation domain of nonribosomal peptide synthetases (NRPS), including Bacillus subtilis ...
391-540 7.15e-03

The adenylation domain of nonribosomal peptide synthetases (NRPS), including Bacillus subtilis termination module Surfactin (SrfA-C); The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and, in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the adenylation domain of the Bacillus subtilis termination module (Surfactin domain, SrfA-C) which recognizes a specific amino acid building block, which is then activated and transferred to the terminal thiol of the 4'-phosphopantetheine (Ppan) arm of the downstream peptidyl carrier protein (PCP) domain.


Pssm-ID: 341282 [Multi-domain]  Cd Length: 483  Bit Score: 40.65  E-value: 7.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAYNILHKlGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLA--EVVPVPIEVPLTRkdagsqqI 468
Cdd:cd12117     23 LTYAELNERANRLARRLRAA-G------VGPGDVVGVLAERS--PELVVALLAVLKAgaAYVPLDPELPAER-------L 86
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958706156  469 GFLLGSCGVTVALTsdacHKGLPKSPTGEIPqfkgwpkLLWFVTESKHLSKPPrdwfPHIKDANNDTAYIEY 540
Cdd:cd12117     87 AFMLADAGAKVLLT----DRSLAGRAGGLEV-------AVVIDEALDAGPAGN----PAVPVSPDDLAYVMY 143
PRK08276 PRK08276
long-chain-fatty-acid--CoA ligase; Validated
391-508 7.41e-03

long-chain-fatty-acid--CoA ligase; Validated


Pssm-ID: 236215 [Multi-domain]  Cd Length: 502  Bit Score: 40.66  E-value: 7.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958706156  391 LTYGKLWTRSMKVAyNILHKLGtkqepmVRPGDRVALVFPNNdpAAFMVAFYGCLLAEVVPVPIEVPLTRKDagsqqIGF 470
Cdd:PRK08276    12 VTYGELEARSNRLA-HGLRALG------LREGDVVAILLENN--PEFFEVYWAARRSGLYYTPINWHLTAAE-----IAY 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958706156  471 LLGSCGVTVALTS----DACHKGLPKSP---------TGEIPQFKGWPKLL 508
Cdd:PRK08276    78 IVDDSGAKVLIVSaalaDTAAELAAELPagvplllvvAGPVPGFRSYEEAL 128
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH