NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958645893|ref|XP_038969105|]
View 

teneurin-4 isoform X10 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 2.99e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.59  E-value: 2.99e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQESEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSGLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1958645893  405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1292-1633 3.18e-47

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 173.49  E-value: 3.18e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1292 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1361
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1362 VFLSDTNSRRVFKIKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1439
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1440 RRVDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1517
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1518 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1597
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1958645893 1598 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1633
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2755-2832 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958645893 2755 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2832
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1666-2531 1.96e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.20  E-value: 1.96e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1666 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1745
Cdd:COG3209    191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1746 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1823
Cdd:COG3209    271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1824 DQVRNSYYIGADGSLRLLLANGMEVALQSEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1903
Cdd:COG3209    351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1904 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1983
Cdd:COG3209    431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1984 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2063
Cdd:COG3209    511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2064 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDESAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2143
Cdd:COG3209    588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2144 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2223
Cdd:COG3209    668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2224 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2303
Cdd:COG3209    747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2304 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2378
Cdd:COG3209    826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2379 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2458
Cdd:COG3209    896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958645893 2459 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlsssSIVP---FHLYMFKNNNPISNS 2531
Cdd:COG3209    955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
903-933 2.34e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.34e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1958645893  903 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 933
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
816-859 5.18e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 42.61  E-value: 5.18e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1958645893  816 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 859
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
Keratin_B2 super family cl37504
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
697-845 1.37e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


The actual alignment was detected with superfamily member pfam01500:

Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  697 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 771
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  772 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 844
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 1958645893  845 C 845
Cdd:pfam01500  153 C 153
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 2.99e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.59  E-value: 2.99e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQESEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSGLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1958645893  405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1292-1633 3.18e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.49  E-value: 3.18e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1292 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1361
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1362 VFLSDTNSRRVFKIKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1439
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1440 RRVDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1517
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1518 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1597
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1958645893 1598 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1633
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2755-2832 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958645893 2755 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2832
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1666-2531 1.96e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.20  E-value: 1.96e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1666 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1745
Cdd:COG3209    191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1746 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1823
Cdd:COG3209    271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1824 DQVRNSYYIGADGSLRLLLANGMEVALQSEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1903
Cdd:COG3209    351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1904 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1983
Cdd:COG3209    431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1984 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2063
Cdd:COG3209    511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2064 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDESAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2143
Cdd:COG3209    588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2144 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2223
Cdd:COG3209    668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2224 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2303
Cdd:COG3209    747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2304 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2378
Cdd:COG3209    826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2379 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2458
Cdd:COG3209    896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958645893 2459 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlsssSIVP---FHLYMFKNNNPISNS 2531
Cdd:COG3209    955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2452-2531 2.88e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.88e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2452 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssssivPF------HLYMFKNN 2525
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 1958645893 2526 NPISNS 2531
Cdd:TIGR03696   67 NPVNWV 72
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1302-1633 1.86e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 61.19  E-value: 1.86e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1302 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKIKST 1378
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1379 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1455
Cdd:COG4257     89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1456 DLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1535
Cdd:COG4257    141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1536 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1615
Cdd:COG4257    186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                          330
                   ....*....|....*...
gi 1958645893 1616 ADGELYVADLGNIRIRFI 1633
Cdd:COG4257    240 GDGRVWFAESGANRIVRF 257
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
903-933 2.34e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.34e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1958645893  903 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 933
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
RHS_core NF041261
RHS element core protein;
1950-2368 2.89e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 63.10  E-value: 2.89e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1950 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRITSRIFADGKMWSYTyleksmvLHLHSqrqyifefdknDR 2022
Cdd:NF041261   401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYS-------LNVVS-----------GD 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2023 LSSVTMPNVarqtletiRSVGYYRNiyqppegnasviqdfteDGHLLHTFYLGTGRRVIYKYGKLSKL-AETLYDTTKVS 2101
Cdd:NF041261   461 ITDITTPDG--------RETKFYYN-----------------DGNQLTSVTSPDGLESRREYDEPGRLvSETSRSGETTR 515
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2102 FTYDESAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR-Y 2180
Cdd:NF041261   516 YRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYRrY 583
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2181 DD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKKEL 2251
Cdd:NF041261   584 DNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTLTN 656
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2252 KvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYKMD 2326
Cdd:NF041261   657 E-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQYD 729
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1958645893 2327 EDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2368
Cdd:NF041261   730 EHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
RHS_core NF041261
RHS element core protein;
2178-2506 2.86e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.08  E-value: 2.86e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2178 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2239
Cdd:NF041261   367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2240 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2310
Cdd:NF041261   443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2311 lrDRITRLGDVqyKMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2390
Cdd:NF041261   520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2391 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2470
Cdd:NF041261   587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1958645893 2471 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LW 2506
Cdd:NF041261   659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLW 705
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1337-1637 1.07e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1337 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkiksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1410
Cdd:PLN02919   556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1411 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1482
Cdd:PLN02919   621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1483 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1530
Cdd:PLN02919   689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1531 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1582
Cdd:PLN02919   769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1958645893 1583 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1637
Cdd:PLN02919   847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
816-859 5.18e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 42.61  E-value: 5.18e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1958645893  816 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 859
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1543-1729 1.87e-04

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 46.11  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1543 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1622
Cdd:cd14957     23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1623 ADLGNIRIRfirknkpVLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1702
Cdd:cd14957     81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                          170       180
                   ....*....|....*....|....*..
gi 1958645893 1703 RDSTGmplwlvvpdgqVYWVTMGTNSA 1729
Cdd:cd14957    138 FTSSG-----------TFSYSIGSGGT 153
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1750-1782 5.95e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.95e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1958645893 1750 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1782
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
697-845 1.37e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  697 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 771
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  772 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 844
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 1958645893  845 C 845
Cdd:pfam01500  153 C 153
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
705-728 4.33e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 4.33e-03
                           10        20
                   ....*....|....*....|....*...
gi 1958645893  705 CSSHGTCIMG----TCICNPGYKGESCE 728
Cdd:cd00054     11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 2.99e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.59  E-value: 2.99e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQESEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893   89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSGLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1958645893  405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1292-1633 3.18e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.49  E-value: 3.18e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1292 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1361
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1362 VFLSDTNSRRVFKIKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1439
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1440 RRVDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1517
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1518 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1597
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1958645893 1598 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1633
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2755-2832 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958645893 2755 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2832
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1666-2531 1.96e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.20  E-value: 1.96e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1666 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1745
Cdd:COG3209    191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1746 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1823
Cdd:COG3209    271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1824 DQVRNSYYIGADGSLRLLLANGMEVALQSEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1903
Cdd:COG3209    351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1904 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1983
Cdd:COG3209    431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1984 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2063
Cdd:COG3209    511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2064 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDESAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2143
Cdd:COG3209    588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2144 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2223
Cdd:COG3209    668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2224 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2303
Cdd:COG3209    747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2304 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2378
Cdd:COG3209    826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2379 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2458
Cdd:COG3209    896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958645893 2459 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlsssSIVP---FHLYMFKNNNPISNS 2531
Cdd:COG3209    955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1297-1633 4.17e-19

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 90.07  E-value: 4.17e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1297 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHkyyLATDPmSGAVFLSDTNSRRVFK 1374
Cdd:cd05819      5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1375 IKSTTVVKDlvknseVVAGTGDQCLPFDdtrcgdggkateatltNPRGITVDKFGLIYFVDgTM---IRRVDQNGIISTL 1451
Cdd:cd05819     81 FDPDGNFLA------SFGGSGDGDGEFN----------------GPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTT 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1452 LGSNDLTSARplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDnnvvlqiSENHQVRIVAgrpmhcqvPGiDHFLL--- 1528
Cdd:cd05819    138 FGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFD--------PD-GNFLTtfg 186
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1529 SKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcfsgddgyaKDAKLNT 1608
Cdd:cd05819    187 STGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNGNFLG-------------------SDGQFNR 244
                          330       340
                   ....*....|....*....|....*
gi 1958645893 1609 PSSLAVCADGELYVADLGNIRIRFI 1633
Cdd:cd05819    245 PSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1417-1727 4.63e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 83.91  E-value: 4.63e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1417 LTNPRGITVDKFGLIYFVDGTM--IRRVDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPmDNSLYVL 1494
Cdd:cd05819      7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNLYVA 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1495 D--NNVVLQISENHQVRIVAGRPmhcqvpGIDHFLLSkvaihatleSATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1572
Cdd:cd05819     72 DtgNHRIQKFDPDGNFLASFGGS------GDGDGEFN---------GPRGIAVDSSGNIYVADTGN---HRIQKFDPDGE 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1573 ISLVAGAPSGCDckndancdcfsgddgyakdAKLNTPSSLAVCADGELYVADLGNIRIRFIrknkpvlntqnmyelsSPI 1652
Cdd:cd05819    134 FLTTFGSGGSGP-------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQVF----------------DPD 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1653 DQELYLFDTSGKHLYTQSLPTG------DYLYnFTYTGDGDITHITDN------NGNmVNVRRDSTGMPLWLVV-PDGQV 1719
Cdd:cd05819    179 GNFLTTFGSTGTGPGQFNYPTGiavdsdGNIY-VADSGNNRVQVFDPDgagfggNGN-FLGSDGQFNRPSGLAVdSDGNL 256

                   ....*...
gi 1958645893 1720 YWVTMGTN 1727
Cdd:cd05819    257 YVADTGNN 264
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1289-1502 3.05e-14

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 76.41  E-value: 3.05e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1289 SCNGLADGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNilemrnkdfrhshspahkyylatdpmsgavflsd 1366
Cdd:cd14953    176 AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTT---------------------------------- 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1367 tnsrrvfkiksttvvkdlvknsevVAGTGDQclPFddtrcGDGGKATEATLTNPRGITVDKFGLIYFVD---GTmIRRVD 1443
Cdd:cd14953    222 ------------------------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADsgnHR-IRKIT 269
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958645893 1444 QNGIISTLLGSndlTSARPLSCDSVmdiSQVRLEWPTDLAINPmDNSLYVLD--NNVVLQI 1502
Cdd:cd14953    270 PAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1295-1564 2.23e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 73.12  E-value: 2.23e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1295 DGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRV 1372
Cdd:cd05819     50 GDGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLASFGGSGDGDGEFNGP---RGIAVDS-SGNIYVADTGNHRI 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1373 FKIKSttvvkdlvkNSEVVAGTGdqclpfddtrcgdGGKATEATLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIIST 1450
Cdd:cd05819    126 QKFDP---------DGEFLTTFG-------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1451 LLGSNDLTSArplscdsvmdisqvRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGrpmhcqvpgidhfll 1528
Cdd:cd05819    184 TFGSTGTGPG--------------QFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------- 233
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958645893 1529 SKVAIHATLESATALAVSHNGVLYIAETDEKKINRI 1564
Cdd:cd05819    234 NFLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2452-2531 2.88e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.88e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2452 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssssivPF------HLYMFKNN 2525
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 1958645893 2526 NPISNS 2531
Cdd:TIGR03696   67 NPVNWV 72
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1353-1630 1.81e-09

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 61.07  E-value: 1.81e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1353 LATDPmSGAVFLSDTNSRRVFKiksttvvkdlvknseVVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLI 1431
Cdd:cd14952     15 VAVDA-AGNVYVADSGNNRVLK---------------LAAGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1432 YFVDGtmirrvDQNGIISTLLGSNDLTsarPLSCDSvmdisqvrLEWPTDLAINPMDNsLYVLD--NNVVLqisenhqvR 1509
Cdd:cd14952     66 YVTDF------GNNRVLKLAAGSTTQT---VLPFTG--------LNDPTGVAVDAAGN-VYVADtgNNRVL--------K 119
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1510 IVAGRPMHCQVPgidhFllskvaihATLESATALAVSHNGVLYIAETDEkkiNRIRQvttsgeisLVAGA------Psgc 1583
Cdd:cd14952    120 LAAGSNTQTVLP----F--------TGLSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSttqtvlP--- 173
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1958645893 1584 dckndancdcFSGddgyakdakLNTPSSLAVCADGELYVADLGNIRI 1630
Cdd:cd14952    174 ----------FTG---------LNSPSGVAVDTAGNVYVTDHGNNRV 201
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1302-1633 1.86e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 61.19  E-value: 1.86e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1302 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKIKST 1378
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1379 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1455
Cdd:COG4257     89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1456 DLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1535
Cdd:COG4257    141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1536 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1615
Cdd:COG4257    186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                          330
                   ....*....|....*...
gi 1958645893 1616 ADGELYVADLGNIRIRFI 1633
Cdd:COG4257    240 GDGRVWFAESGANRIVRF 257
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
903-933 2.34e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.34e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1958645893  903 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 933
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
RHS_core NF041261
RHS element core protein;
1950-2368 2.89e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 63.10  E-value: 2.89e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1950 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRITSRIFADGKMWSYTyleksmvLHLHSqrqyifefdknDR 2022
Cdd:NF041261   401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYS-------LNVVS-----------GD 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2023 LSSVTMPNVarqtletiRSVGYYRNiyqppegnasviqdfteDGHLLHTFYLGTGRRVIYKYGKLSKL-AETLYDTTKVS 2101
Cdd:NF041261   461 ITDITTPDG--------RETKFYYN-----------------DGNQLTSVTSPDGLESRREYDEPGRLvSETSRSGETTR 515
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2102 FTYDESAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR-Y 2180
Cdd:NF041261   516 YRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYRrY 583
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2181 DD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKKEL 2251
Cdd:NF041261   584 DNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTLTN 656
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2252 KvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYKMD 2326
Cdd:NF041261   657 E-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQYD 729
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1958645893 2327 EDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2368
Cdd:NF041261   730 EHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1409-1630 4.98e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 53.83  E-value: 4.98e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1409 GGKATEA-TLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIISTLLGSN-----DLTSARPLSCDS-----VMD----- 1470
Cdd:cd14956     50 GTTGDGPgQFGRPRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSSgsgpgQFNAPRGVAVDAdgnlyVADfgnqr 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1471 ISQVRLE------------------WPTDLAINPmDNSLYVLDnnvvlqiSENHQVrivagrpmhcQVPGIDHFLLSKVA 1532
Cdd:cd14956    130 IQKFDPDgsflrqwggtgiepgsfnYPRGVAVDP-DGTLYVAD-------TYNDRI----------QVFDNDGAFLRKWG 191
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1533 IHAT----LESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcfsgddgyaKDAKLNT 1608
Cdd:cd14956    192 GRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGTFLTSWGSPGT-------------------GPGQFKN 249
                          250       260
                   ....*....|....*....|..
gi 1958645893 1609 PSSLAVCADGELYVADLGNIRI 1630
Cdd:cd14956    250 PWGVVVDADGTVYVADSNNNRV 271
RHS_core NF041261
RHS element core protein;
2178-2506 2.86e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.08  E-value: 2.86e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2178 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2239
Cdd:NF041261   367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2240 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2310
Cdd:NF041261   443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2311 lrDRITRLGDVqyKMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2390
Cdd:NF041261   520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 2391 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2470
Cdd:NF041261   587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1958645893 2471 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LW 2506
Cdd:NF041261   659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLW 705
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1415-1633 6.66e-06

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 50.40  E-value: 6.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1415 ATLTNPRGITVDKFGLIYFVD--GTMIRRVD-QNGIISTllgsndltsarplscdsvmdISQVRLEWPTDLAINPmDNSL 1491
Cdd:COG4257     14 APGSGPRDVAVDPDGAVWFTDqgGGRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-DGNL 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1492 YVLD--NNVVLQIS-ENHQVRIVAGrpmhcqvPGIDHFLlskvaihatlesaTALAVSHNGVLYIAETDekkINRIRQVT 1568
Cdd:COG4257     73 WFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNP-------------HGIAFDPDGNLWFTDQG---GNRIGRLD 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1569 T-SGEISLV-----AGAPSGCDCKND---------ANC-DCFSGDDG----YAKDAKLNTPSSLAVCADGELYVADLGNI 1628
Cdd:COG4257    130 PaTGEVTEFplptgGAGPYGIAVDPDgnlwvtdfgANAiGRIDPDTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSG 209

                   ....*
gi 1958645893 1629 RIRFI 1633
Cdd:COG4257    210 RIGRF 214
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1337-1637 1.07e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1337 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkiksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1410
Cdd:PLN02919   556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1411 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1482
Cdd:PLN02919   621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1483 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1530
Cdd:PLN02919   689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1531 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1582
Cdd:PLN02919   769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1958645893 1583 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1637
Cdd:PLN02919   847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1302-1443 2.59e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 48.48  E-value: 2.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1302 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRhshsPahkYYLATDPmSGAVFLSDTNSRRVFKIKSTT 1379
Cdd:COG4257    147 PYGIAVDPDGNLWVTDFgaNAIGRIDPDTGTLTEYALPTPGAG----P---RGLAVDP-DGNLWVADTGSGRIGRFDPKT 218
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1380 vvkdlvknsevvagtgdqclpfddtrcgdgGKATEATLTN----PRGITVDKFGLIYFVDGT--MIRRVD 1443
Cdd:COG4257    219 ------------------------------GTVTEYPLPGggarPYGVAVDGDGRVWFAESGanRIVRFD 258
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1420-1727 3.47e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.42  E-value: 3.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1420 PRGITVDKFGLIYFVD--GTMIRRVDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPMDNsLYVLDnn 1497
Cdd:cd14957     20 PRGIAVDSAGNIYVADtgNNRIQVFTSSGVYSYSIGSGG--------------TGSGQFNSPYGIAVDSNGN-IYVAD-- 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1498 vvlqiSENHQVRIvagrpmhcqvpgidhFLLSKVAIHA---------TLESATALAVSHNGVLYIAETDEkkiNRIrQVT 1568
Cdd:cd14957     83 -----TDNNRIQV---------------FNSSGVYQYSigtggsgdgQFNGPYGIAVDSNGNIYVADTGN---HRI-QVF 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1569 TSgeislvAGAPsgcdckndancdCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRfirknkpVLNTQNMYel 1648
Cdd:cd14957    139 TS------SGTF------------SYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ-------VFTSSGTF-- 191
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1649 sspidqeLYLFDTSGkhlytqslpTGDYLYNFTY----TGDGDItHITDNNGNMVNVrRDSTGmplwlvvpdgqVYWVTM 1724
Cdd:cd14957    192 -------QYTFGSSG---------SGPGQFSDPYgiavDSDGNI-YVADTGNHRIQV-FTSSG-----------AYQYSI 242

                   ...
gi 1958645893 1725 GTN 1727
Cdd:cd14957    243 GTS 245
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
816-859 5.18e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 42.61  E-value: 5.18e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1958645893  816 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 859
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1573-1633 7.12e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 47.52  E-value: 7.12e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958645893 1573 ISLVAGAPSGcdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1633
Cdd:cd14953      1 VSTVAGSGTA------------GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKI 49
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1543-1729 1.87e-04

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 46.11  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1543 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1622
Cdd:cd14957     23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1623 ADLGNIRIRfirknkpVLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1702
Cdd:cd14957     81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                          170       180
                   ....*....|....*....|....*..
gi 1958645893 1703 RDSTGmplwlvvpdgqVYWVTMGTNSA 1729
Cdd:cd14957    138 FTSSG-----------TFSYSIGSGGT 153
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1746-1786 2.88e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 40.65  E-value: 2.88e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958645893 1746 HGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSF 1786
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1544-1634 3.36e-04

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 45.65  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1544 AVSHNGVLYIAETdekKINRIRQV-TTSGEISLVAGapsgcdckndancdcfSGDDGYA-KDAKLNTPSSLAVCADGELY 1621
Cdd:cd14951    202 AALPDGSVYVADT---YNHKIKRVdPATGEVSTLAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLY 262
                           90
                   ....*....|...
gi 1958645893 1622 VADLGNIRIRFIR 1634
Cdd:cd14951    263 VADTNNHRIRRLD 275
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1750-1782 5.95e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.95e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1958645893 1750 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1782
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
697-845 1.37e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  697 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 771
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893  772 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 844
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 1958645893  845 C 845
Cdd:pfam01500  153 C 153
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1297-1381 1.48e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 43.08  E-value: 1.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1297 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTNilemrnkdFRHSHSPAHKYYLATDPmSGAVFLSDTNSRRVF 1373
Cdd:COG4257    185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPkTGTVTE--------YPLPGGGARPYGVAVDG-DGRVWFAESGANRIV 255

                   ....*...
gi 1958645893 1374 KIKSTTVV 1381
Cdd:COG4257    256 RFDPDTEL 263
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1298-1556 1.58e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 43.05  E-value: 1.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1298 KLLAPVALTCGSDGSLYVGDFnYIRRI--F-PSGNVTNILEmRNKDFRHSHSPAHkyyLATDpmSGAVFLSDTNSRRVfk 1374
Cdd:cd14963     54 EFKYPYGIAVDSDGNIYVADL-YNGRIqvFdPDGKFLKYFP-EKKDRVKLISPAG---LAID--DGKLYVSDVKKHKV-- 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1375 iksttVVKDLvknsevvagTGDQCLPFddtrcGDGGKAtEATLTNPRGITVDKFGLIYFVDgTMIRRV---DQNG-IIST 1450
Cdd:cd14963    125 -----IVFDL---------EGKLLLEF-----GKPGSE-PGELSYPNGIAVDEDGNIYVAD-SGNGRIqvfDKNGkFIKE 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1451 LLGSNDLTSArplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDN--NVVLQISENHQVRIVAGRpmhcqvPGIDhfll 1528
Cdd:cd14963    184 LNGSPDGKSG---------------FVNPRGIAVDP-DGNLYVVDNlsHRVYVFDEQGKELFTFGG------RGKD---- 237
                          250       260
                   ....*....|....*....|....*...
gi 1958645893 1529 skvaiHATLESATALAVSHNGVLYIAET 1556
Cdd:cd14963    238 -----DGQFNLPNGLFIDDDGRLYVTDR 260
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1410-1512 3.61e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 42.18  E-value: 3.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1410 GKATEATLTNPRGITVDKFGLIYFVDgTM---IRRVD-QNGIISTLLGSNDLTSarplscdsvmDISQVRLEWPTDLAIN 1485
Cdd:cd14951    188 GPGAEALLQHPLGVAALPDGSVYVAD-TYnhkIKRVDpATGEVSTLAGTGKAGY----------KDLEAQFSEPSGLVVD 256
                           90       100
                   ....*....|....*....|....*..
gi 1958645893 1486 PmDNSLYVLDNNvvlqiseNHQVRIVA 1512
Cdd:cd14951    257 G-DGRLYVADTN-------NHRIRRLD 275
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1302-1503 4.01e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.81  E-value: 4.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1302 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILemrnkDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRVFKIkstt 1379
Cdd:cd14952     96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVL-----PFTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKL---- 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1380 vvkdlvknsevVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLIYFVDGtmirrvDQNGIISTLLGSNDLT 1458
Cdd:cd14952    163 -----------AAGSTTQTvLPFTG-------------LNSPSGVAVDTAGNVYVTDH------GNNRVLKLAAGSTTPT 212
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1958645893 1459 sARPLScdsvmdisqvRLEWPTDLAINPmDNSLYVLD--NNVVLQIS 1503
Cdd:cd14952    213 -VLPFT----------GLNGPLGVAVDA-AGNVYVADrgNDRVVKLP 247
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
834-856 4.12e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 4.12e-03
                           10        20
                   ....*....|....*....|....*
gi 1958645893  834 CAEHGTCRD--GKCECSPGWNGEHC 856
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
705-728 4.33e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 4.33e-03
                           10        20
                   ....*....|....*....|....*...
gi 1958645893  705 CSSHGTCIMG----TCICNPGYKGESCE 728
Cdd:cd00054     11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
704-727 6.87e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.17  E-value: 6.87e-03
                           10        20
                   ....*....|....*....|....*.
gi 1958645893  704 ACSSHGTCIM--GTCICNPGYKGESC 727
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1602-1701 9.28e-03

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 40.76  E-value: 9.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958645893 1602 KDAKLNTPSSLAVCADGELYVADLGNIRIRfirknkpVLNTQNMYelsspidqeLYLFDTSGKHLYTQSLPTGDYLynft 1681
Cdd:cd05819      3 GPGELNNPQGIAVDSSGNIYVADTGNNRIQ-------VFDPDGNF---------ITSFGSFGSGDGQFNEPAGVAV---- 62
                           90       100
                   ....*....|....*....|
gi 1958645893 1682 yTGDGDItHITDNNGNMVNV 1701
Cdd:cd05819     63 -DSDGNL-YVADTGNHRIQK 80
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
2341-2370 9.45e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.04  E-value: 9.45e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1958645893 2341 YNSAGLLIKAYNrASGWSVRYRYDGLGRRV 2370
Cdd:pfam05593    1 YDAAGRLTSVTD-PDGRVTTYTYDAAGRLT 29
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH