NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|148699169|gb|EDL31116|]
View 

procollagen, type XXVII, alpha 1, isoform CRA_a [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COLFI super family cl02436
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1646-1844 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


The actual alignment was detected with superfamily member pfam01410:

Pssm-ID: 470578  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1646 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1723
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1724 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1774
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1775 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1844
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.08e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.08e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 148699169 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.51e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.51e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 148699169 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 2.04e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 2.04e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  808 GDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 148699169  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.86e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.86e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPIGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
LamG super family cl22861
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
43-221 2.29e-13

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


The actual alignment was detected with superfamily member smart00210:

Pssm-ID: 473984  Cd Length: 184  Bit Score: 70.46  E-value: 2.29e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 148699169    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 7.57e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 54.54  E-value: 7.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   348 --AAAQPRQITATSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1646-1844 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1646 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1723
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1724 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1774
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1775 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1844
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1645-1845 4.03e-52

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 183.44  E-value: 4.03e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   1645 GEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTqGGQTCLKPITASK 1724
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFE-TGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   1725 A----------------------EFA--------VSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEgPGRSSARQAVRFR 1774
Cdd:smart00038   81 PrktwysgkskhvwfgetmnggfKFSygdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMD-EATGNLKKALRLR 159
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169   1775 AWNGQVFEAGGQFRP--EVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGKQYR---LEVGPACFL 1845
Cdd:smart00038  160 GSNDVELSAEGNSKFtyEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIV---DIAPSDIGGPDQefgVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.08e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.08e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 148699169 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1313-1548 1.33e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1313 EKGDQGEDGktegPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGK 1392
Cdd:NF038329  118 EKGEPGPAG----PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1393 PGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGiPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQG 1472
Cdd:NF038329  194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDG 272
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169 1473 PPGFKGESGLPgqlGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGP 1548
Cdd:NF038329  273 PDGKDGERGPV---GPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.51e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.51e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 148699169 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1033-1249 9.08e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 127.71  E-value: 9.08e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1033 GPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGL 1112
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1113 PGEPGSQGPQGPVGPPGEMGPKGPPGA--VGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPglegdhGPVGPDGLKGD 1190
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------GPDGPDGKDGE 279
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 148699169 1191 RGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQG 1249
Cdd:NF038329  280 RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 2.04e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 2.04e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  808 GDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 148699169  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
640-859 1.01e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 109.61  E-value: 1.01e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  640 GPPGPYGNPGPpgppgakgqKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGA 719
Cdd:NF038329  126 GPAGPAGEQGP---------RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  720 KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGS--DGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGL 797
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148699169  798 PGPPGVLGLIGDTGALGPIGYPGPKGMKglmGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKG 859
Cdd:NF038329  277 DGERGPVGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDG 335
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
808-1093 2.11e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 102.68  E-value: 2.11e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  808 GDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGdkgshglpglpggrgKPGPLGKAGDKgslgf 887
Cdd:NF038329  120 GEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG---------------EAGPQGPAGKD----- 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  888 pgppgpegfpgdiGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRKGfPGRPGLDGSKGEPG 967
Cdd:NF038329  180 -------------GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTG 245
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  968 DPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSRGLPGMRGAK 1047
Cdd:NF038329  246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG---------------------------------ERGPVGPAGKDGQN 292
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 148699169 1048 GHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSG 1093
Cdd:NF038329  293 GKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.86e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.86e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPIGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
43-221 2.29e-13

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 70.46  E-value: 2.29e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 148699169    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1485-1585 2.47e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 68.01  E-value: 2.47e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1485 QLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGML 1564
Cdd:NF038329  112 QLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEK 191
                          90       100
                  ....*....|....*....|.
gi 148699169 1565 GPSGLPGPKGDRGSRGDLGLQ 1585
Cdd:NF038329  192 GPQGPRGETGPAGEQGPAGPA 212
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 7.57e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 54.54  E-value: 7.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   348 --AAAQPRQITATSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1409-1583 3.63e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 3.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1409 GLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGP 1488
Cdd:COG5164    10 GPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGG 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1489 PGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLF--GPKGPPGDIGfkgiQGPRGpPGLMGKEGIIGPPGMLGP 1566
Cdd:COG5164    90 TRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGG----STPPG-PGSTGPGGSTTPPGDGGS 164
                         170
                  ....*....|....*..
gi 148699169 1567 SGLPGPKGDRGSRGDLG 1583
Cdd:COG5164   165 TTPPGPGGSTTPPDDGG 181
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
745-801 2.03e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 2.03e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169   745 GLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPP 801
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1042-1096 2.70e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.70e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1042 GMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKG 1096
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
682-737 4.00e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 4.00e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169   682 GLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGP 737
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1177-1476 4.25e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 4.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1177 GDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQGYLGEMGI 1256
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1257 PGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGEPGLAGYNGHKGiTGPLGPPgpkgekGDQGEDGKTEGPPGPPGDRGPV 1336
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPS-GGSTTPP------GDGGSTPPGPGSTGPGGSTTPP 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1337 GDRGDRGEPGDPGYPGQEGVQGlRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGtqGLQGLPGPRGV 1416
Cdd:COG5164   160 GDGGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPK 236
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148699169 1417 VGRQGPEGTAGSDGIPGRDGRPGYQGDQ--GNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGF 1476
Cdd:COG5164   237 TNPIERRGPERPEAAALPAELTALEAENraANPEPATKTIPETTTVKDLATVLGKKGSDLVT 298
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1340-1553 4.37e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 48.47  E-value: 4.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1340 GDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrgrpgpkgskgEEGPKGKPGKAGPSGRRGTQGLQGlpGPRGVVGR 1419
Cdd:pfam09606  178 GGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP-----------ADAGAQMGQQAQANGGMNPQQMGG--APNQVAMQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1420 QGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAglPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTG 1499
Cdd:pfam09606  245 QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQ--PGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1500 LPGNQGEPGSKGQPGdsGEMGFPGVAGLFGPkGPPGDIGFKGIQG-PRGPPGLMG 1553
Cdd:pfam09606  323 PAAHQQQMNQSVGQG--GQVVALGGLNHLET-WNPGNFGGLGANPmQRGQPGMMS 374
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
699-932 6.70e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 47.72  E-value: 6.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  699 PGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYiglpglfglPGSDGERGLPGVPGKRGEMGRPGFPG 778
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQN---------QGSTTPAGNTGGTRPAGNQGATGPAQ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  779 DFGERGPPGldgNPGEIGLPGPPGVLGLIGDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDK 858
Cdd:COG5164    77 NQGGTTPAQ---NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPG 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699169  859 GSHGLPGLPGGRGKPGPLGKAGDKGSlgfpGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGD 932
Cdd:COG5164   154 GSTTPPGDGGSTTPPGPGGSTTPPDD----GGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
285-577 7.87e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.61  E-value: 7.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  285 VSRALAVTLAPAMPTKPLRTVHPdvsehssSQTPLSPAKQSARKTPSPSSSASLANSTRVyrPAAAQPRQITATSPtkrS 364
Cdd:PLN03209  302 VVEVIAETTAPLTPMEELLAKIP-------SQRVPPKESDAADGPKPVPTKPVTPEAPSP--PIEEEPPQPKAVVP---R 369
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  365 PTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASS---PTVRPVQKTFMTPRPPVPSPQPLRP 441
Cdd:PLN03209  370 PLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNvpeVEPAQVEAKKTRPLSPYARYEDLKP 449
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  442 TTGLSKK---FTNPTVAKSKSKTTSWASKPVLARSSvpKTLQQTVLSQSPVSYLGSQTLAPALPPLGVGNPRTMPPTRDS 518
Cdd:PLN03209  450 PTSPSPTaptGVSPSVSSTSSVPAVPDTAPATAATD--AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTN 527
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 148699169  519 ALTPAGSkkfTGRETSKKTRQKSSPRKPEPLSPGKSARDASPrdlttkPSRPsTPALVL 577
Cdd:PLN03209  528 EVVKVGN---SAPPTALADEQHHAQPKPRPLSPYTMYEDLKP------PTSP-TPSPVL 576
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
872-1087 1.66e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 46.18  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  872 KPGPLGKAGDKGSLGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRK 951
Cdd:COG5164     8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  952 GFPGRPGLDGSKGEPGDPGRPGPVGEQGLMGFIGLVG-----EPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGIGNP 1026
Cdd:COG5164    88 GGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGsttppSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTP 167
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169 1027 GEPGPWGPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAG 1087
Cdd:COG5164   168 PGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT 228
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1180-1235 2.10e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.10e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1180 GPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGE 1235
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
124-205 7.63e-04

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 42.02  E-value: 7.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  124 KLQLGLQFLPGRTIIHLGPRqsvafdldVHDGRWHHLALELRGRTVTMVTACGQHrVPVPLPsRRDSMLDPQGSFLLGKV 203
Cdd:cd00110    57 RLVLRYDLGSGSLVLSSKTP--------LNDGQWHSVSVERNGRSVTLSVDGERV-VESGSP-GGSALLNLDGPLYLGGL 126

                  ..
gi 148699169  204 NP 205
Cdd:cd00110   127 PE 128
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1361-1550 7.91e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 7.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1361 GEPGQQGQPGHPGPRGRPGPKGS---KGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGR 1437
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1438 PGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGfkgesglpgqlgPPGKRGTEGGTGLPGNQGEPGSKGQPGDSG 1517
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA------------TPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                         170       180       190
                  ....*....|....*....|....*....|....
gi 148699169 1518 EMGFPGVAGLF-GPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:PRK07764  738 PVPLPPEPDDPpDPAGAPAQPPPPPAPAPAAAPA 771
PHA03169 PHA03169
hypothetical protein; Provisional
1074-1235 1.86e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 1.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1074 RGRPGQPGQQGAAGErGHSGAKGFLGIPGPSGPPGAKGLPGEPGSqgpqgpvgPPGEMGPKGPPGAVGEPGLPGDsGMKG 1153
Cdd:PHA03169   81 HGEKEERGQGGPSGS-GSESVGSPTPSPSGSAEELASGLSPENTS--------GSSPESPASHSPPPSPPSHPGP-HEPA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1154 DLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKG--EDGSPGPPGITGVPGREGKPG 1231
Cdd:PHA03169  151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQA 230

                  ....
gi 148699169 1232 KQGE 1235
Cdd:PHA03169  231 VEHE 234
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
124-206 3.15e-03

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 39.33  E-value: 3.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   124 KLQLGLQFLPGRTIIHLGPRQsvafdldVHDGRWHHLALELRGRTVTMVTaCGQHRVPVPLPSRRDsMLDPQGSFLLGKV 203
Cdd:pfam02210   29 RLVLRYDLGSGPESLLSSGKN-------LNDGQWHSVRVERNGNTLTLSV-DGQTVVSSLPPGESL-LLNLNGPLYLGGL 99

                   ...
gi 148699169   204 NPR 206
Cdd:pfam02210  100 PPL 102
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1646-1844 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1646 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1723
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1724 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1774
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1775 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1844
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1645-1845 4.03e-52

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 183.44  E-value: 4.03e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   1645 GEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTqGGQTCLKPITASK 1724
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFE-TGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   1725 A----------------------EFA--------VSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEgPGRSSARQAVRFR 1774
Cdd:smart00038   81 PrktwysgkskhvwfgetmnggfKFSygdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMD-EATGNLKKALRLR 159
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169   1775 AWNGQVFEAGGQFRP--EVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGKQYR---LEVGPACFL 1845
Cdd:smart00038  160 GSNDVELSAEGNSKFtyEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIV---DIAPSDIGGPDQefgVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.08e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.08e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 148699169 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1313-1548 1.33e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1313 EKGDQGEDGktegPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGK 1392
Cdd:NF038329  118 EKGEPGPAG----PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1393 PGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGiPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQG 1472
Cdd:NF038329  194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDG 272
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169 1473 PPGFKGESGLPgqlGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGP 1548
Cdd:NF038329  273 PDGKDGERGPV---GPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.51e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.51e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 148699169 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1033-1249 9.08e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 127.71  E-value: 9.08e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1033 GPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGL 1112
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1113 PGEPGSQGPQGPVGPPGEMGPKGPPGA--VGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPglegdhGPVGPDGLKGD 1190
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------GPDGPDGKDGE 279
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 148699169 1191 RGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQG 1249
Cdd:NF038329  280 RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 2.04e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 2.04e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  808 GDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 148699169  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
640-859 1.01e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 109.61  E-value: 1.01e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  640 GPPGPYGNPGPpgppgakgqKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGA 719
Cdd:NF038329  126 GPAGPAGEQGP---------RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  720 KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGS--DGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGL 797
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148699169  798 PGPPGVLGLIGDTGALGPIGYPGPKGMKglmGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKG 859
Cdd:NF038329  277 DGERGPVGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDG 335
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
808-1093 2.11e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 102.68  E-value: 2.11e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  808 GDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGdkgshglpglpggrgKPGPLGKAGDKgslgf 887
Cdd:NF038329  120 GEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG---------------EAGPQGPAGKD----- 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  888 pgppgpegfpgdiGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRKGfPGRPGLDGSKGEPG 967
Cdd:NF038329  180 -------------GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTG 245
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  968 DPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSRGLPGMRGAK 1047
Cdd:NF038329  246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG---------------------------------ERGPVGPAGKDGQN 292
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 148699169 1048 GHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSG 1093
Cdd:NF038329  293 GKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.86e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.86e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPIGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
43-221 2.29e-13

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 70.46  E-value: 2.29e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 148699169    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1485-1585 2.47e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 68.01  E-value: 2.47e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1485 QLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGML 1564
Cdd:NF038329  112 QLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEK 191
                          90       100
                  ....*....|....*....|.
gi 148699169 1565 GPSGLPGPKGDRGSRGDLGLQ 1585
Cdd:NF038329  192 GPQGPRGETGPAGEQGPAGPA 212
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 7.57e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 54.54  E-value: 7.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   348 --AAAQPRQITATSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1409-1583 3.63e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 3.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1409 GLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGP 1488
Cdd:COG5164    10 GPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGG 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1489 PGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLF--GPKGPPGDIGfkgiQGPRGpPGLMGKEGIIGPPGMLGP 1566
Cdd:COG5164    90 TRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGG----STPPG-PGSTGPGGSTTPPGDGGS 164
                         170
                  ....*....|....*..
gi 148699169 1567 SGLPGPKGDRGSRGDLG 1583
Cdd:COG5164   165 TTPPGPGGSTTPPDDGG 181
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1427-1583 4.66e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 4.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1427 GSDGIPGRDGRPGYQGDQGNDGDPGPVG---PAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGN 1503
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGstrPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1504 qgePGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPP--GLMGKEGIIGP----PGMLGPSGLPGPKGDRG 1577
Cdd:COG5164    87 ---QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGGStppgPGSTGPGGSTTPPGDGG 163

                  ....*.
gi 148699169 1578 SRGDLG 1583
Cdd:COG5164   164 STTPPG 169
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
308-579 1.56e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.57  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   308 DVSEHSSSQTPLSPAKQSARKTPSPSSSASLANstrvyrpAAAQPRQITATSPTKRSPTKP---SVSPLSVTPMKSPHAT 384
Cdd:pfam17823  108 DGAASRALAAAASSSPSSAAQSLPAAIAALPSE-------AFSAPRAAACRANASAAPRAAiaaASAPHAASPAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   385 QKTGVPSFT-KPVPPTQKPAPFTSYLAP----SKASSPTVRPVQKTFMTP-RPPVPSPQPLRPTTGLSKKFTNPTVAKSK 458
Cdd:pfam17823  181 STTAASSTTaASSAPTTAASSAPATLTPargiSTAATATGHPAAGTALAAvGNSSPAAGTVTAAVGTVTPAALATLAAAA 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   459 SKTTSWA-----SKPVLARSSVPKTLQQTVLSQSPVSYLGSQTLAPAL-----PPLGVGNPRTMPPTRDSALTPAGSKKF 528
Cdd:pfam17823  261 GTVASAAgtinmGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIqvstdQPVHNTAGEPTPSPSNTTLEPNTPKSV 340
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169   529 TGRE----TSKKTRQKSSPRKPEPLSPGKSARDASPRDLTTKPS------RPSTPALVLAP 579
Cdd:pfam17823  341 ASTNlavvTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSpllptqGAAGPGILLAP 401
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
745-801 2.03e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 2.03e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169   745 GLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPP 801
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1042-1096 2.70e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.70e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1042 GMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKG 1096
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
682-737 4.00e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 4.00e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169   682 GLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGP 737
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1177-1476 4.25e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 4.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1177 GDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQGYLGEMGI 1256
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1257 PGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGEPGLAGYNGHKGiTGPLGPPgpkgekGDQGEDGKTEGPPGPPGDRGPV 1336
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPS-GGSTTPP------GDGGSTPPGPGSTGPGGSTTPP 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1337 GDRGDRGEPGDPGYPGQEGVQGlRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGtqGLQGLPGPRGV 1416
Cdd:COG5164   160 GDGGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPK 236
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148699169 1417 VGRQGPEGTAGSDGIPGRDGRPGYQGDQ--GNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGF 1476
Cdd:COG5164   237 TNPIERRGPERPEAAALPAELTALEAENraANPEPATKTIPETTTVKDLATVLGKKGSDLVT 298
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1340-1553 4.37e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 48.47  E-value: 4.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1340 GDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrgrpgpkgskgEEGPKGKPGKAGPSGRRGTQGLQGlpGPRGVVGR 1419
Cdd:pfam09606  178 GGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP-----------ADAGAQMGQQAQANGGMNPQQMGG--APNQVAMQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1420 QGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAglPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTG 1499
Cdd:pfam09606  245 QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQ--PGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1500 LPGNQGEPGSKGQPGdsGEMGFPGVAGLFGPkGPPGDIGFKGIQG-PRGPPGLMG 1553
Cdd:pfam09606  323 PAAHQQQMNQSVGQG--GQVVALGGLNHLET-WNPGNFGGLGANPmQRGQPGMMS 374
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1358-1414 6.53e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 6.53e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1358 GLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPR 1414
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
699-932 6.70e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 47.72  E-value: 6.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  699 PGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYiglpglfglPGSDGERGLPGVPGKRGEMGRPGFPG 778
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQN---------QGSTTPAGNTGGTRPAGNQGATGPAQ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  779 DFGERGPPGldgNPGEIGLPGPPGVLGLIGDTGALGPIGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDK 858
Cdd:COG5164    77 NQGGTTPAQ---NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPG 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699169  859 GSHGLPGLPGGRGKPGPLGKAGDKGSlgfpGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGD 932
Cdd:COG5164   154 GSTTPPGDGGSTTPPGPGGSTTPPDD----GGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1346-1400 7.87e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 7.87e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1346 GDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSG 1400
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
285-577 7.87e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.61  E-value: 7.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  285 VSRALAVTLAPAMPTKPLRTVHPdvsehssSQTPLSPAKQSARKTPSPSSSASLANSTRVyrPAAAQPRQITATSPtkrS 364
Cdd:PLN03209  302 VVEVIAETTAPLTPMEELLAKIP-------SQRVPPKESDAADGPKPVPTKPVTPEAPSP--PIEEEPPQPKAVVP---R 369
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  365 PTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASS---PTVRPVQKTFMTPRPPVPSPQPLRP 441
Cdd:PLN03209  370 PLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNvpeVEPAQVEAKKTRPLSPYARYEDLKP 449
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  442 TTGLSKK---FTNPTVAKSKSKTTSWASKPVLARSSvpKTLQQTVLSQSPVSYLGSQTLAPALPPLGVGNPRTMPPTRDS 518
Cdd:PLN03209  450 PTSPSPTaptGVSPSVSSTSSVPAVPDTAPATAATD--AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTN 527
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 148699169  519 ALTPAGSkkfTGRETSKKTRQKSSPRKPEPLSPGKSARDASPrdlttkPSRPsTPALVL 577
Cdd:PLN03209  528 EVVKVGN---SAPPTALADEQHHAQPKPRPLSPYTMYEDLKP------PTSP-TPSPVL 576
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
712-768 9.77e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 9.77e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169   712 GPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKR 768
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1433-1489 1.11e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 1.11e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1433 GRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPP 1489
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
676-732 1.27e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.27e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169   676 GNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPV 732
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1361-1415 1.36e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.36e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1361 GEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRG 1415
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1370-1424 1.45e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.45e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1370 GHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEG 1424
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1421-1475 1.61e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 1.61e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1421 GPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPG 1475
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1367-1422 1.65e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 1.65e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1367 GQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGP 1422
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
872-1087 1.66e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 46.18  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  872 KPGPLGKAGDKGSLGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRK 951
Cdd:COG5164     8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  952 GFPGRPGLDGSKGEPGDPGRPGPVGEQGLMGFIGLVG-----EPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGIGNP 1026
Cdd:COG5164    88 GGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGsttppSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTP 167
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148699169 1027 GEPGPWGPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAG 1087
Cdd:COG5164   168 PGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT 228
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1180-1235 2.10e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.10e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1180 GPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGE 1235
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1448-1502 2.19e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.19e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1448 GDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPG 1502
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03247 PHA03247
large tegument protein UL36; Provisional
294-573 2.68e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  294 APAMPTK-PLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP--AAAQPRQITATSPT---KRSPTK 367
Cdd:PHA03247 2610 GPAPPSPlPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPqrpRRRAAR 2689
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  368 PSVSPLS---------VTPMKSPHATQkTGVPSFTKPV-------PPTQKPAPFTSYLAPSKASSPTVRPVQKTFMTPRP 431
Cdd:PHA03247 2690 PTVGSLTsladpppppPTPEPAPHALV-SATPLPPGPAaarqaspALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  432 PVPSPQplrPTTGLSKKFTNPTVAKSKSKTTSWASKPVLARSSVPKTLQQTVL--SQSPVSYL----GSQTLAPALPPLG 505
Cdd:PHA03247 2769 PAPPAA---PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALppAASPAGPLppptSAQPTAPPPPPGP 2845
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  506 VGNPRTM-------------PPTRDSALTPAGSKKFTGRETSKKTRQKSS------PRKPEPLSPGKSARDASPRDLTTK 566
Cdd:PHA03247 2846 PPPSLPLggsvapggdvrrrPPSRSPAAKPAAPARPPVRRLARPAVSRSTesfalpPDQPERPPQPQAPPPPQPQPQPPP 2925

                  ....*..
gi 148699169  567 PSRPSTP 573
Cdd:PHA03247 2926 PPQPQPP 2932
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
694-748 2.83e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.83e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169   694 GHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYIGLPG 748
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1487-1541 2.94e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.94e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1487 GPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKG 1541
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1466-1520 3.18e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.18e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1466 GLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMG 1520
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1460-1514 3.72e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.72e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1460 GNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGNQGEPGSKGQPG 1514
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
966-1230 4.01e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.02  E-value: 4.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  966 PGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGignpgePGPWGPPGSRGLPGMRG 1045
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGG------TRPAGNQGATGPAQNQG 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1046 AKGHRGPRGPDGPAGEQGSKGlkgRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGLPGEPGSQGPQGPV 1125
Cdd:COG5164    80 GTTPAQNQGGTRPAGNTGGTT---PAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGST 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1126 GPPGEMGPKGPPGAVGEPGLPGD--SGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDR-GDPGPDGEHGE 1202
Cdd:COG5164   157 TPPGDGGSTTPPGPGGSTTPPDDggSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRgGKTGPKDQRPK 236
                         250       260
                  ....*....|....*....|....*...
gi 148699169 1203 KGQEGLKGEDGSPGPPGITGVPGREGKP 1230
Cdd:COG5164   237 TNPIERRGPERPEAAALPAELTALEAEN 264
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
685-741 4.27e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.27e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169   685 GNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSR 741
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1505-1561 6.65e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 6.65e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1505 GEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPP 1561
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
124-205 7.63e-04

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 42.02  E-value: 7.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  124 KLQLGLQFLPGRTIIHLGPRqsvafdldVHDGRWHHLALELRGRTVTMVTACGQHrVPVPLPsRRDSMLDPQGSFLLGKV 203
Cdd:cd00110    57 RLVLRYDLGSGSLVLSSKTP--------LNDGQWHSVSVERNGRSVTLSVDGERV-VESGSP-GGSALLNLDGPLYLGGL 126

                  ..
gi 148699169  204 NP 205
Cdd:cd00110   127 PE 128
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1361-1550 7.91e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 7.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1361 GEPGQQGQPGHPGPRGRPGPKGS---KGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGR 1437
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1438 PGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGfkgesglpgqlgPPGKRGTEGGTGLPGNQGEPGSKGQPGDSG 1517
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA------------TPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                         170       180       190
                  ....*....|....*....|....*....|....
gi 148699169 1518 EMGFPGVAGLF-GPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:PRK07764  738 PVPLPPEPDDPpDPAGAPAQPPPPPAPAPAAAPA 771
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
273-572 8.01e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.18  E-value: 8.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   273 LTRTPATLGARPVSR-ALAVTLAPAMPTKPLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTRVyrpaAAQ 351
Cdd:pfam17823  162 IAAASAPHAASPAPRtAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNS----SPA 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   352 PRQITATSPTKRSPTKPSVSPlSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPfTSYLAPSKASSP----------TVRP 421
Cdd:pfam17823  238 AGTVTAAVGTVTPAALATLAA-AAGTVASAAGTINMGDPHARRLSPAKHMPSD-TMARNPAAPMGAqaqgpiiqvsTDQP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   422 VQKTfMTPRPPVPSPQPLRPTTGLSKKFTNPTV---AKSKSKTTSWASKPVLARSSVPKtlqqtVLSQSPVSylgsqtla 498
Cdd:pfam17823  316 VHNT-AGEPTPSPSNTTLEPNTPKSVASTNLAVvttTKAQAKEPSASPVPVLHTSMIPE-----VEATSPTT-------- 381
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169   499 palpplgvgNPRTMPPTRDSAltpagskkftGRETSKKTRQKSSPRKPEPLSPGKSARDA-SPRDLTTKPSRPST 572
Cdd:pfam17823  382 ---------QPSPLLPTQGAA----------GPGILLAPEQVATEATAGTASAGPTPRSSgDPKTLAMASCQLST 437
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
688-742 8.02e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 8.02e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169   688 GPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRG 742
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03247 PHA03247
large tegument protein UL36; Provisional
245-686 8.55e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 8.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  245 PQVGTLFPWDSGPAFALHPEPALLGLgnLTRTPATLGARPVSRAL-AVTLAPAMPTKPLRTVHPdvsehSSSQTPLSPAK 323
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPAPHAL--VSATPLPPGPAAARQASpALPAAPAPPAVPAGPATP-----GGPARPARPPT 2762
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  324 QSARKTPSPSSSASLANSTRVYRPAAAQPRQITATSPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTK--PVPPTQK 401
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSaqPTAPPPP 2842
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  402 PAPFTSYLAPSKASSPTvRPVQKtfmtprppvpspqplRPTTGLskkfTNPTVAKSKSKTTSWASKPVLARSSVPKTLQQ 481
Cdd:PHA03247 2843 PGPPPPSLPLGGSVAPG-GDVRR---------------RPPSRS----PAAKPAAPARPPVRRLARPAVSRSTESFALPP 2902
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  482 TVLSQSPVSYLGSQTLAPALPPLgvgNPRTMPPTRDSALTPAGSKKFTGRETSKKTRQKSSPRKPEPLSPGksaRDASPR 561
Cdd:PHA03247 2903 DQPERPPQPQAPPPPQPQPQPPP---PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPG---RVAVPR 2976
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  562 DLTTKPsRPSTPALVLAPAYLLSSSPQPTSSSFPFFHL-LGPTPfpmlmgPPGSKGDCGLPGPPGLPGLPGSPGARGPRG 640
Cdd:PHA03247 2977 FRVPQP-APSREAPASSTPPLTGHSLSRVSSWASSLALhEETDP------PPVSLKQTLWPPDDTEDSDADSLFDSDSER 3049
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 148699169  641 PPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGN 686
Cdd:PHA03247 3050 SDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSAN 3095
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1339-1382 8.59e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 8.59e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 148699169  1339 RGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKG 1382
Cdd:pfam01391   12 PGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1424-1479 9.02e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 9.02e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1424 GTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGE 1479
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
754-802 1.04e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 1.04e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 148699169   754 GSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPG 802
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPG 49
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1156-1218 1.09e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.09e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 148699169  1156 GPLGPPGEQGLIGQRGEPGlegdhgPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPP 1218
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPG------PPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
262-573 1.13e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.91  E-value: 1.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  262 HPEPALLGLGNLTRTPATLGARPVSRALAVTLAPAMPTKPlrtVHP-DVSEHSSSQTPLSPAKQSARKTpspsssaslan 340
Cdd:PTZ00449  546 GGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDP---KHPkDPEEPKKPKRPRSAQRPTRPKS----------- 611
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  341 strvyrPAAAQPRQITATSPTKRSPTKPSvSPLSVTPMKSPHATQKTGVPSFTKPvPPTQKPaPFtsylapskasSPTVR 420
Cdd:PTZ00449  612 ------PKLPELLDIPKSPKRPESPKSPK-RPPPPQRPSSPERPEGPKIIKSPKP-PKSPKP-PF----------DPKFK 672
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  421 pvqktfmtprppvpspqplrpttglSKKFTNPTVAKSKSKTTswasKPVLARSSVPKTLQQTVLSQSPVSYLGSQTLAPA 500
Cdd:PTZ00449  673 -------------------------EKFYDDYLDAAAKSKET----KTTVVLDESFESILKETLPETPGTPFTTPRPLPP 723
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 148699169  501 LPPLGVGNPRTmPPTRDSALTPAGSKKFTGREtSKKTRQKSSPrkPEPLSPGKSARDASPRDLTTKPSRPSTP 573
Cdd:PTZ00449  724 KLPRDEEFPFE-PIGDPDAEQPDDIEFFTPPE-EERTFFHETP--ADTPLPDILAEEFKEEDIHAETGEPDEA 792
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1385-1439 1.42e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.42e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1385 GEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPG 1439
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1349-1403 1.45e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.45e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1349 GYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRG 1403
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1283-1463 1.49e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.35  E-value: 1.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1283 RMGAQGEPGLAGYNGHKGITGPLGPPGPKGEKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQglRGE 1362
Cdd:PRK12678   58 ARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRE--RGE 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1363 PGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGlpgPRGVVGRQGPEGTAGSDGIPGRDGRPGYQG 1442
Cdd:PRK12678  136 AARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQ---AEAERGERGRREERGRDGDDRDRRDRREQG 212
                         170       180
                  ....*....|....*....|.
gi 148699169 1443 DQGNDGDPGPVGpaGRRGNPG 1463
Cdd:PRK12678  213 DRREERGRRDGG--DRRGRRR 231
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
1053-1520 1.50e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.40  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1053 RGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGLPGEPGSQGPQGPVGPPGEMG 1132
Cdd:pfam03157  217 QGQQGQQPERGQQGQQPGQGQQPGQGQQGQQPGQPQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSG 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1133 PKgpPGAVGEPGLPGDSGMKGDLGPLGPPGeQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGED 1212
Cdd:pfam03157  297 YY--PTSQQQAGQLQQEQQLGQEQQDQQPG-QGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTS 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1213 GSPGPPGITGVPGREGKPGKQGEKGQRGAKG---AKGHQGYLGEMGIPGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGE 1289
Cdd:pfam03157  374 QQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGqqpGQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQ 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1290 PGLAGYNGHkgitgplgppGPKGEKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQ--QG 1367
Cdd:pfam03157  454 PGQGQQPGQ----------GQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPLQpgQG 523
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  1368 QPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQgdqgnd 1447
Cdd:pfam03157  524 QPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYY------ 597
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 148699169  1448 gdPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQlgppGKRGTeggtgLPGNQGEPGSKGQPGDSGEMG 1520
Cdd:pfam03157  598 --PTSPQQSGQGQQPGQWQQPGQGQPGYYPTSSLQLGQ----GQQGY-----YPTSPQQPGQGQQPGQWQQSG 659
PHA03169 PHA03169
hypothetical protein; Provisional
1338-1481 1.52e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1338 DRGDRGEPGDPGyPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPgprgvv 1417
Cdd:PHA03169   98 ESVGSPTPSPSG-SAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPS------ 170
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699169 1418 GRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAgrrgNPGVAGLPGAQGPPGFKGESG 1481
Cdd:PHA03169  171 HEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPD----EPGEPQSPTPQQAPSPNTQQA 230
PHA03169 PHA03169
hypothetical protein; Provisional
1313-1452 1.54e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 1.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1313 EKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrGRPGPKGSKGEEGPKGK 1392
Cdd:PHA03169   85 EERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGP-HEPAPPESHNPSPNQQP 163
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 148699169 1393 PGKAGPSGRRGTQ---GLQGLPGPRGVVGRQGPEGTAGSDGIPGRDgRPGYQGDQGNDGDPGP 1452
Cdd:PHA03169  164 SSFLQPSHEDSPEepePPTSEPEPDSPGPPQSETPTSSPPPQSPPD-EPGEPQSPTPQQAPSP 225
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
751-802 1.57e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.57e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 148699169   751 GLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPG 802
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPG 52
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1141-1196 1.58e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.58e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1141 GEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGP 1196
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
266-421 1.63e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.02  E-value: 1.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   266 ALLGLGNLTRTPATLGAR--PVSRALAVTLAPAMPTkpLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTR 343
Cdd:pfam17823  227 ALAAVGNSSPAAGTVTAAvgTVTPAALATLAAAAGT--VASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQ 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   344 --VYRPAAAQPRQITATSPTKrSPTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASSPTVRP 421
Cdd:pfam17823  305 gpIIQVSTDQPVHNTAGEPTP-SPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQP 383
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1376-1430 1.80e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.80e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1376 GRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDG 1430
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03169 PHA03169
hypothetical protein; Provisional
1074-1235 1.86e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 1.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1074 RGRPGQPGQQGAAGErGHSGAKGFLGIPGPSGPPGAKGLPGEPGSqgpqgpvgPPGEMGPKGPPGAVGEPGLPGDsGMKG 1153
Cdd:PHA03169   81 HGEKEERGQGGPSGS-GSESVGSPTPSPSGSAEELASGLSPENTS--------GSSPESPASHSPPPSPPSHPGP-HEPA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1154 DLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKG--EDGSPGPPGITGVPGREGKPG 1231
Cdd:PHA03169  151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQA 230

                  ....
gi 148699169 1232 KQGE 1235
Cdd:PHA03169  231 VEHE 234
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1397-1453 1.91e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.91e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1397 GPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPV 1453
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1379-1433 2.49e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 2.49e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1379 GPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPG 1433
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
901-975 2.94e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 2.94e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169   901 GPPGDNGPEGMKGKPGARGLPGPPGQLGPegdegpmgppgvpglegqPGRKGFPGRPGLDGSKGEPGDPGRPGPV 975
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGP------------------PGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1496-1550 3.06e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 3.06e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1496 GGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
124-206 3.15e-03

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 39.33  E-value: 3.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   124 KLQLGLQFLPGRTIIHLGPRQsvafdldVHDGRWHHLALELRGRTVTMVTaCGQHRVPVPLPSRRDsMLDPQGSFLLGKV 203
Cdd:pfam02210   29 RLVLRYDLGSGPESLLSSGKN-------LNDGQWHSVRVERNGNTLTLSV-DGQTVVSSLPPGESL-LLNLNGPLYLGGL 99

                   ...
gi 148699169   204 NPR 206
Cdd:pfam02210  100 PPL 102
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1343-1535 3.51e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1343 GEPGDPGYPGQEGVQGlRGEPGQQGQPGHPG-PRGRPGPKGSkgeegpkgkPGKAGPSGRRGTQGLQGLPGPRGVVGRQG 1421
Cdd:PRK07764  590 PAPGAAGGEGPPAPAS-SGPPEEAARPAAPAaPAAPAAPAPA---------GAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169 1422 PEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLP 1501
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPV 739
                         170       180       190
                  ....*....|....*....|....*....|....
gi 148699169 1502 GNQGEPGSKGQPGDSGEMGfPGVAGLFGPKGPPG 1535
Cdd:PRK07764  740 PLPPEPDDPPDPAGAPAQP-PPPPAPAPAAAPAA 772
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
254-505 3.55e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 3.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  254 DSGPAFALHPEPAllglgnlTRTPATLGARPVSRALAVTLAPAMPTKPLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPS 333
Cdd:PRK12323  371 GAGPATAAAAPVA-------QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  334 SSASLANSTRV-------YRPAAAQPRQITATSPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTkPVPPTQKPAPFT 406
Cdd:PRK12323  444 PGGAPAPAPAPaaapaaaARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPA-PAQPDAAPAGWV 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  407 SYLAPskasSPTVRPVQKTFMTPRPPVPSPQPLRPTTGLSKKF--TNPTVAKSKSKTTSWASKPVLARSSVPKTLQQTVL 484
Cdd:PRK12323  523 AESIP----DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVapRPPRASASGLPDMFDGDWPALAARLPVRGLAQQLA 598
                         250       260
                  ....*....|....*....|...
gi 148699169  485 SQSPVSYLGSQT--LAPALPPLG 505
Cdd:PRK12323  599 RQSELAGVEGDTvrLRVPVPALA 621
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
647-834 3.56e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.94  E-value: 3.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  647 NPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQ 726
Cdd:COG5164    35 STRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDG 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  727 GFPGPVGDPGPKG--SRGYIGLPGLFG-LPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDG-----NPGEIGLP 798
Cdd:COG5164   115 GATGPPDDGGSTTppSGGSTTPPGDGGsTPPGPGSTGPGGSTTPPGDGGSTTPPGPGGSTTPPDDGGsttppNKGETGTD 194
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 148699169  799 GPPGVLGLIGDTGALGPIGYPGPKGMKGLMGGVGEP 834
Cdd:COG5164   195 IPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGP 230
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1189-1244 3.80e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.09  E-value: 3.80e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148699169  1189 GDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGA 1244
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1192-1246 6.09e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.09e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  1192 GDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKG 1246
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1382-1438 6.59e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.59e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1382 GSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRP 1438
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
261-403 6.85e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 41.30  E-value: 6.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169  261 LHPEPALLGLGNLTRT--PATLGARPVSRALAVTLAPAMPTKPLRTVHPDVSEHSSSQTPLSPAKQSArktpspsssasl 338
Cdd:PRK14971  350 LLVELTLIQLAQLTQKgdDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSA------------ 417
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148699169  339 anstrvyRPAAAQPRQITaTSPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPA 403
Cdd:PRK14971  418 -------TQPAGTPPTVS-VDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPI 474
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
673-724 6.86e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.86e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 148699169   673 GAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPG 724
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1132-1182 7.95e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 7.95e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 148699169  1132 GPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPV 1182
Cdd:pfam01391    7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
271-613 8.05e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 8.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   271 GNLTRTPATLGARPVSralAVTLAPAMPTkplrtvhpdvsehssSQTPLSPAKQSARKTPSPSSSASLANSTRVYR-PAA 349
Cdd:pfam17823   94 GTDLSEPATREGAADG---AASRALAAAA---------------SSSPSSAAQSLPAAIAALPSEAFSAPRAAACRaNAS 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   350 AQPRQITAT-------SPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASS--PTVR 420
Cdd:pfam17823  156 AAPRAAIAAasaphaaSPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAavGNSS 235
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   421 PVQKTfmtprppvpspqpLRPTTGLSKKFTNPTVAKSKSKTTSWA-----SKPVLARSSVPKTLQQTVLSQSPVSYLGSQ 495
Cdd:pfam17823  236 PAAGT-------------VTAAVGTVTPAALATLAAAAGTVASAAgtinmGDPHARRLSPAKHMPSDTMARNPAAPMGAQ 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699169   496 TLAPalpplgvgnprTMPPTRDSALTPAgskkfTGRETSKKTRQKSSPRKPEPLSPGKSARDASPRDLTTKPSRPSTPAL 575
Cdd:pfam17823  303 AQGP-----------IIQVSTDQPVHNT-----AGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 148699169   576 vlapayllssspqpTSSSFPFFHLLGPT--PFPML----MGPPG 613
Cdd:pfam17823  367 --------------HTSMIPEVEATSPTtqPSPLLptqgAAGPG 396
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1038-1086 8.19e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 8.19e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 148699169  1038 RGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAA 1086
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1406-1462 8.77e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 8.77e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1406 GLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNP 1462
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1412-1468 9.13e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 9.13e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 148699169  1412 GPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLP 1468
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH