NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2037036700|ref|XP_041415823|]
View 

uncharacterized protein UHO2_03932 [Ustilago hordei]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RT_LTR cd01647
RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long ...
573-749 5.26e-83

RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long terminal repeats (LTRs) in their DNA copies but not in their RNA template. RT catalyzes DNA replication from an RNA template, and is responsible for the replication of retroelements. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs are present in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and Caulimoviruses.


:

Pssm-ID: 238825  Cd Length: 177  Bit Score: 269.47  E-value: 5.26e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  573 GFIRPSKSPARSPVLFVPKKDGGLRLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEW 652
Cdd:cd01647      1 GIIEPSSSPYASPVVVVKKKDGKLRLCVDYRKLNKVTIKDRYPLPTIDELLEELAGAKVFSKLDLRSGYHQIPLAEESRP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  653 KTAFGTQLGLYEYLVMPFGLANAPAHFQSFINDIFRDIIGIYVVVYLDDFLIFSDTEEAHVKHVTEVLTRLRSNRLFAKL 732
Cdd:cd01647     81 KTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKILGDLLGDFVEVYLDDILVYSKTEEEHLEHLREVLERLREAGLKLNP 160
                          170
                   ....*....|....*..
gi 2037036700  733 SKCEFHTKTVEFLGYII 749
Cdd:cd01647    161 EKCEFGVPEVEFLGHIV 177
RNase_HI_RT_Ty3 cd09274
Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) ...
846-966 2.24e-52

Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) enzymes are divided into two major families, Type 1 and Type 2, based on amino acid sequence similarities and biochemical properties. RNase H is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a sequence non-specific manner in the presence of divalent cations. RNase H is widely present in various organisms, including bacteria, archaea and eukaryotes. RNase HI has also been observed as adjunct domains to the reverse transcriptase gene in retroviruses, in long-term repeat (LTR)-bearing retrotransposons and non-LTR retrotransposons. RNase HI in LTR retrotransposons perform degradation of the original RNA template, generation of a polypurine tract (the primer for plus-strand DNA synthesis), and final removal of RNA primers from newly synthesized minus and plus strands. The catalytic residues for RNase H enzymatic activity, three aspartatic acids and one glutamic acid residue (DEDD), are unvaried across all RNase H domains. Phylogenetic patterns of RNase HI of LTR retroelements is classified into five major families, Ty3/Gypsy, Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate retroviruses. Ty3/Gypsy family widely distributed among the genomes of plants, fungi and animals. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.


:

Pssm-ID: 260006 [Multi-domain]  Cd Length: 121  Bit Score: 179.61  E-value: 2.24e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  846 RLETDASDFAIAGVLKQEHE-GRWHPVAFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGLPnqLVILTDHEALKY 924
Cdd:cd09274      1 ILETDASDYGIGAVLSQEDDdGKERPIAFFSRKLTPAERNYSTTEKELLAIVWALKKFRHYLLGRP--FTVYTDHKALKY 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2037036700  925 FKSQRRITGRQARWAILLADFDFILQYRPGDKGGEPDALTRR 966
Cdd:cd09274     79 LLTQKDLNGRLARWLLLLSEFDFEIEYRPGKENVVADALSRL 120
Integrase_H2C2 pfam17921
Integrase zinc binding domain; This zinc binding domain is found in a wide variety of ...
1052-1111 2.02e-19

Integrase zinc binding domain; This zinc binding domain is found in a wide variety of integrase proteins.


:

Pssm-ID: 465569 [Multi-domain]  Cd Length: 58  Bit Score: 83.06  E-value: 2.02e-19
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1052 IPKHLRFMVMTQCHDGitAGHVGRDATIKAAQRHYWWPNMTAWIADYVASCPVCARYKAP 1111
Cdd:pfam17921    1 VPKSLRKEILKEAHDS--GGHLGIEKTLARLRRRYWWPGMRKDVKKYVKSCETCQRRKPS 58
CD_CSD cd00024
CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this ...
1431-1478 2.27e-17

CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this group are chromodomains or chromo shadow domains; these are SH3-fold-beta-barrel domains of the chromo-like superfamily. Chromodomains lack the first strand of the SH3-fold-beta-barrel, this first strand is altered by insertion in the chromo shadow domains. The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. Chromodomain-containing proteins include: i) those having an N-terminal chromodomain followed by a related chromo shadow domain, such as Drosophila and human heterochromatin protein Su(var)205 (HP1), and mammalian modifier 1 and 2; ii) those having a single chromodomain, such as Drosophila protein Polycomb (Pc), mammalian modifier 3, human Mi-2 autoantigen, and several yeast and Caenorhabditis elegans proteins of unknown function; iii) those having paired tandem chromodomains, such as mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1; (iv) and elongation factor eEF3, a member of the ATP-binding cassette (ABC) family of proteins, that serves an essential function in the translation cycle of fungi. eEF3 is a soluble factor lacking a transmembrane domain and having two ABC domains arranged in tandem, with a unique chromodomain inserted within the ABC2 domain.


:

Pssm-ID: 349274 [Multi-domain]  Cd Length: 50  Bit Score: 77.13  E-value: 2.27e-17
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd00024      1 YEVEKILDHRVRKGKLEYLVKWKGYPPEENTWEPEENLtNAPELIKEYE 49
retropepsin_like cd00303
Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate ...
314-404 1.74e-11

Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.


:

Pssm-ID: 133136  Cd Length: 92  Bit Score: 61.97  E-value: 1.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTKPIKT-RSIHGVTGHQLSINSSASmQVSIGTHNL-GVVEASVADT 391
Cdd:cd00303      1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTpLKVKGANGSSVKTLGVIL-PVTIGIGGKtFTVDFYVLDL 79
                           90
                   ....*....|...
gi 2037036700  392 ADYDLILGFTELR 404
Cdd:cd00303     80 LSYDVILGRPWLE 92
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
1135-1229 5.95e-11

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


:

Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 60.41  E-value: 5.95e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1135 DFIEGlpPSKKYDSKTYdsILVIVDRLTKFAILAPTHKTVTAKQT-AVLLYghMVRLFGY-PDHMVSDRGRQFISGAWKA 1212
Cdd:pfam00665    8 DFTYI--RIPGGGGKLY--LLVIVDDFSREILAWALSSEMDAELVlDALER--AIAFRGGvPLIIHSDNGSEYTSKAFRE 81
                           90
                   ....*....|....*..
gi 2037036700 1213 FAEQMGVKHSLSTAYHP 1229
Cdd:pfam00665   82 FLKDLGIKPSFSRPGNP 98
 
Name Accession Description Interval E-value
RT_LTR cd01647
RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long ...
573-749 5.26e-83

RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long terminal repeats (LTRs) in their DNA copies but not in their RNA template. RT catalyzes DNA replication from an RNA template, and is responsible for the replication of retroelements. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs are present in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and Caulimoviruses.


Pssm-ID: 238825  Cd Length: 177  Bit Score: 269.47  E-value: 5.26e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  573 GFIRPSKSPARSPVLFVPKKDGGLRLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEW 652
Cdd:cd01647      1 GIIEPSSSPYASPVVVVKKKDGKLRLCVDYRKLNKVTIKDRYPLPTIDELLEELAGAKVFSKLDLRSGYHQIPLAEESRP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  653 KTAFGTQLGLYEYLVMPFGLANAPAHFQSFINDIFRDIIGIYVVVYLDDFLIFSDTEEAHVKHVTEVLTRLRSNRLFAKL 732
Cdd:cd01647     81 KTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKILGDLLGDFVEVYLDDILVYSKTEEEHLEHLREVLERLREAGLKLNP 160
                          170
                   ....*....|....*..
gi 2037036700  733 SKCEFHTKTVEFLGYII 749
Cdd:cd01647    161 EKCEFGVPEVEFLGHIV 177
RNase_HI_RT_Ty3 cd09274
Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) ...
846-966 2.24e-52

Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) enzymes are divided into two major families, Type 1 and Type 2, based on amino acid sequence similarities and biochemical properties. RNase H is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a sequence non-specific manner in the presence of divalent cations. RNase H is widely present in various organisms, including bacteria, archaea and eukaryotes. RNase HI has also been observed as adjunct domains to the reverse transcriptase gene in retroviruses, in long-term repeat (LTR)-bearing retrotransposons and non-LTR retrotransposons. RNase HI in LTR retrotransposons perform degradation of the original RNA template, generation of a polypurine tract (the primer for plus-strand DNA synthesis), and final removal of RNA primers from newly synthesized minus and plus strands. The catalytic residues for RNase H enzymatic activity, three aspartatic acids and one glutamic acid residue (DEDD), are unvaried across all RNase H domains. Phylogenetic patterns of RNase HI of LTR retroelements is classified into five major families, Ty3/Gypsy, Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate retroviruses. Ty3/Gypsy family widely distributed among the genomes of plants, fungi and animals. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.


Pssm-ID: 260006 [Multi-domain]  Cd Length: 121  Bit Score: 179.61  E-value: 2.24e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  846 RLETDASDFAIAGVLKQEHE-GRWHPVAFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGLPnqLVILTDHEALKY 924
Cdd:cd09274      1 ILETDASDYGIGAVLSQEDDdGKERPIAFFSRKLTPAERNYSTTEKELLAIVWALKKFRHYLLGRP--FTVYTDHKALKY 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2037036700  925 FKSQRRITGRQARWAILLADFDFILQYRPGDKGGEPDALTRR 966
Cdd:cd09274     79 LLTQKDLNGRLARWLLLLSEFDFEIEYRPGKENVVADALSRL 120
RVT_1 pfam00078
Reverse transcriptase (RNA-dependent DNA polymerase); A reverse transcriptase gene is usually ...
589-749 5.20e-42

Reverse transcriptase (RNA-dependent DNA polymerase); A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses.


Pssm-ID: 395031 [Multi-domain]  Cd Length: 189  Bit Score: 152.46  E-value: 5.20e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  589 VPKKD-GGLRLC----VDYRGLNEITVK-------NRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEWKTAF 656
Cdd:pfam00078    1 IPKKGkGKYRPIsllsIDYKALNKIIVKrlkpenlDSPPQPGFRPGLAKLKKAKWFLKLDLKKAFDQVPLDELDRKLTAF 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  657 GT-----------QLGLYEYLVMPFGLANAPAHFQSFINDIFRDI---IGIYVVVYLDDFLIFSDTEEAHVKHVTEVLTR 722
Cdd:pfam00078   81 TTppininwngelSGGRYEWKGLPQGLVLSPALFQLFMNELLRPLrkrAGLTLVRYADDILIFSKSEEEHQEALEEVLEW 160
                          170       180
                   ....*....|....*....|....*....
gi 2037036700  723 LRSNRLFAKLSKCEF--HTKTVEFLGYII 749
Cdd:pfam00078  161 LKESGLKINPEKTQFflKSKEVKYLGVTL 189
RT_RNaseH pfam17917
RNase H-like domain found in reverse transcriptase; DNA polymerase and ribonuclease H (RNase H) ...
840-944 1.67e-41

RNase H-like domain found in reverse transcriptase; DNA polymerase and ribonuclease H (RNase H) activities allow reverse transcriptases to convert the single-stranded retroviral RNA genome into double-stranded DNA, which is integrated into the host chromosome during infection. This entry represents the RNase H like domain.


Pssm-ID: 465565  Cd Length: 104  Bit Score: 147.66  E-value: 1.67e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  840 DYHLPTRLETDASDFAIAGVLKQ-EHEGRWHPVAFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGlpNQLVILTD 918
Cdd:pfam17917    1 DPSKPFILETDASDYGIGAVLSQkDEDGKERPIAYASRKLTPAERNYSTTEKELLAIVWALKKFRHYLLG--RKFTVYTD 78
                           90       100
                   ....*....|....*....|....*.
gi 2037036700  919 HEALKYFKSQRRITGRQARWAILLAD 944
Cdd:pfam17917   79 HKPLKYLFTPKELNGRLARWALFLQE 104
Integrase_H2C2 pfam17921
Integrase zinc binding domain; This zinc binding domain is found in a wide variety of ...
1052-1111 2.02e-19

Integrase zinc binding domain; This zinc binding domain is found in a wide variety of integrase proteins.


Pssm-ID: 465569 [Multi-domain]  Cd Length: 58  Bit Score: 83.06  E-value: 2.02e-19
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1052 IPKHLRFMVMTQCHDGitAGHVGRDATIKAAQRHYWWPNMTAWIADYVASCPVCARYKAP 1111
Cdd:pfam17921    1 VPKSLRKEILKEAHDS--GGHLGIEKTLARLRRRYWWPGMRKDVKKYVKSCETCQRRKPS 58
CD_CSD cd00024
CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this ...
1431-1478 2.27e-17

CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this group are chromodomains or chromo shadow domains; these are SH3-fold-beta-barrel domains of the chromo-like superfamily. Chromodomains lack the first strand of the SH3-fold-beta-barrel, this first strand is altered by insertion in the chromo shadow domains. The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. Chromodomain-containing proteins include: i) those having an N-terminal chromodomain followed by a related chromo shadow domain, such as Drosophila and human heterochromatin protein Su(var)205 (HP1), and mammalian modifier 1 and 2; ii) those having a single chromodomain, such as Drosophila protein Polycomb (Pc), mammalian modifier 3, human Mi-2 autoantigen, and several yeast and Caenorhabditis elegans proteins of unknown function; iii) those having paired tandem chromodomains, such as mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1; (iv) and elongation factor eEF3, a member of the ATP-binding cassette (ABC) family of proteins, that serves an essential function in the translation cycle of fungi. eEF3 is a soluble factor lacking a transmembrane domain and having two ABC domains arranged in tandem, with a unique chromodomain inserted within the ABC2 domain.


Pssm-ID: 349274 [Multi-domain]  Cd Length: 50  Bit Score: 77.13  E-value: 2.27e-17
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd00024      1 YEVEKILDHRVRKGKLEYLVKWKGYPPEENTWEPEENLtNAPELIKEYE 49
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
1431-1478 1.44e-12

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 63.37  E-value: 1.44e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1431 FEVEALIDKR-SHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:pfam00385    1 YEVERILDHRkDKGGKEEYLVKWKGYPYDENTWEPEENLsKCPELIEEFK 50
retropepsin_like cd00303
Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate ...
314-404 1.74e-11

Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.


Pssm-ID: 133136  Cd Length: 92  Bit Score: 61.97  E-value: 1.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTKPIKT-RSIHGVTGHQLSINSSASmQVSIGTHNL-GVVEASVADT 391
Cdd:cd00303      1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTpLKVKGANGSSVKTLGVIL-PVTIGIGGKtFTVDFYVLDL 79
                           90
                   ....*....|...
gi 2037036700  392 ADYDLILGFTELR 404
Cdd:cd00303     80 LSYDVILGRPWLE 92
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
1135-1229 5.95e-11

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 60.41  E-value: 5.95e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1135 DFIEGlpPSKKYDSKTYdsILVIVDRLTKFAILAPTHKTVTAKQT-AVLLYghMVRLFGY-PDHMVSDRGRQFISGAWKA 1212
Cdd:pfam00665    8 DFTYI--RIPGGGGKLY--LLVIVDDFSREILAWALSSEMDAELVlDALER--AIAFRGGvPLIIHSDNGSEYTSKAFRE 81
                           90
                   ....*....|....*..
gi 2037036700 1213 FAEQMGVKHSLSTAYHP 1229
Cdd:pfam00665   82 FLKDLGIKPSFSRPGNP 98
CHROMO smart00298
Chromatin organization modifier domain;
1430-1478 1.34e-09

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 55.30  E-value: 1.34e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700  1430 DFEVEALIDKRSH-NGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:smart00298    1 EYEVEKILDHRWKkKGELEYLVKWKGYSYSEDTWEPEENLlNCSKKLDNYK 51
transpos_IS481 NF033577
IS481 family transposase; null
1145-1254 3.94e-09

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 59.53  E-value: 3.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1145 KYDSKTYdsILVIVDRLTKFAILA--PTHKTVTAKQTAVLLYghmvRLFGYPDHMV-SDRGRQFIS--GAWKAFAEQMGV 1219
Cdd:NF033577   142 PDVGRLY--LHTAIDDHSRFAYAElyPDETAETAADFLRRAF----AEHGIPIRRVlTDNGSEFRSraHGFELALAELGI 215
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2037036700 1220 KHSLSTAYHPQTDGQTERVNQVIEQ---YLRMYCNYEQ 1254
Cdd:NF033577   216 EHRRTRPYHPQTNGKVERFHRTLKDefaYARPYESLAE 253
gag-asp_proteas pfam13975
gag-polyprotein putative aspartyl protease; This family of putative aspartyl proteases is ...
314-405 8.27e-07

gag-polyprotein putative aspartyl protease; This family of putative aspartyl proteases is found pre-dominantly in retroviral proteins.


Pssm-ID: 464060  Cd Length: 92  Bit Score: 48.34  E-value: 8.27e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTKP-IKTRSIHG----VTGHQLSINSsasmqVSIGTHNLGVVEASV 388
Cdd:pfam13975    1 VDVTINGRPVRFLVDTGASVTVISEALAERLGLDRLVdAYPVTVRTangtVRAARVRLDS-----VKIGGIELRNVPAVV 75
                           90
                   ....*....|....*..
gi 2037036700  389 ADTADYDLILGFTELRR 405
Cdd:pfam13975   76 LPGDLDDVLLGMDFLKR 92
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
1154-1280 2.93e-04

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 44.76  E-value: 2.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1154 ILVIVDRLTKFAI---LAPTHKTVTAKQT---AVLLYGHMVRLFgypdhMVSDRGRQFISGAWKAFAEQMGVKHSLSTAY 1227
Cdd:COG2801    168 LAAVIDLFSREIVgwsVSDSMDAELVVDAlemAIERRGPPKPLI-----LHSDNGSQYTSKAYQELLKKLGITQSMSRPG 242
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1228 HPQTDGQTERVNQVIEQ---YLRMYCNYEQndwANlLDTAAFV--YNNT-VHNSIG-VSP 1280
Cdd:COG2801    243 NPQDNAFIESFFGTLKYellYRRRFESLEE---AR-EAIEEYIefYNHErPHSSLGyLTP 298
transpos_IS30 NF033563
IS30 family transposase;
1149-1246 1.27e-03

IS30 family transposase;


Pssm-ID: 468088 [Multi-domain]  Cd Length: 267  Bit Score: 42.59  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1149 KTYDSILVIVDRLTKFAILA--PTHKTVTAKQTAVLLyghmvrLFGYPDHMV----SDRGRQFisGAWKAFAEQMGVKHS 1222
Cdd:NF033563   145 KHKSALLTLVERKSRFVILVklPDKTAESVNKALIKL------LKPLPKHLRksitADNGKEF--ARHSEIEEALGIDVY 216
                           90       100
                   ....*....|....*....|....
gi 2037036700 1223 LSTAYHPQTDGQTERVNQVIEQYL 1246
Cdd:NF033563   217 FADPYSPWQRGTNENTNGLLRQYL 240
 
Name Accession Description Interval E-value
RT_LTR cd01647
RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long ...
573-749 5.26e-83

RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long terminal repeats (LTRs) in their DNA copies but not in their RNA template. RT catalyzes DNA replication from an RNA template, and is responsible for the replication of retroelements. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs are present in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and Caulimoviruses.


Pssm-ID: 238825  Cd Length: 177  Bit Score: 269.47  E-value: 5.26e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  573 GFIRPSKSPARSPVLFVPKKDGGLRLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEW 652
Cdd:cd01647      1 GIIEPSSSPYASPVVVVKKKDGKLRLCVDYRKLNKVTIKDRYPLPTIDELLEELAGAKVFSKLDLRSGYHQIPLAEESRP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  653 KTAFGTQLGLYEYLVMPFGLANAPAHFQSFINDIFRDIIGIYVVVYLDDFLIFSDTEEAHVKHVTEVLTRLRSNRLFAKL 732
Cdd:cd01647     81 KTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKILGDLLGDFVEVYLDDILVYSKTEEEHLEHLREVLERLREAGLKLNP 160
                          170
                   ....*....|....*..
gi 2037036700  733 SKCEFHTKTVEFLGYII 749
Cdd:cd01647    161 EKCEFGVPEVEFLGHIV 177
RNase_HI_RT_Ty3 cd09274
Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) ...
846-966 2.24e-52

Ty3/Gypsy family of RNase HI in long-term repeat retroelements; Ribonuclease H (RNase H) enzymes are divided into two major families, Type 1 and Type 2, based on amino acid sequence similarities and biochemical properties. RNase H is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a sequence non-specific manner in the presence of divalent cations. RNase H is widely present in various organisms, including bacteria, archaea and eukaryotes. RNase HI has also been observed as adjunct domains to the reverse transcriptase gene in retroviruses, in long-term repeat (LTR)-bearing retrotransposons and non-LTR retrotransposons. RNase HI in LTR retrotransposons perform degradation of the original RNA template, generation of a polypurine tract (the primer for plus-strand DNA synthesis), and final removal of RNA primers from newly synthesized minus and plus strands. The catalytic residues for RNase H enzymatic activity, three aspartatic acids and one glutamic acid residue (DEDD), are unvaried across all RNase H domains. Phylogenetic patterns of RNase HI of LTR retroelements is classified into five major families, Ty3/Gypsy, Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate retroviruses. Ty3/Gypsy family widely distributed among the genomes of plants, fungi and animals. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.


Pssm-ID: 260006 [Multi-domain]  Cd Length: 121  Bit Score: 179.61  E-value: 2.24e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  846 RLETDASDFAIAGVLKQEHE-GRWHPVAFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGLPnqLVILTDHEALKY 924
Cdd:cd09274      1 ILETDASDYGIGAVLSQEDDdGKERPIAFFSRKLTPAERNYSTTEKELLAIVWALKKFRHYLLGRP--FTVYTDHKALKY 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2037036700  925 FKSQRRITGRQARWAILLADFDFILQYRPGDKGGEPDALTRR 966
Cdd:cd09274     79 LLTQKDLNGRLARWLLLLSEFDFEIEYRPGKENVVADALSRL 120
RVT_1 pfam00078
Reverse transcriptase (RNA-dependent DNA polymerase); A reverse transcriptase gene is usually ...
589-749 5.20e-42

Reverse transcriptase (RNA-dependent DNA polymerase); A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses.


Pssm-ID: 395031 [Multi-domain]  Cd Length: 189  Bit Score: 152.46  E-value: 5.20e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  589 VPKKD-GGLRLC----VDYRGLNEITVK-------NRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEWKTAF 656
Cdd:pfam00078    1 IPKKGkGKYRPIsllsIDYKALNKIIVKrlkpenlDSPPQPGFRPGLAKLKKAKWFLKLDLKKAFDQVPLDELDRKLTAF 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  657 GT-----------QLGLYEYLVMPFGLANAPAHFQSFINDIFRDI---IGIYVVVYLDDFLIFSDTEEAHVKHVTEVLTR 722
Cdd:pfam00078   81 TTppininwngelSGGRYEWKGLPQGLVLSPALFQLFMNELLRPLrkrAGLTLVRYADDILIFSKSEEEHQEALEEVLEW 160
                          170       180
                   ....*....|....*....|....*....
gi 2037036700  723 LRSNRLFAKLSKCEF--HTKTVEFLGYII 749
Cdd:pfam00078  161 LKESGLKINPEKTQFflKSKEVKYLGVTL 189
RT_RNaseH pfam17917
RNase H-like domain found in reverse transcriptase; DNA polymerase and ribonuclease H (RNase H) ...
840-944 1.67e-41

RNase H-like domain found in reverse transcriptase; DNA polymerase and ribonuclease H (RNase H) activities allow reverse transcriptases to convert the single-stranded retroviral RNA genome into double-stranded DNA, which is integrated into the host chromosome during infection. This entry represents the RNase H like domain.


Pssm-ID: 465565  Cd Length: 104  Bit Score: 147.66  E-value: 1.67e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  840 DYHLPTRLETDASDFAIAGVLKQ-EHEGRWHPVAFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGlpNQLVILTD 918
Cdd:pfam17917    1 DPSKPFILETDASDYGIGAVLSQkDEDGKERPIAYASRKLTPAERNYSTTEKELLAIVWALKKFRHYLLG--RKFTVYTD 78
                           90       100
                   ....*....|....*....|....*.
gi 2037036700  919 HEALKYFKSQRRITGRQARWAILLAD 944
Cdd:pfam17917   79 HKPLKYLFTPKELNGRLARWALFLQE 104
RT_RNaseH_2 pfam17919
RNase H-like domain found in reverse transcriptase;
817-908 1.00e-38

RNase H-like domain found in reverse transcriptase;


Pssm-ID: 465567 [Multi-domain]  Cd Length: 100  Bit Score: 139.56  E-value: 1.00e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  817 EEAQQAFHKLIQAFTSAGVLQHFDYHLPTRLETDASDFAIAGVLKQEHE-GRWHPVAFYSRKMSSAEKNYEIHDKELLAV 895
Cdd:pfam17919    3 EECQKAFEKLKQALTSAPVLAHPDPDKPFILETDASDYGIGAVLSQEDDdGGERPIAYASRKLSPAERNYSTTEKELLAI 82
                           90
                   ....*....|...
gi 2037036700  896 VACLTQWRHMLAG 908
Cdd:pfam17919   83 VFALKKFRHYLLG 95
RT_ZFREV_like cd03715
RT_ZFREV_like: A subfamily of reverse transcriptases (RTs) found in sequences similar to the ...
566-749 2.63e-20

RT_ZFREV_like: A subfamily of reverse transcriptases (RTs) found in sequences similar to the intact endogenous retrovirus ZFERV from zebrafish and to Moloney murine leukemia virus RT. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses. These elements can be divided into two major groups. One group contains retroviruses and DNA viruses whose propagation involves an RNA intermediate. They are grouped together with transposable elements containing long terminal repeats (LTRs). The other group, also called poly(A)-type retrotransposons, contain fungal mitochondrial introns and transposable elements that lack LTRs. Phylogenetic analysis suggests that ZFERV belongs to a distinct group of retroviruses.


Pssm-ID: 239685 [Multi-domain]  Cd Length: 210  Bit Score: 90.87  E-value: 2.63e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  566 LDENLEKGFIRPSKSPARSPVLFVPKKDGG-LRLCVDYRGLNEITVKNRAPLPLIEEQLFLL-RKARIYTKLDLRAAYNL 643
Cdd:cd03715     21 IQELLEAGILVPCQSPWNTPILPVKKPGGNdYRMVQDLRLVNQAVLPIHPAVPNPYTLLSLLpPKHQWYTVLDLANAFFS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  644 IRIAKGDEWKTAF---GTQlglYEYLVMPFGLANAPAHFQsfiNDIFRDI-------IGIYVVVYLDDFLIFSDTEEAHV 713
Cdd:cd03715    101 LPLAPDSQPLFAFeweGQQ---YTFTRLPQGFKNSPTLFH---EALARDLapfplehEGTILLQYVDDLLLAADSEEDCL 174
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2037036700  714 KHVTEVLTRLRSNRLFAKLSKCEFHTKTVEFLGYII 749
Cdd:cd03715    175 KGTDALLTHLGELGYKVSPKKAQICRAEVKFLGVVW 210
Integrase_H2C2 pfam17921
Integrase zinc binding domain; This zinc binding domain is found in a wide variety of ...
1052-1111 2.02e-19

Integrase zinc binding domain; This zinc binding domain is found in a wide variety of integrase proteins.


Pssm-ID: 465569 [Multi-domain]  Cd Length: 58  Bit Score: 83.06  E-value: 2.02e-19
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1052 IPKHLRFMVMTQCHDGitAGHVGRDATIKAAQRHYWWPNMTAWIADYVASCPVCARYKAP 1111
Cdd:pfam17921    1 VPKSLRKEILKEAHDS--GGHLGIEKTLARLRRRYWWPGMRKDVKKYVKSCETCQRRKPS 58
RT_Rtv cd01645
RT_Rtv: Reverse transcriptases (RTs) from retroviruses (Rtvs). RTs catalyze the conversion of ...
547-723 5.02e-19

RT_Rtv: Reverse transcriptases (RTs) from retroviruses (Rtvs). RTs catalyze the conversion of single-stranded RNA into double-stranded viral DNA for integration into host chromosomes. Proteins in this subfamily contain long terminal repeats (LTRs) and are multifunctional enzymes with RNA-directed DNA polymerase, DNA directed DNA polymerase, and ribonuclease hybrid (RNase H) activities. The viral RNA genome enters the cytoplasm as part of a nucleoprotein complex, and the process of reverse transcription generates in the cytoplasm forming a linear DNA duplex via an intricate series of steps. This duplex DNA is colinear with its RNA template, but contains terminal duplications known as LTRs that are not present in viral RNA. It has been proposed that two specialized template switches, known as strand-transfer reactions or "jumps", are required to generate the LTRs.


Pssm-ID: 238823 [Multi-domain]  Cd Length: 213  Bit Score: 87.34  E-value: 5.02e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  547 PQGPLylkgPKEMSE-LRRYLDENLEKGFIRPSKSPARSPVLFVPKKDGGLRLCVDYRGLNEITVknraplPLIEEQLFL 625
Cdd:cd01645      5 KQWPL----TEEKLEaLTELVTEQLKEGHIEPSTSPWNTPVFVIKKKSGKWRLLHDLRAVNAQTQ------DMGALQPGL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  626 -----LRKARIYTKLDLRAAYNLIRIAKGDEWKTAF----------GTQlglYEYLVMPFGLANAPAHFQSFINDIFRDI 690
Cdd:cd01645     75 phpaaLPKGWPLIVLDLKDCFFSIPLHPDDRERFAFtvpsinnkgpAKR---YQWKVLPQGMKNSPTICQSFVAQALEPF 151
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 2037036700  691 IG----IYVVVYLDDFLIFSDTEEAHVKHVTEVLTRL 723
Cdd:cd01645    152 RKqypdIVIYHYMDDILIASDLEGQLREIYEELRQTL 188
CD_CSD cd00024
CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this ...
1431-1478 2.27e-17

CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this group are chromodomains or chromo shadow domains; these are SH3-fold-beta-barrel domains of the chromo-like superfamily. Chromodomains lack the first strand of the SH3-fold-beta-barrel, this first strand is altered by insertion in the chromo shadow domains. The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. Chromodomain-containing proteins include: i) those having an N-terminal chromodomain followed by a related chromo shadow domain, such as Drosophila and human heterochromatin protein Su(var)205 (HP1), and mammalian modifier 1 and 2; ii) those having a single chromodomain, such as Drosophila protein Polycomb (Pc), mammalian modifier 3, human Mi-2 autoantigen, and several yeast and Caenorhabditis elegans proteins of unknown function; iii) those having paired tandem chromodomains, such as mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1; (iv) and elongation factor eEF3, a member of the ATP-binding cassette (ABC) family of proteins, that serves an essential function in the translation cycle of fungi. eEF3 is a soluble factor lacking a transmembrane domain and having two ABC domains arranged in tandem, with a unique chromodomain inserted within the ABC2 domain.


Pssm-ID: 349274 [Multi-domain]  Cd Length: 50  Bit Score: 77.13  E-value: 2.27e-17
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd00024      1 YEVEKILDHRVRKGKLEYLVKWKGYPPEENTWEPEENLtNAPELIKEYE 49
CD_HP1a_insect cd18653
chromodomain of insect HP1a; CHRomatin Organization Modifier (chromo) domain of insect HP1a. ...
1430-1479 8.39e-15

chromodomain of insect HP1a; CHRomatin Organization Modifier (chromo) domain of insect HP1a. HP1a is a member of the heterochromatin protein family, and is enriched in the heterochromatin and associated with centromeres. HP1 has diverse functions in heterochromatin formation and impacts both gene expression and gene silencing. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. In Drosophila, there are at least five HP1 family proteins, this subgroup includes the CD of Drosophila melanogaster HP1a.


Pssm-ID: 349300  Cd Length: 50  Bit Score: 69.68  E-value: 8.39e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1430 DFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEV 1479
Cdd:cd18653      1 EYAVEKICDRRVRKGKVEYYLKWKGYPETENTWEPEENLDCQDLIQQYEA 50
CD_HP1_like cd18631
chromodomain of heterochromatin protein 1 proteins, including HP1alpha, HP1beta, and HP1gamma; ...
1430-1478 3.24e-14

chromodomain of heterochromatin protein 1 proteins, including HP1alpha, HP1beta, and HP1gamma; CHRomatin Organization Modifier (chromo) domain of mammalian HP1alpha (Cbx5), HP1beta (Cbx1), HP1gamma (Cbx5), and similar proteins. HP1 has diverse functions in heterochromatin formation and impacts both gene expression and gene silencing. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. There are three human homologs of HP1 proteins: HP1alpha (also known as Cbx5), HP1beta (also known as Cbx1), and HP1gamma (also known as Cbx3).


Pssm-ID: 349281  Cd Length: 50  Bit Score: 68.23  E-value: 3.24e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1430 DFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18631      1 EYVVEKVLDRRVVKGKVEYLLKWKGYPDEDNTWEPEENLDCPDLIAEFE 49
CD_HP1beta_Cbx1 cd18650
chromodomain of heterochromatin protein 1 homolog beta; CHRomatin Organization Modifier ...
1430-1477 6.87e-13

chromodomain of heterochromatin protein 1 homolog beta; CHRomatin Organization Modifier (chromo) domain of heterochromatin protein 1 homolog beta (also known as HP1beta, CBX1, and chromobox 1), and related proteins. HP1beta is a highly conserved non-histone protein, which is a member of the heterochromatin protein family, and is enriched in the heterochromatin and associated with centromeres. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. There are three human homologs of HP1 proteins: HP1alpha (also known as Cbx5), HP1beta, and HP1gamma (also known as Cbx3).


Pssm-ID: 349297  Cd Length: 50  Bit Score: 64.58  E-value: 6.87e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1430 DFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18650      1 EYVVEKVLDRRVVKGKVEYLLKWKGFSDEDNTWEPEENLDCPDLIAEF 48
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
1431-1478 1.44e-12

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 63.37  E-value: 1.44e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1431 FEVEALIDKR-SHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:pfam00385    1 YEVERILDHRkDKGGKEEYLVKWKGYPYDENTWEPEENLsKCPELIEEFK 50
retropepsin_like cd00303
Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate ...
314-404 1.74e-11

Retropepsins; pepsin-like aspartate proteases; The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.


Pssm-ID: 133136  Cd Length: 92  Bit Score: 61.97  E-value: 1.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTKPIKT-RSIHGVTGHQLSINSSASmQVSIGTHNL-GVVEASVADT 391
Cdd:cd00303      1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTpLKVKGANGSSVKTLGVIL-PVTIGIGGKtFTVDFYVLDL 79
                           90
                   ....*....|...
gi 2037036700  392 ADYDLILGFTELR 404
Cdd:cd00303     80 LSYDVILGRPWLE 92
CD_HP1alpha_Cbx5 cd18651
chromodomain of heterochromatin protein 1 homolog alpha; CHRomatin Organization Modifier ...
1430-1477 5.73e-11

chromodomain of heterochromatin protein 1 homolog alpha; CHRomatin Organization Modifier (chromo) domain of heterochromatin protein 1 homolog alpha (also known as HP1alpha, Cbx5, and Chromobox 5), and related proteins. HP1alpha has diverse functions in heterochromatin formation, gene regulation, and mitotic progression, and forms complex networks of gene, RNA, and protein interactions. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. There are three human homologs of HP1 proteins: HP1alpha, HP1beta (also known as Cbx1), and HP1gamma (also known as Cbx3).


Pssm-ID: 349298  Cd Length: 50  Bit Score: 58.85  E-value: 5.73e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1430 DFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18651      1 EYVVEKVLDRRVVKGQVEYLLKWKGFSEEHNTWEPEKNLDCPELISEF 48
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
1135-1229 5.95e-11

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 60.41  E-value: 5.95e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1135 DFIEGlpPSKKYDSKTYdsILVIVDRLTKFAILAPTHKTVTAKQT-AVLLYghMVRLFGY-PDHMVSDRGRQFISGAWKA 1212
Cdd:pfam00665    8 DFTYI--RIPGGGGKLY--LLVIVDDFSREILAWALSSEMDAELVlDALER--AIAFRGGvPLIIHSDNGSEYTSKAFRE 81
                           90
                   ....*....|....*..
gi 2037036700 1213 FAEQMGVKHSLSTAYHP 1229
Cdd:pfam00665   82 FLKDLGIKPSFSRPGNP 98
CD_HP1gamma_Cbx3 cd18652
chromodomain of heterochromatin protein 1 homolog gamma; CHRomatin Organization Modifier ...
1430-1477 8.05e-11

chromodomain of heterochromatin protein 1 homolog gamma; CHRomatin Organization Modifier (chromo) domain of heterochromatin protein 1 homolog gamma (also known as HP1gamma, Cbx3, and Chromobox 3), and related proteins. HP1gamma is a highly conserved non-histone protein, which is a member of the heterochromatin protein family, and is enriched in the heterochromatin and associated with centromeres. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. In addition to being involved in transcriptional silencing in heterochromatin-like complexes, HP1gamma also binds lamin B receptor, an integral membrane protein found in the inner nuclear membrane. The dual binding functions of the protein may explain the association of heterochromatin with the inner nuclear membrane. HP1gamma is also recruited to sites of ultraviolet-induced DNA damage and double-strand breaks. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. There are three human homologs of HP1 proteins: HP1alpha (also known as Cbx5), HP1beta (also known as Cbx1), and HP1gamma.


Pssm-ID: 349299  Cd Length: 50  Bit Score: 58.48  E-value: 8.05e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1430 DFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18652      1 EFVVEKVLDRRVVNGKVEYFLKWKGFTDADNTWEPEENLDCPELIEAF 48
CD_HP1_like cd18960
chromodomain of heterochromatin protein 1 proteins, including HP1alpha, HP1beta, and HP1gamma; ...
1431-1478 4.03e-10

chromodomain of heterochromatin protein 1 proteins, including HP1alpha, HP1beta, and HP1gamma; uncharacterized subgroup; CHRomatin Organization Modifier (chromo) domain of mammalian HP1alpha (Cbx5), HP1beta (Cbx1), HP1gamma (Cbx5), and similar proteins. HP1 has diverse functions in heterochromatin formation and impacts both gene expression and gene silencing. HP1 has two conserved protein-protein interaction domains, a single N-terminal chromodomain (CD) which can bind to histone proteins via methylated lysine residues, and a related C-terminal chromo shadow domain (CSD) which is responsible for the homodimerization and interaction with a number of chromatin-associated non-histone proteins; a flexible hinge region separates the CD and CSD and may bind nucleic acid. HP1 is a highly conserved non-histone chromosomal protein that is evolutionarily conserved from fission yeast to plants and animals. There are three human homologs of HP1 proteins: HP1alpha (also known as Cbx5), HP1beta (also known as Cbx1), and HP1gamma (also known as Cbx3).


Pssm-ID: 349316  Cd Length: 51  Bit Score: 56.41  E-value: 4.03e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKR-SHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18960      2 FVVERILDKRlGRNGGEEFLIKWQGFPESDSSWEPRENLQCDEMLEEFE 50
CD_CMT3_like cd18635
chromodomain of chromomethylase 3, and similar proteins; CHRomatin Organization Modifier ...
1431-1477 6.98e-10

chromodomain of chromomethylase 3, and similar proteins; CHRomatin Organization Modifier (chromo) domain of DNA (cytosine-5)-methyltransferase chromomethylase 3 (CMT3, EC:2.1.1.37), and similar proteins. CMT3 is primarily a CHG (where H is either A, T or C) methyltransferase and is predominantly expressed in actively replicating cells. The protein is involved in preferentially methylating transposon-related sequences, reducing their mobility. Studies suggest that in order to target DNA methylation, CMT3 associates with H3K9me2-containing nucleosomes through binding of its BAH- and chromo-domains to H3K9me2. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349285  Cd Length: 57  Bit Score: 56.17  E-value: 6.98e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1431 FEVEALIDKRSH--NGTTE----YKVLWRGYSEEAASWEPVENL-NCPDLIQEY 1477
Cdd:cd18635      2 FEVEKLVGICYGdpKKTGErglyFKVRWKGYGPEEDTWEPIEGLsNCPEKIKEF 55
chromodomain cd18966
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
1431-1478 7.64e-10

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349322  Cd Length: 49  Bit Score: 55.75  E-value: 7.64e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18966      1 YEVERILAERRDDGGKRYLVKWEGYPLEEATWEPEENIGDEELLKEWE 48
CD_MarY1_POL_like cd18975
chromodomain of Tricholoma matsutake polyprotein, and similar proteins; This subgroup includes ...
1431-1477 1.31e-09

chromodomain of Tricholoma matsutake polyprotein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in the polyprotein from the MarY1 Ty3/Gypsy long terminal repeat (LTR) retroelement from the from the Ectomycorrhizal Basidiomycete Tricholoma matsutake. The pol gene in TY3/gypsy elements generally encodes domains in the following order: prt-reverse transcriptase-RNase H-integrase, in marY1 POL the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349331  Cd Length: 49  Bit Score: 55.24  E-value: 1.31e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18975      1 YEVESILNSRLHRGKLQYLIQWKGYPLEEASWELEDNIKNPRLIEEF 47
CHROMO smart00298
Chromatin organization modifier domain;
1430-1478 1.34e-09

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 55.30  E-value: 1.34e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700  1430 DFEVEALIDKRSH-NGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:smart00298    1 EYEVEKILDHRWKkKGELEYLVKWKGYSYSEDTWEPEENLlNCSKKLDNYK 51
CD_DDE_transposase_like cd18978
chromodomain of Rhizopus microsporus putative DDE transposases, and similar proteins; This ...
1428-1479 2.14e-09

chromodomain of Rhizopus microsporus putative DDE transposases, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in Rhizopus microsporus putative DDE transposases, and similar proteins. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349334  Cd Length: 52  Bit Score: 54.63  E-value: 2.14e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2037036700 1428 DLDFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEV 1479
Cdd:cd18978      1 DESYEVEKIINHRGEKNRRKYLVKWKGYDDTDNSWVTQEDFNDKDMIDEYEN 52
CD_CDY cd18634
chromodomain of the Chromodomain Y-like protein family; This group includes the chromodomain ...
1431-1477 2.57e-09

chromodomain of the Chromodomain Y-like protein family; This group includes the chromodomain found in the mammalian chromodomain Y-like (CDY) protein family, and similar proteins. The human CDY family includes 6 proteins: the genes encoding four of these: two copies of CDY1 (CDY1a, CDY1a) and two copies of CDY2(CDY2a and CDY2b), are located on chromosome Y, and the genes encoding the other two members (CDYL and CDYL2) are located on autosomes. The chromosomal genes are only present in primates, whereas the CDYL and CDYL2 genes exist in most mammalian species. The CDY family proteins contain two functional domains: a chromodomain involved in chromatin binding and a catalytic domain found in many coenzyme A (CoA)- dependent acylation enzymes. CDYL is ubiquitously expressed, whereas CDYL2 shows selective expression in tissues of testis, prostate, spleen, and leukocyte. The CDYL genes are ubiquitously expressed, the CDY genes are only expressed in the testis. Deletion of the CDY1b gene has been shown to be a risk factor for male infertility. Impairments in CDY2 expression could be implicated in the pathogenesis of maturation arrest (a failure of germ cell development).


Pssm-ID: 349284  Cd Length: 52  Bit Score: 54.37  E-value: 2.57e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKR-SHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEY 1477
Cdd:cd18634      2 YEVERIVDKRkNKKGKTEYLVRWKGYDSEDDTWEPEQHLlNCEEFIHDF 50
transpos_IS481 NF033577
IS481 family transposase; null
1145-1254 3.94e-09

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 59.53  E-value: 3.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1145 KYDSKTYdsILVIVDRLTKFAILA--PTHKTVTAKQTAVLLYghmvRLFGYPDHMV-SDRGRQFIS--GAWKAFAEQMGV 1219
Cdd:NF033577   142 PDVGRLY--LHTAIDDHSRFAYAElyPDETAETAADFLRRAF----AEHGIPIRRVlTDNGSEFRSraHGFELALAELGI 215
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2037036700 1220 KHSLSTAYHPQTDGQTERVNQVIEQ---YLRMYCNYEQ 1254
Cdd:NF033577   216 EHRRTRPYHPQTNGKVERFHRTLKDefaYARPYESLAE 253
CD_Clr4_like cd18632
N-terminal chromodomain of the fission yeast histone methyltransferase Clr4, and similar ...
1430-1478 9.99e-09

N-terminal chromodomain of the fission yeast histone methyltransferase Clr4, and similar proteins; N-terminal CHRomatin Organization Modifier (chromo) domain of cryptic loci regulator 4 (Clr4), a histone H3 lysine methyltransferase which targets H3K9. Clr4 regulates silencing and switching at the mating-type loci and affects chromatin structure at centromeres. Clr4 is a catalytic component of the rik1-associated E3 ubiquitin ligase complex that shows ubiquitin ligase activity and is required for histone H3K9 methylation. H3K9me represents a specific tag for epigenetic transcriptional repression by recruiting swi6/HP1 to methylated histones which leads to transcriptional silencing within centromeric heterochromatin, telomeric regions and at the silent mating-type loci. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349282  Cd Length: 55  Bit Score: 52.89  E-value: 9.99e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2037036700 1430 DFEVEALIDKRSHNGTTE--YKVLWRGYSEEAASWEPVENLN-CPDLIQEYE 1478
Cdd:cd18632      1 EYEVEKIVDEKTDRNTAEplYLVRWKNYSKNHDTWEPAENLSgCQAVLEKWK 52
CD_polycomb cd18644
chromodomain of polycomb; CHRomatin Organization Modifier (chromo) domain of the PcG ...
1428-1481 1.82e-08

chromodomain of polycomb; CHRomatin Organization Modifier (chromo) domain of the PcG (polycomb-group) chromodomain protein Polycomb (Pc) from Drosophila melanogaster, anthropod, worm, and sea cucumber, and similar proteins. Pc is a component of the Polycomb-group (PcG) multiprotein PRC1 complex, a complex class required to maintain the transcriptionally repressive state of many genes, including Hox genes, throughout development. The core subunits of PRC1 are polycomb (Pc), polyhomeotic (Ph), posterior sex combs (Psc), and sex comb extra (Sce, also known as dRing). Polycomb (Pc) plays a role in modulating life span in flies, it negatively regulates longevity.


Pssm-ID: 349291  Cd Length: 54  Bit Score: 52.08  E-value: 1.82e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1428 DLDFEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEVSE 1481
Cdd:cd18644      1 DLVYAAEKILKKRVRKGKVEYLVKWKGWSNKHNTWEPEENILDRRLIEIFERTN 54
CD_SUV39H1_like cd18639
chromodomain of histone methyltransferase SUV39H1, and similar proteins; CHRomatin ...
1431-1477 2.43e-08

chromodomain of histone methyltransferase SUV39H1, and similar proteins; CHRomatin Organization Modifier (chromo) domain of human SUV39H1, a histone lysine methyltransferase (HMT) which catalyzes di- and tri-methylation of lysine 9 of histone H3 (H3K9me2/3), leading to heterochromatin formation and gene silencing. H3K9me2/3 represents a specific mark for epigenetic transcriptional repression by recruiting HP1 (CBX1, CBX3, and/or CBX5) proteins to methylated histones. SUV39H1 mainly functions in heterochromatin regions. The human SUV39H1/2, histone H3K9 methyltransferases, are the mammalian homologs of Drosophila Su(var)3-9 and Schizosaccharomyces pombe Clr4. SUV39H1 contains a chromodomain at its N-terminus and a SET domain at its C-terminus. Although the SET domain performs the catalytic activity, the chromodomain of SUV39H1 is essential for the catalytic activity of SUV39H1. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349289  Cd Length: 49  Bit Score: 51.36  E-value: 2.43e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18639      1 YEVEYLCDYKKIREQEYYLVKWKGYPDSENTWEPRQNLKCSRLLKQF 47
CD_Cbx2 cd18647
chromodomain of chromobox homolog 2; CHRomatin Organization Modifier (chromo) domain of ...
1431-1478 2.45e-08

chromodomain of chromobox homolog 2; CHRomatin Organization Modifier (chromo) domain of chromobox homolog 2 (CBX2), a component of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells.


Pssm-ID: 349294  Cd Length: 53  Bit Score: 51.60  E-value: 2.45e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18647      4 FAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQ 51
CD_Tf2-1_POL_like cd18973
chromodomain of Rhizoctonia solani AG-1 IB retrotransposable element Tf2 155 kDa protein type ...
1431-1478 2.46e-08

chromodomain of Rhizoctonia solani AG-1 IB retrotransposable element Tf2 155 kDa protein type 1, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in Rhizoctonia solani AG-1 IB retrotransposable element Tf2 155 kDa protein type 1 (Tf2-1), and similar proteins. It belongs to the Ty3/gypsy family of long terminal repeat (LTR) retrotransposons. The pol gene in TY3/gypsy elements generally encodes domains in the following order: an aspartyl protease, a reverse transcriptase, RNase H, and an integrase, here the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349329  Cd Length: 50  Bit Score: 51.48  E-value: 2.46e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd18973      1 YVVEAILDNKRRKGKWLYLVKWKGYGPEHNTWEPRENLeHAQKLLKKYY 49
CD_EhHp1_like cd18638
chromodomain of Entamoeba histolytica heterochromatin protein 1, and similar proteins; This ...
1431-1478 3.52e-08

chromodomain of Entamoeba histolytica heterochromatin protein 1, and similar proteins; This subgroup includes the N-terminal CHRomatin Organization Modifier (chromo) domain of heterochromatin protein 1 (HP1)-like protein from Entamoeba histolytica, and similar proteins. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349288  Cd Length: 52  Bit Score: 51.10  E-value: 3.52e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL--NCPDLIQEYE 1478
Cdd:cd18638      2 FEVEKIVKKKTVKGGTEYFVKWKGYSAKENTWETEDNLekSYKEMIDEFE 51
CD_POL_like cd18976
chromodomain of uncharacterized putative retroelement polyprotein proteins; This subgroup ...
1433-1478 5.73e-08

chromodomain of uncharacterized putative retroelement polyprotein proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in uncharacterized putative retrotransposon proteins, and similar proteins. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349332  Cd Length: 51  Bit Score: 50.64  E-value: 5.73e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1433 VEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL--NCPDLIQEYE 1478
Cdd:cd18976      3 VESLLDRRKVRGQVQYLVKWRGFPRSEATWEPREELmrRCAELVAAYD 50
CD_MMP8 cd18633
chromodomain of M-phase phosphoprotein 8; The chromodomain of M-phase phosphoprotein 8 (MPP8), ...
1431-1478 6.68e-08

chromodomain of M-phase phosphoprotein 8; The chromodomain of M-phase phosphoprotein 8 (MPP8), a component of the RanBPM-containing large protein complex, binds methylated H3K9. This may in turn recruit the H3K9 methyltransferases GLP and ESET, and DNA methyltransferase 3A to the promoter of the E-cadherin gene, mediating the E-cadherin gene silencing and promoting tumor cell motility and invasion. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349283  Cd Length: 51  Bit Score: 50.36  E-value: 6.68e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd18633      2 FEVEKILDMKTEGGKVLYKVRWKGYTSDDDTWEPEVHLeDCKEVLLEFR 50
CD_Rhino cd18630
chromodomain of Drosophila melanogaster Rhino, and similar proteins; N-terminal CHRomatin ...
1433-1478 3.33e-07

chromodomain of Drosophila melanogaster Rhino, and similar proteins; N-terminal CHRomatin Organization Modifier (chromo) domain of Drosophila melanogaster Rhino (also known as heterochromatin protein 1-like), and similar proteins. Rhino is a female-specific protein that affects chromosome structure and egg polarity that is required for germline PIWI-interacting RNA (piRNA) production. In Drosophila the RDC (rhino, deadlock, and cutoff) complex, composed of rhino, the protein deadlock (Del) and the Rai1-like transcription termination cofactor cutoff (Cuff) binds to chromatin of dual-strand piRNA clusters, special genomic regions, which encode piRNA precursors. The RDC complex is anchored to H3K9me3-marked chromatin in part via the H3K9me3-binding activity of Rhino, and is required for transcription of piRNA precursors. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349280  Cd Length: 51  Bit Score: 48.28  E-value: 3.33e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2037036700 1433 VEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd18630      4 VEKILGKRFVNGRPQVLVKWSGFPNENNTWEPLENLgNCMKLVADYE 50
CD_polycomb_like cd18627
chromodomain of polycomb and chromobox family proteins; CHRomatin Organization Modifier ...
1431-1478 4.08e-07

chromodomain of polycomb and chromobox family proteins; CHRomatin Organization Modifier (chromo) domain of Polycomb and Polycomb-group (PcG) chromobox (CBX) family proteins such as CBX2, CBX4, CBX6, CBX7, and CBX8. These CBX proteins are components of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells.


Pssm-ID: 349277  Cd Length: 49  Bit Score: 48.16  E-value: 4.08e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18627      1 FAAECILKKRIRKGKVEYLVKWKGWSQKYNTWEPEENILDPRLLAAFE 48
chromodomain cd18968
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
1430-1477 4.33e-07

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349324  Cd Length: 57  Bit Score: 48.11  E-value: 4.33e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2037036700 1430 DFEVEALI------DKRSHNGTTEYKVLWRGYSEEAASWEPVENLN-CPDLIQEY 1477
Cdd:cd18968      1 EYEVEVILaarvvkDAESRKKGWKYLVKWAGYPDEENTWEPEESFDgCDDLLERF 55
gag-asp_proteas pfam13975
gag-polyprotein putative aspartyl protease; This family of putative aspartyl proteases is ...
314-405 8.27e-07

gag-polyprotein putative aspartyl protease; This family of putative aspartyl proteases is found pre-dominantly in retroviral proteins.


Pssm-ID: 464060  Cd Length: 92  Bit Score: 48.34  E-value: 8.27e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTKP-IKTRSIHG----VTGHQLSINSsasmqVSIGTHNLGVVEASV 388
Cdd:pfam13975    1 VDVTINGRPVRFLVDTGASVTVISEALAERLGLDRLVdAYPVTVRTangtVRAARVRLDS-----VKIGGIELRNVPAVV 75
                           90
                   ....*....|....*..
gi 2037036700  389 ADTADYDLILGFTELRR 405
Cdd:pfam13975   76 LPGDLDDVLLGMDFLKR 92
RT_DIRS1 cd03714
RT_DIRS1: Reverse transcriptases (RTs) occurring in the DIRS1 group of retransposons. Members ...
635-749 1.15e-06

RT_DIRS1: Reverse transcriptases (RTs) occurring in the DIRS1 group of retransposons. Members of the subfamily include the Dictyostelium DIRS-1, Volvox carteri kangaroo, and Panagrellus redivivus PAT elements. These elements differ from LTR and conventional non-LTR retrotransposons. They contain split direct repeat (SDR) termini, and have been proposed to integrate via double-stranded closed-circle DNA intermediates assisted by an encoded recombinase which is similar to gamma-site-specific integrase.


Pssm-ID: 239684 [Multi-domain]  Cd Length: 119  Bit Score: 48.88  E-value: 1.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  635 LDLRAAYNLIRIAKgDEWK-TAFGTQLGLYEYLVMPFGLANAPAHFQSFINDIFRDI--IGIYVVVYLDDFLIFSDTEea 711
Cdd:cd03714      1 VDLKDAYFHIPILP-RSRDlLGFAWQGETYQFKALPFGLSLAPRVFTKVVEALLAPLrlLGVRIFSYLDDLLIIASSI-- 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2037036700  712 hvKHVTEVLTRLRSNRLFA-----KLSKCE-FHTKTVEFLGYII 749
Cdd:cd03714     78 --KTSEAVLRHLRATLLANlgftlNLEKSKlGPTQRITFLGLEL 119
CD_Cbx8 cd18649
chromodomain of chromobox homolog 8; CHRomatin Organization Modifier (chromo) domain of ...
1431-1478 1.63e-05

chromodomain of chromobox homolog 8; CHRomatin Organization Modifier (chromo) domain of chromobox homolog 8 (CBX8), a component of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells. CBX proteins may act as an oncogene or tumor suppressor in a cell-type-dependent manner, CBX8 for example promotes proliferation while suppressing metastasis, in colorectal carcinoma progression.


Pssm-ID: 349296  Cd Length: 55  Bit Score: 43.55  E-value: 1.63e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYE 1478
Cdd:cd18649      5 FAAEALLKRRIRKGRMEYLVKWKGWSQKYSTWEPEENILDARLLAAFE 52
CD_Cbx4 cd18645
chromodomain of chromobox homolog 4; CHRomatin Organization Modifier (chromo) domain of ...
1431-1481 2.29e-05

chromodomain of chromobox homolog 4; CHRomatin Organization Modifier (chromo) domain of chromobox homolog 4 (CBX4), a component of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells. In addition to a chromodomain with H3K27me3-binding activity, Cbx4 contains two SUMO-interacting motifs responsible for its small ubiquitin-related modifier (SUMO) E3 ligase activity. CBX proteins may act as an oncogene or tumor suppressor in a cell-type-dependent manner, for example CBX8 promotes proliferation while suppressing metastasis, in colorectal carcinoma progression. CBX4 may serve as a tumor suppressor in colorectal carcinoma, and has been shown to be an oncogene in osteosarcoma and breast cancer.


Pssm-ID: 349292  Cd Length: 55  Bit Score: 43.12  E-value: 2.29e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEVSE 1481
Cdd:cd18645      4 FAVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNRE 54
CD_MT_like cd18962
chromodomain of a putative Coemansia reversa NRRL 1564 methyltransferase, and similar proteins; ...
1433-1477 3.01e-05

chromodomain of a putative Coemansia reversa NRRL 1564 methyltransferase, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in a Coemansia reversa NRRL 1564 SET (Su(var)3-9, enhancer-of-zeste, trithorax) domain-containing protein, and similar proteins. The SU(VAR)3-9 protein is the main chromocenter-specific histone H3-K9 methyltransferase (HMTase) in Drosophila where it plays a role in heterochromatic gene silencing. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349318  Cd Length: 52  Bit Score: 42.94  E-value: 3.01e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2037036700 1433 VEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEY 1477
Cdd:cd18962      6 VEAIVNDVLIDGKHMYEVKWEGYPSDHNNWVAEWDLNDKEILRKY 50
CD_Cbx7 cd18646
chromodomain of chromobox homolog 7; CHRomatin Organization Modifier (chromo) domain of ...
1431-1481 3.08e-05

chromodomain of chromobox homolog 7; CHRomatin Organization Modifier (chromo) domain of chromobox homolog 7 (CBX7), a component of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells. CBX proteins may act as an oncogene or tumor suppressor in a cell-type-dependent manner, for example CBX8 promotes proliferation while suppressing metastasis, in colorectal carcinoma progression. CBX7 has been shown to function as a tumor suppressor in lung carcinoma and an oncogene in gastric cancer and lymphoma.


Pssm-ID: 349293  Cd Length: 56  Bit Score: 43.15  E-value: 3.08e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEVSE 1481
Cdd:cd18646      5 FAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKE 55
chromodomain cd18965
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
1433-1469 3.93e-05

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349321  Cd Length: 53  Bit Score: 42.46  E-value: 3.93e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2037036700 1433 VEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLN 1469
Cdd:cd18965      3 IEALLKKRQFNRKLEYLVKWHGLPESENTWEREKDIK 39
CD_POL_like cd18974
chromodomain of Penicillium solitum protein PENSOL_c198G03123; This subgroup includes the ...
1431-1478 4.51e-05

chromodomain of Penicillium solitum protein PENSOL_c198G03123; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in Penicillium solitum protein PENSOL_c198G03123 a putative polyprotein from a Ty3/Gypsy long terminal repeat (LTR) retroelement. The pol gene in TY3/gypsy elements generally encodes domains in the following order: an aspartyl protease, a reverse transcriptase, RNase H, and an integrase, here the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349330  Cd Length: 50  Bit Score: 42.08  E-value: 4.51e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYE 1478
Cdd:cd18974      1 WEVEEIVDEKMIDDELHYLVKWKGWPAEYNQWEPEDDMeNAPKAIQSYE 49
CD_NC-like cd18980
chromodomain of a Tasahii var. asahii CBS 8904 retrotransposon nucleocapsid protein, and ...
1428-1477 7.02e-05

chromodomain of a Tasahii var. asahii CBS 8904 retrotransposon nucleocapsid protein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in Trichosporon asahii var. asahii CBS 8904 retrotransposon nucleocapsid protein, and similar proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349336  Cd Length: 56  Bit Score: 41.79  E-value: 7.02e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1428 DLDFEVEALIDKRSHNGTTE---YKVLWRGYSEEAASWEPVENL-NCPDLIQEY 1477
Cdd:cd18980      1 QPEYEVEAILDHKVDRRYRDpnfYLVRWRGYGPSHDSWEPTSALeNAQDLLREF 54
CD_Cbx6 cd18648
chromodomain of chromobox homolog 6; CHRomatin Organization Modifier (chromo) domain of ...
1431-1484 8.04e-05

chromodomain of chromobox homolog 6; CHRomatin Organization Modifier (chromo) domain of chromobox homolog 6 (CBX6), a component of the PcG repressive complex PRC1, one of the two classes of PRCs. PcG proteins form large multiprotein complexes (PcG bodies) which are involved in the stable repression of genes involved in development, signaling or cancer via chromatin-based epigenetic modifications. Mammalian PRC1 includes canonical (cPRC1) and non-canonical complexes; cPRC1, contains four core subunits including one CBX protein (CBX2, CBX4, and CBX6-CBX8) that binds H3K27me3. CBX family members have different affinity for H3K27me3, with CBX7 having the highest binding capability. The human CBX proteins show distinct nuclear localizations and contribute differently to transcriptional repression. Some CBX proteins of the PRC1 complex have been implicated in transcriptional activation as well as in PRC1-independent roles in embryonic stem cells and in somatic cells.


Pssm-ID: 349295  Cd Length: 58  Bit Score: 41.97  E-value: 8.04e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENLNCPDLIQEYEVSEGGR 1484
Cdd:cd18648      4 FAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQKERER 57
chromodomain cd18964
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
1431-1479 1.37e-04

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349320  Cd Length: 54  Bit Score: 41.16  E-value: 1.37e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1431 FEVEALIDKR--SHNGTT--EYKVLWRGYSEEAASWEPVENLN-CPDLIQEYEV 1479
Cdd:cd18964      1 FFVERIIGRRpsARDGPGkfLWLVKWDGYPIEDATWEPPENLGeHAKLIEDFEK 54
CD_POL_like cd18977
chromodomain of a Rhizoctonia solani AG-3 Rhs1AP polyprotein, and similar proteins; This ...
1430-1479 1.97e-04

chromodomain of a Rhizoctonia solani AG-3 Rhs1AP polyprotein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in a Rhizoctonia solani AG-3 Rhs1AP, a putative Ty3/Gypsy polyprotein/retrotransposon which includes a protease, a reverse transcriptase, a ribonuclease H, and an integrase domain, in that order, with a chromodomain at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349333  Cd Length: 57  Bit Score: 40.54  E-value: 1.97e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2037036700 1430 DFEVEALID----KRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEYEV 1479
Cdd:cd18977      3 EYEVEKIVGekwkKRKNRRVKLYKVRFKGYGPEEDEWLTKEELkNAPEILAEWKL 57
CD_CEC-4_like cd18961
chromodomain of Caenorhabditis elegans chromodomain protein 4, and similar proteins; CHRomatin ...
1431-1477 2.57e-04

chromodomain of Caenorhabditis elegans chromodomain protein 4, and similar proteins; CHRomatin Organization Modifier (chromo) domain of Caenorhabditis elegans CEC-4, and similar proteins. CEC-4 is a perinuclear heterochromatin anchor, it mediates the anchoring of H3K9 methylation-bearing chromatin at the nuclear periphery in early to mid-stage embryos. It is necessary for anchoring, but does not affect transcriptional repression. CEC-4 contributes to the efficiency with which muscle differentiation is induced following ectopic expression of the master regulator, HLH-1 (MyoD in mammals). A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349317  Cd Length: 51  Bit Score: 40.16  E-value: 2.57e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGY---SEEAASWEpvENL-NCPDLIQEY 1477
Cdd:cd18961      1 YEVEKILSHRIVNGKPLYLVMWVGYpgpVENSEMWE--EDLkNCGELLKAY 49
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
1154-1280 2.93e-04

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 44.76  E-value: 2.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1154 ILVIVDRLTKFAI---LAPTHKTVTAKQT---AVLLYGHMVRLFgypdhMVSDRGRQFISGAWKAFAEQMGVKHSLSTAY 1227
Cdd:COG2801    168 LAAVIDLFSREIVgwsVSDSMDAELVVDAlemAIERRGPPKPLI-----LHSDNGSQYTSKAYQELLKKLGITQSMSRPG 242
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1228 HPQTDGQTERVNQVIEQ---YLRMYCNYEQndwANlLDTAAFV--YNNT-VHNSIG-VSP 1280
Cdd:COG2801    243 NPQDNAFIESFFGTLKYellYRRRFESLEE---AR-EAIEEYIefYNHErPHSSLGyLTP 298
CD_POL_like cd18972
chromodomain of a Moniliophthora perniciosa FA553 putative retrotransposon polyprotein, and ...
1431-1477 3.61e-04

chromodomain of a Moniliophthora perniciosa FA553 putative retrotransposon polyprotein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in a Moniliophthora perniciosa FA553 putative retrotelement polyprotein, which includes domains in the following order: a reverse transcriptase, RNase H, and an integrase, here the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related "chromo shadow" domain


Pssm-ID: 349328  Cd Length: 50  Bit Score: 39.80  E-value: 3.61e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2037036700 1431 FEVEALIDKRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEY 1477
Cdd:cd18972      1 YEVEAIVGHKPKKKPRQFLVSWLGYDSSHNEWKQKEELeNARELLQDY 48
Tra8 COG2826
Transposase and inactivated derivatives, IS30 family [Mobilome: prophages, transposons];
1154-1246 3.93e-04

Transposase and inactivated derivatives, IS30 family [Mobilome: prophages, transposons];


Pssm-ID: 442074 [Multi-domain]  Cd Length: 325  Bit Score: 44.49  E-value: 3.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1154 ILVIVDRLTKFAILA--PTHKTVTAKQTavllyghMVRLF-GYPDHMV----SDRGRQFisGAWKAFAEQMGVKHSLSTA 1226
Cdd:COG2826    189 LLTLVERKSRFVILLklPDKTAESVADA-------LIRLLrKLPAFLRksitTDNGKEF--ADHKEIEAALGIKVYFADP 259
                           90       100
                   ....*....|....*....|
gi 2037036700 1227 YHPQTDGQTERVNQVIEQYL 1246
Cdd:COG2826    260 YSPWQRGTNENTNGLLRQYF 279
Asp_protease_2 pfam13650
Aspartyl protease; This family consists of predicted aspartic proteases, typically from 180 to ...
314-399 5.17e-04

Aspartyl protease; This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix.


Pssm-ID: 433378  Cd Length: 90  Bit Score: 40.35  E-value: 5.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700  314 LSGMVQDHPARILADTGAGLSIVSDSFISKYQIPTK----PIKTRSIHG-VTGHQLSINSsasmqVSIGTHNLGVVEASV 388
Cdd:pfam13650    1 VPVTINGKPVRFLVDTGASGTVISPSLAERLGLKVRglayTVRVSTAGGrVSAARVRLDS-----LRLGGLTLENVPALV 75
                           90
                   ....*....|..
gi 2037036700  389 ADTADY-DLILG 399
Cdd:pfam13650   76 LDLGDLiDGLLG 87
chromodomain cd18969
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; for most members ...
1428-1477 5.84e-04

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; for most members of this subgroup, the chromodomain is followed by a chromo shadow domain; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain. For the majority of members of this subgroup, the chromodomain is followed by a chromo shadow domain (CSD).


Pssm-ID: 349325  Cd Length: 56  Bit Score: 39.43  E-value: 5.84e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2037036700 1428 DLDFEVEALID---KRSHNGTTEYKVLWRGYSEEAASWEPVENL-NCPDLIQEY 1477
Cdd:cd18969      1 EEEYEIEEILDvkkGGFEDGKLAYFVKWKGYPSSENSWVTEEDAaNAQEMIEEY 54
transpos_IS30 NF033563
IS30 family transposase;
1149-1246 1.27e-03

IS30 family transposase;


Pssm-ID: 468088 [Multi-domain]  Cd Length: 267  Bit Score: 42.59  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2037036700 1149 KTYDSILVIVDRLTKFAILA--PTHKTVTAKQTAVLLyghmvrLFGYPDHMV----SDRGRQFisGAWKAFAEQMGVKHS 1222
Cdd:NF033563   145 KHKSALLTLVERKSRFVILVklPDKTAESVNKALIKL------LKPLPKHLRksitADNGKEF--ARHSEIEEALGIDVY 216
                           90       100
                   ....*....|....*....|....
gi 2037036700 1223 LSTAYHPQTDGQTERVNQVIEQYL 1246
Cdd:NF033563   217 FADPYSPWQRGTNENTNGLLRQYL 240
CD2_tandem cd18659
repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier ...
1432-1478 2.56e-03

repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD1 to CHD9, and yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349306 [Multi-domain]  Cd Length: 54  Bit Score: 37.56  E-value: 2.56e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2037036700 1432 EVEALIDKRSH-NGTTEYKVLWRG--YSEeaASWEPVENLN--CPDLIQEYE 1478
Cdd:cd18659      4 IVERIIAHREDdEGVTEYLVKWKGlpYDE--CTWESEEDISdiFQEAIDEYK 53
CD_Chp1_like cd18636
chromodomain of chromodomain-containing protein 1, and similar proteins; CHRomatin ...
1431-1468 4.39e-03

chromodomain of chromodomain-containing protein 1, and similar proteins; CHRomatin Organization Modifier (chromo) domain of chromodomain-containing protein 1 (CHp1), and similar proteins. Chp1 is needed for RNA interference-dependent heterochromatin formation in fission yeast. Chp1 is a member of the RNA-induced transcriptional silencing (RITS) complex which maintains the heterochromatin regions. The chromodomain of the Chp1 component binds the histone H3 lysine 9 methylated tail (H3K9me) and the core of the nucleosome. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349286  Cd Length: 52  Bit Score: 36.66  E-value: 4.39e-03
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2037036700 1431 FEVEALI-DKRSHNGTTEYKVLWRGYSEEAASWEPVENL 1468
Cdd:cd18636      2 YEVEDILaDRVNKNGINEYYIKWAGYDWYDNTWEPEQNL 40
CD_POL_like cd18971
chromodomain of a Magnaporthe grisea putative retrotransposon polyprotein, and similar ...
1431-1478 5.08e-03

chromodomain of a Magnaporthe grisea putative retrotransposon polyprotein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in a Magnaporthe grisea putative retrotransposon polyprotein which includes domains in the following order: an aspartyl protease, a reverse transcriptase, RNase H, and an integrase, here the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349327  Cd Length: 50  Bit Score: 36.60  E-value: 5.08e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2037036700 1431 FEVEALIDKRSHNGT---TEYKVLWRGYSEEaaSWEPVENLNCPDLIQEYE 1478
Cdd:cd18971      1 YEVEEILAARRRRIRgkgREVLVKWVGYAEP--TWEPLDNLADTAALDRFE 49
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH