Protein

Protein accession
7rkg4 [EnVhog]
Representative
72gmg
Source
EnVhog (cluster: phalp2_32215)
Protein name
7rkg4
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKYDSRNRKNLNNLGDKTKVAAYKWYQYCLNNGIEVLIYETIRTVETQREYVNKGASQTMKSYHIVGQALDFVPIKSDGLEDWNGYKKEPWASAIRYAKQIGFEWGGDWKGFVDSPHLQFNLSGYGTDTFHNKGQAPSGGTPSTPAPNPTPPVDSSGSIYGVATVTADVLNVRSAPSTSATIVGKVRRNEPYKVWAIQDGWYCVGGNQWISGDYVSYAPPVGVITGDVWLHSTPDFNDSSRVRVLKAGEPYVVWGEANGMYAIGGYVSKKYMKLT
Physico‐chemical
properties
protein length:275 AA
molecular weight:30478,8 Da
isoelectric point:8,80
hydropathy:-0,49
Representative Protein Details
Accession
72gmg
Protein name
72gmg
Sequence length
282 AA
Molecular weight
31609,68850 Da
Isoelectric point
9,23904
Sequence
MKYHERNVRNLNQLADNTKAAAFKWYQYCVDNGIDVLIYETIRTKEKQREYVNKGASQTMKSYHIVGQALDFVPIKSNGTEDWNGYNKEPWASAIRYAKQIGFEWGGDWKGFVDSPHLQYNYKGYGTDTFGKGAQNVPSPPPVSNDSARIAYINGNNVNLRKGPGTGYGVIRQLGKGEAYQVFAESNGWLNLGGDQWVYNDPSYIRYTGKSTPVTPQPSNDGVGVVTITADVLRVRTGPGTNYGIVKNVYQGEKYQSFGNKNGWYNVGGNQWVSGEYVNFKK
Other Proteins in cluster: phalp2_32215
Total (incl. this protein): 11 Avg length: 266,1 Avg pI: 7,84

Protein ID Length (AA) pI
72gmg 282 9,23904
4MHyD 213 9,40344
78iqY 283 4,81053
7KBNL 279 8,99497
7ouZR 281 7,56387
7rpPy 279 8,86423
8IE7S 213 6,10840
e6uU 221 8,92354
A0A4Y5NXW3 291 6,95552
A0A9E7PJ93 310 6,53480
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1844
3xTtn
8 25,6% 242 1.280E-15
2 phalp2_33303
6crLy
183 25,2% 214 3.076E-10

Domains

Domains
Representative sequence (used for alignment): 72gmg (282 AA)
Member sequence: 7rkg4 (275 AA)
1 282 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08239, PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (72gmg) rather than this protein.
PDB ID
72gmg
Method AlphaFoldv2
Resolution 83.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50