Protein

Protein accession
CVym [EnVhog]
Representative
1GQff
Source
EnVhog (cluster: phalp2_21310)
Protein name
CVym
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
LPVKTYNLTISGEAKISQNFKVKEFRSYSSTYNKLYSNEVKISTELVAMLEKLRTFLGGKIVIINGYRSPAHNKAVGGSSNSTHLKGYAADICCYDASGKVISAKKVCCVAEDLGFSGIGYMRSNHVHVDMSPSRIWRGDETKKENGTYYSLTRHGLSFYKYFNIKEKYQPYKKPVVTKPQIDYSVPSDWAADIWKEATNKGWTDGTNPRKDVTREQAITLIYRVIFNKPNETIENSVNICKEKKYTDGTNMKNYATREQVVALIYRMYKNNPDATLEECWTWGIESKITDGTVPKNNCTREQVIVMLDRVINLSK
Physico‐chemical
properties
protein length:316 AA
molecular weight:35996,6 Da
isoelectric point:9,28
hydropathy:-0,57
Representative Protein Details
Accession
1GQff
Protein name
1GQff
Sequence length
247 AA
Molecular weight
27634,82050 Da
Isoelectric point
8,31283
Sequence
MAIKTVYVSKLGRDYKISPHFKLREFQSKDGADKVLYSEELLAKLEELRSYGGFIITVNSGYRSPSHNKKVGGAANSSHTRGLAADIKARKEKDGQYVSAKLLCALCQTLGFDGVAYINANSVHVDMAGRNYRGDERKGYGNNVGGDFYKYFGITKAQVEALRVVPAQEEEEEEEMTQEQFNMMMEKWIAQQAAKEPGEWSKESRAWAEANKIMNGTGAGNSYGAPATREQVIEFIYRYAKQVDPIK
Other Proteins in cluster: phalp2_21310
Total (incl. this protein): 14 Avg length: 254,3 Avg pI: 8,73

Protein ID Length (AA) pI
1GQff 247 8,31283
14GXr 263 8,60764
1gCrO 241 8,75437
3eoqR 218 5,43190
3iWVd 247 9,34348
3nHso 226 9,62495
4k9uq 246 9,22628
60sKZ 266 9,32408
6cW1y 271 9,06743
6vYT 263 7,55569
71F0k 217 9,98185
8kVxB 269 8,87435
Hqik 270 8,82297
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24303
3o50S
84 42,1% 166 1.343E-48
2 phalp2_28984
6dKhV
2 32,9% 188 7.981E-39
3 phalp2_30939
1345B
93 37,5% 176 2.182E-36
4 phalp2_31110
21jrV
1 30,4% 187 1.505E-26

Domains

Domains
PET_M15
Unannotated
Representative sequence (used for alignment): 1GQff (247 AA)
Member sequence: CVym (316 AA)
1 247 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
CVym
Method AlphaFoldv2
Resolution 93.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1GQff) rather than this protein.
PDB ID
1GQff
Method AlphaFoldv2
Resolution 88.87
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50