Protein

Protein accession
7KCGA [EnVhog]
Representative
7CDJC
Source
EnVhog (cluster: phalp2_40590)
Protein name
7KCGA
Lysin probability
99%
PhaLP type
endolysin
Probability: 91% (predicted by ML model)
Protein sequence
MRNFFLSFIITISFFLMAFKAHEVILEWQTPIEIPQPKAIKVIMPKLEVPKVEINLKDHTSFLRNIGNFESGNRYEIVNRWGYMGRYQFHLSTLESIGIKTTKKKFLSSPTLQEEAMTRLLKSNKRTLRRYIRKYNNTILHGVYVTESGVLAAAHLGGAGNVIEWFRKGEVFKDGNGTPITRYMKVFSGYELNLE
Physico‐chemical
properties
protein length:195 AA
molecular weight:22612,1 Da
isoelectric point:9,85
hydropathy:-0,22
Representative Protein Details
Accession
7CDJC
Protein name
7CDJC
Sequence length
155 AA
Molecular weight
18152,97290 Da
Isoelectric point
9,62753
Sequence
MKTLLRITMPLLAVMLMAFTYNYVKEIATPIKVERKIIPVIEPVEITAEIPTIKLYNHQDFLDDLGWYESSNRYNITNRWGYMGRYQFHISTLQSLGINTTKKEFLSNPDLQEEAMDRLLKNNYRTLRRFIRKYEGTKRHGVLVTKSGVLAAAHL
Other Proteins in cluster: phalp2_40590
Total (incl. this protein): 15 Avg length: 188,1 Avg pI: 9,59

Protein ID Length (AA) pI
7CDJC 155 9,62753
1PrDd 193 9,99345
2FMC7 196 9,84305
2QGiP 194 9,96464
2Ta9s 195 9,81629
2TbFy 191 9,25187
2ew4h 195 9,99229
3xka0 197 9,83931
7IWCa 197 9,38590
7LrZN 193 9,96264
82kUh 193 9,31583
8dssC 193 8,47754
8fSI0 197 9,62283
8tghJ 138 8,88634
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33734
X5tL
570 51,8% 106 1.128E-46
2 phalp2_8701
8a8Xi
2 40,3% 114 1.823E-25
3 phalp2_37648
4hMfu
36 32,0% 131 1.195E-18
4 phalp2_8134
6LgxQ
11 33,0% 112 2.833E-12
5 phalp2_34723
37lMR
9 28,9% 169 9.824E-12
6 phalp2_7798
7szmD
9 24,6% 142 9.027E-07

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 7CDJC (155 AA)
Member sequence: 7KCGA (195 AA)
1 155 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7CDJC) rather than this protein.
PDB ID
7CDJC
Method AlphaFoldv2
Resolution 88.62
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50