Protein

Protein accession
723FW [EnVhog]
Representative
6D0Fq
Source
EnVhog (cluster: phalp2_5972)
Protein name
723FW
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MLAWSEGTDNGRQKTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKSTAAGRYQLLSRWWDAYRKQLGLKDFSPKSQDAVALQQLLPNQGM
Physico‐chemical
properties
protein length:92 AA
molecular weight:10478,7 Da
isoelectric point:9,57
hydropathy:-0,75
Representative Protein Details
Accession
6D0Fq
Protein name
6D0Fq
Sequence length
83 AA
Molecular weight
8865,99730 Da
Isoelectric point
5,39313
Sequence
MARIDAGTAGGQNRIAFLDMIAVSEIGAALLAESDDGYNVLVGSTPAAPLLFKSYADHPNVLNAALHSTAAGRYQILYRWWCI
Other Proteins in cluster: phalp2_5972
Total (incl. this protein): 29 Avg length: 80,6 Avg pI: 7,97

Protein ID Length (AA) pI
6D0Fq 83 5,39313
1cgPH 73 9,98772
1f1fN 100 9,60013
1kkmF 63 4,90932
1lUDM 49 4,90932
3ObbP 96 9,62644
3rT1y 82 9,69206
4OUin 106 5,78242
5Q7Am 78 5,46600
736z9 88 9,56951
7AWw9 108 9,25761
7Sf0E 106 10,06392
7XDCW 71 4,77649
7YLrE 114 9,64210
7cCxX 96 9,56951
7eO5i 60 8,13967
7enGu 41 5,49823
7kjFd 82 8,97891
7kvHA 57 5,99836
7mBu4 85 9,62650
7mBvC 85 9,20507
7oEZa 90 9,62650
7u1U4 90 9,39467
7vmgp 76 9,03391
7vtDX 90 10,15005
86qhX 54 5,44651
8jDjV 70 6,70566
8tSFj 52 5,59292
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23584
34ms
2 50,9% 51 1.366E-17
2 phalp2_4055
1eWXQ
1 42,6% 75 1.732E-16
3 phalp2_19758
7FSJU
1 40,0% 75 1.260E-12
4 phalp2_2115
3fIb8
3 29,7% 94 5.754E-07
5 phalp2_19461
34JzZ
1 35,2% 68 1.494E-06
6 phalp2_15867
4uLvo
1 31,0% 74 4.564E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6D0Fq (83 AA)
Member sequence: 723FW (92 AA)
1 83 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6D0Fq) rather than this protein.
PDB ID
6D0Fq
Method AlphaFoldv2
Resolution 97.42
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50