Protein

Protein accession
6XrSZ [EnVhog]
Representative
4Aahq
Source
EnVhog (cluster: phalp2_4655)
Protein name
6XrSZ
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
VKKLITLALCTGVFIISIWSYNSNVQSAKITSYETQKEIDALKFEKGKQLLKSIQEEKRKEVINNKIIQAQIQVQKEIPLYDIPLNIDIQKYIYKISKEKNIPHELILGVIKTESDFRPNQTHKNKNGSIDIGLMQVNSCHVDICKELGITNLYDPYQNIKIGATLLANIYNSYPNIHKAMMVYNMGLGGARRNWKVGRSSTDYSRKVVNNINIILACKQE
Physico‐chemical
properties
protein length:221 AA
molecular weight:25272,1 Da
isoelectric point:9,39
hydropathy:-0,33
Representative Protein Details
Accession
4Aahq
Protein name
4Aahq
Sequence length
176 AA
Molecular weight
20352,53960 Da
Isoelectric point
8,54395
Sequence
MKYLFLISLLLLSITAIIVSIENKKLQDIIKYDYYLQDSKRVKKLIEFVDNKQILETVLNICQIENVEPELVLSVIKVESNFKIYATGKNKNGTIDRGLMQINSSNIDCDTVYTPEKNIIEGVKILKWCLDKADNDVILALSYYNAGYGKVKQYKTGESTYKYISKIIKEYEELKK
Other Proteins in cluster: phalp2_4655
Total (incl. this protein): 2 Avg length: 198,5 Avg pI: 8,97

Protein ID Length (AA) pI
4Aahq 176 8,54395
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_664
1m3Kq
134 34,3% 131 8.193E-23
2 phalp2_600
139Z6
763 34,3% 128 2.862E-22
3 phalp2_36276
81fJ
27 28,0% 164 3.331E-18
4 phalp2_4832
5l1Nn
14 33,9% 159 3.539E-16
5 phalp2_16046
3Q4hp
22 26,9% 141 5.780E-15
6 phalp2_28379
1lDNy
2 32,5% 132 1.074E-14
7 phalp2_31447
3o77A
181 28,1% 153 1.465E-14
8 phalp2_712
1GxsY
3 29,6% 162 1.997E-14
9 phalp2_21911
4JxwN
4 31,3% 134 3.234E-13
10 phalp2_3949
7gIqH
9 32,1% 143 5.999E-13

Domains

Domains
Representative sequence (used for alignment): 4Aahq (176 AA)
Member sequence: 6XrSZ (221 AA)
1 176 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Aahq) rather than this protein.
PDB ID
4Aahq
Method AlphaFoldv2
Resolution 91.22
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50