Protein

Protein accession
4kB8Y [EnVhog]
Representative
3hPAn
Source
EnVhog (cluster: phalp2_9997)
Protein name
4kB8Y
Lysin probability
87%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MTEILYRKVELPDGTIQFVPAPERPADHRLAVPWVGQNTTRSDDDYTRSDCGAAVVTGWLHYRGKTSVSVDDVSRATGKPPNYPYTVFADLDKAANAFGLDLVHQFGTLILPLVEYEIDIARPVIALVHYPSLPIKFDPTYQSSHWILITGYGPDLYYYNDPYWPTAERGADILISASQLTDALRNVNLNGNTPFQGATQK
Physico‐chemical
properties
protein length:201 AA
molecular weight:22416,9 Da
isoelectric point:5,09
hydropathy:-0,28
Representative Protein Details
Accession
3hPAn
Protein name
3hPAn
Sequence length
212 AA
Molecular weight
22783,56850 Da
Isoelectric point
5,55223
Sequence
MTDETVLDEVYAWANAVKADAQTLIVKIDAARATPPTKRILSFPWLGQNTPAPTDDFSGNDCGPAALAMWLNGLGQSLTVDDVSRATGLAANYASTAYWDLSKAARAWGITLSRKAGLTIDALKAEIDKGTPLIVLVHYGSLPQRATSFEKGHWILVVGYDADNIYYHDPLWVDSRGANIACPYAAFAKAMADCAIDKNTPNQGLVRPLSGK
Other Proteins in cluster: phalp2_9997
Total (incl. this protein): 7 Avg length: 204,9 Avg pI: 5,71

Protein ID Length (AA) pI
3hPAn 212 5,55223
2DaCG 215 6,05656
2wTVl 207 6,58391
4PysF 201 5,89633
4RlOk 199 5,25217
4T43K 199 5,52278
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5263
87L2J
7 36,8% 228 1.207E-42
2 phalp2_26558
87KwA
10 33,1% 172 1.638E-28
3 phalp2_28609
8i3l6
4 31,4% 159 5.700E-28
4 phalp2_28536
40oDG
10 33,3% 174 6.445E-24
5 phalp2_13431
4ByY8
1 28,8% 163 4.331E-21
6 phalp2_32894
3niS6
1 27,9% 168 2.086E-18
7 phalp2_8836
2ApLb
1 28,9% 138 3.331E-17
8 phalp2_23162
4JP44
1 26,0% 184 5.955E-13
9 phalp2_16156
4uGeP
55 25,6% 164 6.764E-12
10 phalp2_6277
6MCzG
1588 26,5% 158 1.393E-10

Domains

Domains
Representative sequence (used for alignment): 3hPAn (212 AA)
Member sequence: 4kB8Y (201 AA)
1 212 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13529

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3hPAn) rather than this protein.
PDB ID
3hPAn
Method AlphaFoldv2
Resolution 92.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50