Protein

Protein accession
5fT1M [EnVhog]
Representative
4n5QN
Source
EnVhog (cluster: phalp2_21839)
Protein name
5fT1M
Lysin probability
90%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDETRIDHRRTAAGCVAVIALLGAAFALGRCSAPAAAQETQPTPEPVVSLGYDNTIAAHAAIAELLTTLTTTTTAVPQYRNTPAPPTSSDASRWDQLAQCEAGGNWSANTGNGFGGGLQFMHQRSYSTWLSFGGGEFAPHPWDASREQQIDIAERVLASSGWRAWPGCARKNGWL
Physico‐chemical
properties
protein length:175 AA
molecular weight:18615,4 Da
isoelectric point:5,75
hydropathy:-0,29
Representative Protein Details
Accession
4n5QN
Protein name
4n5QN
Sequence length
177 AA
Molecular weight
19261,33850 Da
Isoelectric point
5,66363
Sequence
VGKKIMTRDRILAATLTALFLVAAFFFFDAIALSASAGGDSVPVETTTTTLQVVETTTTTIDLSFLNTTTTTQTPKPVPQYRNTEPVTVETTTDRWDQLAGCESGGNWATNTGNGFGGGLQFMHQRSYSTWLSYGGGEFAPHPWEASREQQIVIAERVLAGSGWKAWPGCSRRFGWL
Other Proteins in cluster: phalp2_21839
Total (incl. this protein): 23 Avg length: 178,7 Avg pI: 5,68

Protein ID Length (AA) pI
4n5QN 177 5,66363
1JKON 170 7,64936
2RYXX 171 5,05346
37Edp 151 4,45631
4YqCB 177 5,66363
4acHY 184 5,50710
4oaoq 178 6,17291
4scfL 184 5,92270
4schg 176 5,26672
5BMXo 178 5,16282
5fWr2 176 5,61532
5n5B3 184 5,47350
5xYpN 184 5,27178
6LKpB 170 7,69307
8mU1D 177 4,52986
8sGe8 177 4,93876
CbKI 173 6,81979
DyYk 173 5,21898
Ujkw 196 5,12332
jUrT 213 6,64632
xfLQ 182 6,72021
A0A6J7WNG6 183 4,43670
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21660
31bly
46 37,6% 109 1.573E-10
2 phalp2_37599
4700Z
2 33,0% 130 4.563E-09

Domains

Domains
Disordered region
Transgly
Representative sequence (used for alignment): 4n5QN (177 AA)
Member sequence: 5fT1M (175 AA)
1 177 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF06737

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4n5QN) rather than this protein.
PDB ID
4n5QN
Method AlphaFoldv2
Resolution 71.59
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50