Protein

Protein accession
Slzc [EnVhog]
Representative
30Fm6
Source
EnVhog (cluster: phalp2_14220)
Protein name
Slzc
Lysin probability
99%
PhaLP type
VAL
Probability: 90% (predicted by ML model)
Protein sequence
VARSRSSIFDNVDPALVRLIQSMPQFGKYGVRATSGYRAGDTRYHGKGGIGRALDVELFDPASGTALANYQNPENFAAYQQLANALYQQALKTDPALAQKLRWGGYFSGEPGKYGALDLMHFDVAGDETGMAGGSWAGGLTPEQAKIWGLTAGGGIGGDMTGGAAGAQQLGPAMPKSVTNFSPEQRRNAIASIESAGSGDYGALGPMTGGDPSSRDRAYGRYQVMGRNVPVWTKQVLGRAMTPQEFLKDPKAQDAVFDKIFGEYVQKHGEEGAASMWFTGRPDAFDAKDVLGTSGGSYIKKYMNALTGQAPGGDTRTYPDQSPATAVAGAGPSPDQGGKEKDFGLGDLLESAGGIFGAGKGGQQRSTGMMRTLMPTQPITTAGIQSSTAMTPEAANARLQLALQRLNTGKLWG
Physico‐chemical
properties
protein length:413 AA
molecular weight:43271,9 Da
isoelectric point:8,52
hydropathy:-0,44
Representative Protein Details
Accession
30Fm6
Protein name
30Fm6
Sequence length
419 AA
Molecular weight
44332,78160 Da
Isoelectric point
6,15012
Sequence
MASRSALFDNVDPRLARLIESFKYDKYGVTPTSGYRPGDKRQHGLRNAMDVQLNDLKTGAGLANYQDPTTFGAYQEYANALYRHALQTDPELAKQLRWGGYFSGGKGKYGALDLMHFDIAGDKIPMGGGSWEGGLNPEQAKIWNLQAGGGVGGAGGDMQGPTAGTQVQQFTPEQRRNAIASIESAGSGDYGALGVWTGDPESGRDRAYGRYQIMGKNIPVWSKEVLGRAITPQEFMADPKLQDAIFDKKFGDYVAKHGEGGAAQAWLGGEGSIGKTDRQDALGTTIGSYANKYLTALGKPAGDTSNPGAETYQPGGSGAPTDPTVTSGVDATKDKPGWGDTLGDMFSGMGSIAGGGKEGLPKIQPLKQLQAPGNVTTEGGPLASNTMSKQMIDQYRQQLAALQNQAPPQAPAGNPWRLF
Other Proteins in cluster: phalp2_14220
Total (incl. this protein): 12 Avg length: 403,5 Avg pI: 6,85

Protein ID Length (AA) pI
30Fm6 419 6,15012
1lKoP 402 6,76443
1mW6V 404 8,73593
4HOwW 404 6,02109
4HRvh 404 5,83449
4IfIB 401 5,83449
5JgR9 332 7,80656
5nbfT 417 6,46711
6FgmU 413 9,19134
6TcPk 311 5,64993
kRaz 522 5,25263
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20031
1p331
1 44,1% 265 7.926E-72
2 phalp2_30361
4Fnl6
5 38,3% 435 1.296E-70
3 phalp2_37203
1Yz8X
1 28,6% 349 1.057E-28

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (30Fm6) rather than this protein.
PDB ID
30Fm6
Method AlphaFoldv2
Resolution 70.91
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50