Protein

Protein accession
kRfj [EnVhog]
Representative
4vSxN
Source
EnVhog (cluster: phalp2_31610)
Protein name
kRfj
Lysin probability
99%
PhaLP type
endolysin
Probability: 79% (predicted by ML model)
Protein sequence
MQGRIQKVASPRQQQLVSTLQKVGFQGKGLKTAYAIAMRESGGNPEAFNPDRTTGDQSYGLFQINMLGDMGPARRRQFGISGNEQLLDPLTNARAAYKMSRGGKDFGAWGLGPNAYRAGAGFDTIEKFYRTFPGTTLKGDKSPVVPRMAAQGAANAVVNAREPVSTDDVGSYFANVSGLQANQQQITDEAAMHLQQLRRVNQAIQASAPSETTRSLLGRLGETGAKVAPRLGQDLSLPDIPSVVYRPRTPHPADEARVNQQFMGSLIPPISSKTGNVPGVDDPTTSKQMQKVLAIAHDQIGTPYVWGAEDPKHGFDCSGLIEYCYEQAGIPTPGRLTTQSALTLGKSVKNQKYLPGDMLITNGGKHMVMYVGKGKVIAAPHTGEVVQYQPVSRFKGDIVDVRRFH
Physico‐chemical
properties
protein length:405 AA
molecular weight:43742,1 Da
isoelectric point:9,58
hydropathy:-0,46
Representative Protein Details
Accession
4vSxN
Protein name
4vSxN
Sequence length
335 AA
Molecular weight
34430,30120 Da
Isoelectric point
9,99371
Sequence
MAQLTPEALLQQLKNAGFRGNGLRTAWAIATRESSGNPQAFNGNAGTGDKSYGLFQINMLGGLGPARLKQYGLSSNDQLFDPSTNAKVAFRMSHGGTDFGAWGLGPNAYKGAPAAAKTRFDQIYQRYPGDTGTTGVAPRVAPLASATSAGGSAIAVPQPRSSPFVQALLNQTSKILGTPAPQLPAFSAPRPLPMAPQPHPAAGGPTATPAVPGSTVGAKIAKTALTQLGTPYQWGGPAKLGSRTDCSGLLQASAAANGIRIGRTTYEQWKQGTPVPLNQLQPGDAVFSHADSRGPGHVSIYIGNGQIVEDPHTGEVVSISDLAGRAGVIGARRYG
Other Proteins in cluster: phalp2_31610
Total (incl. this protein): 15 Avg length: 342,3 Avg pI: 9,61

Protein ID Length (AA) pI
4vSxN 335 9,99371
16hrp 355 7,87489
1FTiK 396 9,46694
1ZcTI 364 9,64449
1bkyZ 239 9,68194
1jpKv 332 9,65454
1p4gP 397 9,43561
4EVaL 328 9,49428
4IAjC 338 10,38181
5GCas 340 9,76143
5t2Fl 303 9,91364
6FgHx 320 9,62714
QSpA 343 9,77684
hovo 340 9,88025
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14723
6DHC0
1 30,8% 318 4.045E-43
2 phalp2_2590
6EE0m
7 34,7% 331 2.648E-36
3 phalp2_30389
4LmZ5
29 26,1% 375 9.429E-29
4 phalp2_28557
7UTPg
9 26,5% 347 2.636E-27
5 phalp2_10964
4LUfr
2 28,4% 344 2.435E-25

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4vSxN) rather than this protein.
PDB ID
4vSxN
Method AlphaFoldv2
Resolution 73.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50