Protein

Protein accession
1ZeFv [EnVhog]
Representative
4G6cQ
Source
EnVhog (cluster: phalp2_19690)
Protein name
1ZeFv
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VGGWENRSNSHGGYDAVRGVVIHHDAGGSSDTASVNFQCFVDADRPNAALHVGRKGEVWIMAAGATNTQGLGGPIAGVPLNQGNFQLIGIEMGNNGVGEPYPAPQQNTILWLCRTLVAVYGARFGFGPGSVISHFEWAPARKNDPSGPSRWSPGGGRWDMAAFRSDVAVPPPPPGDDFMTPEDKTYLDTKFKTLFDAYFVDIRGVAFGQQMVDTIKAGVRAELETSFGLDALRTALVAAIGEALADATVTVDAEAIAKATLDAMAARLAA
Physico‐chemical
properties
protein length:270 AA
molecular weight:28417,6 Da
isoelectric point:5,08
hydropathy:-0,08
Representative Protein Details
Accession
4G6cQ
Protein name
4G6cQ
Sequence length
274 AA
Molecular weight
28770,08860 Da
Isoelectric point
6,65144
Sequence
MPSWLIEHHTASNKNSGNAPSLNIVTNGRPDLPGPLANYLTARDGTIYVVASGRANHAGIGAYPDGMTGNSRSFGNEAENDGVGEPWSLVQMNAINRAAYAICRHLGWAADRVVGHKEYALPRGRKIDPTYDMNVHRATVAALLGQPPAPGPDPVPDPSRSKQMYVLIQRGAGGPIATFDGTTKVFVPNDTLVGGHKIVLGSLGLKNDVLVVDAVFYDSIPDRNDTAGDLRVGRSIIDETVKGVLAGIPAAGNVDNQAIATAVAVELAKRLGNG
Other Proteins in cluster: phalp2_19690
Total (incl. this protein): 62 Avg length: 304,2 Avg pI: 6,25

Protein ID Length (AA) pI
4G6cQ 274 6,65144
11k2b 224 5,88462
15HAd 252 5,85660
1DU7Q 289 7,67817
1Iefb 314 7,75513
1IsoI 282 6,08373
1NV5N 310 6,23441
1QWyH 303 5,22836
1a69x 245 5,65471
1dJFW 245 5,65471
1heQb 292 7,21965
2QxNG 307 5,54978
2S8M0 269 5,97556
2qP2a 323 5,40149
3373p 325 6,53668
3OMS7 309 6,59909
4IeF5 244 5,27877
4JiRG 295 5,67488
4MkVQ 352 5,51943
4Ml1W 297 5,55035
4N6SJ 298 5,10621
4N7dc 301 6,25186
4N7hO 284 9,37540
4N7jl 324 5,29656
4N8ly 288 8,66734
4NFr1 347 5,22972
4QK8B 283 6,58027
4QTET 271 9,60922
4QkdN 347 5,32606
4Qr0Y 319 5,66454
4QxPP 344 5,22972
4QxoO 323 6,66434
4Qy1y 277 6,15432
4Qy2g 309 5,93583
4YwxO 335 5,71740
4lGtT 318 6,89630
4n0or 308 9,86419
5B8it 372 4,74750
5Hg51 347 5,39512
5klTp 295 5,55507
6HVJk 266 5,42479
6SJi7 282 5,32447
6T7Nw 287 7,06016
6UfIg 282 5,70887
6UxXh 295 5,13321
6x2j3 300 5,64141
7cvzo 339 5,42877
7m9Dc 330 7,24086
7p23F 295 8,53808
7udUF 288 5,99369
7ujSe 292 6,45864
7wJLL 294 6,04729
7zjzC 321 5,36255
80LSy 307 7,76201
866Wr 314 6,65797
8hNRH 316 5,70569
RzyH 298 4,83407
dgSp 406 10,83863
fomk 299 6,48888
trmD 370 4,75887
ttfi 370 4,75887
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11030
4TmCr
48 31,3% 252 1.170E-34
2 phalp2_29173
7xGLk
1 33,1% 175 1.697E-30
3 phalp2_12630
1Iuq4
5 31,4% 181 9.318E-29
4 phalp2_32798
2yYOK
118 34,4% 209 1.091E-27
5 phalp2_9416
8Lrse
65 28,8% 253 1.484E-27
6 phalp2_10690
2ZSmu
3 27,9% 204 4.932E-23
7 phalp2_40432
4fxAl
3 29,8% 194 1.037E-21
8 phalp2_6936
2Qzwv
1 24,3% 263 1.633E-14
9 phalp2_18883
1PV8a
2 26,0% 219 9.732E-14
10 phalp2_10221
T1NY
7 25,2% 194 1.233E-04

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 4G6cQ (274 AA)
Member sequence: 1ZeFv (270 AA)
1 274 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1ZeFv
Method AlphaFoldv2
Resolution 79.96
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4G6cQ) rather than this protein.
PDB ID
4G6cQ
Method AlphaFoldv2
Resolution 69.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50