Protein

Protein accession
7W5oo [EnVhog]
Representative
2ICDq
Source
EnVhog (cluster: phalp2_30054)
Protein name
7W5oo
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKVSPKRLRNHLNRHKITHSLVPGWDSPAIDPYKGKSDFAGILLHHTAGRDSLNYIVNSNKYAPVRACHFLIARDGTVMVVSGSGAYHAGSGGPWSFTKNVRVPKDSGNSRLYGIEIESLGTTAKINGKPEGMTVDQVVATARLCAALLDAMRLGPLSLKAGRVIRHRDWAPRRKIDTCQDLDWWQHAVRIALRNAKNPNTAEAALRAFIRANPNGVLVA
Physico‐chemical
properties
protein length:220 AA
molecular weight:24114,4 Da
isoelectric point:10,28
hydropathy:-0,31
Representative Protein Details
Accession
2ICDq
Protein name
2ICDq
Sequence length
217 AA
Molecular weight
24236,53900 Da
Isoelectric point
10,38594
Sequence
MRVKPARLLNKLRAHDVNYKLVKGWNSASIDPYNGRSDFKGVVLHHTAGINSLNYIVNTNPFAPVRACHFLVQRDGTVQVVSGVGAYHAGKGGAYKFNRLVTIPKDQGNRYLYGIEIESMGRTATIGNGEGAINIEQVVSTALLTAALLNAMRPTWKSLPVSRVIRHRDWTTRKPDVKQDLDWWHQVVGIARRNRKDSAKAEREIRAFVKANPKGTL
Other Proteins in cluster: phalp2_30054
Total (incl. this protein): 63 Avg length: 221,7 Avg pI: 9,88

Protein ID Length (AA) pI
2ICDq 217 10,38594
14aGr 219 9,89121
15Spu 217 10,16417
1I8SY 155 9,75673
1MK41 217 10,13928
26mSa 234 10,38813
2LSTe 217 10,44854
2M1JV 222 10,15437
2PF0C 220 10,39122
2RUuF 211 9,56719
2a1YT 217 10,33127
2dLeI 217 10,51629
2dlr3 222 10,29291
340Gc 218 10,10331
34J4c 219 10,18344
3QSZ 219 10,06682
3Xd3o 216 9,37797
3hxoZ 220 10,91406
4443N 219 9,89282
4AeQN 220 10,71518
4CQgf 223 10,45666
4EQr 220 11,04326
4KLFt 217 10,62138
4WX7i 224 10,32676
4ZXv8 211 9,85839
4aQTz 199 9,48796
4afZ 220 10,74715
4bg7z 219 9,91906
4eJLB 318 9,97734
4eTGM 315 9,90732
4eyt4 282 6,89277
4fVVq 266 6,72533
4oADw 219 9,82564
4zNau 211 10,21542
55jx 221 10,17094
56wOH 220 9,88399
56wQD 211 9,53650
5Aiaf 220 10,63375
5BrWi 206 9,46333
5D6RZ 217 9,64281
5aBWE 211 9,63250
5ast5 211 9,53682
5cH94 210 9,72262
5wFON 218 10,12639
5ydPT 220 10,43713
6AgsV 220 9,78097
6PFOS 223 10,00403
6U7iq 175 5,90144
6y8g3 219 9,94929
7IiSa 220 10,90671
7VJM7 210 9,76478
83IHD 211 9,72275
8A4Cd 221 10,28272
8ar5s 282 6,58476
8dZsw 210 9,80527
8hKoH 210 9,80579
8mtjw 213 10,39361
ihFH 223 10,61461
itB3 223 10,44834
kjgo 217 10,30019
sFHn 222 9,39106
tGvS 220 10,18963
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26945
4CQCh
13 51,1% 219 5.801E-87
2 phalp2_32384
yRbL
80 28,8% 201 1.027E-23
3 phalp2_13355
486o0
7 26,7% 202 6.596E-23
4 phalp2_24508
4NNU3
22 24,7% 222 3.684E-21
5 phalp2_7405
4FizF
295 24,1% 211 5.933E-20
6 phalp2_22551
1qQEX
38 29,6% 192 5.133E-19
7 phalp2_12497
QM59
22 26,6% 180 1.759E-18
8 phalp2_39600
6zgRa
59 29,2% 205 2.393E-18
9 phalp2_35081
5cImK
1 33,1% 163 3.802E-17
10 phalp2_19071
1jII3
225 25,9% 227 1.762E-16

Domains

Domains
Representative sequence (used for alignment): 2ICDq (217 AA)
Member sequence: 7W5oo (220 AA)
1 217 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2ICDq) rather than this protein.
PDB ID
2ICDq
Method AlphaFoldv2
Resolution 97.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50