Protein

Protein accession
1locp [EnVhog]
Representative
7G0GZ
Source
EnVhog (cluster: phalp2_20670)
Protein name
1locp
Lysin probability
98%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MANIPETSFIPRAGTDIAKNMSAGFAKASAAMRGMFGAQPGKTAANNQGYAAKAPQATFSSIAPFKAKDIGTVTTRPNESTRFEKSHPGIDIANKEGTAIPAFTPGKVTKVVSGQKPGTKGYGNYVVVTDSQGNRHQYSHLLKTYVNVGDAVGAASPVGAMGRTGSVYSTSGGTGTHLDYRVADAFGKWIDPTKYVSI
Physico‐chemical
properties
protein length:198 AA
molecular weight:20608,9 Da
isoelectric point:9,80
hydropathy:-0,35
Representative Protein Details
Accession
7G0GZ
Protein name
7G0GZ
Sequence length
196 AA
Molecular weight
21311,49310 Da
Isoelectric point
8,70899
Sequence
MPLTEELRGQQFGANQDENANFQNIASALSIPLRIRNQIQPQGSNIQKRSPKTYGNIEQFISDMGTITTPFMGSTRSEAEHPGIDIANKIGTAIKAFAPGVVKEVVTGKKQGDKAYGNYVVVEDPYGAKHRYSHLSQSYVRVGDRINAGDDIASMGATGNTYSTTGGTGSHLDYRIRDAAGIYLNPYSYLAKFLNS
Other Proteins in cluster: phalp2_20670
Total (incl. this protein): 5 Avg length: 197,2 Avg pI: 9,53

Protein ID Length (AA) pI
7G0GZ 196 8,70899
2eeMs 206 9,79837
2umwu 193 9,77562
4Bu7w 193 9,57944
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18865
1IU43
9 53,3% 133 1.767E-52
2 phalp2_21998
4ZChj
22 35,7% 154 7.070E-19
3 phalp2_20604
4H0aV
4 34,0% 138 1.314E-18
4 phalp2_5933
67ENK
29 33,3% 138 2.130E-17
5 phalp2_34915
4pbkX
159 31,9% 141 2.972E-15
6 phalp2_18623
3ebV
83 32,6% 144 1.385E-14
7 phalp2_27905
l4Bb
39 28,8% 142 1.617E-13
8 phalp2_21800
4e6HN
71 31,4% 127 1.617E-13
9 phalp2_18491
6LzOo
5 29,5% 132 6.372E-12
10 phalp2_39519
5tggt
99 32,8% 137 3.974E-11

Domains

Domains
Disordered region
PET_M23
Representative sequence (used for alignment): 7G0GZ (196 AA)
Member sequence: 1locp (198 AA)
1 196 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1locp
Method AlphaFoldv2
Resolution 77.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7G0GZ) rather than this protein.
PDB ID
7G0GZ
Method AlphaFoldv2
Resolution 77.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50