Protein

Protein accession
21nXz [EnVhog]
Representative
1GsF1
Source
EnVhog (cluster: phalp2_25147)
Protein name
21nXz
Lysin probability
94%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNNVIELNPATVTGYKFXVLLDNGHAKSTPGKRSPVFEDGKQFFEYEFARDIVNRISKELEKLNISYKIVTPEVDKDIALSTRANRVNRYCQKLGKNNCLLVSVHANASGNGKQWMPARGWSVXTTKGKTKSDAYADIFYKEAEKLLPLYGMNVRKDLSDGDYDYESDFTILYKSXCPALITENLFQDNKIDCEFLMSDKGRDVITQIHINAIKQILNIK
Physico‐chemical
properties
protein length:220 AA
molecular weight: Da
isoelectric point:8,52
hydropathy:
Representative Protein Details
Accession
1GsF1
Protein name
1GsF1
Sequence length
182 AA
Molecular weight
20222,33360 Da
Isoelectric point
10,01125
Sequence
MNVVLKKKLKKGMKNIMSKKLVILDSGHAKTTPGKRSPDSSLLEWKFNNEMQYLIKKRLEAHGIIVHLTNPDPGTVKDIGLTARANDANKKWKDLGKPEALFVSLHANAAGSCSQWLNARGVEVYHANNASQKSKNAALIICNEIFNGVYKNIDKGFKNRGRKAANFTVIYKAACPSINKKY
Other Proteins in cluster: phalp2_25147
Total (incl. this protein): 21 Avg length: 223,8 Avg pI: 7,85

Protein ID Length (AA) pI
1GsF1 182 10,01125
23Iyq 225 7,63441
23s8r 272 4,83355
2fZre 194 6,96843
38J6e 260 6,44727
3iciT 266 5,44571
4f7dX 198 5,82983
60Btg 205 9,16439
6j1rU 251 9,36785
7JCml 277 4,80695
7VmOP 203 6,89260
7w9HE 211 9,12461
84Srx 226 9,26109
85l51 251 9,35767
85mLg 251 9,36785
8gd9F 196 8,96286
8lVFB 196 8,49366
8pYW5 203 6,43715
8qZZT 196 8,71060
RHU8 217 9,25774
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7949
zu48
832 35,9% 164 1.394E-46
2 phalp2_28969
5Zx99
42 39,4% 114 1.437E-40
3 phalp2_38776
1gGIw
71 30,8% 191 4.129E-38
4 phalp2_37494
3aZmq
1971 28,3% 187 6.306E-36
5 phalp2_20954
7po03
228 36,7% 158 9.606E-34
6 phalp2_5499
3iB6U
6 30,2% 129 1.316E-19
7 phalp2_22711
8625G
10 25,7% 171 3.349E-19
8 phalp2_30021
2qQ9h
7 27,3% 161 6.241E-19
9 phalp2_30498
5dbwi
1 26,5% 177 2.166E-18
10 phalp2_38987
84xdy
117 29,2% 164 2.602E-17

Domains

Domains
Representative sequence (used for alignment): 1GsF1 (182 AA)
Member sequence: 21nXz (220 AA)
1 182 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1GsF1) rather than this protein.
PDB ID
1GsF1
Method AlphaFoldv2
Resolution 91.22
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50