Protein

Protein accession
6oqHh [EnVhog]
Representative
1cCmC
Source
EnVhog (cluster: phalp2_38437)
Protein name
6oqHh
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MYGIDISKHNGNINLEQYKGQFVIIRVGYGNFHLDEKFERNVNECKRLGIPFGVYHYSYALNEAEAEAEARGVLNAIAKYKNDIKVGVWFDMEDADGYKKKHGFKFSNGTIAPICYKFCKIIEDAGYYSGIYTSSSWLDYVKGLNDRFDKWVANWGKNDGTQHTNTSQYGTLQQYTSKPLDKNVMYADLSRYSRGNTTQPQPKPIYQIADEVIAGQWGNGDDRKKRLTDAGYDYNAVQDIVNKKVAPMRKSNDQIASEVIAGQWGDGNDRKNRLEQAGYNYDAVQKAVNKKMGAKKQPAHVYYVVKRGDTLSGIASKYGTTWQKLQAMNGIANPNKIYTGQCLRVK
Physico‐chemical
properties
protein length:346 AA
molecular weight:39205,6 Da
isoelectric point:9,21
hydropathy:-0,75
Representative Protein Details
Accession
1cCmC
Protein name
1cCmC
Sequence length
307 AA
Molecular weight
35306,30120 Da
Isoelectric point
8,43203
Sequence
MFGIDISEHNGNIDLSNYVGQFVIIRVGWGSFTKDKKFERNVAECKRLGIPFGVYHYSYALNPETAKREAEAVLKAIEPYKHDIKVGVWFDMEDADHWKAKHGFKIVRETIEPICYTFCKIIEDAGYYTGIYCSESWLRYLGESNKRFDKWVASWGTNNGTLQNNTQEYGTLHQYTSKPLDRNIMYAEISRYDTFAEHKPQEGVNGKPSAENGSNGVVLKPIDEIAREVMSGAWGNGWNRENALKQAGYDYNAVQKRVNELAHEKKIEAVARDVIAGKYGNGWRRKRNLKRAGYDYNEVQKKVNELM
Other Proteins in cluster: phalp2_38437
Total (incl. this protein): 101 Avg length: 323,2 Avg pI: 6,61

Protein ID Length (AA) pI
1cCmC 307 8,43203
11vto 321 4,52384
13DdX 375 8,96737
13rF0 338 9,31376
1kmTn 333 4,49815
1kn5r 321 4,41351
1lGQv 297 4,31655
21AwE 245 5,32572
21jRf 289 4,37532
21zMV 343 8,29993
23cKW 298 9,86104
23s1V 246 5,09263
23sGV 245 5,19329
24G2N 246 5,65834
2VcSr 307 8,15591
2lYrC 312 9,17754
3WNdw 320 9,06582
3c5D0 293 4,44495
3c70p 321 4,43477
3dXkN 289 4,45370
3vOTw 352 8,51919
4LcYr 343 8,29568
5KoX6 344 9,15350
5LD9g 323 4,37680
5M81o 354 8,33043
5MvWe 344 4,90318
5NzfV 296 8,50739
5OIgp 344 9,31641
5PC1o 301 9,48796
5PoWy 344 9,34658
5QbEn 343 9,21023
5S9Ii 352 8,04586
5Sj7Y 297 4,31513
5T0CD 293 4,39737
60M60 343 9,21738
60OZ2 343 4,62609
60mdy 344 8,41688
62DF7 321 4,46916
64FVU 343 8,19762
64Kvb 345 9,19153
65Qeb 282 8,82090
65l1e 321 4,44432
66zEJ 333 4,77132
67Z4d 333 4,66008
68Cid 344 9,25026
68P4x 344 9,41633
698Me 348 9,25800
6YIqE 332 4,59574
6YvLy 323 4,48706
6asV5 321 4,42374
6bL2U 345 9,13054
6dx23 333 4,63638
6gRYi 301 9,42168
6hHzR 245 4,78922
6j0YM 365 8,33507
6j2WB 359 8,33984
6kB2a 323 4,34713
6kDXO 321 4,52384
6lRzP 344 4,84265
6lynM 344 9,40769
6mJGw 344 9,20423
6mvLi 319 9,35051
6ohwl 346 9,31654
6pQoD 343 9,21738
6qnT5 343 9,27592
6r5OQ 346 9,33053
6rd8i 334 4,67770
6sEU5 301 9,39951
6sZNy 344 9,44973
6uCtu 346 9,36328
6uXwX 344 9,30790
70Wvg 321 4,46404
70gUM 334 4,55004
72z9p 323 4,46217
7BYs4 331 4,62609
7N7tn 369 8,62763
7NAb2 323 4,40527
7XuNa 312 9,10418
83xIc 296 4,29586
8552p 297 4,28466
88KcM 321 4,47194
8fkrm 307 8,43203
8oAdV 326 4,46001
DDqz 333 4,38572
HB2O 321 4,42107
HKET 295 4,32928
HkX4 321 4,43477
N0AI 321 4,44790
N5tN 317 9,05924
NBX6 321 4,47513
NoGL 323 4,40840
O4eA 321 4,38697
oQ2j 321 4,43619
qhn0 322 4,42374
qjSI 321 4,51435
wPsb 321 4,40425
wYkq 323 4,47155
yg1r 333 4,48911
z350 317 8,57270
z6Sj 321 4,47194
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22096
5ZSom
2292 29,7% 319 1.016E-73
2 phalp2_33310
6kV0o
68 36,6% 314 3.189E-72
3 phalp2_26548
858CP
728 37,8% 285 2.509E-67
4 phalp2_31512
3WNwM
280 31,7% 280 4.576E-64
5 phalp2_5443
2VksN
6 41,6% 197 9.713E-57
6 phalp2_36883
7sfSC
39 36,4% 225 2.477E-56
7 phalp2_3425
3TK4D
923 33,7% 246 7.768E-50
8 phalp2_6199
5X5xO
102 34,0% 226 8.252E-48
9 phalp2_2537
6a7No
192 29,7% 249 4.688E-46
10 phalp2_13529
7Jyqc
660 40,8% 203 5.620E-45

Domains

Domains
Representative sequence (used for alignment): 1cCmC (307 AA)
Member sequence: 6oqHh (346 AA)
1 307 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF08230

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1cCmC) rather than this protein.
PDB ID
1cCmC
Method AlphaFoldv2
Resolution 90.34
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50