Protein

Protein accession
6SQz3 [EnVhog]
Representative
4Ij1v
Source
EnVhog (cluster: phalp2_1998)
Protein name
6SQz3
Lysin probability
74%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
VATITSLGFNIFSHYNAAGVRAARRDINDLSTTLERNEKALIATAARFQPLITAAVALTPALIPISTALAGVGAASAAMAVTTGAALGAYGLAAKGAIERTLEMAKAGQKLSPVQTTFVRSVNGMKSAWTGFIKSTEGMTLKTATVAVQGLTTGLGKMTPLVRAVHPEILKVAEAFRSWMGRDNGFQRFIDNVIKFGVPALRNLIAAGRDVLATLGIGFRTFLPLSVSVSKSLRDGAAAMRAWAEGGGFARFLEKVKESSGPVREFFKALVAALGNVLRAMSGLGPLSLGLATTLLRIVAALPPGWIQAIVIGFVAWKAAILGLLVIRLVTIAVTQLRLAWAVLNFVFAASPIGVIVIAIAALVTAIVLIATKTTWFQTAWTYAWNFIKTVAMAVWNFLTQGLGQLTLLLMGPIGVLALLALNWSTIWNAMLAVARFVWTAMQVGWQAFIMALQLVWTTVSTALVTAWNAVWNAIALVARTIWTGMQVAWSAFIMALQLVWTTVSTALVTAWNAVWNAIATVARTIWTGMQVAWSAFIMALQVVWVTVSSALSAAWNAVWNAMRVSAQAIWTAMQVAWNAFITALQTIWTAVSGALSAAWNAVWNALSTAARAIWTALQVAWQAFMTAMQTAWSTFSSAFSAAWNAAWNAVSTAARAIWTALQAAWQAFVTAVQNIWNTFIAAFRTSWQNGWNFVKTAAETIWNALKAAWTAFTQAIQNIWNAFIAAFRTSWSNGWNFVKNVADTVWTAIRGLWNAFTKGVQDTWNAFIAAFRTSWSNGWNAVRDIASDIWEKIKSVIKAAINAVIGFINKVTGGFNKVADFLNINVSISAIPALAEGGVIGFEYGGIAGKPPVAFAHGGTVPGYAPGKDRVPAVLSPGEGVLVPEAVKGLGGPGFVHSANYHYSKGRAGRKGGPTGGWRNRRWNKRSHGRDRKGNNAGMQRYAEGGVVPGGLNSTLQSFAMGGITLAALARAGVPASAIIQPEYNPGVAASAGTHDRGGVIDIVPNAAYLQALINAGFAAWMRGPEQGMSPHIHAVLMSHPDLSPAAAAQVASFKAGGTGLGVGGGGGGGILSLLQPILKKIGKILSDIAQGKSLSQAFAGVLELNIGDDDGGGLFGTGIGPDFGPDLTPGDNLGDALNMAIGGVLGAIIPGGNLKGLGKMLLGLIGAGPFKEAFDWAFKLLGNLDIGAGNFGKIMVGMGKKAVQGTIDFLINKDKEKQAEAMAAVSAPVAGAQSVQAWSALAAQALQIAGLSASQLPAFLALMAAESGGNPNAINRTDSNAAAGIPSQGLMQVIPPTFAAYRDPSLPNNILDPLANMVAAANYIQARYGGNVPGSPYALGTPGATRGWHMVGERGPEMVRFRGGEKVWPHGDKGYEHDRRWQRGGETKPWNDAKYEGRDHDCGAVNLSLPITVQGNMDQDAVKRLESELVPKLRMMLQQKVGRRG
Physico‐chemical
properties
protein length:1447 AA
molecular weight:152848,0 Da
isoelectric point:10,17
hydropathy:0,31
Representative Protein Details
Accession
4Ij1v
Protein name
4Ij1v
Sequence length
1104 AA
Molecular weight
115971,02480 Da
Isoelectric point
9,61025
Sequence
MRALGFSIFSTYDGSGVRSARRDMDDFSGSTNRSTSAMNSWNGRIILMTKAALIFGPALVPIAAHLAAIGGAAIAMGASAGIGLAAYGLAMKNAIDTTNAMAKAGKTLSASQKDFLKSQEAYNKAVSNFGSSFRDESLKAASATLKGFTSILKGLEPVAKALAPEVTKVAVSFEKWAKSSTGFKAYIGLIKDSAVPIFRDLVAAGKAVINVLGDGFRAFLPNGVALADTLRKGAEALKAWSDGGGFTRFLTYVQGNSGAVREFFKALVDALQTIGTVMKDLGPLSLTITTTILKLVAALPPAWIEAIVKGFIAWKVAMMGLLVINTVVTAIRAMAAAWFVLNLAFTASTIGLVVLAIAALVAGIILLIQHWDTVSGALKTAWDATWNAMKIAVEAVWNALKIGWSAFTGAFVIAWQAVSGALTAAWNATWNALKIATDAVWNALKLAWETVVNAFSTAWTAVGAALQIAWTTVWNAIQLAATTVWNVLKTAWEAFINGLQLIWTTVSTALTAAWQVFWNLIQTTAQTIWTALQVAWQAFITALQTIWTTVSTALTSAWQVFWNLIQTTAQAIWTALQVAWQAFITALQTIWTTVSTALTAAWQTFWNLIQTTAQAIWTAMQVAWQAFITALQTIWTTVSTALTSAWQAFWTALQTAAQTVWTALQAAWQAFLTAVQTVWTTVSTALQAAWQAFWTAIQTAAQTVWTAIQTAWQTFLTAVTTAWNTFKDALTAAWQAFWTAIQTAAQTIWTAIQTAWQTFLTAVTTAWNTFKDALTAAWKTFWEAIKTAAETIWTALQGAWDKFLKAIQTAWNTASAAVKKAWEETWDAMSGVAKKIWNTIGGIIEKAINGIIGIINMLTGGFNNVADFLSINIKIGKIDEVKLPGLANGGMVTFAYGGVAGMPATFANGGMANLSNGGALSGYAPGRDTVPAMLSKGEGVLTPEAVRGLGGAGFVHGANREFAGHRGAGKSGTGFSMGGIQHFATGGMVDSLTAAALAKAGVSLGLVSQGSHSDGALSGGTHLGGGVVDLSTTDPAVVAALRAAGFAAWARGPAEGFSPHIHAVLMSAPDLSAAAQAQVASFKAGGNGLGVGTAGGSGGGGGIP
Other Proteins in cluster: phalp2_1998
Total (incl. this protein): 11 Avg length: 1250,1 Avg pI: 10,00

Protein ID Length (AA) pI
4Ij1v 1104 9,61025
1zFp1 1472 9,93311
2S9ZC 1552 9,93659
4Inh4 995 9,73622
5HfAu 942 10,65670
5nHlr 1176 9,45605
6STsW 1024 11,07517
6SV9T 1437 10,16629
6ThF7 1348 10,09718
6Uf8I 1254 9,21829
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5099
1ICO0
13 32,1% 1020 1.322E-189
2 phalp2_26448
1QZgj
1 24,0% 969 2.324E-103
3 phalp2_16197
4Fg3d
1 25,4% 748 1.495E-87
4 phalp2_2495
5Et7E
3 20,0% 1060 1.562E-55
5 phalp2_31864
4EoQg
2 20,1% 1002 1.932E-37

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Ij1v) rather than this protein.
PDB ID
4Ij1v
Method AlphaFoldv2
Resolution 52.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50