Hi all,
I am currently exploring protein generator on the HuggingFace space. I am trying out the AA compositional bias conditioning and I ran the following example: 'W0.2,E0.1', with 40 diffusion steps and a protein length of 250 residues. This resulted in the following protein sequence:
AAPPPAAAVAAAAAAAPPAPAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAAAAAAAAAAAAAPAAAALAAAAPAPAAAAAAAPAAAVAAAAAAAAAAAAAAAAAAAAAAAPAAAPAAAAAAAAAAAAAVAAAAAAAAAAAAPAAVPAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAPAAAAPAAAAAAAAAAAAPAAAAAAAAAALAAAAAAAAAVA
As you can see, the sequence is almost exclusively composed out of alanines and no tryptophans or glutamic acids occur, although explicitly conditioned on these residues. When I change residues and/or bias, the results are similar and I never obtain a sequence coming close to the conditions.
Can you verify something is going wrong here?
Best,
Alex
Hi all,
I am currently exploring protein generator on the HuggingFace space. I am trying out the AA compositional bias conditioning and I ran the following example: 'W0.2,E0.1', with 40 diffusion steps and a protein length of 250 residues. This resulted in the following protein sequence:
AAPPPAAAVAAAAAAAPPAPAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAAAAAAAAAAAAAPAAAALAAAAPAPAAAAAAAPAAAVAAAAAAAAAAAAAAAAAAAAAAAPAAAPAAAAAAAAAAAAAVAAAAAAAAAAAAPAAVPAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAPAAAAPAAAAAAAAAAAAPAAAAAAAAAALAAAAAAAAAVA
As you can see, the sequence is almost exclusively composed out of alanines and no tryptophans or glutamic acids occur, although explicitly conditioned on these residues. When I change residues and/or bias, the results are similar and I never obtain a sequence coming close to the conditions.
Can you verify something is going wrong here?
Best,
Alex