how is alpha fold 3 trained #1031

joepareti54 · 2024-10-16T14:45:54Z

This paper says that the loss function is the weighted sum of L_distogram, L_diffusion and L_confidence. But how is it implemented? taking the derivative to update the weights ? When the processing is in the trunk there is no diffusion yet which occurs later in time. When the processing is in diffusion, the trunk does not seem to execute again. Is there any backpropagation? Moreover the diffusion process training involves predicting the noise injected at a time step, but then the only loss term to base the weight updates ought to be Ldiffusion. How about the attention layers and MLP training ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how is alpha fold 3 trained #1031

how is alpha fold 3 trained #1031

joepareti54 commented Oct 16, 2024 •

edited

Loading

how is alpha fold 3 trained #1031

how is alpha fold 3 trained #1031

Comments

joepareti54 commented Oct 16, 2024 • edited Loading

joepareti54 commented Oct 16, 2024 •

edited

Loading