This is an animation of a molecular dynamics trajectory of a hemoglobin. See https://www.rcsb.org/structure/5EUI for the original PDB file, and watch the animation above.

Continue reading

I predicted the aqueous solubility of chemical compounds listed in a public dataset using a basic linear model and an untuned random forest in my previous post. The random forest showed the two models’ best performance, achieving an RMSE of 0.866 and an R^2 of 0.828. In this post, I train an XGBoost model on the same data. Load the libraries library(tidyverse) library(tidymodels) library(ranger) library(usemodels) library(gridExtra) library(vip) theme_set(theme_classic()) Load, split, and bootstrap sample the data.

Continue reading

Author's picture

Alicia Key

I am passionate about data and science.

PhD student

Aurora, Colorado, USA