I noticed the function used for splitting the training and validation
split <- rsample::initial_split(screening_decisions, prop = training_prop, strata = "LABEL")
is splitting the data properly in terms of the proportion size, but the strata argument doesn't seem to work very well