Data preparation strategies for advanced predictive analytics