Module 1: Data Visualization

Reflective Piece, Using Driscoll’s Model, Polynomial Regression Models in Python

What?

For Unit 3, I put together a project showing how to build and compare a simple linear regression model and a polynomial regression model in Python, using the Diabetes dataset from sklearn.datasets. I decided to focus on BMI as the single independent variable and see how each model performed in predicting diabetes progression. This meant loading and preparing the dataset, splitting it into training and testing sets, fitting both models, and then checking how well they did using R², MSE, and MAE. I also made visualisations to see the differences in the fitted lines. To make sure I was not just following steps blindly, I read Deep Learning: Foundations and Concepts by C and H Bishop, and watched a couple of YouTube tutorials on regression and polynomial regression in Python.

So what?

Before doing this, I thought adding more complexity to a model was always a good thing. Polynomial regression seemed like an upgrade, it can bend and twist to fit the data better, right? But actually, I learned that more flexibility is not always better. If you are not careful, you can end up overfitting and losing the ability to make accurate predictions on new data. Seeing this play out with real metrics in Python made the idea click for me in a way that reading about it never did. Another thing that stuck with me was the ethical side, this dataset relates to a medical condition, so getting predictions wrong could have serious consequences. It reminded me that building models is not just about getting the highest score, it is also about being responsible and transparent.

Now what?

Working on this project made me think about how regression models could fit into my work at the aviation maintenance college. I deal with grades, exam results, and student and staff records every day. While I do not work with medical datasets, the idea of spotting trends and predicting outcomes is very relevant. For example, a similar approach could help identify students who might be at risk of failing based on their past results, attendance, or other factors. It could also be used to track staff compliance or forecast when training updates are likely to be needed. The key takeaway for me is that the same care I used in this project, avoiding over complicated models, checking accuracy, and explaining results clearly, applies just as much in my current role as it does in academic exercises.

References

Bishop, C.M. and Bishop, H., 2023. Deep Learning: Foundations and Concepts. Springer Nature.
McKendrick, I., 2021. Polynomial Regression in Python. YouTube. Available at: https://youtu.be/Q8UfQ_aoCeQ [Accessed 9 Aug. 2025].
Davies, J., 2021. Polynomial Regression with Scikit-Learn. YouTube. Available at: https://youtu.be/aV_sRopNTrw [Accessed 9 Aug. 2025].
Driscoll, J., 2007. Practising Clinical Supervision, A Reflective Approach for Healthcare Professionals. 2nd ed. Edinburgh, Elsevier.
Atkins, S. and Murphy, K., 1994. Reflective practice. Nursing Standard, 8(39), pp.49–56.