Skip to content

Interview with a Data Scientist - The Aerospace Engineer who Loved Spreadsheets

Welcome to the first installment of our “Interview with a Data Scientist” series, where we explore the careers and work of the talented members of the NHS England Data Science team. We aim to showcase the fantastic individuals who contribute to the NHS England Data Science Profession and provide valuable insights for those considering a career in Data Science within the healthcare sector.

The first interview is with Joe Wilson, a former Engineer whose desire to make the perfect F1 Fantasy Team during the COVID pandemic drove him to become a Data Scientist who won’t stop talking about Reproducible Analytical Pipelines.

This interview orginally was published in the March edition of the Data Science Community for Health and Care Newsletter. You can subscribe and join the community here.

How did you end up in data science at the NHS? What did you do before, and what really sparked your interest in this field?

My journey to data science at the NHS is a blend of engineering roots, a pandemic pivot, and a strong personal connection to healthcare.

My academic background began with an Integrated Master's in Aerospace Engineering. I was captivated by the problem-solving inherent in understanding how complex systems, particularly aircraft, function. This fascination with engineering persists, as evidenced by my continued interest in aerospace content.

A pivotal moment that hinted at my future direction occurred during a second-year Business Simulation module. We were tasked with managing car companies within a simulated European market. I developed a comprehensive spreadsheet to optimise production, enabling us to create cost-effective cars with popular features. I would go on to make many more spreadsheets, including one that calculated my mark margins for upcoming exams to allow me to still graduate with first class honours.

Following graduation, I joined a small engineering firm, where I contributed to the development of control software for automated test rigs. We built custom rigs to test components like aircraft actuators and truck brake assemblies, using the visual programming language LabVIEW. This involved wiring code sections together, and I still recognise LabVIEW's distinctive interface in various applications, including airport airbridge controls.

Unfortunately, I lost my job around the onset of the COVID-19 pandemic, coinciding with a severe contraction of the job market. During the lockdown, I used the time to learn Python, which sparked a realisation that I wanted to pursue a career in data science. While I'd always enjoyed numbers, spreadsheets, problem-solving, and programming, the pandemic provided the catalyst to connect these interests.

I enrolled in a Data Science conversion Master's program at Loughborough University, joining the inaugural cohort of the degree. The decision to apply to the NHS was deeply personal. My father was a GP and healthcare consultant, and my then-girlfriend (now wife), was training to be a doctor. The NHS's mission resonated with me, and I discovered the NHS Digital Graduate Scheme. I applied and was fortunate enough to be accepted.

I began my role at the NHS just two weeks after submitting my Master's project.

Once you joined the NHS, what was that experience like? What different roles and teams have you been a part of, and how have they shaped your career?

Joining the NHS was a real whirlwind, but in the best way possible! I've been here for three years now, starting at NHS Digital and then transitioning into NHS England after the merger. My first two years were spent on the NHS Digital Graduate Scheme, and since then, I've been working as a full-fledged Data Scientist.

That Graduate Scheme was incredible. We were the first cohort of its kind, a group of just under twenty, and we kicked things off with an intense eight-week coding bootcamp. It was a fantastic bonding experience – we'd be hammering away at code and then decompressing with board games during breaks.

My first placement on the NHS Spine project, which is this massive, intricate system connecting all sorts of different NHS systems. Because it's so complex, everyone was constantly learning, and that made it a really supportive environment for a newbie like me. Plus, I got to see firsthand what a well-oiled DevOps team looked like, which really shaped my approach moving forward, especially when I landed in the Business Intelligence Development Team.

That team was all about building Tableau dashboards and trying to standardise things. My manager asked me to dig into the Tableau API. They had this super slow, manual process for checking dashboards, and I realised we could automate it with Python. So, I built a proof-of-concept testing framework, which was a great chance to put my NHS Spine lessons to work in a totally different area.

My final placement was with the Reproducible Analytical Pipelines (RAP) Squad. We were all about promoting good coding practices for analytics and helping teams automate their workflows. Basically, we were bringing in the best of data and software engineering into the data science and analytics space. In that role, I got to write articles, create training guides, and even run workshops. We also did these 'Engagements' with other analytical teams, helping them improve their processes. After a year, I passed my end-of-scheme interview and got promoted to a Band 7 Data Scientist, staying with the RAP Squad and leading those Engagements.

More recently, I moved to the National Secure Data Environment Service Team, or the SDE Data Wranglers. We make sure researchers get the right, pseudonymised data. While there, I also got to work on a side project developing an ML-powered tool to detect potentially sensitive information in huge datasets. It was a fun challenge, working with Docker containers, PySpark, and figuring out how to deploy ML models in a clustered computing environment.

Overall, it's been a really varied and rewarding experience. Each role has built on the last, and I've learned so much about the NHS and the power of data.

What are you currently working on? Are there any projects that you're particularly excited about, or that you feel are making a real difference? What impact are you having?

Right now, I'm embedded within an analytical team focused on commissioning specialised NHS services. It's a really interesting area, and I'm helping them boost their Reproducible Analytical Pipelines (RAP) compliance, automate some of their manual processes, and bring in some Python expertise to diversify their toolkit.

One project I'm particularly excited about is automating their monthly reporting. We've managed to cut the processing time from a week down to just a few minutes, which is a huge win! It's incredibly rewarding to free up my colleagues' time, allowing them to focus on more impactful, data-driven insights rather than repetitive tasks.

I'm also personally driven to achieve something new for me; building a pipeline that meets Gold RAP compliance standard; essentially, a fully automated and packaged analytical process. We're facing some interesting challenges, but I'm confident that overcoming them will pave the way for more efficient and robust projects in the future.

If you could give someone just starting out in data science a few pieces of advice, what would they be? And what resources have you found particularly helpful along the way that you can share?

If I could give some advice to someone starting out in data science, I'd say:

  • Consider a conversion master's if it's feasible. Mine was a game-changer. It accelerated my learning so much, and it gave me a strong portfolio to show potential employers right away.
  • Dive into online resources, but don't get lost in them. There's a wealth of information out there! Kaggle Learn is a fantastic starting point.
  • Master Git and GitHub. Seriously, this is crucial. I even helped create a workshop for the NHS-R Community and some training materials on it, which you can find on the RAP Community of Practice website. Learning Git and GitHub will also allow you to use GitHub codespaces, and avoid the pain of setting up your local environment.
  • Don't just stick to tutorials. It's easy to get stuck in tutorial hell. Find a simple, personal project that excites you. Mine was building a tool to optimise my Fantasy F1 team. Passion projects are a great way to build a portfolio and showcase your skills.

Basically, combine structured learning with practical application, and don't be afraid to explore your own interests!