Artificial intelligence is making waves in the medical field, with a new model called Foresight claiming to revolutionize healthcare predictions. Developed by Chris Tomlinson and his team at University College London, Foresight is trained on the medical data of 57 million people who have used the National Health Service in England. This vast dataset includes eight different sets of medical information collected by the NHS between November 2018 to December 2023, totaling 10 billion health events for 57 million individuals.
The potential of Foresight is immense, with claims that it could assist doctors in predicting diseases, forecasting hospitalization rates, and even identifying disease complications before they occur. This could lead to early interventions and a shift towards more preventative healthcare on a large scale. However, concerns have been raised regarding the privacy and data protection implications of using such a large-scale AI model with sensitive medical data.
While the researchers behind Foresight assure that all records used to train the AI were de-identified, there is always a risk of re-identification, especially with large datasets. Luc Rocher from the University of Oxford highlights the challenges of anonymizing rich health data for AI models, emphasizing the need for strict control over the use of such models. Michael Chapman from NHS Digital acknowledges this risk of re-identification and ensures that the AI operates within a secure NHS data environment to prevent data leakage.
Yves-Alexandre de Montjoye from Imperial College London suggests testing the AI model to verify if it can memorize sensitive information, a crucial step in protecting patient privacy. Caroline Green from the University of Oxford emphasizes the importance of public trust and transparency in using medical data for AI models, as people want to know how their data is being utilized.
Under the General Data Protection Regulation (GDPR), individuals should have the option to withdraw consent for the use of their personal data. However, the nature of large language models like Foresight makes it impossible to remove single records once they have been trained. This raises legal questions about data protection laws and the use of de-identified data in AI models.
Despite the legal complexities and ethical considerations surrounding the use of medical data for AI, Foresight is currently being used for research related to covid-19 under exceptions to data protection laws enacted during the pandemic. The need for a balance between technological advancements and ethical considerations is emphasized, with a call for human-centered approaches to AI development in healthcare.
In conclusion, Foresight represents a significant advancement in healthcare predictions, but the challenges of privacy, data protection, and ethical considerations must be carefully navigated to ensure the responsible use of AI in the medical field.