Date: 27th January – 31st January 2020
Venue: Main Buliding, Institute of Psychiatry, Psychology & Neurosciences (IoPPN). View Map.
This 5 day course provides a comprehensive introduction to the fundamentals of clinical prediction modelling using modern statistical modelling techniques for health research. It will cover all steps of developing and accessing a prediction model. Computer based teaching introduces students the theory and practical implementation of cutting-edge predictive statistical and machine learning modelling techniques using the R statistical software.
Clinical prediction research develops models that try to predict the chances of a clinical outcome (such as death, diagnosis, treatment success or other future outcomes) based on characteristics related to the patient. Such models can be used to help clinician communicate the chances of clinical outcomes to their patients and to improve their management. It is therefore of crucial importance that such models are developed and tested appropriately. This 5 day course is aimed to PhD students and researchers in health research and will provide an introduction to key components of prognosis and stratified medicine research using cutting edge statistical and machine learning modelling techniques.
The course covers all major steps of developing and accessing a clinical prediction model, including study design and data preparation, the problem of over-fitting in regression models, how to overcome over-fitting using penalized regression and cross-validation methods, how to deal with missing data, feature variable selection, performance assessment and clinical usefulness of a model. An introduction to other machine learning techniques for prediction modelling, such as random forests and support vector machines, will be provided.Each day a short presentation of an application in prediction modelling will be presented. Teaching will be through lecturers and practical computer lab session interspersed with short presentations of prediction modelling researchers on current work. Practical sessions will involve the analyses and interpretation of practice datasets using the software R. Syntax of all procedures will be provided and explained but some familiarity with a syntax-based software (R, STATA, SAS) is advised. A short 1.5 h introduction to R will be provided at the beginning of the course
This workshop will assume that participants have a good knowledge of regression analyses (as can be obtained from the BHI Introduction to Statistical Modelling Course in January) and some experience with R or any other syntax based statistical software, such as STATA (An introduction to R can be obtained from the BHI Introduction to Programming course running in October or the Intro to R course running in February 2019). Participants will need to bring their own laptop computer with R installed (http://www.r-project.org). We recommend to further install RStudio, a very handy user interface for R (free download from http://www.rstudio.com/)
Subject-specific: Knowledge, Understanding and Skills
At the end of the course the students should be able to demonstrate subject-specific knowledge, understanding and skills and have the ability to:
- Have a good understanding of core clinical prediction concepts, such as prognosis, prognostic factors, prognostic models, and stratified medicine and will be able to apply this understanding to the design, conduct, and interpretation of clinical prediction modelling research studies;
- Be able to describe how modern statistical concepts, regression and machine learning methods can be applied in medical prediction problems;
- Be familiar with the principles that play a role in internal validation such as over-fitting, optimism and shrinkage and understand key components of internal validation methods such as cross-validation or bootstrapping;
- Be able to develop simple prediction models, assess their quality and validate them using R software;
- Be able to critically assess the general applicability of a developed model to predict future outcomes;
- Be equipped with a range of statistical and machine learning skills, including problem -solving, project work and presentation, which will enable students to take prominent roles in a wide spectrum of employment and research.
General: Knowledge, Understanding and Skills.
On successful completion of this course the student should be able to:
- to show initiative and the ability to work autonomously and independently with minimal guidance from others;
- to effectively communicate and critically assess own work in discussion groups;
- to successfully work in a team during computer group lab sessions;
- to show confidence in the use of programming software to implement prediction models.
Cost and Booking
Booking / Application
- External Early bird: £855 (till 29/11/19, price thereafter £950)
- KCL Staff Early bird: £641.25 (till 29/11/19, price thereafter £712.5)
- KCL Student Early bird: £427.5 (till 29/11/19, price thereafter £475)
- Other Student Early bird: £641.25 (till 29/11/19, price thereafter £712.5)
That is, 50% discount to King's College London students, 25% discount to other students and staff at King's College London and King's Health Partners.
Booking for this course is open
Last booking date: 20th January 2020
To apply please email firstname.lastname@example.org with the following details:
Subject: Application for Prediction Modelling 2020
Contact Phone Number:
- Are you affiliated with KCL and/or King's Health Partners?
- If Yes, indicate how you are affiliated with KCL and/or King's Health Partners
- Indicate your education/employee status: KCL PhD, KCL student, KCL staff, King's Health Partners affiliate, External Student or External
- In 100 words, state why you wish to enrol/participate in this course:
In 100 words, state which skills you hope to acquire:
Once your application has been approved, you will be sent a link to payment and a discount code if one is to be applied.
Professor Daniel Stahl (Academic Lead)
Dr Cedric Ginestet
Daniel is a Professor of Medical Statistics and Statistical Learning and lead of the Precision Medicine and Statistical Learning Group.
I started my academic career as a behavioural biologist at the German Primate Center in Germany. During my PhD, I became aware of the importance of statistics and data science. I attended an MSc in Biostatistics and worked since then as a statistician in academic research institutions in Germany, Scotland and - since 2006 – at King’s College in London. I am now lead of the "Precision Medicine and Statistical Learning Group". A primary focus of the group is to develop tools to aid clinical decision using predictors which can be easily, reliably and cost-effectively collected from mental health service users.
My interest is applying statistical and machine learning methods to identify predictors, mediators, and moderators of treatment success and using model-based cluster analysis methods to identify subgroups among psychiatric patients. My methodological research concerns the correct treatment of missing data in machine learning procedures and the assessment of subgrouping in prediction modelling.
As a Lead Trial Statistician, I have been responsible for overseeing the statistical aspects of a number of clinical trials within the IoPPN. I am further interested in model selection problem, improving the low reproducibility of medical studies and- a blast from my past - in the evolution of social system in primates.
See Daniel's research profile here.
Dr Raquel Iniesta
Cedric has received a PhD in Biostatistics from Imperial College London.
Cedric has been affiliated to the Neuroimaging Department in King's College London, as well as to the Mathematics and Statistics Department in Boston University, before joining the Department of Biostatistics and Health Informatics in the Institute of Psychiatry and Neuroscience in 2014.
Causal inference; network analysis; object data analysis; statistical learning.
See Cedric's research profile here.
Dr Daniel Stamate, Goldsmith University
Raquel is a BRC Lecturer in statistical learning and precision medicine. Her main research is focused on identifying clinical and genetic predictors of risk to complex disorders and response to treatment.
She has been doing research in big data analysis and personalised medicine since 2003. After getting graduates in mathematics and statistics by the Autònoma University of Barcelona, she got a PhD in Biomedical research by the Catalan Institute of Oncology in 2010. Her activity since then has also included consultancy and teaching.
Computational statistics & machine learning; High-dimensional data modelling; Bioinformatics; Genetics and Pharmacogenetics of complex diseases (Cancer, Schizophrenia, Major Depression, Hypertension).
She has also designed a website for Statistical Learning & Prediction Modelling Research Group.
See Raquel's research profile here.
Dr Mizanur Khondoker, University of East Anglia
I am a Machine Learning scientist, Data Science team leader, Director of Data Science MSc Programme, and industry AI – Machine Learning expert speaker and consultant. I established and lead the Data Science & Soft Computing Lab which has collaborations with various research groups at King’s College London, University of Manchester, Imperial College London, Maastricht University, and National Research Tomsk State University, and with companies in the City of London such as Santander Bank, Mizuho Investment Bank, etc.
At Goldsmiths, I initiated, designed and run the MSc in Data Science - which inspired and was mostly replicated into similar online programme to come at University of London. I have a background in Computer Science and Mathematics, holding an MSc degree in Computer Science & Mathematics from University of Iasi - Faculty of Mathematics, and a PhD in Computer Science from University of Paris-Sud - LRI Computer Science Laboratory.
See Daniel's research profile here.
Mr Dominic Stringer
Mizanur is a Senior Lecturer in Medical Statistics, Norwich Medical School, University of East Anglia (UEA).
See Mizanur's research profile here.
Dr Deborah Agbedjro
Dominic is a statistician working primarily on the set up, conduct and analysis of clinical trials supported by the King’s Clinical Trials Unit.
He works on trials across several health domains including Psychosis and Renal failure. He has a Bachelor’s degree in Mathematics from the University of Bath and a Master’s degree in Medical Statistics from the London School of Hygiene and Tropical Medicine.
He also has a background in data management in the clinical trials setting. Dominic's other research interests include predictive modelling using statistical learning methods.
See Dominic's research profile here.
Deborah's project aims to develop a personalized medicine prediction model for people with schizophrenia treated with Cognitive Remediation Therapy (CRT) by combining statistical learning methods, missing data imputation techniques and model validation procedures.
The model is trained and validated on several randomised controlled trials individual participant data on the use of CRT.
See Deborah's research profile here.
Your place will not be confirmed until payment has been made. Failure cancel without sufficient notice will forfeit your course fee and access to future courses. If you would like to pay by internal transfer, please contact email@example.com