An introduction to AI for biostatisticians

BMS-Aned seminar

Department of Data Science Methods, Julius Center, University Medical Center Utrecht

2024-09-26

i	length	weight	sex
1	137	30	boy
2	122	24	girl
3	101	18	girl
…	…	…	…

ML tasks: discrimination / classification

call this one variable outcome and - classify when majority of generated samples are of a certain class - or: have a model that outputs expected values $s_j = p_{\theta}(s|l=l',w=w') > 0.5$

task
generation	$l_j,w_j,s_j \sim p_{\theta}(l,w,s)$
conditional generation	$l_j,w_j \sim p_{\theta}(l,w\|s=\text{boy})$
discrimination	$p_{\theta}(s\|l=l_i,w=w_i) > 0.5$

application: prediction, diagnosis

An introduction to AI for biostatisticians BMS-Aned seminar Wouter van Amsterdam Department of Data Science Methods, Julius Center, University Medical Center Utrecht 2024-09-26

An introduction to AI for biostatisticians
About
Disclaimer
Today’s talk
What is AI? History / Definitions
Definition
Early Milestones
Key Developments
AI landscape: what is AI?
Slide 10
Rule-based systems are AI
Slide 12
What is Machine Learning? What are ML tasks?
What is Machine Learning?...
Assume we have this data
ML tasks: generation
ML tasks: conditional generation
ML tasks: conditional generation 2
ML tasks: discrimination / classification
ML tasks: reinforcement learning
Slide 21
Slide 22
What is ‘Deep Learning’?
Neural Networks and Deep Learning
Neural Networks and Deep Learning
Why would neural networks work?
Training neural networks
Stochastic gradient descent
https://www.kaggle.com/code/ryanholbrook/stochastic-gradient-descent...
Training neural networks
Regularization
Early stopping
Regularization
Slide 34
There are specialized neural network architectures for different types of data
Convolutional Neural Networks
Convolutional Neural Networks
Convolutional Neural Networks
Convolutional Neural Networks
Where would CNNs be useful in healthcare?
deep learning for...
Slide 42
What is a large-language model like chatGPT?
Slide 44
Neural Networks for Sequence data
chatGPT: a stochastic auto-regressive conditional generator with a chatbot interface
auto-regressive conditional generation:
auto-regressive conditional generation:
auto-regressive conditional generation:
auto-regressive conditional generation:
auto-regressive conditional generation:
stochastic auto-regressive conditional generation:
stochastic auto-regressive conditional generation:
stochastic auto-regressive conditional generation:
stochastic auto-regressive conditional generation:
GPT-4 scale (underlying current ChatGPT)
For complex tasks, neural networks keep getting better with:
scaling over time
Neural Network Architectures
rule-based AI versus chatGPT
What we don’t know about chatGPT
What might LLMs be useful for in health care?
AI wrap-up
References