Healthcare - AlgoAnalytics

Transcription

Healthcare - AlgoAnalytics
Healthcare Analytics
October 12, 2016
Outline
About AlgoAnalytics
Areas of Application for Healthcare
Our experience
Healthcare datasets
Page 2
About AlgoAnalytics
Analytics Consultancy
• Work at the intersection of mathematics and other domains
• Harness data to provide insight and solutions to our clients
Expertise in Mathematics and Computer Science
• Develop advanced mathematical models or solutions for a wide
range of industries:
• Retail, economics, healthcare, BFSI, telecom, …
Working with Domain Specialists
• Work closely with domain experts – either from the clients side or
our own – to effectively model the problem to be solved
Page 3
Healthcare: Areas of Analytics Application
Patient Aid
Resource Allocation
Revenue Optimization
Page 4
Patient Care Aid
• Analytics helps detect serious
disorders or diseases through
medico-genomic data as well
as through image analytics
• Classification technique
enables deciding whether a
particular pathology exists in a
particular patient or not
Clinical Decision
support
• Uses huge amount of past
patient data and genomics
data for diagnosing high risk
group through risk
stratification score.
• Offers better data driven care,
coordination and follow up in
prevention of lifestyle
diseases.
Chronic Disease
Management
Population health
management
• Generating risk score through
algorithm indicating high
chance of heart failure
amongst given patient
• Reducing readmission rates –
using scoring techniques to
locate high risk customers and
preventing readmissions
through higher monitoring
• Targeting cardiac patients
most in need of intensive
follow up
Adverse event
avoidance
Page
Page5 ‹#›
• Improved disease surveillance
• Identification of patients
needing more proactive
approach to manage their
conditions.
• Longitudinal patient analysis
• Improved patient compliance
Reducing
Readmissions
Resource Allocation
• Using historical data for staffing to
reduce costs
• Having the right clinician at right
time at right place
• Predict the number of days a
patient will spend in the hospital
over the next year
• Predict readmission for the same
illness
• Predict likelihood that an elderly
insurance policy holder will pass
away within 10 months in order to
trigger end-of-life counseling
• Risk of death in surgery based on
aspects of you and your condition
• Prevent bottlenecks in urgent care/
OR by analyzing patient flow during
peak times
Work Flow Optimization
Page 6
Efficient Use of Hospital
Resources
Death
Revenue Optimization
• Forecasting of cash flows
based on claims history
• Using historical data, it uses
reimbursement analysis and
potential denials to forecast
cash.
Cash Flow
Forecasting
• Identify opportunities to
collect missing income,
including claims that are
wrongfully rejected by payers
or overdue monies from
patients.
• Insurance coverage
Billing Errors
Patient Selection
• Patients expense of
treatment
• Pricing model based on past
data of previous illnesses,
costs, etc.
• Can quote min-max range,
with and without insurance
Expected Expense
of Treatment
Page
Page7 ‹#›
• Identifying patients those are
not likely to pay in full
IMAGE ANALYTICS
Page 8
MRI Image Analytics Overview
Goal
Value of Recommendation
Use image analytics on MRI scans to detect
whether a particular pathology exists
•
•
•
•
•
Automatic segmentation
Volumetric analysis
Classification
Feature selection
Analyzing voxel intensities – texture
features
Image Normalization and
Segmentation
Volumetric Analysis,
Image Processing – Feature
Extraction using voxel values
Classifier
Feature Extraction from
segmented part based on
pathology
Page 9
Probability
(Disease)
Data Analysis &
Results
Probability
(Normal)
MRI Analytics – Aim
 Image segmentation and registration
- Automatic registration to a default map
- Automatic segmentation of brain parts
 Image processing
- Volumetric Analysis of the images
- Extract intensity based features from
the image
- Texture features like entropy, energy,
correlation, etc.
 Machine learning techniques
- Combine volumetric, textural features
and other possible features of the image
to classify
 Generalize the above process for multiple
pathologies
Page 10
Further Work:
 Consult domain expert for improving
the performance accuracy of automated
segmentation module
 Better feature selection using domain
knowledge of the pathology
 Addition of textural features to the
predictive model
 Combining unsupervised and supervised
methods
Number of features
used : 58
Diabetic Retinopathy Overview
Image
Processing
Goal
Use image analytics on fundus images to
detect whether diabetic retinopathy exists
Work done till date
• Image processing
• Feature extraction
• Blood-vessels
• Exudates
• Classification
Image-Texture
Based
Features
Blood Vessels
and Exudates
Input
Data
Ensemble Learning
Classify each image as retinopathy or not-retinopathy
retinopathy
using multiple classifying algorithms
Retinopathy grades 0 and 1 are considered
as class 0 (No-Retinopathy) and
grades 2 and 3 as class 1 (Retinopathy)
Page 11
Retinopathy – Future Work
Image processing & Feature computation
• Improvements in blood vessels extraction, exudates extraction, etc.
• Features based on hemorrhages, macula, optical disk, etc.
Feature selection
• Optimize features used for classification
Model selection & Ensemble Learning
• Model tuning
• Different techniques of ensemble learning
Further Analysis
• Trying the model on different dataset / Addition of new images as input
• Similar analysis for multiclass problem
• Change in threshold for categorizing as retinopathy
Page 12
REVENUE INTELLIGENCE
Page 13
Healthcare Revenue Intelligence – Cash Forecasting Model
Goal:
1.
Project N-day ahead cash using revenue and important KPIs
2.
Project target cash if all KPIs are at target level
3.
Find revenue leakage: (N-day Target cash – N-day Forecasted cash)
Read data
dates, cash, revenue,
KPI values, target values
Page 14
Lag the data
Lag the data by n days,
Forecast for n days then
Regression Model
Predict the cash values
for n days ahead using
data remaining after
taking lag
Three Level Optimization Problem
Team Level
Optimization
User Level
Optimization
• Determine teams best using how team performed in past n days
• Objective : Maximise Total Amount Resolved
• Optimized values : Average Amount Worked, Productivity, Effectiveness Ratio
• Determine teams best using how user and team performs in past n days
• Objective : Maximise User Level Amount Resolved
• Optimized values : Average Amount Worked, Productivity, Effectiveness Ratio
• Study the external actionable influencers affecting Productivity and ER combined
• Give actionable recommendation to achieve desirable Productivity and ER
Specific
Recommenda • Influencers divided in to two subtypes : Private and Government
tion
Each level can be used on it’s own as a separate recommendation system !!
Page 15
Daily Work Allocation
 Input : Available users and
number of Pvt. and Govt.
accounts
 Output : Accounts allocation
Total 200
accounts to
work on a
given day.
100 Private ,
100 Govt.
Divide Pvt.
and Govt.
accounts in
the proportion
determined
Find the work
allocation
percentage
from this
optimization
Page 16
Total 7 users
available for
that day to do
a job
Run 3rd level
optimization
for the
available
users
User
Govt.
Accounts
Private
Accounts
Total
Accounts
21842
7
10
17
21843
0
37
37
21844
9
17
26
21845
8
7
15
21852
35
9
44
21853
39
11
50
21867
2
9
11
Total
100
100
200
1. Available Online Datasets
2. Potential Sources of Data
HEALTHCARE DATA
Page 17
Available Healthcare Data Sets Online
Acute Inflammations
- Diagnose two diseases of the
urinary system
Arrhythmia
- Distinguish/classify presence or
absence of cardiac arrhythmia
Breast Cancer Wisconsin
- Classify cancer as benign or
malignant based on FNA
samples
Cardiotocography
- Fetal cardiotocograms
processed and classified
Dermatology
- Determine type of ErythematoSquamous Disease
Page 18
Liver Disorders
- Data sensitive to liver disorders
which may arise from excessive
alcohol consumption
Lung Cancer
- Classify three types of
pathological lung cancers
Thoracic Surgery Data
- Determine post-operative life
expectancy of lung cancer
patients
Vertebral Column assessment
- Classify orthopaedic patients
into 3 classes: normal, disk
hernia or spondylolisthesis
Mammographic Mass
- Determine benign and
malignant mammographic
masses based on BI-RADS
attributes and patient’s age
Diabetes
- Analyze factors related to
readmission and other
outcomes
Parkinson’s
E. Coli Genes
- Predict clinician’s Parkinson’s
disease symptom score on the
UPDRS scale
- Data gives characteristics of
each ORF in the E. coli genome
Post-Operative Patient
Echocardiogram
- Determine where patients in a
postoperative recovery areas
should be sent to next
- Classify if patients will survive a
year after a heart attack
Example Data sets are available online and have been studied
Healthcare Data Assets
Page 19
The Analytics Process
Once a client requirement comes in:
Define and outline the
problem statement
Business Requirement
Understand data
Data preparation
Data Situation
Value-adding Results
Explainable results
Operational outcomes
Page 20
Develop models
Evaluate performance
Analytics Solution
Seamless Integration
Work with clients to
integrate solution
Machine Learning Techniques
Page 21
Decision Trees
Kernel Learning
• Flow-chart like structure
• Maps observations of an item to
conclusion on item’s target value
• SVM extension – different kernel
functions for feature subsets
• Effective when data comes from
a variety of sources
Random Forests
Deep Learning
• Extension of classification trees
• High accuracy and efficient on
large databases
• Model high level abstractions
• Architecture composed of
multiple non-linear
transformations
Artificial Neural Networks
Clustering
• Idea analogous to biological
neural networks
• Used to discover complex
patterns in data
• Unsupervised learning to group
data into 2+ classes
• Clustering based on similarity or
dissimilarity between data points
Logistic Regression
Optimization
• Probabilistic statistical
classification model
• Binary predictor
• Modifying a system to make
some aspect of it work more
efficiently or use fewer resources
Technology:
Page 22
Analytics Projects Overview
News / Social Media
Analytics
Risk Score Modeling
Algorithmic Trading
Network Modeling of Risk
Quantitative strategies
in the Indian markets
Event-based or real-time score
that is updated in real time
Strategy Development
Brokerage Analytics
Recommender Systems
Multilanguage Sentiment
Dormancy prediction for
the upcoming quarter
Machine learning methods to
recommend items to users
Calculating sentiments of news
articles in 7 different languages
Improvements to pre-existing
algorithmic strategies
Event identification based
on topic detection
Country risk scores on
pre-decided risk factors
Cash Flow Forecasting
Work Flow Optimization
Contracts Management
Performance Manager
Cash forecasting for 90
days in future
Work Allocation for team
and team members
Automate existing system to
analyze legal contracts’ text
Forecast various KPIs concerning
operational performance
Predictive Maintenance
IoT Analytics
Assisted Living
Retinopathy Analytics
Fault detection in ovens
for energy optimization
Analytics on signals obtained
from a cloud network
Outlier detection based on
sensor input
Automated detection of
diabetic retinopathy
Network Failure Model
Analytics Engine
Clickstream Analytics
MRI Analytics
Monitor communication
of a network of routers
Connect to an infrastructure
and provide required analytics
Compute inherent features of
users based on website logs
Evidence-based decision
support system
Page 23
AlgoAnalytics
 Areas of expertise
•
Predictive analytics consultancy
•
Algorithmic and quantitative trading in financial markets
 About us
•
Founded by Dr. Aniruddha Pant in 2009
•
Firm focused on the intersection of mathematics, computer science and other domains
 Team
Dr. Aniruddha Pant
CEO
Ph.D. (Berkeley, USA)
Control theory, Machine
Learning, Analytics,
Statistics
+ 30 Quantitative Analysts
with a solid foundation in mathematics and engineering
Page 24