HEALTHCARE DOES HADOOP: AN ACADEMIC MEDICAL

2y ago
23 Views
2 Downloads
5.22 MB
33 Pages
Last View : 1d ago
Last Download : 3m ago
Upload by : Kian Swinton
Transcription

HEALTHCARE DOES HADOOP:AN ACADEMIC MEDICAL CENTER’SFIVE-YEAR JOURNEYCharles Boicey, MS, RN-BC, CPHIMSChief Innovation OfficerClearsense

The doctor of the future will give no medicine, but insteadwill interest his patients in the care of human frame, indiet, and in the cause and prevention of disease.Thomas Edison (1847 – 1931)

PHR Centric HealthHIEModernHDPEMR

Early Days - 2010Naveen Ashish, PhD

NowTrending 2012

Current EnvironmentElectronic Medical Record Not designed to process highvolume/velocity data Not intended to handlecomplex operations Such as: Anomaly detection Machine learning Building complex algorithms Pattern set recognitionEnterprise Data Warehouse Suffer from a latency factor ofup to 24 hours The EDW serves all of thefollowing retrospectively asopposed to in real time Clinicians Operations Quality and research

Big Data Interoperability Big Data Ecosystem thatSupports: Neo 4j (Graph Database) Relational Data Base Hadoop (HDFS) R Hbase Spark Hive Storm Pig Weka MapReduce Mahout MongoDB (NoSQL)

Big Data Complete Data The Electronic Medical Record is primarily transactional takingfeeds from source systems via an interface engine The Enterprise Data Warehouse is a collection of data fromthe EMR and various source systems in the enterprise In both cases decisions are made concerning data acquisition A Big Data system is capable of ingesting and storinghealthcare data in total and in real time

Modern Healthcare Data PlatformA healthcare information ecosystem built on “Big Data”technologies should:Be capable of serving the needs of clinicians, operations, quality and researchAnd should do so in real time and in one environmentShould be:Able to ingest all healthcare generated data both internal and external in nativeformatShould be:A platform for advanced analytics such as early detection of sepsis & hospitalacquired conditionsBe enabled to predict potential readmissionsLeverage complex algorithms and be a machine learning platform

Architecture Guiding Principles Architecture to minimize encumbrance on IT staff Ability to store all healthcare date in native form andcomplete Use of supported open source code Ensure architectural compatibility with commercialapplications

Infrastructure Low Cost of Entry & Scalable Open Source Commodity Hardware UCI Hadoop Ecosystem 10 nodes 5 terabytes Yahoo Hadoop Ecosystem 60K nodes 160 petabytes Cloud Ready

Data Sources Legacy Systems Print to Text or Delimited String Smart Pumps All HL7 Feeds (EMR source Healthcare Organization systems)All EMR Initiated Data (StoredProcedures)Device Data (in one minuteintervals)Physiological Monitors (HL7)Ventilators (HL7) Social Media (POC) Sentiment AnalysisPatient EngagementHome Monitoring (POC)Real Time Location System(RFID)Hospital Sensors

Newer Data Sources External Streaming Device Open DataData www.data.gov Wearables Adverse Drug Event www.researchae.com Home Devices Social Media Geographic InformationSystem (GIS) Data Omic Data Internet of Things (IoT) Telematics 5G

Use Cases Legacy System Retirement Cohort Discovery Patient Condition Changes RRT Data Science Clinician Aware Applications Early Sepsis Detection Patient Monitoring External toTraditional Healthcare Setting Event Driven Care & Real Real Time Nursing UnitTime Quality MonitoringUtilization Staffing and Resource Allocation Personal Health Record Social Media SentimentAnalysis Research Environmental Response

Future Use Cases Ventilator Management Vent dashboard in EMR Combining Phenotype Datawith Genotype DataHospital Acquired Infections Patient Threat Analysis(HAI) Edge and Vertices AnalysisVTE Surveillance Patient caregivers andoutcomesSensium Vitals Digital PatchPatient-Generated Data Home Devices (Scale, VitalSigns, Glucose) Exercise & Diet (Fit Bit,Jawbone, Nike)

Imaging Analytics NIH Funded U24 Grant Joel Saltz, PhD This project is to develop, deploy, and disseminate a suite of opensource tools and integrated informatics platform that will facilitate multiscale, correlative analyses of high resolution whole slide tissue imagedata, spatially mapped genetics and molecular data for cancerresearch.

Patient Persona Surveys Questionnaires Clinic Notes External Sources IoT Social Media Credit Telemetrics

FOSS Driven Protean Is a centrally-hosted, instrumented “Smart and Connected” platform servicing real time business event streams using high-speed MPPCompute and Storage GridsPrimarily based on the concepts and principles of Event DrivenArchitecture (EDA), Complex Event Processing (CEP) and Multi-AgentSystems (MAS)Support for high speed data ingestion - Structured and Unstructured(Textual)Core Advanced Analytics enabled through Model Building, Data Miningand Machine Learning techniques (Supervised and Unsupervised)Context modeling creation across Time-Space-Value dimensionsEnables creation of a Central Enterprise Data Refinery to enable “Sourceof Truth” for transactional information within the Healthcare Enterprise

FHIR – The “Public API” for Healthcare?FHIR Fast Health Interoperability Resource Emerging HL7 Standard (DSTU 2 soon) More powerful & less complex than HL7 V3ReSTful API ReST Representational State Transfer – basis for Internet Scale Resource-oriented rather than Remote Procedure Call (nouns verbs) Easy for developers to understand and useFHIR Resources Well-defined, simple snippets of data that capture core clinical entities Build on top of existing HL7 data types Resources are the “objects” in a network of URI reference linksHuff, S., McCallie, D HIMSS 2015

SMART Platform – Open Specificationfor Apps “Substitutable Medical Apps” Kohane/Mandl – NEJM (2009) A SMART App is a Web App HTML5 JavaScript Remote or embedded in EHR URL passes context & FHIR li nk EHR Data Access via FHIR OAuth2 / OIDC for securityHuff, S., McCallie, D HIMSS 2015

Some SMART HotbedsHuff, S., McCallie, D HIMSS 2015

Boston Childrens: SMART Growth ChartHuff, S., McCallie, D HIMSS 2015

DSRIP 8 billion dollar grant (Medicaid waiver) from CMS to NYState 25% reduction over five years in avoidable hospitalizationsand ER visits in the Medicaid and uninsured population Collaborative effort to implement innovative projectsfocused on System transformation Clinical improvement Population health improvement

5 Year Goals Create integrated Suffolk County care delivery system for387K lives anchored by safety net providers Engage partners across the care delivery spectrum tocreate a countywide network of care After five years, transition this network to an ACO which willcontract with insurance providers on an at risk basis

Suffolk Care Collaborative IT ArchitectureSuffolk County ProvidersStony Brook MedicineEMRs or clinical InformationSystemEMRs or clinical Information SystemSuffolk County PPS Population Management ToolsClinical Data for PatientCareRegistriesCare PlansWorkflowMed AdherenceMobilitySuffolk County PPS Patient PortaleFormsPatient WellnessAlertsMobile MonitoringPatient EducationClinical RecordsCollaborationSuffolk County Big Data PlatformPredictive AnalyticsEvent EngineStructured DataFinancial DataLegacy DataMachine LearningNLPUnstructured DataWearables DataSocial DataAnomaly DetectionRulesDevice DataHL7/CCDOpen DataSuffolk county PPS Master Patient Index (MPI)Suffolk county PPS Health Information Exchange (HIE)E-HNLI RHIO (HIE)

Gavin Stone, edico genome 5G Summit May, 14, 2015

New Team Members Data ScientistDevelopersCognitive and Behavioral PsychologyUser ExperienceHuman & Computer Interaction Devices Wearables Patients & Family

Trends: Big Data Definition: Evolving Creation & Management: Distributed and augmented Information Governance: Shared Meaningful Analysis: Beyond PnL, Reporting, Connections,Correlations, Pattern Recognition, Machine Learning, NaturalLanguage Processing Business Requirements: Blank Page; We don’t know whatwe want we will figure it out once we look at the data, the datawill lead the way, AKA, Data Science

Trends: Healthcare Content Analytics – Suggestive Analytics* – PrescriptiveAnalytics Imaging Analytics Moving Analytics out of the EMR Environment Graph Data Mart Edge and Vertices Analysis Omic & Phenotype Combines Sentiment AnalysisDale Sanders

Takeaways Underpinning platforms may change but concept is here tostay, abstract where possible. Machine learning will lead to the evolution of Data Science andeventual use of AI in Healthcare. Get used to source now, ask questions later: Healthcareevolves with data and it is not a point in time construct anylonger. Get used to working with constant change, disruptive trendsand something new that will make your “frameworks” obsolete.

Contact Me @Charles cine.educboicey@clearsense.com1 904-373-0831@N2InformaticsRN

Suffolk Care Collaborative IT Architecture Suffolk County Providers Suffolk county PPS Master Patient Index (MPI) Suffolk county PPS Health Information Exchange (HIE) E-HNLI RHIO (HIE) Suffolk County PPS Patient Portal Stony Brook Medicine Suffolk County Big Data Suffolk County Big Data PlatformPlatform

Related Documents:

1: hadoop 2 2 Apache Hadoop? 2 Apache Hadoop : 2: 2 2 Examples 3 Linux 3 Hadoop ubuntu 5 Hadoop: 5: 6 SSH: 6 hadoop sudoer: 8 IPv6: 8 Hadoop: 8 Hadoop HDFS 9 2: MapReduce 13 13 13 Examples 13 ( Java Python) 13 3: Hadoop 17 Examples 17 hoods hadoop 17 hadoop fs -mkdir: 17: 17: 17 hadoop fs -put: 17: 17

2006: Doug Cutting implements Hadoop 0.1. after reading above papers 2008: Yahoo! Uses Hadoop as it solves their search engine scalability issues 2010: Facebook, LinkedIn, eBay use Hadoop 2012: Hadoop 1.0 released 2013: Hadoop 2.2 („aka Hadoop 2.0") released 2017: Hadoop 3.0 released HADOOP TIMELINE Daimler TSS Data Warehouse / DHBW 12

The hadoop distributed file system Anatomy of a hadoop cluster Breakthroughs of hadoop Hadoop distributions: Apache hadoop Cloudera hadoop Horton networks hadoop MapR hadoop Hands On: Installation of virtual machine using VMPlayer on host machine. and work with some basics unix commands needs for hadoop.

The In-Memory Accelerator for Hadoop is a first-of-its-kind Hadoop extension that works with your choice of Hadoop distribution, which can be any commercial or open source version of Hadoop available, including Hadoop 1.x and Hadoop 2.x distributions. The In-Memory Accelerator for Hadoop is designed to provide the same performance

Configuring SSH: 6 Add hadoop user to sudoer's list: 8 Disabling IPv6: 8 Installing Hadoop: 8 Hadoop overview and HDFS 9 Chapter 2: Debugging Hadoop MR Java code in local eclipse dev environment. 12 Introduction 12 Remarks 12 Examples 12 Steps for configuration 12 Chapter 3: Hadoop commands 14 Syntax 14 Examples 14 Hadoop v1 Commands 14 1 .

-Type "sudo tar -xvzf hadoop-2.7.3.tar.gz" 6. I renamed the download to something easier to type-out later. -Type "sudo mv hadoop-2.7.3 hadoop" 7. Make this hduser an owner of this directory just to be sure. -Type "sudo chown -R hduser:hadoop hadoop" 8. Now that we have hadoop, we have to configure it before it can launch its daemons (i.e .

Hadoop and Pig Overview Lavanya Ramakrishnan Shane Canon . Source: Hadoop: The Definitive Guide Zoo Keeper 13 Constantly evolving! Google Vs Hadoop Google Hadoop MapReduce Hadoop MapReduce GFS HDFS Sawzall Pig, Hive . Hadoop on Amazon – Elastic MapReduce 19 .

Hadoop FS APIs Higher-level languages: Hive, BigSQL JAQL, Pig Applications Supported Hadoop versions: 2.7.1 HDFS Client Spectrum Scale HDFS RPC Hadoop client Hadoop FileSystem API Connector on libgpfs,posix API Hadoop FileSystem Connector on libgpfs,posix API GPFS node Hadoop client Hadoop