This section starts by discussing general aspects of Big Data and Health including data sizes, different areas including genomics, EBI, radiology and the Quantified Self movement. We review current state of health care and trends associated with it including increased use of Telemedicine. We summarize an industry survey by GE and Accenture and an impressive exemplar Cloud-based medicine system from Potsdam. We give some details of big data in medicine. Some remarks on Cloud computing and Health focus on security and privacy issues.
We survey an April 2013 McKinsey report on the Big Data revolution in US health care; a Microsoft report in this area and a European Union report on how Big Data will allow patient centered care in the future. Examples are given of the Internet of Things, which will have great impact on health including wearables. A study looks at 4 scenarios for healthcare in 2032. Two are positive, one middle of the road and one negative. The final topic is Genomics, Proteomics and Information Visualization.
Slides: (131 pages) Health Informatics (PDF)
This section starts by discussing general aspects of Big Data and Health including data sizes, different areas including genomics, EBI, radiology and the Quantified Self movement. We review current state of health care and trends associated with it including increased use of Telemedicine. We summarize an industry survey by GE and Accenture and an impressive exemplar Cloud-based medicine system from Potsdam. We give some details of big data in medicine. Some remarks on Cloud computing and Health focus on security and privacy issues.
We survey an April 2013 McKinsey report on the Big Data revolution in US health care; a Microsoft report in this area and a European Union report on how Big Data will allow patient centered care in the future. Examples are given of the Internet of Things, which will have great impact on health including wearables. A study looks at 4 scenarios for healthcare in 2032. Two are positive, one middle of the road and one negative. The final topic is Genomics, Proteomics and Information Visualization.
This lesson starts with general aspects of Big Data and Health including listing subareas where Big data important. Data sizes are given in radiology, genomics, personalized medicine, and the Quantified Self movement, with sizes and access to European Bioinformatics Institute.
Video: 10:02 : Big Data and Health: (streamed with optional CC) https://www.youtube.com/watch?v=ZkM-yZJQ1Cg (unstreamed without CC) https://drive.google.com/file/d/0B5plU-u0wqMoRlVwZlk0UERxVUk/view?usp=sharing
This covers trends of costs and type of healthcare with low cost genomes and an aging population. Social media and government Brain initiative.
Video: 16:09 : Status of Healthcare Today: (streamed with optional CC) https://www.youtube.com/watch?v=x9TpdMBqYrk (unstreamed without CC) https://drive.google.com/file/d/0B5plU-u0wqMoOEYxVzQxQWtpZXM/view?usp=sharing
This describes increasing use of telemedicine and how we tried and failed to do this in 1994.
Video: 8:21: Telemedicine: (streamed with optional CC) https://www.youtube.com/watch?v=Pe4CVXQaL_U (unstreamed without CC) https://drive.google.com/file/d/0B5plU-u0wqMoVWxfeHVaWWR4c0E/view?usp=sharing
Summary of an industry survey by GE and Accenture.
Video: 10:02: Big Data and Healthcare Indusry: (streamed with optional CC) https://youtu.be/SJhXdV4WfoI (unstreamed without CC) https://drive.google.com/open?id=0B5plU-u0wqMoSXRVYVV3ZlVvMnM
An impressive exemplar Cloud-based medicine system from Potsdam.
Video: 15:02: Medical Big Data in the Clouds (streamed with optional CC) https://www.youtube.com/watch?v=GldSVijkJcM (unstreamed without CC) https://drive.google.com/file/d/0B5plU-u0wqMoSXk3cFd0Z0Yyems/view?usp=sharing
Video: 6:33: Midical Image Big Data: (streamed with optional CC) https://www.youtube.com/watch?v=GOcVtwx2R2k (unstreamed without CC) https://drive.google.com/file/d/0B5plU-u0wqMoT1JsYnJXRFFpdWM/view?usp=sharing
This lesson covers 9 aspects of the McKinsey report. These are the convergence of multiple positive changes has created a tipping point for innovation; Primary data pools are at the heart of the big data revolution in healthcare; Big data is changing the paradigm: these are the value pathways; Applying early successes at scale could reduce US healthcare costs by $300 billion to $450 billion; Most new big-data applications target consumers and providers across pathways; Innovations are weighted towards influencing individual decision-making levers; Big data innovations use a range of public, acquired, and proprietary data types; Organizations implementing a big data transformation should provide the leadership required for the associated cultural transformation; Companies must develop a range of big data capabilities.
This lesson identifies data sources as Clinical Data, Pharma & Life Science Data, Patient & Consumer Data, Claims & Cost Data and Correlational Data. Three approaches are Live data feed, Advanced analytics and Social analytics.
This lesson summarizes an EU Report on Redesigning health in Europe for 2020. The power of data is seen as a lever for change in My Data, My decisions; Liberate the data; Connect up everything; Revolutionize health; and Include Everyone removing the current correlation between health and wealth.
The Internet of Things will have great impact on health including telemedicine and wearables. Examples are given.
A study looks at 4 scenarios for healthcare in 2032. Two are positive, one middle of the road and one negative.
A study of an Azure application with an Excel frontend and a cloud BLAST backend starts this lesson. This is followed by a big data analysis of personal genomics and an analysis of a typical DNA sequencing analytics pipeline. The Protein Sequence Universe is defined and used to motivate Multi dimensional Scaling MDS. Sammon’s method is defined and its use illustrated by a metagenomics example. Subtleties in use of MDS include a monotonic mapping of the dissimilarity function. The application to the COG Proteomics dataset is discussed. We note that the MDS approach is related to the well known chisq method and some aspects of nonlinear minimization of chisq (Least Squares) are discussed.
Next we continue the discussion of the COG Protein Universe introduced in the last lesson. It is shown how Proteomics clusters are clearly seen in the Universe browser. This motivates a side remark on different clustering methods applied to metagenomics. Then we discuss the Generative Topographic Map GTM method that can be used in dimension reduction when original data is in a metric space and is in this case faster than MDS as GTM computational complexity scales like N not N squared as seen in MDS.
Examples are given of GTM including an application to topic models in Information Retrieval. Indiana University has developed a deterministic annealing improvement of GTM. 3 separate clusterings are projected for visualization and show very different structure emphasizing the importance of visualizing results of data analytics. The final slide shows an application of MDS to generate and visualize phylogenetic trees.