Health Care Data Analysis Essay

Improved Essays
Predictive Analysis of Health Care Data using SPARK

Vengamamba Pachava
Computer Information Systems
University of Houston – Clear Lake
Houston, Texas
Pachavav2901@uhcl.edu

Abstract—the main objective of this research is to provide a thorough review on how to handle the rapid growth of health care data using Apache Spark framework. Healthcare data is not very structured and unique. This data is more diversified and complex and using linear analysis techniques on it is not helpful.
Keywords—component; formatting; style; styling; insert (key words)
I. INTRODUCTION
In recent years, Internet has changed the way of life. New generation of health care data has evolved which includes Electronic Health Records (EHR’s), Electronic Patient Records (EPR’s). With this large-scale growth of structured and unstructured data, it has become difficult to store, manage, retrieve and distribute data. This research mainly concentrates on reviewing the recent developments in “Predictive Analysis of Health Care Data using SPARK”. With the increase in importance of public health surveillance, it is essential to consider an effective, scalable and secure framework. [1] The modern health care systems produce a vast amount of electronically stored data. Analyzing such data will create a great potential that improve patient care and health outcomes. There are different sources of healthcare data which makes it unique: A. Multiple Places: Healthcare data comes from different sources like Electronic Health Record (EHR) or Electronic Medical Record (EMR) which includes range of data such as demographics, laboratory results, personal statistics, medication etc. Healthcare data also occurs in different formats like digital, pictures, text, numeric, paper, multimedia and today’s EMR’s can hold numerous textual and numeric data. Also, there are few complex formats of heath data such as a patient’s broken arm might be an image in medical record but appears as a Diagnosis code in the claims data. [1] B. Structured and Unstructured data: One of the most important clinical fact is that health data is captured in whatever way is most convenient giving least priority on how to analyze and aggregate the data. As users become trained to
…show more content…
Managing such an amalgam o of data and turning it into usable information requires more advanced set of tools. [4]
II. V’S OF BIG DATA
• Volume: Big data implies enormous volumes of data. It used to be employees created data. Now that data is generated by machines, networks and human interaction on systems like social media the volume of data to be analyzed is massive. Yet, the volume of data is not as much the problem as other V’s like veracity.
• Variety: Variety refers to the many sources and types of data both structured and unstructured. We used to store data from sources like spreadsheets and databases. Now data comes in the form of emails, photos, videos, monitoring devices, PDFs, audio, etc. This variety of unstructured data creates problems for storage, mining and analyzing data.
• Velocity: Big Data Velocity deals with the pace at which data flows in from sources like business processes, machines, networks and human interaction with things like social media sites, mobile devices, etc. The flow of data is massive and continuous. This real-time data can help researchers and businesses make valuable decisions that provide strategic competitive advantages and ROI if you are able to handle the velocity. Professionals suggest that sampling data can help deal with issues like volume and

Related Documents

  • Improved Essays

    Nt1310 Unit 3 Study Guide

    • 703 Words
    • 3 Pages

    1) What is BI? Business intelligence, or BI, is an umbrella term that refers to a variety of software applications used to analyze an organization’s raw data. Companies use BI and several related activities, including data mining, online analytical processing, querying and reporting to improve decision making, cut costs, identify new business opportunities and identify inefficient business processes that are ripe for re-engineering. 2) What is Visual Analytics, dashboards, data warehouse, data dictionary, meta data, ETL, schema, attributes, hierarchy, cube, OLAP, “drill down”, data mining, data mapping.…

    • 703 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    College to Career: A recent survey commissioned by Bentley University in 2014 found that “compared with business decision-makers and corporate recruiters, high school and college students were far less likely to define preparedness in terms of personal traits or work ethic” (Chase, 2015). This means skills and experience rank high for college students when they evaluate preparedness. Unfortunately, the same survey found “thirty-seven percent of recent college graduates give themselves a grade of “C” or lower on their individual level of preparedness” (Chase, 2015).…

    • 899 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    Unit 7 D2

    • 693 Words
    • 3 Pages

    2. Data warehouses- collects data from multiple databases and data silos; then organizes them for complex analysis, to gain more knowledge, and to use in their decision process. 3. Data marts-a small-smaller versions of data…

    • 693 Words
    • 3 Pages
    Improved Essays
  • Superior Essays

    In chapter 6, we discuss the economic issues that occurs and how exponential growth of things will not yield more signal, but more noise. The danger in big data is losing sight of this underlying data…

    • 1038 Words
    • 5 Pages
    Superior Essays
  • Improved Essays

    Cookie-Cutter Strategies

    • 384 Words
    • 2 Pages

    There is no silver bullet that will transform student retention rates in a few days. Student success will happen over time, not overnight. We cannot dismiss the urgency of the moment. We must be fully committed to perform every function, take advantage of every opportunity, and be accountable for every responsibility so that student success becomes a reality. Academic institutions have developed best practices that we can follow.…

    • 384 Words
    • 2 Pages
    Improved Essays
  • Improved Essays

    Description of the Electronic Health Record (EHR) The Electronic…

    • 977 Words
    • 4 Pages
    Improved Essays
  • Great Essays

    Case Study Assignment – I Campbellsville University MASSIVE DATABASE MASTERING - MASTERCARD INTERNATIONAL Various affiliations are endeavoring to address the open entryways and limit challenges related with "huge data." Industry masters gage that the total volume of data is increasing at general interims and most by a wide margin of new data being delivered is prepared to go spaces. MasterCard Universal (www.mastercard.com) is not any more impossible to miss to think about the issues identified with monstrous databases. MasterCard has amassed a data circulation focus that is more than 100-terabytes in size. Insiders expect that it will create to more than 1.8 petabytes.…

    • 933 Words
    • 4 Pages
    Great Essays
  • Decent Essays

    Following is an annotated bibliography that helps to broaden the context of the issues and concepts discussed by FLC members. Teaching and Learning. Mellody, M., & National Research Council. (2014).…

    • 598 Words
    • 3 Pages
    Decent Essays
  • Superior Essays

    According to Darzi (2014), the introduction of records sharing schemes is already transforming the medical landscape by reducing errors and improving care. Electronic health records are crucial because ACO’s mange the health of the patient, thus requiring patient care records on every patient in the system. Health information technology provides clinicians with accurate and complete information about a patient’s health and reduces the amount of paperwork for patient and physicians. Additionally, electronic records are an essential part of data recording for the physicians to see the past history of each patient. This would allow the health care staff to take data from all the sources and use it specifically to track and manage the patients.…

    • 976 Words
    • 4 Pages
    Superior Essays
  • Great Essays

    Fully custom solution, not turn-key Every project that Fluent Analytics undertakes is completely customized to the needs of the client. Customization ensures that the analytics application we build develop a thorough understanding of your organization and your patients. Our customized solutions allows us to dig deeply into the data, both after the initial implementation, and during each subsequent implementation, allowing for further customizations based on new rules and definitions. Designing applications in this manner allows our clients to continuously improve their organizations through additional queries of the data, and getting answers that are accurate, and actionable.…

    • 1128 Words
    • 5 Pages
    Great Essays
  • Great Essays

    Dataclear is a business data analytics company, based out of Baton Rouge, Louisiana. The company was founded in 1998 by Greg McNally, a graduate of UC Berkeley, with a PhD in computer science. McNally developed his skill working as a software developer for 15 years, “at Borland and Oracle” (Case Study, para.5). DataClear established itself in a market that was wide open and offered very rapid customer and profit growth. Within the first year of operation, the company’s sales hit $2.2 million (Case Study).…

    • 1259 Words
    • 6 Pages
    Great Essays
  • Great Essays

    Many of the health care industries today have codes of ethics that guide the behavior of the professional in the medical field. Ethics refers to standards that provide criteria for what is right or wrong. Ethics is shaped by individuals, society and nations. In the past we have had ethical dilemmas, which is a situation in which a person is faced with a decision to test these morals. These decisions have helped shaped the way informatics is structured in our modern day.…

    • 1246 Words
    • 5 Pages
    Great Essays
  • Decent Essays

    E-Health services have several benefits however along with the advances the services face barriers that can impede the delivery of care for patients. Interoperability is a significant obstacle that can have an adverse reaction on patient care. Interoperability issues can be overcome if the technology is designed to communicate with other systems. According to Adebesin, Foster, Kotzé, & Van Greunen, (2013), healthcare information systems that share and exchange data is indispensable to facilitate the quality and effectiveness of health services. The development of new technology is in the works which will allow systems to communicate effectively other systems, until there is a standard system available health care providers will work with the…

    • 142 Words
    • 1 Pages
    Decent Essays
  • Superior Essays

    Amazon Research Paper

    • 1604 Words
    • 7 Pages

    This assignment has presented three main points why big data is so crucial. Firstly, Amazon is widely praised for having pioneered web-based customer relations, setting an industry benchmark in market-leading technological infrastructure. Secondly Amaxon offer anonline personal service References Kotler, P., Armstrong, G., Wong, V., and Saunders, J., 2008. Principles of Marketing.…

    • 1604 Words
    • 7 Pages
    Superior Essays
  • Improved Essays

    Social Media Data Analysis

    • 1277 Words
    • 5 Pages

    Having the necessary information is crucial in almost any area, especially for research and development. The purpose of collecting data is primarily for knowledge discovery, then for further consideration. In other words, all websites gather people’s information and base on them to explore their behaviours and trends. From there, they will shape the browsing experience that match each person’s preference in the future. As mentioned above, social media, especially Facebook and Google, track all users’ activities online.…

    • 1277 Words
    • 5 Pages
    Improved Essays