Big Data Fundamentals: Concepts, Drivers & Techniques (The by Thomas Erl, Wajid Khattak, Paul Buhler

By Thomas Erl, Wajid Khattak, Paul Buhler

“This textual content could be required analyzing for everybody in modern business.”
--Peter Woodhull, CEO, Modus21

“The one booklet that basically describes and hyperlinks titanic info ideas to enterprise utility.”
--Dr. Christopher Starr, PhD

“Simply, this is often the simplest tremendous information publication at the market!”
--Sam Rostam, Cascadian IT Group

“...one of the main modern ways I’ve visible to special information fundamentals...”
--Joshua M. Davis, PhD

The Definitive Plain-English consultant to special facts for company and expertise pros

Big information basics provides a practical, no-nonsense creation to special information. Best-selling IT writer Thomas Erl and his crew truly clarify key colossal info suggestions, idea and terminology, in addition to primary applied sciences and methods. All assurance is supported with case examine examples and diverse easy diagrams.

The authors start via explaining how mammoth info can propel a firm ahead via fixing a spectrum of formerly intractable enterprise difficulties. subsequent, they demystify key research suggestions and applied sciences and convey how a tremendous facts resolution surroundings could be outfitted and built-in to supply aggressive advantages.

  • Discovering substantial Data’s basic suggestions and what makes it diversified from past kinds of information research and knowledge science
  • Understanding the enterprise motivations and drivers in the back of gigantic info adoption, from operational advancements via innovation
  • Planning strategic, business-driven enormous facts initiatives
  • Addressing concerns resembling information administration, governance, and security
  • Recognizing the five “V” features of datasets in enormous information environments: quantity, speed, kind, veracity, and value
  • Clarifying immense Data’s relationships with OLTP, OLAP, ETL, facts warehouses, and information marts
  • Working with great info in based, unstructured, semi-structured, and metadata formats
  • Increasing worth by way of integrating gigantic facts assets with company functionality monitoring
  • Understanding how vast facts leverages disbursed and parallel processing
  • Using NoSQL and different applied sciences to fulfill colossal Data’s specific facts processing requirements
  • Leveraging statistical techniques of quantitative and qualitative analysis
  • Applying computational research tools, together with desktop learning

Show description

Read Online or Download Big Data Fundamentals: Concepts, Drivers & Techniques (The Prentice Hall Service Technology Series from Thomas Erl) PDF

Similar data mining books

The Role of Systems Methodology in Social Science Research, 1st Edition

Whereas basic structures learn has had a substantial influence on examine within the social sciences, this impression has been usually conceptual and has no longer served to supply the operational and methodological aids for learn that are attainable. furthermore, lots of these systems-oriented instructions and effects which do influence social technology study have constructed inde­ pendently and in piecemeal style in fresh many years.

Advances in Intelligent Data Analysis XIII: 13th International Symposium, IDA 2014, Leuven, Belgium, October 30 -- November 1, 2014. Proceedings (Lecture Notes in Computer Science)

This publication constitutes the refereed convention court cases of the thirteenth overseas convention on clever info research, which used to be held in October/November 2014 in Leuven, Belgium. The 33 revised complete papers including three invited papers have been conscientiously reviewed and chosen from 70 submissions dealing with all types of modeling and research tools, without reference to self-discipline.

Process Mining Techniques in Business Environments: Theoretical Aspects, Algorithms, Techniques and Open Challenges in Process Mining (Lecture Notes in Business Information Processing)

After a quick presentation of the cutting-edge of process-mining strategies, Andrea Burratin proposes diversified eventualities for the deployment of process-mining initiatives, and particularly a characterization of businesses when it comes to their technique know-how. The ways proposed during this ebook belong to 2 diverse computational paradigms: first to vintage "batch technique mining," and moment to more moderen "online strategy mining.

Real-World Machine Learning

Precis Real-World computing device studying is a pragmatic consultant designed to coach operating builders the artwork of ML undertaking execution. with out overdosing you on educational thought and intricate arithmetic, it introduces the daily perform of computing device studying, getting ready you to effectively construct and install strong ML structures.

Additional resources for Big Data Fundamentals: Concepts, Drivers & Techniques (The Prentice Hall Service Technology Series from Thomas Erl)

Sample text

The IT team attributes this to the data validation performed at multiple stages including validation at the time of data entry, validation at various points when an application is processing data, such as function-level input validation, and validation performed by the database when data is persisted. Looking outside ETI’s boundary, a study of a few samples taken from the social media data and weather data demonstrates further decline in veracity indicating that such data will require an increased level of data validation and cleansing to make it high veracity data.

In response to this, the team prepared a feasibility report that highlights the following obstacles: • Acquiring, storing and processing unstructured data from internal and external data sources – Currently, only structured data is stored and processed, because the existing technology does not support the storage and processing of unstructured data. • Processing large amounts of data in a timely manner – Although the EDW is used to generate reports based on historical data, the amount of data processed cannot be classified as large, and the reports take a long time to generate.

7 The cloud can be used to complete on-demand data analysis at the end of each month or enable the scaling out of systems with an increase in load. It makes sense for enterprises already using cloud computing to reuse the cloud for their Big Data initiatives because: • personnel already possesses the required cloud computing skills • the input data already exists in the cloud Migrating to the cloud is logical for enterprises planning to run analytics on datasets that are available via data markets, as many data markets make their datasets available in a cloud environment, such as Amazon S3.

Download PDF sample

Rated 4.31 of 5 – based on 36 votes

Categories: Data Mining