Home Common Problem What are the three sources of big data?

What are the three sources of big data?

Jul 24, 2020 pm 04:02 PM
Big Data

What are the three sources of big data?

What are the three sources of big data?

1. Transaction data, including POS machine data, credit card swipe data, etc.;

2. Human data, including emails, documents, pictures, and through WeChat, blogs, Twitter, etc. The generated data stream;

3, machine and sensor data, such as data from sensors, meters and other facilities.

What is big data?

Gartner, a research organization for “big data”, gives this definition. "Big data" requires new processing models to have stronger decision-making power, insight discovery and process optimization capabilities to adapt to the massive, high growth rate and diversified information assets.

The definition given by McKinsey Global Institute is: a data collection that is so large that its acquisition, storage, management, and analysis greatly exceed the capabilities of traditional database software tools. It has massive data scale, rapid It has four major characteristics: data flow, diverse data types and low value density.

The strategic significance of big data technology lies not in mastering huge data information, but in professional processing of these meaningful data. In other words, if big data is compared to an industry, then the key to making this industry profitable is to improve the "processing capabilities" of data and achieve the "value-added" of data through "processing".

Technically, the relationship between big data and cloud computing is as inseparable as the two sides of the same coin. Big data cannot be processed by a single computer and must use a distributed architecture. Its characteristic lies in distributed data mining of massive data. But it must rely on distributed processing, distributed database and cloud storage, and virtualization technology of cloud computing.

With the advent of the cloud era, big data (Big data) has also attracted more and more attention. The analyst team believes that big data is generally used to describe the large amounts of unstructured and semi-structured data created by a company, which would take too much time and money to download to a relational database for analysis. Big data analytics is often associated with cloud computing because real-time analysis of large data sets requires frameworks like MapReduce to distribute work to tens, hundreds, or even thousands of computers.

Big data requires special techniques to efficiently handle large amounts of data over a tolerable amount of time. Technologies applicable to big data include massively parallel processing (MPP) databases, data mining, distributed file systems, distributed databases, cloud computing platforms, the Internet, and scalable storage systems.

Recommended tutorial: "PHP"

The above is the detailed content of What are the three sources of big data?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot Article

Hot Article

Hot Article Tags

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

PHP's big data structure processing skills PHP's big data structure processing skills May 08, 2024 am 10:24 AM

PHP's big data structure processing skills

C++ development experience sharing: Practical experience in C++ big data programming C++ development experience sharing: Practical experience in C++ big data programming Nov 22, 2023 am 09:14 AM

C++ development experience sharing: Practical experience in C++ big data programming

Five major development trends in the AEC/O industry in 2024 Five major development trends in the AEC/O industry in 2024 Apr 19, 2024 pm 02:50 PM

Five major development trends in the AEC/O industry in 2024

Discussion on the reasons and solutions for the lack of big data framework in Go language Discussion on the reasons and solutions for the lack of big data framework in Go language Mar 29, 2024 pm 12:24 PM

Discussion on the reasons and solutions for the lack of big data framework in Go language

Application of algorithms in the construction of 58 portrait platform Application of algorithms in the construction of 58 portrait platform May 09, 2024 am 09:01 AM

Application of algorithms in the construction of 58 portrait platform

AI, digital twins, visualization... Highlights of the 2023 Yizhiwei Autumn Product Launch Conference! AI, digital twins, visualization... Highlights of the 2023 Yizhiwei Autumn Product Launch Conference! Nov 14, 2023 pm 05:29 PM

AI, digital twins, visualization... Highlights of the 2023 Yizhiwei Autumn Product Launch Conference!

Big data processing in C++ technology: How to use in-memory databases to optimize big data performance? Big data processing in C++ technology: How to use in-memory databases to optimize big data performance? May 31, 2024 pm 07:34 PM

Big data processing in C++ technology: How to use in-memory databases to optimize big data performance?

Getting Started Guide: Using Go Language to Process Big Data Getting Started Guide: Using Go Language to Process Big Data Feb 25, 2024 pm 09:51 PM

Getting Started Guide: Using Go Language to Process Big Data