Data Science & Big Data Analytics Discovering, Analyzing, Visualizing and Presenting Data
Data Science & Big Data Analytics |
Big Data Overview
Data is created constantly, and at an ever-increasing rate. Mobile phones, social media, imaging technologies to determine a medical diagnosis—all these and more create new data, and that must be stored somewhere for some purpose. Devices and sensors automatically generate diagnostic information that needs to be stored and processed in real time. Merely keeping up with this huge influx of data is difficult, but substantially more challenging is analyzing vast amounts of it, especially when it does not conform to traditional notions of data structure, to identify meaningful patterns and extract useful information. These challenges of the data deluge present the opportunity to transform business, government, science, and everyday life.
Several industries have led the way in developing their ability to gather and exploit data:
- Credit card companies monitor every purchase their customers make and can identify fraudulent purchases with a high degree of accuracy using rules derived by processing billions of transactions.
- Mobile phone companies analyze subscribers’ calling patterns to determine, for example, whether acaller’s frequent contacts are on a rival network. If that rival network is offering an attractive promotion that might cause the subscriber to defect, the mobile phone company can proactively offer the subscriber an incentive to remain in her contract.
- For companies such as LinkedIn and Facebook, data itself is their primary product. The valuations of these companies are heavily derived from the data they gather and host, which contains more and more intrinsic value as the data grows.
- Three attributes stand out as defining Big Data characteristics:
- Huge volume of data: Rather than thousands or millions of rows, Big Data can be billions of rows andmillions of columns.
- Complexity of data types and structures: Big Data reflects the variety of new data sources, formats, and structures, including digital traces being left on the web and other digital repositories for subsequent analysis.
- Speed of new data creation and growth: Big Data can describe high velocity data, with rapid data ingestion and near real time analysis.
Introduction to Artificial Intelligence
Table of Contents
1.1 Big Data Overview
1.2 State of the Practice in Analytics
1.3 Key Roles for the New Big Data Ecosystem
1.4 Examples of Big Data Analytics
2 Data analytics lifecycle
2.1 Data Analytics Lifecycle Overview
2.2 Phase 1: Discovery
2.3 Phase 2: Data Preparation
2.4 Phase 3: Model Planning
2.5 Phase 4: Model Building
3. Review of Basic Data analytics using R
3.1 Introduction to R
3.2 Exploratory Data Analysis
3.3 Statistical Methods for Evaluation
4.2 K-means
4.3 Additional Algorithms
5. Advanced analytical theory and methods: Associations Rules
5.1 Overview
5.2 Apriori Algorithm
5.3 Evaluation of Candidate Rules
5.4 Applications of Association Rules
5.5 An Example: Transactions in a Grocery Store
5.6 Validation and Testing
5.7 Diagnostics
advertisement
Data Science & Big Data Analytics
ReplyDelete