DATA SCIENCE

Language: English

Instructors: Online Quiz

₹99 including GST

Why this course?

Description

Syllabus of the DATA SCIENCE Test

UNIT – I INTRODUCTION: DATA SCIENCE AND BIG DATA

Introduction to Data science and Big Data, Defining Data science and Big Data, Big Data examples, Data explosion, Data volume, Data Velocity, Big data infrastructure and challenges, Big Data Processing Architectures, Data Warehouse, Re-Engineering the Data Warehouse, Shared everything and shared nothing architecture, Big data learning approaches.

 

UNIT – II MATHEMATICAL FOUNDATION OF BIG DATA

Probability theory, Tail bounds with applications, Markov chains and random walks, Pair wise independence and universal hashing, Approximate counting, Approximate median, The streaming models, Flajolet Martin Distance sampling, Bloom filters, Local search and testing connectivity, Enforce test techniques, Random walks and testing, Boolean functions, BLR test for linearity.

 

UNIT - III BIG DATA PROCESSING 

Big Data technologies, Introduction to Google file system, Hadoop Architecture, Hadoop Storage: HDFS, Common Hadoop Shell commands, Anatomy of File Write and Read, NameNode, Secondary NameNode, and DataNode, Hadoop MapReduce paradigm, Map Reduce tasks, Job, Task trackers - Cluster Setup – SSH & Hadoop Configuration, Introduction to: NOSQL, Textual ETL processing.

 

UNIT – IV BIG DATA ANALYTICS 

Data analytics life cycle, Data cleaning , Data transformation, Comparing reporting and analysis, Types of analysis, Analytical approaches, Data analytics using R, Exploring basic features of R, Exploring R GUI, Reading data sets, Manipulating and processing data in R, Functions and packages in R, Performing graphical analysis in R, Integrating R and Hadoop, Hive, Data analytics.

 

UNIT – V Big Data Visualization 

Introduction to Data visualization, Challenges to Big data visualization, Conventional data visualization tools, Techniques for visual data representations, Types of data visualization, Visualizing Big Data, Tools used in data visualization, Propriety Data Visualization tools, Open –source data visualization tools, Analytical techniques used in Big data visualization, Data visualization with Tableau, Introduction to: Pentaho, Flare, Jasper Reports, Dygraphs, Datameer Analytics Solution and Cloudera, Platfora, NodeBox, Gephi, Google Chart API, Flot, D3, and Visually.

 

UNIT – VI BIG DATA TECHNOLOGIES APPLICATION AND IMPACT 

Social media analytics, Text mining, Mogile analytics , Roles and responsibilities of Big data person, Organizational impact, Data analytics life cycle, Data Scientist roles and responsibility, Understanding decision theory, creating big data strategy, big data value creation drivers, Michael Porter’s valuation creation models, Big data user experience ramifications, Identifying big data use cases.

Course Curriculum

How to Use

After successful purchase, this item would be added to your courses.You can access your courses in the following ways :

  • From the computer, you can access your courses after successful login
  • For other devices, you can access your library using this web app through browser of your device.