Instructors: Online Quiz
Why this course?
Syllabus of the DATA SCIENCE Test
UNIT – I INTRODUCTION: DATA SCIENCE AND BIG DATA
Introduction to Data science and Big Data, Defining Data science and Big Data, Big Data examples, Data explosion, Data volume, Data Velocity, Big data infrastructure and challenges, Big Data Processing Architectures, Data Warehouse, Re-Engineering the Data Warehouse, Shared everything and shared nothing architecture, Big data learning approaches.
UNIT – II MATHEMATICAL FOUNDATION OF BIG DATA
Probability theory, Tail bounds with applications, Markov chains and random walks, Pair wise independence and universal hashing, Approximate counting, Approximate median, The streaming models, Flajolet Martin Distance sampling, Bloom filters, Local search and testing connectivity, Enforce test techniques, Random walks and testing, Boolean functions, BLR test for linearity.
UNIT - III BIG DATA PROCESSING
Big Data technologies, Introduction to Google file system, Hadoop Architecture, Hadoop Storage: HDFS, Common Hadoop Shell commands, Anatomy of File Write and Read, NameNode, Secondary NameNode, and DataNode, Hadoop MapReduce paradigm, Map Reduce tasks, Job, Task trackers - Cluster Setup – SSH & Hadoop Configuration, Introduction to: NOSQL, Textual ETL processing.
UNIT – IV BIG DATA ANALYTICS
Data analytics life cycle, Data cleaning , Data transformation, Comparing reporting and analysis, Types of analysis, Analytical approaches, Data analytics using R, Exploring basic features of R, Exploring R GUI, Reading data sets, Manipulating and processing data in R, Functions and packages in R, Performing graphical analysis in R, Integrating R and Hadoop, Hive, Data analytics.
UNIT – V Big Data Visualization
Introduction to Data visualization, Challenges to Big data visualization, Conventional data visualization tools, Techniques for visual data representations, Types of data visualization, Visualizing Big Data, Tools used in data visualization, Propriety Data Visualization tools, Open –source data visualization tools, Analytical techniques used in Big data visualization, Data visualization with Tableau, Introduction to: Pentaho, Flare, Jasper Reports, Dygraphs, Datameer Analytics Solution and Cloudera, Platfora, NodeBox, Gephi, Google Chart API, Flot, D3, and Visually.
UNIT – VI BIG DATA TECHNOLOGIES APPLICATION AND IMPACT
Social media analytics, Text mining, Mogile analytics , Roles and responsibilities of Big data person, Organizational impact, Data analytics life cycle, Data Scientist roles and responsibility, Understanding decision theory, creating big data strategy, big data value creation drivers, Michael Porter’s valuation creation models, Big data user experience ramifications, Identifying big data use cases.
After successful purchase, this item would be added to your courses.You can access your courses in the following ways :