NCC Expert Short Course in Data Science

NCC Expert Short Course

in Data Science

Introduction

The NCC Expert Short Course in Data Science offers advanced knowledge in data processing, predictive analytics, and machine learning, empowering learners to tackle complex data challenges and excel in high-demand roles.

NCC Expert Short Course in Data Science

Course Title and Duration

Course Title

NCC Expert Short Course in Data Science

Awarding Institution and Language of Study

Awarding Institution:

NCC Education

Language of Study:

English

Course Overview

The Expert Short Course in Data Science is designed to provide an in-depth understanding of advanced data science methodologies and techniques. This course aims to equip participants with comprehensive knowledge and skills in data science, including data processing, optimisation, big data concepts, predictive analytics, and machine learning. The course is suitable for individuals aiming to advance their careers in data science or related fields.

Entry Requirements

Participants should have a solid foundation in data science principles and basic programming skills. Previous completion of intermediate-level data science courses is recommended.

Syllabus Content

  1. Data Science Process
    • What is Data Science?
    • What do Data Scientists do?
    • How to be a Data Scientist?
    • What should you learn?
    • Data Science Process
    • Data Science Ethics
    • Data Scientific Method
      Learning Outcomes: 1, 2
  2. Optimisation and How to Formulate the Problem
    • Optimisation & Operation Research
    • Mathematical Optimisation Problem
    • Classification of Optimisation Problems
    • Integer Programming Problem
    • Stochastic Programming Problem
    • Linear Programming I: Simplex Method
    • Standard Form of a Linear Programming Problem
    • Characteristics of a Linear Programming Problem
    • Transformation of LP Problems into Standard Form
      Learning Outcomes: 3, 4
  3. Big Data Concepts
    • What is Big Data?
    • Big Data Characteristics
    • Big Data: 6V’s
    • Cloud Computing
    • Why Study Big Data Technologies?
    • Big Data Open Source Tools
    • Philosophy to Scale for Big Data Processing
    • What is Hadoop?
    • Why use Hadoop?
    • Core Parts of a Hadoop Distribution
    • Common Hadoop Distributions
      Learning Outcomes: 3
  4. Data Mashups and Big Data Infrastructure
    • Data Analytics and Mash-up
    • Data/Web Mashup
    • Architecture of Web Mashup
    • Implementation Architecture
    • Why Cloud Computing?
    • Advantages and Disadvantages of Cloud Computing
    • How Cloud Computing Works
    • Challenges of Cloud Computing
    • Layers of Cloud Computing
    • Components of Cloud Computing
    • Big Data
    • Hadoop Architecture
    • Hadoop with Big Data
    • Map Reduce
    • Why Data Analytics?
    • Types of Data Analytics
    • Big Data Analytics
      Learning Outcomes: 5
  5. Data Collection with Web Scraping Tools
    • What’s Wrong with Scraping?
    • Before Writing a Scraper
    • Alternatives to Scraping
    • Why Scrape Web?
    • Harvesting Options
    • Watch Out for Spider Traps!
    • Web Scraping, the Easy Way
    • Document Object Model (DOM)
    • Regular Expressions
      Learning Outcomes: 6
  6. Introduction to Predictive Analysis and Prediction to Multiple Regression Model
    • Data Mining Foundations
    • Problem Types and Model Paradigms
    • Model Performance Considerations
    • Recognising a Strong Model
    • K-Nearest Neighbours Classification
    • Decision Tree Models
    • K-Means Clustering
    • Multiple Regression
    • Formal Definition of the Model
    • Estimating the Parameters of the Model
    • Analysis of the Variance Table
    • F-Test for the Overall Fit of the Model
    • Interval Estimation
    • Selecting the Best Regression Equation
    • Example: Sales Forecasting
    • Interpreting the Final Model
      Learning Outcomes: 7
  7. Introduction to Logistic Regression
    • What is Logistic Regression?
    • Questions
    • Assumptions
    • Background
    • Plain Old Regression
    • An Alternative – The Ogive Function
    • The Logistics Function
    • The Logit
    • Conversion
      Learning Outcomes: 7
  8. Time Series & Predictive Modelling Techniques
    • Objective of Time Series Analysis
    • Classical Decomposition: An Example
    • Trend
    • Residuals
    • Trend and Residual Variation
    • Time Series Models
    • Gaussian White Noise
    • Random Walk
    • Time Series Modelling
    • Nonlinear Transformations
    • Differencing and Trend
    • Differencing and Seasonal Variation
    • What are Predictive Analytics?
    • Why Predictive Analytics?
    • What Do Predictive Analytics Do?
    • Predictive Modelling Techniques
      Learning Outcomes: 7
  9. Data Mining, Data Exploration, Cleaning, and Visualisation
    • What is Data Mining?
    • Data Mining Definitions
    • Why Mine Data? Commercial Viewpoint
    • Why Mine Data? Scientific Viewpoint
    • Database Processing vs. Data Mining Processing
    • Data Mining: Classification Schemes
    • Decisions in Data Mining
    • Data Mining Tasks
    • Data Mining Models and Tasks
    • Classification: Definition
    • Classification: Applications
    • Classification Techniques
    • Why Do We Need Data Mining?
    • The Data Analysis Pipeline
    • Data Quality
    • Sampling
    • Types of Sampling
    • A Data Mining Challenge
    • Data Collection
    • Mining Task
    • Exploratory Analysis of Data
    • Post Processing
      Learning Outcomes: 8
  10. Data Transformation and Reduction
  • Data Transformation
  • Data Transformation Challenges
  • Data Transformation Strategies
  • Normalisation
  • Normalisation Methods
  • Data Reduction Strategies
  • The Curse of Dimensionality
  • The Curse of Dimensionality: Principal Components Analysis
  • The Curse of Dimensionality: Factor Analysis
  • Data Sampling
  • Binning and Reduction of Cardinality
    Learning Outcomes: 9
  1. Cluster Analysis
  • What is Cluster Analysis?
  • Types of Data in Cluster Analysis
  • A Categorisation of Major Clustering Methods
  • Partitioning Methods
  • Hierarchical Methods
  • Grid-Based Methods
  • Model-Based Clustering Methods
  • Outlier Analysis
    Learning Outcomes: 8
  1. Decision Tree and Model Evaluation & Deployment
  • Decision Tree and Classification Task
  • Building Decision Tree
  • Built Decision Tree Algorithm
  • Node Splitting in BuildDT Algorithm
  • Entropy and Its Meaning
  • Decision Tree Induction Techniques
  • CRISP-DM Phases
  • Modelling
    • Select Modelling Technique
    • General Test Design
    • Build Models
    • Access Models
  • Evaluation
    • Evaluate Results
    • Review Process
    • Determine Next Steps
  • Deployment
    • Plan Deployment
    • Plan Monitoring and Maintenance
    • Produce Final Report
    • Review Project
      Learning Outcomes: 9

Assessment Strategy

Assessment includes:

Coursework

Various practical assignments and projects.

Examinations

Formal exams to test understanding of theoretical concepts.

Formative Assessments

Continuous feedback throughout the course for improvement.

Summative Assessments

Contribute to the final grade and assess overall knowledge and skills gained.

Career and Professional Development

Graduates can pursue roles such as Data Scientist, Machine Learning Engineer, Big Data Analyst, and Data Mining Specialist. This course also provides a strong foundation for further academic research or professional certifications in data science.

Support for Student Learning:

Students will have access to:

Personal Tutors

Available for guidance and support throughout the course.

Learning Resources

Online libraries, tutorials, and study materials.

Discussion Forums and Live Sessions

For interaction with peers and instructors.

Total Qualification Time:

Approximately 200 hours (including guided learning and self-study).

Conclusion

The NCC Expert Short Course in Data Science equips learners with advanced methodologies, tools, and skills to excel in data-driven environments. With comprehensive knowledge in predictive analytics, big data, and machine learning, graduates are well-prepared to tackle complex data challenges, pursue advanced roles in data science, or continue with further academic research, making a significant impact in the rapidly evolving data science landscape.

Leave A Comment

Your email address will not be published. Required fields are marked *