DCU Home | Our Courses | Loop | Registry | Library | Search DCU
<< Back to Module List

Latest Module Specifications

Current Academic Year 2025 - 2026

Module Title Foundations of Statistical Analysis & Machine Learning
Module Code CSC1181
Faculty Engineering & Computing School Computing
NFQ level 9 Credit Rating 7.5
Description

This module aims to review and complement foundation statistical knowledge and to establish the context for a range of methods, used in the analysis of simple and complex systems. The emphasis is on an intuitive understanding of the principles and a practical ability to apply these to data examples drawn from diverse systems. This module will cover introductory topics and conventional algorithms needed to understand the field of Machine Learning. It will prepare students to understand and distinguish between main groups of methods used in Machine Learning, supervised and unsupervised, and how and when they are applicable. Key topics will include Regression, Decision Trees, Naive Bayes, Neural Networks and Clustering.

Learning Outcomes

1. Statistical Data Analysis: Distinguish between descriptive and inferential statistical quantities in the theory and practice of statistics and in data analytics, use a range of analytical statistical techniques and interpret outcomes.
2. Fundamentals and applications of machine learning: Different levels of measurement and data types, Purpose and objectives of Machine Learning, basic terminology and concepts, real-world machine learning applications, supervised vs. unsupervised learning.
3. Probability and Linear Algebra: Simple approaches to prediction, Linear algebra review, Probability Distributions (Gaussian), Understanding and application of underlying probability principles and distribution examples, Least Squares, Nearest Neighbours, Decision Theory, Bayesian Methods.
4. Supervised Learning: Main methods used in supervised learning including Regression, Decision Trees, Naive Bayes, Support Vector Machines, validation and model evaluation, Basics of Neural Networks, Ensembles, Perceptron.
5. Unsupervised Learning: Principal techniques used in unsupervised learning including Clustering, K-means, Principal Component Analysis.


WorkloadFull time hours per semester
TypeHoursDescription
Lecture36Principles and methods
Tutorial12Examples
Assessment Feedback56Practical application/formal analysis
Independent Study83.5Reading, understanding, applying concepts and reviewing examples
Total Workload: 187.5
Section Breakdown
CRN12208Part of TermSemester 1
Coursework50%Examination Weight50%
Grade Scale40PASSPass Both ElementsN
Resit CategoryRC1Best MarkN
Module Co-ordinatorMarija BezbradicaModule Teacher
Assessment Breakdown
TypeDescription% of totalAssessment Date
In Class TestSupervised Lab exam20%Sem 2 End
Laboratory PortfolioThree supervised lab submissions assessing the material covered in the previous weeks each comprising some independent coding work and some commentary.30%As required
Formal ExaminationEnd of Module Formal Exam50%End-of-Semester
Reassessment Requirement Type
Resit arrangements are explained by the following categories;
RC1: A resit is available for both* components of the module.
RC2: No resit is available for a 100% coursework module.
RC3: No resit is available for the coursework component where there is a coursework and summative examination element.

* ‘Both’ is used in the context of the module having a coursework/summative examination split; where the module is 100% coursework, there will also be a resit of the assessment

Pre-requisite None
Co-requisite None
Compatibles None
Incompatibles None

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Topic 1
Summary Statistics, Conditional Probability, Data Types

Topic 2
Statistical Distributions

Topic 3
Hypothesis Testing and Confidence Intervals

Topic 4
Linear Regression and Logistic regression

Topic 5
Naïve Bayes, Decision Tree

Topic 6
Basics of Neural Networks and Perceptron

Topic 7
Categorical Variables, Regression Problem, hyperparameters

Topic 8
Introduction to feature Engineering – Clustering/ K-means clustering, Principal Component Analysis – Dimensional

Indicative Reading List

Books:
  • John D. Kelleher,Brian Mac Namee,Aoife D'Arcy: 2015, Fundamentals of Machine Learning for Predictive Data Analytics, MIT Press, 619, 9780262029445
  • Trevor Hastie, Robert Tibshirani, Jerome Friedman: 2009, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2, 0387848576
  • Tom Michael Mitchell: 1997, Machine Learning, McGraw-Hill Science/Engineering/Math, 414, 9780070428072
  • Michael J. Crawley: 2005, Statistics, John Wiley & Sons, 348, 0470022981
  • Thomas H. Davenport,Jeanne G. Harris: 2007, Competing on Analytics, Harvard Business Press, 243, 1422103323
  • Thomas H. Davenport,Jeanne G. Harris,Robert Morison: 2010, Analytics at Work, Harvard Business Press, 231, 1422177696
  • Sam L. Savage: 2009, The Flaw of Averages, Wiley, 0, 0471381977


Articles:
None
Other Resources

None

<< Back to Module List View 2024/25 Module Record for CSC1181