Module: Data Analysis & Machine Learning II

Latest Module Specifications

Current Academic Year 2025 - 2026

Module Title	Data Analysis & Machine Learning II
Module Code	EEN1072 (ITS: EE514)
Faculty	Engineering & Computing	School	Electronic Engineering
NFQ level	9	Credit Rating	7.5

Description

This module will provide students with fundamental and advanced skills required for data analysis and machine learning. The module will cover a large variety of machine learning algorithms including deep learning techniques. It is focused on providing students with a strong theoretical foundation, along with the ability to make practical use of the advanced techniques in the field. The Python programming language will be used for demonstrating the use of various techniques throughout the module, giving students practical tools for solving relatively sophisticated and broadly-defined real world problems in a well-established and widely-used programming environment.

Learning Outcomes

1. describe supervised machine learning theory, including problem types, best practices for data preparation, model selection, overfitting and underfitting, and bias-variance tradeoff
2. apply fundamental and advanced classification and regression algorithms
3. perform various types of generic unsupervised data analytics
4. describe the principles of modern representation learning and deep learning techniques and evaluate the merits of several state-of-the-art models
5. apply a variety of deep learning techniques using the artificial neural networks architectures
6. demonstrate a critical appreciation of available software packages for data analysis
7. demonstrate the ability to implement a predictive analytics pipeline

*Type*	*Hours*	*Description*
Workload	Full time hours per semester
Lecture	36	Classroom Lectures
Independent Study	24	Regular Homeworks
Independent Study	36	Assignment Work
Independent Study	92	Self-directed study of materials and study for the examination.
Total Workload: 188

Section Breakdown
CRN	21207	Part of Term	Semester 2
Coursework	25%	Examination Weight	75%
Grade Scale	40PASS	Pass Both Elements	N
Resit Category	RC1	Best Mark	N
Module Co-ordinator	Muhammad Intizar Ali	Module Teacher	Jennifer Bruton, Kevin Mcguinness

Section Breakdown
CRN	21151	Part of Term	Semester 2
Coursework	25%	Examination Weight	75%
Grade Scale	40PASS	Pass Both Elements	N
Resit Category	RC1	Best Mark	N
Module Co-ordinator	Muhammad Intizar Ali	Module Teacher

Type	Description	% of total	Assessment Date
Assessment Breakdown
Assignment	Students will be asked to apply at least two types of modern artificial neural networks based algorithms to solve a set of given problems. Students will demonstrate their practical machine learning skills to handle a given dataset, analyze the problem statement, train the machine learning model, fine tune it and present the outcomes of their analysis.	25%	Week 10
Formal Examination	n/a	75%	End-of-Semester

Reassessment Requirement Type
Resit arrangements are explained by the following categories; RC1: A resit is available for both^* components of the module. RC2: No resit is available for a 100% coursework module. RC3: No resit is available for the coursework component where there is a coursework and summative examination element. ^* ‘Both’ is used in the context of the module having a coursework/summative examination split; where the module is 100% coursework, there will also be a resit of the assessment

Pre-requisite	None
Co-requisite	None
Compatibles	None
Incompatibles	None

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Supervised Machine Learning Principles
Overview and objectives of supervised learning. Introduction to standard notation and conventions, problem types (regression, classification, structured prediction), training and tests sets, black box learning principles, training error, test error, generalization error, and out-of-sample error. Discussion of bias-variance tradeoff, overfitting and underfitting, the no free lunch theorem, model selection, cross-validation, data hygiene, and data snooping.

Supervised Machine Learning Algorithms
Discussion of several important classes of machine learning algorithms including linear regression, decision trees, ensemble methods, logistic regression, support vector machines, and a range of neural network types. Algorithm for optimizing loss functions (gradient descent and stochastic gradient descent). Types of loss functions (convex and non-convex). Kernel methods.

Representation Learning and Deep Learning
Principles of representation learning. Introduction to multi-layer perceptrons, stacked autoencoders, convolutional neural networks, and recurrent neural networks. Practical optimization methods, GPU-based optimization, and software packages. Illustration of several real-world applications (natural language processing, image classification, speech recognition, information retrieval, recommender systems). Comparative discussion of several industry standard technologies (Tensorflow, Caffe, Torch).

Artificial Neural Networks
Learn about the concept of artificial neural networks and understand the underlying architectures of neural network algorithms. Understand the idea of input, hidden and output layers of a neural network. Learn about Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). Apply different algorithms for real-world data problems solving.

Indicative Reading List

Books:

Trevor Hastie, Robert Tibshirani, Jerome Friedman: 2009, The elements of statistical learning, Springer, New York, N.Y., 9780387848570
Yaser S. Abu-Mostafa, Malik Magdon-Ismail, Hsuan-Tien Lin: 0, Learning From Data, AMLBook, 213, 1600490069
Edward R. Tufte: 2001, The visual display of quantitative information, Graphics Press, Cheshire, Conn., 0961392142
Mark Pilgrim: 0, Dive Into Python, Apress, 413, 1590593561
Wes McKinney: 0, Python for Data Analysis, O'Reilly Media, 470, 1449319793
0: Machine Learning in Python: Essential Techniques for Predictive Analysis, Chichester; John Wiley & Sons, 1118961749,

Articles:
None

Other Resources

Website: Scikit-learn: Machine Learning in Python, http://scikit-learn.org/stable/
Website: Pandas: Python Data Analysis Library, http://pandas.pydata.org/
Website: Seaborn: Statistical Data Visualization, http://stanford.edu/~mwaskom/software/seaborn/
Website: TensorFlow: Open Source Software Library for Machine Intelligence, https://www.tensorflow.org/