DCU Home | Our Courses | Loop | Registry | Library | Search DCU
<< Back to Module List

Latest Module Specifications

Current Academic Year 2025 - 2026

Module Title Data Analysis & Machine Learning II
Module Code EEN1072 (ITS: EE514)
Faculty Electronic Engineering School Engineering & Computing
NFQ level 9 Credit Rating 7.5
Description

This module will provide students with fundamental and advanced skills required for data analysis and machine learning. The module will cover a large variety of machine learning algorithms including deep learning techniques. It is focused on providing students with a strong theoretical foundation, along with the ability to make practical use of the advanced techniques in the field. The Python programming language will be used for demonstrating the use of various techniques throughout the module, giving students practical tools for solving relatively sophisticated and broadly-defined real world problems in a well-established and widely-used programming environment.

Learning Outcomes

1. describe supervised machine learning theory, including problem types, best practices for data preparation, model selection, overfitting and underfitting, and bias-variance tradeoff
2. apply fundamental and advanced classification and regression algorithms
3. perform various types of generic unsupervised data analytics
4. describe the principles of modern representation learning and deep learning techniques and evaluate the merits of several state-of-the-art models
5. apply a variety of deep learning techniques using the artificial neural networks architectures
6. demonstrate a critical appreciation of available software packages for data analysis
7. demonstrate the ability to implement a predictive analytics pipeline


WorkloadFull time hours per semester
TypeHoursDescription
Lecture36Classroom Lectures
Independent Study24Regular Homeworks
Independent Study36Assignment Work
Independent Study92Self-directed study of materials and study for the examination.
Total Workload: 188
Section Breakdown
CRN21207Part of TermSemester 2
Coursework0%Examination Weight0%
Grade Scale40PASSPass Both ElementsY
Resit CategoryRC1Best MarkN
Module Co-ordinatorMuhammad Intizar AliModule TeacherJennifer Bruton, Kevin Mcguinness
Section Breakdown
CRN21151Part of TermSemester 2
Coursework0%Examination Weight0%
Grade Scale40PASSPass Both ElementsY
Resit CategoryRC1Best MarkN
Module Co-ordinatorMuhammad Intizar AliModule Teacher
Assessment Breakdown
TypeDescription% of totalAssessment Date
AssignmentStudents will be asked to apply at least two types of modern artificial neural networks based algorithms to solve a set of given problems. Students will demonstrate their practical machine learning skills to handle a given dataset, analyze the problem statement, train the machine learning model, fine tune it and present the outcomes of their analysis.25%Week 10
Formal Examinationn/a75%End-of-Semester
Reassessment Requirement Type
Resit arrangements are explained by the following categories;
RC1: A resit is available for both* components of the module.
RC2: No resit is available for a 100% coursework module.
RC3: No resit is available for the coursework component where there is a coursework and summative examination element.

* ‘Both’ is used in the context of the module having a coursework/summative examination split; where the module is 100% coursework, there will also be a resit of the assessment

Pre-requisite None
Co-requisite None
Compatibles None
Incompatibles None

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Supervised Machine Learning Principles
Overview and objectives of supervised learning. Introduction to standard notation and conventions, problem types (regression, classification, structured prediction), training and tests sets, black box learning principles, training error, test error, generalization error, and out-of-sample error. Discussion of bias-variance tradeoff, overfitting and underfitting, the no free lunch theorem, model selection, cross-validation, data hygiene, and data snooping.

Supervised Machine Learning Algorithms
Discussion of several important classes of machine learning algorithms including linear regression, decision trees, ensemble methods, logistic regression, support vector machines, and a range of neural network types. Algorithm for optimizing loss functions (gradient descent and stochastic gradient descent). Types of loss functions (convex and non-convex). Kernel methods.

Representation Learning and Deep Learning
Principles of representation learning. Introduction to multi-layer perceptrons, stacked autoencoders, convolutional neural networks, and recurrent neural networks. Practical optimization methods, GPU-based optimization, and software packages. Illustration of several real-world applications (natural language processing, image classification, speech recognition, information retrieval, recommender systems). Comparative discussion of several industry standard technologies (Tensorflow, Caffe, Torch).

Artificial Neural Networks
Learn about the concept of artificial neural networks and understand the underlying architectures of neural network algorithms. Understand the idea of input, hidden and output layers of a neural network. Learn about Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). Apply different algorithms for real-world data problems solving.

Indicative Reading List

Books:
  • Trevor Hastie, Robert Tibshirani, Jerome Friedman: 2009, The elements of statistical learning, Springer, New York, N.Y., 9780387848570
  • Yaser S. Abu-Mostafa, Malik Magdon-Ismail, Hsuan-Tien Lin: 0, Learning From Data, AMLBook, 213, 1600490069
  • Edward R. Tufte: 2001, The visual display of quantitative information, Graphics Press, Cheshire, Conn., 0961392142
  • Mark Pilgrim: 0, Dive Into Python, Apress, 413, 1590593561
  • Wes McKinney: 0, Python for Data Analysis, O'Reilly Media, 470, 1449319793
  • 0: Machine Learning in Python: Essential Techniques for Predictive Analysis, Chichester; John Wiley & Sons, 1118961749,


Articles:
None
Other Resources

  • Website: Scikit-learn: Machine Learning in Python, http://scikit-learn.org/stable/
  • Website: Pandas: Python Data Analysis Library, http://pandas.pydata.org/
  • Website: Seaborn: Statistical Data Visualization, http://stanford.edu/~mwaskom/software/seaborn/
  • Website: TensorFlow: Open Source Software Library for Machine Intelligence, https://www.tensorflow.org/

<< Back to Module List View 2024/25 Module Record for EE514