DCU Home | Our Courses | Loop | Registry | Library | Search DCU
<< Back to Module List

Latest Module Specifications

Current Academic Year 2025 - 2026

Module Title Advanced Analytics Techniques
Module Code BAA1053 (ITS: SB5003)
Faculty DCU Business School School DCU Business School
NFQ level 9 Credit Rating 10
Description

Today's business environment is characterised by enormous amounts of data with different sources and in different formats. This data has the potential to provide valuable information, leading to better decision-making and competitive advantage. However, managing, analysing and communicating the value of such a large amount of information requires specific skills. This module builds on the first-semester modules, BAA1088 and BAA1089, and provides students with a variety of advanced data analytics and statistical techniques to enable and support the analysis of such a vast amount of data.

Learning Outcomes

1. Evaluate, select and apply appropriate project management methodologies (e.g. PRINCE2) and techniques to conduct a data analytics project (e.g. CRISP-DM).
2. Understand the role of advanced analytics and Big Data, using Hadoop, and how to adopt it in different business environments.
3. Understand how to pursue business objectives with data mining and the best practices in different business environments.
4. Understand and demonstrate the ability to collect, clean, and prepare unstructured data types.
5. Understand the various unsupervised learning methods.
6. Understand the different Artificial Intelligence technologies, such as Natural language processing (NLP), Bidirectional Encoder Representations from Transformers (BERT), Deep learning, Generative AI/LLMs and the potential business implications of adopting such techniques in different business environments.
7. Critically analyse the outputs of the analysis to support and improve the decision-making process.
8. Build collaborative skills by teaming up to compare BERT and LLMs for a business application, and by jointly reporting and presenting performance, ethics, and business relevance with clear code-backed storytelling and team reflections.


WorkloadFull time hours per semester
TypeHoursDescription
Lecture44Lectures + Workshops/Tutorials
Tutorial11Additional Tutorials
Independent Study100Preparation for, and reading after the lectures
Online activity50Online training using Kubicle, Knime, and RapidMiner
Assignment Completion45Group assignment work: research and completion
Total Workload: 250
Section Breakdown
CRN10126Part of TermSemester 1
Coursework100%Examination Weight0%
Grade Scale40PASSPass Both ElementsN
Resit CategoryRC1Best MarkN
Module Co-ordinatorMathieu MercadierModule TeacherAnderson Augusto Simiscuka, Michael Farayola, Theodore Lynn
Section Breakdown
CRN21513Part of TermSemester 2
Coursework100%Examination Weight0%
Grade Scale40PASSPass Both ElementsN
Resit CategoryRC1Best MarkN
Module Co-ordinatorMathieu MercadierModule TeacherAnderson Augusto Simiscuka, Michael Farayola, Theodore Lynn
Assessment Breakdown
TypeDescription% of totalAssessment Date
Report(s)Using a CBL approach, students will apply the CRISP-DM framework to a Kaggle dataset, completing all project stages from business understanding to deployment. Students should align their project with at least one of the UN SDGs. Students will deliver a report, Python notebook, and presentation demonstrating actionable insights and real-world problem-solving skills.30%Week 24
Report(s)Build and compare three deep learning architectures (e.g., CNN, RNN, Transformer) while optimizing hyperparameters. Evaluate performance metrics, analyse trade-offs, and recommend the best configuration for the selected dataset and task.40%Week 28
Group project Compare BERT and Generative AI/LLMs for a business application. Analyse model performance, ethical considerations, and business relevance, presenting insights and code in a report and a presentation covering project scope, methodology, results, and team reflections.30%Week 30
Reassessment Requirement Type
Resit arrangements are explained by the following categories;
RC1: A resit is available for both* components of the module.
RC2: No resit is available for a 100% coursework module.
RC3: No resit is available for the coursework component where there is a coursework and summative examination element.

* ‘Both’ is used in the context of the module having a coursework/summative examination split; where the module is 100% coursework, there will also be a resit of the assessment

Pre-requisite None
Co-requisite None
Compatibles None
Incompatibles None

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Introduction to Project Management
Key concepts in project management including starting and initiating a project, controlling a stage, managing product delivery, managing a stage boundary, directing and closing a project.

Managing data analytics projects
Deploying a data analytics management framework (CRISP-DM, Data Lifecycle Management).

Big Data
Exploration of Data Mining (pattern discovery and predictions) and Data Analytics (analysis and decision-making) in the context of Big Data, using Hadoop.

Unsupervised Machine Learning
Concepts of dimensionality Reduction Techniques (Principal component analysis, Fisher linear discriminant analysis, t-Distributed Stochastic Neighbor Embedding) and clustering analysis (Hierarchical and partitional clustering).

Text Analysis and Bidirectional Encoder Representations from Transformers (BERT)
Data collections and preparation, information retrieval models, and Natural Language Processing. Pre-training and fine-tuning for NLP tasks (e.g., text classification, question answering). Contextual embeddings and self-attention mechanisms.

Deep Learning
Neural networks, activation functions, loss functions, and backpropagation. Types of architectures: Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Transformers. Deep Reinforcement Learning. Applications: Image recognition, sequence prediction, and feature extraction.

Generative AI and Large Language Models (LLMs)
Key concepts of Generative AI and the architecture of LLMs (e.g., GPT, BERT variants). Pre-training, fine-tuning, and transfer learning. Applications: Text generation, summarization, machine translation, and conversational AI.

Indicative Reading List

Books:
  • Axelos: 2015, Prince2 Agile, stationery Office Limited TSO, 0, 9780113314676
  • Fenner, M.: 2020, Machine Learning with Python for Everyone., Pearson,
  • IIBA: 2015, A Guide to the Business Analysis Body of Knowledge, 5, 6, 8, 9, 13, 978-1-927584
  • Raschka, S.: 2024, Build a Large Language Model (From Scratch), Manning Publications,
  • Ravichandiran, S.: 2021, Getting Started with Google BERT: Build and train state-of-the-art natural language processing models using BERT, Packt Publishing Ltd.,
  • Siegel, E.: 2024, The AI Playbook: Mastering the Rare Art of Machine Learning Deployment, MIT Press,
  • Vajjala, S., Majumder, B., Gupta, A. and Surana, H.: 2020, Practical natural language processing: a comprehensive guide to building real-world NLP systems., O'Reilly Media,


Articles:
  • Abdi, H., & Williams, L. J.: 2010, Principal component analysis, Wiley interdisciplinary reviews: computational statistics, 2, 506045
  • 2017: A brief survey of text mining: Classification, clustering and extraction techniques., arXiv preprint, 506047, 1
  • Multimodal machine learning: A survey and taxonomy.: IEEE transactions on pattern analysis and machine intelligence, 41, 506042, 2, Chaudhuri, S., Dayal, U., and Narasayya, V.
  • Communications of the ACM: 54, http://dl.acm.org/citation.cfm?id=1978562, 506041, 1, Chen, H., Chiang, R. H. L., and Storey, V. C., 2012
  • 36: 506046, 1, Kobayashi, V. B., Mol, S. T., Berkers, H. A., Kismihók, G., & Den Hartog, D. N., 2018, Text Mining in Organizational Research
  • 523496: 1, Plotnikova, V., Dumas, M., Nolte, A., & Milani, F., 2023, Designing a data mining process for the financial services domain, Journal of Business Analytics, 6(2), 140,
  • 1: Siegel, E., 2024, Getting Machine Learning Projects from Idea to Execution, Harvard Business Review,
  • Wu, X., Zhu, X., Wu, G. Q., & Ding, W.: 2013, Data mining with big data, IEEE transactions on knowledge and data engineering, 26,
Other Resources

  • 1: Website, Kubicle - AI Fundamentals,
  • 404806: 1, Website, RapidMiner - Machine Learning,
  • https://academy.rapidminer.com/learning-paths/machine-learning: 419849, 1, Website, Knime
  • https://www.knime.com/learning:

<< Back to Module List View 2024/25 Module Record for BAA1053