DCU Home | Our Courses | Loop | Registry | Library | Search DCU
<< Back to Module List

Module Specifications.

Current Academic Year 2024 - 2025

All Module information is indicative, and this portal is an interim interface pending the full upgrade of Coursebuilder and subsequent integration to the new DCU Student Information System (DCU Key).

As such, this is a point in time view of data which will be refreshed periodically. Some fields/data may not yet be available pending the completion of the full Coursebuilder upgrade and integration project. We will post status updates as they become available. Thank you for your patience and understanding.

Date posted: September 2024

Module Title Data Analytics & Data Mining
Module Code CA683 (ITS) / CSC1144 (Banner)
Faculty Engineering & Computing School Computing
Module Co-ordinatorAndrew Mccarren
Module Teachers-
NFQ level 9 Credit Rating 7.5
Pre-requisite Not Available
Co-requisite Not Available
Compatibles Not Available
Incompatibles Not Available
Repeat examination
Description

This module aims to review and complement foundation statistical knowledge and to establish the context for a range of methods, used in the analysis of simple and complex systems. The emphasis is on an intuitive understanding of the principles and a practical ability to apply these to data examples drawn from diverse systems, rather than mathematical sophistication.

Learning Outcomes

1. Explain the different levels of measurement and data types within the context of data analytics and data mining.
2. Apply underlying probability principles and distribution examples for data analytics and data mining.
3. Distinguish between descriptive and inferential statistical quantities in the theory and practice of statistics and in data analytics.
4. Appraise the scope and robustness of common analytical methods for one to many samples.
5. Implement a range of analytical statistical techniques and interpret outcomes.
6. Select an appropriate statistical software, having been exposed to several examples.
7. Apply data analysis and data mining techniques to a range of illustrative examples and case studies in big data and other real world systems.



Workload Full-time hours per semester
Type Hours Description
Lecture36Lecture Delivery.
Assignment Completion25Working on practical assignments.
Independent Study126.5Independent course reading and working through in class example.
Total Workload: 187.5

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Measurement type, data types and quality. Consistency.

Review of basis probability, probability distributions and their uses.
Practical examples from relevant application : e.g. business , decision-making, queuing and service; bio-ict, - e.g. recombination and mapping functions, segregation, rare events and motif detection; social interaction, e.g. networks, and introduction to stochastic time series basis for event pattern recognition.

Bayes Methods and revised probabilities from new data feedback/incorporation

Estimation
Classical vs MLE, Bayesian and outline of when to use Non-parametrics.

Estimation and Hypothesis Testing for one to many-samples.
ANOVA (basic Experimental Design principles), and equivalent for discrete data – e.g. Contingencies (for several populations), as well as extensions to linear regression.

Applications
Examples for above, inc. time series features for time-conditioned data in e.g. bio-ict, social nets, business and financial series.

Software options

Assessment Breakdown
Continuous Assessment50% Examination Weight50%
Course Work Breakdown
TypeDescription% of totalAssessment Date
AssignmentIn Class Test In class supervised lab exam on data analysis25%Once per semester
AssignmentIn Class Test In class supervised lab exam on data analysis25%Once per semester
Reassessment Requirement Type
Resit arrangements are explained by the following categories:
Resit category 1: A resit is available for both* components of the module.
Resit category 2: No resit is available for a 100% continuous assessment module.
Resit category 3: No resit is available for the continuous assessment component where there is a continuous assessment and examination element.
* ‘Both’ is used in the context of the module having a Continuous Assessment/Examination split; where the module is 100% continuous assessment, there will also be a resit of the assessment
This module is category 1
Indicative Reading List

    Other Resources

    None
    CA/Exam split change agreed by PB 50/50 from 25/75

    << Back to Module List