Registry

Module Specifications

Archived Version 2017 - 2018

Module Title	Computer Vision
Module Code	EE544
School	School of Electronic Engineering
Online Module Resources
Module Co-ordinator	Prof Paul F. Whelan	Office Number	S362
NFQ level	9	Credit Rating	7.5
Pre-requisite	None
Co-requisite	None
Compatibles	None
Incompatibles	None

Description

Computer vision applications have significantly expanded over the last decade and this core skill set is always in high demand by employers. This module will build on the basic concepts with a view to delving deeper into core computer vision, machine learning and deep learning topics. As well as examining traditional computer vision concepts (i.e. feature extraction and machine learning) a key focus of the module will be on deep learning as applied to computer vision. We will examine the core concepts behind deep learning for computer vision with a specific focus on Convolutional Neural Networks (CNN). Students will learn how to design and tune such networks in a range of practical applications and assignments. In addition we will examine a range of deep learning architectures ranging from AlexNet upto the current state of the art in this ever expanding field. Deep learning based computer vision forms the core of many of the recent developments in this field and has been widely adopted as a core AI tool by all the key industrial players such as Google, Facebook, IBM, Apple, Baidu ... as well as a wide range of highly innovative startups. All computer vision and deep learning concepts will be reinforced by guided practical work and case studies. This module is primarily aimed at those who aim to undertake research in computer vision or require a deeper understanding of the subject to address commercial computer vision development. Computer vision applications span a wide range of disciplines including industrial/machine vision, video data processing, biomedical engineering, healthcare, astronomy, imaging science, sensor technology, multimedia and enhanced reality systems. Please refer to the modules summary syllabus for a breakdown of the course content. This module will require basic programming skills. See the EE544 module website (http://www.eeng.dcu.ie/~whelanp/ee544/) for details on the computer vision & deep learning development environment.

Learning Outcomes

1. Recall, review and analyse the advanced theories, algorithms, methodologies and techniques involved in computer vision.
2. Illustrate their ability to comprehend and interpret issues relating to the design of advanced computer vision.
3. Synthesize and evaluate the relevant merits of competing advanced computer vision techniques.
4. Apply computer vision techniques in a range of application scenarios.
5. Develop an deep understanding of the issues involved in the evaluating computer vision research.
6. Demonstrate the ability to implement a computer vision pipeline.

*Type*	*Hours*	*Description*
Workload	Full-time hours per semester
Lecture	36	This module is presented in a traditional format (lecture and continuous assessment) with significant practical support: Including: pdf versions of the class slides, screencast videos (when available), computer vision, ML, DL development environment (used for the assignments and to illustrate all module concepts) and selected examples and case studies.
Assignment Completion	60	A significant element of this module consists of a workbook of practical computer vision design tasks.
Independent Study	91.5	General revision and practice, Online activity with module material
Total Workload: 187.5

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities

Shape, Colour & Texture
Introduction to Python / scikit-image, Basic Morphology, Conditional Dilation, Geodesic Transformations, Reconstruction by Dilation, Applications, Double [Hysteresis] Threshold, Skeleton Influence By Zones, Watershed, Marker Controlled Segmentation, H-Extrema, Regional Maxima/Minima, Minima Imposition Technique, Opening/Closing by Reconstruction - Colour Spaces (CIE-Lab, CIE-Luv, Spatial SCIE-Lab), Colour Image Processing, Decorrelation Stretching, Computational Colour Constancy, Colour Segmentation (Feature-Based, Area Based, Physics Based), Integration of Colour-Texture Descriptors, Example Applications - Texture Spectrum Method, Local Binary Patterns (LBP), Rotation Invariance, Gabor Filtering, Multi-Channel Filtering, GIST, Texture based deep learning - Example applications and case studies.

Interest Point Detection & Feature Extraction
Interest Points Review, Moravec Corner Detector, Harris/Plessey Corner Detector, Scale Invariant Harris Corner Detector, SIFT – Scale Invariant Feature Transform, SIFT Matching, SIFT Variants, SIFT Applications - Histogram of Oriented Gradients (HOG), RHOG + SIFT, HOG + SVM, Deformable Part Models, Face Detection (Viola Jones), Boosting, Classifier Cascade, Bag of Visual Words - Example applications and case studies.

Classification & Machine Learning
Introduction to scikit-learn, Statistics Review, Principal Component Analysis (First Principals to Practical Applications), PCA Applied to Images: Eigenvalues and Eigenvectors, Training and Testing, Automated Selection of Eigenvalues - Metrics, Feature Normalisation, VC Dimension, Hughes Phenomenon, Feature Selection, Supervised & Unsupervised Classification, Hierarchical Clustering / Non-Hierarchical Clustering, KMeans Classification, Randomized Decision Forests, Evaluation of Classifier Performance, Parametric vs Non-Parametric Classifiers, Decision Trees, Gaussian Mixture Models, Expectation Maximization - Shallow vs Deep Learning, Support Vector Machines (SVM), SVM Classification, Kernels, SVM Multi-Class Classification, SVM Implementation - Example applications and case studies.

Deep Learning for Computer Vision
Introduction to Keras/Tensorflow, Feature vs Data Driven Approaches, Artificial Neural Networks (ANN), Logistic (Linear) Classifier, softmax, Stochastic Gradient Descent, Multi-Layer Perceptron, Gradient Decent, Backpropagation Learning Algorithm, Overfitting / Regularization, From ANN to Deep Learning, Sparse Coding, Supervised Deep Learning (DL), Convolution Neural Networks (CNN – Convnets), Network Architectures, Transfer Learning, DL APIs, DL Network Architectures (Alexnet to current SOA), Fully Convolutional Networks, Classification + Localization, R-CNN, RNN, Unsupervised Learning, Autoencoders, Style Transfer Network, Generative Adversarial Networks, DL Visualization - Example applications and case studies.

Systems Engineering
Computer Vision Design Factors, Video Standards and Techniques, Optical Terminology, Lenses, Lighting Design, Optical Material Characteristics, Typical Machine Vision Lamps, Lighting Techniques, Reliability, Solid State (Digital) Sensors, Sensor Characteristics, Colour Cameras, Pixel Level Effects, Monochrome Aberrations, Chromatic Aberrations, Vignetting, Extended Focus - Example applications and case studies.

3D Imaging (Spatial & Temporal)
Passive Stereoscopic Methods, Camera Calibration/Rectification, Epipolar Geometry, Shape from Stereo, Fundamental Matrix, Rectification Procedure, Stereo Feature Constraints & Matching, Depth Estimation Techniques (Window-based techniques, Dynamic programming, Deep Learning), Colour Stereo Vision, Multiple View Stereo Vision, Active Stereoscopic Methods, Structure (3D) from Motion (SfM), Depth From Focus, Laser Ranging Systems, Time of Flight Measurement, Structured Light, Moiré, Infrared Scanning, Shape from X, RGBD, Industry/Consumer Applications - Motion Tracking, Motion Field & Optical Flow, Optical Flow: 2D, Optical Flow: Local Approach, Histogram of Optical Flow (HOF), Feature Matching (Motion Correspondence), Kanade Lucas Tomasi Tracking.

Assessment Breakdown
Continuous Assessment	40%	Examination Weight	60%

Type	Description	% of total	Assessment Date
Course Work Breakdown

Reassessment Requirement
Resit arrangements are explained by the following categories; 1 = A resit is available for all components of the module 2 = No resit is available for 100% continuous assessment module 3 = No resit is available for the continuous assessment component
Unavailable

Indicative Reading List

Paul F Whelan: 0, Online course notes (slides),
Richard Szeliski: 0, Computer Vision: Algorithms and Applications, Springer, 9781848829343
Christopher M. Bishop: 2006, Pattern recognition and machine learning, Springer, New York, 9780387310732
Ian Goodfellow and Yoshua Bengio and Aaron Courville: 2016, Deep Learning, MIT Press, http://www.deeplearningbook.org,
Pierre Soille: 0, Morphological Image Analysis, Springer, 3540429883
Richard O. Duda, Peter E. Hart, David G. Strok: 0, Pattern classification, New York [etc.] John Wiley & sons cop. 2001, 0471056693
David A. Forsyth, Jean Ponce: 0, Computer vision, Upper Saddle River, N.J. Pearson Education 2003, 0131911937
Richard Hartley, Andrew Zisserman: 0, Multiple view geometry in computer vision, Cambridge ; Cambridge University Press, 2003., 0521540518

Other Resources

27166, Module Website, Paul F Whelan, 2017, EE544, http://www.eeng.dcu.ie/~whelanp/ee544/ee544_notes.html, 27162, Lecturer Website, Paul F Whelan, 2017, http://paulwhelan.eu/, http://paulwhelan.eu/,

Programme or List of Programmes

3UMHCT	MEng in Digital Health & Medical Tech.
BMEDV	M.Eng. in Biomedical Engineering
CAPD	PhD
CAPT	PhD-track
ECSAO	Study Abroad (Engineering & Computing)
EEPT	PhD-track
GCES	Grad Cert. in Electronic Systems
GCTC	Grad Cert. in Telecommunications Eng.
GDE	Graduate Diploma in Electronic Systems
GTC	Grad Dip in Telecommunications Eng
MECE	MEng Electronic & Computer Engineering
MEQ	Masters Engineering Qualifier Course
MTC	MEng in Telecommunications Engineering
SMPEC	Single Module Programme (Eng & Comp)

Archives: