Module Specifications.
Current Academic Year 2024 - 2025
All Module information is indicative, and this portal is an interim interface pending the full upgrade of Coursebuilder and subsequent integration to the new DCU Student Information System (DCU Key).
As such, this is a point in time view of data which will be refreshed periodically. Some fields/data may not yet be available pending the completion of the full Coursebuilder upgrade and integration project. We will post status updates as they become available. Thank you for your patience and understanding.
Date posted: September 2024
| |||||||||||||||||||||||||||||||||||||||||||
None Students will undertake laboratories on a self-study basis. Project will be undertaken and assessed as an individual assignment. |
|||||||||||||||||||||||||||||||||||||||||||
Description The use of search technologies to locate relevant information from within increasingly voluminous archives of online digital media is rapidly becoming a ubiquitous and vital technology for daily life both in social and working environments. These archives include formally published text materials, heterogeneous web content, social media, audio-visual content, and various forms of enterprise content. The efficient location and delivery of content from these archives is enabling many exciting opportunities, increasing social engagement, creative exploitation of information, improved efficiency in business operations. However, realizing systems to perform reliable search and discovery of information, and effective delivery to users poses many challenges. This module introduces relevant search technologies and explores applications such as web search, image and video search, enterprise search and mobile search applications. The module covers key search topics, including content indexing, file structures, algorithms to support retrieval; related technologies such as content summarisation, speech and video processing; and user interaction in search and evaluation of search systems. | |||||||||||||||||||||||||||||||||||||||||||
Learning Outcomes 1. Explain the process of content indexing in information retrieval including stop word removal, conflation (stemming, string-comparison), and the language dependency of these methods. 2. Demonstrate an understanding of the importance and application of data structures in efficient information retrieval, in particular inverted file structures. 3. Have knowledge of the importance and operation of standard algorithms for ranked information retrieval, including the term weighting and ranking models, e.g. tf-idf weighting, vector-space model, probabilistic model, language modeling. 4. Describe the process of relevance feedback for improved ranking in information retrieval, and apply standard relevance feedback algorithms, e.g Roochio, and probabilistic methods. 5. Explain the principles of content summarization, be able to describe and apply standard extractive summarization methods, e.g. to form document snippets for web retrieval. 6. Describe the need for indexing in multimedia content including spoken and visual content, including explaining the impact of recognition errors on information retrieval behaviour. 7. Understand the importance of evaluation in development of search engines, and the application of standard evaluation metrics such as precision and recall and test collections in measuring effectiveness of information retrieval systems. 8. Appreciate the application and operation of search engines in diverse environments, e.g. web search, audio-visual search, context-aware and mobile search, enterprise search, patent search, search in lifelogging. 9. Be able to begin to combine technologies relevant to search systems in novel ways to synthesise new information retrieval applications. | |||||||||||||||||||||||||||||||||||||||||||
All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml |
|||||||||||||||||||||||||||||||||||||||||||
Indicative Content and Learning Activities
indexingtokenisation, stop word removal, conflation (e.g. stemming), data structures (e.g. inverted files)information retrieval algorithmsBoolean search, term weighting (e.g. tf-idf), vector-space model probabilistic model, language model, relevance feedback methods (e.g. Rocchio, probabilistic approaches)multimedia indexingspeech recognition, processing of image and videoinformation retrieval evaluationevaluation metrics (e.g. precision, recall), evaluation task and test collection developmentInformation retrieval applicationse.g. web search (including the learning-to-rank approach), audio-visual search, patent search,.lifelog search, mobile search, enterprise search | |||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||
Indicative Reading List
| |||||||||||||||||||||||||||||||||||||||||||
Other Resources None | |||||||||||||||||||||||||||||||||||||||||||