204553 Information Storage and Retrieval

Thursday evening, 18-21h

Asanee K., Arnon R.



 NEW: Reading and writing assignment.

Course Objective

This course intends to give students a strong background concerning the automated storage and retrieval of information. Since text is the primary way that human knowledge is stored, and after speech, the primary way it is transmitted to one and another, we will mainly focus on the techniques for storing and searching for textual documents. During this course, we intend to cover both the theoretical aspects of information retrieval design and evaluation and the practical aspects of how these theories have been implemented in actual systems.

Important Note: This is an interactive course. Students taking this course are expected to propose, investigate and report on possible solutions in IR problems. Note that each instructer will give a talk of approximately one to one and a half hour, then students have to fill the second and the third hour with prepared talk.

Course Syllabus

04-jun Introduction to Information Storage and Retrieval [1] (Ar)

11-jun Retrieval Models I [2,3] (Ar)

18-jun Retrieval Models II [2,3] (Ar)

25-jun Invited Speaker from Simon Fraser U. (Prof. Paul McFETRIDGE)

02-jul Retrieval Model III [11] (Ar)


16-jul NLP and IR system [3] (As)


23-jul Text Analysis and Language Processing I [3] (As)


30-jul ----- Mid-term Break----


06-aug Text Analysis and Language Processing II [3] (As)

13-aug Theory of Term Importance [5,2,3,12] (Ar)

20-aug Automatic Text Transformation [3] (As)

27-aug Performance Evaluation in IR [2,10] (Ar)

03-sep Selected readings (As)

10-sep Selected readings (Ar)

17-sep Selected readings (As)

Grading policy


References

Recommended text:

[1] William B. Frakes and Ricardo Baeza-Yates. Information retrieval: data structures & algorithms. Englewood Cliffs, N.J. : Prentice Hall, 1992. Gerarld

[2] Salton and McGrill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.

[3] Gerald Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, 1989.

Further reading:

4] Peter Willett, Karen Sparck Jones, Ray R. Larson. Readings In Information Retrieval. San Mateo, CA : Morgan Kaufman, 1997.

[5] C. J. van Rijsbergen. Information retrieval. London : Butterworths, 1975.

[6] William R. Hersh. Information Retrieval: A Health Care Perspective. Springer-Verlag, 1996.

[7] Witten, Moffat and Bell. Managing Gigabytes. Van Nostrand Reinhold, 1994.

[8] D.C. Blair. Language and Representation in Information Retrieval. Elsevier, 1990.

[9] Proceedings of the annual international ACM/SIGIR Conference on research and development in IR, ACM.

[10] The Text REtrieval Conference, NIST.

Some selected papers:

[11] Dumais, S. T., Furnas, G. W., Landauer, T. K. and Deerwester, S. (1988), "Using latent semantic analysis to improve information retrieval." In Proceedings of CHI'88: Conference on Human Factors in Computing, New York: ACM, 281-285.

[12] G. Salton, A. Wong and C.T. Yu, Automatic Indexing using Term Discrimination and Term Precision Measurement, Information Processing & Management, 1976.

[13] A. Singhal, C. Buckley and M. Mitra, Pivoted Document Length Normalization, ACM/SIGIR-96, Zurich-Switzerland, 1996.


Last modified: Mon May 25 10:35:55 ICT 1998